CHAPTER 8 ONCOGENIC MARKER DETECTION FROM P53 MUTANT AMINO-ACID SUBSTITUTIONS

Size: px
Start display at page:

Download "CHAPTER 8 ONCOGENIC MARKER DETECTION FROM P53 MUTANT AMINO-ACID SUBSTITUTIONS"

Transcription

1 134 CHAPTER 8 ONCOGENIC MARKER DETECTION FROM P53 MUTANT AMINO-ACID SUBSTITUTIONS The recent past has witnessed a rapid rise in the utilization of computational techniques to aid and accelerate biological experiments by providing improved focus for wet-lab investigations that would otherwise be a heavy labour and resource-intensive task (Onisko et al, 2011;Huang et al, 2011; Familil et al, 2012). This chapter details the Genetic Mutant Marker Extraction (GMME) methodology based on Naïve Bayes learning to detect the co-occurring mutations that have the capacity to reinstate P53 to normalcy. The proposed GMME methodology aimed at predicting the possible hot spot cancer, strong rescue and weak rescue mutants that have the capacity to reinstate normal P53 transcriptional activity as detailed in this chapter. 8.1 P53 AMINO-ACID SUSBTITUTIONS The training P53 mutation (amino-acid substitutions) data adopted for this research was obtained by yeast P53 functional assays (Brachmann, 2005; Danziger et al, 2006). The mutation data comprised of amino-acid substitutions from 1-site until 6-site. The data required pre-processing before execution of the proposed GMME methodology. The proposed framework for P53 amino-acid substitution detection is detailed in the next section.

2 8.2 ONCOGENIC MARKER EXTRACTION FRAMEWORK FOR CANCER PREDICTION 135 The methodology adopted in this research work to predict the P53 amino-acid substitutions is diagrammatically presented in Figure 8.1. Figure 8.1 Oncomutation marker extraction framework for cancer prediction

3 136 In order to formulate the GMME technique, three major requirements were to be met. Primarily, the raw P53 mutation data had to be pre-processed to retrieve the mutations on a site-wise basis. The data was pre-processed to manipulate the large dataset comprising of P53 cancer mutants and their cooccurring (2-site, 3-site, 4-site) mutations spanning nearly 16,700 records. Secondly, a suitable prediction technique had to be chosen to classify the P53 amino-acid substitutions as active/inactive. For this reason, a comparison on the predictive performance was drawn among the well-known and outstanding classifiers to detect the status of site-wise P53 amino-acid substitutions. Finally a Naïve Bayes-based learning approach was devised to identify and extract the hot spot cancer/strong rescue/weak rescue mutations Data Pre-Processing of P53 Amino-Acid Substitutions Initially the P53 amino-acid substitution data contained innumerable missing values. Hence, all the records that had well-defined values for each of the sites were filtered separately as 1-site, 2-site, 3-site, 4-site, 5-site and 6-site mutation data. This pre-processed data held precise information on the cancer mutants and their co-occurring mutation sites (1-site to 6-site mutations). The data comprised of 16,699 P53 mutant records each depicting amino acid substitutions at the different codon positions as depicted in Table 8.1. Table 8.1 Site-wise P53 amino-acid substitution records Site Active Records Inactive Records Total No.of Records ,496 16, Training records 16,699 The mutations sites varied from one to six. The 1-site data represented the independent P53 mutations while the 2-site to 6-site data represented the co-occurring mutations that have the capacity to turn the

4 137 previous site mutations cancerous (inactive) or non-cancerous (active). However, rescue markers could be detected only from the second, third and fourth site recombination records since 1-site data were independent mutations while the 5-site and 6-site site data contained very few records that led to reasonably low cross-validation accuracy and hence believed to be insufficient to train the classifier. The 1-site data constituted fifty-four hot spot cancer mutants of which fourteen were rescueable as revealed by the proposed GMME technique. The second, third and fourth site mutation records constituted the training dataset Learning Methodology for Cancer Prediction from P53 Amino- Acid Substitutions A comparison of five prediction techniques viz, Naïve Bayes, Nearest Neighbor, Decision Tree, Random Forest and Ababoost with Decision Stump learning was carried out on the P53 training dataset. The Naïve Bayes classifier generated optimal accuracy, MCC, AUC on the 2-site, 3-site and 4-site recombinations in comparatively less time. This led to the formulation of a general data mining framework to categorize the mutant types as hot spot cancer, strong rescue and weak rescue mutants. The Naïve Bayes Learning algorithm presented a supervised learning methodology that depicted reasonably good performance in prediction of P53 activity from amino-acid substitutions. This guided the research focus to utilize the Naïve Bayesian probabilities to formulate the GMME methodology Genetic Mutant Marker Extraction Methodology The P53 mutant marker prediction hierarchy is depicted in Figure 8.2. After executing the Naïve Bayes algorithm on the site-wise P53 amino-acid substitutions, the GMME methodology was applied as described below.

5 138 Training Data P53 Mutants (Amino-acid Substitutions) Naïve Bayes Learning Inactive Figure Active GMME Technique Hot Spot Cancer Mutants Strong Rescue Mutants Weak Rescue Mutants Figure 8.2 P53 mutant marker prediction hierarchy The GMME methodology to relate and extract the cancer/rescue mutations at each recombination site is as follows. The Conditional Probability Values (CPV) and Classification Function Values (CFV) for each amino acid substitution in the 2-site, 3-site and 4-site training data were calculated according to Eq. (8.1) and Eq. (8.2) respectively (Han and Kamber, 2006). CPV=P (X CL) = (8.1) CFV= P (CL X) =P (CL) * (8.2) P53 mutation (amino-acid substitution) and CL denotes the target class (Active/Inactive). The number of recombinant mutation 8.1) and (8.2) correspond to take up both positive

6 139 and negative values while CPV can span only positive values ranging from zero to one. The algorithmic presentation of the GMME methodology is depicted below. Input: Pre-processed training mutation data (1-site, 2-site, 3-site, 4-site) Output: Cancer/Strong Rescue/Weak Rescue mutations Algorithm: under consideration (4 in this research).. Step 3: Let i=1; // To detect cancer mutations Step 4: X=i+1; Step 5: For each data record in the subset Step 5.2: Move to next record. Step 5.3 Else store as NON-CANCER mutant and repeat steps and Step 6: i=i+1; Step 7: While (i>1 && i, j=i+1<n) //Co-occurring mutations (N=4) Step 7.1: for (k=1, m=1; k<=i; k++, m++) xm = k; //x1-site 1, x2-site 2 Step 7.2: Obtain the conditional probability values (c.p.v) value for each record in each xm in this training subset as follows: P (X CL) = Step 7.3: Obtain the classification function distribution (c.f.v) for each record in each xm. P (CL X) =P (CL) *

7 140 Step 7.4: If at site i for xm) then Step 7.4.1: Store mutation as STRONG RESCUE mutations at site i. Step 7.4.2: Verify with co-occurring mutations in the higher sites (up to site 4). Step 7.4.3: If ((c.f.v) && (c.p.v) values are high for class ( in higher sites, then Step : Store corresponding mutation sites as WEAK RESCUE. Step : Remove such mutations from STRONG RESCUE list. Step 7.5: Else If c.p.v value for mutation in site-i is low then ignore // It has no rescue mutant in this subset. Step7.6: Else If c.p.v value for site i is low For every combination in this subset Step 7.6.1: Detect mutations that have high CPV Step 7.6.2: Store as Strong Rescue mutant for mutation in site-i. Step 7.6.3: Repeat steps and Step7.7: Else If c.p.v value is HIGH for both classes then Step 7.7.1: Detect mutations that have high CPV Step 7.7.2: Store as Strong Rescue mutant for mutation in site-i. Step 7.7.3: Repeat steps and End While Step 8: End The first step in GMME was to detect the Naïve Bayes probabilities defined by the Conditional Probability Value (CPV) and Classification Function

8 141 Value (CFV) of each amino-acid substitution with respect to the target (Active/Inactive) class. The probability values indicated clearly the contribution P53 mutation. The CPV and CFV took up one of the two values: Probability status High (PSH) for CFV is depicted by bability status Low (PSL). A low value indicated least contribution to the target class and a high value indicated the most contribution to the target class. The amino acid substitutions in each site were individually investigated in the GMME technique. A co-occurring amino-acid substitution is i CFV were high (PSH) with class Active at that particular site. A co-occurring rescue mutation for a hot spot cancer mutant was added to the Strong Rescue all higher order recombinant sites (i+1<=4). A co-occurring rescue mutation for a cancer mutant was added to the Weak Rescue mutant list iff the CFV or CPV recombination site (i+1<=4). The amino-acid substitutions that tend to be tes were termed to be hot spot cancer mutants. The CPV The obtained values were compared with respect to the target class x i -acid substitution at mutants, [C] denote the set of hot spot Cancer mutants and [WR], [SR] denote the set of Weak Rescue mutants and Strong Rescue mutants respectively. Let s/class of aminothe Inactive status/class. The principle adopted in the GMME methodology to categorize the amino acid substitutions under the different mutant types can be concisely denoted by the following equations:

9 x i 142 [ R] X : ( xicpv && xicfv == PSH): Active Class (A) (8.3) [ SR] x j [ R] : 4 j i 1x j ( x && jcpv x == PSH): Active Class (A) (8.4) jcfv x j [ WR] [ R] : 4 j i 1x j ( x jcpv x == PSH) : Inactive Class (IA) (8.5) jcfv x i [ C] X : 4 i 2 x i ( x && icpv xicfv == PSH): Inactive Class (IA) (8.6) x icpv, x icfv, denote the CPV and CFV of the amino acid substitutions at site í. The probability status of CPV for both the Active and Inactive class may take up only positive values (0 to 1). The PSH for both the Active and Inactive classes for the CPV is given by values greater than zero (>0) while the PSL is given by zero (=0). Eq. (8.3) depicted the co-occurring amino-acid substitutions that mutants [R]. Eq. (8.4) revealed the set of Rescue mutants[r] that generated PSH with the Active class (A) for all the recombinant sites (j=i+1 <=4) thus constituting the set of Strong Rescue [SR] mutants. Eq. (8.5) extracted the set of Rescue mutants[r] if there existed at least one recombinant amino acid substitution at a site (j=i+1 <=4) with the Inactive class (IA) thus constituting the set of Weak Rescue [WR] mutants. Hot spot Cancer mutants [C] from Eq. (8.6) are obtained by detecting the amino acid substitutions that revealed PSH with the Inactive class for all sites (until 4). The experimental results and the validations performed are discussed in the following section.

10 RESULTS AND DISCUSSION The experimental results are discussed as follows. Section details the data pre-processing outcome while Section summarizes the evaluation results of the supervised learning approaches on the P53 amino-acid substitution data. Section highlights the results of identifying the hot spot cancer/strong rescue/weak rescue mutants from the conditional probabilities and reports the novel rescue mutants for fourteen hot-spot cancer mutants at each site Pre-Processed P53 Amino-Acid Substitution Data The site-specific P53 mutation data records for tumour marker prediction are tabulated in Table 8.2. The 2-site, 3-site and 4-site mutation records were utilized as the training data for the subsequent prediction techniques. Table 8.2 P53 mutation sites sample training samples 1-site 2-site 3-site 4-site 5- site 6- site Class a119e X x X x x inactive a161t X x X x x inactive c135y X x X x x inactive c141y X x X x x inactive c141y d228e n235k n239l y236n x active c141y i232v h233r y234f n235k n239l active c141y n235k x X x x active c141y n235k n239y X x x active c141y n235k s240t X x x active c176f x x x x x inactive c176y x x x x x inactive The rows containing 2-site, 3-site and 4-site mutations were segregated to obtain the P53 mutation training data from which the genetic

11 markers could be extracted at the respective site-wise recombinations. Table 8.3 holds the P53 amino-acid substitutions at 2-site. 144 Table 8.3 Sample P53 mutation 2-site training samples 1- site 2-site Class a119e l125p inactive c135y e285m inactive c135y e285v inactive The total number of records spanning the P53 amino-acid substitutions at all sites summed up to 16,669 records of which nearly 16, 553 records constituted the 2-site data. Table 8.4 Sample P53 mutation 3-site training samples 1- site 2-site 3-site Class a119e r283k a353v inactive c135y n235k n239y inactive c141y n235k n239y active c141y n235k s240t active c176f n235k n239y inactive The 3-site data constituted 112 records while the 4-site mutation data held 31 records, the samples of which are shown in Table 8.4 and Table 8.5 respectively. Table 8.5 Sample P53 mutation 4-site training samples 1-site 2-site 3-site 4-site Class c141y d228a n235k n239m inactive g245s i232m n239i s240k inactive g245s m237l s240w s241a inactive r249m n235k n239y s227t active r249s k139r h168r n239y active r249s v122i c124s h168r inactive Once the data was pre-processed, a comparison was drawn among the predictor methods to identify the most optimal method to detect the

12 oncomutations. The performance of the classifier models is discussed in the next section Classifier Performance on P53 Amino-Acid Substitution Data Five classification methods namely Naïve Bayes, Nearest-Neighbor, Decision Tree, Random Forest and Adaboost learning methods were utilized for comparison based on their reported optimal performance on biological datasets. The results are tabulated in Table 8.6 and Table 8.7. Table 8.6. Classifier performance on P53 amino acid substitution data Classification Algorithms ACC (%) MCC 2-site 3-site 4-site 2-site 3-site 4-site Naïve Bayes Nearest Neighbor Decision Tree Random Forest Adaboost In order to ensure the efficiency of the Naïve Bayes algorithm in P53 functionality prediction, the AUC and computation time of the algorithms were also investigated. Table 8.7. Area Under ROC Curve of classification algorithms on P53 mutation data S.No Classification AUC Computation Time Algorithms 2-site 3-site 4-site 2-site 3-site 4-site 1. Naïve Bayes Nearest Neighbor Decision Tree Random Forest Adaboost

13 146 The Naïve Bayes algorithm was found to be most suitable for the prediction of P53 mutant status based on the MCC, AUC and the computation time. The related sensitivity (True Positive Rate-TPR) and False Positive Rate (FPR) values for ROC analysis are tabulated in Table 8.8. Table 8.8. Sensitivity and specificity values for ROC curve analysis Algorithm 2-Site 3-Site 4-Site TPR FPR TPR FPR TPR FPR Naïve Bayes Nearest-Neighbor Decision Tree Random Forest Adaboost FPR is measured as 1- Specificity. Although the Random Forest and SVM classifiers also showed comparable accuracy and MCC in prediction, the execution time of the algorithms was heavy compared to Naïve Bayes learning. The predicted P53 cancer/rescue mutants are discussed in the next section Prediction of Rescue Markers from P53 Amino-Acid Substitution Data The P53 cancer mutations and their corresponding rescue mutants obtained via wet-lab assay experiments have been reported in previous studies. The 2-site suppressor mutations are tabulated in Table 8.9. Rescue markers previously reported by Brachmann were brought out by the proposed GMME technique too: C141Y - I232V - H233R - Y234F - N235K - N239L and R272L - D228Q - N235K - N239Y - S240R - C229W.

14 147 Table 8.9 P53 suppressor mutations at 2-site recombination S.No Cancer Mutant 2-Site Rescue Mutants 1. C141Y N235K 2. G245S S240Y, T231Y 3. P152L D207E,H115D,H115E,K101E,L114P,L137M,Q100A,T118Y, T123A,T123G,T123P,T123S 4 R273H C242R,S240N,S240Q 5. V272M N235K,N239Y The results were tabulated according to the sites at which the mutants were rescued. The graphical representation of the cancer-rescue mutant relationship is depicted in Figure 8.3. Figure 8.3 P53 cancer-rescue mutants at 2-site recombination The cancer mutants are depicted in green color in Figure 8.3, Figure 8.4 and Figure 8.5. The cancer mutants are depicted as diamonds while the rescue mutants are portrayed as discs. E286K is weakly rescued by N235K at the second site but later inactivated P53 at codon 239 re-combining as N239W and N239Y at the 3rd

15 148 site.t123a is a weak active mutant that turns P53 inactive due to combination with H168R/S240R in the second site. Hence, they were not included in the list of rescue mutants. Table 8.10 lists the P53 suppressor mutations at the 3-site recombination. Table 8.10 P53 suppressor mutations at 3-site recombination S.No Cancer 2-Site-3- Site Rescue Recombination Mutant 1. C141Y N235K-N239Y 2. G245C N235K-N239Y 3. G245S H115A-T123P,H115D-T123P,H115P-T123P,H115R- T123P,H115S-T123P,H115T-T123P,H233C-N239F,N235K- N239Y,N239F-S240N,N239Y-S99H,N239Y-T102S,N239Y- T230S,Y234F-S240Y 4. R158L D228E-N239R,H233L-N239F,H233Y-N239L,N235K- N239Y, N235T-N239R,S227F-N239Y 5 R249M N235K-N239Y 6. R273C D281G-E285G 7. R273H G226D-S240R,N239R-S240N 8. V157F N235K-N239Y,N239R-N235K 9. V173L N239Y-S240T,S227T-N239Y,N235K-N239Y 10. V272M N235K-N239Y 11. Y205C D207V-N239W,D228E-N239Y,N235K-N239Y 12. Y220C D228E-N239Y,N235K-N239L,N235K-N239Y,T230C- N239Y,Y234F-N239L The proposed GMME methodology extracted the hot-spot cancer, strong rescue and weak rescue mutants by relating the conditional probabilities and classification function values at the site-wise recombination beginning at the 2- site. The rescue mutants were thus identified corresponding to their active functionality at the respective site. The rescue mutants at the 3-site recombination are graphically represented in Figure 8.4 along with the cancer mutants they rescue.

16 149 Figure 8.4 P53 cancer-rescue mutants at 3-site recombination The 4-site P53 suppressor mutations are tabulated in Table It is to be noted that the major number of rescue mutations exist in the surface regions spanning the 2-site, 3-site and 4-site data. Table 8.11 P53 suppressor mutations at 4-site recombination S.No Cancer Mutant 2-Site-3-Site-4-Site Rescue Recombination 1. R249S K139R - H168R - N239Y 2 V173L T231I-N235K-S240N 3. Y205C D207V-H233N-N239W The graphical representation of the rescue mutants at the 4-site recombination is depicted in Figure 8.5. Only three rescue mutants have been detected from this site. This calls for more training samples from wet-lab investigations to identify possible rescues for other cancer mutants.

17 150 Figure 8.5 P53 cancer-rescue mutants at 4-site recombination The cancer mutants that could not be rescued according to our findings are tabulated in Table These cancer mutants were not rescued by any other mutation at any higher order site recombination. Table 8.12 Hot Spot Cancer Mutants S.No Mutant S.No Mutant S.No Mutant S.No Mutant 1. A119E 11. F113L 21. H214R 31. R248W 2 A161T 12. G117Q 22. I195T 32. R280K 3. C135Y 13. G244S 23. M237I 33. R280T 4. C176F 14. G245D 24. P151S 34. R282W 5. C176Y 15. G245V 25. P278L 35. S183T 6. C238Y 16. G266E 26. P278S 36. S227P 7. C242F 17. G266R 27. R158H 37. S240R 8. C275Y 18. H179R 28. R175H 38 S241F 9. E258K 19. H179Y 29. R248L 39. V172M 10. E285K 20. H193R 30. R248Q 40. Y234C Such mutants need further clinical investigations and stronger applications of permissible external forces to induce possible mutations for rescue. The proposed GMME methodology directly targeted the individual P53 cancer mutants that have been validated by yeast assays and have revealed

18 151 rescue markers for fourteen cancer mutants. No existing feature selection algorithm could be utilized for categorizing the mutants since all the features (1- site to 4-site mutations) need to be considered to categorize and predict the mutants as hot-spot cancer, strong rescue and weak rescue mutants. In addition, no existing classification algorithm could categorize the mutants as the only information that was available on their status was whether they were active or inactive. The proposed GMME methodology targeted this drawback and attempted to detect the Bayesian probabilities and classification function values of the mutants in the respective categories (active/inactive), then related them using the site-wise recombination, and then categorized them as hot spot cancer, weak rescue and strong rescue mutants. 8.4 SUMMARY This chapter details the proposed Genetic Mutant Marker Extraction (GMME) technique to extract and categorize the hot spot cancer, strong rescue and weak rescue mutants at the respective mutation sites from P53 amino-acid substitutions. The importance of oncoprotein activity motivated further research on utilizing data mining techniques to predict novel oncoprotein interaction patterns. This revealed the downsides of the existing approaches that were restricted in their application to supervised datasets. The next chapter details the proposed Association Rule Mining based approach to detect new oncoprotein interaction patterns from available HIV1-human protein-protein interacting pairs to predict yet unknown viral-human protein pair interactions through unsupervised learning.

CHAPTER 3 PROBLEM STATEMENT AND RESEARCH METHODOLOGY

CHAPTER 3 PROBLEM STATEMENT AND RESEARCH METHODOLOGY 64 CHAPTER 3 PROBLEM STATEMENT AND RESEARCH METHODOLOGY 3.1 PROBLEM DEFINITION Clinical data mining (CDM) is a rising field of research that aims at the utilization of data mining techniques to extract

More information

Supplementary Figure 1. FACS analysis of cells infected with TY93/H5N1 GFP-627E,

Supplementary Figure 1. FACS analysis of cells infected with TY93/H5N1 GFP-627E, Supplementary Figure 1. FACS analysis of cells infected with TY93/H5N1 GFP-627E, TY93/H5N1 GFP-627K, or the TY93/H5N1 PB2(588-759) virus library. To establish our GFP- FACS screening platform, we compared

More information

A Comprehensive Study of TP53 Mutations in Chronic Lymphocytic Leukemia: Analysis of 1,287 Diagnostic CLL Samples

A Comprehensive Study of TP53 Mutations in Chronic Lymphocytic Leukemia: Analysis of 1,287 Diagnostic CLL Samples A Comprehensive Study of TP53 Mutations in Chronic Lymphocytic Leukemia: Analysis of 1,287 Diagnostic CLL Samples Sona Pekova, MD., PhD. Chambon Ltd., Laboratory for molecular diagnostics, Prague, Czech

More information

To test the possible source of the HBV infection outside the study family, we searched the Genbank

To test the possible source of the HBV infection outside the study family, we searched the Genbank Supplementary Discussion The source of hepatitis B virus infection To test the possible source of the HBV infection outside the study family, we searched the Genbank and HBV Database (http://hbvdb.ibcp.fr),

More information

Supplementary Figure 1

Supplementary Figure 1 Count Count Supplementary Figure 1 Coverage per amplicon for error-corrected sequencing experiments. Errorcorrected consensus sequence (ECCS) coverage was calculated for each of the 568 amplicons in the

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

Virological failure to Protease inhibitors in Monotherapy is linked to the presence of signature mutations in Gag without changes in HIV-1 replication

Virological failure to Protease inhibitors in Monotherapy is linked to the presence of signature mutations in Gag without changes in HIV-1 replication Virological failure to Protease inhibitors in Monotherapy is linked to the presence of signature mutations in Gag without changes in HIV-1 replication Oscar Blanch-Lombarte Rome, 7-9 June, 2017 European

More information

modified dye uptake assay including formazan test EC 90 not tested plaque reduction assay

modified dye uptake assay including formazan test EC 90 not tested plaque reduction assay Sauerbrei A, Bohn-Wippert K, Kaspar M, Krumbholz A, Karrasch M, Zell R. 2015. Database on natural polymorphisms and resistance-related non-synonymous mutations in thymidine kinase and DNA polymerase genes

More information

HCV NS3 Protease Drug Resistance

HCV NS3 Protease Drug Resistance test code: cpt code: 10000 87902 category: Infectious Disease HCV genotype 1a CODON Glecaprevir 1a Grazoprevir 1a Paritaprevir 1a Simeprevir 1a Voxilaprevir 1a V36A S 13 R 10 Comment V36C V36G V36I V36L

More information

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Dhouha Grissa, Mélanie Pétéra, Marion Brandolini, Amedeo Napoli, Blandine Comte and Estelle Pujos-Guillot

More information

SUPPLEMENTARY INFORMATION. Rare independent mutations in renal salt handling genes contribute to blood pressure variation

SUPPLEMENTARY INFORMATION. Rare independent mutations in renal salt handling genes contribute to blood pressure variation SUPPLEMENTARY INFORMATION Rare independent mutations in renal salt handling genes contribute to blood pressure variation Weizhen Ji, Jia Nee Foo, Brian J. O Roak, Hongyu Zhao, Martin G. Larson, David B.

More information

Pharmacogenomics in Rare Diseases: Development Strategy for Ivacaftor as a Therapy for Cystic Fibrosis

Pharmacogenomics in Rare Diseases: Development Strategy for Ivacaftor as a Therapy for Cystic Fibrosis Pharmacogenomics in Rare Diseases: Development Strategy for Ivacaftor as a Therapy for Cystic Fibrosis Federico Goodsaid Vice President Strategic Regulatory Intelligence Vertex Pharmaceuticals Is there

More information

Supplementary Figure 1 Weight and body temperature of ferrets inoculated with

Supplementary Figure 1 Weight and body temperature of ferrets inoculated with Supplementary Figure 1 Weight and body temperature of ferrets inoculated with A/Anhui/1/2013 (H7N9) influenza virus. (a) Body temperature and (b) weight change of ferrets after intranasal inoculation with

More information

Mature microrna identification via the use of a Naive Bayes classifier

Mature microrna identification via the use of a Naive Bayes classifier Mature microrna identification via the use of a Naive Bayes classifier Master Thesis Gkirtzou Katerina Computer Science Department University of Crete 13/03/2009 Gkirtzou K. (CSD UOC) Mature microrna identification

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL SUPPLEMENTAL MATERIAL SUPPLEMENTAL MATERIAL Supplemental Methods Phenotype Prediction Analyses In order to assess the phylogenetic properties of nssnvs, sequence conservation analysis was conducted using

More information

Deep-Sequencing of HIV-1

Deep-Sequencing of HIV-1 Deep-Sequencing of HIV-1 The quest for true variants Alexander Thielen, Martin Däumer 09.05.2015 Limitations of drug resistance testing by standard-sequencing Blood plasma RNA extraction RNA Reverse Transcription/

More information

HCV NS3 Protease Drug Resistance

HCV NS3 Protease Drug Resistance test code: cpt code: 10000 87902 category: Infectious Disease HCV genotypes 1a CODON Simeprevir 1a Boceprevir 1a Telaprevir 1a Paritaprevir 1a Grazoprevir 1a V36A Comment R 3 R 10, 11 R 16 S 24 V36C V36G

More information

Kalydeco. Kalydeco (ivacaftor) Description

Kalydeco. Kalydeco (ivacaftor) Description Federal Employee Program 1310 G Street, N.W. Washington, D.C. 20005 202.942.1000 Fax 202.942.1125 5.45.03 Subject: Kalydeco Page: 1 of 6 Last Review Date: November 30, 2018 Kalydeco Description Kalydeco

More information

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH 1 VALLURI RISHIKA, M.TECH COMPUTER SCENCE AND SYSTEMS ENGINEERING, ANDHRA UNIVERSITY 2 A. MARY SOWJANYA, Assistant Professor COMPUTER SCENCE

More information

History (August 2010) Therapy for Experienced Patients. History (September 2010) History (November 2010) 12/2/11

History (August 2010) Therapy for Experienced Patients. History (September 2010) History (November 2010) 12/2/11 (August 2010) Therapy for Experienced Patients Hiroyu Hatano, MD, MHS Assistant Professor of Medicine University of California San Francisco Medical Management of AIDS December 2011 42M HIV (CD4=450, VL=6250,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature13898 Supplementary Information Table 1 Kras mutation status of carcinogen-induced mouse lung adenomas Tumour Treatment Strain Grade Genotype Kras status (WES)* Kras status (Sanger) 32T1

More information

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project

Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Computational Identification and Prediction of Tissue-Specific Alternative Splicing in H. Sapiens. Eric Van Nostrand CS229 Final Project Introduction RNA splicing is a critical step in eukaryotic gene

More information

Resistance Workshop. 3rd European HIV Drug

Resistance Workshop. 3rd European HIV Drug 3rd European HIV Drug Resistance Workshop March 30-April 1 st, 2005 Christine Hughes, PharmD Clinical Associate Professor Faculty of Pharmacy & Pharmaceutical Sciences University of Alberta Tenofovir resistance

More information

Stage-Specific Predictive Models for Cancer Survivability

Stage-Specific Predictive Models for Cancer Survivability University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations December 2016 Stage-Specific Predictive Models for Cancer Survivability Elham Sagheb Hossein Pour University of Wisconsin-Milwaukee

More information

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION OPT-i An International Conference on Engineering and Applied Sciences Optimization M. Papadrakakis, M.G. Karlaftis, N.D. Lagaros (eds.) Kos Island, Greece, 4-6 June 2014 A NOVEL VARIABLE SELECTION METHOD

More information

Supplementary Materials

Supplementary Materials Supplementary Materials July 2, 2015 1 EEG-measures of consciousness Table 1 makes explicit the abbreviations of the EEG-measures. Their computation closely follows Sitt et al. (2014) (supplement). PE

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT Research Article Bioinformatics International Journal of Pharma and Bio Sciences ISSN 0975-6299 A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS D.UDHAYAKUMARAPANDIAN

More information

NNRTI Resistance NORTHWEST AIDS EDUCATION AND TRAINING CENTER

NNRTI Resistance NORTHWEST AIDS EDUCATION AND TRAINING CENTER NORTHWEST AIDS EDUCATION AND TRAINING CENTER NNRTI Resistance Brian R. Wood, MD Medical Director, NW AETC ECHO Assistant Professor of Medicine, University of Washington Presentation prepared by: Brian

More information

The role of E148Q in FMF. Elon Pras Institute of Human Genetics Sheba Medical Center

The role of E148Q in FMF. Elon Pras Institute of Human Genetics Sheba Medical Center The role of E148Q in FMF Elon Pras Institute of Human Genetics Sheba Medical Center Familial Mediterranean Fever (FMF) Acute attacks of fever accompanied by: Peritonitis Pleuritis Arthritis Erysipelas

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science

More information

CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY

CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY Muhammad Shahbaz 1, Shoaib Faruq 2, Muhammad Shaheen 1, Syed Ather Masood 2 1 Department of Computer Science and Engineering, UET, Lahore, Pakistan Muhammad.Shahbaz@gmail.com,

More information

Modeling Sentiment with Ridge Regression

Modeling Sentiment with Ridge Regression Modeling Sentiment with Ridge Regression Luke Segars 2/20/2012 The goal of this project was to generate a linear sentiment model for classifying Amazon book reviews according to their star rank. More generally,

More information

Medical Policy An independent licensee of the Blue Cross Blue Shield Association

Medical Policy An independent licensee of the Blue Cross Blue Shield Association Cystic Fibrosis Transmembrane Page 1 of 11 Medical Policy An independent licensee of the Blue Cross Blue Shield Association Title: Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) Prime Therapeutics

More information

J. A. Mayfield et al. FIGURE S1. Methionine Salvage. Methylthioadenosine. Methionine. AdoMet. Folate Biosynthesis. Methylation SAH.

J. A. Mayfield et al. FIGURE S1. Methionine Salvage. Methylthioadenosine. Methionine. AdoMet. Folate Biosynthesis. Methylation SAH. FIGURE S1 Methionine Salvage Methionine Methylthioadenosine AdoMet Folate Biosynthesis Methylation SAH Homocysteine Homocystine CBS Cystathionine Cysteine Glutathione Figure S1 Biochemical pathway of relevant

More information

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes J. Sujatha Research Scholar, Vels University, Assistant Professor, Post Graduate

More information

7.1 Grading Diabetic Retinopathy

7.1 Grading Diabetic Retinopathy Chapter 7 DIABETIC RETINOPATHYGRADING -------------------------------------------------------------------------------------------------------------------------------------- A consistent approach to the

More information

A case study in Pulse Polio Immunisation

A case study in Pulse Polio Immunisation Chapter 5 A case study in Pulse Polio Immunisation 5.1 Introduction This chapter features a worked example of Bayesian graphical modelling. Our data for this exercise are (i) the antibody measurements

More information

Vertical Magnetic Separation of Circulating Tumor Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients

Vertical Magnetic Separation of Circulating Tumor Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients Vertical Magnetic Separation of Circulating Cells and Somatic Genomic-Alteration Analysis in Lung Cancer Patients Chang Eun Yoo 1,2#, Jong-Myeon Park 3#, Hui-Sung Moon 1,2, Je-Gun Joung 2, Dae-Soon Son

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Feature Diminution by Ant Colonized Relative Reduct Algorithm for improving the Success Rate for IVF Treatment

Feature Diminution by Ant Colonized Relative Reduct Algorithm for improving the Success Rate for IVF Treatment Feature Diminution by Ant Colonized Relative Reduct for improving the Success Rate for IVF Treatment Dr. M. Durairaj 1 Assistant Professor School of Comp. Sci., Engg, & Applications, Bharathidasan University,

More information

Cardiac Arrest Prediction to Prevent Code Blue Situation

Cardiac Arrest Prediction to Prevent Code Blue Situation Cardiac Arrest Prediction to Prevent Code Blue Situation Mrs. Vidya Zope 1, Anuj Chanchlani 2, Hitesh Vaswani 3, Shubham Gaikwad 4, Kamal Teckchandani 5 1Assistant Professor, Department of Computer Engineering,

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

When Overlapping Unexpectedly Alters the Class Imbalance Effects

When Overlapping Unexpectedly Alters the Class Imbalance Effects When Overlapping Unexpectedly Alters the Class Imbalance Effects V. García 1,2, R.A. Mollineda 2,J.S.Sánchez 2,R.Alejo 1,2, and J.M. Sotoca 2 1 Lab. Reconocimiento de Patrones, Instituto Tecnológico de

More information

Transmission of integrase resistance HIV

Transmission of integrase resistance HIV Transmission of integrase resistance HIV Charles Boucher, MD, PhD Clinical Virology, Dept. Viroscience, Erasmus Medical Center, Erasmus Universiy, The Netherlands Major resistance mutations (Stanford)

More information

Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction

Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction Amir Barati Farimani, Mohammad Heiranian, Narayana R. Aluru 1 Department

More information

A scored AUC Metric for Classifier Evaluation and Selection

A scored AUC Metric for Classifier Evaluation and Selection A scored AUC Metric for Classifier Evaluation and Selection Shaomin Wu SHAOMIN.WU@READING.AC.UK School of Construction Management and Engineering, The University of Reading, Reading RG6 6AW, UK Peter Flach

More information

A Naïve Bayesian Classifier for Educational Qualification

A Naïve Bayesian Classifier for Educational Qualification Indian Journal of Science and Technology, Vol 8(16, DOI: 10.17485/ijst/2015/v8i16/62055, July 2015 ISSN (Print : 0974-6846 ISSN (Online : 0974-5645 A Naïve Bayesian Classifier for Educational Qualification

More information

3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients

3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients 3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients Dong Nie 1,2, Han Zhang 1, Ehsan Adeli 1, Luyan Liu 1, and Dinggang Shen 1(B) 1 Department of Radiology

More information

Rajiv Gandhi College of Engineering, Chandrapur

Rajiv Gandhi College of Engineering, Chandrapur Utilization of Data Mining Techniques for Analysis of Breast Cancer Dataset Using R Keerti Yeulkar 1, Dr. Rahila Sheikh 2 1 PG Student, 2 Head of Computer Science and Studies Rajiv Gandhi College of Engineering,

More information

HIV-1 resistance testing from proviral DNA

HIV-1 resistance testing from proviral DNA HIV-1 resistance testing from proviral DNA Alexander Thielen AREVIR 2018 Resistance testing from proviral DNA? amount of samples with low viral loads increasing desire to switch under successful therapy

More information

Applying Data Mining for Epileptic Seizure Detection

Applying Data Mining for Epileptic Seizure Detection Applying Data Mining for Epileptic Seizure Detection Ying-Fang Lai 1 and Hsiu-Sen Chiang 2* 1 Department of Industrial Education, National Taiwan Normal University 162, Heping East Road Sec 1, Taipei,

More information

Machine learning II. Juhan Ernits ITI8600

Machine learning II. Juhan Ernits ITI8600 Machine learning II Juhan Ernits ITI8600 Hand written digit recognition 64 Example 2: Face recogition Classification, regression or unsupervised? How many classes? Example 2: Face recognition Classification,

More information

Analysis of Classification Algorithms towards Breast Tissue Data Set

Analysis of Classification Algorithms towards Breast Tissue Data Set Analysis of Classification Algorithms towards Breast Tissue Data Set I. Ravi Assistant Professor, Department of Computer Science, K.R. College of Arts and Science, Kovilpatti, Tamilnadu, India Abstract

More information

Susanne Schnittger. Workflow of molecular investigations in JAK2-negative MPNs - the Munich experience

Susanne Schnittger. Workflow of molecular investigations in JAK2-negative MPNs - the Munich experience Susanne Schnittger Workflow of molecular investigations in JAK2negative MPNs the Munich experience Cohort single centre experience to apply new markers in a daily diagnostic work flow total: 20,547 cases

More information

Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing

Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing GA N 668353 H2020 Research and Innovation Deliverable 2.1 List of relevant genetic variants for pre-emptive PGx testing WP N and Title: WP2 - Towards shared European Guidelines for PGx Lead beneficiary:

More information

4. Model evaluation & selection

4. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2017 4. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

Dynamic Stochastic Synapses as Computational Units

Dynamic Stochastic Synapses as Computational Units Dynamic Stochastic Synapses as Computational Units Wolfgang Maass Institute for Theoretical Computer Science Technische Universitat Graz A-B01O Graz Austria. email: maass@igi.tu-graz.ac.at Anthony M. Zador

More information

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

COMPARISON OF HIV DRUG-RESISTANT MUTANT DETECTION BY NGS WITH AND WITHOUT UNIQUE MOLECULAR IDENTIFIERS (UMI)

COMPARISON OF HIV DRUG-RESISTANT MUTANT DETECTION BY NGS WITH AND WITHOUT UNIQUE MOLECULAR IDENTIFIERS (UMI) COMPARISON OF HIV DRUG-RESISTANT MUTANT DETECTION BY NGS WITH AND WITHOUT UNIQUE MOLECULAR IDENTIFIERS (UMI) Kevin D McCormick; Kerri J Penrose; Rahil Sethi; Jacob Waldman; William Schwarzmann; Uma Chandran;

More information

CSE 258 Lecture 1.5. Web Mining and Recommender Systems. Supervised learning Regression

CSE 258 Lecture 1.5. Web Mining and Recommender Systems. Supervised learning Regression CSE 258 Lecture 1.5 Web Mining and Recommender Systems Supervised learning Regression What is supervised learning? Supervised learning is the process of trying to infer from labeled data the underlying

More information

Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka

Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka I J C T A, 10(8), 2017, pp. 59-67 International Science Press ISSN: 0974-5572 Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka Milandeep Arora* and Ajay

More information

Fabry Disease X-linked genetic, multi-organ disorder. Fabry disease screening program in Hypertrophic p Cardiomyopathy: preliminary results.

Fabry Disease X-linked genetic, multi-organ disorder. Fabry disease screening program in Hypertrophic p Cardiomyopathy: preliminary results. Fabry Disease X-linked genetic, multi-organ disorder Fabry disease screening program in Hypertrophic p Cardiomyopathy: y preliminary results. Globotriaosylceramide, GL3 Brain -galactosidase A Eyes Lactosylceramide

More information

Classification of Smoking Status: The Case of Turkey

Classification of Smoking Status: The Case of Turkey Classification of Smoking Status: The Case of Turkey Zeynep D. U. Durmuşoğlu Department of Industrial Engineering Gaziantep University Gaziantep, Turkey unutmaz@gantep.edu.tr Pınar Kocabey Çiftçi Department

More information

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection Esma Nur Cinicioglu * and Gülseren Büyükuğur Istanbul University, School of Business, Quantitative Methods

More information

Classification and Predication of Breast Cancer Risk Factors Using Id3

Classification and Predication of Breast Cancer Risk Factors Using Id3 The International Journal Of Engineering And Science (IJES) Volume 5 Issue 11 Pages PP 29-33 2016 ISSN (e): 2319 1813 ISSN (p): 2319 1805 Classification and Predication of Breast Cancer Risk Factors Using

More information

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data 2017 International Conference on Arts and Design, Education and Social Sciences (ADESS 2017) ISBN: 978-1-60595-511-7 The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

More information

Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data

Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data S. Kanchana 1 1 Assistant Professor, Faculty of Science and Humanities SRM Institute of Science & Technology,

More information

Changing demographics of smoking and its effects during therapy

Changing demographics of smoking and its effects during therapy Changing demographics of smoking and its effects during therapy Egbert F. Smit MD PhD. Dept. Pulmonary Diseases, Vrije Universiteit Medical Centre, Amsterdam, The Netherlands Smoking prevalence adults

More information

Automated Medical Diagnosis using K-Nearest Neighbor Classification

Automated Medical Diagnosis using K-Nearest Neighbor Classification (IMPACT FACTOR 5.96) Automated Medical Diagnosis using K-Nearest Neighbor Classification Zaheerabbas Punjani 1, B.E Student, TCET Mumbai, Maharashtra, India Ankush Deora 2, B.E Student, TCET Mumbai, Maharashtra,

More information

3. Model evaluation & selection

3. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

INTRODUCTION TO MACHINE LEARNING. Decision tree learning INTRODUCTION TO MACHINE LEARNING Decision tree learning Task of classification Automatically assign class to observations with features Observation: vector of features, with a class Automatically assign

More information

AN INVESTIGATION OF MULTI-LABEL CLASSIFICATION TECHNIQUES FOR PREDICTING HIV DRUG RESISTANCE IN RESOURCE-LIMITED SETTINGS

AN INVESTIGATION OF MULTI-LABEL CLASSIFICATION TECHNIQUES FOR PREDICTING HIV DRUG RESISTANCE IN RESOURCE-LIMITED SETTINGS AN INVESTIGATION OF MULTI-LABEL CLASSIFICATION TECHNIQUES FOR PREDICTING HIV DRUG RESISTANCE IN RESOURCE-LIMITED SETTINGS by Pascal Brandt Submitted in fulfilment of the academic requirements for the degree

More information

Recognition of HIV-1 subtypes and antiretroviral drug resistance using weightless neural networks

Recognition of HIV-1 subtypes and antiretroviral drug resistance using weightless neural networks Recognition of HIV-1 subtypes and antiretroviral drug resistance using weightless neural networks Caio R. Souza 1, Flavio F. Nobre 1, Priscila V.M. Lima 2, Robson M. Silva 2, Rodrigo M. Brindeiro 3, Felipe

More information

Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features.

Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features. Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features. Mohamed Tleis Supervisor: Fons J. Verbeek Leiden University

More information

Medical Policy An independent licensee of the Blue Cross Blue Shield Association

Medical Policy An independent licensee of the Blue Cross Blue Shield Association Cystic Fibrosis Transmembrane Page 1 of 13 Medical Policy An independent licensee of the Blue Cross Blue Shield Association Title: Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) Prime Therapeutics

More information

Measuring Focused Attention Using Fixation Inner-Density

Measuring Focused Attention Using Fixation Inner-Density Measuring Focused Attention Using Fixation Inner-Density Wen Liu, Mina Shojaeizadeh, Soussan Djamasbi, Andrew C. Trapp User Experience & Decision Making Research Laboratory, Worcester Polytechnic Institute

More information

Natural Scene Statistics and Perception. W.S. Geisler

Natural Scene Statistics and Perception. W.S. Geisler Natural Scene Statistics and Perception W.S. Geisler Some Important Visual Tasks Identification of objects and materials Navigation through the environment Estimation of motion trajectories and speeds

More information

PA Update: Oral Cystic Fibrosis Modulators

PA Update: Oral Cystic Fibrosis Modulators Copyright 2012 Oregon State University. All Rights Reserved Drug Use Research & Management Program Oregon State University, 500 Summer Street NE, E35 Salem, Oregon 97301-1079 Phone 503-947-5220 Fax 503-947-1119

More information

Using AUC and Accuracy in Evaluating Learning Algorithms

Using AUC and Accuracy in Evaluating Learning Algorithms 1 Using AUC and Accuracy in Evaluating Learning Algorithms Jin Huang Charles X. Ling Department of Computer Science The University of Western Ontario London, Ontario, Canada N6A 5B7 fjhuang, clingg@csd.uwo.ca

More information

Fast Affinity Propagation Clustering based on Machine Learning

Fast Affinity Propagation Clustering based on Machine Learning www.ijcsi.org 302 Fast Affinity Propagation Clustering based on Machine Learning Shailendra Kumar Shrivastava 1, Dr. J.L. Rana 2 and Dr. R.C. Jain 3 1 Samrat Ashok Technological Institute Vidisha, Madhya

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Binary Classification of Cognitive Disorders: Investigation on the Effects of Protein Sequence Properties in Alzheimer s and Parkinson s Disease

Binary Classification of Cognitive Disorders: Investigation on the Effects of Protein Sequence Properties in Alzheimer s and Parkinson s Disease Binary Classification of Cognitive Disorders: Investigation on the Effects of Protein Sequence Properties in Alzheimer s and Parkinson s Disease Shomona Gracia Jacob, Member, IAENG, Tejeswinee K Abstract

More information

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes. Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension

More information

CPTR title slide. A Standardized System for Grading Mutations in Mycobacterium tuberculosis for Association with Drug Resistance

CPTR title slide. A Standardized System for Grading Mutations in Mycobacterium tuberculosis for Association with Drug Resistance CPTR title slide A Standardized System for Grading Mutations in Mycobacterium tuberculosis for Association with Drug Resistance PAOLO MIOTTO CPTR 2017 Workshop, March 20 23 The Need A lack of user-friendly

More information

Detection of Abnormalities of Retina Due to Diabetic Retinopathy and Age Related Macular Degeneration Using SVM

Detection of Abnormalities of Retina Due to Diabetic Retinopathy and Age Related Macular Degeneration Using SVM Science Journal of Circuits, Systems and Signal Processing 2016; 5(1): 1-7 http://www.sciencepublishinggroup.com/j/cssp doi: 10.11648/j.cssp.20160501.11 ISSN: 2326-9065 (Print); ISSN: 2326-9073 (Online)

More information

A Genetic Approach to the Treatment of Cystic Fibrosis

A Genetic Approach to the Treatment of Cystic Fibrosis A Genetic Approach to the Treatment of Cystic Fibrosis Peter Mueller, PhD Chief Scientific Officer and Executive Vice President Global Research and Development Vertex Pharmaceuticals, Incorporated March

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature23291 Supplementary discussion part 1 Our model suggests that, in tumours that express Class 3 BRAF mutants and activated RAS, both BRAF MUT /RAF WT heterodimers and WT RAF dimers are

More information

VISUAL SALIENCY BASED BRIGHT LESION DETECTION AND DISCRIMINATION IN RETINAL IMAGES

VISUAL SALIENCY BASED BRIGHT LESION DETECTION AND DISCRIMINATION IN RETINAL IMAGES VISUAL SALIENCY BASED BRIGHT LESION DETECTION AND DISCRIMINATION IN RETINAL IMAGES by ujjwal, Sai Deepak, Arunava Chakravarty, Jayanthi Sivaswamy in IEEE 10th International Symposium on Biomedical Imaging

More information

Resistance to Integrase Strand Transfer Inhibitors

Resistance to Integrase Strand Transfer Inhibitors NORTHWEST AIDS EDUCATION AND TRAINING CENTER Resistance to Integrase Strand Transfer Inhibitors David Spach, MD Clinical Director, Northwest AETC Professor of Medicine, Division of Infectious Diseases

More information

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network Contents Classification Rule Generation for Bioinformatics Hyeoncheol Kim Rule Extraction from Neural Networks Algorithm Ex] Promoter Domain Hybrid Model of Knowledge and Learning Knowledge refinement

More information

FUZZY DATA MINING FOR HEART DISEASE DIAGNOSIS

FUZZY DATA MINING FOR HEART DISEASE DIAGNOSIS FUZZY DATA MINING FOR HEART DISEASE DIAGNOSIS S.Jayasudha Department of Mathematics Prince Shri Venkateswara Padmavathy Engineering College, Chennai. ABSTRACT: We address the problem of having rigid values

More information

Transfer Learning Approaches to Improve Drug Sensitivity Prediction in Multiple Myeloma Patients

Transfer Learning Approaches to Improve Drug Sensitivity Prediction in Multiple Myeloma Patients Received January 27, 217, accepted March 21, 217, date of publication April 24, 217, date of current version June 7, 217. Digital Object Identifier 1.119/ACCESS.217.2696523 Transfer Learning Approaches

More information

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER M.Bhavani 1 and S.Vinod kumar 2 International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.352-359 DOI: http://dx.doi.org/10.21172/1.74.048

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

Mapping Evolutionary Pathways of HIV-1 Drug Resistance. Christopher Lee, UCLA Dept. of Chemistry & Biochemistry

Mapping Evolutionary Pathways of HIV-1 Drug Resistance. Christopher Lee, UCLA Dept. of Chemistry & Biochemistry Mapping Evolutionary Pathways of HIV-1 Drug Resistance Christopher Lee, UCLA Dept. of Chemistry & Biochemistry Stalemate: We React to them, They React to Us E.g. a virus attacks us, so we develop a drug,

More information

A Bayesian Network Analysis of Eyewitness Reliability: Part 1

A Bayesian Network Analysis of Eyewitness Reliability: Part 1 A Bayesian Network Analysis of Eyewitness Reliability: Part 1 Jack K. Horner PO Box 266 Los Alamos NM 87544 jhorner@cybermesa.com ICAI 2014 Abstract In practice, many things can affect the verdict in a

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

Sudden Cardiac Arrest Prediction Using Predictive Analytics

Sudden Cardiac Arrest Prediction Using Predictive Analytics Received: February 14, 2017 184 Sudden Cardiac Arrest Prediction Using Predictive Analytics Anurag Bhatt 1, Sanjay Kumar Dubey 1, Ashutosh Kumar Bhatt 2 1 Amity University Uttar Pradesh, Noida, India 2

More information

PROFILE SIMILARITY IN BIOEQUIVALENCE TRIALS

PROFILE SIMILARITY IN BIOEQUIVALENCE TRIALS Sankhyā : The Indian Journal of Statistics Special Issue on Biostatistics 2000, Volume 62, Series B, Pt. 1, pp. 149 161 PROFILE SIMILARITY IN BIOEQUIVALENCE TRIALS By DAVID T. MAUGER and VERNON M. CHINCHILLI

More information