REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE

Size: px
Start display at page:

Download "REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE"

Transcription

1 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE

2 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 1 Biomarker discovery has opened new realms in the medical industry, from patient diagnosis and treatment, to drug development and testing. However, through these advances the capacity to discover biomarkers panels has often been constrained by the employed methodologies. Current approaches to biomarker panel discovery A number of different machine learning, clustering and statistical approaches can be used for biomarker selection, including traditional methods such as: top scoring pair (TSP), decision trees (DT), naïve bayes (NB), prediction analysis of microarrays (PAM), support vector machine (SVM) and others. But these traditional methods can be difficult to interpret, use many biomarkers, and yield low accuracies, including sensitivity and specificity. For the medical industry, from diagnostics, to pharmaceutical developers, to labs, this translates into a costly process that leads to a harder path through regulatory approval. One weakness of traditional biomarker discovery techniques is the invariant approach. Testing for individual biomarkers, one at a time, is not only cumbersome and costly; it neglects the complex, interrelated nature of those markers. By capturing the relationships between multiple biomarkers, a more nuanced and precise evaluation can be conducted, which takes into account the interactions between potential biomarkers in determining patient outcomes. Another weakness of traditional biomarker discovery is the constraints of the statistical techniques typically employed. Inherent to these methods are numerous assumptions, which can constrain the potential information embedded in the data, clouding the potential results. The SimplicityBio Biomarker Optimization Software System A new multivariable approach to biomarker discovery has emerged to resolve these weaknesses, using SimplicityBio s proprietary Biomarker Optimization Software System (BOSS) we are able to find the perfect balance between accuracy and quantity of biomarkers. The core of this is the co-evolutionary fuzzy modeling method Fuzzy CoCo1. Around this method several steps are performed to select the best combination of biomarkers. BOSS performs two phases: 1 st 2 nd Exploratory-modeling: Potential signatures are created by testing billions of panels of biomarkers with Fuzzy CoCo. Fuzzy CoCo uses an artificial evolution approach, which allows populations of signatures to evolve, mate, and migrate, with only the most robust signatures surviving at the end. Signature-selection: A reduced number of signatures are. This family of signatures represents several characteristics, so it is possible to have signatures that are more sensible, sensitive, or with fewer variables than others. The final selection is made taking into account the needs of the client.

3 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 2 The success of BOSS lies in its ability to minimize the number of rules and variables used in multivariate signatures, while maintaining exceptional accuracy, including sensitivity and specificity. The method yields a family of models, which can be isolated to meet the specific needs of the client. By reducing the number of rules and variables in each of the family s signature, testing costs will be reduced, both on the development end and consumer end. A cleaner, more concise resultant model can also aid developers in navigating the regulatory approval process. Testing SimplicityBio s Biomarker Optimization Software To test its efficacy, BOSS was compared with other biomarker discovery methods and s such as TSP, k-tsp, DT, NB, K-NN, PAM, SVM, MOE, Bagging C4.5, AdaBoost C4.5, KEM Biomarker from Ariana Pharma, AHC, Single C4.5, fsvm, and Fuzzy Logic for six seminal, published datasets. In comparing SimplicityBio s biomarker discovery with other methods for published datasets, BOSS consistently yields lower numbers of variables, while matching or exceeding the accuracy of the other methods. Across the six datasets, BOSS achieved an accuracy of 95.83% or higher which exceeded or met the accuracy of every other method it was compared to. But the key to BOSS s superiority is not just its exceptional accuracy, it is its ability to constrain the number of variables in each model. LEUKEMIA (Golub et al.2) Includes 38 observations, each of which is described by the gene expression levels of 7,129 genes and a class attribute with the two distinct labels of acute myeloid leukemia and lymphoblastic leukemia. Acute myeloid and lymphoblastic leukemia (Golub et al.) BOSS % 2 SimplicityBio1 NB % * Tan et al.8 SVM % 8 Guyon et al.9 PAM 97.22% 2296 Tan et al. k-tsp 95.83% 18 Tan et al. K-NN 84.82% * Tan et al. Fuzzy logic 79.00% 2 Ohno-Machado et al.10 DT 73.81% 2 Tan et al. In the comparison using the leukemia dataset, BOSS achieved an accuracy of 100% using 2 variables. SVM, another method that achieved this level of accuracy, used 8 variables. Other methods which used only 2 variables, Fuzzy logic and DT, only achieved accuracies of 79% and 73.81% respectively.

4 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 3 COLON CANCER (Alon et al.3) Includes 62 observations made up of 40 tumor samples and 22 normal samples. There are approximately 6,000 genes represented in each sample in the dataset. Colon Cancer (Alon et al.) BOSS 94.14% 27 SimplicityBio TSP 91.10% 2 Tan et al. k-tsp 90.30% 2 Tan et al. Fuzzy logic 90.00% 17 Huerta et al.11 PAM 85.48% 15 Tan et al. SVM 82.26% * Tan et al. DT 80.65% 3 Tan et al. K-NN 74.19% * Tan et al. NB 58.06% * Tan et al. Despite using more variables, BOSS outperforms the other datasets in terms of accuracy. PROSTATE CANCER (Singh et al.4) Includes 52 prostate tumor samples and 50 non-tumor prostate samples with a total of 12,600 genes. Prostate Tumor (Singh et al.) BOSS 97.29% 2 SimplicityBio TSP 95.00% 2 Tan et al. MC-SVM 92.00% * Statnikov et al.12 k-tsp 91.00% 2 Tan et al. PAM 91.00% 47 Tan et al. SVM 91.00% * Tan et al. NN 91.00% * Statnikov et al. DT 87.00% 4 Tan et al. KNN 85.00% * Statnikov et al. NB 62.00% * Tan et al. BOSS achieves an accuracy of 97.29% for the prostate cancer dataset using 2 variables. The only method using fewer variables TSP compromises accuracy to do so.

5 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 4 LUNG CANCER (Gordon et al.5) Includes 52 prostate tumor samples and 50 non-tumor prostate samples with a total of 12,600 genes. Lung cancer (Gordon et al.) BOSS % 2 SimplicityBio PAM 99.45% 15 Tan et al. SVM 99.45% * Tan et al. k-tsp 98.90% 2 Tan et al. K-NN 98.34% * Tan et al. TSP 98.30% 2 Tan et al. NB 97.79% * Tan et al. DT 96.13% 3 Tan et al. MOE 91.00% 2 Wang & Palade13 With the only 100% accuracy result for the methods tested in the lung cancer dataset, BOSS uses 2 variables. Four methods use the same or fewer variables k-tsp, TSP, DT, and MOE however they have significantly lower accuracies of 98.90%, 98.30%, 96.13% and 91.00% respectively. Breast CANCER (Van de Vijver et al.6) Includes 295 samples made up of 151 lymph-node negative disease and 144 with lymph-node positive disease with a total of 70 genes. Breast cancer (van de Vijver et al.6) BOSS 95.83% 31 SimplicityBio Bagging C % * Tan & Gilbert14 AdaBoost C % * Tan & Gilbert BOSS 87.50% 9 SimplicityBio KEM Biomarker 85.89% 13 Guergova-Kuras et al.15 AHC 83.33% 70 van de Vijver et al. Single C * Tan & Gilbert Single C % * Tan & Gilbert Here are presented two signatures discovered by BOSS. The first one has the highest accuracy (95.83%) but not the lowest number of variables. The second one has a lower number of variables (9) and an accuracy superior to KEM Biomarker who presents the lower number of variables among the other methods.

6 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 5 Ovarian CANCER (Zhou et al.7) Includes 94 samples made up of 44 samples from women diagnosed with serous papillary ovarian cancer and 50 healthy women with a total of 3,017 mass spectrometry signatures. Ovarian Cancer (Zhou et al.7) BOSS % 10 SimplicityBio1 fsvm % 3017 Zhou et al. KEM Biomarker 92.97% 13 Guergova-Kuras et al. The ovarian cancer dataset exemplifies the importance of reducing the number of variables used in modeling. While fsvm achieves an accuracy of 100% to match that of BOSS it uses 300x the number of variables. As exemplified by these six datasets, BOSS consistently has the highest accuracy of any method tested, with lower or comparable numbers of variables used. Even when BOSS uses slightly more variables, an increase of 1 to 2 variables is a modest tradeoff for higher accuracy. When minimizing the number of variables used is the goal, BOSS can still produce exceptional accuracy results. Summary BOSS is the next stage in the evolution of biomarker discovery technology. The co-evolutionary engine behind BOSS continually drives discovery models toward more elegant, simple, and powerful solutions to better meet the needs of clients.

7 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 6 About SimplicityBio SimplicityBio is a Swiss biomarker panel discovery company. SimplicityBio s Biomarker Optimization Software System (BOSS) allows you to take full advantage of multiple data types and unbalanced data sets, while answering your production, regulatory and IP requirements. To do so, our discovers robust, highly specific and sensitive biomarker panels. Leaving you to choose the one that answers your needs. BOSS brings a unique and powerful combination of machine learning, evolutionary algorithms and fuzzy logic to the biological world, and is thus able to discover new robust multi-biomarker panels and improve existing ones. Our clients and partners range from research institutions, to diagnostic, companion diagnostic, prognostic and pharmaceutical companies. Contact us: Route de l'ile-aux-bois 1A 1870 Monthey Switzerland info@simplicitybio.com visit:

8 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 7 s: [1] Barreto-Sanz, M. A., Bujard, A., & Pena-Reyes, C. A. (2012, November). Evolving very-compact fuzzy models for gene expression data analysis. InBioinformatics & Bioengineering (BIBE), 2012 IEEE 12th International Conference on (pp ). IEEE. [2] Golub, T. R., Slonim, D. K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J. P.,... & Lander, E. S. (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. science,286(5439), [3] Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D., & Levine, A. J. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceedings of the National Academy of Sciences, 96(12), [4] Singh, D., Febbo, P. G., Ross, K., Jackson, D. G., Manola, J., Ladd, C.,... & Sellers, W. R. (2002). Gene expression correlates of clinical prostate cancer behavior. Cancer cell, 1(2), [5] Gordon, G. J., Jensen, R. V., Hsiao, L. L., Gullans, S. R., Blumenstock, J. E., Ramaswamy, S.,... & Bueno, R. (2002). Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer research, 62(17), [6] Van De Vijver, M. J., He, Y. D., van't Veer, L. J., Dai, H., Hart, A. A., Voskuil, D. W.,... & Bernards, R. (2002). A gene-expression signature as a predictor of survival in breast cancer. New England Journal of Medicine, 347(25), [7] Zhou, M., Guan, W., Walker, L. D., Mezencev, R., Benigno, B. B., Gray, A.,... & McDonald, J. F. (2010). Rapid mass spectrometric metabolic profiling of blood sera detects ovarian cancer with high accuracy. Cancer Epidemiology Biomarkers & Prevention, 19(9), [8] Tan, A. C., Naiman, D. Q., Xu, L., Winslow, R. L., & Geman, D. (2005). Simple decision rules for classifying human cancers from gene expression profiles.bioinformatics, 21(20), [9] Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine learning, 46(1-3), [10] Ohno-Machado, L., Vinterbo, S., & Weber, G. (2002). Classification of gene expression data using fuzzy logic. Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 12(1), [11] Huerta, E., Duval, B., & Hao, J. K. (2008). Fuzzy logic for elimination of redundant information of microarray data. Genomics, proteomics & bioinformatics, 6(2), [12] Statnikov, A., Aliferis, C. F., Tsamardinos, I., Hardin, D., & Levy, S. (2005). A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics, 21(5), [13] Wang, Z., & Palade, V. (2010, December). Multi-objective evolutionary algorithms based interpretable fuzzy models for microarray gene expression data analysis. In Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on (pp ). IEEE. [14] Tan, A. C., & Gilbert, D. (2003). Ensemble machine learning on gene expression data for cancer classification. [15] Guergova-Kuras, M., Schneider, M. P., Jullian, N., & Afshar, M. (2014). 667: Shorter multimarker signatures: a new tool to facilitate cancer diagnosis.european Journal of Cancer, (50), S160.

9 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 8 APPENDIX A Acronym TSP k-tsp DT NB K-NN PAM SVM MOE Bagging C4.5 AdaBoost C4.5 KEM Biomarker from Ariana Pharma AHC Single C4.5 fsvm Fuzzy Logic MC-SVM BOSS Technique of Platform Top scoring pair k- Top scoring pair C4.5 decision trees Naïve Bayes K-nearest neighbor Prediction analysis of microarrays Support Vector Machines Multi-objectiive Evolucionary Algorithms and Fuzzy Logic Knowledge Extraction and Management Aglomerative hierchical clutering algorithm Functional Support Vector Machine Multiclass support vector machine Biomarker Optimization Software System

Predictive Biomarkers

Predictive Biomarkers Uğur Sezerman Evolutionary Selection of Near Optimal Number of Features for Classification of Gene Expression Data Using Genetic Algorithms Predictive Biomarkers Biomarker: A gene, protein, or other change

More information

FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION

FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION SOMAYEH ABBASI, HAMID MAHMOODIAN Department of Electrical Engineering, Najafabad branch, Islamic

More information

A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system

A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Technology and Health Care 24 (2016) S601 S605 DOI 10.3233/THC-161187 IOS Press S601 A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Jongwoo Lim, Bohyun

More information

Analyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data

Analyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data Analyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data Simone A. Ludwig Department of Computer Science North Dakota State University Fargo, ND, USA simone.ludwig@ndsu.edu

More information

Active Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification

Active Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification 1936 J. Chem. Inf. Comput. Sci. 2004, 44, 1936-1941 Active Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification Ying Liu* Georgia Institute of Technology, College

More information

Algorithms Implemented for Cancer Gene Searching and Classifications

Algorithms Implemented for Cancer Gene Searching and Classifications Algorithms Implemented for Cancer Gene Searching and Classifications Murad M. Al-Rajab and Joan Lu School of Computing and Engineering, University of Huddersfield Huddersfield, UK {U1174101,j.lu}@hud.ac.uk

More information

CANCER CLASSIFICATION USING SINGLE GENES

CANCER CLASSIFICATION USING SINGLE GENES 179 CANCER CLASSIFICATION USING SINGLE GENES XIAOSHENG WANG 1 OSAMU GOTOH 1,2 david@genome.ist.i.kyoto-u.ac.jp o.gotoh@i.kyoto-u.ac.jp 1 Department of Intelligence Science and Technology, Graduate School

More information

Hybridized KNN and SVM for gene expression data classification

Hybridized KNN and SVM for gene expression data classification Mei, et al, Hybridized KNN and SVM for gene expression data classification Hybridized KNN and SVM for gene expression data classification Zhen Mei, Qi Shen *, Baoxian Ye Chemistry Department, Zhengzhou

More information

A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis

A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis Baljeet Malhotra and Guohui Lin Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada T6G 2E8

More information

Multiclass microarray data classification based on confidence evaluation

Multiclass microarray data classification based on confidence evaluation Methodology Multiclass microarray data classification based on confidence evaluation H.L. Yu 1, S. Gao 1, B. Qin 1 and J. Zhao 2 1 School of Computer Science and Engineering, Jiangsu University of Science

More information

Simple Decision Rules for Classifying Human Cancers from Gene Expression Profiles

Simple Decision Rules for Classifying Human Cancers from Gene Expression Profiles Simple Decision Rules for Classifying Human Cancers from Gene Expression Profiles Aik Choon TAN Post-Doc Research Fellow actan@jhu.edu Prof. Raimond L. Winslow rwinslow@jhu.edu, Director, ICM & CCBM, Prof.

More information

Package propoverlap. R topics documented: February 20, Type Package

Package propoverlap. R topics documented: February 20, Type Package Type Package Package propoverlap February 20, 2015 Title Feature (gene) selection based on the Proportional Overlapping Scores Version 1.0 Date 2014-09-15 Author Osama Mahmoud, Andrew Harrison, Aris Perperoglou,

More information

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION

A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department

More information

Gene expression correlates of clinical prostate cancer behavior

Gene expression correlates of clinical prostate cancer behavior Gene expression correlates of clinical prostate cancer behavior Cancer Cell 2002 1: 203-209. Singh D, Febbo P, Ross K, Jackson D, Manola J, Ladd C, Tamayo P, Renshaw A, D Amico A, Richie J, Lander E, Loda

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

Classification of cancer profiles. ABDBM Ron Shamir

Classification of cancer profiles. ABDBM Ron Shamir Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;

More information

AUTHOR PROOF COPY ONLY

AUTHOR PROOF COPY ONLY REVIEW Ensemble machine learning on gene expression data for cancer classification Aik Choon Tan and David Gilbert Bioinformatics Research Centre, Department of Computing Science, University of Glasgow,

More information

International Journal of Pure and Applied Mathematics

International Journal of Pure and Applied Mathematics Volume 119 No. 12 2018, 12505-12513 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Analysis of Cancer Classification of Gene Expression Data A Scientometric Review 1 Joseph M. De Guia,

More information

Gene Expression Based Leukemia Sub Classification Using Committee Neural Networks

Gene Expression Based Leukemia Sub Classification Using Committee Neural Networks Bioinformatics and Biology Insights M e t h o d o l o g y Open Access Full open access to this and thousands of other papers at http://www.la-press.com. Gene Expression Based Leukemia Sub Classification

More information

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection 202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray

More information

HYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA

HYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA HYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA S. SubashChandraBose 1 and T. Christopher 2 1 Department of Computer Science, PG and Research Department,

More information

THE gene expression profiles that are obtained from

THE gene expression profiles that are obtained from , July 3-5, 2013, London, U.K. A Study of Cancer Microarray Gene Expression Profile: Objectives and Approaches Hala M. Alshamlan, Ghada H. Badr, and Yousef Alohali Abstract Cancer is one of the dreadful

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

An entropy-based improved k-top scoring pairs (TSP) method for classifying human cancers

An entropy-based improved k-top scoring pairs (TSP) method for classifying human cancers African Journal of Biotechnology Vol. 11(45), pp. 10438-10445, 5 June, 2012 Available online at http://www.academicjournals.org/ajb DOI:10.5897/AJB11.1016 ISSN 1684 5315 2012 Academic Journals Full Length

More information

Introduction to Discrimination in Microarray Data Analysis

Introduction to Discrimination in Microarray Data Analysis Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t

More information

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes. Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension

More information

Augmented Medical Decisions

Augmented Medical Decisions Machine Learning Applied to Biomedical Challenges 2016 Rulex, Inc. Intelligible Rules for Reliable Diagnostics Rulex is a predictive analytics platform able to manage and to analyze big amounts of heterogeneous

More information

Increasing Efficiency of Microarray Analysis by PCA and Machine Learning Methods

Increasing Efficiency of Microarray Analysis by PCA and Machine Learning Methods 56 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 Increasing Efficiency of Microarray Analysis by PCA and Machine Learning Methods Jing Sun 1, Kalpdrum Passi 1, Chakresh Jain 2 1 Department

More information

NIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1.

NIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1. NIH Public Access Author Manuscript Published in final edited form as: Best Pract Res Clin Haematol. 2009 June ; 22(2): 271 282. doi:10.1016/j.beha.2009.07.001. Analysis of DNA Microarray Expression Data

More information

Nearest Shrunken Centroid as Feature Selection of Microarray Data

Nearest Shrunken Centroid as Feature Selection of Microarray Data Nearest Shrunken Centroid as Feature Selection of Microarray Data Myungsook Klassen Computer Science Department, California Lutheran University 60 West Olsen Rd, Thousand Oaks, CA 91360 mklassen@clunet.edu

More information

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD

Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review

More information

Applications of Causal Discovery Methods in Biomedicine

Applications of Causal Discovery Methods in Biomedicine Applications of Causal Discovery Methods in Biomedicine Sisi Ma Sisi.Ma@nyumc.org New York University School of Medicine NYU Center for Health Informatics & Bioinformatics Alexander Statnikov; NYU Psychiatry

More information

Published in the Russian Federation Modeling of Artificial Intelligence Has been issued since ISSN: Vol. 6, Is. 2, pp.

Published in the Russian Federation Modeling of Artificial Intelligence Has been issued since ISSN: Vol. 6, Is. 2, pp. Copyright 2015 by Academic Publishing House Researcher Published in the Russian Federation Modeling of Artificial Intelligence Has been issued since 2014. ISSN: 2312-0355 Vol. 6, Is. 2, pp. 171-182, 2015

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang

Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Abstract: Unlike most cancers, thyroid cancer has an everincreasing incidence rate

More information

Comparison of discrimination methods for the classification of tumors using gene expression data

Comparison of discrimination methods for the classification of tumors using gene expression data Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley

More information

Tissue Classification Based on Gene Expression Data

Tissue Classification Based on Gene Expression Data Chapter 6 Tissue Classification Based on Gene Expression Data Many diseases result from complex interactions involving numerous genes. Previously, these gene interactions have been commonly studied separately.

More information

Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images.

Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images. Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images. Olga Valenzuela, Francisco Ortuño, Belen San-Roman, Victor

More information

Accuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option

Accuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option JMLR: Workshop and Conference Proceedings 8: 65-81 Machine Learning in Systems Biology Accuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option Malik Sajjad Ahmed Nadeem

More information

T. R. Golub, D. K. Slonim & Others 1999

T. R. Golub, D. K. Slonim & Others 1999 T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have

More information

Cancer is the fourth most common disease and the. Genomic Processing for Cancer Classification and Prediction

Cancer is the fourth most common disease and the. Genomic Processing for Cancer Classification and Prediction [ Peng Qiu, Z. Jane Wang, and K.J. Ray Liu ] Genomic Processing for Cancer Classification and Prediction [A broad review of the recent advances in model-based genomic and proteomic signal processing for

More information

Classification consistency analysis for bootstrapping gene selection

Classification consistency analysis for bootstrapping gene selection Neural Comput & Applic (27) 6:527 539 DOI.7/s52-7-- ICONIP26 Classification consistency analysis for bootstrapping gene selection Shaoning Pang Æ Ilkka Havukkala Æ Yingjie Hu Æ Nikola Kasabov Received:

More information

A Survey on Detection and Classification of Brain Tumor from MRI Brain Images using Image Processing Techniques

A Survey on Detection and Classification of Brain Tumor from MRI Brain Images using Image Processing Techniques A Survey on Detection and Classification of Brain Tumor from MRI Brain Images using Image Processing Techniques Shanti Parmar 1, Nirali Gondaliya 2 1Student, Dept. of Computer Engineering, AITS-Rajkot,

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science

More information

A novel approach to feature extraction from classification models based on information gene pairs

A novel approach to feature extraction from classification models based on information gene pairs Pattern Recognition 41 (2008) 1975 1984 www.elsevier.com/locate/pr A novel approach to feature extraction from classification models based on information gene pairs J. Li, X. Tang, J. Liu, J. Huang, Y.

More information

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23: Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to

More information

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER

A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER M.Bhavani 1 and S.Vinod kumar 2 International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.352-359 DOI: http://dx.doi.org/10.21172/1.74.048

More information

CANCER PREDICTION SYSTEM USING DATAMINING TECHNIQUES

CANCER PREDICTION SYSTEM USING DATAMINING TECHNIQUES CANCER PREDICTION SYSTEM USING DATAMINING TECHNIQUES K.Arutchelvan 1, Dr.R.Periyasamy 2 1 Programmer (SS), Department of Pharmacy, Annamalai University, Tamilnadu, India 2 Associate Professor, Department

More information

Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine based on Analysis of Variance Features

Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine based on Analysis of Variance Features American Journal of Applied Sciences 8 (12): 1295-1301, 2011 ISSN 1546-9239 2011 Science Publications Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine

More information

Applying Machine Learning Techniques to Analysis of Gene Expression Data: Cancer Diagnosis

Applying Machine Learning Techniques to Analysis of Gene Expression Data: Cancer Diagnosis Applying Machine Learning Techniques to Analysis of Gene Expression Data: Cancer Diagnosis Kyu-Baek Hwang, Dong-Yeon Cho, Sang-Wook Park Sung-Dong Kim, and Byoung-Tak Zhang Artificial Intelligence Lab

More information

Predicting Kidney Cancer Survival from Genomic Data

Predicting Kidney Cancer Survival from Genomic Data Predicting Kidney Cancer Survival from Genomic Data Christopher Sauer, Rishi Bedi, Duc Nguyen, Benedikt Bünz Abstract Cancers are on par with heart disease as the leading cause for mortality in the United

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Data complexity measures for analyzing the effect of SMOTE over microarrays

Data complexity measures for analyzing the effect of SMOTE over microarrays ESANN 216 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges (Belgium), 27-29 April 216, i6doc.com publ., ISBN 978-2878727-8. Data complexity

More information

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes

Comparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Cancer Gene Extraction Based on Stepwise Regression

Cancer Gene Extraction Based on Stepwise Regression Mathematical Computation Volume 5, 2016, PP.6-10 Cancer Gene Extraction Based on Stepwise Regression Jie Ni 1, Fan Wu 1, Meixiang Jin 1, Yixing Bai 1, Yunfei Guo 1 1. Mathematics Department, Yanbian University,

More information

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 A.K.A. Artificial Intelligence Unsupervised learning! Cluster analysis Patterns, Clumps, and Joining

More information

SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis

SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis Walaa Gad Faculty of Computers and Information Sciences Ain Shams University Cairo, Egypt Email: walaagad [AT]

More information

MACHINE LEARNING BASED APPROACHES FOR CANCER CLASSIFICATION USING GENE EXPRESSION DATA

MACHINE LEARNING BASED APPROACHES FOR CANCER CLASSIFICATION USING GENE EXPRESSION DATA MACHINE LEARNING BASED APPROACHES FOR CANCER CLASSIFICATION USING GENE EXPRESSION DATA Amit Bhola 1 and Arvind Kumar Tiwari 2 1 Department of CSE, Kashi Institute of Technology, Varanasi, U.P., India 2

More information

Using CART to Mine SELDI ProteinChip Data for Biomarkers and Disease Stratification

Using CART to Mine SELDI ProteinChip Data for Biomarkers and Disease Stratification Using CART to Mine SELDI ProteinChip Data for Biomarkers and Disease Stratification Kenna Mawk, D.V.M. Informatics Product Manager Ciphergen Biosystems, Inc. Outline Introduction to ProteinChip Technology

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

Journal of Engineering Technology

Journal of Engineering Technology New approaches for gene selection and cancer diagnosis based on microarray gene expression profiling Sara Haddou Bouazza 1, Khalid Auhmani 2, Abdelouhab Zeroual 1 1 Department of Physics, Faculty of Sciences

More information

Detection of Cognitive States from fmri data using Machine Learning Techniques

Detection of Cognitive States from fmri data using Machine Learning Techniques Detection of Cognitive States from fmri data using Machine Learning Techniques Vishwajeet Singh, K.P. Miyapuram, Raju S. Bapi* University of Hyderabad Computational Intelligence Lab, Department of Computer

More information

International Journal of Advance Engineering and Research Development A THERORETICAL SURVEY ON BREAST CANCER PREDICTION USING DATA MINING TECHNIQUES

International Journal of Advance Engineering and Research Development A THERORETICAL SURVEY ON BREAST CANCER PREDICTION USING DATA MINING TECHNIQUES Scientific Journal of Impact Factor (SJIF): 4.14 e-issn: 2348-4470 p-issn: 2348-6406 International Journal of Advance Engineering and Research Development Volume 4, Issue 02 February -2018 A THERORETICAL

More information

Molecular classi cation of cancer types from microarray data using the combination of genetic algorithms and support vector machines

Molecular classi cation of cancer types from microarray data using the combination of genetic algorithms and support vector machines FEBS Letters 555 (2003) 358^362 FEBS 27869 Molecular classi cation of cancer types from microarray data using the combination of genetic algorithms and support vector machines Sihua Peng a, Qianghua Xu

More information

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1335-1341 International Research Publications House http://www. irphouse.com A Fuzzy Improved

More information

Accurate molecular classification of cancer using simple rules.

Accurate molecular classification of cancer using simple rules. University of Nebraska Medical Center DigitalCommons@UNMC Journal Articles: Genetics, Cell Biology & Anatomy Genetics, Cell Biology & Anatomy 10-30-2009 Accurate molecular classification of cancer using

More information

Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies

Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies 2 nd Breast Cancer Workshop 2015 April 7 th 2015 Porto, Portugal Pedro Ferreira Nuno A. Fonseca Inês Dutra Ryan Woods Elizabeth

More information

CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY

CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY Muhammad Shahbaz 1, Shoaib Faruq 2, Muhammad Shaheen 1, Syed Ather Masood 2 1 Department of Computer Science and Engineering, UET, Lahore, Pakistan Muhammad.Shahbaz@gmail.com,

More information

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant

National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant Reporting Period July 1, 2011 June 30, 2012 Formula Grant Overview The National Surgical

More information

Predicting Breast Cancer Recurrence Using Machine Learning Techniques

Predicting Breast Cancer Recurrence Using Machine Learning Techniques Predicting Breast Cancer Recurrence Using Machine Learning Techniques Umesh D R Department of Computer Science & Engineering PESCE, Mandya, Karnataka, India Dr. B Ramachandra Department of Electrical and

More information

An Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM)

An Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM) www.ijcsi.org 8 An Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM) Shamsan Aljamali 1, Zhang Zuping 2 and Long Jun 3 1 School of Information

More information

Package golubesets. August 16, 2014

Package golubesets. August 16, 2014 Package golubesets August 16, 2014 Version 1.6.0 Title exprsets for golub leukemia data Author Todd Golub Maintainer Vince Carey Description representation

More information

Gene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering

Gene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering Gene expression analysis Roadmap Microarray technology: how it work Applications: what can we do with it Preprocessing: Image processing Data normalization Classification Clustering Biclustering 1 Gene

More information

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods

Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods Ying-Fang Lai 1 and Hsiu-Sen Chiang 2* 1 Department of Industrial Education, National Taiwan Normal University 162, Heping

More information

Inter-session reproducibility measures for high-throughput data sources

Inter-session reproducibility measures for high-throughput data sources Inter-session reproducibility measures for high-throughput data sources Milos Hauskrecht, PhD, Richard Pelikan, MSc Computer Science Department, Intelligent Systems Program, Department of Biomedical Informatics,

More information

Classification with microarray data

Classification with microarray data Classification with microarray data Aron Charles Eklund eklund@cbs.dtu.dk DNA Microarray Analysis - #27612 January 8, 2010 The rest of today Now: What is classification, and why do we do it? How to develop

More information

Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines

Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Florian Markowetz and Anja von Heydebreck Max-Planck-Institute for Molecular Genetics Computational Molecular Biology

More information

Certificate Courses in Biostatistics

Certificate Courses in Biostatistics Certificate Courses in Biostatistics Term I : September December 2015 Term II : Term III : January March 2016 April June 2016 Course Code Module Unit Term BIOS5001 Introduction to Biostatistics 3 I BIOS5005

More information

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 1 ISSN : 2456-3307 Data Mining Techniques to Predict Cancer Diseases

More information

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer

A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,

More information

Prediction Models of Diabetes Diseases Based on Heterogeneous Multiple Classifiers

Prediction Models of Diabetes Diseases Based on Heterogeneous Multiple Classifiers Int. J. Advance Soft Compu. Appl, Vol. 10, No. 2, July 2018 ISSN 2074-8523 Prediction Models of Diabetes Diseases Based on Heterogeneous Multiple Classifiers I Gede Agus Suwartane 1, Mohammad Syafrullah

More information

IN SPITE of a very quick development of medicine within

IN SPITE of a very quick development of medicine within INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 21, VOL. 6, NO. 3, PP. 281-286 Manuscript received July 1, 21: revised September, 21. DOI: 1.2478/v1177-1-37-9 Application of Density Based Clustering

More information

Colon cancer subtypes from gene expression data

Colon cancer subtypes from gene expression data Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et

More information

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Performance Analysis of Brain MRI Using Multiple Method Shroti Paliwal *, Prof. Sanjay Chouhan * Department of Electronics & Communication

More information

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION

A NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION OPT-i An International Conference on Engineering and Applied Sciences Optimization M. Papadrakakis, M.G. Karlaftis, N.D. Lagaros (eds.) Kos Island, Greece, 4-6 June 2014 A NOVEL VARIABLE SELECTION METHOD

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 2 no. 4 25, pages 34 32 doi:.93/bioinformatics/bti483 Gene expression Ensemble dependence model for classification and prediction of cancer and normal gene expression

More information

Intelligent Patient Profiling for Diagnosis, Staging and Treatment Selection in Colon Cancer

Intelligent Patient Profiling for Diagnosis, Staging and Treatment Selection in Colon Cancer Intelligent Patient Profiling for Diagnosis, Staging and Treatment Selection in Colon Cancer Yorgos Goletsis, Member, IEEE, Themis P. Exarchos, Student member, IEEE, Nikolaos Giannakeas, Student member,

More information

Comparison Classifier: Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) In Digital Mammogram Images

Comparison Classifier: Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) In Digital Mammogram Images JUISI, Vol. 02, No. 02, Agustus 2016 35 Comparison Classifier: Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) In Digital Mammogram Images Jeklin Harefa 1, Alexander 2, Mellisa Pratiwi 3 Abstract

More information

VeriStrat Poor Patients Show Encouraging Overall Survival and Progression Free Survival Signal; Confirmatory Phase 2 Study Planned by Year-End

VeriStrat Poor Patients Show Encouraging Overall Survival and Progression Free Survival Signal; Confirmatory Phase 2 Study Planned by Year-End AVEO and Biodesix Announce Exploratory Analysis of VeriStrat-Selected Patients with Non-Small Cell Lung Cancer in Phase 2 Study of Ficlatuzumab Presented at ESMO 2014 Congress VeriStrat Poor Patients Show

More information

Ensemble methods for classification of patients for personalized. medicine with high-dimensional data

Ensemble methods for classification of patients for personalized. medicine with high-dimensional data Ensemble methods for classification of patients for personalized medicine with high-dimensional data Hojin Moon 1, Hongshik Ahn, Ralph L. Kodell 1, Songjoon Baek 1, Chien-Ju Lin 1, Taewon Lee 1 and James

More information

Improved Intelligent Classification Technique Based On Support Vector Machines

Improved Intelligent Classification Technique Based On Support Vector Machines Improved Intelligent Classification Technique Based On Support Vector Machines V.Vani Asst.Professor,Department of Computer Science,JJ College of Arts and Science,Pudukkottai. Abstract:An abnormal growth

More information

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT Research Article Bioinformatics International Journal of Pharma and Bio Sciences ISSN 0975-6299 A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS D.UDHAYAKUMARAPANDIAN

More information

Analysis of Classification Algorithms towards Breast Tissue Data Set

Analysis of Classification Algorithms towards Breast Tissue Data Set Analysis of Classification Algorithms towards Breast Tissue Data Set I. Ravi Assistant Professor, Department of Computer Science, K.R. College of Arts and Science, Kovilpatti, Tamilnadu, India Abstract

More information

NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION

NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION Sriram Selvaraj 1 and Bommannaraja Kanakaraj 2 1 Department of Biomedical Engineering, P.S.N.A College of Engineering and Technology,

More information

Extraction of Informative Genes from Microarray Data

Extraction of Informative Genes from Microarray Data Extraction of Informative Genes from Microarray Data Topon Kumar Paul Department of Frontier Informatics The University of Tokyo Chiba 277-8561, Japan topon@iba.k.u-tokyo.ac.jp Hitoshi Iba Department of

More information

Clinical Utility of Diagnostic Tests

Clinical Utility of Diagnostic Tests Clinical Utility of Diagnostic Tests David A. Eberhard MD, PhD Director, Pre-Clinical Genomic Pathology, Lineberger Comprehensive Cancer Center Associate Professor, Depts. of Pathology and Pharmacology

More information

SubLasso:a feature selection and classification R package with a. fixed feature subset

SubLasso:a feature selection and classification R package with a. fixed feature subset SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced

More information

Diagnosis Of Ovarian Cancer Using Artificial Neural Network

Diagnosis Of Ovarian Cancer Using Artificial Neural Network Diagnosis Of Ovarian Cancer Using Artificial Neural Network B.Rosiline Jeetha #1, M.Malathi *2 1 Research Supervisor, 2 Research Scholar, Assistant Professor RVS College of Arts And Science Department

More information

NIH Public Access Author Manuscript Bioanalysis. Author manuscript; available in PMC 2011 March 16.

NIH Public Access Author Manuscript Bioanalysis. Author manuscript; available in PMC 2011 March 16. NIH Public Access Author Manuscript Published in final edited form as: Bioanalysis. 2010 May ; 2(5): 855 862. doi:10.4155/bio.10.35. Derivation of cancer diagnostic and prognostic signatures from gene

More information

A NOVEL CLASSIFICATION MODEL FOR ANALYSIS OF A CRIME USING NAÏVE BYES AND KNN IN DATA MINING

A NOVEL CLASSIFICATION MODEL FOR ANALYSIS OF A CRIME USING NAÏVE BYES AND KNN IN DATA MINING A NOVEL CLASSIFICATION MODEL FOR ANALYSIS OF A CRIME USING NAÏVE BYES AND KNN IN DATA MINING SHIVRAJ SINGH DEOPA 1, ABHISHEK KUMAR 2, KUNEEK GUPTA 3 Dr. SHASHI KANT SINGH 4 Galgotias college of engineering

More information