REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE
|
|
- Ashley Hicks
- 5 years ago
- Views:
Transcription
1 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE
2 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 1 Biomarker discovery has opened new realms in the medical industry, from patient diagnosis and treatment, to drug development and testing. However, through these advances the capacity to discover biomarkers panels has often been constrained by the employed methodologies. Current approaches to biomarker panel discovery A number of different machine learning, clustering and statistical approaches can be used for biomarker selection, including traditional methods such as: top scoring pair (TSP), decision trees (DT), naïve bayes (NB), prediction analysis of microarrays (PAM), support vector machine (SVM) and others. But these traditional methods can be difficult to interpret, use many biomarkers, and yield low accuracies, including sensitivity and specificity. For the medical industry, from diagnostics, to pharmaceutical developers, to labs, this translates into a costly process that leads to a harder path through regulatory approval. One weakness of traditional biomarker discovery techniques is the invariant approach. Testing for individual biomarkers, one at a time, is not only cumbersome and costly; it neglects the complex, interrelated nature of those markers. By capturing the relationships between multiple biomarkers, a more nuanced and precise evaluation can be conducted, which takes into account the interactions between potential biomarkers in determining patient outcomes. Another weakness of traditional biomarker discovery is the constraints of the statistical techniques typically employed. Inherent to these methods are numerous assumptions, which can constrain the potential information embedded in the data, clouding the potential results. The SimplicityBio Biomarker Optimization Software System A new multivariable approach to biomarker discovery has emerged to resolve these weaknesses, using SimplicityBio s proprietary Biomarker Optimization Software System (BOSS) we are able to find the perfect balance between accuracy and quantity of biomarkers. The core of this is the co-evolutionary fuzzy modeling method Fuzzy CoCo1. Around this method several steps are performed to select the best combination of biomarkers. BOSS performs two phases: 1 st 2 nd Exploratory-modeling: Potential signatures are created by testing billions of panels of biomarkers with Fuzzy CoCo. Fuzzy CoCo uses an artificial evolution approach, which allows populations of signatures to evolve, mate, and migrate, with only the most robust signatures surviving at the end. Signature-selection: A reduced number of signatures are. This family of signatures represents several characteristics, so it is possible to have signatures that are more sensible, sensitive, or with fewer variables than others. The final selection is made taking into account the needs of the client.
3 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 2 The success of BOSS lies in its ability to minimize the number of rules and variables used in multivariate signatures, while maintaining exceptional accuracy, including sensitivity and specificity. The method yields a family of models, which can be isolated to meet the specific needs of the client. By reducing the number of rules and variables in each of the family s signature, testing costs will be reduced, both on the development end and consumer end. A cleaner, more concise resultant model can also aid developers in navigating the regulatory approval process. Testing SimplicityBio s Biomarker Optimization Software To test its efficacy, BOSS was compared with other biomarker discovery methods and s such as TSP, k-tsp, DT, NB, K-NN, PAM, SVM, MOE, Bagging C4.5, AdaBoost C4.5, KEM Biomarker from Ariana Pharma, AHC, Single C4.5, fsvm, and Fuzzy Logic for six seminal, published datasets. In comparing SimplicityBio s biomarker discovery with other methods for published datasets, BOSS consistently yields lower numbers of variables, while matching or exceeding the accuracy of the other methods. Across the six datasets, BOSS achieved an accuracy of 95.83% or higher which exceeded or met the accuracy of every other method it was compared to. But the key to BOSS s superiority is not just its exceptional accuracy, it is its ability to constrain the number of variables in each model. LEUKEMIA (Golub et al.2) Includes 38 observations, each of which is described by the gene expression levels of 7,129 genes and a class attribute with the two distinct labels of acute myeloid leukemia and lymphoblastic leukemia. Acute myeloid and lymphoblastic leukemia (Golub et al.) BOSS % 2 SimplicityBio1 NB % * Tan et al.8 SVM % 8 Guyon et al.9 PAM 97.22% 2296 Tan et al. k-tsp 95.83% 18 Tan et al. K-NN 84.82% * Tan et al. Fuzzy logic 79.00% 2 Ohno-Machado et al.10 DT 73.81% 2 Tan et al. In the comparison using the leukemia dataset, BOSS achieved an accuracy of 100% using 2 variables. SVM, another method that achieved this level of accuracy, used 8 variables. Other methods which used only 2 variables, Fuzzy logic and DT, only achieved accuracies of 79% and 73.81% respectively.
4 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 3 COLON CANCER (Alon et al.3) Includes 62 observations made up of 40 tumor samples and 22 normal samples. There are approximately 6,000 genes represented in each sample in the dataset. Colon Cancer (Alon et al.) BOSS 94.14% 27 SimplicityBio TSP 91.10% 2 Tan et al. k-tsp 90.30% 2 Tan et al. Fuzzy logic 90.00% 17 Huerta et al.11 PAM 85.48% 15 Tan et al. SVM 82.26% * Tan et al. DT 80.65% 3 Tan et al. K-NN 74.19% * Tan et al. NB 58.06% * Tan et al. Despite using more variables, BOSS outperforms the other datasets in terms of accuracy. PROSTATE CANCER (Singh et al.4) Includes 52 prostate tumor samples and 50 non-tumor prostate samples with a total of 12,600 genes. Prostate Tumor (Singh et al.) BOSS 97.29% 2 SimplicityBio TSP 95.00% 2 Tan et al. MC-SVM 92.00% * Statnikov et al.12 k-tsp 91.00% 2 Tan et al. PAM 91.00% 47 Tan et al. SVM 91.00% * Tan et al. NN 91.00% * Statnikov et al. DT 87.00% 4 Tan et al. KNN 85.00% * Statnikov et al. NB 62.00% * Tan et al. BOSS achieves an accuracy of 97.29% for the prostate cancer dataset using 2 variables. The only method using fewer variables TSP compromises accuracy to do so.
5 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 4 LUNG CANCER (Gordon et al.5) Includes 52 prostate tumor samples and 50 non-tumor prostate samples with a total of 12,600 genes. Lung cancer (Gordon et al.) BOSS % 2 SimplicityBio PAM 99.45% 15 Tan et al. SVM 99.45% * Tan et al. k-tsp 98.90% 2 Tan et al. K-NN 98.34% * Tan et al. TSP 98.30% 2 Tan et al. NB 97.79% * Tan et al. DT 96.13% 3 Tan et al. MOE 91.00% 2 Wang & Palade13 With the only 100% accuracy result for the methods tested in the lung cancer dataset, BOSS uses 2 variables. Four methods use the same or fewer variables k-tsp, TSP, DT, and MOE however they have significantly lower accuracies of 98.90%, 98.30%, 96.13% and 91.00% respectively. Breast CANCER (Van de Vijver et al.6) Includes 295 samples made up of 151 lymph-node negative disease and 144 with lymph-node positive disease with a total of 70 genes. Breast cancer (van de Vijver et al.6) BOSS 95.83% 31 SimplicityBio Bagging C % * Tan & Gilbert14 AdaBoost C % * Tan & Gilbert BOSS 87.50% 9 SimplicityBio KEM Biomarker 85.89% 13 Guergova-Kuras et al.15 AHC 83.33% 70 van de Vijver et al. Single C * Tan & Gilbert Single C % * Tan & Gilbert Here are presented two signatures discovered by BOSS. The first one has the highest accuracy (95.83%) but not the lowest number of variables. The second one has a lower number of variables (9) and an accuracy superior to KEM Biomarker who presents the lower number of variables among the other methods.
6 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 5 Ovarian CANCER (Zhou et al.7) Includes 94 samples made up of 44 samples from women diagnosed with serous papillary ovarian cancer and 50 healthy women with a total of 3,017 mass spectrometry signatures. Ovarian Cancer (Zhou et al.7) BOSS % 10 SimplicityBio1 fsvm % 3017 Zhou et al. KEM Biomarker 92.97% 13 Guergova-Kuras et al. The ovarian cancer dataset exemplifies the importance of reducing the number of variables used in modeling. While fsvm achieves an accuracy of 100% to match that of BOSS it uses 300x the number of variables. As exemplified by these six datasets, BOSS consistently has the highest accuracy of any method tested, with lower or comparable numbers of variables used. Even when BOSS uses slightly more variables, an increase of 1 to 2 variables is a modest tradeoff for higher accuracy. When minimizing the number of variables used is the goal, BOSS can still produce exceptional accuracy results. Summary BOSS is the next stage in the evolution of biomarker discovery technology. The co-evolutionary engine behind BOSS continually drives discovery models toward more elegant, simple, and powerful solutions to better meet the needs of clients.
7 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 6 About SimplicityBio SimplicityBio is a Swiss biomarker panel discovery company. SimplicityBio s Biomarker Optimization Software System (BOSS) allows you to take full advantage of multiple data types and unbalanced data sets, while answering your production, regulatory and IP requirements. To do so, our discovers robust, highly specific and sensitive biomarker panels. Leaving you to choose the one that answers your needs. BOSS brings a unique and powerful combination of machine learning, evolutionary algorithms and fuzzy logic to the biological world, and is thus able to discover new robust multi-biomarker panels and improve existing ones. Our clients and partners range from research institutions, to diagnostic, companion diagnostic, prognostic and pharmaceutical companies. Contact us: Route de l'ile-aux-bois 1A 1870 Monthey Switzerland info@simplicitybio.com visit:
8 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 7 s: [1] Barreto-Sanz, M. A., Bujard, A., & Pena-Reyes, C. A. (2012, November). Evolving very-compact fuzzy models for gene expression data analysis. InBioinformatics & Bioengineering (BIBE), 2012 IEEE 12th International Conference on (pp ). IEEE. [2] Golub, T. R., Slonim, D. K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J. P.,... & Lander, E. S. (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. science,286(5439), [3] Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D., & Levine, A. J. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceedings of the National Academy of Sciences, 96(12), [4] Singh, D., Febbo, P. G., Ross, K., Jackson, D. G., Manola, J., Ladd, C.,... & Sellers, W. R. (2002). Gene expression correlates of clinical prostate cancer behavior. Cancer cell, 1(2), [5] Gordon, G. J., Jensen, R. V., Hsiao, L. L., Gullans, S. R., Blumenstock, J. E., Ramaswamy, S.,... & Bueno, R. (2002). Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer research, 62(17), [6] Van De Vijver, M. J., He, Y. D., van't Veer, L. J., Dai, H., Hart, A. A., Voskuil, D. W.,... & Bernards, R. (2002). A gene-expression signature as a predictor of survival in breast cancer. New England Journal of Medicine, 347(25), [7] Zhou, M., Guan, W., Walker, L. D., Mezencev, R., Benigno, B. B., Gray, A.,... & McDonald, J. F. (2010). Rapid mass spectrometric metabolic profiling of blood sera detects ovarian cancer with high accuracy. Cancer Epidemiology Biomarkers & Prevention, 19(9), [8] Tan, A. C., Naiman, D. Q., Xu, L., Winslow, R. L., & Geman, D. (2005). Simple decision rules for classifying human cancers from gene expression profiles.bioinformatics, 21(20), [9] Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine learning, 46(1-3), [10] Ohno-Machado, L., Vinterbo, S., & Weber, G. (2002). Classification of gene expression data using fuzzy logic. Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 12(1), [11] Huerta, E., Duval, B., & Hao, J. K. (2008). Fuzzy logic for elimination of redundant information of microarray data. Genomics, proteomics & bioinformatics, 6(2), [12] Statnikov, A., Aliferis, C. F., Tsamardinos, I., Hardin, D., & Levy, S. (2005). A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics, 21(5), [13] Wang, Z., & Palade, V. (2010, December). Multi-objective evolutionary algorithms based interpretable fuzzy models for microarray gene expression data analysis. In Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on (pp ). IEEE. [14] Tan, A. C., & Gilbert, D. (2003). Ensemble machine learning on gene expression data for cancer classification. [15] Guergova-Kuras, M., Schneider, M. P., Jullian, N., & Afshar, M. (2014). 667: Shorter multimarker signatures: a new tool to facilitate cancer diagnosis.european Journal of Cancer, (50), S160.
9 REINVENTING THE BIOMARKER PANEL DISCOVERY EXPERIENCE 8 APPENDIX A Acronym TSP k-tsp DT NB K-NN PAM SVM MOE Bagging C4.5 AdaBoost C4.5 KEM Biomarker from Ariana Pharma AHC Single C4.5 fsvm Fuzzy Logic MC-SVM BOSS Technique of Platform Top scoring pair k- Top scoring pair C4.5 decision trees Naïve Bayes K-nearest neighbor Prediction analysis of microarrays Support Vector Machines Multi-objectiive Evolucionary Algorithms and Fuzzy Logic Knowledge Extraction and Management Aglomerative hierchical clutering algorithm Functional Support Vector Machine Multiclass support vector machine Biomarker Optimization Software System
Predictive Biomarkers
Uğur Sezerman Evolutionary Selection of Near Optimal Number of Features for Classification of Gene Expression Data Using Genetic Algorithms Predictive Biomarkers Biomarker: A gene, protein, or other change
More informationFUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION
FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION SOMAYEH ABBASI, HAMID MAHMOODIAN Department of Electrical Engineering, Najafabad branch, Islamic
More informationA hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system
Technology and Health Care 24 (2016) S601 S605 DOI 10.3233/THC-161187 IOS Press S601 A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Jongwoo Lim, Bohyun
More informationAnalyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data
Analyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data Simone A. Ludwig Department of Computer Science North Dakota State University Fargo, ND, USA simone.ludwig@ndsu.edu
More informationActive Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification
1936 J. Chem. Inf. Comput. Sci. 2004, 44, 1936-1941 Active Learning with Support Vector Machine Applied to Gene Expression Data for Cancer Classification Ying Liu* Georgia Institute of Technology, College
More informationAlgorithms Implemented for Cancer Gene Searching and Classifications
Algorithms Implemented for Cancer Gene Searching and Classifications Murad M. Al-Rajab and Joan Lu School of Computing and Engineering, University of Huddersfield Huddersfield, UK {U1174101,j.lu}@hud.ac.uk
More informationCANCER CLASSIFICATION USING SINGLE GENES
179 CANCER CLASSIFICATION USING SINGLE GENES XIAOSHENG WANG 1 OSAMU GOTOH 1,2 david@genome.ist.i.kyoto-u.ac.jp o.gotoh@i.kyoto-u.ac.jp 1 Department of Intelligence Science and Technology, Graduate School
More informationHybridized KNN and SVM for gene expression data classification
Mei, et al, Hybridized KNN and SVM for gene expression data classification Hybridized KNN and SVM for gene expression data classification Zhen Mei, Qi Shen *, Baoxian Ye Chemistry Department, Zhengzhou
More informationA Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis
A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis Baljeet Malhotra and Guohui Lin Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada T6G 2E8
More informationMulticlass microarray data classification based on confidence evaluation
Methodology Multiclass microarray data classification based on confidence evaluation H.L. Yu 1, S. Gao 1, B. Qin 1 and J. Zhao 2 1 School of Computer Science and Engineering, Jiangsu University of Science
More informationSimple Decision Rules for Classifying Human Cancers from Gene Expression Profiles
Simple Decision Rules for Classifying Human Cancers from Gene Expression Profiles Aik Choon TAN Post-Doc Research Fellow actan@jhu.edu Prof. Raimond L. Winslow rwinslow@jhu.edu, Director, ICM & CCBM, Prof.
More informationPackage propoverlap. R topics documented: February 20, Type Package
Type Package Package propoverlap February 20, 2015 Title Feature (gene) selection based on the Proportional Overlapping Scores Version 1.0 Date 2014-09-15 Author Osama Mahmoud, Andrew Harrison, Aris Perperoglou,
More informationA COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION
5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department
More informationGene expression correlates of clinical prostate cancer behavior
Gene expression correlates of clinical prostate cancer behavior Cancer Cell 2002 1: 203-209. Singh D, Febbo P, Ross K, Jackson D, Manola J, Ladd C, Tamayo P, Renshaw A, D Amico A, Richie J, Lander E, Loda
More informationGene Selection for Tumor Classification Using Microarray Gene Expression Data
Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology
More informationClassification of cancer profiles. ABDBM Ron Shamir
Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;
More informationAUTHOR PROOF COPY ONLY
REVIEW Ensemble machine learning on gene expression data for cancer classification Aik Choon Tan and David Gilbert Bioinformatics Research Centre, Department of Computing Science, University of Glasgow,
More informationInternational Journal of Pure and Applied Mathematics
Volume 119 No. 12 2018, 12505-12513 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Analysis of Cancer Classification of Gene Expression Data A Scientometric Review 1 Joseph M. De Guia,
More informationGene Expression Based Leukemia Sub Classification Using Committee Neural Networks
Bioinformatics and Biology Insights M e t h o d o l o g y Open Access Full open access to this and thousands of other papers at http://www.la-press.com. Gene Expression Based Leukemia Sub Classification
More informationEfficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection
202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray
More informationHYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA
HYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA S. SubashChandraBose 1 and T. Christopher 2 1 Department of Computer Science, PG and Research Department,
More informationTHE gene expression profiles that are obtained from
, July 3-5, 2013, London, U.K. A Study of Cancer Microarray Gene Expression Profile: Objectives and Approaches Hala M. Alshamlan, Ghada H. Badr, and Yousef Alohali Abstract Cancer is one of the dreadful
More informationAn Improved Algorithm To Predict Recurrence Of Breast Cancer
An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant
More informationAn entropy-based improved k-top scoring pairs (TSP) method for classifying human cancers
African Journal of Biotechnology Vol. 11(45), pp. 10438-10445, 5 June, 2012 Available online at http://www.academicjournals.org/ajb DOI:10.5897/AJB11.1016 ISSN 1684 5315 2012 Academic Journals Full Length
More informationIntroduction to Discrimination in Microarray Data Analysis
Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t
More informationStatistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.
Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension
More informationAugmented Medical Decisions
Machine Learning Applied to Biomedical Challenges 2016 Rulex, Inc. Intelligible Rules for Reliable Diagnostics Rulex is a predictive analytics platform able to manage and to analyze big amounts of heterogeneous
More informationIncreasing Efficiency of Microarray Analysis by PCA and Machine Learning Methods
56 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 Increasing Efficiency of Microarray Analysis by PCA and Machine Learning Methods Jing Sun 1, Kalpdrum Passi 1, Chakresh Jain 2 1 Department
More informationNIH Public Access Author Manuscript Best Pract Res Clin Haematol. Author manuscript; available in PMC 2010 June 1.
NIH Public Access Author Manuscript Published in final edited form as: Best Pract Res Clin Haematol. 2009 June ; 22(2): 271 282. doi:10.1016/j.beha.2009.07.001. Analysis of DNA Microarray Expression Data
More informationNearest Shrunken Centroid as Feature Selection of Microarray Data
Nearest Shrunken Centroid as Feature Selection of Microarray Data Myungsook Klassen Computer Science Department, California Lutheran University 60 West Olsen Rd, Thousand Oaks, CA 91360 mklassen@clunet.edu
More informationCase Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD
Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review
More informationApplications of Causal Discovery Methods in Biomedicine
Applications of Causal Discovery Methods in Biomedicine Sisi Ma Sisi.Ma@nyumc.org New York University School of Medicine NYU Center for Health Informatics & Bioinformatics Alexander Statnikov; NYU Psychiatry
More informationPublished in the Russian Federation Modeling of Artificial Intelligence Has been issued since ISSN: Vol. 6, Is. 2, pp.
Copyright 2015 by Academic Publishing House Researcher Published in the Russian Federation Modeling of Artificial Intelligence Has been issued since 2014. ISSN: 2312-0355 Vol. 6, Is. 2, pp. 171-182, 2015
More informationPredicting Breast Cancer Survivability Rates
Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer
More informationIdentifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang
Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Abstract: Unlike most cancers, thyroid cancer has an everincreasing incidence rate
More informationComparison of discrimination methods for the classification of tumors using gene expression data
Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley
More informationTissue Classification Based on Gene Expression Data
Chapter 6 Tissue Classification Based on Gene Expression Data Many diseases result from complex interactions involving numerous genes. Previously, these gene interactions have been commonly studied separately.
More informationDevelopment of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images.
Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images. Olga Valenzuela, Francisco Ortuño, Belen San-Roman, Victor
More informationAccuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option
JMLR: Workshop and Conference Proceedings 8: 65-81 Machine Learning in Systems Biology Accuracy-Rejection Curves (ARCs) for Comparing Classification Methods with a Reject Option Malik Sajjad Ahmed Nadeem
More informationT. R. Golub, D. K. Slonim & Others 1999
T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have
More informationCancer is the fourth most common disease and the. Genomic Processing for Cancer Classification and Prediction
[ Peng Qiu, Z. Jane Wang, and K.J. Ray Liu ] Genomic Processing for Cancer Classification and Prediction [A broad review of the recent advances in model-based genomic and proteomic signal processing for
More informationClassification consistency analysis for bootstrapping gene selection
Neural Comput & Applic (27) 6:527 539 DOI.7/s52-7-- ICONIP26 Classification consistency analysis for bootstrapping gene selection Shaoning Pang Æ Ilkka Havukkala Æ Yingjie Hu Æ Nikola Kasabov Received:
More informationA Survey on Detection and Classification of Brain Tumor from MRI Brain Images using Image Processing Techniques
A Survey on Detection and Classification of Brain Tumor from MRI Brain Images using Image Processing Techniques Shanti Parmar 1, Nirali Gondaliya 2 1Student, Dept. of Computer Engineering, AITS-Rajkot,
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017
RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science
More informationA novel approach to feature extraction from classification models based on information gene pairs
Pattern Recognition 41 (2008) 1975 1984 www.elsevier.com/locate/pr A novel approach to feature extraction from classification models based on information gene pairs J. Li, X. Tang, J. Liu, J. Huang, Y.
More informationRoadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:
Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to
More informationA DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER
A DATA MINING APPROACH FOR PRECISE DIAGNOSIS OF DENGUE FEVER M.Bhavani 1 and S.Vinod kumar 2 International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.352-359 DOI: http://dx.doi.org/10.21172/1.74.048
More informationCANCER PREDICTION SYSTEM USING DATAMINING TECHNIQUES
CANCER PREDICTION SYSTEM USING DATAMINING TECHNIQUES K.Arutchelvan 1, Dr.R.Periyasamy 2 1 Programmer (SS), Department of Pharmacy, Annamalai University, Tamilnadu, India 2 Associate Professor, Department
More informationEfficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine based on Analysis of Variance Features
American Journal of Applied Sciences 8 (12): 1295-1301, 2011 ISSN 1546-9239 2011 Science Publications Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine
More informationApplying Machine Learning Techniques to Analysis of Gene Expression Data: Cancer Diagnosis
Applying Machine Learning Techniques to Analysis of Gene Expression Data: Cancer Diagnosis Kyu-Baek Hwang, Dong-Yeon Cho, Sang-Wook Park Sung-Dong Kim, and Byoung-Tak Zhang Artificial Intelligence Lab
More informationPredicting Kidney Cancer Survival from Genomic Data
Predicting Kidney Cancer Survival from Genomic Data Christopher Sauer, Rishi Bedi, Duc Nguyen, Benedikt Bünz Abstract Cancers are on par with heart disease as the leading cause for mortality in the United
More informationEvaluating Classifiers for Disease Gene Discovery
Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics
More informationData complexity measures for analyzing the effect of SMOTE over microarrays
ESANN 216 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges (Belgium), 27-29 April 216, i6doc.com publ., ISBN 978-2878727-8. Data complexity
More informationComparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationCancer Gene Extraction Based on Stepwise Regression
Mathematical Computation Volume 5, 2016, PP.6-10 Cancer Gene Extraction Based on Stepwise Regression Jie Ni 1, Fan Wu 1, Meixiang Jin 1, Yixing Bai 1, Yunfei Guo 1 1. Mathematics Department, Yanbian University,
More informationMachine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017
Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 A.K.A. Artificial Intelligence Unsupervised learning! Cluster analysis Patterns, Clumps, and Joining
More informationSVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis
SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis Walaa Gad Faculty of Computers and Information Sciences Ain Shams University Cairo, Egypt Email: walaagad [AT]
More informationMACHINE LEARNING BASED APPROACHES FOR CANCER CLASSIFICATION USING GENE EXPRESSION DATA
MACHINE LEARNING BASED APPROACHES FOR CANCER CLASSIFICATION USING GENE EXPRESSION DATA Amit Bhola 1 and Arvind Kumar Tiwari 2 1 Department of CSE, Kashi Institute of Technology, Varanasi, U.P., India 2
More informationUsing CART to Mine SELDI ProteinChip Data for Biomarkers and Disease Stratification
Using CART to Mine SELDI ProteinChip Data for Biomarkers and Disease Stratification Kenna Mawk, D.V.M. Informatics Product Manager Ciphergen Biosystems, Inc. Outline Introduction to ProteinChip Technology
More informationA Comparison of Collaborative Filtering Methods for Medication Reconciliation
A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,
More informationJournal of Engineering Technology
New approaches for gene selection and cancer diagnosis based on microarray gene expression profiling Sara Haddou Bouazza 1, Khalid Auhmani 2, Abdelouhab Zeroual 1 1 Department of Physics, Faculty of Sciences
More informationDetection of Cognitive States from fmri data using Machine Learning Techniques
Detection of Cognitive States from fmri data using Machine Learning Techniques Vishwajeet Singh, K.P. Miyapuram, Raju S. Bapi* University of Hyderabad Computational Intelligence Lab, Department of Computer
More informationInternational Journal of Advance Engineering and Research Development A THERORETICAL SURVEY ON BREAST CANCER PREDICTION USING DATA MINING TECHNIQUES
Scientific Journal of Impact Factor (SJIF): 4.14 e-issn: 2348-4470 p-issn: 2348-6406 International Journal of Advance Engineering and Research Development Volume 4, Issue 02 February -2018 A THERORETICAL
More informationMolecular classi cation of cancer types from microarray data using the combination of genetic algorithms and support vector machines
FEBS Letters 555 (2003) 358^362 FEBS 27869 Molecular classi cation of cancer types from microarray data using the combination of genetic algorithms and support vector machines Sihua Peng a, Qianghua Xu
More informationA Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1335-1341 International Research Publications House http://www. irphouse.com A Fuzzy Improved
More informationAccurate molecular classification of cancer using simple rules.
University of Nebraska Medical Center DigitalCommons@UNMC Journal Articles: Genetics, Cell Biology & Anatomy Genetics, Cell Biology & Anatomy 10-30-2009 Accurate molecular classification of cancer using
More informationPredicting Malignancy from Mammography Findings and Image Guided Core Biopsies
Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies 2 nd Breast Cancer Workshop 2015 April 7 th 2015 Porto, Portugal Pedro Ferreira Nuno A. Fonseca Inês Dutra Ryan Woods Elizabeth
More informationCANCER DIAGNOSIS USING DATA MINING TECHNOLOGY
CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY Muhammad Shahbaz 1, Shoaib Faruq 2, Muhammad Shaheen 1, Syed Ather Masood 2 1 Department of Computer Science and Engineering, UET, Lahore, Pakistan Muhammad.Shahbaz@gmail.com,
More informationNational Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant
National Surgical Adjuvant Breast and Bowel Project (NSABP) Foundation Annual Progress Report: 2009 Formula Grant Reporting Period July 1, 2011 June 30, 2012 Formula Grant Overview The National Surgical
More informationPredicting Breast Cancer Recurrence Using Machine Learning Techniques
Predicting Breast Cancer Recurrence Using Machine Learning Techniques Umesh D R Department of Computer Science & Engineering PESCE, Mandya, Karnataka, India Dr. B Ramachandra Department of Electrical and
More informationAn Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM)
www.ijcsi.org 8 An Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM) Shamsan Aljamali 1, Zhang Zuping 2 and Long Jun 3 1 School of Information
More informationPackage golubesets. August 16, 2014
Package golubesets August 16, 2014 Version 1.6.0 Title exprsets for golub leukemia data Author Todd Golub Maintainer Vince Carey Description representation
More informationGene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering
Gene expression analysis Roadmap Microarray technology: how it work Applications: what can we do with it Preprocessing: Image processing Data normalization Classification Clustering Biclustering 1 Gene
More informationAutomatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods
Automatic Detection of Epileptic Seizures in EEG Using Machine Learning Methods Ying-Fang Lai 1 and Hsiu-Sen Chiang 2* 1 Department of Industrial Education, National Taiwan Normal University 162, Heping
More informationInter-session reproducibility measures for high-throughput data sources
Inter-session reproducibility measures for high-throughput data sources Milos Hauskrecht, PhD, Richard Pelikan, MSc Computer Science Department, Intelligent Systems Program, Department of Biomedical Informatics,
More informationClassification with microarray data
Classification with microarray data Aron Charles Eklund eklund@cbs.dtu.dk DNA Microarray Analysis - #27612 January 8, 2010 The rest of today Now: What is classification, and why do we do it? How to develop
More informationClass discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines
Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Florian Markowetz and Anja von Heydebreck Max-Planck-Institute for Molecular Genetics Computational Molecular Biology
More informationCertificate Courses in Biostatistics
Certificate Courses in Biostatistics Term I : September December 2015 Term II : Term III : January March 2016 April June 2016 Course Code Module Unit Term BIOS5001 Introduction to Biostatistics 3 I BIOS5005
More informationABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 1 ISSN : 2456-3307 Data Mining Techniques to Predict Cancer Diseases
More informationA Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer
A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,
More informationPrediction Models of Diabetes Diseases Based on Heterogeneous Multiple Classifiers
Int. J. Advance Soft Compu. Appl, Vol. 10, No. 2, July 2018 ISSN 2074-8523 Prediction Models of Diabetes Diseases Based on Heterogeneous Multiple Classifiers I Gede Agus Suwartane 1, Mohammad Syafrullah
More informationIN SPITE of a very quick development of medicine within
INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 21, VOL. 6, NO. 3, PP. 281-286 Manuscript received July 1, 21: revised September, 21. DOI: 1.2478/v1177-1-37-9 Application of Density Based Clustering
More informationColon cancer subtypes from gene expression data
Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et
More informationIJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Performance Analysis of Brain MRI Using Multiple Method Shroti Paliwal *, Prof. Sanjay Chouhan * Department of Electronics & Communication
More informationA NOVEL VARIABLE SELECTION METHOD BASED ON FREQUENT PATTERN TREE FOR REAL-TIME TRAFFIC ACCIDENT RISK PREDICTION
OPT-i An International Conference on Engineering and Applied Sciences Optimization M. Papadrakakis, M.G. Karlaftis, N.D. Lagaros (eds.) Kos Island, Greece, 4-6 June 2014 A NOVEL VARIABLE SELECTION METHOD
More informationBIOINFORMATICS ORIGINAL PAPER
BIOINFORMATICS ORIGINAL PAPER Vol. 2 no. 4 25, pages 34 32 doi:.93/bioinformatics/bti483 Gene expression Ensemble dependence model for classification and prediction of cancer and normal gene expression
More informationIntelligent Patient Profiling for Diagnosis, Staging and Treatment Selection in Colon Cancer
Intelligent Patient Profiling for Diagnosis, Staging and Treatment Selection in Colon Cancer Yorgos Goletsis, Member, IEEE, Themis P. Exarchos, Student member, IEEE, Nikolaos Giannakeas, Student member,
More informationComparison Classifier: Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) In Digital Mammogram Images
JUISI, Vol. 02, No. 02, Agustus 2016 35 Comparison Classifier: Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) In Digital Mammogram Images Jeklin Harefa 1, Alexander 2, Mellisa Pratiwi 3 Abstract
More informationVeriStrat Poor Patients Show Encouraging Overall Survival and Progression Free Survival Signal; Confirmatory Phase 2 Study Planned by Year-End
AVEO and Biodesix Announce Exploratory Analysis of VeriStrat-Selected Patients with Non-Small Cell Lung Cancer in Phase 2 Study of Ficlatuzumab Presented at ESMO 2014 Congress VeriStrat Poor Patients Show
More informationEnsemble methods for classification of patients for personalized. medicine with high-dimensional data
Ensemble methods for classification of patients for personalized medicine with high-dimensional data Hojin Moon 1, Hongshik Ahn, Ralph L. Kodell 1, Songjoon Baek 1, Chien-Ju Lin 1, Taewon Lee 1 and James
More informationImproved Intelligent Classification Technique Based On Support Vector Machines
Improved Intelligent Classification Technique Based On Support Vector Machines V.Vani Asst.Professor,Department of Computer Science,JJ College of Arts and Science,Pudukkottai. Abstract:An abnormal growth
More informationInternational Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT
Research Article Bioinformatics International Journal of Pharma and Bio Sciences ISSN 0975-6299 A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS D.UDHAYAKUMARAPANDIAN
More informationAnalysis of Classification Algorithms towards Breast Tissue Data Set
Analysis of Classification Algorithms towards Breast Tissue Data Set I. Ravi Assistant Professor, Department of Computer Science, K.R. College of Arts and Science, Kovilpatti, Tamilnadu, India Abstract
More informationNAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION
NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION Sriram Selvaraj 1 and Bommannaraja Kanakaraj 2 1 Department of Biomedical Engineering, P.S.N.A College of Engineering and Technology,
More informationExtraction of Informative Genes from Microarray Data
Extraction of Informative Genes from Microarray Data Topon Kumar Paul Department of Frontier Informatics The University of Tokyo Chiba 277-8561, Japan topon@iba.k.u-tokyo.ac.jp Hitoshi Iba Department of
More informationClinical Utility of Diagnostic Tests
Clinical Utility of Diagnostic Tests David A. Eberhard MD, PhD Director, Pre-Clinical Genomic Pathology, Lineberger Comprehensive Cancer Center Associate Professor, Depts. of Pathology and Pharmacology
More informationSubLasso:a feature selection and classification R package with a. fixed feature subset
SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced
More informationDiagnosis Of Ovarian Cancer Using Artificial Neural Network
Diagnosis Of Ovarian Cancer Using Artificial Neural Network B.Rosiline Jeetha #1, M.Malathi *2 1 Research Supervisor, 2 Research Scholar, Assistant Professor RVS College of Arts And Science Department
More informationNIH Public Access Author Manuscript Bioanalysis. Author manuscript; available in PMC 2011 March 16.
NIH Public Access Author Manuscript Published in final edited form as: Bioanalysis. 2010 May ; 2(5): 855 862. doi:10.4155/bio.10.35. Derivation of cancer diagnostic and prognostic signatures from gene
More informationA NOVEL CLASSIFICATION MODEL FOR ANALYSIS OF A CRIME USING NAÏVE BYES AND KNN IN DATA MINING
A NOVEL CLASSIFICATION MODEL FOR ANALYSIS OF A CRIME USING NAÏVE BYES AND KNN IN DATA MINING SHIVRAJ SINGH DEOPA 1, ABHISHEK KUMAR 2, KUNEEK GUPTA 3 Dr. SHASHI KANT SINGH 4 Galgotias college of engineering
More information