Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines
|
|
- Robyn Lucinda Norris
- 5 years ago
- Views:
Transcription
1 Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Florian Markowetz and Anja von Heydebreck Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin 26th Annual Conference of the Gesellschaft für Klassifikation GfKl 2002: Mannheim 1
2 Overview I II III IV Characterization of tumor types by gene expression profiles The SVM score ISIS: the algorithm Output of ISIS Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 2
3 Microarray Experiments Hybridization (red label) total mrna standard condition Target Probe Hybridization convert into numerical values total mrna (green label) gene 1 gene 2 gene 3 gene 4 gene 5 standard 14,243 5,323 10,300 1, ,232 experiment 12,154 27,152 1,407 3, ,993 interesting condition Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 3
4 Characterization of tumor types Between two different, but related tumor tissue types, often relatively few genes are clearly differentially expressed. Usual cluster algorithms are based on a global measure of similarity between samples, where (possibly conflicting) signals from different groups of genes are intermingled. Our approach: we try to find interesting bi partitions of the set of tissue samples and measure how well the two groups are separated by expression levels of a suitable small subset of genes. This gives a score for each partition. Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 4
5 The SVM score The SVM margin score S(B) of a bipartition B = {M, M} is the margin of a (linear) Support Vector Machine separating M from M. Samples in M Margin Separating Hyperplane Samples in M Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 5
6 Significance of the SVM score Comparison to 10,000 random partitions of the data: Dataset Class distance to SVM SVM margin local max. margin at local max. Leukemia AML 3 (59.1%) 1.6 (0%) 2.2 (0%) T-cell ALL 0 (0.5%) 2.6 (0%) 2.6 (0%) B-cell ALL 1 (13.3%) 1.9 (0%) 2.8 (0%) Lymphoma / CLL 2 (11.8%) 2.0 (0%) 2.8 (0%) leukemia FL 0 (0%) 2.4 (0%) 2.4 (0%) DLBCL-G 3 (24.3%) 1.7 (0%) 2.0 (6.2%) DLBCL-A 5 (49.5%) 1.7 (0%) 2.9 (0%) Melanoma cluster of 1 (8.8%) 1.2 (0%) 2.0 (10.2%) 19 samples Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 6
7 Input: Example data sets Dataset # samples # genes classes: # samples total used (total) Leukemia 72 4,000 AML: 25 Golub et al. (1999) (6,817) B-cell ALL: 38 T-cell ALL: 9 Lymphoma / 62 4,026 CLL: 11 leukemia (17,856) FL: 9 Alizadeh et al. (2000) DLBCL-G: 21 DLBCL-A: 21 Melanoma 31 3,613 cluster: 19 Bittner et al. (2000) (6,971) remaining: 12 Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 7
8 Step 1: Finding candidate partitions 1. Data matrix X = (x gj ) samples j = 1,..., n and genes g = 1,..., k. Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 8
9 Step 1: Finding candidate partitions 1. Data matrix X = (x gj ) samples j = 1,..., n and genes g = 1,..., k. 2. Compute average expression profiles of clusters of genes. = new data matrix Y = (y ij ) Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 8
10 Step 1: Finding candidate partitions 1. Data matrix X = (x gj ) samples j = 1,..., n and genes g = 1,..., k. 2. Compute average expression profiles of clusters of genes. = new data matrix Y = (y ij ) 3. For every gene cluster i and every sample j : the value y ij defines a bipartition given by the subsets M = {j y ij y ij } and M + = {j y ij > y ij }. Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 8
11 Step 1: Finding candidate partitions 1. Data matrix X = (x gj ) samples j = 1,..., n and genes g = 1,..., k. 2. Compute average expression profiles of clusters of genes. = new data matrix Y = (y ij ) 3. For every gene cluster i and every sample j : the value y ij defines a bipartition given by the subsets M = {j y ij y ij } and M + = {j y ij > y ij }. 4. Order splits by value of two-sample t-statistic 5. Choose highest scoring splits as candidates. Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 8
12 Step 2: Feature selection For each split B: select the 50 genes with highest absolute value of the two sample t statistic t g (B) t g (B) = µ gm µ g M (m 1)σgM 2 + ( m 1)σ2 g M m m(n 2) n, where m = M and m = n m. Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 9
13 Step 3: Local Search for Maxima Greedy Search: Start at B choose the neighboring split with the highest SVM score continue until a local maximum is reached Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 10
14 Output: Leukemia dataset S(J) AML T cell ALL B cell ALL Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 11
15 Output: Melanoma dataset S(J) clustered samples 12 other samples Florian Markowetz, Class Discovery in Gene Expression Data, GfKl 2002, Mannheim 12
Lecture #4: Overabundance Analysis and Class Discovery
236632 Topics in Microarray Data nalysis Winter 2004-5 November 15, 2004 Lecture #4: Overabundance nalysis and Class Discovery Lecturer: Doron Lipson Scribes: Itai Sharon & Tomer Shiran 1 Differentially
More informationGene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering
Gene expression analysis Roadmap Microarray technology: how it work Applications: what can we do with it Preprocessing: Image processing Data normalization Classification Clustering Biclustering 1 Gene
More informationComparison of discrimination methods for the classification of tumors using gene expression data
Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley
More informationGene Selection for Tumor Classification Using Microarray Gene Expression Data
Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology
More informationClassification. Methods Course: Gene Expression Data Analysis -Day Five. Rainer Spang
Classification Methods Course: Gene Expression Data Analysis -Day Five Rainer Spang Ms. Smith DNA Chip of Ms. Smith Expression profile of Ms. Smith Ms. Smith 30.000 properties of Ms. Smith The expression
More informationApplying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem
Oral Presentation at MIE 2011 30th August 2011 Oslo Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Kirsi
More informationEfficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine based on Analysis of Variance Features
American Journal of Applied Sciences 8 (12): 1295-1301, 2011 ISSN 1546-9239 2011 Science Publications Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine
More informationA hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system
Technology and Health Care 24 (2016) S601 S605 DOI 10.3233/THC-161187 IOS Press S601 A hierarchical two-phase framework for selecting genes in cancer datasets with a neuro-fuzzy system Jongwoo Lim, Bohyun
More informationT. R. Golub, D. K. Slonim & Others 1999
T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have
More informationNearest Shrunken Centroid as Feature Selection of Microarray Data
Nearest Shrunken Centroid as Feature Selection of Microarray Data Myungsook Klassen Computer Science Department, California Lutheran University 60 West Olsen Rd, Thousand Oaks, CA 91360 mklassen@clunet.edu
More informationResearch Supervised clustering of genes Marcel Dettling and Peter Bühlmann
http://genomebiology.com/22/3/2/research/69. Research Supervised clustering of genes Marcel Dettling and Peter Bühlmann Address: Seminar für Statistik, Eidgenössische Technische Hochschule (ETH) Zürich,
More informationHybridized KNN and SVM for gene expression data classification
Mei, et al, Hybridized KNN and SVM for gene expression data classification Hybridized KNN and SVM for gene expression data classification Zhen Mei, Qi Shen *, Baoxian Ye Chemistry Department, Zhengzhou
More informationClassifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/
CSCI1950 Z Computa4onal Methods for Biology Lecture 18 Ben Raphael April 8, 2009 hip://cs.brown.edu/courses/csci1950 z/ Binary classifica,on Given a set of examples (x i, y i ), where y i = + 1, from unknown
More informationStatistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.
Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension
More informationIntroduction to Discrimination in Microarray Data Analysis
Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t
More informationPredictive Biomarkers
Uğur Sezerman Evolutionary Selection of Near Optimal Number of Features for Classification of Gene Expression Data Using Genetic Algorithms Predictive Biomarkers Biomarker: A gene, protein, or other change
More informationEfficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection
202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray
More informationGene-microRNA network module analysis for ovarian cancer
Gene-microRNA network module analysis for ovarian cancer Shuqin Zhang School of Mathematical Sciences Fudan University Oct. 4, 2016 Outline Introduction Materials and Methods Results Conclusions Introduction
More informationEvaluation of Gene Selection Using Support Vector Machine Recursive Feature Elimination
Evaluation of Gene Selection Using Support Vector Machine Recursive Feature Elimination Committee: Advisor: Dr. Rosemary Renaut Dr. Adrienne C. Scheck Dr. Kenneth Hoober Dr. Bradford Kirkman-Liff John
More informationPredicting Sleep Using Consumer Wearable Sensing Devices
Predicting Sleep Using Consumer Wearable Sensing Devices Miguel A. Garcia Department of Computer Science Stanford University Palo Alto, California miguel16@stanford.edu 1 Introduction In contrast to the
More informationEvaluating Classifiers for Disease Gene Discovery
Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics
More informationA Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer
A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,
More informationInformative Gene Selection for Leukemia Cancer Using Weighted K-Means Clustering
IOSR Journal of Pharmacy and Biological Sciences (IOSR-JPBS) e-issn: 2278-3008, p-issn:2319-7676. Volume 9, Issue 4 Ver. V (Jul -Aug. 2014), PP 12-16 Informative Gene Selection for Leukemia Cancer Using
More informationColon cancer subtypes from gene expression data
Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et
More informationPredicting Breast Cancer Recurrence Using Machine Learning Techniques
Predicting Breast Cancer Recurrence Using Machine Learning Techniques Umesh D R Department of Computer Science & Engineering PESCE, Mandya, Karnataka, India Dr. B Ramachandra Department of Electrical and
More informationPredicting Breast Cancer Survival Using Treatment and Patient Factors
Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women
More information7.1 Grading Diabetic Retinopathy
Chapter 7 DIABETIC RETINOPATHYGRADING -------------------------------------------------------------------------------------------------------------------------------------- A consistent approach to the
More informationKnowledge Discovery and Data Mining I
Ludwig-Maximilians-Universität München Lehrstuhl für Datenbanksysteme und Data Mining Prof. Dr. Thomas Seidl Knowledge Discovery and Data Mining I Winter Semester 2018/19 Introduction What is an outlier?
More informationClassification with microarray data
Classification with microarray data Aron Charles Eklund eklund@cbs.dtu.dk DNA Microarray Analysis - #27612 January 8, 2010 The rest of today Now: What is classification, and why do we do it? How to develop
More informationCANCER DIAGNOSIS USING DATA MINING TECHNOLOGY
CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY Muhammad Shahbaz 1, Shoaib Faruq 2, Muhammad Shaheen 1, Syed Ather Masood 2 1 Department of Computer Science and Engineering, UET, Lahore, Pakistan Muhammad.Shahbaz@gmail.com,
More informationClassification of cancer profiles. ABDBM Ron Shamir
Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;
More informationData mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis
Data mining for Obstructive Sleep Apnea Detection 18 October 2017 Konstantinos Nikolaidis Introduction: What is Obstructive Sleep Apnea? Obstructive Sleep Apnea (OSA) is a relatively common sleep disorder
More informationA Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis
A Biclustering Based Classification Framework for Cancer Diagnosis and Prognosis Baljeet Malhotra and Guohui Lin Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada T6G 2E8
More informationWen et al. (1998) PNAS, 95:
Large-scale temporal gene expression mapping of central nervous system development Fluctuations in mrna expression of 2 genes during rat central nervous system development, focusing on the cervical spinal
More informationA COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION
5-9 JATIT. All rights reserved. A COMBINATORY ALGORITHM OF UNIVARIATE AND MULTIVARIATE GENE SELECTION 1 H. Mahmoodian, M. Hamiruce Marhaban, 3 R. A. Rahim, R. Rosli, 5 M. Iqbal Saripan 1 PhD student, Department
More informationData complexity measures for analyzing the effect of SMOTE over microarrays
ESANN 216 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges (Belgium), 27-29 April 216, i6doc.com publ., ISBN 978-2878727-8. Data complexity
More informationTissue Classification Based on Gene Expression Data
Chapter 6 Tissue Classification Based on Gene Expression Data Many diseases result from complex interactions involving numerous genes. Previously, these gene interactions have been commonly studied separately.
More informationMRI Image Processing Operations for Brain Tumor Detection
MRI Image Processing Operations for Brain Tumor Detection Prof. M.M. Bulhe 1, Shubhashini Pathak 2, Karan Parekh 3, Abhishek Jha 4 1Assistant Professor, Dept. of Electronics and Telecommunications Engineering,
More informationCancer Gene Extraction Based on Stepwise Regression
Mathematical Computation Volume 5, 2016, PP.6-10 Cancer Gene Extraction Based on Stepwise Regression Jie Ni 1, Fan Wu 1, Meixiang Jin 1, Yixing Bai 1, Yunfei Guo 1 1. Mathematics Department, Yanbian University,
More informationMachine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017
Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 A.K.A. Artificial Intelligence Unsupervised learning! Cluster analysis Patterns, Clumps, and Joining
More informationFUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION
FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION SOMAYEH ABBASI, HAMID MAHMOODIAN Department of Electrical Engineering, Najafabad branch, Islamic
More informationA Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China
A Vision-based Affective Computing System Jieyu Zhao Ningbo University, China Outline Affective Computing A Dynamic 3D Morphable Model Facial Expression Recognition Probabilistic Graphical Models Some
More informationsimulatorapms October 25, 2011 An example of an ISI using apcomplex on simex
simulatorapms October 25, 2011 APComEX An example of an ISI using apcomplex on simex Using simex, vbaitsex, no deformed nor sticky baits, APComEX is an example output of ap- Complex. This is the bipartite
More informationL. Ziaei MS*, A. R. Mehri PhD**, M. Salehi PhD***
Received: 1/16/2004 Accepted: 8/1/2005 Original Article Application of Artificial Neural Networks in Cancer Classification and Diagnosis Prediction of a Subtype of Lymphoma Based on Gene Expression Profile
More informationAn Improved Algorithm To Predict Recurrence Of Breast Cancer
An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant
More informationTHE data used in this project is provided. SEIZURE forecasting systems hold promise. Seizure Prediction from Intracranial EEG Recordings
1 Seizure Prediction from Intracranial EEG Recordings Alex Fu, Spencer Gibbs, and Yuqi Liu 1 INTRODUCTION SEIZURE forecasting systems hold promise for improving the quality of life for patients with epilepsy.
More informationBIOINFORMATICS ORIGINAL PAPER
BIOINFORMATICS ORIGINAL PAPER Vol. 2 no. 4 25, pages 34 32 doi:.93/bioinformatics/bti483 Gene expression Ensemble dependence model for classification and prediction of cancer and normal gene expression
More informationEmpirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal
Empirical Mode Decomposition based Feature Extraction Method for the Classification of EEG Signal Anant kulkarni MTech Communication Engineering Vellore Institute of Technology Chennai, India anant8778@gmail.com
More informationNature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.
Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze
More informationGender Based Emotion Recognition using Speech Signals: A Review
50 Gender Based Emotion Recognition using Speech Signals: A Review Parvinder Kaur 1, Mandeep Kaur 2 1 Department of Electronics and Communication Engineering, Punjabi University, Patiala, India 2 Department
More informationCS 453X: Class 18. Jacob Whitehill
CS 453X: Class 18 Jacob Whitehill More on k-means Exercise: Empty clusters (1) Assume that a set of distinct data points { x (i) } are initially assigned so that none of the k clusters is empty. How can
More informationA novel approach to feature extraction from classification models based on information gene pairs
Pattern Recognition 41 (2008) 1975 1984 www.elsevier.com/locate/pr A novel approach to feature extraction from classification models based on information gene pairs J. Li, X. Tang, J. Liu, J. Huang, Y.
More informationPart [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals
Part [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals Patrick J. Heagerty Department of Biostatistics University of Washington 174 Biomarkers Session Outline
More informationVariable Features Selection for Classification of Medical Data using SVM
Variable Features Selection for Classification of Medical Data using SVM Monika Lamba USICT, GGSIPU, Delhi, India ABSTRACT: The parameters selection in support vector machines (SVM), with regards to accuracy
More informationHYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA
HYBRID SUPPORT VECTOR MACHINE BASED MARKOV CLUSTERING FOR TUMOR DETECTION FROM BIO-MOLECULAR DATA S. SubashChandraBose 1 and T. Christopher 2 1 Department of Computer Science, PG and Research Department,
More informationVL Network Analysis ( ) SS2016 Week 3
VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture
More informationPAIRED AND UNPAIRED COMPARISON AND CLUSTERING WITH GENE EXPRESSION DATA
Statistica Sinica 12(2002), 87-110 PAIRED AND UNPAIRED COMPARISON AND CLUSTERING WITH GENE EXPRESSION DATA Jenny Bryan 1, Katherine S. Pollard 2 and Mark J. van der Laan 2 1 University of British Columbia
More informationData Mining in Bioinformatics Day 4: Text Mining
Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tübingen Karsten Borgwardt: Data Mining in Bioinformatics, Page 1 What is text mining?
More informationComparing Multifunctionality and Association Information when Classifying Oncogenes and Tumor Suppressor Genes
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationMulticlass microarray data classification based on confidence evaluation
Methodology Multiclass microarray data classification based on confidence evaluation H.L. Yu 1, S. Gao 1, B. Qin 1 and J. Zhao 2 1 School of Computer Science and Engineering, Jiangsu University of Science
More informationEnhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation
Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation L Uma Maheshwari Department of ECE, Stanley College of Engineering and Technology for Women, Hyderabad - 500001, India. Udayini
More informationDIABETIC RISK PREDICTION FOR WOMEN USING BOOTSTRAP AGGREGATION ON BACK-PROPAGATION NEURAL NETWORKS
International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 4, July-Aug 2018, pp. 196-201, Article IJCET_09_04_021 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=9&itype=4
More information38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16
38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and
More information14. Linear Mixed-Effects Models for Data from Split-Plot Experiments
14. Linear Mixed-Effects Models for Data from Split-Plot Experiments opyright c 219 Dan Nettleton (Iowa State University) 14. Statistics 51 1 / 3 Start with a Field Field opyright c 219 Dan Nettleton (Iowa
More informationBrain Tumor segmentation and classification using Fcm and support vector machine
Brain Tumor segmentation and classification using Fcm and support vector machine Gaurav Gupta 1, Vinay singh 2 1 PG student,m.tech Electronics and Communication,Department of Electronics, Galgotia College
More informationSVM-based Discriminative Accumulation Scheme for Place Recognition
SVM-based Discriminative Accumulation Scheme for Place Recognition Andrzej Pronobis CAS/CVAP, KTH Stockholm, Sweden pronobis@csc.kth.se Óscar Martínez Mozos AIS, University Of Freiburg Freiburg, Germany
More informationVega: Variational Segmentation for Copy Number Detection
Vega: Variational Segmentation for Copy Number Detection Sandro Morganella Luigi Cerulo Giuseppe Viglietto Michele Ceccarelli Contents 1 Overview 1 2 Installation 1 3 Vega.RData Description 2 4 Run Vega
More informationUtilizing Posterior Probability for Race-composite Age Estimation
Utilizing Posterior Probability for Race-composite Age Estimation Early Applications to MORPH-II Benjamin Yip NSF-REU in Statistical Data Mining and Machine Learning for Computer Vision and Pattern Recognition
More informationRoadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:
Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017
RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science
More informationCase Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD
Case Studies on High Throughput Gene Expression Data Kun Huang, PhD Raghu Machiraju, PhD Department of Biomedical Informatics Department of Computer Science and Engineering The Ohio State University Review
More informationThe Handling of Disjunctive Fuzzy Information via Neural Network Approach
From: AAAI Technical Report FS-97-04. Compilation copyright 1997, AAAI (www.aaai.org). All rights reserved. The Handling of Disjunctive Fuzzy Information via Neural Network Approach Hahn-Ming Lee, Kuo-Hsiu
More informationSubLasso:a feature selection and classification R package with a. fixed feature subset
SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced
More informationIdentification of Tissue Independent Cancer Driver Genes
Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important
More informationRadiotherapy Outcomes
in partnership with Outcomes Models with Machine Learning Sarah Gulliford PhD Division of Radiotherapy & Imaging sarahg@icr.ac.uk AAPM 31 st July 2017 Making the discoveries that defeat cancer Radiotherapy
More information10CS664: PATTERN RECOGNITION QUESTION BANK
10CS664: PATTERN RECOGNITION QUESTION BANK Assignments would be handed out in class as well as posted on the class blog for the course. Please solve the problems in the exercises of the prescribed text
More informationDiagnosis of multiple cancer types by shrunken centroids of gene expression
Diagnosis of multiple cancer types by shrunken centroids of gene expression Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu PNAS 99:10:6567-6572, 14 May 2002 Nearest Centroid
More informationData analysis and binary regression for predictive discrimination. using DNA microarray data. (Breast cancer) discrimination. Expression array data
West Mike of Statistics & Decision Sciences Institute Duke University wwwstatdukeedu IPAM Functional Genomics Workshop November Two group problems: Binary outcomes ffl eg, ER+ versus ER ffl eg, lymph node
More informationPredicting Kidney Cancer Survival from Genomic Data
Predicting Kidney Cancer Survival from Genomic Data Christopher Sauer, Rishi Bedi, Duc Nguyen, Benedikt Bünz Abstract Cancers are on par with heart disease as the leading cause for mortality in the United
More informationSCIENCE & TECHNOLOGY
Pertanika J. Sci. & Technol. 25 (S): 241-254 (2017) SCIENCE & TECHNOLOGY Journal homepage: http://www.pertanika.upm.edu.my/ Fuzzy Lambda-Max Criteria Weight Determination for Feature Selection in Clustering
More informationGIANT: Geo-Informative Attributes for Location Recognition and Exploration
GIANT: Geo-Informative Attributes for Location Recognition and Exploration Quan Fang, Jitao Sang, Changsheng Xu Institute of Automation, Chinese Academy of Sciences October 23, 2013 Where is this? La Sagrada
More informationTissue Classi cation with Gene Expression Pro les ABSTRACT
JOURNAL OF COMPUTATIONAL BIOLOGY Volume 7, Numbers 3/4, 2000 Mary Ann Liebert, Inc. Pp. 559 583 Tissue Classi cation with Gene Expression Pro les AMIR BEN-DOR, 1 LAURAKAY BRUHN, 1 NIR FRIEDMAN, 2 IFTACH
More informationA Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1335-1341 International Research Publications House http://www. irphouse.com A Fuzzy Improved
More informationCase Studies of Signed Networks
Case Studies of Signed Networks Christopher Wang December 10, 2014 Abstract Many studies on signed social networks focus on predicting the different relationships between users. However this prediction
More informationModeling Sentiment with Ridge Regression
Modeling Sentiment with Ridge Regression Luke Segars 2/20/2012 The goal of this project was to generate a linear sentiment model for classifying Amazon book reviews according to their star rank. More generally,
More informationLarge-Scale Statistical Modelling via Machine Learning Classifiers
J. Stat. Appl. Pro. 2, No. 3, 203-222 (2013) 203 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.12785/jsap/020303 Large-Scale Statistical Modelling via Machine
More informationGene Expression Based Leukemia Sub Classification Using Committee Neural Networks
Bioinformatics and Biology Insights M e t h o d o l o g y Open Access Full open access to this and thousands of other papers at http://www.la-press.com. Gene Expression Based Leukemia Sub Classification
More informationPackage propoverlap. R topics documented: February 20, Type Package
Type Package Package propoverlap February 20, 2015 Title Feature (gene) selection based on the Proportional Overlapping Scores Version 1.0 Date 2014-09-15 Author Osama Mahmoud, Andrew Harrison, Aris Perperoglou,
More informationIntroduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018
Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this
More informationPerformance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis
Performance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis Sahil Sharma Department of Computer Science & IT University Of Jammu Jammu, India
More informationCOMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
Journal of Engineering Science and Technology Vol. 11, No. 9 (2016) 1221-1233 School of Engineering, Taylor s University COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION
More informationDetection and Classification of Leukaemia using Artificial Neural Network
Detection and Classification of Leukaemia using Artificial Neural Network Prasidhi G. Fal Desai 1, Geeta Shet 2 1, 2 Department of Electronics and Telecommunication, Goa College of Engineering Abstract:
More informationBiomarker adaptive designs in clinical trials
Review Article Biomarker adaptive designs in clinical trials James J. Chen 1, Tzu-Pin Lu 1,2, Dung-Tsa Chen 3, Sue-Jane Wang 4 1 Division of Bioinformatics and Biostatistics, National Center for Toxicological
More informationCSE 255 Assignment 9
CSE 255 Assignment 9 Alexander Asplund, William Fedus September 25, 2015 1 Introduction In this paper we train a logistic regression function for two forms of link prediction among a set of 244 suspected
More informationDesign of Multi-Class Classifier for Prediction of Diabetes using Linear Support Vector Machine
Design of Multi-Class Classifier for Prediction of Diabetes using Linear Support Vector Machine Akshay Joshi Anum Khan Omkar Kulkarni Department of Computer Engineering Department of Computer Engineering
More informationIntegrative Methods for Gene Data Analysis and Knowledge Discovery on the Case Study of KEDRI s Brain Gene Ontology. Yuepeng Wang
Integrative Methods for Gene Data Analysis and Knowledge Discovery on the Case Study of KEDRI s Brain Gene Ontology Yuepeng Wang A thesis submitted to Auckland University of Technology in partial fulfilment
More informationBREAST CANCER EARLY DETECTION USING X RAY IMAGES
Volume 119 No. 15 2018, 399-405 ISSN: 1314-3395 (on-line version) url: http://www.acadpubl.eu/hub/ http://www.acadpubl.eu/hub/ BREAST CANCER EARLY DETECTION USING X RAY IMAGES Kalaichelvi.K 1,Aarthi.R
More informationShu Kong. Department of Computer Science, UC Irvine
Ubiquitous Fine-Grained Computer Vision Shu Kong Department of Computer Science, UC Irvine Outline 1. Problem definition 2. Instantiation 3. Challenge 4. Fine-grained classification with holistic representation
More informationInferring Biological Meaning from Cap Analysis Gene Expression Data
Inferring Biological Meaning from Cap Analysis Gene Expression Data HRYSOULA PAPADAKIS 1. Introduction This project is inspired by the recent development of the Cap analysis gene expression (CAGE) method,
More informationA Comparison of Collaborative Filtering Methods for Medication Reconciliation
A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,
More information