Diagnosis of multiple cancer types by shrunken centroids of gene expression
|
|
- Geraldine Phelps
- 6 years ago
- Views:
Transcription
1 Diagnosis of multiple cancer types by shrunken centroids of gene expression Robert Tibshirani, Trevor Hastie, Balasubramanian Narasimhan, and Gilbert Chu PNAS 99:10: , 14 May 2002
2 Nearest Centroid Classifier Calculate a centroid for each class x ik = j Ck x ij /n k Calculation of the distance between a test sample and each class centroid Class prediction by squared distance d 2 (x *,x k) = (x * i x ik) 2 i
3 'Nearest Shrunken Centroid' General Idea: Shrink by 'threshold' amount Advantages: Reduce noise Gene selection
4 Class Centroids Mean expression of gene i in class k: x ik = j Ck x ij /n k i th component of overall centroid x i = n x /n j=1 ij
5 Normalize Standard Deviation Let d ik = (x ik x i) m k (s i + s 0 ) where s i is pooled within-class standard deviation for gene i: and s i 2 = 1 n K K (x ij x ik) 2 k=1 j C k m k = 1/n k 1/n
6 Shrink the d ik by Soft-Thresholding New shrunken centroids: x'ik = x i + m k (s i + s 0 )d' ik d' ik = sign(d ik )( d ik ) +
7 Soft vs. Hard Thresholding Hard thresholding: d' ik = d ik I( d ik > ) More jumpy Higher minimum test error Higher gene expression estimation error
8 Contrasting the Shrunken Centroids Tibshirani et al, Stat Sci (2003)
9 Choose? by Cross-Validation Use k-fold cross validation Divide data into k roughly equal parts Fit model for many values of? and use CV to determine error Choose value of? that gives smallest error Note: Assume a separate test set
10 Linear Discriminant Analysis δ k LDA (x * ) = (x * x k) T W 1 (x * x k) 2logπ k Compute distance to centroids W is pooled within-class covariance matrix Shrunken centroid method can be seen as 'a heavily restricted form of LDA, necessary to cope with the large number of variables (genes)'
11 LDA vs. Nearest Centroid Equivalent if within-class covariance matrix is restricted to diagonal and prior is ignored Relative performance depends on correlation structure of the samples Tibshirani et al, Stat Sci (2003)
12 Class Probabilities and Discriminant Functions Purpose: Correct for relative numbers of samples in each class Expression levels: x * = (x 1 *, x 2 *,..., x p * ) Discriminant Score: δ k (x * ) = p i=1 (x i * x' ik ) 2 (s i + s 0 ) 2 2logπ k
13 Class Probabilities and Discriminant Functions New classification rule C(x * ) = l if δ l (x * ) = minδ k (x * ) k Gaussian Linear Discriminant Analysis pk (x* ) = e (1/ 2)δ k (x * ) K e (1/ 2)δ l (x * ) l=1
14 Adaptive Threshold Scaling Define scaling vector (θ 1, θ 2,, θ K ) Include in d ik : d ik = x ik x i m k θ k s i Adaptive procedure: start with all θ k =1, reduce by 10% for class k with largest training error, repeat Can dramatically reduce total number of genes used without increasing error rate
15 Overall Model Predictive Analysis of Microarrays Typically accurate classifier Minimizes number of genes and error Results simple to understand Software available at:
16 Diagnosis of multiple cancer types by shrunken centroids of gene expression
17 Goal 'Classify and predict the diagnostic category of a sample on the basis of its gene expression profile' Use a simple approach that performs well and is easy to interpret
18 Small Round Blue Cell Tumors Occur in children and young adults with a male preference Presents as a large mass within the abdomen, usually the pelvic region Aggressive tumor with a poor prognosis
19 Experimental Data Expression measurements on 2,308 genes from cdna microarrys 4 tumor classes: Burkitt lymphoma (BL) Ewing sarcoma (EWS) Neuroblastoma (NB) Rhabdomyosarcoma (RMS) 88 total samples 63 training samples 25 test samples (including 5 control samples)
20 'Reference 5' Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks Javed Khan, Jun S. Wei, Markus Ringner, Lao H. Saal, Marc Ladanyi, Frank Westermann, Frank Berthold, Manfred Schwab, Cristina R. Antonescu, Carsten Peterson, and Paul S. Meltzer Nature Medicine, June 2001: 7: 6: 673-9
21 The Artificial Neural Network
22 Khan et al Conclusions Development of a linear network Report 0% training and test errors Decided that 96 genes were 'important' 61 genes specifically expressed in a cancer type 41 had not been previously reported as associated with these diseases
23 Apply PAM Method Utilize nearest shrunken centroid method Eliminate noisy genes
24 Shrunken Centroids
25 Determination of? Soft thresholding using shrinkage parameter: d' ik = sign(d ik )( d ik ) + 10-fold cross validation: error minimized for? = 4.34 tr = training set cv = crossvalidated set te = test set
26 The Genes That Matter
27 The Genes That Matter
28 Heat Map Comparisons Tibshirani - 43 Genes Shrunken Centroid Method Khan - 96 Genes Artificial Neural Network Method
29 Class Probabilities Classified by True Class Classified by Predicted Class 'All 63 of the training samples and all 20 of the test samples known to be SRBCT are correctly classified'
30 Findings 43 important genes identified 27 also found by neural network method 1 of 8 presently considered to be diagnostic for SBRCTs Discusses other genes that play oncogenic roles
31 Conclusions 'The method of nearest shrunken centroids was successful in finding genes that accurately predict classes' 'The efficiency of our method in finding a relatively small number of predictive genes will facilitate the search for new diagnostic tools' 'The success of our methodology has implications for improving the diagnosis of cancer'
32 Leukemia Example Another example of shrunken centroid classification Data presents a 2-class problem 7,129 total genes and 34 total samples 20 acute lymphocytic leukemia (ALL) 14 acute mylogenous leukemia (AML) Data has previously been classified by Golub et al using a linear scoring procedure
33 Golub Gene Selection Correlation Measure: Weighted Vote: i = x i1 x i2 s i1 + s i2 G(x * ) = i x * i x i1 + x i2 2 i S(m ) = i S(m) (x i1 x i2 )x i * s i1 + s i2 i S(m) (x i1 x i2) s i1 + s i2 (x i1 + x i2) 2
34 2-Class Discriminant Scores Original discriminant score equation: δ k (x * ) = p i=1 2 class equation: (x i * x' ik ) 2 (s i + s 0 ) 2 2logπ k l(x * ) =? δ 1 (x * )? δ 2 (x * ) = i S( ) * ( x i1 x i2)x i (s i + s 0 ) 2 i S( ) ( x i1 x i2) ( x i1 (s i + s 0 ) 2 2 x i2) + log π 1 π 2
35 Methodological Comparison Variance vs. Standard Deviation Hard vs. soft thresholding Cross-validation (m vs.? ) Added features
36 Leukemia Classification tr = training set cv = crossvalidated set te = test set
37 Findings Significant genes shrunk from 50 to 21 Halved test error Some incorporation of marker genes
38 Overall Method Conclusions Classifier is potentially useful in high-dimensional classification problems Straightforward computations Simultaneous minimizing of error and number of genes used Questionable potential for gene identification Can also be applied in conjunction with unsupervised methods
Nearest Shrunken Centroid as Feature Selection of Microarray Data
Nearest Shrunken Centroid as Feature Selection of Microarray Data Myungsook Klassen Computer Science Department, California Lutheran University 60 West Olsen Rd, Thousand Oaks, CA 91360 mklassen@clunet.edu
More informationClassification. Methods Course: Gene Expression Data Analysis -Day Five. Rainer Spang
Classification Methods Course: Gene Expression Data Analysis -Day Five Rainer Spang Ms. Smith DNA Chip of Ms. Smith Expression profile of Ms. Smith Ms. Smith 30.000 properties of Ms. Smith The expression
More informationCANCER DIAGNOSIS USING DATA MINING TECHNOLOGY
CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY Muhammad Shahbaz 1, Shoaib Faruq 2, Muhammad Shaheen 1, Syed Ather Masood 2 1 Department of Computer Science and Engineering, UET, Lahore, Pakistan Muhammad.Shahbaz@gmail.com,
More informationComparison of discrimination methods for the classification of tumors using gene expression data
Comparison of discrimination methods for the classification of tumors using gene expression data Sandrine Dudoit, Jane Fridlyand 2 and Terry Speed 2,. Mathematical Sciences Research Institute, Berkeley
More informationIntroduction to Discrimination in Microarray Data Analysis
Introduction to Discrimination in Microarray Data Analysis Jane Fridlyand CBMB University of California, San Francisco Genentech Hall Auditorium, Mission Bay, UCSF October 23, 2004 1 Case Study: Van t
More informationGene expression analysis. Roadmap. Microarray technology: how it work Applications: what can we do with it Preprocessing: Classification Clustering
Gene expression analysis Roadmap Microarray technology: how it work Applications: what can we do with it Preprocessing: Image processing Data normalization Classification Clustering Biclustering 1 Gene
More informationT. R. Golub, D. K. Slonim & Others 1999
T. R. Golub, D. K. Slonim & Others 1999 Big Picture in 1999 The Need for Cancer Classification Cancer classification very important for advances in cancer treatment. Cancers of Identical grade can have
More informationRoadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:
Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 8, Issue 1 2009 Article 13 Detecting Outlier Samples in Microarray Data Albert D. Shieh Yeung Sam Hung Harvard University, shieh@fas.harvard.edu
More informationStatistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.
Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension
More informationReach and grasp by people with tetraplegia using a neurally controlled robotic arm
Leigh R. Hochberg et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm Nature, 17 May 2012 Paper overview Ilya Kuzovkin 11 April 2014, Tartu etc How it works?
More informationAnalysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach
University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen
More informationMachine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017
Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017 A.K.A. Artificial Intelligence Unsupervised learning! Cluster analysis Patterns, Clumps, and Joining
More informationGene Selection for Tumor Classification Using Microarray Gene Expression Data
Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology
More informationApplication of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures
Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures 1 2 3 4 5 Kathleen T Quach Department of Neuroscience University of California, San Diego
More informationColon cancer subtypes from gene expression data
Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et
More informationPart [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals
Part [2.1]: Evaluation of Markers for Treatment Selection Linking Clinical and Statistical Goals Patrick J. Heagerty Department of Biostatistics University of Washington 174 Biomarkers Session Outline
More informationOutlier Analysis. Lijun Zhang
Outlier Analysis Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Extreme Value Analysis Probabilistic Models Clustering for Outlier Detection Distance-Based Outlier Detection Density-Based
More informationEfficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection
202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray
More informationResponse to Mease and Wyner, Evidence Contrary to the Statistical View of Boosting, JMLR 9:1 26, 2008
Journal of Machine Learning Research 9 (2008) 59-64 Published 1/08 Response to Mease and Wyner, Evidence Contrary to the Statistical View of Boosting, JMLR 9:1 26, 2008 Jerome Friedman Trevor Hastie Robert
More informationUvA-DARE (Digital Academic Repository)
UvA-DARE (Digital Academic Repository) A classification model for the Leiden proteomics competition Hoefsloot, H.C.J.; Berkenbos-Smit, S.; Smilde, A.K. Published in: Statistical Applications in Genetics
More informationResearch Supervised clustering of genes Marcel Dettling and Peter Bühlmann
http://genomebiology.com/22/3/2/research/69. Research Supervised clustering of genes Marcel Dettling and Peter Bühlmann Address: Seminar für Statistik, Eidgenössische Technische Hochschule (ETH) Zürich,
More informationMOST: detecting cancer differential gene expression
Biostatistics (2008), 9, 3, pp. 411 418 doi:10.1093/biostatistics/kxm042 Advance Access publication on November 29, 2007 MOST: detecting cancer differential gene expression HENG LIAN Division of Mathematical
More informationApplications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis
DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:
More informationBIOINFORMATICS ORIGINAL PAPER
BIOINFORMATICS ORIGINAL PAPER Vol. 21 no. 9 2005, pages 1979 1986 doi:10.1093/bioinformatics/bti294 Gene expression Estimating misclassification error with small samples via bootstrap cross-validation
More informationFrom Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1
From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...
More informationReview: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections
Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi
More informationClassification of cancer profiles. ABDBM Ron Shamir
Classification of cancer profiles 1 Background: Cancer Classification Cancer classification is central to cancer treatment; Traditional cancer classification methods: location; morphology, cytogenesis;
More informationClass discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines
Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Florian Markowetz and Anja von Heydebreck Max-Planck-Institute for Molecular Genetics Computational Molecular Biology
More informationmicrornas (mirna) and Biomarkers
micrornas (mirna) and Biomarkers Small RNAs Make Big Splash mirnas & Genome Function Biomarkers in Cancer Future Prospects Javed Khan M.D. National Cancer Institute EORTC-NCI-ASCO November 2007 The Human
More informationGray level cooccurrence histograms via learning vector quantization
Gray level cooccurrence histograms via learning vector quantization Timo Ojala, Matti Pietikäinen and Juha Kyllönen Machine Vision and Media Processing Group, Infotech Oulu and Department of Electrical
More informationComments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al.
Comments on Significance of candidate cancer genes as assessed by the CaMP score by Parmigiani et al. Holger Höfling Gad Getz Robert Tibshirani June 26, 2007 1 Introduction Identifying genes that are involved
More informationStatistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies
Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley
More informationTissue Classification Based on Gene Expression Data
Chapter 6 Tissue Classification Based on Gene Expression Data Many diseases result from complex interactions involving numerous genes. Previously, these gene interactions have been commonly studied separately.
More informationA measure of the impact of CV incompleteness on prediction error estimation with application to PCA and normalization
Hornung et al. BMC Medical Research Methodology () :95 DOI 10.16/s4-0-0088-9 RESEARCH ARTICLE Open Access A measure of the impact of CV incompleteness on prediction error estimation with application to
More informationCancer Gene Extraction Based on Stepwise Regression
Mathematical Computation Volume 5, 2016, PP.6-10 Cancer Gene Extraction Based on Stepwise Regression Jie Ni 1, Fan Wu 1, Meixiang Jin 1, Yixing Bai 1, Yunfei Guo 1 1. Mathematics Department, Yanbian University,
More informationClassification with microarray data
Classification with microarray data Aron Charles Eklund eklund@cbs.dtu.dk DNA Microarray Analysis - #27612 January 8, 2010 The rest of today Now: What is classification, and why do we do it? How to develop
More informationEfficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine based on Analysis of Variance Features
American Journal of Applied Sciences 8 (12): 1295-1301, 2011 ISSN 1546-9239 2011 Science Publications Efficient Classification of Cancer using Support Vector Machines and Modified Extreme Learning Machine
More informationMachine Learning to Inform Breast Cancer Post-Recovery Surveillance
Machine Learning to Inform Breast Cancer Post-Recovery Surveillance Final Project Report CS 229 Autumn 2017 Category: Life Sciences Maxwell Allman (mallman) Lin Fan (linfan) Jamie Kang (kangjh) 1 Introduction
More informationUNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016
UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 Exam policy: This exam allows one one-page, two-sided cheat sheet; No other materials. Time: 80 minutes. Be sure to write your name and
More informationModelling Spatially Correlated Survival Data for Individuals with Multiple Cancers
Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers Dipak K. Dey, Ulysses Diva and Sudipto Banerjee Department of Statistics University of Connecticut, Storrs. March 16,
More informationBoosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer
Boosted PRIM with Application to Searching for Oncogenic Pathway of Lung Cancer Pei Wang Department of Statistics Stanford University Stanford, CA 94305 wp57@stanford.edu Young Kim, Jonathan Pollack Department
More informationApplication of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets
Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Chih-Lin Chi a, W. Nick Street b, William H. Wolberg c a Health Informatics Program, University of Iowa b
More informationAn Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM)
www.ijcsi.org 8 An Efficient Diseases Classifier based on Microarray Datasets using Clustering ANOVA Extreme Learning Machine (CAELM) Shamsan Aljamali 1, Zhang Zuping 2 and Long Jun 3 1 School of Information
More informationIntelligent Edge Detector Based on Multiple Edge Maps. M. Qasim, W.L. Woon, Z. Aung. Technical Report DNA # May 2012
Intelligent Edge Detector Based on Multiple Edge Maps M. Qasim, W.L. Woon, Z. Aung Technical Report DNA #2012-10 May 2012 Data & Network Analytics Research Group (DNA) Computing and Information Science
More informationSmall Group Presentations
Admin Assignment 1 due next Tuesday at 3pm in the Psychology course centre. Matrix Quiz during the first hour of next lecture. Assignment 2 due 13 May at 10am. I will upload and distribute these at the
More informationFormulating Emotion Perception as a Probabilistic Model with Application to Categorical Emotion Classification
Formulating Emotion Perception as a Probabilistic Model with Application to Categorical Emotion Classification Reza Lotfian and Carlos Busso Multimodal Signal Processing (MSP) lab The University of Texas
More informationImprovised K-Means for Improving Raw-Data in Diagnosis of Thyroid Disease
Improvised K-Means for Improving Raw-Data in Diagnosis of Thyroid Disease D.P.Gaikwad, Kunal Mahurkar Asst. Professor, Department of Computer Engineering, All India Shree Shivaji Memorial Society s College
More informationMULTI-MODAL FETAL ECG EXTRACTION USING MULTI-KERNEL GAUSSIAN PROCESSES. Bharathi Surisetti and Richard M. Dansereau
MULTI-MODAL FETAL ECG EXTRACTION USING MULTI-KERNEL GAUSSIAN PROCESSES Bharathi Surisetti and Richard M. Dansereau Carleton University, Department of Systems and Computer Engineering 25 Colonel By Dr.,
More informationL. Ziaei MS*, A. R. Mehri PhD**, M. Salehi PhD***
Received: 1/16/2004 Accepted: 8/1/2005 Original Article Application of Artificial Neural Networks in Cancer Classification and Diagnosis Prediction of a Subtype of Lymphoma Based on Gene Expression Profile
More informationA methodology for the analysis of medical data
Please cite this book chapter as: A. Tsanas, M.A. Little, P.E. McSharry, A methodology for the analysis of medical data, Handbook of Systems and Complexity in Health, Springer, New York, pp. 113-125, 2013
More informationRNA-seq. Differential analysis
RNA-seq Differential analysis Data transformations Count data transformations In order to test for differential expression, we operate on raw counts and use discrete distributions differential expression.
More informationInformation Processing During Transient Responses in the Crayfish Visual System
Information Processing During Transient Responses in the Crayfish Visual System Christopher J. Rozell, Don. H. Johnson and Raymon M. Glantz Department of Electrical & Computer Engineering Department of
More informationProfile Analysis. Intro and Assumptions Psy 524 Andrew Ainsworth
Profile Analysis Intro and Assumptions Psy 524 Andrew Ainsworth Profile Analysis Profile analysis is the repeated measures extension of MANOVA where a set of DVs are commensurate (on the same scale). Profile
More informationPrediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer
Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer Ronghui (Lily) Xu Division of Biostatistics and Bioinformatics Department of Family Medicine
More informationPharmacometric Modelling to Support Extrapolation in Regulatory Submissions
Pharmacometric Modelling to Support Extrapolation in Regulatory Submissions Kristin Karlsson, PhD Pharmacometric/Pharmacokinetic assessor Medical Products Agency, Uppsala,Sweden PSI One Day Extrapolation
More informationMulticlass microarray data classification based on confidence evaluation
Methodology Multiclass microarray data classification based on confidence evaluation H.L. Yu 1, S. Gao 1, B. Qin 1 and J. Zhao 2 1 School of Computer Science and Engineering, Jiangsu University of Science
More informationHierarchical Bayesian Modeling of Individual Differences in Texture Discrimination
Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Timothy N. Rubin (trubin@uci.edu) Michael D. Lee (mdlee@uci.edu) Charles F. Chubb (cchubb@uci.edu) Department of Cognitive
More informationConstruction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering method
BIOINFORMATICS ORIGINAL PAPER Vol. 21 no. 2 2005, pages 179 186 doi:10.1093/bioinformatics/bth473 Construction of robust prognostic predictors by using projective adaptive resonance theory as a gene filtering
More informationRunning head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1. John M. Clark III. Pearson. Author Note
Running head: NESTED FACTOR ANALYTIC MODEL COMPARISON 1 Nested Factor Analytic Model Comparison as a Means to Detect Aberrant Response Patterns John M. Clark III Pearson Author Note John M. Clark III,
More informationContributions to Brain MRI Processing and Analysis
Contributions to Brain MRI Processing and Analysis Dissertation presented to the Department of Computer Science and Artificial Intelligence By María Teresa García Sebastián PhD Advisor: Prof. Manuel Graña
More informationClassifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/
CSCI1950 Z Computa4onal Methods for Biology Lecture 18 Ben Raphael April 8, 2009 hip://cs.brown.edu/courses/csci1950 z/ Binary classifica,on Given a set of examples (x i, y i ), where y i = + 1, from unknown
More informationMemorial Sloan-Kettering Cancer Center
Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2007 Paper 14 On Comparing the Clustering of Regression Models
More informationSection 4.1. Chapter 4. Classification into Groups: Discriminant Analysis. Introduction: Canonical Discriminant Analysis.
Chapter 4 Classification into Groups: Discriminant Analysis Section 4.1 Introduction: Canonical Discriminant Analysis Understand the goals of discriminant Identify similarities between discriminant analysis
More informationClassification of Microarray Gene Expression Data
Classification of Microarray Gene Expression Data Geoff McLachlan Department of Mathematics & Institute for Molecular Bioscience University of Queensland Institute for Molecular Bioscience, University
More information3. Fixed-sample Clinical Trial Design
3. Fixed-sample Clinical Trial Design 3. Fixed-sample clinical trial design 3.1 On choosing the sample size 3.2 Frequentist evaluation of a fixed-sample trial 3.3 Bayesian evaluation of a fixed-sample
More informationPrediction of Successful Memory Encoding from fmri Data
Prediction of Successful Memory Encoding from fmri Data S.K. Balci 1, M.R. Sabuncu 1, J. Yoo 2, S.S. Ghosh 3, S. Whitfield-Gabrieli 2, J.D.E. Gabrieli 2 and P. Golland 1 1 CSAIL, MIT, Cambridge, MA, USA
More informationPAIRED AND UNPAIRED COMPARISON AND CLUSTERING WITH GENE EXPRESSION DATA
Statistica Sinica 12(2002), 87-110 PAIRED AND UNPAIRED COMPARISON AND CLUSTERING WITH GENE EXPRESSION DATA Jenny Bryan 1, Katherine S. Pollard 2 and Mark J. van der Laan 2 1 University of British Columbia
More informationA Semi-supervised Approach to Perceived Age Prediction from Face Images
IEICE Transactions on Information and Systems, vol.e93-d, no.10, pp.2875 2878, 2010. 1 A Semi-supervised Approach to Perceived Age Prediction from Face Images Kazuya Ueki NEC Soft, Ltd., Japan Masashi
More informationBootstrapping Residuals to Estimate the Standard Error of Simple Linear Regression Coefficients
Bootstrapping Residuals to Estimate the Standard Error of Simple Linear Regression Coefficients Muhammad Hasan Sidiq Kurniawan 1) 1)* Department of Statistics, Universitas Islam Indonesia hasansidiq@uiiacid
More informationUnsupervised MRI Brain Tumor Detection Techniques with Morphological Operations
Unsupervised MRI Brain Tumor Detection Techniques with Morphological Operations Ritu Verma, Sujeet Tiwari, Naazish Rahim Abstract Tumor is a deformity in human body cells which, if not detected and treated,
More informationArtificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)
Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6) BPNN in Practice Week 3 Lecture Notes page 1 of 1 The Hopfield Network In this network, it was designed on analogy of
More informationThe Effects of Autocorrelated Noise and Biased HRF in fmri Analysis Error Rates
The Effects of Autocorrelated Noise and Biased HRF in fmri Analysis Error Rates Ariana Anderson University of California, Los Angeles Departments of Psychiatry and Behavioral Sciences David Geffen School
More informationLearning in neural networks
http://ccnl.psy.unipd.it Learning in neural networks Marco Zorzi University of Padova M. Zorzi - European Diploma in Cognitive and Brain Sciences, Cognitive modeling", HWK 19-24/3/2006 1 Connectionist
More informationHypothesis testing: comparig means
Hypothesis testing: comparig means Prof. Giuseppe Verlato Unit of Epidemiology & Medical Statistics Department of Diagnostics & Public Health University of Verona COMPARING MEANS 1) Comparing sample mean
More informationKnowledge Discovery and Data Mining I
Ludwig-Maximilians-Universität München Lehrstuhl für Datenbanksysteme und Data Mining Prof. Dr. Thomas Seidl Knowledge Discovery and Data Mining I Winter Semester 2018/19 Introduction What is an outlier?
More informationSensory Cue Integration
Sensory Cue Integration Summary by Byoung-Hee Kim Computer Science and Engineering (CSE) http://bi.snu.ac.kr/ Presentation Guideline Quiz on the gist of the chapter (5 min) Presenters: prepare one main
More informationAn entropy-based improved k-top scoring pairs (TSP) method for classifying human cancers
African Journal of Biotechnology Vol. 11(45), pp. 10438-10445, 5 June, 2012 Available online at http://www.academicjournals.org/ajb DOI:10.5897/AJB11.1016 ISSN 1684 5315 2012 Academic Journals Full Length
More informationZainab M. AlQenaei. Dissertation Defense University of Colorado at Boulder Leeds School of Business Operations and Information Management Division
An Investigation of the Relationship between Consumer Mental Health Recovery Indicators and Clinicians Reports Using Multivariate Analyses of the Singular Value Decomposition of a Textual Corpus Zainab
More informationSubLasso:a feature selection and classification R package with a. fixed feature subset
SubLasso:a feature selection and classification R package with a fixed feature subset Youxi Luo,3,*, Qinghan Meng,2,*, Ruiquan Ge,2, Guoqin Mai, Jikui Liu, Fengfeng Zhou,#. Shenzhen Institutes of Advanced
More informationA contrast paradox in stereopsis, motion detection and vernier acuity
A contrast paradox in stereopsis, motion detection and vernier acuity S. B. Stevenson *, L. K. Cormack Vision Research 40, 2881-2884. (2000) * University of Houston College of Optometry, Houston TX 77204
More informationPCA Enhanced Kalman Filter for ECG Denoising
IOSR Journal of Electronics & Communication Engineering (IOSR-JECE) ISSN(e) : 2278-1684 ISSN(p) : 2320-334X, PP 06-13 www.iosrjournals.org PCA Enhanced Kalman Filter for ECG Denoising Febina Ikbal 1, Prof.M.Mathurakani
More information3. Model evaluation & selection
Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr
More informationProblem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms).
Problem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms). 1. Bayesian Information Criterion 2. Cross-Validation 3. Robust 4. Imputation
More informationA Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer
A Strategy for Identifying Putative Causes of Gene Expression Variation in Human Cancer Hautaniemi, Sampsa; Ringnér, Markus; Kauraniemi, Päivikki; Kallioniemi, Anne; Edgren, Henrik; Yli-Harja, Olli; Astola,
More informationFUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION
FUZZY C-MEANS AND ENTROPY BASED GENE SELECTION BY PRINCIPAL COMPONENT ANALYSIS IN CANCER CLASSIFICATION SOMAYEH ABBASI, HAMID MAHMOODIAN Department of Electrical Engineering, Najafabad branch, Islamic
More informationUK Musculoskeletal Oncology: Something for All Ages. Lars Wagner, MD Pediatric Hematology/Oncology University of Kentucky
UK Musculoskeletal Oncology: Something for All Ages Lars Wagner, MD Pediatric Hematology/Oncology University of Kentucky Pediatric-Type Sarcomas of Bone and Soft Tissue The incidence of sarcoma continues
More informationModel-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT
Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT Chang Ming, 22.11.2017 University of Basel Swiss Public Health Conference 2017 Breast Cancer & personalized
More information(C) Jamalludin Ab Rahman
SPSS Note The GLM Multivariate procedure is based on the General Linear Model procedure, in which factors and covariates are assumed to have a linear relationship to the dependent variable. Factors. Categorical
More informationData analysis in microarray experiment
16 1 004 Chinese Bulletin of Life Sciences Vol. 16, No. 1 Feb., 004 1004-0374 (004) 01-0041-08 100005 Q33 A Data analysis in microarray experiment YANG Chang, FANG Fu-De * (National Laboratory of Medical
More informationFeasibility Study in Digital Screening of Inflammatory Breast Cancer Patients using Selfie Image
Feasibility Study in Digital Screening of Inflammatory Breast Cancer Patients using Selfie Image Reshma Rajan and Chang-hee Won CSNAP Lab, Temple University Technical Memo Abstract: Inflammatory breast
More informationRNA preparation from extracted paraffin cores:
Supplementary methods, Nielsen et al., A comparison of PAM50 intrinsic subtyping with immunohistochemistry and clinical prognostic factors in tamoxifen-treated estrogen receptor positive breast cancer.
More informationIntroduction to Computational Neuroscience
Introduction to Computational Neuroscience Lecture 5: Data analysis II Lesson Title 1 Introduction 2 Structure and Function of the NS 3 Windows to the Brain 4 Data analysis 5 Data analysis II 6 Single
More informationAnalyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data
Analyzing Gene Expression Data: Fuzzy Decision Tree Algorithm applied to the Classification of Cancer Data Simone A. Ludwig Department of Computer Science North Dakota State University Fargo, ND, USA simone.ludwig@ndsu.edu
More informationBrain Tumor Detection of MRI Image using Level Set Segmentation and Morphological Operations
Brain Tumor Detection of MRI Image using Level Set Segmentation and Morphological Operations Swati Dubey Lakhwinder Kaur Abstract In medical image investigation, one of the essential problems is segmentation
More informationGene expression correlates of clinical prostate cancer behavior
Gene expression correlates of clinical prostate cancer behavior Cancer Cell 2002 1: 203-209. Singh D, Febbo P, Ross K, Jackson D, Manola J, Ladd C, Tamayo P, Renshaw A, D Amico A, Richie J, Lander E, Loda
More informationIdentifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang
Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Abstract: Unlike most cancers, thyroid cancer has an everincreasing incidence rate
More informationEARLY STAGE DIAGNOSIS OF LUNG CANCER USING CT-SCAN IMAGES BASED ON CELLULAR LEARNING AUTOMATE
EARLY STAGE DIAGNOSIS OF LUNG CANCER USING CT-SCAN IMAGES BASED ON CELLULAR LEARNING AUTOMATE SAKTHI NEELA.P.K Department of M.E (Medical electronics) Sengunthar College of engineering Namakkal, Tamilnadu,
More informationRepresentational similarity analysis
School of Psychology Representational similarity analysis Dr Ian Charest Representational similarity analysis representational dissimilarity matrices (RDMs) stimulus (e.g. images, sounds, other experimental
More information