Hidden Markov Mqdels.for Chromosome Identification

Similar documents
A Hierarchical Artificial Neural Network Model for Giemsa-Stained Human Chromosome Classification

Karyotype = a test to identify and evaluate the size, shape, and number of chromosomes in a sample of body cells.

Biologically-Inspired Human Motion Detection

Karyology. Preparation and study of karyotypes is part of Cytogenetics.

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network

10CS664: PATTERN RECOGNITION QUESTION BANK

Understanding the Human Karyotype Colleen Jackson Cook, Ph.D.

A Survey on Brain Tumor Detection Technique

Improved Intelligent Classification Technique Based On Support Vector Machines

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

Assigning B cell Maturity in Pediatric Leukemia Gabi Fragiadakis 1, Jamie Irvine 2 1 Microbiology and Immunology, 2 Computer Science

CYTOGENETICS Dr. Mary Ann Perle

Effect of Sensor Fusion for Recognition of Emotional States Using Voice, Face Image and Thermal Image of Face

arxiv: v2 [cs.cv] 8 Mar 2018

Gender Based Emotion Recognition using Speech Signals: A Review

Question 1 Multiple Choice (8 marks)

Dealing with Missing Values in Neural Network-Based Diagnostic Systems

Katsunari Shibata and Tomohiko Kawano

Performance of Gaussian Mixture Models as a Classifier for Pathological Voice

Neural Network Diagnosis of Avascular Necrosis from Magnetic Resonance Images

Automated Brain Tumor Segmentation Using Region Growing Algorithm by Extracting Feature

Classification of benign and malignant masses in breast mammograms

Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit

Cytogenetics 101: Clinical Research and Molecular Genetic Technologies

MRI Image Processing Operations for Brain Tumor Detection

EECS 433 Statistical Pattern Recognition

TWO HANDED SIGN LANGUAGE RECOGNITION SYSTEM USING IMAGE PROCESSING

Chromosome pathology

REVIEW ON ARRHYTHMIA DETECTION USING SIGNAL PROCESSING

LIE DETECTION SYSTEM USING INPUT VOICE SIGNAL K.Meena 1, K.Veena 2 (Corresponding Author: K.Veena) 1 Associate Professor, 2 Research Scholar,

Artificial neural networks: application to electrical stimulation of the human nervous system

Sensory Cue Integration

A framework for the Recognition of Human Emotion using Soft Computing models

Recognition of sign language gestures using neural networks

Skin cancer reorganization and classification with deep neural network

Analysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System

Computational Cognitive Neuroscience

Computer aided detection of clusters of microcalcifications on full field digital mammograms

Multiclass Classification of Cervical Cancer Tissues by Hidden Markov Model

Application of Tree Structures of Fuzzy Classifier to Diabetes Disease Diagnosis

A Novel Regression Based Model for Detecting Anemia Using Color Microscopic Blood Images

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

A HMM-based Pre-training Approach for Sequential Data

Chromosome Abnormalities

WAO9 P-32 August 1, 2008 Bank Characterization Report

Application of Artificial Neural Networks in Classification of Autism Diagnosis Based on Gene Expression Signatures

QUANTIFICATION OF PROGRESSION OF RETINAL NERVE FIBER LAYER ATROPHY IN FUNDUS PHOTOGRAPH

Speech recognition in noisy environments: A survey

Noise-Robust Speech Recognition Technologies in Mobile Environments

Application of distributed lighting control architecture in dementia-friendly smart homes

Computational Cognitive Science

Evaluating Classifiers for Disease Gene Discovery

IMAGE ANALYSIS SYSTEM FOR DETECTION OF RED BLOOD CELL DISORDERS USING ARTIFICIAL NEURAL NETWORK

ITERATIVELY TRAINING CLASSIFIERS FOR CIRCULATING TUMOR CELL DETECTION

Deep Learning-based Detection of Periodic Abnormal Waves in ECG Data

ECG Rhythm Analysis by Using Neuro-Genetic Algorithms

Artificial Neural Networks

Analysis of Microscopic Images of Blood Cells for Detection of Leukemia

CHAPTER-VII : SUMMARY AND CONCLUSIONS

Extraction and Identification of Tumor Regions from MRI using Zernike Moments and SVM

Human Genetics: Making Karyotypes Ch. 14

LUNG NODULE DETECTION SYSTEM

Classification and Statistical Analysis of Auditory FMRI Data Using Linear Discriminative Analysis and Quadratic Discriminative Analysis

Keywords Artificial Neural Networks (ANN), Echocardiogram, BPNN, RBFNN, Classification, survival Analysis.

CLASSIFICATION OF BRAIN TUMOUR IN MRI USING PROBABILISTIC NEURAL NETWORK

Why do we have a hippocampus? Short-term memory and consolidation

Automated identification of analyzable metaphase chromosomes depicted on microscopic digital images

Eretmapodites quinquevittatus Theobald 1

MS&E 226: Small Data

Abbott Cell-Dyn Reticulocyte Method Comparison and Reticulocyte Normal Reference Range Evaluation

Dharmesh A Sarvaiya 1, Prof. Mehul Barot 2

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Canadian College of Medical Geneticists (CCMG) Cytogenetics Examination. May 4, 2010

ANN predicts locoregional control using molecular marker profiles of. Head and Neck squamous cell carcinoma

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Splice Site Prediction Using Artificial Neural Networks

Searching for Temporal Patterns in AmI Sensor Data

BACKPROPOGATION NEURAL NETWORK FOR PREDICTION OF HEART DISEASE

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range

An Enhanced Approach on ECG Data Analysis using Improvised Genetic Algorithm

Reporting cytogenetics Can it make sense? Daniel Weisdorf MD University of Minnesota

KARYOTYPE OF A SEI WHALE

Evaluation & Systems. Ling573 Systems & Applications April 9, 2015

CHROMOSOME. Chromosomes are act as factors which distinguished one species from another.

Lesson 6 Learning II Anders Lyhne Christensen, D6.05, INTRODUCTION TO AUTONOMOUS MOBILE ROBOTS

COMP9444 Neural Networks and Deep Learning 5. Convolutional Networks

Primary Level Classification of Brain Tumor using PCA and PNN

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Primary Tumor detection with EEG Signals using Wavelet Transform and Neural Network

FEASIBILITY OF EMG-BASED CONTROL OF SHOULDER MUSCLE FNS VIA ARTIFICIAL NEURAL NETWORK

Performance of ART1 Network in the Detection of Breast Cancer

Automated Prediction of Thyroid Disease using ANN

SPEECH TO TEXT CONVERTER USING GAUSSIAN MIXTURE MODEL(GMM)

Brain Tumour Diagnostic Support Based on Medical Image Segmentation

Catherine T. Falk, 1 James M. Gilchrist, 2 Margaret A. Pericak-Vance, 3 and Marcy C. Speer 3. Summary. Introduction

Keywords Missing values, Medoids, Partitioning Around Medoids, Auto Associative Neural Network classifier, Pima Indian Diabetes dataset.

Proceedings of the 5th WSEAS International Conference on Telecommunications and Informatics, Istanbul, Turkey, May 27-29, 2006 (pp )

Use of artificial neural networks to estimate production variables of broilers breeders in the production phase

Transcription:

Hidden Markov Mqdels.for Chromosome Identification John M. Conroy Center for Computzizg Scneiices Instztute for Defense Analyses Bowae, Maryland Robert L. Becker, Jr. William Lefkowitz Kewi L. Christopher Rawatma1 B. Surana Timothy J. O Leary Armed Forces Instztute of Pathology Washzizgton, DC Dianne P. O Leary Computer Sczeizce Department and Instztute for Advanced Computer Studzes Unzverszty of Maryland College Park, MD Tamara G Kolda Computatzonal Sczence and Mathematzcs Research Department Sandza Natzoizal Laboratorzes Lavermore, CA Abstract In this talk we present a Hidden Markov Markov for automatic karyotyping. Previously, we demoizstrated that this method is robust in the presence of different types of metaphase spreads, truncation of chromosomes, and minor chromosome abnormalities,, and that it gives results superior to neural network on standard data sets.. In this work we evaluate it on a data set consisting of a mix of chromosomes obtained from blood, amniotic fluid and bone marrow specimens. The method is shown to be robust on this mixed set of data as well as giving far superior results than that obtained by iieural networks. Technical areas: Signal and image processing in medicine; software, systems in medicine. 0-7695-1004-3/01 $10.00 0 2001 EEE 413

414 1. Introduction. Automatic methods for chromosome karyotyping are of great interest in yielding a preliminary classification for G-banded chromosomes, making the final classification process much less tedious. The most popular automated techniques are based on neural networks (see, for example, [l, 2, 31). In previous work [4] some of the authors have proposed the use of hidden Markov models (HMMs) for automatic karyotyping, and we demonstrated the effectiveness of the method on standard data sets: Philadelphia, with chromosomes taken from chorionic villus, and Edinburgh and Copenhagen, with chromosomes taken from blood. We ran three tests of the methods, training on half of the samples in each data set and then evaluating based on performance on the other half of the data. In each case, the HMM method gave higher accuracy than the neural network. In this work, we continue our study of the HMM method, evaluating it on a data set collected at the Armed Forces Institute of Pathology, consisting of a mix of chromosomes obtained from blood, amniotic fluid and bone marrow specimens. 2. Data gathering. Metaphase spreads were obtained from blood, amniotic fluid and bone marrow specimens that were sent for routine cytogenetic evaluation and appropriately cultured. Slides were stained using a trypsin-giemsa technique [SI. Selected metaphases were then visualized using an Olympus Vanox microscope and digitized in gray scale using a Panasonic RS 170 digital camera. Although bent and overstained chromosomes were included in the image set, overlapping chromosomes were excluded. Following identification of each chromosome as autosome 1-22, X, or Y, axial densitometric traces of the chromosomes were obtained, giving sequences of gray levels. The data were randomly spit into two pieces,, each containing about 2500 sample chromosomes. The HMMs and the neural network were trained on the first set and then tested on the second. 3. Methods. Net maker Professional for Windows and Brainmaker Professional for Windows (California Scientific Software, Nevada City, California) were used to generate backpropagation neural networks with an input layer of 15, 30 or 43 nodes, a single hidden layer of 200 nodes, and an output layer of 24 nodes. The input to the neural net was a sequence of 15 gray-level values for each chromosome, or the gray levels augmented by 15 first and 15 second differences. The hidden Markov model (HMM) we used was identical to the one

presented in [4]. We highlight a few features of the model here and refer the reader to the journal article for details. For each chromosome type (the autosomes 1,2,..., 22 and the sex chromosomes X and Y) we determine a EIMM based on half of the available data. The model consists of hidden states whose number is determined by the median number of observations in the training data corresponding to that chromosome. Each state corresponds to the corresponding positions in the sequence of gray level values for an idealized chromosome of that type. Chromosomes to he karyotyped could he shorter or longer than the average length, or perhaps even truncated at either end, so the model is designed to allow states to be skipped or repeated in taking a walk though the chain. The observations, which are modelled as outputs of a probabilistic furiction of the hidden states, are the gray scale value, and possibly the first and second differences. The latter two give additional discriiniiiatioii power, because, they indicate how quickly the gray scale value changes at a position, and the curvature of the gray scale sequence. A Gaussian mixture motlel is usecl to model this three-tuple of observations and we compute a mixture model for each position in the template for each chromosome type. To classifi an unknown chromosome we score it with each of the 24 FIiUMs and then interpret these 24 scores as a feature vector for input to a linear discrimiriant analysis classifier. 475 Figure 1: Markov Model for I<aryotyping 4. Experimental Results. After training the neural network and the HL414, we ran three cxperiments: 1. Evaluate using the chromosomes in the test set. (Recall that the test

476 NN, no differences NN, 1 differences NN, 2 differences.91.81 HMM, no differences.80 HMM, 1 differences.91.93 HMM, 2 differences.96 I.92.93 Table 1: Comparing I-IMM and Neural Network sct was a random selection of half of the data and consisted of approximately 2500 chromosomes.) This experiment is designated by Normal in Table 1. 2. Truncate the last 10% of the data for each chromosome, simulating the identification of broken chromosomes or chromosomc records that were improperly cut. This experiment is designated by Truncated. 3. Take the gray level values in the middle 10% of the sequence for each chromosome and reverse their order, simulating an internal inversion of chromosomal material. This experiment is designated by Flipped. Table 1 gives the percent of the test chromosomes that were correctly identified by each method. For further confidence the data were repartitioned into approximately equal pieces so that a patients chromosomes would never bc split between the testing and training data. The HMM was run on the repartitioned data and the results agreed with the above within 2 digits of accuracy. 5. Conclusions. We have demonstrated that the HMM method is a robust method for automatic karyotyping, robust in the presence of different types of metaphase spreads, truncation of chromosomes, and minor chromosome abnormalities, and that it gives results superior to neural networks. References [l] Graham J, Errington P, Jennings A. A neural network chromosome classifier. J.Radiat.Res.(Tokyo.) 1992;33 Suppl:250-7.

[2] Sweeney WPJ, Musavi MT, Guidi JN. Classification of chromosomes using a probabilistic neural network. Cytometry 1994;16( 1):17-24. 477 [3] Errington PA, Graham J. Application of artificial neural networks to chromosome classification. Cytometry 1993; 14(6):627-39. [4] Conroy JM, Kolda TG, O Leary DP, O Leary TJ, Chromosome Identification Using Hidden Markov Modles: Comparison with Neural Network, Singular Value Decomposition, Principal Components Analysis, and Fisher Discriminant Analysis, Laboratory Investigation. 2000;80:1629-1641. [5] Gustashaw, KM. Chromosome Stains, in The AGT Cytogenetics Laboratory Manual, Barch MJ, Knutsen T, and Spurbeck JL, Eds. Lippincott-Raven, Philadelphia, 1997:259-324.