NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION

Similar documents
COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) DETECTION OF ACUTE LEUKEMIA USING WHITE BLOOD CELLS SEGMENTATION BASED ON BLOOD SAMPLES

Computerized Detection System for Acute Myelogenous Leukemia in Blood Microscopic Images

Available online at ScienceDirect. Procedia Computer Science 70 (2015 )

Keywords: Leukaemia, Image Segmentation, Clustering algorithms, White Blood Cells (WBC), Microscopic images.

A ROBUST MORPHOLOGICAL ANALYSIS OF NORMAL AND ABNORMAL LEUKEMIC CELLS POPULATIONS IN ACUTE LYMPHOBLASTIC LEUKEMIA

Blood Microscopic Image Segmentation & Acute Leukemia Detection Tejashri G. Patil *, V. B. Raskar E & TC Department & S.P. Pune, Maharashtra, India

Enhanced Detection of Lung Cancer using Hybrid Method of Image Segmentation

Analysis of Microscopic Images of Blood Cells for Detection of Leukemia

Brain Tumor segmentation and classification using Fcm and support vector machine

Primary Level Classification of Brain Tumor using PCA and PNN

Improved Intelligent Classification Technique Based On Support Vector Machines

Analysis & Classification of Acute Lymphoblastic Leukemia using KNN Algorithm

COMPARATIVE STUDY ON FEATURE EXTRACTION METHOD FOR BREAST CANCER CLASSIFICATION

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Detection of Glaucoma and Diabetic Retinopathy from Fundus Images by Bloodvessel Segmentation

A New Approach for Detection and Classification of Diabetic Retinopathy Using PNN and SVM Classifiers

Detection of Leukemia in Human blood sample through Image Processing: A Review

AUTOMATED DETECTION AND CLASSIFICATION OF LEUKEMIA USING IMAGE PROCESSING AND MACHINE LEARNING

A REVIEW ON CLASSIFICATION OF BREAST CANCER DETECTION USING COMBINATION OF THE FEATURE EXTRACTION MODELS. Aeronautical Engineering. Hyderabad. India.

Classification of mammogram masses using selected texture, shape and margin features with multilayer perceptron classifier.

SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis

Detection of Abnormalities of Retina Due to Diabetic Retinopathy and Age Related Macular Degeneration Using SVM

Extraction of Texture Features using GLCM and Shape Features using Connected Regions

CLASSFICATION OF MALARIAL PARASITE AND ITS LIFE-CYCLE- STAGES IN BLOOD SMEAR

MEM BASED BRAIN IMAGE SEGMENTATION AND CLASSIFICATION USING SVM

I. INTRODUCTION III. OVERALL DESIGN

Detection and Classification of Leukaemia using Artificial Neural Network

Classification of Mammograms using Gray-level Co-occurrence Matrix and Support Vector Machine Classifier

A COMPARATIVE STUDY OF TECHNIQUES FOR LEUKAEMIA DETECTION

A Robust Feature Extraction and Selection Method for the Recognition of Lymphocytes versus Acute Lymphoblastic Leukemia

DETECTING DIABETES MELLITUS GRADIENT VECTOR FLOW SNAKE SEGMENTED TECHNIQUE

A Reliable Method for Brain Tumor Detection Using Cnn Technique

International Journal of Computer Sciences and Engineering. Review Paper Volume-5, Issue-12 E-ISSN:

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH

Effect of Feedforward Back Propagation Neural Network for Breast Tumor Classification

Analysis of Classification Algorithms towards Breast Tissue Data Set

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

Lung Cancer Detection using CT Scan Images

MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection

Cancer Cells Detection using OTSU Threshold Algorithm

LOCATING BRAIN TUMOUR AND EXTRACTING THE FEATURES FROM MRI IMAGES

MRI Image Processing Operations for Brain Tumor Detection

MR Image classification using adaboost for brain tumor type

Predicting Breast Cancer Survivability Rates

Early Detection of Lung Cancer

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

CANCER DIAGNOSIS USING DATA MINING TECHNOLOGY

Brain Tumor Segmentation Based On a Various Classification Algorithm

An Edge-Device for Accurate Seizure Detection in the IoT

Keywords Image segmentation, Brain Tumor, MRI, Enhanced Darwinian Particle Swarm Optimization (EDPSO), Firefly.

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

A Pictorial Review and an Algorithm for the Determination of Sickle Cell Anemia

SPPS: STACHOSTIC PREDICTION PATTERN CLASSIFICATION SET BASED MINING TECHNIQUES FOR ECG SIGNAL ANALYSIS

ANN BASED IMAGE CLASSIFIER FOR PANCREATIC CANCER DETECTION

Lung Cancer Diagnosis from CT Images Using Fuzzy Inference System

Segmentation of Tumor Region from Brain Mri Images Using Fuzzy C-Means Clustering And Seeded Region Growing

BONE CANCER DETECTION USING ARTIFICIAL NEURAL NETWORK

A Review on Arrhythmia Detection Using ECG Signal

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

Classification of Thyroid Nodules in Ultrasound Images using knn and Decision Tree

Application of distributed lighting control architecture in dementia-friendly smart homes

CLASSIFICATION OF BRAIN TUMOUR IN MRI USING PROBABILISTIC NEURAL NETWORK

Research Article. Automated grading of diabetic retinopathy stages in fundus images using SVM classifer

DETECTION OF LEUKEMIA USING IMAGE PROCESSING

CHAPTER 5 DECISION TREE APPROACH FOR BONE AGE ASSESSMENT

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

Analysis of EEG Signal for the Detection of Brain Abnormalities

Detection of Glaucoma using Cup-to-Disc Ratio and Blood Vessels Orientation

REVIEW OF METHODS FOR DIABETIC RETINOPATHY DETECTION AND SEVERITY CLASSIFICATION

Detection and Classification of Brain Tumor using BPN and PNN Artificial Neural Network Algorithms

Automatic Classification of Breast Masses for Diagnosis of Breast Cancer in Digital Mammograms using Neural Network

Intelligent Diabetic Retinopathy Diagnosis in Retinal Images

Predicting Non-Small Cell Lung Cancer Diagnosis and Prognosis by Fully Automated Microscopic Pathology Image Features

AUTOMATIC BRAIN TUMOR DETECTION AND CLASSIFICATION USING SVM CLASSIFIER

Kishan Patel, Rashmin B. Prajapati Department of Computer Engineering, S.V.I.T. Vasad, Gujarat, India

VOTING BASED AUTOMATIC EXUDATE DETECTION IN COLOR FUNDUS PHOTOGRAPHS

Papsmear Image based Detection of Cervical Cancer

Keywords Missing values, Medoids, Partitioning Around Medoids, Auto Associative Neural Network classifier, Pima Indian Diabetes dataset.

IMPROVED SELF-ORGANIZING MAPS BASED ON DISTANCE TRAVELLED BY NEURONS

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Extraction and Identification of Tumor Regions from MRI using Zernike Moments and SVM

Mammogram Analysis: Tumor Classification

Machine Learning! Robert Stengel! Robotics and Intelligent Systems MAE 345,! Princeton University, 2017

Lung Cancer Detection by Using Artificial Neural Network and Fuzzy Clustering Methods

Mammographic Cancer Detection and Classification Using Bi Clustering and Supervised Classifier

Time-to-Recur Measurements in Breast Cancer Microscopic Disease Instances

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Final Project Report. Detection of Cervical Cancer in Pap Smear Images

Detect the Stage Wise Lung Nodule for CT Images Using SVM

Evaluating Classifiers for Disease Gene Discovery

Classification of Cancer of the Lungs using ANN and SVM Algorithms

Automatic Detection and Classification of Skin Cancer

BraTS : Brain Tumor Segmentation Some Contemporary Approaches

Computer Assisted System for Features Determination of Lung Nodule from Chest X-ray Image

Classification of normal and abnormal images of lung cancer

Dharmesh A Sarvaiya 1, Prof. Mehul Barot 2

Differentiating Tumor and Edema in Brain Magnetic Resonance Images Using a Convolutional Neural Network

Transcription:

NAÏVE BAYESIAN CLASSIFIER FOR ACUTE LYMPHOCYTIC LEUKEMIA DETECTION Sriram Selvaraj 1 and Bommannaraja Kanakaraj 2 1 Department of Biomedical Engineering, P.S.N.A College of Engineering and Technology, Dindigul, Tamilnadu, India 2 K. P.R. Institute of Engineering and Technology, Coimbatore, India E-Mail: sriram@psnacet.edu.in ABSTRACT Leukemia is a malignant neoplasm of the blood and is one of the major causes of death throughout the world. Acute Lymphocytic Leukemia is the most common type of Leukemia and it generally affects children and adults above fifty years of age. Examination of Peripheral blood smear Image is one of the most widely used technique for Leukemia detection though it suffers from problems such as subjective interpretations, operator tiredness and efficiency. The objective of this work is to develop a method that can classify lymphocyte and lymphoblast nuclei from the ALL-IDB2 dataset. In the present paper, from a set of forty Images of Lymphocytes and Lymphoblasts, nucleus is segmented using K- means clustering method after which a set of features are extracted. Naive Bayesian classifier was used in this work for Lymphocyte classification which yielded 75% accuracy. Keywords: Leukemia, acute lymphocytic Leukemia, segmentation, K-means clustering, Naïve Bayesian classifier. 1. INTRODUCTION Human blood mainly consists of three main cells namely, Red Blood cells (RBC S), White Blood cells (WBC s) and Platelets. Examination of the number and shape of these cells will give us vital information about the well being of an individual [1]. Excess number or Lack of either Red Blood cells or White blood cells will indicate certain characteristic diseases such as Anemia or Leukemia. Leukemia is a state in the human body where the human body produces excess number of immature white blood cells. These excess White blood cells prevent the production of normal white blood cells and hamper their normal functioning. Leukemia can be broadly classified into Acute or chronic based on the quickness with which the disease develops and spreads [2]. It can be further classified into the following types based on the affected cells a) Lymphocytic Leukemia b) Myelogenous Leukemia. Leukemia results in suppression of hematopoiesis and thus resulting in anemia, thrombocytopenia and neutropenia. Diagnosis of Leukemia poses serious challenges because of the nonspecific nature of the signs and symptoms. Though there are advanced techniques available for diagnosis of Leukemia such as in situ hybridization (FISH), immunophenotyping, cytogenetic analysis and cytochemistry, they are found to be time consuming and costly [3].So the most simple and efficient method used to diagnose Leukemia is examination of Blood smear Image. But the Problem with this method is that it is subjective, tedious and susceptible to human errors. Over the years some methods have been devised to automate this process. 2. RELATED WORK Nucleus segmentation SubrajeetMohapatra etal.,[3] proposed a method in which segmentation is performed in two stages for segmenting the White Blood cell Nucleus using color based clustering. Initial segmentation is achieved by K- means clustering followed by nearest neighbour classification in L*a*b space. SubrajeetMohapatra etal. then used other methods such as fuzzy clustering technique [4] and kernel induced Rough C Means clustering [5] for nuclei segmentation in White Blood cells. The same authors employed a functional Link Neural architecture [6] to segment Cytoplasm, Nucleus and Background region in a lymphocyte cell. Abd.Halim etal., [7], devised a segmentation based on HIS color space to improve the Image quality Feature extraction Apart from features such a Area, Perimeter, Convex Area, Solidity, Major Axis Length, Orientation, Filled area and Eccentricity, Fabio Scotti[8]also extracted rectangularity and circularity of the cell for identification of Leukemia. In almost all their works related to Leukemia detection SubrajeetMohapatra et al.,[2,3,4,6]extracted the following features. a) Fractal Features Which included Hausdorff Dimension and Contour signature 6888

b) Shape Features such as area Perimeter, Compactness, Solidity, Eccentricity& Form factor c) Texture Features-Homogeneity, Energy, Correlation and Entropy. Classification Fabio scotti [8, 9] used the K-Nearest Neighbor classifier and considered various norms such as Euclidean, Cubic and Manhattan for classifying a normal lymphocyte and a lymphoblast to aid in Acute Leukemia detection. The same method was used by him to classify Leukocytes [9]. He also used Feed forward neural network and Linear Bayes normal classifier and Radial Basis Function to accomplish the same task and compared their performances. With the set of features extracted SubrajeetMohapatraet al., [2, 3, 4, 6] preferred to use Support Vector Machine (SVM) as their classifier for Acute Leukemia detection. SubrajeetMohapatraetal., [10] deployed a ensemble based classifier system which predicts by combining several diverse classifiers to classify Lymphocytes and Lymphoblasts to facilitate Acute Leukemia detection. In their ensemble they have used Naïve Bayesian, K-Nearest Neighbor, Multilayer Perceptron, Radial Basis function networks and Support Vector Machine (SVM) to get the desired output. 3. METHODS Figure-1 gives the flowchart of the various steps involved in Acute Leukemia detection used in this paper. A) Image acquisition Images of lymphocyte cells, both from normal subjects and people suffering from Acute Lymphocyte Leukemia are taken with permission from ALL-IDB, a public Image dataset of peripheral blood samples of normal individuals and Leukemia patients, maintained by Fabio scotti, Department of Information Technology, University of Milan, Italy. The Images from ALL-IDB2 have been used for thiswork.all-idb2 is a collection of normal and blast lymphocyte cells. All the Images in the dataset are in JPG format with 24 bit color depth, and a native resolution equal to 2592*1944 captured with a Power Shot G56 camera. The magnifications of the Images range from 300 to 500 B) Nucleus segmentation The K-means clustering method is used to segment the nucleus from the normal and abnormal lymphocyte cells got from ALL-IDB2 dataset. K-Means clustering is a semi supervised clustering method that is used to create K clusters from n observations. Segmentation is achieved in such a way that objects within a cluster are as close as possible and at the same time they are far away from objects in other clusters. C) Feature extraction If the input data or Image is too large it may have irrelevant or redundant data. The process of selecting only the relevant information from the input data to reduce its size and complexity is called as feature extraction. Features are extracted from the segmented nuclei of lymphocyte cells from the ALL-IDB2 dataset.in this work two types of features namely the shape features and the densitometric features are extracted from the Lymphocyte nuclei. a) Shape features *Area: Area is determined by calculating the total number of nonzero pixels within the Image region. *Perimeter: It is evaluated by measuring the distance between Successive boundary pixels. *Compactness: Compactness is calculated by the following formula: Compactness= (Perimeter) 2 /Area *Form Factor: This feature is used to measure the surface irregularities. Form factor is measured by the formula Form Factor = (4*Pi*Area)/ (Perimeter) 2 Figure-1. Flow chart of the proposed method. b) Densitometric features Apart from the above mentioned shape features the following densitoimetric features have also been extracted from the segmented lymphocyte nuclei. 6889

*Energy: This feature is used to measure the degree of uniformity of a given Image. *Entropy: It is used to measure the randomness of a given Image *Correlation: This feature gives an idea about the correlation between Pixel values and its neighbourhood. *Variance: Variance is used to find how each pixel varies from the neighbouring pixel and is used in classification of different regions. 5. RESULTS AND DISCUSSIONS The method Proposed in Figure-1 is implemented on dataset ALL-IDB2 for Automated Leukemia detection and the steps given there are followed sequentially. D) Classification Classification is the process of grouping an image into one or several distinct and exclusive classes. In this work Naïve Bayesian classification method is used for segregating a Lymphocyte Nucleus Image into a Normal Lymphocyte or an abnormal Lymphoblast because of its simplicity and reliability. Naïve Bayesian classifier belongs to a family of simple probabilistic classifier based on applying Baye s theorem with strong independence between the features. Naive Bayesian classifier assumes that the value of a feature which is independent of the value of any other feature. Naïve Bayesian classifier combines the Naïve Bayes Probability model with a decision rule. The decision rule is known as maximum aposteriori rule and it picks the most probable hypothesis. Bayes classifier is a function that assigns a class label y=c k for some K as given below: 4. PERFORMANCE ANALYSIS Performance of the Naïve Bayesian Classifier for Automated Leukemia detection is evaluated by calculating the performance metrics such as Accuracy, Sensitivity and specificity from the confusion matrix. In Binary classification method, Positive is considered as identified and Negative as rejected. So generally TP, TN, FP and FN can be defined as follows: TP (True Positive): Correctly Identified TN (True Negative): Incorrectly Identified FP (False Positive): Correctly rejected FN (False Negative): Incorrectly Rejected Performance measures can be calculated as follows: Accuracy= {(TP+TN)/(TP+FP+TN+FN)}*100% Sensitivity={TP/(TP+FN)}*100% Specificity={TN/ (TN+FP)}*100% Figure-2. Results of segmentation of nuclei by K-means clustering. d,e,f represent segmented nuclei from lymphocytes a,b,c and j,k,l represent segmented nuclei from lymphoblasts g,h and I respectively. A set of 40 Images are taken from ALL-IDB2 Dataset, out of which 20 are samples of normal lymphocytes and 20 are samples of cancer affected Lymphocytes called as Lymphoblasts. All the Image samples are subjected to Segmentation as per the method suggested in III (B).The segmented output of cell Nuclei Images of three normal Lymphocytes and three abnormal Lymphoblasts are given in Figure-2. After the nucleus of the Lymphocyte is segmented as described in III (B), a set of features are extracted from them so that they may be fed to a classifier. The set of features extracted from two sample nuclei, one from a lymphocyte and the other from a Lymphoblast are given in Table-1. 6890

Table-1. Feature values of a sample Lymphocyte and a Lymphoblast. Feature Lymphocyte Lymphoblast Area 4873 5536 Perimeter 603 1306 Compactness 74.6170 308.0989 Form factor 0.1683 0.0407 Energy 1.5203 1.5314 Entropy 3.2310 3.1383 Correlation -0.3158-0.3405 Variance 2.6846e+05 2.6717e+05 The features mentioned in thetable.1 are used as inputs to the classifier described in III (D) The Naïve Baye s classifier that we have used in this work for classification of Lymphocytes and Lymphoblasts in the datasetal-idb2 yielded 75% accuracy. Other Performance metrics of the classifier are given in the Table-2. Table-2. Performance values of Naïve Bayesian classifier used for Leukemia detection in ALL-IDB2. Performance metric Value Accuracy 75% Sensitivity 77.78% Specificity 72.72% 6. CONCLUSION AND FUTUTRE WORK In this work a Naïve Bayesian classifier has been proposed for classification of normal Lymphocytes and abnormal Lymphoblasts from ALL-IDB2 which yielded 75% accuracy. The results give us encouragement to use other modern classification techniques for the same dataset so that Acute Lymphocytic Leukemia can be detected more accurately. In future same methods can be applied to real time Blood samples Images which will help the clinical expert in Acute Leukemia Detection. ACKNOWLEDGEMENT The authors would like to thank Professor Fabio Scotti, Department of Information Technology, and University of Milan, Italy for granting them Permission to use the dataset ALL-IDB Prepared and maintained by him [11] which is an image dataset of Peripheral Blood samples of Normal individuals and Leukemic Patients. REFERENCES [1] Vincenzo Piuri and Fabio Scotti. 2004. Morphological Classification of Blood Leucocytes by Microscopic Images. In:Proc. IEEE International conference on computational intelligence for Measurement systems and Applications, Boston, MA.USA. pp. 103-108. [2] Fabio Scotti. 2005. Automatic Morphological Analysis for Acute Leukemia Identification in Peripheral Blood Microscopic Images.In:Proc of IEEE International conference on computational intelligence for Measurement systems and Applications, Giardini Naxos,Italy. pp. 96-101. [3] Fabio Scotti. 2006. Robust segmentation and Measurement Techniques of White Cells in Blood Microscopic Images. In: Proc. IEEE Instrumentation and Measurement Technology conference, IMTC, Sorrento, Italy. pp. 43-48. [4] SubrajeetMohapatra and DiptiPatra. 2010. Automated Leukemia Detection using Hausdorff Dimension in Blood Microscopic Images. In: Proc. IEEE international conference on Emerging Trends in Robotics and communication Technologies (INTERACT), Chennai, India. pp. 64-68. [5] SubrajeetMohapatra and DiptiPatra. 2010. Automated Cell Nucleus Segmentation and Acute Leukemia Detection in Blood Microscopic Images. In:Proc IEEE International Conference on Systems in Medicine and Biology.pp. 49-54, IIT Kharagpur,India. [6] SubrajeetMohapatra, SushantaShekarSamanta, DiptiPatra and SanghamitraSatpathi. Fuzzy Based Blood Image Segmentation for Automated Leukemia Detection. In:Proc in IEEE international conference on Devices and communication (ICDeCom),Mesra. pp. 1-5. [7] N.H.AbdHalim, M.Y.Mashor, A.S.Abdul Nasir, N.R.Mokhtar and H.Rosline. Nucleus segmentation Technique for Acute Leukemia. Published in IEEE 7 th International colloquiumon Signal Processing and its Applications, Penang. pp. 192-197. [8] SubrajeetMohapatra, DiptiPatra,SunilKumarandSanghamitraSatpathi. 2012. Kernel Induced Rough c-means clustering for Lymphocyte Image Segmentation. In: Proc. 4 th International conference on Intelligent Human Computer Interaction. pp. 1-6, Kharagpur. [9] SubrajeetMohapatra, DiptiPatra, Sunil Kumar and SanghamitraSatpathi. 2012. Lymphocyte Image Segmentation Using Functional Link Neural 6891

Architecture for Acute Leukemia Detection. Biomedical Engineering Letters. 2: 100-110. [10] SubrajeetMohapatra, DiptiPatra, and SanghamitraSatpathi. 2013. An ensemble classifier system for early diagnosis of acute Lymphoblastic Leukemia in blood microscopic images. Neural Computing and Applications. l24: 1887-1904. [11] RuggeroDonida, Labati, Vincenzo Piuriand Fabio scotti. 2011. ALL-IDB: The Acute Lymphoblastic Leukemia Image Database for Image Processing. Published in 18 th IEEE International conference on Image Processing (ICIP), Brussels. 6892