Akosa, Josephine Kelly, Shannon SAS Analytics Day

Size: px
Start display at page:

Download "Akosa, Josephine Kelly, Shannon SAS Analytics Day"

Transcription

1 Application of Data Mining Techniques in Improving Breast Cancer Diagnosis Akosa, Josephine Kelly, Shannon 2016 SAS Analytics Day

2 Facts and Figures about Breast Cancer

3 Methods of Diagnosing Breast Cancer Surgical Biopsy: Has sensitivity close to 100%. Cost associated with this method is high. Mammography: Sensitivity fluctuates between 68% and 79%. Reporting sensitivity varies with radiologists experience. Limitations including variation in age and breast density prevent researchers from significantly improving the sensitivity of this method.

4 Methods of Diagnosing Breast Cancer (contd.) Fine Needle Aspiration (FNA) with visual interpretations: Sensitivity varies between 65% and 95%. FNA biopsies are minimally invasive and can be completed within minutes. Limitations associated with mammography are less severe with FNA with visual interpretation. Computer decision aids can improve radiologists ability to correctly diagnose the malignancy of breast tumors.

5 Data Description and Preparation The study utilizes the Wisconsin Breast Cancer data, originally compiled by Dr. William H. Wolberg and available within the UCI Machine Learning Repository. The dataset contains 699 clinical case samples (65.52% benign and 34.48% malignant) assessing the nuclear features of fine needle aspirates taken from patients breasts. There are 11 attributes per observation including the ID and the binary target variable. The input variables are measured on an interval scale (1-10), with 1 indicating a normal state and a value of 10 indicating a highly abnormal state. There were 16 missing values but due to the small percentage (2.3%), these cases were excluded from the analysis. Weight of evidence approach (WOE) was employed to convert the categorical variables into numerical values.

6 Data Description and Preparation To ensure honest assessment of the models built, the data was partitioned into training (70%) and validation (30%) subsets. Prior probabilities were set to account for oversampling. Variable Label Mean Standard Deviation Minimum Median Maximum Skewness Kurtosis WOE_BC Uniformity of Cell Size Uniformity of Cell WOE_BN Shape WOE_CT Bland Chromatin WOE_MAdh Bare Nuclei WOE_Mit Clump Thickness Single Epithelial Cell WOE_NN Size WOE_SECS Marginal Adhesion WOE_UCSh Normal Nucleoli WOE_UCSz Mitoses Table 1. Weight of evidence variable summary statistics

7 Methods A variety of data mining techniques were considered for model building. All models were built in SAS Enterprise Miner Support Vector Machines Gradient Boosting Logistic Regression Data Mining Techniques Random Forest Decision Tree Neural Networks

8 Process Flow Chart

9 Model Comparison Fit Statistics Misclassification rate KS Statistic Gini Coefficient ROC Index Sensitivity Specificity SVM (Linear) % 95.52% Decision tree (3 branches) % 94.78% Autoneural (default) % 97.01% Random Forest via PLS % 96.27% Random Forest via regression % 96.27% Linear Logistic regression % 96.27% Autoneural via regression % 97.01% Random Forest via PC % 96.27% Decision tree via PC % 96.27% Boosting via PC % 96.27%

10 Explaining the Best Model With regards to the selected gradient boosting model, the first 5 principal components (PC) were used in the model building. These components account for 90.48% of the total variability in the data. PC_1 PC_2 PC_3 PC_4 PC_5 PC_6 PC_7 PC_8 WOE_UCSz WOE_UCSh WOE_SECS WOE_BC WOE_NN WOE_BN WOE_MAdh WOE_CT WOE_Mit

11 Summary of Findings The gradient boosting model turned out to be the best model for diagnosing breast cancer using data from fine needle aspiration. Uniformity of cell shape and size, bare nuclei, and bland chromatin were identified as the best FNA characteristics with respect to breast cancer diagnosis. Outcome prediction can be further improved by refining the methods used to identify and measure the FNA characteristics. Finally, utilizing this model would help decrease interpretation errors by radiologists.

12 Acknowledgement & Contact We wish to express our sincere gratitude to Dr. Goutam Chakraborty, Department of Marketing and founder of SAS and OSU Data Mining Certificate program Oklahoma State University for his support and guidance throughout this study. CONTACT INFORMATION Josephine Sarpong Akosa 320-C Math Sciences (MSCS) Stillwater, OK

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science

More information

A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction

A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction Samuel Giftson Durai Research Scholar, Dept. of CS Bishop Heber College Trichy-17, India S. Hari Ganesh, PhD Assistant

More information

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 1 ISSN : 2456-3307 Data Mining Techniques to Predict Cancer Diseases

More information

CANCER DIAGNOSIS USING NAIVE BAYES ALGORITHM

CANCER DIAGNOSIS USING NAIVE BAYES ALGORITHM CANCER DIAGNOSIS USING NAIVE BAYES ALGORITHM Rashmi M 1, Usha K Patil 2 Assistant Professor,Dept of Computer Science,GSSSIETW, Mysuru Abstract The paper Cancer Diagnosis Using Naive Bayes Algorithm deals

More information

COMPARISON OF DECISION TREE METHODS FOR BREAST CANCER DIAGNOSIS

COMPARISON OF DECISION TREE METHODS FOR BREAST CANCER DIAGNOSIS COMPARISON OF DECISION TREE METHODS FOR BREAST CANCER DIAGNOSIS Emina Alickovic, Abdulhamit Subasi International Burch University, Faculty of Engineering and Information Technologies Sarajevo, Bosnia and

More information

Classification of breast cancer using Wrapper and Naïve Bayes algorithms

Classification of breast cancer using Wrapper and Naïve Bayes algorithms Journal of Physics: Conference Series PAPER OPEN ACCESS Classification of breast cancer using Wrapper and Naïve Bayes algorithms To cite this article: I M D Maysanjaya et al 2018 J. Phys.: Conf. Ser. 1040

More information

Generalized additive model for disease risk prediction

Generalized additive model for disease risk prediction Generalized additive model for disease risk prediction Guodong Chen Chu Kochen Honors College, Zhejiang University Channing Division of Network Medicine, BWH & HMS Advised by: Prof. Yang-Yu Liu 1 Is it

More information

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods International Journal of Bioinformatics and Biomedical Engineering Vol. 1, No. 3, 2015, pp. 318-322 http://www.aiscience.org/journal/ijbbe ISSN: 2381-7399 (Print); ISSN: 2381-7402 (Online) Diagnosis of

More information

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Predicting Breast Cancer Survival Using Treatment and Patient Factors Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women

More information

Prediction of Malignant and Benign Tumor using Machine Learning

Prediction of Malignant and Benign Tumor using Machine Learning Prediction of Malignant and Benign Tumor using Machine Learning Ashish Shah Department of Computer Science and Engineering Manipal Institute of Technology, Manipal University, Manipal, Karnataka, India

More information

IMPROVED SELF-ORGANIZING MAPS BASED ON DISTANCE TRAVELLED BY NEURONS

IMPROVED SELF-ORGANIZING MAPS BASED ON DISTANCE TRAVELLED BY NEURONS IMPROVED SELF-ORGANIZING MAPS BASED ON DISTANCE TRAVELLED BY NEURONS 1 HICHAM OMARA, 2 MOHAMED LAZAAR, 3 YOUNESS TABII 1 Abdelmalak Essaadi University, Tetuan, Morocco. E-mail: 1 hichamomara@gmail.com,

More information

Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selection based on Mutual Information

Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selection based on Mutual Information Breast Cancer Diagnosis using a Hybrid Genetic Algorithm for Feature Selection based on Mutual Information Abeer Alzubaidi abeer.alzubaidi022014@my.ntu.ac.uk David Brown david.brown@ntu.ac.uk Abstract

More information

Are you in danger of stroke? An insight into the leading causes. ABSTRACT 1. INTRODUCTION

Are you in danger of stroke? An insight into the leading causes. ABSTRACT 1. INTRODUCTION Are you in danger of stroke? An insight into the leading causes. Anjali Bansal, Pallabi Deb, Musthan M., Dr. Goutam Chakraborty, Dr. Miriam McGaugh Oklahoma State University ABSTRACT Stroke aka cerebrovascular

More information

Rajiv Gandhi College of Engineering, Chandrapur

Rajiv Gandhi College of Engineering, Chandrapur Utilization of Data Mining Techniques for Analysis of Breast Cancer Dataset Using R Keerti Yeulkar 1, Dr. Rahila Sheikh 2 1 PG Student, 2 Head of Computer Science and Studies Rajiv Gandhi College of Engineering,

More information

Time-to-Recur Measurements in Breast Cancer Microscopic Disease Instances

Time-to-Recur Measurements in Breast Cancer Microscopic Disease Instances Time-to-Recur Measurements in Breast Cancer Microscopic Disease Instances Ioannis Anagnostopoulos 1, Ilias Maglogiannis 1, Christos Anagnostopoulos 2, Konstantinos Makris 3, Eleftherios Kayafas 3 and Vassili

More information

Donor Sentiment and Characteristic Analysis Using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Donor Sentiment and Characteristic Analysis Using SAS Enterprise Miner and SAS Sentiment Analysis Studio Paper 3347 2015 Donor Sentiment and Characteristic Analysis Using SAS Enterprise Miner and SAS Sentiment Analysis Studio ABSTRACT Ramcharan Kakarla, Dr. Goutam Chakraborty; Oklahoma State University, Stillwater,

More information

Supersparse Linear Integer Models for Interpretable Prediction. Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013

Supersparse Linear Integer Models for Interpretable Prediction. Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013 Supersparse Linear Integer Models for Interpretable Prediction Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013 CHADS 2 Scoring System Condition Points Congestive heart failure 1 Hypertension 1 Age

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

Comparative Analysis of Artificial Neural Network and Support Vector Machine Classification for Breast Cancer Detection

Comparative Analysis of Artificial Neural Network and Support Vector Machine Classification for Breast Cancer Detection International Research Journal of Engineering and Technology (IRJET) e-issn: 2395-0056 Volume: 02 Issue: 09 Dec-2015 p-issn: 2395-0072 www.irjet.net Comparative Analysis of Artificial Neural Network and

More information

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Dhouha Grissa, Mélanie Pétéra, Marion Brandolini, Amedeo Napoli, Blandine Comte and Estelle Pujos-Guillot

More information

A Novel Prediction on Breast Cancer from the Basis of Association rules and Neural Network

A Novel Prediction on Breast Cancer from the Basis of Association rules and Neural Network Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 4, April 2013,

More information

Performance of ART1 Network in the Detection of Breast Cancer

Performance of ART1 Network in the Detection of Breast Cancer 2012 2nd International Conference on Computer Design and Engineering (ICCDE 2012) IPCSIT vol. 49 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V49.19 Performance of ART1 Network in the

More information

Predicting Breast Cancer using Novel Approach in Data Analytics

Predicting Breast Cancer using Novel Approach in Data Analytics Predicting Breast Cancer using Novel Approach in Data Analytics Ms. L. Sankari PG Scholar, Department of CSE Manakula Vinayagar Insitute of Technology Puducherry, India Mr. R. Rajbharath, Research Scholar,

More information

Mining Big Data: Breast Cancer Prediction using DT - SVM Hybrid Model

Mining Big Data: Breast Cancer Prediction using DT - SVM Hybrid Model Mining Big Data: Breast Cancer Prediction using DT - SVM Hybrid Model K.Sivakami, Assistant Professor, Department of Computer Application Nadar Saraswathi College of Arts & Science, Theni. Abstract - Breast

More information

CLASSIFICATION OF BREAST CANCER INTO BENIGN AND MALIGNANT USING SUPPORT VECTOR MACHINES

CLASSIFICATION OF BREAST CANCER INTO BENIGN AND MALIGNANT USING SUPPORT VECTOR MACHINES CLASSIFICATION OF BREAST CANCER INTO BENIGN AND MALIGNANT USING SUPPORT VECTOR MACHINES K.S.NS. Gopala Krishna 1, B.L.S. Suraj 2, M. Trupthi 3 1,2 Student, 3 Assistant Professor, Department of Information

More information

Fine Needle Aspirate of Breast Lesions Dataset

Fine Needle Aspirate of Breast Lesions Dataset Fine Needle Aspirate of Breast Lesions Dataset Dr Simon S Cross, Senior Lecturer, Department of Pathology, University of Sheffield Medical School, Beech Hill Road, Sheffield S10 2UL, UK, s.s.cross@sheffield.ac.uk

More information

Outlier detection in datasets with mixed-attributes

Outlier detection in datasets with mixed-attributes Vrije Universiteit Amsterdam Thesis Outlier detection in datasets with mixed-attributes Author: Milou Meltzer Supervisor: Johan ten Houten Evert Haasdijk A thesis submitted in fulfilment of the requirements

More information

An Enhanced Breast Cancer Diagnosis Scheme based on Two-Step-SVM Technique

An Enhanced Breast Cancer Diagnosis Scheme based on Two-Step-SVM Technique An Enhanced Breast Cancer Diagnosis Scheme based on Two-Step-SVM Technique Ahmed Hamza Osman Department of Information System, Faculty of Computing and Information Technology King Abdulaziz University

More information

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH 1 VALLURI RISHIKA, M.TECH COMPUTER SCENCE AND SYSTEMS ENGINEERING, ANDHRA UNIVERSITY 2 A. MARY SOWJANYA, Assistant Professor COMPUTER SCENCE

More information

Evaluation of Breast Specimens Removed by Needle Localization Technique

Evaluation of Breast Specimens Removed by Needle Localization Technique Evaluation of Breast Specimens Removed by Needle Localization Technique Specimen Handling: The breast specimen when received should be measured and grossly inspected for any orientation designated by the

More information

Fuzzy Analysis of Breast Cancer Disease using Fuzzy c-means and Pattern Recognition

Fuzzy Analysis of Breast Cancer Disease using Fuzzy c-means and Pattern Recognition SOUTHEAST EUROPE JOURNAL OF SOFT COMPUTING Available online at www.scjournal.com.ba Fuzzy Analysis of Breast Cancer Disease using Fuzzy c-means and Pattern Recognition Indira Muhic International University

More information

CURRENT METHODS IN IMAGE GUIDED BREAST BIOPSY

CURRENT METHODS IN IMAGE GUIDED BREAST BIOPSY CURRENT METHODS IN IMAGE GUIDED BREAST BIOPSY Stuart Silver April 24, 2004 OBJECTIVES Review development of current techniques Discuss stereotactic breast biopsy Discuss US guided breast biopsy 1 OBJECTIVES

More information

Australian Journal of Basic and Applied Sciences

Australian Journal of Basic and Applied Sciences AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Experimental Analysis Towards Realizing Breast Cancer Prognosis Using Diverse Machine

More information

Stage-Specific Predictive Models for Cancer Survivability

Stage-Specific Predictive Models for Cancer Survivability University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations December 2016 Stage-Specific Predictive Models for Cancer Survivability Elham Sagheb Hossein Pour University of Wisconsin-Milwaukee

More information

Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies

Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies Predicting Malignancy from Mammography Findings and Image Guided Core Biopsies 2 nd Breast Cancer Workshop 2015 April 7 th 2015 Porto, Portugal Pedro Ferreira Nuno A. Fonseca Inês Dutra Ryan Woods Elizabeth

More information

SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis

SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis SVM-Kmeans: Support Vector Machine based on Kmeans Clustering for Breast Cancer Diagnosis Walaa Gad Faculty of Computers and Information Sciences Ain Shams University Cairo, Egypt Email: walaagad [AT]

More information

Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department

Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department Data Mining Techniques to Find Out Heart Diseases: An Overview Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department H.V.P.M s COET, Amravati

More information

Testing Statistical Models to Improve Screening of Lung Cancer

Testing Statistical Models to Improve Screening of Lung Cancer Testing Statistical Models to Improve Screening of Lung Cancer 1 Elliot Burghardt: University of Iowa Daren Kuwaye: University of Hawai i at Mānoa Iowa Summer Institute in Biostatistics - University of

More information

Variable Features Selection for Classification of Medical Data using SVM

Variable Features Selection for Classification of Medical Data using SVM Variable Features Selection for Classification of Medical Data using SVM Monika Lamba USICT, GGSIPU, Delhi, India ABSTRACT: The parameters selection in support vector machines (SVM), with regards to accuracy

More information

Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka

Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka I J C T A, 10(8), 2017, pp. 59-67 International Science Press ISSN: 0974-5572 Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka Milandeep Arora* and Ajay

More information

Breast Cancer Diagnosis and Prognosis

Breast Cancer Diagnosis and Prognosis Breast Cancer Diagnosis and Prognosis Patrick Pantel Department of Computer Science University of Manitoba Winnipeg, Manitoba, Canada R3T 2N2 ppantel@cs.umanitoba.ca Abstract Breast cancer accounts for

More information

ORIGINAL ARTICLE Nuclear morphometry and texture analysis on cytological smears of thyroid neoplasms: a study of 50 cases

ORIGINAL ARTICLE Nuclear morphometry and texture analysis on cytological smears of thyroid neoplasms: a study of 50 cases Malaysian J Pathol 2017; 39(1) : 33 37 ORIGINAL ARTICLE Nuclear morphometry and texture analysis on cytological smears of thyroid neoplasms: a study of 50 cases Lopamudra DEKA MD, Shilpa GUPTA MD, Ruchika

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

Palpable Breast Lesions Cytomorphological Analysis and Scoring System with Histopatholgical Correlation

Palpable Breast Lesions Cytomorphological Analysis and Scoring System with Histopatholgical Correlation IOSR Journal of Dental and Medical Sciences (IOSR-JDMS) e-issn: 2279-0853, p-issn: 2279-0861.Volume 15, Issue 10 Ver. III (October. 2016), PP 25-29 www.iosrjournals.org Palpable Breast Lesions Cytomorphological

More information

Multilayer Perceptron Neural Network Classification of Malignant Breast. Mass

Multilayer Perceptron Neural Network Classification of Malignant Breast. Mass Multilayer Perceptron Neural Network Classification of Malignant Breast Mass Joshua Henry 12/15/2017 henry7@wisc.edu Introduction Breast cancer is a very widespread problem; as such, it is likely that

More information

BreastScreen Victoria Annual Statistical Report

BreastScreen Victoria Annual Statistical Report BreastScreen Victoria Annual Statistical Report 005 Produced by: BreastScreen Victoria Coordination Unit Level, Pelham Street, Carlton South Victoria 05 PH 0 9660 6888 FX 0 966 88 EM info@breastscreen.org.au

More information

Automatic Classification of Breast Masses for Diagnosis of Breast Cancer in Digital Mammograms using Neural Network

Automatic Classification of Breast Masses for Diagnosis of Breast Cancer in Digital Mammograms using Neural Network IJSTE - International Journal of Science Technology & Engineering Volume 1 Issue 11 May 2015 ISSN (online): 2349-784X Automatic Classification of Breast Masses for Diagnosis of Breast Cancer in Digital

More information

Identifying Parkinson s Patients: A Functional Gradient Boosting Approach

Identifying Parkinson s Patients: A Functional Gradient Boosting Approach Identifying Parkinson s Patients: A Functional Gradient Boosting Approach Devendra Singh Dhami 1, Ameet Soni 2, David Page 3, and Sriraam Natarajan 1 1 Indiana University Bloomington 2 Swarthmore College

More information

Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning

Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning Presenter: Hersh Sagreiya 1, M.D. Authors: Alireza Akhbardeh 1, Ph.D., Isabelle Durot 1, M.D., Carlo Filice 2,

More information

Interpretable Models to Predict Breast Cancer

Interpretable Models to Predict Breast Cancer Interpretable Models to Predict Breast Cancer Pedro Ferreira, Inês Dutra, Rogerio Salvini, Elizabeth Burnside CRACS-INESC TEC, Porto, Portugal DCC-FC, Universidade do Porto, Porto, Portugal University

More information

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Chih-Lin Chi a, W. Nick Street b, William H. Wolberg c a Health Informatics Program, University of Iowa b

More information

International Journal of Advance Engineering and Research Development A THERORETICAL SURVEY ON BREAST CANCER PREDICTION USING DATA MINING TECHNIQUES

International Journal of Advance Engineering and Research Development A THERORETICAL SURVEY ON BREAST CANCER PREDICTION USING DATA MINING TECHNIQUES Scientific Journal of Impact Factor (SJIF): 4.14 e-issn: 2348-4470 p-issn: 2348-6406 International Journal of Advance Engineering and Research Development Volume 4, Issue 02 February -2018 A THERORETICAL

More information

Quality ID #263: Preoperative Diagnosis of Breast Cancer National Quality Strategy Domain: Effective Clinical Care

Quality ID #263: Preoperative Diagnosis of Breast Cancer National Quality Strategy Domain: Effective Clinical Care Quality ID #263: Preoperative Diagnosis of Breast Cancer National Quality Strategy Domain: Effective Clinical Care 2018 OPTIONS FOR INDIVIDUAL MEASURES: REGISTRY ONLY MEASURE TYPE: Process DESCRIPTION:

More information

Classification of mammogram masses using selected texture, shape and margin features with multilayer perceptron classifier.

Classification of mammogram masses using selected texture, shape and margin features with multilayer perceptron classifier. Biomedical Research 2016; Special Issue: S310-S313 ISSN 0970-938X www.biomedres.info Classification of mammogram masses using selected texture, shape and margin features with multilayer perceptron classifier.

More information

Cytological grading of breast carcinoma with histological correlation

Cytological grading of breast carcinoma with histological correlation Journal of BUON 10: 251-256, 2005 2005 Zerbinis Medical Publications. Printed in Greece ORIGINAL ARTICLE Cytological grading of breast carcinoma with histological correlation M. Jovicić-Milentijević 1,

More information

Building an Ensemble System for Diagnosing Masses in Mammograms

Building an Ensemble System for Diagnosing Masses in Mammograms Building an Ensemble System for Diagnosing Masses in Mammograms Yu Zhang, Noriko Tomuro, Jacob Furst, Daniela Stan Raicu College of Computing and Digital Media DePaul University, Chicago, IL 60604, USA

More information

QSAR studies of breast carcinoma using Artificial neural network, Bayesian classifier and Multiple linear regression

QSAR studies of breast carcinoma using Artificial neural network, Bayesian classifier and Multiple linear regression QSAR studies of breast carcinoma using Artificial neural network, Bayesian classifier and Multiple linear regression Guru Pratap Singh, Rajnish Kumar, Anju Sharma Amity Institute of Biotechnology, Amity

More information

Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT

Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT Chang Ming, 22.11.2017 University of Basel Swiss Public Health Conference 2017 Breast Cancer & personalized

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

Investigating the performance of a CAD x scheme for mammography in specific BIRADS categories

Investigating the performance of a CAD x scheme for mammography in specific BIRADS categories Investigating the performance of a CAD x scheme for mammography in specific BIRADS categories Andreadis I., Nikita K. Department of Electrical and Computer Engineering National Technical University of

More information

Artificial Intelligence in Breast Imaging

Artificial Intelligence in Breast Imaging Artificial Intelligence in Breast Imaging Manisha Bahl, MD, MPH Director of Breast Imaging Fellowship Program, Massachusetts General Hospital Assistant Professor of Radiology, Harvard Medical School Outline

More information

Personalized Colorectal Cancer Survivability Prediction with Machine Learning Methods*

Personalized Colorectal Cancer Survivability Prediction with Machine Learning Methods* Personalized Colorectal Cancer Survivability Prediction with Machine Learning Methods* 1 st Samuel Li Princeton University Princeton, NJ seli@princeton.edu 2 nd Talayeh Razzaghi New Mexico State University

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

Breast Cancer Diagnosis Based on K-Means and SVM

Breast Cancer Diagnosis Based on K-Means and SVM Breast Cancer Diagnosis Based on K-Means and SVM Mengyao Shi UNC STOR May 4, 2018 Mengyao Shi (UNC STOR) Breast Cancer Diagnosis Based on K-Means and SVM May 4, 2018 1 / 19 Background Cancer is a major

More information

Classification of Mammograms using Gray-level Co-occurrence Matrix and Support Vector Machine Classifier

Classification of Mammograms using Gray-level Co-occurrence Matrix and Support Vector Machine Classifier Classification of Mammograms using Gray-level Co-occurrence Matrix and Support Vector Machine Classifier P.Samyuktha,Vasavi College of engineering,cse dept. D.Sriharsha, IDD, Comp. Sc. & Engg., IIT (BHU),

More information

Analysis of Classification Algorithms towards Breast Tissue Data Set

Analysis of Classification Algorithms towards Breast Tissue Data Set Analysis of Classification Algorithms towards Breast Tissue Data Set I. Ravi Assistant Professor, Department of Computer Science, K.R. College of Arts and Science, Kovilpatti, Tamilnadu, India Abstract

More information

Malignant Tumor Detection Using Machine Learning through Scikit-learn

Malignant Tumor Detection Using Machine Learning through Scikit-learn Volume 119 No. 15 2018, 2863-2874 ISSN: 1314-3395 (on-line version) url: http://www.acadpubl.eu/hub/ http://www.acadpubl.eu/hub/ Malignant Tumor Detection Using Machine Learning through Scikit-learn Arushi

More information

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network UTM Computing Proceedings Innovations in Computing Technology and Applications Volume 2 Year: 2017 ISBN: 978-967-0194-95-0 1 Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon

More information

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, MANUSCRIPT ID DPPred: An Effective Prediction Framework with Concise Discriminative Patterns Jingbo Shang, Meng Jiang, Wenzhu Tong, Jinfeng Xiao, Jian

More information

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT Research Article Bioinformatics International Journal of Pharma and Bio Sciences ISSN 0975-6299 A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS D.UDHAYAKUMARAPANDIAN

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

BREAST CANCER EPIDEMIOLOGY MODEL:

BREAST CANCER EPIDEMIOLOGY MODEL: BREAST CANCER EPIDEMIOLOGY MODEL: Calibrating Simulations via Optimization Michael C. Ferris, Geng Deng, Dennis G. Fryback, Vipat Kuruchittham University of Wisconsin 1 University of Wisconsin Breast Cancer

More information

Computer-Aided Diagnosis for Microcalcifications in Mammograms

Computer-Aided Diagnosis for Microcalcifications in Mammograms Computer-Aided Diagnosis for Microcalcifications in Mammograms Werapon Chiracharit Department of Electronic and Telecommunication Engineering King Mongkut s University of Technology Thonburi BIE 690, November

More information

Follicular Derived Thyroid Tumors

Follicular Derived Thyroid Tumors Follicular Derived Thyroid Tumors Jennifer L. Hunt, MD, MEd Aubrey J. Hough Jr, MD, Endowed Professor of Pathology Chair of Pathology and Laboratory Medicine University of Arkansas for Medical Sciences

More information

AMSER Case of the Month: November 2018

AMSER Case of the Month: November 2018 AMSER Case of the Month: November 2018 52 year old female with an abnormal screening mammogram Areeg Rehman, MS 4 Nova Southeastern University Rebecca T. Sivarajah, MD Penn State University College of

More information

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Brain Tumour Detection of MR Image Using Naïve

More information

Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines

Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Class discovery in Gene Expression Data: Characterizing Splits by Support Vector Machines Florian Markowetz and Anja von Heydebreck Max-Planck-Institute for Molecular Genetics Computational Molecular Biology

More information

Copyright 2007 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 2007.

Copyright 2007 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 2007. Copyright 27 IEEE. Reprinted from 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, April 27. This material is posted here with permission of the IEEE. Such permission of the

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

NUCLEI SEGMENTATION OF MICROSCOPY IMAGES OF THYROID NODULES VIA ACTIVE CONTOURS AND K-MEANS CLUSTERING

NUCLEI SEGMENTATION OF MICROSCOPY IMAGES OF THYROID NODULES VIA ACTIVE CONTOURS AND K-MEANS CLUSTERING 1 st International Conference on Experiments/Process/System Modelling/Simulation/Optimization 1 st IC-EpsMsO Athens, 6-9 July, 2005 IC-EpsMsO NUCLEI SEGMENTATION OF MICROSCOPY IMAGES OF THYROID NODULES

More information

Panel: Machine Learning in Surgery and Cancer

Panel: Machine Learning in Surgery and Cancer Panel: Machine Learning in Surgery and Cancer Professor Dimitris Bertsimas, SM 87, PhD 88, Boeing Leaders for Global Operations Professor of Management; Professor of Operations Research; Co-Director, Operations

More information

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem

Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Oral Presentation at MIE 2011 30th August 2011 Oslo Applying One-vs-One and One-vs-All Classifiers in k-nearest Neighbour Method and Support Vector Machines to an Otoneurological Multi-Class Problem Kirsi

More information

Cytyc Corporation - Case Presentation Archive - March 2002

Cytyc Corporation - Case Presentation Archive - March 2002 FirstCyte Ductal Lavage History: 68 Year Old Female Gail Index: Unknown Clinical History: Negative Mammogram in 1995 6 yrs. later presents with bloody nipple discharge Subsequent suspicious mammogram Suspicious

More information

Biomedical Research 2016; Special Issue: S148-S152 ISSN X

Biomedical Research 2016; Special Issue: S148-S152 ISSN X Biomedical Research 2016; Special Issue: S148-S152 ISSN 0970-938X www.biomedres.info Prognostic classification tumor cells using an unsupervised model. R Sathya Bama Krishna 1*, M Aramudhan 2 1 Department

More information

Utilizing Posterior Probability for Race-composite Age Estimation

Utilizing Posterior Probability for Race-composite Age Estimation Utilizing Posterior Probability for Race-composite Age Estimation Early Applications to MORPH-II Benjamin Yip NSF-REU in Statistical Data Mining and Machine Learning for Computer Vision and Pattern Recognition

More information

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes. Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension

More information

Almost any suspected tumor can be aspirated easily and safely. Some masses are more risky to aspirate including:

Almost any suspected tumor can be aspirated easily and safely. Some masses are more risky to aspirate including: DOES THIS PATIENT HAVE CANCER? USING IN-HOUSE CYTOLOGY TO HELP YOU MAKE THIS DIAGNOSIS. Joyce Obradovich, DVM, Diplomate, ACVIM (Oncology) Animal Cancer & Imaging Center, Canton, Michigan Almost every

More information

AN EXPERT SYSTEM FOR THE DIAGNOSIS OF DIABETIC PATIENTS USING DEEP NEURAL NETWORKS AND RECURSIVE FEATURE ELIMINATION

AN EXPERT SYSTEM FOR THE DIAGNOSIS OF DIABETIC PATIENTS USING DEEP NEURAL NETWORKS AND RECURSIVE FEATURE ELIMINATION International Journal of Civil Engineering and Technology (IJCIET) Volume 8, Issue 12, December 2017, pp. 633 641, Article ID: IJCIET_08_12_069 Available online at http://http://www.iaeme.com/ijciet/issues.asp?jtype=ijciet&vtype=8&itype=12

More information

Predicting Breast Cancer Recurrence Using Machine Learning Techniques

Predicting Breast Cancer Recurrence Using Machine Learning Techniques Predicting Breast Cancer Recurrence Using Machine Learning Techniques Umesh D R Department of Computer Science & Engineering PESCE, Mandya, Karnataka, India Dr. B Ramachandra Department of Electrical and

More information

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes J. Sujatha Research Scholar, Vels University, Assistant Professor, Post Graduate

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Literature Survey Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is

More information

Assigning B cell Maturity in Pediatric Leukemia Gabi Fragiadakis 1, Jamie Irvine 2 1 Microbiology and Immunology, 2 Computer Science

Assigning B cell Maturity in Pediatric Leukemia Gabi Fragiadakis 1, Jamie Irvine 2 1 Microbiology and Immunology, 2 Computer Science Assigning B cell Maturity in Pediatric Leukemia Gabi Fragiadakis 1, Jamie Irvine 2 1 Microbiology and Immunology, 2 Computer Science Abstract One method for analyzing pediatric B cell leukemia is to categorize

More information

Analysis and Interpretation of Data Part 1

Analysis and Interpretation of Data Part 1 Analysis and Interpretation of Data Part 1 DATA ANALYSIS: PRELIMINARY STEPS 1. Editing Field Edit Completeness Legibility Comprehensibility Consistency Uniformity Central Office Edit 2. Coding Specifying

More information

Atypical And Suspicious Categories In Fine Needle Aspiration Cytology Of The Breast

Atypical And Suspicious Categories In Fine Needle Aspiration Cytology Of The Breast IOSR Journal of Dental and Medical Sciences (IOSR-JDMS) e-issn: 2279-853, p-issn: 2279-861.Volume 15, Issue 1 Ver. III (October. 216), PP 57-61 www.iosrjournals.org Atypical And Suspicious Categories in

More information

MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE

MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE Abstract MACHINE LEARNING BASED APPROACHES FOR PREDICTION OF PARKINSON S DISEASE Arvind Kumar Tiwari GGS College of Modern Technology, SAS Nagar, Punjab, India The prediction of Parkinson s disease is

More information

Detection of Neuromuscular Diseases Using Surface Electromyograms

Detection of Neuromuscular Diseases Using Surface Electromyograms Faculty of Electrical Engineering and Computer Science University of Maribor 1 Department of Computer Science, University of Cyprus 2 The Cyprus Institute of Neurology and Genetics 3 Detection of Neuromuscular

More information

Breast Cancer Prevention and Early Detection using Different Processing Techniques

Breast Cancer Prevention and Early Detection using Different Processing Techniques e t International Journal on Emerging Technologies (Special Issue on ICRIET-2016) 7(2): 92-96(2016) ISSN No. (Print) : 0975-8364 ISSN No. (Online) : 2249-3255 Breast Cancer Prevention and Early Detection

More information

Methods for Predicting Type 2 Diabetes

Methods for Predicting Type 2 Diabetes Methods for Predicting Type 2 Diabetes CS229 Final Project December 2015 Duyun Chen 1, Yaxuan Yang 2, and Junrui Zhang 3 Abstract Diabetes Mellitus type 2 (T2DM) is the most common form of diabetes [WHO

More information

Performance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis

Performance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis Performance Based Evaluation of Various Machine Learning Classification Techniques for Chronic Kidney Disease Diagnosis Sahil Sharma Department of Computer Science & IT University Of Jammu Jammu, India

More information

Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism

Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism Clustering with Genotype and Phenotype Data to Identify Subtypes of Autism Project Category: Life Science Rocky Aikens (raikens) Brianna Kozemzak (kozemzak) December 16, 2017 1 Introduction and Motivation

More information