TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT

Similar documents
Supplement materials:

A hybrid Model to Estimate Cirrhosis Using Laboratory Testsand Multilayer Perceptron (MLP) Neural Networks

Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department

Modelling and Application of Logistic Regression and Artificial Neural Networks Models

Multi Parametric Approach Using Fuzzification On Heart Disease Analysis Upasana Juneja #1, Deepti #2 *

The effects of the underlying disease and serum albumin on GFR prediction using the Adaptive Neuro Fuzzy Inference System (ANFIS)

A prediction model for type 2 diabetes using adaptive neuro-fuzzy interface system.

Developing a Fuzzy Database System for Heart Disease Diagnosis

State Profile for FY 2018 for Dialysis Patients and Facilities - STATE SAMPLE

Analysis of Classification Algorithms towards Breast Tissue Data Set

A Fuzzy Expert System for Heart Disease Diagnosis

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction

Predicting Juvenile Diabetes from Clinical Test Results

Prediction of heart disease using k-nearest neighbor and particle swarm optimization.

COMPARATIVE STUDY OF EXISTING TECHNIQUES FOR DIAGNOSING VARIOUS THYROID AILMENTS

Identification and Classification of Coronary Artery Disease Patients using Neuro-Fuzzy Inference Systems

An Improved Algorithm To Predict Recurrence Of Breast Cancer

A Data Mining Technique for Prediction of Coronary Heart Disease Using Neuro-Fuzzy Integrated Approach Two Level

Application of Tree Structures of Fuzzy Classifier to Diabetes Disease Diagnosis

Predicting Breast Cancer Recurrence Using Machine Learning Techniques

Australian Journal of Basic and Applied Sciences

A Critical Study of Classification Algorithms for LungCancer Disease Detection and Diagnosis

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network

Performance Analysis of Liver Disease Prediction Using Machine Learning Algorithms

Statistical Fact Sheet Populations

2008 Dialysis Facility Report

A new method of automatic recognition for tuberculosis disease diagnosis using support vector machines.

Classification of Liver disease using Multilayer Perceptron Neural Network

Performance Analysis of Classification Algorithms on a Novel Unified Clinical Decision Support Model for Predicting Coronary Heart Disease Risks

Supplementary Online Content

Study of Data Mining Algorithms in the Context of Performance Enhancement of Classification

A New Monotony Advanced Decision Tree Using Graft Algorithm to Predict the Diagnosis of Diabetes Mellitus

CHAPTER 2 LITERATURE REVIEW

INVESTIGATING ILPD FOR MOST SIGNIFICANT FEATURES

Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques: A Review

Application of distributed lighting control architecture in dementia-friendly smart homes

MACHINE LEARNING ON DIABETES MANAGEMENT: EMPLOYABILITY OF ADVANCED LOGISTIC REGRESSION AND PREDICTIVE ANALYSIS IN EARLY DETECTION OF DIABETES

Predictive Models for Healthcare Analytics

Chapter 4: Cardiovascular Disease in Patients With CKD

Rohit Miri Asst. Professor Department of Computer Science & Engineering Dr. C.V. Raman Institute of Science & Technology Bilaspur, India

Performance Analysis of Different Classification Methods in Data Mining for Diabetes Dataset Using WEKA Tool

USRDS UNITED STATES RENAL DATA SYSTEM

Prediction of Diabetes by using Artificial Neural Network

A Novel Prediction Approach for Myocardial Infarction Using Data Mining Techniques

ANALYSIS AND CLASSIFICATION OF EEG SIGNALS. A Dissertation Submitted by. Siuly. Doctor of Philosophy

Question 1 Multiple Choice (8 marks)

DUKECATHR Dataset Dictionary

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

Diagnosis of Breast Cancer Using Ensemble of Data Mining Classification Methods

Survey on Breast Cancer Analysis using Machine Learning Techniques

Keywords Artificial Neural Networks (ANN), Echocardiogram, BPNN, RBFNN, Classification, survival Analysis.

Chapter 4: Cardiovascular Disease in Patients With CKD

EFFICIENT MULTIPLE HEART DISEASE DETECTION SYSTEM USING SELECTION AND COMBINATION TECHNIQUE IN CLASSIFIERS

An SVM-Fuzzy Expert System Design For Diabetes Risk Classification

Probabilistic Reasoning for Medical Decision Support. Omolola Ogunyemi, PhD Director, Center for Biomedical Informatics Charles Drew University

Table 1. Proposed Measures for Use in Establishing Quality Performance Standards that ACOs Must Meet for Shared Savings

Rajiv Gandhi College of Engineering, Chandrapur

Correlate gestational diabetes with juvenile diabetes using Memetic based Anytime TBCA

Comparative Analysis of Machine Learning Algorithms for Chronic Kidney Disease Detection using Weka

SCIENCE & TECHNOLOGY

PREDICTION OF HEART DISEASE USING HYBRID MODEL: A Computational Approach

TYPE II MI. KC ACDIS LOCAL CHAPTER March 8, 2016

2011 Dialysis Facility Report

DIABETIC RISK PREDICTION FOR WOMEN USING BOOTSTRAP AGGREGATION ON BACK-PROPAGATION NEURAL NETWORKS

Comparability of patient-reported health status: multi-country analysis of EQ-5D responses in patients with type 2 diabetes

Data-Mining-Based Coronary Heart Disease Risk Prediction Model Using Fuzzy Logic and Decision Tree

Meaningful Use Clinical Quality Measures for Eligible Professionals

2011 Dialysis Facility Report SAMPLE Dialysis Facility State: XX Network: 99 CCN: SAMPLE Dialysis Facility Report SAMPLE

Survey on Decision Support System For Heart Disease

Diabetes Patient s Risk through Soft Computing Model

Chapter 9: Cardiovascular Disease in Patients With ESRD

Adult Diabetes Clinician Guide NOVEMBER 2017

HEART DISEASE PREDICTION BY ANALYSING VARIOUS PARAMETERS USING FUZZY LOGIC

2010 Dialysis Facility Report

ISCHEMIC VASCULAR DISEASE (IVD) MEASURES GROUP OVERVIEW

AN EXPERT SYSTEM FOR THE DIAGNOSIS OF DIABETIC PATIENTS USING DEEP NEURAL NETWORKS AND RECURSIVE FEATURE ELIMINATION

Particle Swarm Optimization Supported Artificial Neural Network in Detection of Parkinson s Disease

Predicting Breast Cancer Survivability Rates

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

Brain Tumor Segmentation Based On a Various Classification Algorithm

2016 Internal Medicine Preferred Specialty Measure Set

Predicting Heart Attack using Fuzzy C Means Clustering Algorithm

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental

A. Sheik Abdullah PG Scholar Department of Computer Science Engineering Kongu Engineering College ABSTRACT

Research Article. Automated grading of diabetic retinopathy stages in fundus images using SVM classifer

Clinical Quality Measures for Submission by Medicare or Medicaid EP/s for the 2011 and 2012 Payment Year

4. Which survey program does your facility use to get your program designated by the state?

Clinical Quality Measures

Viszards09: Bibsonomy

BLOOD GLUCOSE PREDICTION MODELS FOR PERSONALIZED DIABETES MANAGEMENT

Comparative analysis of data mining tools for lungs cancer patients

ABSTRACT. classification algorithm, so rules can be extracted from the decision tree models.

ABSTRACT I. INTRODUCTION II. HEART DISEASE

Design of Multi-Class Classifier for Prediction of Diabetes using Linear Support Vector Machine

Data Mining Application in Diabetes Diagnosis using Biomedical Records of Pathological Attribute

An Edge-Device for Accurate Seizure Detection in the IoT

BACKPROPOGATION NEURAL NETWORK FOR PREDICTION OF HEART DISEASE

Supplemental Table 1. Standardized Serum Creatinine Measurements. Supplemental Table 3. Sensitivity Analyses with Additional Mortality Outcomes.

Patient characteristics Intervention Comparison Length of followup

Procedia Computer Science

Transcription:

vii TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT LIST OF TABLES LIST OF FIGURES LIST OF SYMBOLS AND ABBREVIATIONS iii xi xii xiii 1 INTRODUCTION 1 1.1 CLINICAL DATA MINING 1 1.2 OBJECTIVES OF THE RESEARCH 5 1.3 LITERATURE REVIEW 6 1.3.1 Research Works Based on Data Mining Techniques 7 1.3.2 Research Works on Data Mining Using Clinical Datasets 11 1.3.3 Research Works on Heart Disease Prognosis, Diagnosis and Risk Prediction 17 1.3.4 Research Works on Diagnosis and Prediction of Hepatitis 22 1.3.5 Research Works on Correlation among Hepatitis, Heart Disease, Diabetes and Anaemia 26 1.4 ORGANISATION OF THE THESIS 31

viii CHAPTER NO. TITLE PAGE NO. 2 INTELLIGENT PREDICTIVE MODEL FOR KNOWLEDGE DISCOVERY FROM CLINICAL DATASETS 33 2.1 PRE-PROCESSING 33 2.2 PRE-MINING SUBSYSTEM 37 2.3 MINING SUBSYSTEM 38 2.4 EVALUATING SUBSYSTEM 40 2.5 KNOWLEDGE BASE 41 2.6 INFERENCE AND FORECASTING SUBSYSTEM 41 2.7 EVALUATION OF THE MODEL 42 3 NEURO FUZZY APPROACH FOR PREDICTING THE SURVIVAL OF HEPATITIS 43 3.1 HEPATITIS DATA 44 3.2 PRE-MINING SUBSYSTEM 45 3.2.1 Principal Component Analysis Technique 46 3.2.2 Fuzzy C-Means Clustering Technique 47 3.3 NEURO FUZZY CLASSIFIER 49 3.4 INFERENCE AND FORECASTING SUBSYSTEM 51 3.5 EXPERIMENTAL RESULTS 52 4 COMPARATIVE WORK FOR DISCOVERING RULES FROM HEPATITIS DATASET 55 4.1 HEPATITIS DATASET 55 4.2 PRE-MINING SUBYSTEM 57

ix CHAPTER NO. TITLE PAGE NO. 4.3 MINING SUBSYSTEM 58 4.3.1 Association Rule Mining 58 4.3.2 Neural Network 59 4.3.3 Decision Tree 62 4.4 RULE VALIDATIONSUBSYSTEM 63 4.5 INFERENCE AND FORECASTING 63 4.6 EXPERIMENTAL RESULTS 64 5 STATISTICAL APPROACH FOR PREDICTING THE PRESENCE OF HEART DISEASE 68 5.1 HEART DISEASE DATASET 69 5.2 PRE-MINING SUBSYSTEM 70 5.3 MINING SUBSYSTEM 71 5.3.1 Contingency Table Generation 72 5.3.2 Rule Generation 72 5.4 VALIDATION SUBSYSTEM 74 5.5 INFERENCE AND FORECASTING SUBSYSTEM 74 5.5.1 Classification 74 5.5.2 Weight of Evidence 76 5.5.3 Confidence Estimation 78 5.6 EXPERIMENTAL RESULTS 79 6 FUZZY NEURO-GENETIC APPROACH FOR PREDICTING THE SEVERITY OF HEART DISEASE 81 6.1 HEART DISEASE DATA 82 6.2 PRE-MINING SUBSYSTEM 84

x CHAPTER NO. TITLE PAGE NO. 6.3 MINING SUBSYSTEM 86 6.3.1 Training 86 6.3.2 Rule Selection 89 6.4 VALIDATION SUBSYSTEM 90 6.5 KNOWLEDGE BASE 90 6.6 INFERENCE AND FORECASTING SUBSYSTEM 91 6.7 EXPERIMENTAL RESULTS 91 7 CONCLUSIONS AND FUTURE WORKS 96 7.1 CONCLUSION 96 7.2 FUTURE WORK 99 REFERENCES 101 LIST OF PUBLICATIONS 112 VITAE 113

xi LIST OF TABLES TABLE NO. TITLE PAGE NO. 3.1 Dataset Description (Hepatitis) 45 3.2 Contingency Table for best run 52 3.3 Contingency Table for average run 53 3.4 Contingency Table for worst run 53 3.5 Performance Measures 53 4.1 Illustrative Time-Series Hepatitis Data 57 4.2 Attributes and their variations over time 58 4.3 Number of rules generated 64 4.4 Confusion Matrix (Neural Network) 66 4.5 Confusion Matrix (Decision Tree) 66 4.6 Performance Measure of Intelligent Rule Miner 67 5.1 Hungarian Dataset Description 70 5.2 Contingency table for sex and chest pain type 72 5.3 Heart Disease Data 79 5.4 Contingency Table (Bayesian Classifier) 80 5.5 Performance Measures 80 6.1 Description of Heart Disease Database 83 6.2 Explicatory Rules 93 6.3 Contingency Table (Heart Disease) 94 6.4 Comparison of Classification Accuracy for Cleveland heart data 94 7.1 Comparison of Classification Accuracy for Hepatitis Data 97 7.2 Comparison of Classification Accuracy for Heart Disease Data 98

xii LIST OF FIGURES FIGURE NO. TITLE PAGE NO. 2.1 Model for Knowledge Discovery 34 3.1 Model Tailored Using Neuro-Fuzzy Inferencing Technique for Predicting Survival of Hepatitis 44 3.2 Neuro-Fuzzy Classifier 50 4.1 Model Tailored Using Association Rule Mining, Neural Network and Decision Tree to Predict Hepatitis 56 4.2 Network Architecture 59 4.3 Decision Tree 62 4.4 Histogram 65 4.5 Error Rate for Training 65 5.1 Model Tailored Using Statistical Classifier to Predict Heart Disease 69 5.2 Explicatory Rules 79 6.1 Model Tailored Using Fuzzy Neuro-Genetic Technique for Predicting the Severity of Heart Disease 82 6.2 Neural Network 89 6.3 Run Time Analysis 92

xiii LIST OF SYMBOLS AND ABBREVIATIONS ANFIS ATP ALB ALK AMP ANN BMI CVD CTM CHE CANFIS CHF CSFNN CABG CAD CHD DAC ECG ESRD EPO FSS FACO FCM FL FNN - Adaptive Neuro Fuzzy Inference System - Adult Treatment Panel - Albumin - Alkaline - Anemia Management Protocol - Artificial Neural Networks - Body Mass Index - Cardiovascular Diseases - Central Tendency Measure - Cholinesterase - Co-Active Neuro-Fuzzy Inference System - Congestive Heart Failure - Conic Section Function Neural Network - Coronary Artery Bypass Graft Surgery - Coronary Artery Disease - Coronary Heart Disease - Direct Adaptive Controller - Electro Cardio Graph - End Stage Renal Disease - Erythropoeitin - Feature Subset Selection - Fuzzy based Ant Colony Algorithm - Fuzzy C-Means Clustering - Fuzzy Logic - Fuzzy Neural Network

xiv FRBCS - Fuzzy Rule Based Classifier System GRNN - Generalized Regression Neural Network GA - Genetic Algorithms GOT - Glutamic-Oxaloacetic Transminase GPT - Glutamic-Pyruvic Transminase HGB - Hemoglobin HBV - Hepatitis B Virus HCV - Hepatitis C Virus HDV - Hepatitis D Virus HEMR - Hepatitis Electronic Medical Record System HDL-C - High Density Lipoprotein Cholesterol HOMA-IR - Homeostasis Model Assessment of Insulin Resistance HIV - Human Immuodeficiency Virus IHDPS - Intelligent Heart Disease Prediction System LEM - Learning From Examples LVQ - Learning Vector Quantization LDL - Low Density Lipoprotein UCS - Michigan-style Learning Classifier System MICD - Minimum Inter Class Distance Classifier MPC - Model Predictive Controller MLP - Multi Layer Perceptron MI - Myocardial Infarction NLCS - Neural Based Learning Classifier System NeC4.5 - Neural Ensemble based C4.5 NN - Neural Networks PD - Pattern Discovery PCI - Percutaneous Coronary Intervention PTDM - Post Transplant Diabetes Mellitus PCA - Principle Component Analysis

xv PNN TP RBF RFNN RNA SGOT SGPT SNP SARSA SRNN SQL SVM TTT T-BIL T-CHO TRF TSAT UCI WHO ZTT - Probabilistic Neural Network - Protein Total - Radial Basis Function - Recurrent Fuzzy Neural Network - Ribo-Nucleic Acid - Serum Glutamic-Ocaloacetic Transminase - Serum Glutamic-Pyruvic Transminase - Single Nucleotide Polymorphism - State-Action-Reward-State-Action - State-space Recurrent Neural Networks - Structured Query Language - Support Vector Machine - Thymol Turbidity Test - Total Bilirubin - Total Cholesterol - Total Risk Factor - Transferin Saturation - University of California, Irvine - World Health Organization - Zinc Sulphate Turbidity Test