Chemometrics for Analysis of NIR Spectra on Pharmaceutical Oral Dosages

Size: px
Start display at page:

Download "Chemometrics for Analysis of NIR Spectra on Pharmaceutical Oral Dosages"

Transcription

1 Chemometrics for Analysis of NIR Spectra on Pharmaceutical Oral Dosages William Welsh, Sastry Isukapalli, Rodolfo Romañach, Bozena Kohn-Michniak, Alberto Cuitino, Fernando Muzzio NATIONAL SCIENCE FOUNDATION A I R ACCELERATING INNOCATION RESEARCH PROGRAM NON-DESTRUCTIVE CHARACTERIZATION OF PHARMACEUTICAL PRODUCTS NSF ENGINEERING RESEARCH CENTER FOR STRUCTURE ORGANIC PARTICULATE SYSTEMS 1

2 Destructive vs Non-Destructive Testing Tablets the most common drug delivery vehicle Dissolution tests: Key requirement for the development, registration, approval, and quality control of these tablets. Disadvantages of dissolution: Destructive, time consuming, expensive, and tedious. Need a fast, non-destructive and easy technique for tablet characterization. Near IR (NIR) spectroscopy serves this purpose. 2

3 Diffuse NIR Spectrometry Detector for Reflectance NIR spectra tablet Detector for Transmission Use Chemometrics to Correlate NIR spectral features to sample properties 3

4 NIR Dataset Reflectance and Transmission Spectra 47 samples: API (acetaminophen); lactose; MgStearate (1%) Two dependent variables: %API (-30%), and Compaction Force (7 - kn) Output t data: Reflectance (R), Transmittance (T); Pooled R & T data, 1st and 2nd derivatives Chemometric Models Correlate %API and CF with NIR spectral data Standard Models: HCA, Regression Trees (CART), PLS Approaches for improved predictive ability: LASSO Regression, Ridge regression, Elastic Nets, Bayesian models 4

5 Chemometrics Two General Approaches Unsupervised Principal Component Analysis (PCA) Hierarchical Cluster Analysis (HCA) Supervised Partial Least Squares (PLS) regression Classification and Regression Tree (CART) Support Vector Machine (SVM) Artificial Neural Network (ANN) LASSO, Ridge Regression, and Elastic Nets regression clustering group 1 group 2 classification inactive drug active drug Predict ted Value Actual Value 5

6 Chemometrics Many Uses Many Common Methods HCA PCA Data Exploration & Clustering knn SVR Classification & Discriminationi i? Cooman s plot ANN CLASS 1 OUTLIERS Quantitative Prediction & Correlation CLASS 1&2 CLASS 2

7 Clustering with Pooled Reflectance and Transmission Data SNV: clustering with 12 clusters [case snv-r-t] 1 Compac ction Force (kn) % Active Ingredient Unsupervised clustering Distinct clusters for low CF cases and for low %API cases Single sub-cluster for high CF-%API cases 7

8 Hierarchical Cluster Analysis Iterative agglomeration of clusters through distance similarity measures To estimate control variables from experimental conditions (CF, %API) Clustering of samples based on their spectra only moderately correlated to CF-%API groupings g Distance SF-1-17 SF SF-1-17 SF-1-17 SF-16-1 SF SF SF SF--17 SF-16-1 SF-24- SF-16-1 SF-1-17 SF SF--16 SF--17 SF SF SF--17 SF-30- SF-2- SF--17 SF SF-26- SF SF-24- SF SF--16 SF SF SF SF SF SF-26- SF SF SF SF SF SF--17 SF-- SF SF SF SF--12 SF SF SF--12 SF SF-1-16 SF SF-2-12 SF SF-2-13 SF SF SF-12- SF-1-11 SF--11 SF SF-2-12 SF SF SF--13 SF-12-9 SF-- SF-- SF SF SF-- SF-- SF-22- SF-1- SF-16- SF-- SF-16- SF Cluster Nodes

9 LASSO, Ridge, and Elastic Net Regressions Ordinary least squares regression models Y ƒ(x i ) tend to overfit the data, leading to poor predictive ability. The problem is over-determined: many more variables (spectral data) than solutions (property values; samples). A process called Regularization can be introduced to prevent overfitting and to provide models that are predictive (low bias) & robust (low variance). Examples of this approach are LASSO (Least Absolute Shrinkage and Selection Operator) Ridge regression Elastic Nets 9

10 LASSO, Ridge, and Elastic Net Regressions Methods like LASSO penalize over-complex models, thereby leading to models with fewer terms (Occam s Razor). Occam s Razor: All other things being equal, simpler solutions are preferred over complex ones. Simpler models discern hidden structure, and may thus have better predictive performance. LASSO models are more easily interpretable; fewer variables.

11 CART Regression Trees % Active Ingred dient (predicted by CART) pr redicted Compactio on Force (kn) (pre edicted by CART predicte ed Transmission Data only Regression Tree Modeling [case: snv-t] % Active Ingredient % Active Ingredient (actual) Reflectance Data only Regression Tree Modeling [case: snv-r] % Active Ingredient % Active Ingredient (actual) 30 All Data Pooled (including derivatives) Regression Tree Modeling [case: all-together] 35 % Active Ingredient % Active Ingredient (actual) actual actual actual Regression Tree Modeling [case: snv-t] Compaction Force 5 5 Compaction Force (kn) (actual) Regression Tree Modeling [case: snv-r] Compaction Force 5 5 Compaction Force (kn) (actual) Regression Tree Modeling [case: all-together] 5 5 Compaction Force (kn) (actual)

12 LASSO Regression 30 Predicted % Active Ingre edient predict ted All Data Pooled Transmission Data only Reflectance Data only (including derivatives) % A.I. LASSO Regression [Baseline T] edient Predicted % Active Ingre 30 LASSO Regression [SNV R] edient Predicted % Active Ingre 30 LASSO Regression [B/SNV/G1/G2: R + T combined] 1 16 Predicted Compac ction Force (kn) predic cted 12 Actual % Active Ingredient Compaction Force LASSO Regression [Baseline T] Actual % Active Ingredient LASSO Regression [SNV R] 30 Actual % Active Ingredient actual actual actual ction Force (kn) Predicted Compac paction Force (kn) Predicted Comp LASSO Regression [B/SNV/G1/G2: R + T combined] Actual Compaction Force (kn) Actual Compaction Force (kn) Actual Compaction Force (kn)

13 1 Actual [cross-validation] LASSO Regression [B/SNV/G1/G2: R only] Predicted oactual o predicted R only Reflectance Data LASSO Regression e paction (kn) Force Compacti % Active Ingredient 1 Compaction Forc ce (kn) on Force 16 Compactio 12 %API LASSO Regression [B/SNV/G1/G2: T only] Actual [cross-validation] Predicted oactual o predicted Tonly Transmission Data Simultaneous prediction of %API and Compaction Force Improvement when both reflectance and transmission data are used rce (kn) n Force Compaction For Co LASSO Regression [B/SNV/G1/G2: R + T combined] Actual [cross-validation] Predicted oactual o predicted Pooled Data R and T %API % Active Ingredient %API % Active Ingredient

14 Concluding Remarks Novel chemometrics methods can build relationships between non-destructive test data and biorelevant properties of tablets, including dissolution Pooling reflectance & transmission data advantageous LASSO regression delivers substantial model improvements, and identifies subset of information-rich variables (NIR features) Methods used in the analysis are easily scalable Thousands of dimensions/millions of rows Open source tool kits

15 Acknowledgments People Rutgers: Bozena Michniak-Kohn, Alberto Cuitino, Fernando Muzzio UPR-Mayaguez: Rodolfo J. Romañach Team at Rutgers ERC-Structured Organic Particulate Systems Snowdon: Sastry Isukapalli Funding: NSF-AIR

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

Testing Statistical Models to Improve Screening of Lung Cancer

Testing Statistical Models to Improve Screening of Lung Cancer Testing Statistical Models to Improve Screening of Lung Cancer 1 Elliot Burghardt: University of Iowa Daren Kuwaye: University of Hawai i at Mānoa Iowa Summer Institute in Biostatistics - University of

More information

J2.6 Imputation of missing data with nonlinear relationships

J2.6 Imputation of missing data with nonlinear relationships Sixth Conference on Artificial Intelligence Applications to Environmental Science 88th AMS Annual Meeting, New Orleans, LA 20-24 January 2008 J2.6 Imputation of missing with nonlinear relationships Michael

More information

10CS664: PATTERN RECOGNITION QUESTION BANK

10CS664: PATTERN RECOGNITION QUESTION BANK 10CS664: PATTERN RECOGNITION QUESTION BANK Assignments would be handed out in class as well as posted on the class blog for the course. Please solve the problems in the exercises of the prescribed text

More information

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations

Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations Deep Learning Analytics for Predicting Prognosis of Acute Myeloid Leukemia with Cytogenetics, Age, and Mutations Andy Nguyen, M.D., M.S. Medical Director, Hematopathology, Hematology and Coagulation Laboratory,

More information

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Predicting Breast Cancer Survival Using Treatment and Patient Factors Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women

More information

Index. E Eftekbar, B., 152, 164 Eigenvectors, 6, 171 Elastic net regression, 6 discretization, 28 regularization, 42, 44, 46 Exponential modeling, 135

Index. E Eftekbar, B., 152, 164 Eigenvectors, 6, 171 Elastic net regression, 6 discretization, 28 regularization, 42, 44, 46 Exponential modeling, 135 A Abrahamowicz, M., 100 Akaike information criterion (AIC), 141 Analysis of covariance (ANCOVA), 2 4. See also Canonical regression Analysis of variance (ANOVA) model, 2 4, 255 canonical regression (see

More information

Machine Learning to Inform Breast Cancer Post-Recovery Surveillance

Machine Learning to Inform Breast Cancer Post-Recovery Surveillance Machine Learning to Inform Breast Cancer Post-Recovery Surveillance Final Project Report CS 229 Autumn 2017 Category: Life Sciences Maxwell Allman (mallman) Lin Fan (linfan) Jamie Kang (kangjh) 1 Introduction

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

CSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression

CSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression CSE 258 Lecture 2 Web Mining and Recommender Systems Supervised learning Regression Supervised versus unsupervised learning Learning approaches attempt to model data in order to solve a problem Unsupervised

More information

EECS 433 Statistical Pattern Recognition

EECS 433 Statistical Pattern Recognition EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern

More information

What is Regularization? Example by Sean Owen

What is Regularization? Example by Sean Owen What is Regularization? Example by Sean Owen What is Regularization? Name3 Species Size Threat Bo snake small friendly Miley dog small friendly Fifi cat small enemy Muffy cat small friendly Rufus dog large

More information

An Improved Algorithm To Predict Recurrence Of Breast Cancer

An Improved Algorithm To Predict Recurrence Of Breast Cancer An Improved Algorithm To Predict Recurrence Of Breast Cancer Umang Agrawal 1, Ass. Prof. Ishan K Rajani 2 1 M.E Computer Engineer, Silver Oak College of Engineering & Technology, Gujarat, India. 2 Assistant

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

DIABETIC RISK PREDICTION FOR WOMEN USING BOOTSTRAP AGGREGATION ON BACK-PROPAGATION NEURAL NETWORKS

DIABETIC RISK PREDICTION FOR WOMEN USING BOOTSTRAP AGGREGATION ON BACK-PROPAGATION NEURAL NETWORKS International Journal of Computer Engineering & Technology (IJCET) Volume 9, Issue 4, July-Aug 2018, pp. 196-201, Article IJCET_09_04_021 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=9&itype=4

More information

Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang

Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Identifying Thyroid Carcinoma Subtypes and Outcomes through Gene Expression Data Kun-Hsing Yu, Wei Wang, Chung-Yu Wang Abstract: Unlike most cancers, thyroid cancer has an everincreasing incidence rate

More information

Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-Infrared Spectroscopy

Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-Infrared Spectroscopy Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-Infrared Spectroscopy Fenghua Wang 1,*, Ju Yang 1, Hailong Zhu 2, and Zhiyong Xi 1 1 Faculty of Modern Agricultural Engineering,Kunming

More information

Radiotherapy Outcomes

Radiotherapy Outcomes in partnership with Outcomes Models with Machine Learning Sarah Gulliford PhD Division of Radiotherapy & Imaging sarahg@icr.ac.uk AAPM 31 st July 2017 Making the discoveries that defeat cancer Radiotherapy

More information

Artificial Neural Networks and Near Infrared Spectroscopy - A case study on protein content in whole wheat grain

Artificial Neural Networks and Near Infrared Spectroscopy - A case study on protein content in whole wheat grain A White Paper from FOSS Artificial Neural Networks and Near Infrared Spectroscopy - A case study on protein content in whole wheat grain By Lars Nørgaard*, Martin Lagerholm and Mark Westerhaus, FOSS *corresponding

More information

New Procedures for Identifying High Rates of Pesticide Use

New Procedures for Identifying High Rates of Pesticide Use New Procedures for Identifying High Rates of Pesticide Use Larry Wilhoit Department of Pesticide Regulation February 14, 2011 1 Detecting Outliers in Rates of Use 2 Current Outlier Criteria 3 New Outlier

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,

More information

Genetic and Environmental Info in goat milk FTIR spectra

Genetic and Environmental Info in goat milk FTIR spectra Genetic and Environmental Info in goat milk FTIR spectra B. Dagnachew and T. Ådnøy Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences EAAP 2011 29 th August, Stavanger,

More information

4. Model evaluation & selection

4. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2017 4. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network

ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network International Journal of Electronics Engineering, 3 (1), 2011, pp. 55 58 ECG Beat Recognition using Principal Components Analysis and Artificial Neural Network Amitabh Sharma 1, and Tanushree Sharma 2

More information

Applying Machine Learning Methods in Medical Research Studies

Applying Machine Learning Methods in Medical Research Studies Applying Machine Learning Methods in Medical Research Studies Daniel Stahl Department of Biostatistics and Health Informatics Psychiatry, Psychology & Neuroscience (IoPPN), King s College London daniel.r.stahl@kcl.ac.uk

More information

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Dhouha Grissa, Mélanie Pétéra, Marion Brandolini, Amedeo Napoli, Blandine Comte and Estelle Pujos-Guillot

More information

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California

Computer Age Statistical Inference. Algorithms, Evidence, and Data Science. BRADLEY EFRON Stanford University, California Computer Age Statistical Inference Algorithms, Evidence, and Data Science BRADLEY EFRON Stanford University, California TREVOR HASTIE Stanford University, California ggf CAMBRIDGE UNIVERSITY PRESS Preface

More information

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi

More information

Supersparse Linear Integer Models for Interpretable Prediction. Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013

Supersparse Linear Integer Models for Interpretable Prediction. Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013 Supersparse Linear Integer Models for Interpretable Prediction Berk Ustun Stefano Tracà Cynthia Rudin INFORMS 2013 CHADS 2 Scoring System Condition Points Congestive heart failure 1 Hypertension 1 Age

More information

Panel: Machine Learning in Surgery and Cancer

Panel: Machine Learning in Surgery and Cancer Panel: Machine Learning in Surgery and Cancer Professor Dimitris Bertsimas, SM 87, PhD 88, Boeing Leaders for Global Operations Professor of Management; Professor of Operations Research; Co-Director, Operations

More information

Keywords Missing values, Medoids, Partitioning Around Medoids, Auto Associative Neural Network classifier, Pima Indian Diabetes dataset.

Keywords Missing values, Medoids, Partitioning Around Medoids, Auto Associative Neural Network classifier, Pima Indian Diabetes dataset. Volume 7, Issue 3, March 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Medoid Based Approach

More information

Urinary metabolic profiling in inflammatory bowel disease. Dr Horace Williams Clinical Research Fellow Imperial College London

Urinary metabolic profiling in inflammatory bowel disease. Dr Horace Williams Clinical Research Fellow Imperial College London Urinary metabolic profiling in inflammatory bowel disease Dr Horace Williams Clinical Research Fellow Imperial College London Background: Metabolic profiling Metabolic profiling or metabonomics describes

More information

3. Model evaluation & selection

3. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

Detection and Classification of Lung Cancer Using Artificial Neural Network

Detection and Classification of Lung Cancer Using Artificial Neural Network Detection and Classification of Lung Cancer Using Artificial Neural Network Almas Pathan 1, Bairu.K.saptalkar 2 1,2 Department of Electronics and Communication Engineering, SDMCET, Dharwad, India 1 almaseng@yahoo.co.in,

More information

Analysis of Classification Algorithms towards Breast Tissue Data Set

Analysis of Classification Algorithms towards Breast Tissue Data Set Analysis of Classification Algorithms towards Breast Tissue Data Set I. Ravi Assistant Professor, Department of Computer Science, K.R. College of Arts and Science, Kovilpatti, Tamilnadu, India Abstract

More information

Distribution Assertive Regression Kumarjit Pathak a*, Jitin Kapila b*, Aasheesh Barvey c, Nikit Gawande d

Distribution Assertive Regression Kumarjit Pathak a*, Jitin Kapila b*, Aasheesh Barvey c, Nikit Gawande d 1 Distribution Assertive Regression Kumarjit Pathak a*, Jitin Kapila b*, Aasheesh Barvey c, Nikit Gawande d a Data Scientist professional, Harman, Whitefield, Bangalore,mail:Kumarjit.pathak@outlook.com

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Prediction of CP concentration and rumen degradability by Fourier Transform Infrared Spectroscopy (FTIR)

Prediction of CP concentration and rumen degradability by Fourier Transform Infrared Spectroscopy (FTIR) IBERS Prediction of CP concentration and rumen degradability by Fourier Transform Infrared Spectroscopy (FTIR) Belanche A., G.G. Allison, C.J. Newbold, M.R. Weisbjerg and J.M. Moorby EAAP meeting, 27 August,

More information

A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction

A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction A Novel Iterative Linear Regression Perceptron Classifier for Breast Cancer Prediction Samuel Giftson Durai Research Scholar, Dept. of CS Bishop Heber College Trichy-17, India S. Hari Ganesh, PhD Assistant

More information

Automated Estimation of mts Score in Hand Joint X-Ray Image Using Machine Learning

Automated Estimation of mts Score in Hand Joint X-Ray Image Using Machine Learning Automated Estimation of mts Score in Hand Joint X-Ray Image Using Machine Learning Shweta Khairnar, Sharvari Khairnar 1 Graduate student, Pace University, New York, United States 2 Student, Computer Engineering,

More information

Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department

Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department Data Mining Techniques to Find Out Heart Diseases: An Overview Mayuri Takore 1, Prof.R.R. Shelke 2 1 ME First Yr. (CSE), 2 Assistant Professor Computer Science & Engg, Department H.V.P.M s COET, Amravati

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

BREAST CANCER EPIDEMIOLOGY MODEL:

BREAST CANCER EPIDEMIOLOGY MODEL: BREAST CANCER EPIDEMIOLOGY MODEL: Calibrating Simulations via Optimization Michael C. Ferris, Geng Deng, Dennis G. Fryback, Vipat Kuruchittham University of Wisconsin 1 University of Wisconsin Breast Cancer

More information

Tissue Classification Based on Gene Expression Data

Tissue Classification Based on Gene Expression Data Chapter 6 Tissue Classification Based on Gene Expression Data Many diseases result from complex interactions involving numerous genes. Previously, these gene interactions have been commonly studied separately.

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 Exam policy: This exam allows one one-page, two-sided cheat sheet; No other materials. Time: 80 minutes. Be sure to write your name and

More information

Predicting Kidney Cancer Survival from Genomic Data

Predicting Kidney Cancer Survival from Genomic Data Predicting Kidney Cancer Survival from Genomic Data Christopher Sauer, Rishi Bedi, Duc Nguyen, Benedikt Bünz Abstract Cancers are on par with heart disease as the leading cause for mortality in the United

More information

Gender Based Emotion Recognition using Speech Signals: A Review

Gender Based Emotion Recognition using Speech Signals: A Review 50 Gender Based Emotion Recognition using Speech Signals: A Review Parvinder Kaur 1, Mandeep Kaur 2 1 Department of Electronics and Communication Engineering, Punjabi University, Patiala, India 2 Department

More information

BACKPROPOGATION NEURAL NETWORK FOR PREDICTION OF HEART DISEASE

BACKPROPOGATION NEURAL NETWORK FOR PREDICTION OF HEART DISEASE BACKPROPOGATION NEURAL NETWORK FOR PREDICTION OF HEART DISEASE NABEEL AL-MILLI Financial and Business Administration and Computer Science Department Zarqa University College Al-Balqa' Applied University

More information

Brain Tumor segmentation and classification using Fcm and support vector machine

Brain Tumor segmentation and classification using Fcm and support vector machine Brain Tumor segmentation and classification using Fcm and support vector machine Gaurav Gupta 1, Vinay singh 2 1 PG student,m.tech Electronics and Communication,Department of Electronics, Galgotia College

More information

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT

International Journal of Pharma and Bio Sciences A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS ABSTRACT Research Article Bioinformatics International Journal of Pharma and Bio Sciences ISSN 0975-6299 A NOVEL SUBSET SELECTION FOR CLASSIFICATION OF DIABETES DATASET BY ITERATIVE METHODS D.UDHAYAKUMARAPANDIAN

More information

Rating prediction on Amazon Fine Foods Reviews

Rating prediction on Amazon Fine Foods Reviews Rating prediction on Amazon Fine Foods Reviews Chen Zheng University of California,San Diego chz022@ucsd.edu Ye Zhang University of California,San Diego yez033@ucsd.edu Yikun Huang University of California,San

More information

Aspects of Statistical Modelling & Data Analysis in Gene Expression Genomics. Mike West Duke University

Aspects of Statistical Modelling & Data Analysis in Gene Expression Genomics. Mike West Duke University Aspects of Statistical Modelling & Data Analysis in Gene Expression Genomics Mike West Duke University Papers, software, many links: www.isds.duke.edu/~mw ABS04 web site: Lecture slides, stats notes, papers,

More information

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos

PMR5406 Redes Neurais e Lógica Fuzzy. Aula 5 Alguns Exemplos PMR5406 Redes Neurais e Lógica Fuzzy Aula 5 Alguns Exemplos APPLICATIONS Two examples of real life applications of neural networks for pattern classification: RBF networks for face recognition FF networks

More information

RISK PREDICTION MODEL: PENALIZED REGRESSIONS

RISK PREDICTION MODEL: PENALIZED REGRESSIONS RISK PREDICTION MODEL: PENALIZED REGRESSIONS Inspired from: How to develop a more accurate risk prediction model when there are few events Menelaos Pavlou, Gareth Ambler, Shaun R Seaman, Oliver Guttmann,

More information

Investigating Links Between the Immune System and the Brain from Medical Claims and Laboratory Tests

Investigating Links Between the Immune System and the Brain from Medical Claims and Laboratory Tests Investigating Links Between the Immune System and the Brain from Medical Claims and Laboratory Tests Guhan Venkataraman Department of Biomedical Informatics guhan@stanford.edu Tymor Hamamsy Department

More information

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network

Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon Network UTM Computing Proceedings Innovations in Computing Technology and Applications Volume 2 Year: 2017 ISBN: 978-967-0194-95-0 1 Classıfıcatıon of Dıabetes Dısease Usıng Backpropagatıon and Radıal Basıs Functıon

More information

Variable selection should be blinded to the outcome

Variable selection should be blinded to the outcome Variable selection should be blinded to the outcome Tamás Ferenci Manuscript type: Letter to the Editor Title: Variable selection should be blinded to the outcome Author List: Tamás Ferenci * (Physiological

More information

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil

CSE Introduction to High-Perfomance Deep Learning ImageNet & VGG. Jihyung Kil CSE 5194.01 - Introduction to High-Perfomance Deep Learning ImageNet & VGG Jihyung Kil ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton,

More information

Modeling Sentiment with Ridge Regression

Modeling Sentiment with Ridge Regression Modeling Sentiment with Ridge Regression Luke Segars 2/20/2012 The goal of this project was to generate a linear sentiment model for classifying Amazon book reviews according to their star rank. More generally,

More information

Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-infrared Spectroscopy

Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-infrared Spectroscopy Research of Determination Method of Starch and Protein Content in Buckwheat by Mid-infrared Spectroscopy Fenghua Wang 1,1, Ju Yang 1, Hailong Zhu 2, Zhiyong Xi 1 1 Faculty of Modern Agricultural Engineering,Kunming

More information

Deep Networks and Beyond. Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University

Deep Networks and Beyond. Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University Deep Networks and Beyond Alan Yuille Bloomberg Distinguished Professor Depts. Cognitive Science and Computer Science Johns Hopkins University Artificial Intelligence versus Human Intelligence Understanding

More information

Colon cancer subtypes from gene expression data

Colon cancer subtypes from gene expression data Colon cancer subtypes from gene expression data Nathan Cunningham Giuseppe Di Benedetto Sherman Ip Leon Law Module 6: Applied Statistics 26th February 2016 Aim Replicate findings of Felipe De Sousa et

More information

SUPPLEMENTARY APPENDIX

SUPPLEMENTARY APPENDIX SUPPLEMENTARY APPENDIX 1) Supplemental Figure 1. Histopathologic Characteristics of the Tumors in the Discovery Cohort 2) Supplemental Figure 2. Incorporation of Normal Epidermal Melanocytic Signature

More information

BLOOD GLUCOSE PREDICTION MODELS FOR PERSONALIZED DIABETES MANAGEMENT

BLOOD GLUCOSE PREDICTION MODELS FOR PERSONALIZED DIABETES MANAGEMENT BLOOD GLUCOSE PREDICTION MODELS FOR PERSONALIZED DIABETES MANAGEMENT A Thesis Submitted to the Graduate Faculty of the North Dakota State University of Agriculture and Applied Science By Warnakulasuriya

More information

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Brain Tumour Detection of MR Image Using Naïve

More information

EVALUATION OF EFFERVESCENT FLOATING TABLETS. 6.7 Mathematical model fitting of obtained drug release data

EVALUATION OF EFFERVESCENT FLOATING TABLETS. 6.7 Mathematical model fitting of obtained drug release data EVALUATION OF EFFERVESCENT FLOATING TABLETS 6.1 Technological characteristics of floating tablets 6.2 Fourier transform infrared spectroscopy (FT-IR) 6.3 Differential scanning calorimetry (DSC) 6.4 In

More information

Leveraging Pharmacy Medical Records To Predict Diabetes Using A Random Forest & Artificial Neural Network

Leveraging Pharmacy Medical Records To Predict Diabetes Using A Random Forest & Artificial Neural Network Leveraging Pharmacy Medical Records To Predict Diabetes Using A Random Forest & Artificial Neural Network Stephen Lavery 1 and Jeremy Debattista 2 1 National College of Ireland, Dublin, Ireland, laverys@tcd.ie

More information

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH

PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH PREDICTION OF BREAST CANCER USING STACKING ENSEMBLE APPROACH 1 VALLURI RISHIKA, M.TECH COMPUTER SCENCE AND SYSTEMS ENGINEERING, ANDHRA UNIVERSITY 2 A. MARY SOWJANYA, Assistant Professor COMPUTER SCENCE

More information

Big Image-Omics Data Analytics for Clinical Outcome Prediction

Big Image-Omics Data Analytics for Clinical Outcome Prediction Big Image-Omics Data Analytics for Clinical Outcome Prediction Junzhou Huang, Ph.D. Associate Professor Dept. Computer Science & Engineering University of Texas at Arlington Dept. CSE, UT Arlington Scalable

More information

MRI Image Processing Operations for Brain Tumor Detection

MRI Image Processing Operations for Brain Tumor Detection MRI Image Processing Operations for Brain Tumor Detection Prof. M.M. Bulhe 1, Shubhashini Pathak 2, Karan Parekh 3, Abhishek Jha 4 1Assistant Professor, Dept. of Electronics and Telecommunications Engineering,

More information

Classification of EEG signals in an Object Recognition task

Classification of EEG signals in an Object Recognition task Classification of EEG signals in an Object Recognition task Iacob D. Rus, Paul Marc, Mihaela Dinsoreanu, Rodica Potolea Technical University of Cluj-Napoca Cluj-Napoca, Romania 1 rus_iacob23@yahoo.com,

More information

A REVIEW ON CLASSIFICATION OF BREAST CANCER DETECTION USING COMBINATION OF THE FEATURE EXTRACTION MODELS. Aeronautical Engineering. Hyderabad. India.

A REVIEW ON CLASSIFICATION OF BREAST CANCER DETECTION USING COMBINATION OF THE FEATURE EXTRACTION MODELS. Aeronautical Engineering. Hyderabad. India. Volume 116 No. 21 2017, 203-208 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu A REVIEW ON CLASSIFICATION OF BREAST CANCER DETECTION USING COMBINATION OF

More information

Analytical Developments for Identification and Authentication of Botanicals

Analytical Developments for Identification and Authentication of Botanicals Analytical Developments for Identification and Authentication of Botanicals James Harnly Food Composition and Methods Development Lab Beltsville Human Nutrition Research Center Agricultural Research Service

More information

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental Neuroinformatics Aapo Hyvärinen, still Academy Research Fellow for a while Post-docs: Patrik Hoyer and Jarmo Hurri + possibly international post-docs PhD students Ilmari Kurki, Urs Köster, Jukka Perkiö,

More information

Analysis of TB prevalence surveys

Analysis of TB prevalence surveys Workshop and training course on TB prevalence surveys with a focus on field operations Analysis of TB prevalence surveys Day 8 Thursday, 4 August 2011 Phnom Penh Babis Sismanidis with acknowledgements

More information

Detection and Recognition of Sign Language Protocol using Motion Sensing Device

Detection and Recognition of Sign Language Protocol using Motion Sensing Device Detection and Recognition of Sign Language Protocol using Motion Sensing Device Rita Tse ritatse@ipm.edu.mo AoXuan Li P130851@ipm.edu.mo Zachary Chui MPI-QMUL Information Systems Research Centre zacharychui@gmail.com

More information

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6)

Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6) Artificial Neural Networks (Ref: Negnevitsky, M. Artificial Intelligence, Chapter 6) BPNN in Practice Week 3 Lecture Notes page 1 of 1 The Hopfield Network In this network, it was designed on analogy of

More information

Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features.

Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features. Yeast Cells Classification Machine Learning Approach to Discriminate Saccharomyces cerevisiae Yeast Cells Using Sophisticated Image Features. Mohamed Tleis Supervisor: Fons J. Verbeek Leiden University

More information

Data complexity measures for analyzing the effect of SMOTE over microarrays

Data complexity measures for analyzing the effect of SMOTE over microarrays ESANN 216 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges (Belgium), 27-29 April 216, i6doc.com publ., ISBN 978-2878727-8. Data complexity

More information

Research of the Measurement on Palmitic Acid in Edible Oils by Near-Infrared Spectroscopy

Research of the Measurement on Palmitic Acid in Edible Oils by Near-Infrared Spectroscopy Research of the Measurement on Palmitic Acid in Edible Oils by Near-Infrared Spectroscopy Hui Li 1, Jingzhu Wu 1*, Cuiling Liu 1, 1 College of Computer & Information Engineering, Beijing Technology and

More information

OUTLIER DETECTION : A REVIEW

OUTLIER DETECTION : A REVIEW International Journal of Advances Outlier in Embedeed Detection System : A Review Research January-June 2011, Volume 1, Number 1, pp. 55 71 OUTLIER DETECTION : A REVIEW K. Subramanian 1, and E. Ramraj

More information

For more information, please contact: or +1 (302)

For more information, please contact: or +1 (302) Introduction Quantitative Prediction of Tobacco Components using Near-Infrared Diffuse Reflectance Spectroscopy Kristen Frano Katherine Bakeev B&W Tek, Newark, DE Chemical analysis is an extremely important

More information

AppNote 10/2002. Detection of Spoilage Markers in Food Products using a Mass-Spectrometry Based Chemical Sensor KEYWORDS ABSTRACT

AppNote 10/2002. Detection of Spoilage Markers in Food Products using a Mass-Spectrometry Based Chemical Sensor KEYWORDS ABSTRACT AppNote 10/2002 Detection of Spoilage Markers in Food Products using a Mass-Spectrometry Based Chemical Sensor Vanessa R. Kinton Gerstel, Inc., 701 Digital Drive, Suite J, Linthicum, MD 21090, USA Kevin

More information

Data mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis

Data mining for Obstructive Sleep Apnea Detection. 18 October 2017 Konstantinos Nikolaidis Data mining for Obstructive Sleep Apnea Detection 18 October 2017 Konstantinos Nikolaidis Introduction: What is Obstructive Sleep Apnea? Obstructive Sleep Apnea (OSA) is a relatively common sleep disorder

More information

Large-Scale Statistical Modelling via Machine Learning Classifiers

Large-Scale Statistical Modelling via Machine Learning Classifiers J. Stat. Appl. Pro. 2, No. 3, 203-222 (2013) 203 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.12785/jsap/020303 Large-Scale Statistical Modelling via Machine

More information

International Journal of Pure and Applied Mathematics

International Journal of Pure and Applied Mathematics Volume 119 No. 12 2018, 12505-12513 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Analysis of Cancer Classification of Gene Expression Data A Scientometric Review 1 Joseph M. De Guia,

More information

Article from. Forecasting and Futurism. Month Year July 2015 Issue Number 11

Article from. Forecasting and Futurism. Month Year July 2015 Issue Number 11 Article from Forecasting and Futurism Month Year July 2015 Issue Number 11 Calibrating Risk Score Model with Partial Credibility By Shea Parkes and Brad Armstrong Risk adjustment models are commonly used

More information

Error Detection based on neural signals

Error Detection based on neural signals Error Detection based on neural signals Nir Even- Chen and Igor Berman, Electrical Engineering, Stanford Introduction Brain computer interface (BCI) is a direct communication pathway between the brain

More information

Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images.

Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images. Development of Soft-Computing techniques capable of diagnosing Alzheimer s Disease in its pre-clinical stage combining MRI and FDG-PET images. Olga Valenzuela, Francisco Ortuño, Belen San-Roman, Victor

More information

Generalized additive model for disease risk prediction

Generalized additive model for disease risk prediction Generalized additive model for disease risk prediction Guodong Chen Chu Kochen Honors College, Zhejiang University Channing Division of Network Medicine, BWH & HMS Advised by: Prof. Yang-Yu Liu 1 Is it

More information

L. Ziaei MS*, A. R. Mehri PhD**, M. Salehi PhD***

L. Ziaei MS*, A. R. Mehri PhD**, M. Salehi PhD*** Received: 1/16/2004 Accepted: 8/1/2005 Original Article Application of Artificial Neural Networks in Cancer Classification and Diagnosis Prediction of a Subtype of Lymphoma Based on Gene Expression Profile

More information

arxiv: v1 [q-bio.nc] 12 Jun 2014

arxiv: v1 [q-bio.nc] 12 Jun 2014 1 arxiv:1406.3284v1 [q-bio.nc] 12 Jun 2014 Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition Charles F. Cadieu 1,, Ha Hong 1,2, Daniel L. K. Yamins 1,

More information

Analysis of Hoge Religious Motivation Scale by Means of Combined HAC and PCA Methods

Analysis of Hoge Religious Motivation Scale by Means of Combined HAC and PCA Methods Analysis of Hoge Religious Motivation Scale by Means of Combined HAC and PCA Methods Ana Štambuk Department of Social Work, Faculty of Law, University of Zagreb, Nazorova 5, HR- Zagreb, Croatia E-mail:

More information

Figure 1: MRI Scanning [2]

Figure 1: MRI Scanning [2] A Deep Belief Network Based Brain Tumor Detection in MRI Images Thahseen P 1, Anish Kumar B 2 1 MEA Engineering College, State Highway 39, Nellikunnu-Vengoor, Perinthalmanna, Malappuram, Kerala 2 Assistant

More information

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY Performance Analysis of Brain MRI Using Multiple Method Shroti Paliwal *, Prof. Sanjay Chouhan * Department of Electronics & Communication

More information