Biomedical Machine Learning

Size: px
Start display at page:

Download "Biomedical Machine Learning"

Transcription

1 Marco F. Ramoni, PhD Children s Hospital Informatics Program Harvard Medical School Dental Informatics and Dental Research Making The Connection NIH, Bethesda, MD June 12-13, 2003

2 Machine Learning Artificial Intelligence: Mechanical simulation/emulation of functions considered proper of human beings: Reasoning: Derive conclusions and represent knowledge; Vision: Processing and recognition of images; Natural Language: Ability to parse natural language; Machine Learning: Automated learning from data. Machine Learning: Became popular with Bioinformatics; The less I know, the more I learn: supervised vs unsupervised; Know the structure of your domain and want to learn; You know the structure and you don t know what to learn; You don t know the structure and you know what to learn; You don t know the structure and you don t know what to learn.

3 Rev. Thomas Bayes Name: Thomas Bayes. Birth: 1702, London, UK. Death: 1761, Tunbridge, Kent. Buried: Bunhill Fields, London. Job title: Presbyterian minister. Publications: 3 (1 posthumous). Claim to fame: Essay Towards Solving a Problem in the Doctrine of Chances, Phil Trans Royal Soc, 1763, by Richard Price. Trivia: Unrelated to Bayesians.

4 Bayesian Phenotype Bayesian: The opposite of Frequentist. Frequentist: Every (at least most) statistician you know. Population: varieties of Bayesians (Good, 1971). Profile: Hates the cookbook style of classical statistics. Tools: Total Probability Theorem and Bayes Theorem. Main Beliefs: Inference is about decisions. Inference is conditional on data (small samples); Inference is updating prior knowledge (even when it is knowledge of not knowing anything); Everything (also an hypothesis) has a probability.

5 Bayesian Networks Qualitative: A dependency graph made by: Node: a variable X, with a set of states {x 1,,x n }. Arc: a dependency of a variable X on its parents Π. Quantitative: The distributions of a variable X combination of states π i of its parents Π. given each A p(a) p(a) Y 0.3 Y 0.3 O 0.7 O 0.7 E p(e) p(e) L 0.8 L 0.8 H 0.2 H 0.2 A E A=Age Age; ; E=Education Education; ; I=Income Income I A E I I p(i A,E) p(i A,E) Y L L 0.9 Y L L 0.9 Y L H 0.1 Y L H 0.1 Y H L 0.5 Y H L 0.5 Y H H 0.5 Y H H 0.5 O L L 0.7 O L L 0.7 O L H 0.3 O L H 0.3 O H L 0.2 O H L 0.2 O H H 0.8 O H H 0.8

6 Learning Bayesian Networks Bayesian networks were developed to encode experts knowledge. Late 80s, people realize that the statistical nature of Bayesian networks allows us to learn them from data. In principle, the process of learning a Bayesian network structure involves: Search strategy to explore the possible structures; Scoring metric to select a structure. Courtesy Enrico Motta

7 Supervised Classification Components: Dataset of features and a training signal. Features: Gene expression levels in different classes. Training signal: The class label. Task: Create model associating feature profiles to class. Drawback: Intractable (combinatorial explosion). Other methods: Neural networks, support vector machines, etc. C G1 G2 G2 G4 G5 G6

8 Naïve Bayes Classifiers Assumption: A simple classifier assumes that the features (genes, symptoms) are conditionally independent given the class. Classifiers: These models are called Naïve Bayes Classifiers. Method: Estimate, for each feature (gene/symptom), the conditional probability distribution of the feature given each class. Toward Unsupervision: Select best predictors (feature selection). C G1 G2 G3 G4 G5 G5

9 Supervised vs Unsupervised Elements: Features (symptoms genes) and a training signal (disease, class). Question: Which function best maps features to class? Goal: Find a good predictive system of class (a system that takes patient symptoms and return a diagnosis). Task: Estimation (except for feature selection, the task of finding the best predictors). Elements: Features (symptoms genes) but no training signal. Question: Which features behave in a related (similar) way across conditions? Goal: Understand interaction (e.g. how genes/symptoms behave similarly under some experimental conditions). Task: Model selection.

10 Feature Selection Task: Identify those genes that best predict the class: Advantage I: Typically increases predictive accuracy. Advantage II: More compact representation. Advantage III: Provide insights into the process. Type of Task: Model selection. Rationale: Since we cannot try all combinations, most different features should be the best at discriminating. C G1 G2 G3 G4 G5 G5 G6

11 Comparative Experiments Healthy cell Tumor cell Sample 1 Sample 2 Sample 3 Sample 4 Samples k=1,,n i genes g=1,,g Gene.DescS1 S2 S3 S4 S5 AFFX-BioC hum_alu_a AFFX-DapX AFFX-LysX AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HUM AFFX-HSA y gik

12 Bayesian Differential Analysis Bayesian solution: Compute the probability that the ratio between the expression of a gene in condition A and B is higher than 1. Reproducibility: 8 groups of 4 data sets each, of different sample sizes, selected from 102 samples of normal vs prostate cancer. with P. Sebastiani, BU

13 Slippery Class Survival Class in Ovarian Cancer: Task: Identify patient subclass with favorable prognosis; BADGE: Bayesian Analysis of Differential Gene Expression. Overall Recurrence Survival (DSF) p < with Cannistra and Spentzos, BIDMC

14 Clustering Condition: The class is so slippery that we don t have a shred of information about it. Goal: Create a classification. Method: Group samples in clusters according to their similarity (eg correlation). Example: Find a classification for 20,000 rock samples from Mars. Example II: Oral cancer (NIDCR): Identify phenotypes using their genomic profile. C? G1 G2 G3 G4 G5 G5

15 Bayesian Clustering Problem: How do we decide that N patients/genes are sufficiently similar become a cluster on their own? Similarity: Profiles are similar when they are generated by the same stochastic process. Example: EKGs are similar but not identical series generated by the a set of physiological process. Clusters: Cluster profiles on the basis of their similarity is to group profiles generated by the same process. Bayesian solution: The most probable set of generating processes responsible for the observed profiles. Strategy: Compute posterior probability p(m D) of each clustering model given the data and take the highest.

16 Modeling Dynamics Time: Control happens along time. Example: Development of normal/cleft palate. Correlation: Assumes data to be independent: same score on permuted data. The past dictates the future. Information: Is information enough to discriminate? Method: Cluster Analysis of Gene Expression Dynamics (CAGED) Correlation=0.57 Correlation=0.72

17 The Clustering Model We need to merge these separated models in a single global model of the processes. We add a variable indexing the series and we build a single conditional probability distribution. C X y y = Xβ + ε

18 CAGED We use a dataset of 512 genes collected to study the response to fibroblast to serum (Iyer et al. 1999). Back in 1999, 238 out of 517 genes were unknown. We relabeled the genes according to the current state of the art and less than 20 are left unknown. There are 19 repeated genes in the dataset: Original model puts 4 of these in different clusters; We put 1 of these in two different clusters. Ramoni Sebastiani Kohane, PNAS (2002)

19 Decoding Control Clustering: Group samples but tells us nothing about the control mechanisms: it provides boxes, not chains of command. Class: It s so slippery that it does not even exist. Dependencies: Bayesian networks are built to represent control: a set of links means that a variable depends on a set of variables. Learning: Automated learning a Bayesian network structure: Search strategy to explore the possible structures; Scoring metric to select a structure. Challenges: Limitations of current methods: Distributions: Only discrete and normal distributions; Dependencies: Only linear (for normal distribution). Solution: Generalized Gamma Networks (GGNs). with P. Sebastiani, BU

20 Structure Learning SNPs: Single nucleotide polymorphisms - change of a single base in the DNA - work as markers of genomic regions. Diabetes network: 13 genes and one phenotype (IDDM) in triads.... ATGCGATACTCGACTCGA ATGCGATACGCGACTCGA... Data by J Hirshorn, WI and CH

21 Finding The Right SNPs Background: Oral mucositis (OM) is a a dose-limiting, toxicity experienced by cancer patients receiving radiotherapy or chemotherapy. Objective: Identify a set of sentinel polymorphisms able to predict individual risk of OM in hematopoietic stem cell transplant (HSCT) recipients as result of antineoplastic therapy. Subjects: Allo (70) and auto HSCT (50) recipients and donors (70). Candidate Genes: Identify a set of genes and regulatory regions. Candidate SNPs: Identify coding SNPs in candidate genes and SNPs in regulatory regions of interest. Challenge: It is likely to be a complex trait. with S Sonis, BWH and HSDM

22 Differential Analysis Example: Take the genes differentially expressed in the prostate cancer database and learn the structure of each condition (normal vs tumor) : collagene is independent of all other nodes, and becomes a child of 914 (oncogene with transcription regulation functions). Normal Samples Tumor Samples

23 Phenomic Landscapes with D Shen, Harvard/MIT HST

24 BADGE Bayesian Analysis of Differential Gene Expression Take Home Toys CAGED Cluster Analysis of Gene Expression Dynamics Bayesware Discoverer A program to learn Bayesian networks from data Contact: Marco Ramoni, Children s Hospital Informatics program marco_ramoni@harvard.edu

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER 2016 DELIVERING VALUE WITH DATA SCIENCE BAYES APPROACH - MAKING DATA WORK HARDER The Ipsos MORI Data Science team increasingly

More information

Supplementary notes for lecture 8: Computational modeling of cognitive development

Supplementary notes for lecture 8: Computational modeling of cognitive development Supplementary notes for lecture 8: Computational modeling of cognitive development Slide 1 Why computational modeling is important for studying cognitive development. Let s think about how to study the

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017 RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science

More information

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

The Development and Application of Bayesian Networks Used in Data Mining Under Big Data 2017 International Conference on Arts and Design, Education and Social Sciences (ADESS 2017) ISBN: 978-1-60595-511-7 The Development and Application of Bayesian Networks Used in Data Mining Under Big Data

More information

Lecture 9 Internal Validity

Lecture 9 Internal Validity Lecture 9 Internal Validity Objectives Internal Validity Threats to Internal Validity Causality Bayesian Networks Internal validity The extent to which the hypothesized relationship between 2 or more variables

More information

SNPrints: Defining SNP signatures for prediction of onset in complex diseases

SNPrints: Defining SNP signatures for prediction of onset in complex diseases SNPrints: Defining SNP signatures for prediction of onset in complex diseases Linda Liu, Biomedical Informatics, Stanford University Daniel Newburger, Biomedical Informatics, Stanford University Grace

More information

Introduction to Computational Neuroscience

Introduction to Computational Neuroscience Introduction to Computational Neuroscience Lecture 5: Data analysis II Lesson Title 1 Introduction 2 Structure and Function of the NS 3 Windows to the Brain 4 Data analysis 5 Data analysis II 6 Single

More information

A Comparison of Collaborative Filtering Methods for Medication Reconciliation

A Comparison of Collaborative Filtering Methods for Medication Reconciliation A Comparison of Collaborative Filtering Methods for Medication Reconciliation Huanian Zheng, Rema Padman, Daniel B. Neill The H. John Heinz III College, Carnegie Mellon University, Pittsburgh, PA, 15213,

More information

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China

A Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China A Vision-based Affective Computing System Jieyu Zhao Ningbo University, China Outline Affective Computing A Dynamic 3D Morphable Model Facial Expression Recognition Probabilistic Graphical Models Some

More information

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network

Contents. Just Classifier? Rules. Rules: example. Classification Rule Generation for Bioinformatics. Rule Extraction from a trained network Contents Classification Rule Generation for Bioinformatics Hyeoncheol Kim Rule Extraction from Neural Networks Algorithm Ex] Promoter Domain Hybrid Model of Knowledge and Learning Knowledge refinement

More information

Vision: Over Ov view Alan Yuille

Vision: Over Ov view Alan Yuille Vision: Overview Alan Yuille Why is Vision Hard? Complexity and Ambiguity of Images. Range of Vision Tasks. More 10x10 images 256^100 = 6.7 x 10 ^240 than the total number of images seen by all humans

More information

A HMM-based Pre-training Approach for Sequential Data

A HMM-based Pre-training Approach for Sequential Data A HMM-based Pre-training Approach for Sequential Data Luca Pasa 1, Alberto Testolin 2, Alessandro Sperduti 1 1- Department of Mathematics 2- Department of Developmental Psychology and Socialisation University

More information

EECS 433 Statistical Pattern Recognition

EECS 433 Statistical Pattern Recognition EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern

More information

Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data

Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data Statistical Analysis Using Machine Learning Approach for Multiple Imputation of Missing Data S. Kanchana 1 1 Assistant Professor, Faculty of Science and Humanities SRM Institute of Science & Technology,

More information

Application of Bayesian Network Model for Enterprise Risk Management of Expressway Management Corporation

Application of Bayesian Network Model for Enterprise Risk Management of Expressway Management Corporation 2011 International Conference on Innovation, Management and Service IPEDR vol.14(2011) (2011) IACSIT Press, Singapore Application of Bayesian Network Model for Enterprise Risk Management of Expressway

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

10CS664: PATTERN RECOGNITION QUESTION BANK

10CS664: PATTERN RECOGNITION QUESTION BANK 10CS664: PATTERN RECOGNITION QUESTION BANK Assignments would be handed out in class as well as posted on the class blog for the course. Please solve the problems in the exercises of the prescribed text

More information

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s

Using Bayesian Networks to Analyze Expression Data. Xu Siwei, s Muhammad Ali Faisal, s Tejal Joshi, s Using Bayesian Networks to Analyze Expression Data Xu Siwei, s0789023 Muhammad Ali Faisal, s0677834 Tejal Joshi, s0677858 Outline Introduction Bayesian Networks Equivalence Classes Applying to Expression

More information

Comparison of machine learning models for the prediction of live birth following IVF treatment: an analysis of 463,669 cycles from a national

Comparison of machine learning models for the prediction of live birth following IVF treatment: an analysis of 463,669 cycles from a national Comparison of machine learning models for the prediction of live birth following IVF treatment: an analysis of 463,669 cycles from a national database Session title: Session 53: Machine learning and artificial

More information

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1 Lecture 27: Systems Biology and Bayesian Networks Systems Biology and Regulatory Networks o Definitions o Network motifs o Examples

More information

Quantitative Biology Lecture 1 (Introduction + Probability)

Quantitative Biology Lecture 1 (Introduction + Probability) 21 st Sep 2015 Quantitative Biology Lecture 1 (Introduction + Probability) Gurinder Singh Mickey Atwal Center for Quantitative Biology Why Quantitative Biology? (1. Models) Galileo Galilei The Book of

More information

How to use the Lafayette ESS Report to obtain a probability of deception or truth-telling

How to use the Lafayette ESS Report to obtain a probability of deception or truth-telling Lafayette Tech Talk: How to Use the Lafayette ESS Report to Obtain a Bayesian Conditional Probability of Deception or Truth-telling Raymond Nelson The Lafayette ESS Report is a useful tool for field polygraph

More information

ERA: Architectures for Inference

ERA: Architectures for Inference ERA: Architectures for Inference Dan Hammerstrom Electrical And Computer Engineering 7/28/09 1 Intelligent Computing In spite of the transistor bounty of Moore s law, there is a large class of problems

More information

Identifying the Zygosity Status of Twins Using Bayes Network and Estimation- Maximization Methodology

Identifying the Zygosity Status of Twins Using Bayes Network and Estimation- Maximization Methodology Identifying the Zygosity Status of Twins Using Bayes Network and Estimation- Maximization Methodology Yicun Ni (ID#: 9064804041), Jin Ruan (ID#: 9070059457), Ying Zhang (ID#: 9070063723) Abstract As the

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Chih-Lin Chi a, W. Nick Street b, William H. Wolberg c a Health Informatics Program, University of Iowa b

More information

CS 4365: Artificial Intelligence Recap. Vibhav Gogate

CS 4365: Artificial Intelligence Recap. Vibhav Gogate CS 4365: Artificial Intelligence Recap Vibhav Gogate Exam Topics Search BFS, DFS, UCS, A* (tree and graph) Completeness and Optimality Heuristics: admissibility and consistency CSPs Constraint graphs,

More information

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods Introduction Patrick Breheny January 10 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/25 Introductory example: Jane s twins Suppose you have a friend named Jane who is pregnant with twins

More information

Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University

Bayesian Networks Representation. Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 16 th, 2005 Handwriting recognition Character recognition, e.g., kernel SVMs r r r a c r rr

More information

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals.

An assistive application identifying emotional state and executing a methodical healing process for depressive individuals. An assistive application identifying emotional state and executing a methodical healing process for depressive individuals. Bandara G.M.M.B.O bhanukab@gmail.com Godawita B.M.D.T tharu9363@gmail.com Gunathilaka

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

Machine-Learning on Prediction of Inherited Genomic Susceptibility for 20 Major Cancers

Machine-Learning on Prediction of Inherited Genomic Susceptibility for 20 Major Cancers Machine-Learning on Prediction of Inherited Genomic Susceptibility for 20 Major Cancers Sung-Hou Kim University of California Berkeley, CA Global Bio Conference 2017 MFDS, Seoul, Korea June 28, 2017 Cancer

More information

Bayesian Concepts in Software Testing: An Initial Review

Bayesian Concepts in Software Testing: An Initial Review Bayesian Concepts in Software Testing: An Initial Review Daniel Rodríguez Universidad de Alcalá Javier Dolado U. País Vasco/Euskal Herriko Unibertsitatea Javier Tuya Universidad de Oviedo PRESI TIN2013-46928-C3-1-R,

More information

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper? Outline A Critique of Software Defect Prediction Models Norman E. Fenton Dongfeng Zhu What s inside this paper? What kind of new technique was developed in this paper? Research area of this technique?

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

Computational Cognitive Neuroscience

Computational Cognitive Neuroscience Computational Cognitive Neuroscience Computational Cognitive Neuroscience Computational Cognitive Neuroscience *Computer vision, *Pattern recognition, *Classification, *Picking the relevant information

More information

Contributions to Brain MRI Processing and Analysis

Contributions to Brain MRI Processing and Analysis Contributions to Brain MRI Processing and Analysis Dissertation presented to the Department of Computer Science and Artificial Intelligence By María Teresa García Sebastián PhD Advisor: Prof. Manuel Graña

More information

Introduction to Computational Neuroscience

Introduction to Computational Neuroscience Introduction to Computational Neuroscience Lecture 11: Attention & Decision making Lesson Title 1 Introduction 2 Structure and Function of the NS 3 Windows to the Brain 4 Data analysis 5 Data analysis

More information

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns

An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns An Artificial Neural Network Architecture Based on Context Transformations in Cortical Minicolumns 1. Introduction Vasily Morzhakov, Alexey Redozubov morzhakovva@gmail.com, galdrd@gmail.com Abstract Cortical

More information

Lecture 3: Bayesian Networks 1

Lecture 3: Bayesian Networks 1 Lecture 3: Bayesian Networks 1 Jingpeng Li 1 Content Reminder from previous lecture: Bayes theorem Bayesian networks Why are they currently interesting? Detailed example from medical diagnostics Bayesian

More information

EEL-5840 Elements of {Artificial} Machine Intelligence

EEL-5840 Elements of {Artificial} Machine Intelligence Menu Introduction Syllabus Grading: Last 2 Yrs Class Average 3.55; {3.7 Fall 2012 w/24 students & 3.45 Fall 2013} General Comments Copyright Dr. A. Antonio Arroyo Page 2 vs. Artificial Intelligence? DEF:

More information

A probabilistic method for food web modeling

A probabilistic method for food web modeling A probabilistic method for food web modeling Bayesian Networks methodology, challenges, and possibilities Anna Åkesson, Linköping University, Sweden 2 nd international symposium on Ecological Networks,

More information

Neurons and neural networks II. Hopfield network

Neurons and neural networks II. Hopfield network Neurons and neural networks II. Hopfield network 1 Perceptron recap key ingredient: adaptivity of the system unsupervised vs supervised learning architecture for discrimination: single neuron perceptron

More information

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision

More information

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach)

Single SNP/Gene Analysis. Typical Results of GWAS Analysis (Single SNP Approach) Typical Results of GWAS Analysis (Single SNP Approach) High-Throughput Sequencing Course Gene-Set Analysis Biostatistics and Bioinformatics Summer 28 Section Introduction What is Gene Set Analysis? Many names for gene set analysis: Pathway analysis Gene set

More information

Sequential Decision Making

Sequential Decision Making Sequential Decision Making Sequential decisions Many (most) real world problems cannot be solved with a single action. Need a longer horizon Ex: Sequential decision problems We start at START and want

More information

CSE 258 Lecture 1.5. Web Mining and Recommender Systems. Supervised learning Regression

CSE 258 Lecture 1.5. Web Mining and Recommender Systems. Supervised learning Regression CSE 258 Lecture 1.5 Web Mining and Recommender Systems Supervised learning Regression What is supervised learning? Supervised learning is the process of trying to infer from labeled data the underlying

More information

Agents and Environments

Agents and Environments Artificial Intelligence Programming s and s Chris Brooks 3-2: Overview What makes an agent? Defining an environment Types of agent programs 3-3: Overview What makes an agent? Defining an environment Types

More information

The Century of Bayes

The Century of Bayes The Century of Bayes Joseph J. Retzer Ph.D., Maritz Research The Bayesian `machine together with MCMC is arguably the most powerful mechanism ever created for processing data and knowledge Berger, 2001

More information

Predicting Kidney Cancer Survival from Genomic Data

Predicting Kidney Cancer Survival from Genomic Data Predicting Kidney Cancer Survival from Genomic Data Christopher Sauer, Rishi Bedi, Duc Nguyen, Benedikt Bünz Abstract Cancers are on par with heart disease as the leading cause for mortality in the United

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

An Efficient Attribute Ordering Optimization in Bayesian Networks for Prognostic Modeling of the Metabolic Syndrome

An Efficient Attribute Ordering Optimization in Bayesian Networks for Prognostic Modeling of the Metabolic Syndrome An Efficient Attribute Ordering Optimization in Bayesian Networks for Prognostic Modeling of the Metabolic Syndrome Han-Saem Park and Sung-Bae Cho Department of Computer Science, Yonsei University 134

More information

A Survey on Prediction of Diabetes Using Data Mining Technique

A Survey on Prediction of Diabetes Using Data Mining Technique A Survey on Prediction of Diabetes Using Data Mining Technique K.Priyadarshini 1, Dr.I.Lakshmi 2 PG.Scholar, Department of Computer Science, Stella Maris College, Teynampet, Chennai, Tamil Nadu, India

More information

Bayesian (Belief) Network Models,

Bayesian (Belief) Network Models, Bayesian (Belief) Network Models, 2/10/03 & 2/12/03 Outline of This Lecture 1. Overview of the model 2. Bayes Probability and Rules of Inference Conditional Probabilities Priors and posteriors Joint distributions

More information

Vision as Bayesian inference: analysis by synthesis?

Vision as Bayesian inference: analysis by synthesis? Vision as Bayesian inference: analysis by synthesis? Schwarz Andreas, Wiesner Thomas 1 / 70 Outline Introduction Motivation Problem Description Bayesian Formulation Generative Models Letters, Text Faces

More information

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials

Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Using Probabilistic Methods to Optimize Data Entry in Accrual of Patients to Clinical Trials Bhavesh D. Goswami 1, Lawrence O. Hall 1, Dmitry B. Goldgof 1, Eugene Fink 2, Jeffrey P. Krischer 1 bgoswami@csee.usf.edu,

More information

TEACHING YOUNG GROWNUPS HOW TO USE BAYESIAN NETWORKS.

TEACHING YOUNG GROWNUPS HOW TO USE BAYESIAN NETWORKS. TEACHING YOUNG GROWNUPS HOW TO USE BAYESIAN NETWORKS Stefan Krauss 1, Georg Bruckmaier 1 and Laura Martignon 2 1 Institute of Mathematics and Mathematics Education, University of Regensburg, Germany 2

More information

CS343: Artificial Intelligence

CS343: Artificial Intelligence CS343: Artificial Intelligence Introduction: Part 2 Prof. Scott Niekum University of Texas at Austin [Based on slides created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All materials

More information

Signal Detection Theory and Bayesian Modeling

Signal Detection Theory and Bayesian Modeling Signal Detection Theory and Bayesian Modeling COGS 202: Computational Modeling of Cognition Omar Shanta, Shuai Tang, Gautam Reddy, Reina Mizrahi, Mehul Shah Detection Theory and Psychophysics: A Review

More information

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16 PGAR: ASD Candidate Gene Prioritization System Using Expression Patterns Steven Cogill and Liangjiang Wang Department of Genetics and

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

VL Network Analysis ( ) SS2016 Week 3

VL Network Analysis ( ) SS2016 Week 3 VL Network Analysis (19401701) SS2016 Week 3 Based on slides by J Ruan (U Texas) Tim Conrad AG Medical Bioinformatics Institut für Mathematik & Informatik, Freie Universität Berlin 1 Motivation 2 Lecture

More information

Distinguishing epidemiological dependent from treatment (resistance) dependent HIV mutations: Problem Statement

Distinguishing epidemiological dependent from treatment (resistance) dependent HIV mutations: Problem Statement Distinguishing epidemiological dependent from treatment (resistance) dependent HIV mutations: Problem Statement Leander Schietgat 1, Kristof Theys 2, Jan Ramon 1, Hendrik Blockeel 1, and Anne-Mieke Vandamme

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

The Human Behaviour-Change Project

The Human Behaviour-Change Project The Human Behaviour-Change Project Participating organisations A Collaborative Award funded by the www.humanbehaviourchange.org @HBCProject This evening Opening remarks from the chair Mary de Silva, The

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Evolutionary Programming

Evolutionary Programming Evolutionary Programming Searching Problem Spaces William Power April 24, 2016 1 Evolutionary Programming Can we solve problems by mi:micing the evolutionary process? Evolutionary programming is a methodology

More information

Stage-Specific Predictive Models for Cancer Survivability

Stage-Specific Predictive Models for Cancer Survivability University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations December 2016 Stage-Specific Predictive Models for Cancer Survivability Elham Sagheb Hossein Pour University of Wisconsin-Milwaukee

More information

Overview. What is an agent?

Overview. What is an agent? Artificial Intelligence Programming s and s Chris Brooks Overview What makes an agent? Defining an environment Overview What makes an agent? Defining an environment Department of Computer Science University

More information

Genomics Research. May 31, Malvika Pillai

Genomics Research. May 31, Malvika Pillai Genomics Research May 31, 2018 Malvika Pillai Outline for Research Discussion Why Informatics Read journal articles! Example Paper Presentation Research Pipeline How To Read A Paper If I m aiming to just

More information

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL

AClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier or How I learned to stop worrying

More information

Intelligent System Project

Intelligent System Project Intelligent System Project (Part VI INTELLIGENT METHODS & MODELS) Miquel Sànchez i Marrè miquel@cs.upc.edu http://www.cs.upc.edu/~miquel Course 2018/2019 https://kemlg.upc.edu PART 6 INTELLIGENT METHODS

More information

Automatic Pathology Software for Diagnosis of Non-Alcoholic Fatty Liver Disease

Automatic Pathology Software for Diagnosis of Non-Alcoholic Fatty Liver Disease Automatic Pathology Software for Diagnosis of Non-Alcoholic Fatty Liver Disease (OTT ID 1236) Inventors: Joseph Bockhorst and Scott Vanderbeck, Department of Computer Science and Electrical Engineering,

More information

Does Machine Learning. In a Learning Health System?

Does Machine Learning. In a Learning Health System? Does Machine Learning Have a Place In a Learning Health System? Grand Rounds: Rethinking Clinical Research Friday, December 15, 2017 Michael J. Pencina, PhD Professor of Biostatistics and Bioinformatics,

More information

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental

Neuroinformatics. Ilmari Kurki, Urs Köster, Jukka Perkiö, (Shohei Shimizu) Interdisciplinary and interdepartmental Neuroinformatics Aapo Hyvärinen, still Academy Research Fellow for a while Post-docs: Patrik Hoyer and Jarmo Hurri + possibly international post-docs PhD students Ilmari Kurki, Urs Köster, Jukka Perkiö,

More information

Accessing and Using ENCODE Data Dr. Peggy J. Farnham

Accessing and Using ENCODE Data Dr. Peggy J. Farnham 1 William M Keck Professor of Biochemistry Keck School of Medicine University of Southern California How many human genes are encoded in our 3x10 9 bp? C. elegans (worm) 959 cells and 1x10 8 bp 20,000

More information

PART - A 1. Define Artificial Intelligence formulated by Haugeland. The exciting new effort to make computers think machines with minds in the full and literal sense. 2. Define Artificial Intelligence

More information

Bayesian Prediction Tree Models

Bayesian Prediction Tree Models Bayesian Prediction Tree Models Statistical Prediction Tree Modelling for Clinico-Genomics Clinical gene expression data - expression signatures, profiling Tree models for predictive sub-typing Combining

More information

Artificial Intelligence Approach for Disease Diagnosis and Treatment

Artificial Intelligence Approach for Disease Diagnosis and Treatment Artificial Intelligence Approach for Disease Diagnosis and Treatment S. Vaishnavi PG Scholar, ME- Department of CSE, PSR Engineering College, Sivakasi, TamilNadu, India. ABSTRACT: Generally, Data mining

More information

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction

A Fuzzy Improved Neural based Soft Computing Approach for Pest Disease Prediction International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1335-1341 International Research Publications House http://www. irphouse.com A Fuzzy Improved

More information

How to use prior knowledge and still give new data a chance?

How to use prior knowledge and still give new data a chance? How to use prior knowledge and still give new data a chance? Kristina Weber1, Rob Hemmings2, Armin Koch 19.12.18 1 now with Roche, 1 2 MHRA, London, UK Part I Background Extrapolation and Bayesian methods:

More information

A Brief Introduction to Bayesian Statistics

A Brief Introduction to Bayesian Statistics A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon

More information

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta

Image Captioning using Reinforcement Learning. Presentation by: Samarth Gupta Image Captioning using Reinforcement Learning Presentation by: Samarth Gupta 1 Introduction Summary Supervised Models Image captioning as RL problem Actor Critic Architecture Policy Gradient architecture

More information

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India

ABSTRACT I. INTRODUCTION. Mohd Thousif Ahemad TSKC Faculty Nagarjuna Govt. College(A) Nalgonda, Telangana, India International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 1 ISSN : 2456-3307 Data Mining Techniques to Predict Cancer Diseases

More information

TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT

TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT vii TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT LIST OF TABLES LIST OF FIGURES LIST OF SYMBOLS AND ABBREVIATIONS iii xi xii xiii 1 INTRODUCTION 1 1.1 CLINICAL DATA MINING 1 1.2 OBJECTIVES OF

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014 Exam policy: This exam allows two one-page, two-sided cheat sheets (i.e. 4 sides); No other materials. Time: 2 hours. Be sure to write

More information

Model calibration and Bayesian methods for probabilistic projections

Model calibration and Bayesian methods for probabilistic projections ETH Zurich Reto Knutti Model calibration and Bayesian methods for probabilistic projections Reto Knutti, IAC ETH Toy model Model: obs = linear trend + noise(variance, spectrum) 1) Short term predictability,

More information

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq

Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Computational Analysis of UHT Sequences Histone modifications, CAGE, RNA-Seq Philipp Bucher Wednesday January 21, 2009 SIB graduate school course EPFL, Lausanne ChIP-seq against histone variants: Biological

More information

School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019

School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019 School of Population and Public Health SPPH 503 Epidemiologic methods II January to April 2019 Time: Tuesday, 1330 1630 Location: School of Population and Public Health, UBC Course description Students

More information

Rethinking Cognitive Architecture!

Rethinking Cognitive Architecture! Rethinking Cognitive Architecture! Reconciling Uniformity and Diversity via Graphical Models! Paul Rosenbloom!!! 1/25/2010! Department of Computer Science &! Institute for Creative Technologies! The projects

More information

5/20/2014. Leaving Andy Clark's safe shores: Scaling Predictive Processing to higher cognition. ANC sysmposium 19 May 2014

5/20/2014. Leaving Andy Clark's safe shores: Scaling Predictive Processing to higher cognition. ANC sysmposium 19 May 2014 ANC sysmposium 19 May 2014 Lorentz workshop on HPI Leaving Andy Clark's safe shores: Scaling Predictive Processing to higher cognition Johan Kwisthout, Maria Otworowska, Harold Bekkering, Iris van Rooij

More information

Artificial Intelligence Programming Probability

Artificial Intelligence Programming Probability Artificial Intelligence Programming Probability Chris Brooks Department of Computer Science University of San Francisco Department of Computer Science University of San Francisco p.1/25 17-0: Uncertainty

More information

Cognitive Modeling. Lecture 12: Bayesian Inference. Sharon Goldwater. School of Informatics University of Edinburgh

Cognitive Modeling. Lecture 12: Bayesian Inference. Sharon Goldwater. School of Informatics University of Edinburgh Cognitive Modeling Lecture 12: Bayesian Inference Sharon Goldwater School of Informatics University of Edinburgh sgwater@inf.ed.ac.uk February 18, 20 Sharon Goldwater Cognitive Modeling 1 1 Prediction

More information

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit Uncertainty, Risk and Probability: Fundamental Definitions and Concepts What Is Uncertainty? A state of having limited knowledge

More information

Sensory specific satiation: using Bayesian networks to combine data from related studies

Sensory specific satiation: using Bayesian networks to combine data from related studies Sensory specific satiation: using Bayesian networks to combine data from related studies Van-Anh Phan, PhD student 10 th Sensometrics Rotterdam, July 2010 Outline of the presentation I. Bayesian networks

More information

Why did the network make this prediction?

Why did the network make this prediction? Why did the network make this prediction? Ankur Taly (Google Inc.) Joint work with Mukund Sundararajan and Qiqi Yan Some Deep Learning Successes Source: https://www.cbsinsights.com Deep Neural Networks

More information

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection

How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection How to Create Better Performing Bayesian Networks: A Heuristic Approach for Variable Selection Esma Nur Cinicioglu * and Gülseren Büyükuğur Istanbul University, School of Business, Quantitative Methods

More information

Solving Problems of Clustering and Classification of Cancer Diseases Based on DNA Methylation Data 1,2

Solving Problems of Clustering and Classification of Cancer Diseases Based on DNA Methylation Data 1,2 APPLIED PROBLEMS Solving Problems of Clustering and Classification of Cancer Diseases Based on DNA Methylation Data 1,2 A. N. Polovinkin a, I. B. Krylov a, P. N. Druzhkov a, M. V. Ivanchenko a, I. B. Meyerov

More information