Analysis of acgh data: statistical models and computational challenges
|
|
- Lewis Summers
- 5 years ago
- Views:
Transcription
1 : statistical models and computational challenges Ramón Díaz-Uriarte Díaz-Uriarte, R. acgh analysis: models and computation / 38
2 Outline 1 Introduction Alternative approaches What we really want RJaCGH There is math (just so you believe us) RJaCGH: typical output RJaCGH: performance 2 Statistical methods need software Introduction ADaCGH Can we make it fast? MPI et al. Díaz-Uriarte, R. acgh analysis: models and computation / 38
3 1 Introduction Alternative approaches What we really want RJaCGH There is math (just so you believe us) RJaCGH: typical output RJaCGH: performance 2 Statistical methods need software Introduction ADaCGH Can we make it fast? MPI et al. Díaz-Uriarte, R. acgh analysis: models and computation / 38
4 Introduction Log 2 (T/ R) Barrett et al., 2004 Calling gains and losses: hypothesis testing Olshen, 2005 Hupe & Barillot, 2005 Arrays: but location in chromosome matters Inferring number of copy gains/losses: estimation Chromosome Díaz-Uriarte, R. acgh analysis: models and computation / 38
5 Alternative approaches Available methods Hypothesis testing based Circular Binary Segmentation CGHExplorer acghsmooth SWArray CLAC Wavelet-based smoothing Copy number estimation Hidden markov models Quantile smoothing GLAD (adaptive nonparametric kernel smoothing) Gaussian mixtures Bayes regression CGHMIX Díaz-Uriarte, R. acgh analysis: models and computation / 38
6 What we really want Is this status of affairs OK? What is it we really want? Díaz-Uriarte, R. acgh analysis: models and computation / 38
7 What we really want What is it we really want? 1. Probabilities of alteration. Díaz-Uriarte, R. acgh analysis: models and computation / 38
8 What we really want What is it we really want? 1. Probabilities of alteration. The direct answer (to is this gene/region gained/lost? No, p-values and smoothed means are not a direct answer.) Usable in contexts from clinical to basic research: modify your thresholds as needed. Probabilities: of genes, of regions, of regions across subjects, etc. (No, not easy, but doable). Probabilities: incorporate uncertainty. Díaz-Uriarte, R. acgh analysis: models and computation / 38
9 What we really want What is it we really want? 2. Account for distance between probes. Díaz-Uriarte, R. acgh analysis: models and computation / 38
10 What we really want What is it we really want? 2. Account for distance between probes. Most platforms widely variable distance between probes. The larger the distance, the more likely a change. The larger the distance, the less information a probe provides about state of nearby probes. Use distance so that the information that consecutive probes provide is adequately accounted for. Díaz-Uriarte, R. acgh analysis: models and computation / 38
11 What we really want What is it we really want? 3. Genome-wide and chromosome-wide analysis. Díaz-Uriarte, R. acgh analysis: models and computation / 38
12 What we really want What is it we really want? 3. Genome-wide and chromosome-wide analysis. Chromosome-wide: alterations wrt chromosome s mean ploidy. Genome-wide: only way to detect whole chromosome gains/loses. Díaz-Uriarte, R. acgh analysis: models and computation / 38
13 What we really want Do available methods give that to us? No Díaz-Uriarte, R. acgh analysis: models and computation / 38
14 What we really want Do available methods give that to us? No Most don t provide probabilities. Most don t use distance between probes. Many ad-hoc, with parameters without intuitive meaning. No single methods fulfills the above three. Díaz-Uriarte, R. acgh analysis: models and computation / 38
15 What we really want... thus we will have to develop a new method Díaz-Uriarte, R. acgh analysis: models and computation / 38
16 RJaCGH Model: characteristics Finite number of different copy gains / losses, number of gains/losses not measured directly, state of every gen related to the state of its neighbours: Hidden Markov Model (HMM) with Gaussian emissions. Influence larger the closer the genes are: non-homogeneous HMM with Gaussian distributions Díaz-Uriarte, R. acgh analysis: models and computation / 38
17 RJaCGH Model: estimation Well known machinery for homogeneous HMM with known number of states Díaz-Uriarte, R. acgh analysis: models and computation / 38
18 RJaCGH Model: estimation Well known machinery for homogeneous HMM with known number of states We have: non-homogeneous HMM with unknown number of states and want probabilistic statements about likely state and flexible specification of model: Bayesian inference through Markov Chain Monte Carlo Díaz-Uriarte, R. acgh analysis: models and computation / 38
19 RJaCGH Model: estimation Well known machinery for homogeneous HMM with known number of states We have: non-homogeneous HMM with unknown number of states and want probabilistic statements about likely state and flexible specification of model: Bayesian inference through Markov Chain Monte Carlo Number of states not known in advance number of parameters different for different models need to jump between models during MCMC for automatic selection and probab. asses. of number of states: reversible jump MCMC Díaz-Uriarte, R. acgh analysis: models and computation / 38
20 RJaCGH Model: estimation Well known machinery for homogeneous HMM with known number of states We have: non-homogeneous HMM with unknown number of states and want probabilistic statements about likely state and flexible specification of model: Bayesian inference through Markov Chain Monte Carlo Number of states not known in advance number of parameters different for different models need to jump between models during MCMC for automatic selection and probab. asses. of number of states: reversible jump MCMC Model uncertainty must be taken into account: models averaged using Bayesian Model Averaging (improved mean square error too!). Díaz-Uriarte, R. acgh analysis: models and computation / 38
21 There is math (just so you believe us) Statistical model Díaz-Uriarte, R. acgh analysis: models and computation / 38
22 There is math (just so you believe us) Bayesian model Díaz-Uriarte, R. acgh analysis: models and computation / 38
23 There is math (just so you believe us) Reversible Jump Díaz-Uriarte, R. acgh analysis: models and computation / 38
24 RJaCGH: typical output Díaz-Uriarte, R. acgh analysis: models and computation / 38
25 RJaCGH: typical output Díaz-Uriarte, R. acgh analysis: models and computation / 38
26 RJaCGH: performance Does it really work better? Compare against best-performing methods (two reviews tell us which are best ). Data NOT simulated under our model: Willenbrock and Fridlyand. Díaz-Uriarte, R. acgh analysis: models and computation / 38
27 RJaCGH: performance Adding variable distances to the data Díaz-Uriarte, R. acgh analysis: models and computation / 38
28 RJaCGH: performance Adding variable distances to the data Location of gaps: Uniform (100) Length of gaps: Poisson. Increasing lambda of Poisson (recall: mean = variance = λ) increases variance of inter-probe distance and % of gaps. Díaz-Uriarte, R. acgh analysis: models and computation / 38
29 RJaCGH: performance RJaCGH: and how does it do? Correct classification rate ACE BIOHMM DNACopy + Merge levels HMM RJaCGH 0 % 10 % 25 % 50 % 65 % Proportion of genes missing Díaz-Uriarte, R. acgh analysis: models and computation / 38
30 RJaCGH: performance RJaCGH: and how does it do? (II) 0 % 10 % 25 % 50 % 65 % Std.Dev.Sample : (0.18,0.19] Std.Dev.Sample : (0.19,0.2] ACE BIOHMM DNA Copy + Merge Levels HMM RJaCGH 0.96 Std.Dev.Sample : (0.14,0.15] Std.Dev.Sample : (0.15,0.16] Std.Dev.Sample : (0.16,0.17] Std.Dev.Sample : (0.17,0.18] Correct classification Std.Dev.Sample : (0.09,0.11] Std.Dev.Sample : (0.11,0.12] Std.Dev.Sample : (0.12,0.13] Std.Dev.Sample : (0.13,0.14] % 10 % 25 % 50 % 65 % Proportion of missing genes 0 % 10 % 25 % 50 % 65 % Díaz-Uriarte, R. acgh analysis: models and computation / 38
31 RJaCGH: performance acgh analyses: what next? Minimal common regions (joint product of marginals!!!) Computation: many MCMCs... Díaz-Uriarte, R. acgh analysis: models and computation / 38
32 Statistical methods need software 1 Introduction Alternative approaches What we really want RJaCGH There is math (just so you believe us) RJaCGH: typical output RJaCGH: performance 2 Statistical methods need software Introduction ADaCGH Can we make it fast? MPI et al. Díaz-Uriarte, R. acgh analysis: models and computation / 38
33 Statistical methods need software Introduction Statistical methods need software (...) a reference implementation, some code which is warranted to give the authors intended answers in a moderately-sized problem. It need not be efficient, but it should be available to anyone and everyone. Brian D. Ripley, RSS 2002 Plenary Lecture. (...) publishing figures or results without the complete software environment could be compared to a mathematician publishing an announcement of a mathematical theorem without giving the proof. Jonatahn B. Buckeit and David L. Donoho. Wavelab and Reproducible Research. Díaz-Uriarte, R. acgh analysis: models and computation / 38
34 Statistical methods need software Introduction Need not be efficient? Or who is our audience? Díaz-Uriarte, R. acgh analysis: models and computation / 38
35 Statistical methods need software Introduction Need not be efficient? Or who is our audience? Statisticians, bioinformaticians Wet-lab researchers Díaz-Uriarte, R. acgh analysis: models and computation / 38
36 Statistical methods need software Introduction For wet-lab researchers Implementing a user friendly tool for a Bayesian-based approach not easy Díaz-Uriarte, R. acgh analysis: models and computation / 38
37 Statistical methods need software Introduction For wet-lab researchers Implementing a user friendly tool for a Bayesian-based approach not easy Will the user get tired of waiting (faster, faster) All the extra stuff (convergence, restarting chains, etc) Díaz-Uriarte, R. acgh analysis: models and computation / 38
38 Statistical methods need software Introduction For wet-lab researchers Implementing a user friendly tool for a Bayesian-based approach not easy Will the user get tired of waiting (faster, faster) All the extra stuff (convergence, restarting chains, etc) Working on it. In the meantime... Díaz-Uriarte, R. acgh analysis: models and computation / 38
39 Statistical methods need software ADaCGH A simpler problem ADaCGH: A web-based tool for the analysis of acgh data. Common interface Implements all decently performing methods: CBS, HMM, BioHMM, GLAD, cghseg, ACE, PSW, Wavelet-based smoothing. Dynamic graphics and links to additional functional information. And it is FAST. Díaz-Uriarte, R. acgh analysis: models and computation / 38
40 Statistical methods need software Can we make it fast? How fast is fast? (I) Small data set (2271 genes) HMM GLAD CBS Sequential code User wall time (seconds) Parallelized code Number of arrays (samples) Díaz-Uriarte, R. acgh analysis: models and computation / 38
41 Statistical methods need software Can we make it fast? How fast is fast? (II) Medium data set (10,000 genes) HMM GLAD CBS Sequential code User wall time (seconds) Parallelized code Number of arrays (samples) Díaz-Uriarte, R. acgh analysis: models and computation / 38
42 Statistical methods need software Can we make it fast? How many users? HMM Small data set (2271 genes, 10 arrays) Wavelets User wall time (seconds) CBS GLAD Number of simultaneous users Díaz-Uriarte, R. acgh analysis: models and computation / 38
43 Statistical methods need software MPI et al. Underneath Combine MPI with web-service load-balancing plus systematic rotation of MPI master and slave nodes. Díaz-Uriarte, R. acgh analysis: models and computation / 38
44 Statistical methods need software MPI et al. Díaz-Uriarte, R. acgh analysis: models and computation / 38
45 Statistical methods need software MPI et al. What can we parallelize? Díaz-Uriarte, R. acgh analysis: models and computation / 38
46 Statistical methods need software MPI et al. What can we parallelize? Segmentation (heavy number crunching) and figures... Díaz-Uriarte, R. acgh analysis: models and computation / 38
47 Statistical methods need software MPI et al. What can we parallelize? Segmentation (heavy number crunching) and figures... over arrays, or chromosomes, or chromosomes by arrays. Díaz-Uriarte, R. acgh analysis: models and computation / 38
48 Statistical methods need software MPI et al. What can we parallelize? Segmentation (heavy number crunching) and figures... over arrays, or chromosomes, or chromosomes by arrays.... for everything else, there is load-balancing. Díaz-Uriarte, R. acgh analysis: models and computation / 38
49 Statistical methods need software MPI et al. Is this the way to go? (I) Gains are spectacular. Díaz-Uriarte, R. acgh analysis: models and computation / 38
50 Statistical methods need software MPI et al. Is this the way to go? (I) Gains are spectacular. Strange (Striking?): we seem to be the only ones to use, systematically, parallelization for bioinfo/biostats web-based applications. Díaz-Uriarte, R. acgh analysis: models and computation / 38
51 Statistical methods need software MPI et al. Medium data set (10,000 genes) HMM GLAD CBS Sequential code User wall time (seconds) Parallelized code Number of arrays (samples) Díaz-Uriarte, R. acgh analysis: models and computation / 38
52 Statistical methods need software MPI et al. Is this the way to go? (II) Hard to tell. Díaz-Uriarte, R. acgh analysis: models and computation / 38
53 Statistical methods need software MPI et al. Is this the way to go? (II) Hard to tell. (Automatically) managing, monitoring, sanitizing MPI and R + MPI a pain Díaz-Uriarte, R. acgh analysis: models and computation / 38
54 Statistical methods need software MPI et al. Is this the way to go? (II) Hard to tell. (Automatically) managing, monitoring, sanitizing MPI and R + MPI a pain MPI is fragile (use a fault-tolerant PVM instead?) Díaz-Uriarte, R. acgh analysis: models and computation / 38
55 Statistical methods need software MPI et al. Is this the way to go? (II) Hard to tell. (Automatically) managing, monitoring, sanitizing MPI and R + MPI a pain MPI is fragile (use a fault-tolerant PVM instead?) I wish R were Erlang... Díaz-Uriarte, R. acgh analysis: models and computation / 38
56 Statistical methods need software MPI et al. Is this the way to go? (II) Hard to tell. (Automatically) managing, monitoring, sanitizing MPI and R + MPI a pain MPI is fragile (use a fault-tolerant PVM instead?) I wish R were Erlang... However it is every day more evident that The Free Lunch Is Over: A Fundamental Turn Toward Concurrency in Software : H. Sutter. Díaz-Uriarte, R. acgh analysis: models and computation / 38
57 Statistical methods need software MPI et al. Acknowledgements Oscar Rueda: for the work on RJaCGH. Funding from Fundación de Investigación Médica Mutua Madrileña and Project TIC C02-02 of the Spanish Ministry of Education and Science Ramón y Cajal Programme of the Spanish Ministry of Education and Science CNIO users for interesting problems and tool testing users and developers for a vibrant statistical computing community and amazing platform Díaz-Uriarte, R. acgh analysis: models and computation / 38
Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach
Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Manuela Zucknick Division of Biostatistics, German Cancer Research Center Biometry Workshop,
More informationDetection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit
APPLICATION NOTE Ion PGM System Detection of aneuploidy in a single cell using the Ion ReproSeq PGS View Kit Key findings The Ion PGM System, in concert with the Ion ReproSeq PGS View Kit and Ion Reporter
More informationBayesian Inference Bayes Laplace
Bayesian Inference Bayes Laplace Course objective The aim of this course is to introduce the modern approach to Bayesian statistics, emphasizing the computational aspects and the differences between the
More informationBayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm
Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University
More informationBiostatistical modelling in genomics for clinical cancer studies
This work was supported by Entente Cordiale Cancer Research Bursaries Biostatistical modelling in genomics for clinical cancer studies Philippe Broët JE 2492 Faculté de Médecine Paris-Sud In collaboration
More informationApplied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business
Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A
More informationHarvard University. A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (acgh)
Harvard University Harvard University Biostatistics Working Paper Series Year 2005 Paper 30 A Pseudolikelihood Approach for Simultaneous Analysis of Array Comparative Genomic Hybridizations (acgh) David
More informationIE 5203 Decision Analysis Lab I Probabilistic Modeling, Inference and Decision Making with Netica
IE 5203 Decision Analysis Lab I Probabilistic Modeling, Inference and Decision Making with Netica Overview of Netica Software Netica Application is a comprehensive tool for working with Bayesian networks
More informationA Vision-based Affective Computing System. Jieyu Zhao Ningbo University, China
A Vision-based Affective Computing System Jieyu Zhao Ningbo University, China Outline Affective Computing A Dynamic 3D Morphable Model Facial Expression Recognition Probabilistic Graphical Models Some
More informationIndividual Differences in Attention During Category Learning
Individual Differences in Attention During Category Learning Michael D. Lee (mdlee@uci.edu) Department of Cognitive Sciences, 35 Social Sciences Plaza A University of California, Irvine, CA 92697-5 USA
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector
More information2014 Modern Modeling Methods (M 3 ) Conference, UCONN
2014 Modern Modeling (M 3 ) Conference, UCONN Comparative study of two calibration methods for micro-simulation models Department of Biostatistics Center for Statistical Sciences Brown University School
More informationIntroduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018
Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this
More informationSearch e Fall /18/15
Sample Efficient Policy Click to edit Master title style Search Click to edit Emma Master Brunskill subtitle style 15-889e Fall 2015 11 Sample Efficient RL Objectives Probably Approximately Correct Minimizing
More informationBayesian Nonparametric Methods for Precision Medicine
Bayesian Nonparametric Methods for Precision Medicine Brian Reich, NC State Collaborators: Qian Guan (NCSU), Eric Laber (NCSU) and Dipankar Bandyopadhyay (VCU) University of Illinois at Urbana-Champaign
More informationTRIPODS Workshop: Models & Machine Learning for Causal I. & Decision Making
TRIPODS Workshop: Models & Machine Learning for Causal Inference & Decision Making in Medical Decision Making : and Predictive Accuracy text Stavroula Chrysanthopoulou, PhD Department of Biostatistics
More informationIntroduction to Bayesian Analysis 1
Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches
More informationMethod Comparison for Interrater Reliability of an Image Processing Technique in Epilepsy Subjects
22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Method Comparison for Interrater Reliability of an Image Processing Technique
More informationPractical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012
Practical Bayesian Optimization of Machine Learning Algorithms Jasper Snoek, Ryan Adams, Hugo LaRochelle NIPS 2012 ... (Gaussian Processes) are inadequate for doing speech and vision. I still think they're
More informationGenome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre
Genome-wide copy-number calling (CNAs not CNVs!) Dr Geoff Macintyre Structural variation (SVs) Copy-number variations C Deletion A B C Balanced rearrangements A B A B C B A C Duplication Inversion Causes
More informationBayesian hierarchical modelling
Bayesian hierarchical modelling Matthew Schofield Department of Mathematics and Statistics, University of Otago Bayesian hierarchical modelling Slide 1 What is a statistical model? A statistical model:
More informationHuman Cancer Genome Project. Bioinformatics/Genomics of Cancer:
Bioinformatics/Genomics of Cancer: Professor of Computer Science, Mathematics and Cell Biology Courant Institute, NYU School of Medicine, Tata Institute of Fundamental Research, and Mt. Sinai School of
More informationProbabilistic Modeling to Support and Facilitate Decision Making in Early Drug Development
Probabilistic Modeling to Support and Facilitate Decision Making in Early Drug Development Huybert Groenendaal, PhD, MBA Francisco Zagmutt, DVM, MPVM EpiX Analytics www.epixanalytics.com EpiX Analytics
More informationVision as Bayesian inference: analysis by synthesis?
Vision as Bayesian inference: analysis by synthesis? Schwarz Andreas, Wiesner Thomas 1 / 70 Outline Introduction Motivation Problem Description Bayesian Formulation Generative Models Letters, Text Faces
More informationUnderstanding DNA Copy Number Data
Understanding DNA Copy Number Data Adam B. Olshen Department of Epidemiology and Biostatistics Helen Diller Family Comprehensive Cancer Center University of California, San Francisco http://cc.ucsf.edu/people/olshena_adam.php
More informationApplications with Bayesian Approach
Applications with Bayesian Approach Feng Li feng.li@cufe.edu.cn School of Statistics and Mathematics Central University of Finance and Economics Outline 1 Missing Data in Longitudinal Studies 2 FMRI Analysis
More informationEECS 433 Statistical Pattern Recognition
EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern
More informationAdvanced Bayesian Models for the Social Sciences
Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University
More informationBayesian and Frequentist Approaches
Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law
More informationBayesian Random SegmentationModels to Identify Shared Copy Number Aberrations for Array CGH Data
From the SelectedWorks of Veera Baladandayuthapani 2 Bayesian Random SegmentationModels to Identify Shared Copy Number Aberrations for Array CGH Data Veera Baladandayuthapani Available at: https://works.bepress.com/veera/3/
More informationAClass: A Simple, Online Probabilistic Classifier. Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL
AClass: A Simple, Online Probabilistic Classifier Vikash K. Mansinghka Computational Cognitive Science Group MIT BCS/CSAIL AClass: A Simple, Online Probabilistic Classifier or How I learned to stop worrying
More informationAdvanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)
Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller
More informationExploring the Influence of Particle Filter Parameters on Order Effects in Causal Learning
Exploring the Influence of Particle Filter Parameters on Order Effects in Causal Learning Joshua T. Abbott (joshua.abbott@berkeley.edu) Thomas L. Griffiths (tom griffiths@berkeley.edu) Department of Psychology,
More informationLearning from data when all models are wrong
Learning from data when all models are wrong Peter Grünwald CWI / Leiden Menu Two Pictures 1. Introduction 2. Learning when Models are Seriously Wrong Joint work with John Langford, Tim van Erven, Steven
More informationFor general queries, contact
Much of the work in Bayesian econometrics has focused on showing the value of Bayesian methods for parametric models (see, for example, Geweke (2005), Koop (2003), Li and Tobias (2011), and Rossi, Allenby,
More informationBayesian (Belief) Network Models,
Bayesian (Belief) Network Models, 2/10/03 & 2/12/03 Outline of This Lecture 1. Overview of the model 2. Bayes Probability and Rules of Inference Conditional Probabilities Priors and posteriors Joint distributions
More informationBayesian Joint Modelling of Benefit and Risk in Drug Development
Bayesian Joint Modelling of Benefit and Risk in Drug Development EFSPI/PSDM Safety Statistics Meeting Leiden 2017 Disclosure is an employee and shareholder of GSK Data presented is based on human research
More informationOutline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?
Outline A Critique of Software Defect Prediction Models Norman E. Fenton Dongfeng Zhu What s inside this paper? What kind of new technique was developed in this paper? Research area of this technique?
More informationAn Ideal Observer Model of Visual Short-Term Memory Predicts Human Capacity Precision Tradeoffs
An Ideal Observer Model of Visual Short-Term Memory Predicts Human Capacity Precision Tradeoffs Chris R. Sims Robert A. Jacobs David C. Knill (csims@cvs.rochester.edu) (robbie@bcs.rochester.edu) (knill@cvs.rochester.edu)
More informationST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods
(10) Frequentist Properties of Bayesian Methods Calibrated Bayes So far we have discussed Bayesian methods as being separate from the frequentist approach However, in many cases methods with frequentist
More informationTEACHING BAYESIAN METHODS FOR EXPERIMENTAL DATA ANALYSIS
TEACHING BAYESIAN METHODS FOR EXPERIMENTAL DATA ANALYSIS Bruno Lecoutre, C.N.R.S. et Université de Rouen Mathématiques, France The innumerable articles denouncing the deficiencies of significance testing
More informationVisual Design. Simplicity, Gestalt Principles, Organization/Structure
Visual Design Simplicity, Gestalt Principles, Organization/Structure Many examples are from Universal Principles of Design, Lidwell, Holden, and Butler 1 Why discuss visual design? You need to present
More informationAspects of Statistical Modelling & Data Analysis in Gene Expression Genomics. Mike West Duke University
Aspects of Statistical Modelling & Data Analysis in Gene Expression Genomics Mike West Duke University Papers, software, many links: www.isds.duke.edu/~mw ABS04 web site: Lecture slides, stats notes, papers,
More informationStepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality
Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,
More informationComparison of segmentation methods in cancer samples
fig/logolille2. Comparison of segmentation methods in cancer samples Morgane Pierre-Jean, Guillem Rigaill, Pierre Neuvial Laboratoire Statistique et Génome Université d Évry Val d Éssonne UMR CNRS 8071
More informationList of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition
List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing
More informationGender Based Emotion Recognition using Speech Signals: A Review
50 Gender Based Emotion Recognition using Speech Signals: A Review Parvinder Kaur 1, Mandeep Kaur 2 1 Department of Electronics and Communication Engineering, Punjabi University, Patiala, India 2 Department
More informationCombining Risks from Several Tumors using Markov chain Monte Carlo (MCMC)
Combining Risks from Several Tumors using Markov chain Monte Carlo (MCMC) Leonid Kopylev & Chao Chen NCEA/ORD/USEPA The views expressed in this presentation are those of the authors and do not necessarily
More informationOrdinal Data Modeling
Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1
More informationSystematic Analysis for Identification of Genes Impacting Cancers
Systematic Analysis for Identification of Genes Impacting Cancers Arpita Singhal Stanford University Saint Francis High School ABSTRACT Currently, vast amounts of molecular information involving genomic
More informationPSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5
PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science Homework 5 Due: 21 Dec 2016 (late homeworks penalized 10% per day) See the course web site for submission details.
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationCombining Risks from Several Tumors Using Markov Chain Monte Carlo
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln U.S. Environmental Protection Agency Papers U.S. Environmental Protection Agency 2009 Combining Risks from Several Tumors
More informationUSING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE
USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE Using StatCrunch for confidence intervals (CI s) is super easy. As you can see in the assignments, I cover 9.2 before 9.1 because
More informationIdentification of regions with common copy-number variations using SNP array
Identification of regions with common copy-number variations using SNP array Agus Salim Epidemiology and Public Health National University of Singapore Copy Number Variation (CNV) Copy number alteration
More informationHierarchical Bayesian Modeling of Individual Differences in Texture Discrimination
Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Timothy N. Rubin (trubin@uci.edu) Michael D. Lee (mdlee@uci.edu) Charles F. Chubb (cchubb@uci.edu) Department of Cognitive
More informationCNV PCA Search Tutorial
CNV PCA Search Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Data Preparation 2 A. Join Log Ratio Data with Phenotype Information.............................. 2 B. Activate only
More informationInference Methods for First Few Hundred Studies
Inference Methods for First Few Hundred Studies James Nicholas Walker Thesis submitted for the degree of Master of Philosophy in Applied Mathematics and Statistics at The University of Adelaide (Faculty
More informationGeorgetown University ECON-616, Fall Macroeconometrics. URL: Office Hours: by appointment
Georgetown University ECON-616, Fall 2016 Macroeconometrics Instructor: Ed Herbst E-mail: ed.herbst@gmail.com URL: http://edherbst.net/ Office Hours: by appointment Scheduled Class Time and Organization:
More informationA Brief Introduction to Bayesian Statistics
A Brief Introduction to Statistics David Kaplan Department of Educational Psychology Methods for Social Policy Research and, Washington, DC 2017 1 / 37 The Reverend Thomas Bayes, 1701 1761 2 / 37 Pierre-Simon
More informationSignal Detection Theory and Bayesian Modeling
Signal Detection Theory and Bayesian Modeling COGS 202: Computational Modeling of Cognition Omar Shanta, Shuai Tang, Gautam Reddy, Reina Mizrahi, Mehul Shah Detection Theory and Psychophysics: A Review
More informationSmall Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh
Small Sample Bayesian Factor Analysis PhUSE 2014 Paper SP03 Dirk Heerwegh Overview Factor analysis Maximum likelihood Bayes Simulation Studies Design Results Conclusions Factor Analysis (FA) Explain correlation
More informationIntroductory Statistical Inference with the Likelihood Function
Introductory Statistical Inference with the Likelihood Function Charles A. Rohde Introductory Statistical Inference with the Likelihood Function 123 Charles A. Rohde Bloomberg School of Health Johns Hopkins
More informationCARISMA-LMS Workshop on Statistics for Risk Analysis
Department of Mathematics CARISMA-LMS Workshop on Statistics for Risk Analysis Thursday 28 th May 2015 Location: Department of Mathematics, John Crank Building, Room JNCK128 (Campus map can be found at
More informationHow do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions
1/29 How do we combine two treatment arm trials with multiple arms trials in IPD metaanalysis? An Illustration with College Drinking Interventions David Huh, PhD 1, Eun-Young Mun, PhD 2, & David C. Atkins,
More informationBayesian Random Segmentation Models to Identify Shared Copy Number Aberrations for Array CGH Data
Supplementary materials for this article are available online. Please click the JASA link at http://pubs.amstat.org. Bayesian Random Segmentation Models to Identify Shared Copy Number Aberrations for Array
More informationMeasuring the User Experience
Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide
More informationKelvin Chan Feb 10, 2015
Underestimation of Variance of Predicted Mean Health Utilities Derived from Multi- Attribute Utility Instruments: The Use of Multiple Imputation as a Potential Solution. Kelvin Chan Feb 10, 2015 Outline
More informationBayesian methods in health economics
Bayesian methods in health economics Gianluca Baio University College London Department of Statistical Science g.baio@ucl.ac.uk Seminar Series of the Master in Advanced Artificial Intelligence Madrid,
More informationBOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS
BOOTSTRAPPING CONFIDENCE LEVELS FOR HYPOTHESES ABOUT REGRESSION MODELS 17 December 2009 Michael Wood University of Portsmouth Business School SBS Department, Richmond Building Portland Street, Portsmouth
More informationUNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER
UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER 2016 DELIVERING VALUE WITH DATA SCIENCE BAYES APPROACH - MAKING DATA WORK HARDER The Ipsos MORI Data Science team increasingly
More informationFEELING AS INTRINSIC REWARD
FEELING AS INTRINSIC REWARD Bob Marinier John Laird 27 th Soar Workshop May 24, 2007 OVERVIEW What are feelings good for? Intuitively, feelings should serve as a reward signal that can be used by reinforcement
More informationBayesians methods in system identification: equivalences, differences, and misunderstandings
Bayesians methods in system identification: equivalences, differences, and misunderstandings Johan Schoukens and Carl Edward Rasmussen ERNSI 217 Workshop on System Identification Lyon, September 24-27,
More informationStructural Equation Modeling (SEM)
Structural Equation Modeling (SEM) Today s topics The Big Picture of SEM What to do (and what NOT to do) when SEM breaks for you Single indicator (ASU) models Parceling indicators Using single factor scores
More informationThe Century of Bayes
The Century of Bayes Joseph J. Retzer Ph.D., Maritz Research The Bayesian `machine together with MCMC is arguably the most powerful mechanism ever created for processing data and knowledge Berger, 2001
More informationStatistical methods for reliably updating meta-analyses
Statistical methods for reliably updating meta-analyses Mark Simmonds University of York, UK With: Julian Elliott, Joanne McKenzie, Georgia Salanti, Adriani Nikolakopoulou, Julian Higgins Updating meta-analyses
More informationRussian Journal of Agricultural and Socio-Economic Sciences, 3(15)
ON THE COMPARISON OF BAYESIAN INFORMATION CRITERION AND DRAPER S INFORMATION CRITERION IN SELECTION OF AN ASYMMETRIC PRICE RELATIONSHIP: BOOTSTRAP SIMULATION RESULTS Henry de-graft Acquah, Senior Lecturer
More informationGenome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data
Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data Haipeng Xing, Yifan Mo, Will Liao, Michael Q. Zhang Clayton Davis and Geoffrey
More informationCISC453 Winter Probabilistic Reasoning Part B: AIMA3e Ch
CISC453 Winter 2010 Probabilistic Reasoning Part B: AIMA3e Ch 14.5-14.8 Overview 2 a roundup of approaches from AIMA3e 14.5-14.8 14.5 a survey of approximate methods alternatives to the direct computing
More informationCSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression
CSE 258 Lecture 2 Web Mining and Recommender Systems Supervised learning Regression Supervised versus unsupervised learning Learning approaches attempt to model data in order to solve a problem Unsupervised
More informationStatistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies
Statistical Analysis of Single Nucleotide Polymorphism Microarrays in Cancer Studies Stanford Biostatistics Workshop Pierre Neuvial with Henrik Bengtsson and Terry Speed Department of Statistics, UC Berkeley
More information10CS664: PATTERN RECOGNITION QUESTION BANK
10CS664: PATTERN RECOGNITION QUESTION BANK Assignments would be handed out in class as well as posted on the class blog for the course. Please solve the problems in the exercises of the prescribed text
More informationBangor University Laboratory Exercise 1, June 2008
Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground
More informationFDA-iRISK 4.0 A Comparative Risk Assessment Tool J u l y 6,
Welcome to the Webinar FDA-iRISK 4.0 A Comparative Risk Assessment Tool J u l y 6, 2 0 1 7 Today s Speakers Dr. Sherri Dennis, FDA Jane Van Doren, FDA Yuhuan Chen, FDA Greg Paoli, RSI Acknowledgements:
More informationModeling Change in Recognition Bias with the Progression of Alzheimer s
Modeling Change in Recognition Bias with the Progression of Alzheimer s James P. Pooley (jpooley@uci.edu) Michael D. Lee (mdlee@uci.edu) Department of Cognitive Sciences, 3151 Social Science Plaza University
More informationReview: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections
Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi
More informationYou must answer question 1.
Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose
More informationRethinking Cognitive Architecture!
Rethinking Cognitive Architecture! Reconciling Uniformity and Diversity via Graphical Models! Paul Rosenbloom!!! 1/25/2010! Department of Computer Science &! Institute for Creative Technologies! The projects
More informationModel calibration and Bayesian methods for probabilistic projections
ETH Zurich Reto Knutti Model calibration and Bayesian methods for probabilistic projections Reto Knutti, IAC ETH Toy model Model: obs = linear trend + noise(variance, spectrum) 1) Short term predictability,
More informationBAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS
Sara Garofalo Department of Psychiatry, University of Cambridge BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS Overview Bayesian VS classical (NHST or Frequentist) statistical approaches Theoretical issues
More informationExperimental Design for Immunologists
Experimental Design for Immunologists Hulin Wu, Ph.D., Dean s Professor Department of Biostatistics & Computational Biology Co-Director: Center for Biodefense Immune Modeling School of Medicine and Dentistry
More informationBlending Psychometrics with Bayesian Inference Networks: Measuring Hundreds of Latent Variables Simultaneously
Blending Psychometrics with Bayesian Inference Networks: Measuring Hundreds of Latent Variables Simultaneously Jonathan Templin Department of Educational Psychology Achievement and Assessment Institute
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 1, Jan Feb 2017
RESEARCH ARTICLE Classification of Cancer Dataset in Data Mining Algorithms Using R Tool P.Dhivyapriya [1], Dr.S.Sivakumar [2] Research Scholar [1], Assistant professor [2] Department of Computer Science
More informationExploiting Similarity to Optimize Recommendations from User Feedback
1 Exploiting Similarity to Optimize Recommendations from User Feedback Hasta Vanchinathan Andreas Krause (Learning and Adaptive Systems Group, D-INF,ETHZ ) Collaborators: Isidor Nikolic (Microsoft, Zurich),
More information5/20/2014. Leaving Andy Clark's safe shores: Scaling Predictive Processing to higher cognition. ANC sysmposium 19 May 2014
ANC sysmposium 19 May 2014 Lorentz workshop on HPI Leaving Andy Clark's safe shores: Scaling Predictive Processing to higher cognition Johan Kwisthout, Maria Otworowska, Harold Bekkering, Iris van Rooij
More informationP. RICHARD HAHN. Research areas. Employment. Education. Research papers
P. RICHARD HAHN Arizona State University email: prhahn@asu.edu Research areas Bayesian methods, causal inference, foundations of statistics, nonlinear regression, Monte Carlo methods, applications to social
More informationA Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models
A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models Jonathan Y. Ito and David V. Pynadath and Stacy C. Marsella Information Sciences Institute, University of Southern California
More informationStatistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 10, Issue 1 2011 Article 52 Modeling Read Counts for CNV Detection in Exome Sequencing Data Michael I. Love, Max Planck Institute for Molecular
More informationHow to use the Lafayette ESS Report to obtain a probability of deception or truth-telling
Lafayette Tech Talk: How to Use the Lafayette ESS Report to Obtain a Bayesian Conditional Probability of Deception or Truth-telling Raymond Nelson The Lafayette ESS Report is a useful tool for field polygraph
More information