Bayesian hierarchical modelling

Size: px
Start display at page:

Download "Bayesian hierarchical modelling"

Transcription

1 Bayesian hierarchical modelling Matthew Schofield Department of Mathematics and Statistics, University of Otago Bayesian hierarchical modelling Slide 1

2 What is a statistical model? A statistical model: a data generating process f(y θ) The model can be used to simulate data Fixing parameters θ and considering realizations y The model can be used for statistical inference Observe data y Estimate parameters ˆθ Infer to the population of interest Bayesian hierarchical modelling Slide 2

3 What is a hierarchical model? The parameters θ are described by a probability model f(θ ψ) θ is considered a random variable Special cases include: Mixed models Latent variable models Missing data models Various forms of overdispersion Penalized regression... Bayesian hierarchical modelling Slide 3

4 What is Bayesian statistics? Alternate approach for statistical inference Probability is used to express uncertainty Update our knowledge (with data): f(θ) }{{} Prior distribution f(y θ) }{{} Collect data f(θ y) }{{} Posterior distribution Posterior distribution f(θ y) reflects our updated knowledge Used for inference Often the prior is chosen to reflect ignorance Reference or default prior Bayesian hierarchical modelling Slide 4

5 Bayesian hierarchical modelling Combine the previous two slides Use Bayesian statistics for inference from hierarchical models The two are often combined Hierarchical modelling is natural within a Bayesian context Relatively simple to specify and fit hierarchical models Bayesian hierarchical modelling Slide 5

6 Example I: muscle fibres Observe fibre level data across a muscle cross-section Binary: slow-twitch or fast-twitch fibre Bayesian hierarchical modelling Slide 6

7 Example I: muscle fibres Observe fibre level data across a muscle cross-section Binary: slow-twitch or fast-twitch fibre Bayesian hierarchical modelling Slide 6

8 Example I: muscle fibres Fibres are grouped within fascicles Multiple fascicles make up a muscle Goal: understand how fibre composition depends on location Conjecture: function declines near fascicle and muscle edge Model occurs at two levels: Fibre level Parameters describing how fibres vary within fascicle Fascicle level Model fibre level parameters based on fasicle location Complexity: allow for additional spatial covariation Bayesian hierarchical modelling Slide 6

9 Example II: genetic mapping SNP data from high-throughput sequencing Full-sibling family population Outcrossing of two individuals Output: a genetic map Locating the (SNP) markers on the genome Estimating the genetic distance between markers Bayesian hierarchical modelling Slide 7

10 Example II: genetic mapping Statistical model includes: Parameter that account for genotyping error Nuisance parameter Collection of parameters that describe crossover Functions of these parameters determine genetic distance One parameter for each marker data hungry Consider as a realization from hierarchical model Borrow strength and improve estimation? Other advantages: prior specification Potential for model extension describing relationship Consider map uncertainty Bayesian hierarchical modelling Slide 7

11 Example III: animal abundance (a cautionary tale) Avoid tagging animals (difficult) Use repeated counts to estimate abundance Assume the distribution (binomial) is the same each visit Both index (N) and probability (p) are unknown If we have 2 replicates both parameters can be estimated Properties have been long studied Peter Hall (1992): On the Erratic Behavior of Estimators of N in the Binomial N, p distribution Use repeated trials (across space) Consider abundances (N s) as realization from hierarchical model Borrow strength and improve estimation? Bayesian hierarchical modelling Slide 8

12 Other examples Climate reconstruction Missing data in earthquake records Density dependence from mark-recapture data... Bayesian hierarchical modelling Slide 9

13 Some advantages Model latent variables Describe a model for a hidden or partially observed process Separate data collection (nuisance) and process modelling Specify a complex marginal model for the data A series of simple conditional models Return to this point later Improved estimation Specifying hierarchical models can improve estimation Broadly applicable Ideas go back to work by James and Stein Look at some simulation results Bayesian hierarchical modelling Slide 10

14 Simulation: ANOVA type model Five groups, each with 10 observations Variance is known: 1 Look at two scenarios: 1. Five means are similar: µ = (0, 0.1, 0.1, 0.2, 0.2) 2. Five means are unrelated: µ = (0, 100, 100, 200, 200) Look at the mean square error of µ s: Standard ANOVA model y ij iid N(µ j, 1) Hierarchical model y ij iid N(µ j, 1) µ j iid N(α, κ 2 ) Bayesian hierarchical modelling Slide 11

15 Simulation: similar values of µ Difference in squared errors (+ve: hier model preferred) Hierarchical model lower MSE than standard ANOVA model Bayesian hierarchical modelling Slide 12

16 Simulation: unrelated values of µ Difference in squared errors (+ve: hier model preferred) When µ j s are unrelated hierarchical model has done no harm Return to this later Bayesian hierarchical modelling Slide 13

17 Relatively straightforward to fit The model above is straightforward to specify and fit in freely available software, e.g. JAGS. model{ for(j in 1:G){ for(i in 1:n[j]){ y[i,j] ~ dnorm(mu[j],1) } mu[j] ~ dnorm(alpha,tau) } ### Prior distributions -- their specification is for another talk tau <- 1/kappa^2 kappa ~ dt(0,0.04,3)t(0,) alpha ~ dnorm(0,0.0001) } Bayesian hierarchical modelling Slide 14

18 Relatively straightforward to fit When fitting hierarchical models using MCMC Computational issues can and do arise Generally easier than finding MLEs Extending the hierarchical is relatively easy E.g. we could allow variance of y to be: Unknown Vary by group Hierarchical distribution Bayesian hierarchical modelling Slide 15

19 Model extensions: JAGS model{ for(j in 1:G){ } for(i in 1:n[j]){ } y[i,j] ~ dnorm(mu[j],tauy[j]) mu[j] ~ dnorm(alpha[1],tau[1]) tauy[j] <- 1/sdy[j] sdy[j] ~ dlnorm(alpha[2],tau[2]) ### Prior distributions for(h in 1:2){ } tau[h] <- 1/kappa[h]^2 kappa[h] ~ dt(0,0.04,3)t(0,) alpha[h] ~ dnorm(0,0.0001) } Bayesian hierarchical modelling Slide 16

20 What are we doing? Specifying a marginal model We specify the model conditionally f(y θ)f(θ ψ) The model is fitted marginally f(y ψ) = f(y θ)f(θ ψ)dθ MCMC perform (numerical) integration for us With simple conditional models Results in complex marginal models Some care is required Bayesian hierarchical modelling Slide 17

21 Marginal and conditional models Many common distributions are marginal hierarchical models Negative binomial: Conditional: y P ois(θ) θ Gamma(α, β) Marginal: y NB(y; α, β) t distribution Conditional: y N(µ, θ) Marginal: y t ν (µ, σ 2 ) θ Gamma 1 ( ν 2, ν 2 σ2) Beta-binomial Mixture models Probit regression Bayesian hierarchical modelling Slide 18

22 What are we doing? Partial pooling Consider the ANOVA model Two choices: 1. Means are different (no pooling) 2. Means are the same (complete pooling) Hierarchical modeling gives an intermediate option Means are different but related (partial pooling) Bayesian hierarchical modelling Slide 19

23 What are we doing? Partial pooling Batting average JS MLE Bayesian hierarchical modelling Slide 19

24 What are we doing? Biased estimation Consider the ANOVA model Gauss-Markov theorem: BLUE Hierarchal model is introducing bias Simulation 1: E[ µ5 ] 0.1 with µ 5 = 0.2 Increased bias is associated with decreased variance Simulation 1: Var( µ5 ) 0.05 compared to Var( µ 5 ) 0.1 Introduce bias to improve (decrease) the mean square error Goes back to work by James, Stein, Efron, Morris,... Bayesian hierarchical modelling Slide 20

25 Example 1 Goal: was to describe spatial distribution of fibres on muscle Probit regression at the fibre level (for each fascicle) Predictor is the distance from edge of fascicle Spatial model on error structure Both intercept and slope are modelled at the fascicle level Intercept describes relative abundance of fast/slow twitch fibres Modelled as function of distance from muscle edge Slope tells us about amount fast/slow twitch fibres change within fascicle as a function of distance Modelled as function of distance from muscle edge Common spatial process at fascicle level Bayesian hierarchical modelling Slide 21

26 Example β β Bayesian hierarchical modelling Slide 22

27 Example 1 Distance explained the spatial variability of type at fibre level Distance only partially explained the variability of the parameters in the fascicle level model Considerable spatial clustering after accounting for distance Distance from edge appears to be an important predictor within fascicles Assess the importance at multiple levels within the muscle Future: embed this within a larger hierarchical model to assess demographic changes in muscle composition Bayesian hierarchical modelling Slide 23

28 Example 2 Work in progress showing considerable promise Bayesian hierarchical modelling Slide 24

29 Cautions and limitations Hierarchical modelling has the potential for abuse Replace data with model assumption Several examples Example 3 (N-mixture model) Latent class analysis for diagnostic testing Estimating abundance from occupancy data Including heterogeneity in mark-recapture models Factor analysis... Simplicity of model fitting can lead to pushing the boundaries. Bayesian hierarchical modelling Slide 25

30 A continuum of models M 1 M 5 M 3 M 4 M 2 Data model estimable without hierarchy Data model overspecified Stage 2: partial pooling Examples 1 and 2 Estimable with hierarchy As we move from left to right: Sensitivity to hierarchical model increases Increasing reliance on specification of hierarchical model Bayesian hierarchical modelling Slide 26

31 Model checking Model fitting is done marginally. Suggests we need to assess fit marginally RHS of continuum: hierarchical model essential Important part of model adequacy Should we check model fit conditionally? Trade-off between data and process models How do we assess fit? Bayesian hierarchical modelling Slide 27

32 What do the latent variables represent? When the latent variable is a first moment Assess directly against data We may need to pool If the latent variable is not a first moment Cannot directly assess variables against data Estimation can be sensitive to minor changes in data Process variables need not reflect any physical quantity RHS of continuum & not related to a moment Good marginal fit with latent variable not reflecting reality Bayesian hierarchical modelling Slide 28

33 Example 3 Challenges to fitting model to one site Erratic behaviour of standard estimators Sensitive to model assumptions (cf Poisson) Marginal model is multivariate Poisson Mean = λp = µ Variance = λp = µ Correlation = p Latent abundances do not relate to first moment Good information regarding µ Information about λ (and the site specific N s) Depends on p (estimated from second moment) Ratio Bayesian hierarchical modelling Slide 29

34 Summary and discussion Hierarchical models have considerable appeal Degree of flexibility in model specification Separate data model and process models Hierarchical models can offer improved estimation Partial pooling Borrowing strength Regularization Bayesian approach offers advantages Hierarchical modelling cannot absolve all statistical sins Potential for a poor model to attain a veneer of respectability Need improved understanding of model adequacy Bayesian hierarchical modelling Slide 30

35 Acknowledgements Collaborators on various examples Tilman Davies, Phil Sheard, Jon Cornwall Timothy Bilton et al. Richard Barker, Bill Link, John Sauer Bayesian hierarchical modelling Slide 31

Ordinal Data Modeling

Ordinal Data Modeling Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1

More information

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods

ST440/550: Applied Bayesian Statistics. (10) Frequentist Properties of Bayesian Methods (10) Frequentist Properties of Bayesian Methods Calibrated Bayes So far we have discussed Bayesian methods as being separate from the frequentist approach However, in many cases methods with frequentist

More information

Biostatistical modelling in genomics for clinical cancer studies

Biostatistical modelling in genomics for clinical cancer studies This work was supported by Entente Cordiale Cancer Research Bursaries Biostatistical modelling in genomics for clinical cancer studies Philippe Broët JE 2492 Faculté de Médecine Paris-Sud In collaboration

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

An Introduction to Bayesian Statistics

An Introduction to Bayesian Statistics An Introduction to Bayesian Statistics Robert Weiss Department of Biostatistics UCLA Fielding School of Public Health robweiss@ucla.edu Sept 2015 Robert Weiss (UCLA) An Introduction to Bayesian Statistics

More information

Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data

Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data Sandro Gsteiger, Beat Neuenschwander, and Heinz Schmidli Novartis Pharma AG Bayes Pharma,

More information

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan

A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan A Bayesian Measurement Model of Political Support for Endorsement Experiments, with Application to the Militant Groups in Pakistan Kosuke Imai Princeton University Joint work with Will Bullock and Jacob

More information

County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS)

County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS) County-Level Small Area Estimation using the National Health Interview Survey (NHIS) and the Behavioral Risk Factor Surveillance System (BRFSS) Van L. Parsons, Nathaniel Schenker Office of Research and

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015

On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses. Structural Equation Modeling Lecture #12 April 29, 2015 On Test Scores (Part 2) How to Properly Use Test Scores in Secondary Analyses Structural Equation Modeling Lecture #12 April 29, 2015 PRE 906, SEM: On Test Scores #2--The Proper Use of Scores Today s Class:

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis

GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA. Anti-Epileptic Drug Trial Timeline. Exploratory Data Analysis. Exploratory Data Analysis GENERALIZED ESTIMATING EQUATIONS FOR LONGITUDINAL DATA 1 Example: Clinical Trial of an Anti-Epileptic Drug 59 epileptic patients randomized to progabide or placebo (Leppik et al., 1987) (Described in Fitzmaurice

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Bayesian Hierarchical Models for Fitting Dose-Response Relationships

Bayesian Hierarchical Models for Fitting Dose-Response Relationships Bayesian Hierarchical Models for Fitting Dose-Response Relationships Ketra A. Schmitt Battelle Memorial Institute Mitchell J. Small and Kan Shao Carnegie Mellon University Dose Response Estimates using

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design

T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design T-Statistic-based Up&Down Design for Dose-Finding Competes Favorably with Bayesian 4-parameter Logistic Design James A. Bolognese, Cytel Nitin Patel, Cytel Yevgen Tymofyeyef, Merck Inna Perevozskaya, Wyeth

More information

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Bayesian Joint Modelling of Benefit and Risk in Drug Development Bayesian Joint Modelling of Benefit and Risk in Drug Development EFSPI/PSDM Safety Statistics Meeting Leiden 2017 Disclosure is an employee and shareholder of GSK Data presented is based on human research

More information

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases Christian Röver 1, Tim Friede 1, Simon Wandel 2 and Beat Neuenschwander 2 1 Department of Medical Statistics,

More information

A Case Study: Two-sample categorical data

A Case Study: Two-sample categorical data A Case Study: Two-sample categorical data Patrick Breheny January 31 Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/43 Introduction Model specification Continuous vs. mixture priors Choice

More information

Improving ecological inference using individual-level data

Improving ecological inference using individual-level data Improving ecological inference using individual-level data Christopher Jackson, Nicky Best and Sylvia Richardson Department of Epidemiology and Public Health, Imperial College School of Medicine, London,

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy

Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Chapter 21 Multilevel Propensity Score Methods for Estimating Causal Effects: A Latent Class Modeling Strategy Jee-Seon Kim and Peter M. Steiner Abstract Despite their appeal, randomized experiments cannot

More information

Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models

Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models Florian M. Hollenbach Department of Political Science Texas A&M University Jacob M. Montgomery

More information

Inference About Magnitudes of Effects

Inference About Magnitudes of Effects invited commentary International Journal of Sports Physiology and Performance, 2008, 3, 547-557 2008 Human Kinetics, Inc. Inference About Magnitudes of Effects Richard J. Barker and Matthew R. Schofield

More information

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination

Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Hierarchical Bayesian Modeling of Individual Differences in Texture Discrimination Timothy N. Rubin (trubin@uci.edu) Michael D. Lee (mdlee@uci.edu) Charles F. Chubb (cchubb@uci.edu) Department of Cognitive

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis Treatment effect estimates adjusted for small-study effects via a limit meta-analysis Gerta Rücker 1, James Carpenter 12, Guido Schwarzer 1 1 Institute of Medical Biometry and Medical Informatics, University

More information

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis Thesis Proposal Indrayana Rustandi April 3, 2007 Outline Motivation and Thesis Preliminary results: Hierarchical

More information

Statistical Tolerance Regions: Theory, Applications and Computation

Statistical Tolerance Regions: Theory, Applications and Computation Statistical Tolerance Regions: Theory, Applications and Computation K. KRISHNAMOORTHY University of Louisiana at Lafayette THOMAS MATHEW University of Maryland Baltimore County Contents List of Tables

More information

Bayesian Methodology to Estimate and Update SPF Parameters under Limited Data Conditions: A Sensitivity Analysis

Bayesian Methodology to Estimate and Update SPF Parameters under Limited Data Conditions: A Sensitivity Analysis Bayesian Methodology to Estimate and Update SPF Parameters under Limited Data Conditions: A Sensitivity Analysis Shahram Heydari (Corresponding Author) Research Assistant Department of Civil and Environmental

More information

Analysis of left-censored multiplex immunoassay data: A unified approach

Analysis of left-censored multiplex immunoassay data: A unified approach 1 / 41 Analysis of left-censored multiplex immunoassay data: A unified approach Elizabeth G. Hill Medical University of South Carolina Elizabeth H. Slate Florida State University FSU Department of Statistics

More information

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Combining Risks from Several Tumors Using Markov Chain Monte Carlo University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln U.S. Environmental Protection Agency Papers U.S. Environmental Protection Agency 2009 Combining Risks from Several Tumors

More information

Some alternatives for Inhomogeneous Poisson Point Processes for presence only data

Some alternatives for Inhomogeneous Poisson Point Processes for presence only data Some alternatives for Inhomogeneous Poisson Point Processes for presence only data Hassan Doosti Macquarie University hassan.doosti@mq.edu.au July 6, 2017 Hassan Doosti (MQU) Inhomogeneous Spatial Point

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

A re-randomisation design for clinical trials

A re-randomisation design for clinical trials Kahan et al. BMC Medical Research Methodology (2015) 15:96 DOI 10.1186/s12874-015-0082-2 RESEARCH ARTICLE Open Access A re-randomisation design for clinical trials Brennan C Kahan 1*, Andrew B Forbes 2,

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Appl. Statist. (2018) 67, Part 1, pp. 145 163 Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data Qiuju Li and Li Su Medical Research Council

More information

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods Merrill and McClure Trials (2015) 16:523 DOI 1186/s13063-015-1044-z TRIALS RESEARCH Open Access Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of

More information

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Practical Bayesian Design and Analysis for Drug and Device Clinical Trials p. 1/2 Practical Bayesian Design and Analysis for Drug and Device Clinical Trials Brian P. Hobbs Plan B Advisor: Bradley P. Carlin

More information

How many people do you know?: Efficiently estimating personal network size

How many people do you know?: Efficiently estimating personal network size How many people do you know?: Efficiently estimating personal network size Tian Zheng Department of Statistics Columbia University April 22nd, 2009 1 / 34 Acknowledgements Collaborators Tyler McCormick

More information

Sample size calculation for a stepped wedge trial

Sample size calculation for a stepped wedge trial Baio et al. Trials (2015) 16:354 DOI 10.1186/s13063-015-0840-9 TRIALS RESEARCH Sample size calculation for a stepped wedge trial Open Access Gianluca Baio 1*,AndrewCopas 2, Gareth Ambler 1, James Hargreaves

More information

Introduction to Bayesian Analysis 1

Introduction to Bayesian Analysis 1 Biostats VHM 801/802 Courses Fall 2005, Atlantic Veterinary College, PEI Henrik Stryhn Introduction to Bayesian Analysis 1 Little known outside the statistical science, there exist two different approaches

More information

Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors

Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors Journal of Data Science 6(2008), 105-123 Joint Spatio-Temporal Modeling of Low Incidence Cancers Sharing Common Risk Factors Jacob J. Oleson 1,BrianJ.Smith 1 and Hoon Kim 2 1 The University of Iowa and

More information

Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes

Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes Stephen Burgess Simon G. Thompson CRP CHD Genetics Collaboration May 24, 2012 Abstract

More information

ISIR: Independent Sliced Inverse Regression

ISIR: Independent Sliced Inverse Regression ISIR: Independent Sliced Inverse Regression Kevin B. Li Beijing Jiaotong University Abstract In this paper we consider a semiparametric regression model involving a p-dimensional explanatory variable x

More information

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors Kazem Nasserinejad 1 Joost van Rosmalen 1 Mireille Baart 2 Katja van den Hurk 2 Dimitris Rizopoulos 1 Emmanuel

More information

Bayesian approaches to handling missing data: Practical Exercises

Bayesian approaches to handling missing data: Practical Exercises Bayesian approaches to handling missing data: Practical Exercises 1 Practical A Thanks to James Carpenter and Jonathan Bartlett who developed the exercise on which this practical is based (funded by ESRC).

More information

Using mixture priors for robust inference: application in Bayesian dose escalation trials

Using mixture priors for robust inference: application in Bayesian dose escalation trials Using mixture priors for robust inference: application in Bayesian dose escalation trials Astrid Jullion, Beat Neuenschwander, Daniel Lorand BAYES2014, London, 11 June 2014 Agenda Dose escalation in oncology

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Modeling unobserved heterogeneity in Stata

Modeling unobserved heterogeneity in Stata Modeling unobserved heterogeneity in Stata Rafal Raciborski StataCorp LLC November 27, 2017 Rafal Raciborski (StataCorp) Modeling unobserved heterogeneity November 27, 2017 1 / 59 Plan of the talk Concepts

More information

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION Fong Wang, Genentech Inc. Mary Lange, Immunex Corp. Abstract Many longitudinal studies and clinical trials are

More information

Bayesian random-effects meta-analysis made simple

Bayesian random-effects meta-analysis made simple Bayesian random-effects meta-analysis made simple Christian Röver 1, Beat Neuenschwander 2, Simon Wandel 2, Tim Friede 1 1 Department of Medical Statistics, University Medical Center Göttingen, Göttingen,

More information

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis WinBUGS : part 1 Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert Gabriele, living with rheumatoid arthritis Agenda 2 Introduction to WinBUGS Exercice 1 : Normal with unknown mean and variance

More information

Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data

Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data Genome-Wide Localization of Protein-DNA Binding and Histone Modification by a Bayesian Change-Point Method with ChIP-seq Data Haipeng Xing, Yifan Mo, Will Liao, Michael Q. Zhang Clayton Davis and Geoffrey

More information

Bayesian Methods for Medical Test Accuracy. Broemeling & Associates Inc., 1023 Fox Ridge Road, Medical Lake, WA 99022, USA;

Bayesian Methods for Medical Test Accuracy. Broemeling & Associates Inc., 1023 Fox Ridge Road, Medical Lake, WA 99022, USA; Diagnostics 2011, 1, 1-35; doi:10.3390/diagnostics1010001 OPEN ACCESS diagnostics ISSN 2075-4418 www.mdpi.com/journal/diagnostics/ Review Bayesian Methods for Medical Test Accuracy Lyle D. Broemeling Broemeling

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

Advanced Bayesian Models for the Social Sciences

Advanced Bayesian Models for the Social Sciences Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

Improving ecological inference using individual-level data

Improving ecological inference using individual-level data STATISTICS IN MEDICINE Statist. Med. 2006; 25:2136 2159 Published online 11 October 2005 in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/sim.2370 Improving ecological inference using individual-level

More information

The Effects of Autocorrelated Noise and Biased HRF in fmri Analysis Error Rates

The Effects of Autocorrelated Noise and Biased HRF in fmri Analysis Error Rates The Effects of Autocorrelated Noise and Biased HRF in fmri Analysis Error Rates Ariana Anderson University of California, Los Angeles Departments of Psychiatry and Behavioral Sciences David Geffen School

More information

Meta-analysis of few small studies in small populations and rare diseases

Meta-analysis of few small studies in small populations and rare diseases Meta-analysis of few small studies in small populations and rare diseases Christian Röver 1, Beat Neuenschwander 2, Simon Wandel 2, Tim Friede 1 1 Department of Medical Statistics, University Medical Center

More information

STATISTICAL METHODS FOR THE EVALUATION OF A CANCER SCREENING PROGRAM

STATISTICAL METHODS FOR THE EVALUATION OF A CANCER SCREENING PROGRAM STATISTICAL METHODS FOR THE EVALUATION OF A CANCER SCREENING PROGRAM STATISTICAL METHODS FOR THE EVALUATION OF A CANCER SCREENING PROGRAM BY HUAN JIANG, M.Sc. a thesis submitted to the department of Clinical

More information

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques

Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Analysis of Hearing Loss Data using Correlated Data Analysis Techniques Ruth Penman and Gillian Heller, Department of Statistics, Macquarie University, Sydney, Australia. Correspondence: Ruth Penman, Department

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ Tutorial in Biostatistics Received: 11 March 2016, Accepted: 13 September 2016 Published online 16 October 2016 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.7141 Meta-analysis using

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations Iddrisu and Gumedze BMC Medical Research Methodology (2019) 19:10 https://doi.org/10.1186/s12874-018-0639-y RESEARCH ARTICLE Open Access An application of a pattern-mixture model with multiple imputation

More information

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center)

16:35 17:20 Alexander Luedtke (Fred Hutchinson Cancer Research Center) Conference on Causal Inference in Longitudinal Studies September 21-23, 2017 Columbia University Thursday, September 21, 2017: tutorial 14:15 15:00 Miguel Hernan (Harvard University) 15:00 15:45 Miguel

More information

The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of multilevel modeling

The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of multilevel modeling MODERN MODELING METHODS 2016, 2016/05/23-26 University of Connecticut, Storrs CT, USA The matching effect of intra-class correlation (ICC) on the estimation of contextual effect: A Bayesian approach of

More information

Bayesian meta-analysis of Papanicolaou smear accuracy

Bayesian meta-analysis of Papanicolaou smear accuracy Gynecologic Oncology 107 (2007) S133 S137 www.elsevier.com/locate/ygyno Bayesian meta-analysis of Papanicolaou smear accuracy Xiuyu Cong a, Dennis D. Cox b, Scott B. Cantor c, a Biometrics and Data Management,

More information

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company Case Studies in Bayesian Augmented Control Design Nathan Enas Ji Lin Eli Lilly and Company Outline Drivers for innovation in Phase II designs Case Study #1 Pancreatic cancer Study design Analysis Learning

More information

Generation times in epidemic models

Generation times in epidemic models Generation times in epidemic models Gianpaolo Scalia Tomba Dept Mathematics, Univ of Rome "Tor Vergata", Italy in collaboration with Åke Svensson, Dept Mathematics, Stockholm University, Sweden Tommi Asikainen

More information

Bayesian Mediation Analysis

Bayesian Mediation Analysis Psychological Methods 2009, Vol. 14, No. 4, 301 322 2009 American Psychological Association 1082-989X/09/$12.00 DOI: 10.1037/a0016972 Bayesian Mediation Analysis Ying Yuan The University of Texas M. D.

More information

Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach

Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach Manuela Zucknick Division of Biostatistics, German Cancer Research Center Biometry Workshop,

More information

The Late Pretest Problem in Randomized Control Trials of Education Interventions

The Late Pretest Problem in Randomized Control Trials of Education Interventions The Late Pretest Problem in Randomized Control Trials of Education Interventions Peter Z. Schochet ACF Methods Conference, September 2012 In Journal of Educational and Behavioral Statistics, August 2010,

More information

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T. Diagnostic Tests 1 Introduction Suppose we have a quantitative measurement X i on experimental or observed units i = 1,..., n, and a characteristic Y i = 0 or Y i = 1 (e.g. case/control status). The measurement

More information

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Data Analysis in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Multilevel Data Statistical analyses that fail to recognize

More information

Statistical Models for Censored Point Processes with Cure Rates

Statistical Models for Censored Point Processes with Cure Rates Statistical Models for Censored Point Processes with Cure Rates Jennifer Rogers MSD Seminar 2 November 2011 Outline Background and MESS Epilepsy MESS Exploratory Analysis Summary Statistics and Kaplan-Meier

More information

Dynamic borrowing of historical data: Performance and comparison of existing methods based on a case study

Dynamic borrowing of historical data: Performance and comparison of existing methods based on a case study Introduction Methods Simulations Discussion Dynamic borrowing of historical data: Performance and comparison of existing methods based on a case study D. Dejardin 1, P. Delmar 1, K. Patel 1, C. Warne 1,

More information

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models

Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Bayesian Analysis of Between-Group Differences in Variance Components in Hierarchical Generalized Linear Models Brady T. West Michigan Program in Survey Methodology, Institute for Social Research, 46 Thompson

More information

Use of GEEs in STATA

Use of GEEs in STATA Use of GEEs in STATA 1. When generalised estimating equations are used and example 2. Stata commands and options for GEEs 3. Results from Stata (and SAS!) 4. Another use of GEEs Use of GEEs GEEs are one

More information

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia American Journal of Theoretical and Applied Statistics 2017; 6(4): 182-190 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20170604.13 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Bayesian Nonparametric Methods for Precision Medicine

Bayesian Nonparametric Methods for Precision Medicine Bayesian Nonparametric Methods for Precision Medicine Brian Reich, NC State Collaborators: Qian Guan (NCSU), Eric Laber (NCSU) and Dipankar Bandyopadhyay (VCU) University of Illinois at Urbana-Champaign

More information

Type and quantity of data needed for an early estimate of transmissibility when an infectious disease emerges

Type and quantity of data needed for an early estimate of transmissibility when an infectious disease emerges Research articles Type and quantity of data needed for an early estimate of transmissibility when an infectious disease emerges N G Becker (Niels.Becker@anu.edu.au) 1, D Wang 1, M Clements 1 1. National

More information

Selection and estimation in exploratory subgroup analyses a proposal

Selection and estimation in exploratory subgroup analyses a proposal Selection and estimation in exploratory subgroup analyses a proposal Gerd Rosenkranz, Novartis Pharma AG, Basel, Switzerland EMA Workshop, London, 07-Nov-2014 Purpose of this presentation Proposal for

More information

On Regression Analysis Using Bivariate Extreme Ranked Set Sampling

On Regression Analysis Using Bivariate Extreme Ranked Set Sampling On Regression Analysis Using Bivariate Extreme Ranked Set Sampling Atsu S. S. Dorvlo atsu@squ.edu.om Walid Abu-Dayyeh walidan@squ.edu.om Obaid Alsaidy obaidalsaidy@gmail.com Abstract- Many forms of ranked

More information

Bayesian Latent Subgroup Design for Basket Trials

Bayesian Latent Subgroup Design for Basket Trials Bayesian Latent Subgroup Design for Basket Trials Yiyi Chu Department of Biostatistics The University of Texas School of Public Health July 30, 2017 Outline Introduction Bayesian latent subgroup (BLAST)

More information

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Measurement Error in Nonlinear Models

Measurement Error in Nonlinear Models Measurement Error in Nonlinear Models R.J. CARROLL Professor of Statistics Texas A&M University, USA D. RUPPERT Professor of Operations Research and Industrial Engineering Cornell University, USA and L.A.

More information

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling STATISTICS IN MEDICINE Statist. Med. 2009; 28:3363 3385 Published online 3 September 2009 in Wiley InterScience (www.interscience.wiley.com).3721 Estimating drug effects in the presence of placebo response:

More information

Design for Targeted Therapies: Statistical Considerations

Design for Targeted Therapies: Statistical Considerations Design for Targeted Therapies: Statistical Considerations J. Jack Lee, Ph.D. Department of Biostatistics University of Texas M. D. Anderson Cancer Center Outline Premise General Review of Statistical Designs

More information

Small Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh

Small Sample Bayesian Factor Analysis. PhUSE 2014 Paper SP03 Dirk Heerwegh Small Sample Bayesian Factor Analysis PhUSE 2014 Paper SP03 Dirk Heerwegh Overview Factor analysis Maximum likelihood Bayes Simulation Studies Design Results Conclusions Factor Analysis (FA) Explain correlation

More information

In this module I provide a few illustrations of options within lavaan for handling various situations.

In this module I provide a few illustrations of options within lavaan for handling various situations. In this module I provide a few illustrations of options within lavaan for handling various situations. An appropriate citation for this material is Yves Rosseel (2012). lavaan: An R Package for Structural

More information

Multivariate Multilevel Models

Multivariate Multilevel Models Multivariate Multilevel Models Getachew A. Dagne George W. Howe C. Hendricks Brown Funded by NIMH/NIDA 11/20/2014 (ISSG Seminar) 1 Outline What is Behavioral Social Interaction? Importance of studying

More information

Analysis of acgh data: statistical models and computational challenges

Analysis of acgh data: statistical models and computational challenges : statistical models and computational challenges Ramón Díaz-Uriarte 2007-02-13 Díaz-Uriarte, R. acgh analysis: models and computation 2007-02-13 1 / 38 Outline 1 Introduction Alternative approaches What

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Bayesian and Classical Approaches to Inference and Model Averaging

Bayesian and Classical Approaches to Inference and Model Averaging Bayesian and Classical Approaches to Inference and Model Averaging Course Tutors Gernot Doppelhofer NHH Melvyn Weeks University of Cambridge Location Norges Bank Oslo Date 5-8 May 2008 The Course The course

More information

Estimating Heterogeneous Choice Models with Stata

Estimating Heterogeneous Choice Models with Stata Estimating Heterogeneous Choice Models with Stata Richard Williams Notre Dame Sociology rwilliam@nd.edu West Coast Stata Users Group Meetings October 25, 2007 Overview When a binary or ordinal regression

More information