Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum

Size: px
Start display at page:

Download "Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum"

Transcription

1 Instrumental variables and their sensitivity to unobserved biases Dylan Small Department of Statistics, Wharton School, University of Pennsylvania Based on joint work with Paul Rosenbaum

2 Overview Instrumental Variable (IV) Method: Method of controlling for unmeasured confounding Example: Effect of World War II military service on future earnings. Sensitivity to unobserved biases for IV method. How strength of IVs affects sensitivity to unobserved biases: Implications for designing studies with IVs.

3 Example: WWII Veteran Status and Earnings Does military service raise or lower earnings? Angrist and Krueger (1994) studied this in context of WWII military service and 1980 earnings (using 5% public use sample of US Census). Lower earnings? Military service in WWII interrupts education or career. Higher earnings? Labor market might favor veterans, GI Bill increases education.

4 This is association not causation: WWII Vets might not be comparable to Non-Vets in terms of health, criminal behavior WWII Vets (76% of men) earned on average $4500 more in 1980 than Non-Vets.

5 Addressing Confounding Confounding Variable: Variable that is (i) not comparable between treatment and control groups. (ii) affects outcome. e.g., health, criminal behavior. If all confounders measured, they can be adjusted for by regression, propensity scores, matching methods But health, criminal behavior not measured in the Census

6 Unmeasured Confounding Graph is conditional on measured confounders (race, education up to 8 years, location of birth) Earnings Veteran Status Unobserved Confounders (Health, criminal behavior, etc.

7 Instrumental Variables Strategy Y=Outcome W=Treatment Z=IV Graph is conditional on measured confounders (race, education up to 8 years, location of birth) W: Veteran Status Y:Earnings Z: Year of Birth Unobserved Confounders (Health etc.) Extract variation in W from Z that is free of unobserved confounders and use this variation to estimate the causal effect of W on Y. Key IV Assumptions: (1) Z independent of unobserved variables; (2) Z does not have direct effect on outcome.

8

9 IV Applications in Health Research Outcome (Y ) Treatment (W ) IV ( Z ) Reference Mortality More intensive vs. less intensive treatment for heart attack patients Distance lived from cardiac care center Mortality Mortality Conventional vs. atypical antipsychotics Premature baby delivered at high level NICU vs. local hospital Prescribing physician s preference Mother s differential distance between high level NICU and local hospital McLellan, McNeil and Newhouse (1994) Wang, Schneeweiss et al. (2005) Lorch, Baiocchi, Ahlberg and Small (2012) Birth weight Maternal smoking State cigarette taxes Evans and Ringel (1999) Birth weight Maternal smoking Random assignment of free smoker s counseling Permutt and Hebel (1989) Heart attack HDL cholesterol Genes that affect HDL Voight et al. (2012)

10 Prototype IV Design: Matched Pair Encouragement Design Consider a matched pair design in which there are I matched pairs (say matched for measured confounders) and one unit j in each pair i is encouraged to receive treatment ( Z ij = 1) and the other unit j is not encouraged to receive treatment ( Z = 0). ij ' In this context, the encouragement variable Z is said to be a valid instrumental variable (IV) if Z is effectively randomly assigned: 1 1 PZ ( i1 = 1, Zi2 = 0) =, PZ ( i1 = 0, Zi2 = 1) = 2 2 (i.e., Z is not related to any unmeasured confounders). Inference can be based on two stage least squares or permutation inference. 95% CI for effect of military service on earnings using 1926 vs as IV: (-$1445, -$500)

11 A picture of the IV argument -- We created matched triples: men matched on quarter of birth, race, age, education up to 8 years and location of birth. -- This figure provides reason to doubt military service increases earnings by $ From 1924 to 1926, the proportion of veterans stayed about constant and the earnings stayed about the same. From 1926 to 1928, the proportion of veterans decreased by 50% but earnings increased, suggesting military service decreases earnings.

12 Sensitivity Analysis IV method assumes that the IV (encouragement) is effectively randomly assigned: 1 1 PZ ( i1 = 1, Zi2 = 0) =, PZ ( i1 = 0, Zi2 = 1) = 2 2 There is often concern about whether this is true. In WWII Study, there are gradual long term trends in apprenticeship, education, employment and nutrition that might bias comparisons of workers born two years apart. A sensitivity analysis asks how departures from random assignment of the IV of various magnitudes might alter a study s conclusion.

13 Model for Sensitivity Analysis For subject ij, let π ij denote the probability that ij is encouraged, π = PZ ( = 1). ij ij Suppose that two subjects ij and ik may differ in their odds of being encouraged by at most a factor of Γ 1 because they differ in terms of an unobserved covariate, uij uik, 1 πij (1 πik ) Γ i, jk,. Γ π (1 π ) ik ij If Γ= 1, IV is randomly assigned. If Γ> 1, then distribution of treatment assignments is unknown but magnitude of departure from random assignment controlled by Γ.

14 Sensitivity Analysis for WWII Study Upper Bound on One-Sided Significance Level for 1926 vs IV Γ H : β 0 H : β 4, β = causal effect of military service on earnings

15 Strength of IV We d like our study to be as insensitive to bias as possible, i.e., finding is significant for as large a Γ as possible. How does strength of IV affect sensitivity to bias. An IV is strong if encouragement has a strong effect on treatment received; An IV is weak if encouragement has only a weak effect on treatment received. Study Strong IV Weak IV World War II Study 1926 vs vs Maternal Smoking Study Random assignment of State cigarette taxes free counseling Effects of Weak IVs 1. Increased Variance 2. Increased Sensitivity to Bias

16

17 Effect of Weak IVs I: Increased Variance Y W X Z X Unobserved Variables If Z is a weak IV, then the variance of the IV estimate will be higher because less variation in W from Z can be extracted. 95% CI for effect of military service using 1926 vs IV: (-$1,445, -$500). 95% CI for effect of military service using 1924 vs IV: (-$10,130, $10,750)

18 Effect of Weak IVs II: Increased Sensitivity to Bias Power of a Sensitivity Analysis (Rosenbaum, 2004) 1 Suppose Z were in fact a valid IV so that PZ ( i1 = 1, Zi2 = 0) = PZ ( i1 = 0, Zi2 = 1) =, 2 but we didn t know this and wanted to allow for some sensitivity to bias measured by Γ Suppose also that β β0 (true causal effect minus null hypothesis causal effect) was large, so that H0 : β = β0 was substantially in error. We would like to be able to reject H0 : β = β0 when the bias could be up to some Γ (e.g., Γ =1.5). Power of a sensitivity analysis at Γ : Probability that we will reject H0 : β = β0 for Γ assuming that Z is a valid IV and a given value of β β0.

19 Effect size: ( β β0)/ σ = 1 Number of pairs I Strength of IV: Γ , ,000 lim P(Treat IV=1)- I P(Treat IV=0) When the IV is valid ( Γ= 1), the power is of course greater for stronger IVs but there is good power for all cases with sample size of 10,000 pairs. Valid but weak IVs eventually get it right. But when 1 Γ>, the power can tend to 1 or 0 depending on the strength of the IV. Weak IVs are quite sensitive to small biases.

20 Practical Consequences 1. Weak IVs that might have small bias are dangerous to use. Weak IVs are sensitive to quite small biases ( Γ> 1 yet Γ close to 1), even when the effect size ( β β0)/ σ is quite large. Unless one is confident that a weak IV is perfectly valid ( Γ= 1), its extreme sensitivity to small biases is likely to limit its usefulness to the study of enormous effects, ( β β0)/ σ >> Strong IVs that might be moderately biased are useful. A strong IV may provide useful information even if moderate biases are plausible. Consider two studies, a small study with a strong IV and a large study with a weak IV, which would have the same power if both IVs are unbiased. When there is concern that the IVs might be biased, the small study with a strong IV has considerable advantages.

21 Potential IVs in Health Outcomes Research Potential IV Differential Distance to Nearest Provider of Treatment A vs. Treatment B Geographic or Hospital Preference for Treatment A vs. B Physician Preference for Treatment A vs. B Calendar Time (one treatment may become more common over time) Genetic Variants Timing of Admission to Hospital Insurance Plan Coverage for Treatment A vs. B Randomized Encouragement at Point of Care for Treatment A vs. B When No Clear Cut Choice Strength Weak or Strong Weak or Strong Weak or Strong Weak or Strong Usually Weak Weak or Strong Weak or Strong Potentially Strong Reference for this talk: Small, D. and Rosenbaum, P. (2008). War and wages: the strength of instrumental variables and their sensitivity to unobserved biases. Journal of the American Statisical Association,

in Medicine Tutorial in Biostatistics: Instrumental Variable Methods for Causal Inference

in Medicine Tutorial in Biostatistics: Instrumental Variable Methods for Causal Inference Received XXXX Statistics (www.interscience.wiley.com) DOI: 10.1002/sim.0000 Research Article in Medicine Tutorial in Biostatistics: Instrumental Variable Methods for Causal Inference Michael Baiocchi a,jing

More information

Can you guarantee that the results from your observational study are unaffected by unmeasured confounding? H.Hosseini

Can you guarantee that the results from your observational study are unaffected by unmeasured confounding? H.Hosseini Instrumental Variable (Instrument) applications in Epidemiology: An Introduction Hamed Hosseini Can you guarantee that the results from your observational study are unaffected by unmeasured confounding?

More information

Introduction to Observational Studies. Jane Pinelis

Introduction to Observational Studies. Jane Pinelis Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching

More information

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo.

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo. Goal of a Formal Sensitivity Analysis To replace a general qualitative statement that applies in all observational studies the association we observe between treatment and outcome does not imply causation

More information

Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact.

Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact. Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact. 1 This three-part series focuses on the estimation of the association between exposures

More information

J. Y. Hsu S. A. Lorch Center for Outcomes Research, The Children s Hospital of Philadelphia, Philadelphia, PA, USA

J. Y. Hsu S. A. Lorch Center for Outcomes Research, The Children s Hospital of Philadelphia, Philadelphia, PA, USA Health Serv Outcomes Res Method (2012) 12:119 140 DOI 10.1007/s10742-012-0095-9 Perils and prospects of using aggregate area level socioeconomic information as a proxy for individual level socioeconomic

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Instrumental Variables I (cont.)

Instrumental Variables I (cont.) Review Instrumental Variables Observational Studies Cross Sectional Regressions Omitted Variables, Reverse causation Randomized Control Trials Difference in Difference Time invariant omitted variables

More information

Sensitivity Analysis in Observational Research: Introducing the E-value

Sensitivity Analysis in Observational Research: Introducing the E-value Sensitivity Analysis in Observational Research: Introducing the E-value Tyler J. VanderWeele Harvard T.H. Chan School of Public Health Departments of Epidemiology and Biostatistics 1 Plan of Presentation

More information

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS EMPIRICAL STRATEGIES IN LABOUR ECONOMICS University of Minho J. Angrist NIPE Summer School June 2009 This course covers core econometric ideas and widely used empirical modeling strategies. The main theoretical

More information

Lecture II: Difference in Difference. Causality is difficult to Show from cross

Lecture II: Difference in Difference. Causality is difficult to Show from cross Review Lecture II: Regression Discontinuity and Difference in Difference From Lecture I Causality is difficult to Show from cross sectional observational studies What caused what? X caused Y, Y caused

More information

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias. Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias. In the second module, we will focus on selection bias and in

More information

Lecture II: Difference in Difference and Regression Discontinuity

Lecture II: Difference in Difference and Regression Discontinuity Review Lecture II: Difference in Difference and Regression Discontinuity it From Lecture I Causality is difficult to Show from cross sectional observational studies What caused what? X caused Y, Y caused

More information

Establishing Causality Convincingly: Some Neat Tricks

Establishing Causality Convincingly: Some Neat Tricks Establishing Causality Convincingly: Some Neat Tricks Establishing Causality In the last set of notes, I discussed how causality can be difficult to establish in a straightforward OLS context If assumptions

More information

Causal Association : Cause To Effect. Dr. Akhilesh Bhargava MD, DHA, PGDHRM Prof. Community Medicine & Director-SIHFW, Jaipur

Causal Association : Cause To Effect. Dr. Akhilesh Bhargava MD, DHA, PGDHRM Prof. Community Medicine & Director-SIHFW, Jaipur Causal Association : Cause To Effect Dr. MD, DHA, PGDHRM Prof. Community Medicine & Director-SIHFW, Jaipur Measure of Association- Concepts If more disease occurs in a group that smokes compared to the

More information

How should the propensity score be estimated when some confounders are partially observed?

How should the propensity score be estimated when some confounders are partially observed? How should the propensity score be estimated when some confounders are partially observed? Clémence Leyrat 1, James Carpenter 1,2, Elizabeth Williamson 1,3, Helen Blake 1 1 Department of Medical statistics,

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine Confounding by indication developments in matching, and instrumental variable methods Richard Grieve London School of Hygiene and Tropical Medicine 1 Outline 1. Causal inference and confounding 2. Genetic

More information

Challenges of Observational and Retrospective Studies

Challenges of Observational and Retrospective Studies Challenges of Observational and Retrospective Studies Kyoungmi Kim, Ph.D. March 8, 2017 This seminar is jointly supported by the following NIH-funded centers: Background There are several methods in which

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018

Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018 Propensity Score Methods to Adjust for Bias in Observational Data SAS HEALTH USERS GROUP APRIL 6, 2018 Institute Institute for Clinical for Clinical Evaluative Evaluative Sciences Sciences Overview 1.

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Measuring Impact. Program and Policy Evaluation with Observational Data. Daniel L. Millimet. Southern Methodist University.

Measuring Impact. Program and Policy Evaluation with Observational Data. Daniel L. Millimet. Southern Methodist University. Measuring mpact Program and Policy Evaluation with Observational Data Daniel L. Millimet Southern Methodist University 23 May 2013 DL Millimet (SMU) Observational Data May 2013 1 / 23 ntroduction Measuring

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

Reflection Questions for Math 58B

Reflection Questions for Math 58B Reflection Questions for Math 58B Johanna Hardin Spring 2017 Chapter 1, Section 1 binomial probabilities 1. What is a p-value? 2. What is the difference between a one- and two-sided hypothesis? 3. What

More information

Econometric Game 2012: infants birthweight?

Econometric Game 2012: infants birthweight? Econometric Game 2012: How does maternal smoking during pregnancy affect infants birthweight? Case A April 18, 2012 1 Introduction Low birthweight is associated with adverse health related and economic

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

Class 1: Introduction, Causality, Self-selection Bias, Regression

Class 1: Introduction, Causality, Self-selection Bias, Regression Class 1: Introduction, Causality, Self-selection Bias, Regression Ricardo A Pasquini April 2011 Ricardo A Pasquini () April 2011 1 / 23 Introduction I Angrist s what should be the FAQs of a researcher:

More information

Mendelian Randomization

Mendelian Randomization Mendelian Randomization Drawback with observational studies Risk factor X Y Outcome Risk factor X? Y Outcome C (Unobserved) Confounders The power of genetics Intermediate phenotype (risk factor) Genetic

More information

Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial

Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial Estimating Direct Effects of New HIV Prevention Methods. Focus: the MIRA Trial Michael Rosenblum UCSF: Helen Cheng, Nancy Padian, Steve Shiboski, Ariane van der Straten Berkeley: Nicholas P. Jewell, Mark

More information

Brief introduction to instrumental variables. IV Workshop, Bristol, Miguel A. Hernán Department of Epidemiology Harvard School of Public Health

Brief introduction to instrumental variables. IV Workshop, Bristol, Miguel A. Hernán Department of Epidemiology Harvard School of Public Health Brief introduction to instrumental variables IV Workshop, Bristol, 2008 Miguel A. Hernán Department of Epidemiology Harvard School of Public Health Goal: To consistently estimate the average causal effect

More information

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer A NON-TECHNICAL INTRODUCTION TO REGRESSIONS David Romer University of California, Berkeley January 2018 Copyright 2018 by David Romer CONTENTS Preface ii I Introduction 1 II Ordinary Least Squares Regression

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture 02 Experimental Design Strategy Welcome back to the course on Biostatistics

More information

I. Identifying the question Define Research Hypothesis and Questions

I. Identifying the question Define Research Hypothesis and Questions Term Paper I. Identifying the question What is the question? (What are my hypotheses?) Is it possible to answer the question with statistics? Is the data obtainable? (birth weight, socio economic, drugs,

More information

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth 1 The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth Madeleine Benjamin, MA Policy Research, Economics and

More information

Simulation study of instrumental variable approaches with an application to a study of the antidiabetic effect of bezafibrate

Simulation study of instrumental variable approaches with an application to a study of the antidiabetic effect of bezafibrate pharmacoepidemiology and drug safety 2012; 21(S2): 114 120 Published online in Wiley Online Library (wileyonlinelibrary.com).3252 ORIGINAL REPORT Simulation study of instrumental variable approaches with

More information

Estimating treatment effects with observational data: A new approach using hospital-level variation in treatment intensity

Estimating treatment effects with observational data: A new approach using hospital-level variation in treatment intensity Preliminary and incomplete Do not quote Estimating treatment effects with observational data: A new approach using hospital-level variation in treatment intensity Mark McClellan Stanford University and

More information

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Dean Eckles Department of Communication Stanford University dean@deaneckles.com Abstract

More information

Advanced IPD meta-analysis methods for observational studies

Advanced IPD meta-analysis methods for observational studies Advanced IPD meta-analysis methods for observational studies Simon Thompson University of Cambridge, UK Part 4 IBC Victoria, July 2016 1 Outline of talk Usual measures of association (e.g. hazard ratios)

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

The Late Pretest Problem in Randomized Control Trials of Education Interventions

The Late Pretest Problem in Randomized Control Trials of Education Interventions The Late Pretest Problem in Randomized Control Trials of Education Interventions Peter Z. Schochet ACF Methods Conference, September 2012 In Journal of Educational and Behavioral Statistics, August 2010,

More information

Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data

Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data Aidan O Keeffe Department of Statistical Science University College London 18th September 2014 Aidan O Keeffe

More information

University of Michigan School of Public Health

University of Michigan School of Public Health University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 2016 Paper 119 Strengthening Instrumental Variables through Weighting Douglas

More information

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases

A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases A Bayesian Perspective on Unmeasured Confounding in Large Administrative Databases Lawrence McCandless lmccandl@sfu.ca Faculty of Health Sciences, Simon Fraser University, Vancouver Canada Summer 2014

More information

Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies

Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies Bayesian methods for combining multiple Individual and Aggregate data Sources in observational studies Sara Geneletti Department of Epidemiology and Public Health Imperial College, London s.geneletti@imperial.ac.uk

More information

The Dynamic Effects of Obesity on the Wages of Young Workers

The Dynamic Effects of Obesity on the Wages of Young Workers The Dynamic Effects of Obesity on the Wages of Young Workers Joshua C. Pinkston University of Louisville June, 2015 Contributions 1. Focus on more recent cohort, NLSY97. Obesity

More information

(b) empirical power. IV: blinded IV: unblinded Regr: blinded Regr: unblinded α. empirical power

(b) empirical power. IV: blinded IV: unblinded Regr: blinded Regr: unblinded α. empirical power Supplementary Information for: Using instrumental variables to disentangle treatment and placebo effects in blinded and unblinded randomized clinical trials influenced by unmeasured confounders by Elias

More information

Collecting Data Example: Does aspirin prevent heart attacks?

Collecting Data Example: Does aspirin prevent heart attacks? Collecting Data In an experiment, the researcher controls or manipulates the environment of the individuals. The intent of most experiments is to study the effect of changes in the explanatory variable

More information

The Limits of Inference Without Theory

The Limits of Inference Without Theory The Limits of Inference Without Theory Kenneth I. Wolpin University of Pennsylvania Koopmans Memorial Lecture (2) Cowles Foundation Yale University November 3, 2010 Introduction Fuller utilization of the

More information

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of

Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of Objective: To describe a new approach to neighborhood effects studies based on residential mobility and demonstrate this approach in the context of neighborhood deprivation and preterm birth. Key Points:

More information

Can higher cigarette taxes improve birth outcomes?

Can higher cigarette taxes improve birth outcomes? Journal of Public Economics 72 (1999) 135 154 Can higher cigarette taxes improve birth outcomes? William N. Evans *, Jeanne S. Ringel a, b a Department of Economics, University of Maryland, Project HOPE

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Causal Inference for Medical Decision Making

Causal Inference for Medical Decision Making Causal Inference for Medical Decision Making OUTLINE Sharon-Lise Normand Harvard Medical School Harvard T.H. Chan School of Public Health April 2018 Regulatory Science What is Evidence? Assumptions Concluding

More information

Gender and Generational Effects of Family Planning and Health Interventions: Learning from a Quasi- Social Experiment in Matlab,

Gender and Generational Effects of Family Planning and Health Interventions: Learning from a Quasi- Social Experiment in Matlab, Gender and Generational Effects of Family Planning and Health Interventions: Learning from a Quasi- Social Experiment in Matlab, 1977-1996 T. Paul Schultz* * I gratefully acknowledge research support from

More information

1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement t

1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement t 1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement the program depends on the assessment of its likely

More information

Instrumental Variables. Application and Limitations

Instrumental Variables. Application and Limitations ORIGINAL ARTICL Application and Limitations dwin P. Martens,* Wiebe R. Pestman, Anthonius de Boer,* Svetlana V. Belitser,* and Olaf H. Klungel* Abstract: To correct for confounding, the method of instrumental

More information

10. Introduction to Multivariate Relationships

10. Introduction to Multivariate Relationships 10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory variables have an influence on any particular

More information

TEXT: Freedman-Pisani-Purves, Statistics, 3rd edition, W.W. Norton & Co. Bring book to class, WHICH WILL BE HELD IN 344 EVANS, Tu Th

TEXT: Freedman-Pisani-Purves, Statistics, 3rd edition, W.W. Norton & Co. Bring book to class, WHICH WILL BE HELD IN 344 EVANS, Tu Th Notes for first lecture Statistics 131 DA Freedman TEXT: Freedman-Pisani-Purves, Statistics, 3rd edition, W.W. Norton & Co. Read chapters 1 and 2. Work exercises! Bring book to class, WHICH WILL BE HELD

More information

Stronger Instruments and Refined Covariate Balance in an Observational Study of the Effectiveness of Prompt Admission to the ICU in the UK

Stronger Instruments and Refined Covariate Balance in an Observational Study of the Effectiveness of Prompt Admission to the ICU in the UK Stronger Instruments and Refined Covariate Balance in an Observational Study of the Effectiveness of Prompt Admission to the ICU in the UK Luke Keele Steve Harris Samuel D. Pimentel Richard Grieve August

More information

3.4 What are some cautions in analyzing association?

3.4 What are some cautions in analyzing association? 3.4 What are some cautions in analyzing association? Objectives Extrapolation Outliers and Influential Observations Correlation does not imply causation Lurking variables and confounding Simpson s Paradox

More information

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH Why Randomize? This case study is based on Training Disadvantaged Youth in Latin America: Evidence from a Randomized Trial by Orazio Attanasio,

More information

Case A, Wednesday. April 18, 2012

Case A, Wednesday. April 18, 2012 Case A, Wednesday. April 18, 2012 1 Introduction Adverse birth outcomes have large costs with respect to direct medical costs aswell as long-term developmental consequences. Maternal health behaviors at

More information

Chapter 8 Statistical Principles of Design. Fall 2010

Chapter 8 Statistical Principles of Design. Fall 2010 Chapter 8 Statistical Principles of Design Fall 2010 Experimental Design Many interesting questions in biology involve relationships between response variables and one or more explanatory variables. Biology

More information

Challenges in design and analysis of large register-based epidemiological studies

Challenges in design and analysis of large register-based epidemiological studies FMS/DSBS autumn meeting 2014 Challenges in design and analysis of large register-based epidemiological studies Caroline Weibull & Anna Johansson Department of Medical Epidemiology and Biostatistics (MEB)

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Empirical Tools of Public Finance. 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley

Empirical Tools of Public Finance. 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley Empirical Tools of Public Finance 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley 1 DEFINITIONS Empirical public finance: The use of data and statistical methods to measure the impact of government

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 7: Endogeneity and IVs Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 7 VŠE, SS 2016/17 1 / 36 Outline 1 OLS and the treatment effect 2 OLS and endogeneity 3 Dealing

More information

Disparity Data Fact Sheet General Information

Disparity Data Fact Sheet General Information Disparity Data Fact Sheet General Information Tobacco use is a well-recognized risk factor for many cancers, respiratory illnesses and cardiovascular diseases within Michigan. rates have continued to decline

More information

Statistical methods for assessing treatment effects for observational studies.

Statistical methods for assessing treatment effects for observational studies. University of Louisville ThinkIR: The University of Louisville's Institutional Repository Electronic Theses and Dissertations 5-2014 Statistical methods for assessing treatment effects for observational

More information

Matched Cohort designs.

Matched Cohort designs. Matched Cohort designs. Stefan Franzén PhD Lund 2016 10 13 Registercentrum Västra Götaland Purpose: improved health care 25+ 30+ 70+ Registries Employed Papers Statistics IT Project management Registry

More information

Confidence Intervals On Subsets May Be Misleading

Confidence Intervals On Subsets May Be Misleading Journal of Modern Applied Statistical Methods Volume 3 Issue 2 Article 2 11-1-2004 Confidence Intervals On Subsets May Be Misleading Juliet Popper Shaffer University of California, Berkeley, shaffer@stat.berkeley.edu

More information

The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis

The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis Empir Econ DOI 10.1007/s00181-010-0434-z The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis Justin Leroux John A. Rizzo Robin Sickles Received:

More information

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Lec 02: Estimation & Hypothesis Testing in Animal Ecology Lec 02: Estimation & Hypothesis Testing in Animal Ecology Parameter Estimation from Samples Samples We typically observe systems incompletely, i.e., we sample according to a designed protocol. We then

More information

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013 Evidence-Based Medicine Journal Club A Primer in Statistics, Study Design, and Epidemiology August, 2013 Rationale for EBM Conscientious, explicit, and judicious use Beyond clinical experience and physiologic

More information

NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS

NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS John F. Helliwell Shun Wang Working Paper 21374 http://www.nber.org/papers/w21374

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

Two-sample Categorical data: Measuring association

Two-sample Categorical data: Measuring association Two-sample Categorical data: Measuring association Patrick Breheny October 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 40 Introduction Study designs leading to contingency

More information

From Description to Causation

From Description to Causation From Description to Causation Department of Government London School of Economics and Political Science 1 Correlation vs. Causation 2 Over-Time Changes 3 Getting Systematic about Causality What makes something

More information

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Supplement 2. Use of Directed Acyclic Graphs (DAGs) Supplement 2. Use of Directed Acyclic Graphs (DAGs) Abstract This supplement describes how counterfactual theory is used to define causal effects and the conditions in which observed data can be used to

More information

Contingency Tables Summer 2017 Summer Institutes 187

Contingency Tables Summer 2017 Summer Institutes 187 Contingency Tables 87 Overview ) Types of Variables ) Comparing () Categorical Variables Contingency (two-way) tables Tests 3) x Tables Sampling designs Testing for association Estimation of effects Paired

More information

Recent advances in non-experimental comparison group designs

Recent advances in non-experimental comparison group designs Recent advances in non-experimental comparison group designs Elizabeth Stuart Johns Hopkins Bloomberg School of Public Health Department of Mental Health Department of Biostatistics Department of Health

More information

Section The Question of Causation

Section The Question of Causation Section 2.5 - The Question of Causation Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Causation Does smoking cause cancer? Did chemical weapons exposure cause health problems in Gulf War

More information

Complier Average Causal Effect (CACE)

Complier Average Causal Effect (CACE) Complier Average Causal Effect (CACE) Booil Jo Stanford University Methodological Advancement Meeting Innovative Directions in Estimating Impact Office of Planning, Research & Evaluation Administration

More information

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros Marno Verbeek Erasmus University, the Netherlands Using linear regression to establish empirical relationships Linear regression is a powerful tool for estimating the relationship between one variable

More information

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda DUE: June 6, 2006 Name 1) Earnings functions, whereby the log of earnings is regressed on years of education, years of on-the-job training, and

More information

Common Statistical Issues in Biomedical Research

Common Statistical Issues in Biomedical Research Common Statistical Issues in Biomedical Research Howard Cabral, Ph.D., M.P.H. Boston University CTSI Boston University School of Public Health Department of Biostatistics May 15, 2013 1 Overview of Basic

More information

t-test for r Copyright 2000 Tom Malloy. All rights reserved

t-test for r Copyright 2000 Tom Malloy. All rights reserved t-test for r Copyright 2000 Tom Malloy. All rights reserved This is the text of the in-class lecture which accompanied the Authorware visual graphics on this topic. You may print this text out and use

More information

Student Performance Q&A:

Student Performance Q&A: Student Performance Q&A: 2009 AP Statistics Free-Response Questions The following comments on the 2009 free-response questions for AP Statistics were written by the Chief Reader, Christine Franklin of

More information

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Biases in clinical research Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Learning objectives Describe the threats to causal inferences in clinical studies Understand the role of

More information

ICPSR Causal Inference in the Social Sciences. Course Syllabus

ICPSR Causal Inference in the Social Sciences. Course Syllabus ICPSR 2012 Causal Inference in the Social Sciences Course Syllabus Instructors: Dominik Hangartner London School of Economics Marco Steenbergen University of Zurich Teaching Fellow: Ben Wilson London School

More information

Bios 6648: Design & conduct of clinical research

Bios 6648: Design & conduct of clinical research Bios 6648: Design & conduct of clinical research Section 1 - Specifying the study setting and objectives 1. Specifying the study setting and objectives Where will we end up?: (a) The treatment indication

More information

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Biases in clinical research Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University Learning objectives Describe the threats to causal inferences in clinical studies Understand the role of

More information

Research Designs. Inferential Statistics. Two Samples from Two Distinct Populations. Sampling Error (Figure 11.2) Sampling Error

Research Designs. Inferential Statistics. Two Samples from Two Distinct Populations. Sampling Error (Figure 11.2) Sampling Error Research Designs Inferential Statistics Note: Bring Green Folder of Readings Chapter Eleven What are Inferential Statistics? Refer to certain procedures that allow researchers to make inferences about

More information

Structural vs. Atheoretic Approaches to Econometrics

Structural vs. Atheoretic Approaches to Econometrics Structural vs. Atheoretic Approaches to Econometrics by Michael P. Keane Yale University First Draft, September 16, 2005 Revised October 22, 2005 Revised, March 22, 2006 Revised, May 1, 2006 Prepared as

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

Chapter 6. Experiments in the Real World. Chapter 6 1

Chapter 6. Experiments in the Real World. Chapter 6 1 Chapter 6 Experiments in the Real World Chapter 6 1 Thought Question 1 Suppose you are interested in determining if drinking a glass of red wine each day helps prevent heartburn. You recruit 40 adults

More information

ECON Microeconomics III

ECON Microeconomics III ECON 7130 - Microeconomics III Spring 2016 Notes for Lecture #5 Today: Difference-in-Differences (DD) Estimators Difference-in-Difference-in-Differences (DDD) Estimators (Triple Difference) Difference-in-Difference

More information

Confounding and Bias

Confounding and Bias 28 th International Conference on Pharmacoepidemiology and Therapeutic Risk Management Barcelona, Spain August 22, 2012 Confounding and Bias Tobias Gerhard, PhD Assistant Professor, Ernest Mario School

More information