For Objec*ve Causal Inference, Design Trumps Analysis. Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012

Size: px
Start display at page:

Download "For Objec*ve Causal Inference, Design Trumps Analysis. Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012"

Transcription

1 For Objec*ve Causal Inference, Design Trumps Analysis Donald B. Rubin Department of Sta*s*cs Harvard University 16 March

2 Prologue to Objec*ve Causal Inference in Observa*onal Studies My Introduc*on Physics Wheeler 1961 Experimental Design Cochran 1968 Clear Separa*on Between Science = object of inference: DEFINE QUESTION FIRST What is done to learn about the science Intervene to measure aspects at a point in *me Same nota*on/representa*on of science no maxer how we try to learn about or measure Missing data always exist Cannot go back in *me 2

3 Poten*al Outcomes Approach to Causal Inference Simplest SeZng Units 1... N Y(1) Y(0) 3

4 Poten*al Outcomes Approach to Causal Inference Simplest SeZng Fundamental problem of causal inference For each i, only Y i (1) or Y i (0) can be observed Y(1) Y(0) T 1? 1.? 1 Units.? 1.? 0.? 0 N? 0 Random assignment of ac*ve versus control representa*ve sample of Y i (1) will be compared to representa*ve sample of Y i (0) 4

5 Poten*al Outcomes Approach to Causal Inference Simplest SeZng with Covariates Units 1... N X Y(1) Y(0) Same as before, except includes pretreatment covariates, e.g., age, sex, background educa*on Randomiza*on s*ll works for females 5

6 Randomized Trials That Are Designed Oden Using Covariates Randomized blocks (e.g., males, females) Forces balance on blocking variables Probability of treatment versus control can depend on covariates values, some*mes in complicated ways This is the template for the design and analysis of nonrandomized (e.g., observa*onal) data Assignment- based approaches use assignment mechanism for inference Fisherian & Neymanian Predic*ve approach predicts from observed values Bayesian posterior predic*ve (Rubin, 1978) 6

7 Design Observa-onal Studies to Approximate Randomized Trials 1. Hide outcome data un*l the design phase is complete 2. Think very carefully about decision makers and the key covariates that were used to make treatment decisions 3. If key covariates are not observed or very noisy, usually best to give up and seek bexer data source 4. Find subgroups (subclasses or matched pairs) in which the treatment and control groups have balance essen*ally the same distribu*on of observed covariates Not always possible to achieve balance Inferences are limited to subgroups where balance is achieved 5. Protocol specified analysis #1 - #5 combine to create an objec*ve design that approximates a randomized trial in each subclass that is balanced with respect to observed covariates 7

8 Illustra*ve Example with One Key Covariate (Cochran, 1968) Popula*on: Male smokers in U.S. Treatment = cigar/pipe smoking Control = cigarexe smoking Outcome = death rate/1000 person years Decision maker is the individual male smoker Reason for a smoking male to choose cigarexes versus cigar/pipe? Age is a key covariate for selec*on of smoking type for males 8

9 Subclassifica*on to Balance Age To achieve balance on age, compare: young cigar/pipe smokers with young cigarexe smokers old cigar/pipe smokers with old cigarexe smokers Or bexer, compare: Young, middle aged, old Even more age subclasses Design phase, no outcome data, objec*ve: Approximates a randomized trial within subclasses Now look at outcome data Reference: Rubin DB. The Design Versus the Analysis of Observa*onal Studies for Causal Effects: Parallels With The Design of Randomized Trials. Sta*s*cs in Medicine

10 Comparison of Mortality Rates for Two Variable Smoking Groups in U.S. Mortality Rates per 1000 person- years, % Adjusted Mortality Rates using subclasses, % CigareXe Smokers Cigar/Pipe Smokers age subclasses age subclasses age subclasses Source: Cochran WG. The effec*veness of adjustment of subclassifica*on in removing bias in observa*onal studies. Biometrics 1968; 24: Note: 20 four- level covariates over million million subclasses 10

11 Propensity Score Methods Rosenbaum and Rubin. The Central Role of the Propensity Score in Observa*onal Studies. Biometrika Observa*onal study analogue of randomiza*on The propensity score is the probability of treatment versus control as a func*on of observed covariates Model the reasons for treatment versus control at the level of the decision makers For example, logis*c regression model to predict cigarexe versus cigar/pipe smoking with age, educa*on, income, etc. as predictors Then subclassify (or match) on the propensity score as if it were the only covariate, e.g., 5-10 subclasses If correctly done, this creates balance within each subclass on ALL covariates used in es*ma*ng the propensity score Using diagnos*cs to assess and to document balance is cri*cal 11

12 Example: GAO Study of Breast Conserva*on versus Mastectomy Six large and expensive randomized clinical trials had been completed showing lixle difference for the type of women randomized in the trials and par*cipa*ng clinics Ques*on: Same results in general prac*ce? Observa*onal data available SEER Database: covariates, treatments, post- surgery outcomes Design phase Hide outcomes Balance covariates between treatment and control Reasons for mastectomy versus breast conserva*on Age, marital status, region of country, urbaniza*on, race, size of tumor, etc. Reference: Rubin DB. Es*mated Causal Effects from Large Datasets Using Propensity Scores. Annals of Internal Medicine 1997; 127, 8(II):

13 Es*mated 5- year Survival Rates for Node- nega*ve Pa*ents in Six Randomized Clinical Trials Study Women Breast Conserva*on (BC) Es*mated Survival Rate for Women Es*mated Causal Effect Mastectomy (Mas) BC Mas BC Mas n n % % % US- NCI Milanese French Danish EORTC US- NSABP Single- center trial; Mul*center trial Reference: Rubin DB. Es*mated Causal Effects from Large Datasets Using Propensity Scores. Annals of Internal Medicine 1997; 127, 8(II):

14 Propensity Score Analysis Approach Es*mate propensity scores Then subclassify (or match) on propensity score as if the only covariate, e.g., 5-10 subclasses Why does this work? Creates balance in each subclass on ALL covariates used in es*ma*ng the propensity score This balance will be achieved in large samples just like the balance that will be achieved in a large randomized clinical trial 14

15 Propensity Score Subclass Es*mated 5- year Survival Rates for Node- Nega*ve Pa*ents in the SEER Database within Each of Five Propensity Score Subclasses Women Breast Conserva*on (BC) Es*mated Survival Rate for Women Es*mated Causal Effect Mastectom y (Mas) BC Mas BC Mas n n % % % Averages Across Five Subclasses Reference: Rubin DB. Es*mated Causal Effects from Large Datasets Using Propensity Scores. Annals of Internal Medicine 1997; 127, 8(II):

16 Diagnos*cs for Accessing Balance Assessing balance simpler in large samples, just as with randomized experiments To illustrate diagnos*cs, use a marke*ng applica*on that involved a weight loss drug Units = doctors Treatment = sales rep visits doctor to discuss Control = no visit Decision- makers = sales reps Key covariates = prior Rxs, medical specialty, years in prac*ce, size of prac*ce, etc. 16

17 Histograms for background variable: Prior Rx Score (0-100) at Baseline Source: Rubin DB and Waterman RP. Es*ma*ng Causal Effects of Marke*ng Interven*ons Using Propensity Score Methodology. Sta*s*cal Science 2006; 21(2):

18 Histograms for background variable: Specialty Source: Rubin DB and Waterman RP. Es*ma*ng Causal Effects of Marke*ng Interven*ons Using Propensity Score Methodology. Sta*s*cal Science 2006; 21(2):

19 Histograms for summarized background variables: Linear Propensity Score Source: Rubin DB and Waterman RP. Es*ma*ng Causal Effects of Marke*ng Interven*ons Using Propensity Score Methodology. Sta*s*cal Science 2006; 21(2):

20 Histograms for a variable in a subclass of propensity scores: Prior Rx Score Source: Rubin DB and Waterman RP. Es*ma*ng Causal Effects of Marke*ng Interven*ons Using Propensity Score Methodology. Sta*s*cal Science 2006; 21(2):

21 Histograms for a variable in a subclass of propensity scores: Specialty Source: Rubin DB and Waterman RP. Es*ma*ng Causal Effects of Marke*ng Interven*ons Using Propensity Score Methodology. Sta*s*cal Science 2006; 21(2):

22 Marke*ng Example: Achieved Balance Within each narrow subclass of propensity scores, the treatment and control groups will be as balanced as if randomly divided Claim: This holds for all subclasses in which there are both treated and control subjects, and holds for all covariates that were used to es*mate the propensity score Works best when the propensity score subclasses have large sample sizes and are rela*vely narrow Five to ten propensity score subclasses oden fully adequate to balance all covariates No outcome data used in the design stage 22

23 * * 23 23

24 24 24

25 25

26 Simple Noncompliance, Instrumental Variables, and Bayesian Generaliza*ons Template for other observa*onal studies involves more complex randomized experiment Illustrate with completely randomized experiment with noncompliance with assigned treatment Return later to combined analysis with observa*onal study design

27 Sommer and Zeger Vitamin A Data Row True Compliance Type Treatment Assignment Treatment Received Y obs Number of Children 1? ? N N C C Reference: Sommer and Zeger (1991). On Estimating Efficacy from Clinical Trials. Statistics in Medicine.

28 Results of Three Standard MoM Analyses Method Es-mate Calcula-on Row Comparison ITT , 4, 5, & 6 vs. 1 & 2 As- treated & 6 vs. 1, 2, 3, &4 Per protocol & 6 vs. 1 & 2 Reference: Sommer and Zeger (1991). On Estimating Efficacy from Clinical Trials. Statistics in Medicine.

29 MoM CACE Analysis ACE = p N. NACE + p C. CACE = 0.2. NACE CACE = 0.8. CACE CACE = /0.8 =

30 Bayesian Analysis of Sommer & Zeger Data Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

31 Bayesian Analysis of Sommer & Zeger Data, Marginal Posterior Distribu*ons with and without Exclusion Restric*on Es*mand Exclusion restric*on Mean Standard devia*on Median 5 th percen*le 95 th percen*le CACE No ITT Y (n) No CACE Yes Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

32 Bayesian Analysis of Sommer & Zeger Data Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

33 Bayesian Analysis of Sommer & Zeger Data Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

34 Bayesian Analysis of Sommer & Zeger Data Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

35 Hypothe*cal Example Illustra*ng Frequen*st Superiority of Bayes over IVE (MoM) and MLE, Popula*on Parameters with Exclusion Restric*ons and Monotonicity T P(C i = t π) D i (0) D i (1) Y i C i = t, Z i = 0, π Y i C i = t, Z i = 0, π c N(0.1, 0.16) N(0.9, 0.49) n N(1.0, 0.25) N(1.0, 0.25) a N(0.0, 0.36) N(0.0, 0.36) Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

36 Hypothe*cal Example Illustra*ng Frequen*st Superiority of Bayes over IVE (MoM) and MLE, One Sample Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

37 Hypothe*cal Example Illustra*ng Frequen*st Superiority of Bayes over IVE (MoM) and MLE, Frequen*st Evalua*on under Monotonicity and Exclusion Restric*ons Es*mator Posterior mean Posterior median Mean bias Median bias Root mean squared error Median absolute error Coverage rate 90% interval Median width MLE IVE Imbens G.W. and Rubin D.B. (1997) Bayesian Inference for Causal Effects in Randomized Experiments with Noncompliance. Annals of Statistics 25(1):

38

39 Jin and Rubin (JASA, 2008) - JR JR reanalyzed data from EF using principal stra*fica*on (Frangakis & Rubin, Biometrics, 2002) Randomized Treatment (ac*ve drug) versus Control (placebo) and meaured: Compliance (pill count) - intermediate outcome Cholesterol reduc*on - primary outcome Cri*cal axributes of Analysis Here: Principal stra*fica*on and the various problems that it can address Formula*on using hypothe*cal experiment Bayesian method of analysis, which easily incorporates scien*fic understanding 39

40 The EF Data 164 men were randomized to the treatment group and assigned the drug, Z i = T 171 men were randomized to the control group and assigned placebo, Z i = C For each pa*ent, cholesterol levels were measured before and ader taking the drug or placebo The outcome variable, Y i (T) or Y i (C), was the decrease in cholesterol level: the only variable used by EF or JR, besides treatment assigned and dose taken 40

41 Complica*ons with the EF Data Par*al and Extended Noncompliance Most pa*ents in the treatment group took only a propor*on of the assigned drug: D i (T) [0,1] Most pa*ents in the control group took only a propor*on of the assigned placebo: d i (C) [0,1] By design, D i (C ) = 0 and d i (T) = 0 Thanks to Brad Efron for sharing data 41

42 Rela*onship Between Observed Cholesterol Reduc*on and Observed Compliance Figures from Efron and Feldman,

43 How to es*mate dose- response? Observed dose- response in both arms Somehow, subtract Y(C) versus d(c) plot from Y(T) versus D(T) plot EF axempted this, but wrixen discussion (including by DBR) of ar*cle indicated debatable success Objec*ve in JR was to do this subtrac*on correctly under explicit assump*ons Here, highlight principal stra*fica*on, hypothe*cal experiment for dose- response, and Bayesian approach to analysis 43

44 Standard Assump*ons Stable Unit Treatment Value Assump*on (SUTVA): One pa*ent s treatment assignment does not affect other pa*ents poten*al outcomes; For each pa*ent, no hidden versions of treatment and no hidden versions of control Ignorable Treatment Assignment of T versus C True for randomized experiment These are accepted by both EF and JR. 44

45 Histograms of Observed Compliance Figures from Jin and Rubin,

46 Q- Q Plot of Observed Drug and Observed Placebo Compliance Figure from Jin and Rubin,

47 Possible Assump*ons at the Individual Level Perfect Blind: D i (T) = d i (C); obviously wrong Equipercen*le Equatable Compliances Align percen*les of D i (T) and d i (C), as in Q- Q So both known for all men: D i (T) = F(d i (C)) EF assume this, which is true in expecta*on Side- Effect Monotonicity Nega*ve: D i (T) < d i (C) Posi*ve: D i (T) > d i (C) JR assume nega*ve side effects; plausible 47

48 Meaning of d i and D i d i : compliance to placebo indicates pa*ent i s psychological compliance status, a covariate that is missing for men assigned drug D i : compliance to drug reflects both pa*ents i s psychological compliance status and his tolerance to nega*ve side effects of the drug, etc. But D i hints at possibility of es*ma*ng dose- response Similar comments in EF, but JR allow D i (T) F(d i (C)) 48

49 Es*ma*ng a Dose- Response Rela*onship within the Rubin Causal Model (Holland, 1986) To es*mate a dose- response rela*onship Need a hypothe*cal experiment where different doses of drug are randomly assigned and enforced Principal stra*fica*on framework (Frangakis and Rubin, 2002) vast generaliza*on of IVE The intermediate outcome d i (C) is unaffected by treatment assignment Therefore is a par*ally observed covariate For each stratum of pa*ents with the same d i (C), the assignment of dose is stochas*c and latent ignorable (Frangakis and Rubin, 1999) 49

50 Specific Hypothe*cal Experiment Measure d i* = baseline compliance for each pa*ent when assigned full placebo dose Randomly divide pa*ents into Treatment and Control In treatment group, stochas*cally assign dose Z Di < d i * according to a Beta random variable In control group, assign full placebo and measure d i We no*ce d i (C) = d i * in the control group, then toss d i * in the control group and in the treatment group Thus, nonignorable assignment of Z Di, but latent ignorable given d i * Also, forget the rule for the assignment of Z Di 50

51 Principal Stra*fica*on Framework for Dose- Response with d i (C) Defining Strata and Z Di (T) Defining Dose 51

52 Treatment Assignment Mechanism Nota*on θ = All parameters Assume d i (C) = d i * for everyone, denoted d i Actual Randomiza*on of Z=T versus Z=C [Z d i, Y i (C ), {Y i (Z Di )}, θ] = [Z θ] ~ constant Hypothe*cal Randomiza*on of Dose Z D given Z=T [Z Di d i,y i (C ),{Y i (Z Di )},Z=T,θ] = [Z Di d i, Z=T, θ] ~ d i. Beta(α 1, α 2 ) Latently ignorable given par*ally observed variable d i 52

53 JR s Parametric Model Given θ Covariate Distribu*on [d i θ] ~ Beta(α 3, α 4 ) Poten*al Outcomes Joint Distribu*on Close to EF s [Y i (C) d i, θ] ~ N(β 0 + β d i, σ C2 ) [Y i (Z Di ) Y i (C), d i, θ] ~ N[Y i (C) + γ 1 Z Di + γ 2 Z Di2 + γ 3 Z Di d i,σ 2 T. C ], Mutually condi*onally independent across the Z Di, - - plausible? And γ 1 >0, γ 2 >0, γ 1 + γ 3 >0 When Z Di = 0, expecta*on of Y i (Z Di ) - Y i (C) = 0 Dose- response is monotonely increasing for this range of doses 53

54 Prior Distribu*on on θ Prior distribu*on on parameters of Betas is specified by adding six fake observa*ons with both Z Di and d i observed on the equal percen*le equa*ng line and nothing else observed These are the minimum, 25 th percen*le, median, 75 th percen*le, maximum Purpose of these observa*ons is simply to stabilize computa*on and has lixle influence on inference (n fake =6 versus n real >300) and are accurate in expecta*on because of the randomiza*on Prior distribu*on on the rest of θ is independent and is the standard noninforma*ve prior propor*onal to 1/(σ C σ T. C ) 54

55 Figures from Jin and Rubin,

56 JR s Computa*on Missing data problem, which is addressed using MCMC to draw Bayesian inferences Parameters are θ Key missing data are d i for those assigned treatment and Z Di for those assigned control Given θ, draw key missing data; given key missing data, draw θ; iterate un*l approximate convergence Vast number of such draws approximates posterior distribu*on of dose- response as a func*on of principal strata defined by d i (C) 56

57 Diagnos*c Checks for JR s Model One Posterior Draw of Key Missing Data Figures from Jin and Rubin,

58 Dose- Response Results for Principal Strata Maximum d, 75 th d, median d, 25 th d Figures from Jin and Rubin,

59 Discussion of the Dose Response Conclusions Under EF s assump*ons, dose- response at each d i (C ) is a point because D i (T) = F(d i (C)) JR s dose- response results are causal under debatable assump*on Is Nature s randomiza*on of dose given placebo compliance (i.e., the crucial latent ignorability assump*on) plausible? Or do we need to condi*on further on background medical characteris*cs related to possible side effects of the drug? Such sensi*vity analysis is future work 59

60 60

61

62 Are Job- Training Programs Effec*ve? Donald B. Rubin Harvard University Presentation based on joint work with Fabrizia Mealli, Paolo Frumento, and Barbara Pacini. 62

63 The Na*onal Job Corps Study A randomized study to evaluate the effects of a training program on employment and wages Randomiza*on assures fair comparison, in expecta*on, between treatment groups Sampled youths (n=15,386) were assigned randomly to a job training program group or a control group Only those assigned to the job training program group were able to enroll in Job Corps Post- treatment complica*ons Noncompliance (only 73% axended the offered training) Trunca*on of wages for the unemployed Missing outcomes due to nonresponse 63

64 Poten*al Outcomes Approach to Causal Inference Simplest SeZng Units 1... N Y(1) Y(0) 64

65 Poten*al Outcomes Approach to Causal Inference Simplest SeZng Fundamental problem of causal inference For each i, only Y i (1) or Y i (0) can be observed Y(1) Y(0) T 1? 1.? 1 Units.? 1.? 0.? 0 N? 0 Random assignment of ac*ve versus control representa*ve sample of Y i (1) will be compared to representa*ve sample of Y i (0) 65

66 Poten*al Outcomes Approach to Causal Inference Simplest SeZng with Covariates Units 1... N X Y(1) Y(0) Same as before, except includes pretreatment covariates, e.g., age, sex, background educa*on Randomiza*on s*ll works for females 66

67 Poten*al Outcomes Approach to Causal Inference Simple Noncompliance with Ac*ve Treatment Units D(1) D(0) Y(1) Y(0) N 0 0 compliers noncompliers Randomiza*on s*ll works for compliers 67

68 Poten*al Outcomes Approach to Causal Inference Simple Noncompliance with Ac*ve Treatment: Observed Data D(1) D(0) Y(1) Y(0) T 1 1 0? 1 complier status observed. 1 0? 1 Units.? 0? 0.? 0? 0 complier status missing. 0 0? 1 noncomplier status observed N? 0? 0 noncomplier status missing Compliers For individuals assigned treatment (T=1), D(1)=1 & D(0)=0 For individuals assigned control (T=0), D(1)=? because true compliance under treatment is unknown & D(0)=0 Noncompliers For individuals assigned treatment (T=1), D(1)=0 & D(0)=0 For individuals assigned control (T=0), D(1)=? because true compliance under treatment is unknown & D(0)=0 Randomiza-on s-ll works for compliers 68

69 Key Idea: Principal Stra*fica*on (Frangakis and Rubin, 2002) Stra*fy on values of post- treatment intermediate outcome Convert D i (1), D i (0) into stra*fica*on variable True complier c if D i (1)=1 Noncomplier n if D i (1)=0 Idea works more generally 69

70 Intermediate Outcome - Employment Employed (yes, no) at a given *me post- treatment is an important outcome, but is also needed to define principal strata for final outcomes, Y, describing axributes of possible employment, such as wages, re*rement plan benefits, etc., which are not well- defined if unemployed Principal strata are defined by employment status EE = employed whether assigned to training or not EU = employed if trained, unemployed if not trained UE = unemployed if trained, employed if not trained UU = unemployed whether assigned to training or not Causal effects of training on Y only well- defined for EE UE empty? Reserva*on wage issue 70

71 Causal Effects of Training within Principal Strata Principal strata are defined by compliance with assignment to job training and by employment status c&ee, c&eu, c&ue, c&uu n&ee, n&eu, n&ue, n&uu By assump*on (exclusion restric*on on employment), we rule out n&eu and n&ue If assignment does not affect entry into training, assignment cannot affect employment status Also assume exclusion for axributes of employment, Y Causal effects of T on Y are only well- defined for c&ee and n&ee principal strata (no effect on Y in n&ee by exclusion restric*on) 71

72 Not Done Yet with Needed Principal Strata Indicators for response to survey items asking about employment status and wages, etc. R(1) and R(0), each indica*ng respondent or not Do not make exclusion restric*on here e.g., males could have R i (1) respond if assigned training, but R i (0) not respond if assigned control But do assume missing at random (MAR) A nuisance, not of scien*fic interest 72

73 Causal Effects Assignment to be trained on being job- trained Pr(c) = propor*on compliers Assignment to be trained on being employed Pr(c&EU) Pr(c&UE) Assignment to be trained on being employed for compliers [Pr(c&EU) Pr(c&UE)]/Pr(c) Rela*ve sizes of principal strata c&ee, c&eu, c&ue, c&uu, n&ee, n&uu Distribu*ons of X within principal strata 73

74 Causal Effects on Wages For the always employed Ave[Y i (1) - Y i (0) c&ee or n&ee] For the always employed compliers Ave[Y i (1) - Y i (0) c&ee] By exclusion, for the always employed noncompliers Ave[Y i (1) Y i (0) n&ee] = 0 74

75 Method of Analysis Direct likelihood at each of three post- treatment points in *me Search for parsimonious model to help guide policy Needs scien*fic judgement 75

76 Es*mated Means of Covariates within Principal Strata Week 52 Principal Stratum c&ee c&eu c&ue c&uu n&ee n&uu Percent in Stratum Female Age at baseline White With a Partner Has children Educa*on Ever arrested Mother s educa*on Father s educa*on Household income > $ Person income > $ Have job Had job, previous year Months in Job, previous year Earnings, previous year

77 Es*mated Means of Covariates within Principal Strata Week 130 Principal Stratum c&ee c&eu c&ue c&uu n&ee n&uu Percent in Stratum Female Age at baseline White With a Partner Has children Educa*on Ever arrested Mother s educa*on Father s educa*on Household income > $ Person income > $ Have job Had job, previous year Months in Job, previous year Earnings, previous year

78 Es*mated Means of Covariates within Principal Strata Week 208 Principal Stratum c&ee c&eu c&ue c&uu n&ee n&uu Percent in Stratum Female Age at baseline White With a Partner Has children Educa*on Ever arrested Mother s educa*on Father s educa*on Household income > $ Person income > $ Have job Had job, previous year Months in Job, previous year Earnings, previous year

79 Percent within Principal Strata by Time Period Principal Stratum c&ee c&eu c&ue c&uu n&ee n&uu Week Week Week For compliers, % EE increases in *me, and % UU decreases For noncompliers, EE remains fairly stable Causal effect of training slightly increases in *me, i.e., the difference between propor*ons in c&eu and c&ue appears to increase in *me Economists lock- in effect during the period of training 79

80 Es*mated Average Hourly Wages for Those Employed in Dollars within Principal Strata by Time Period Principal Stratum c&ee(1) c&ee(0) c&eu(1) c&ue(0) n&ee Week Week Week Es*mated causal effect on wages for always employed compliers is approximately 0.2 for all *me periods Always employed compliers, whether trained or not, have the lower hourly wages than the some*mes employed (c&eu or c&ue) or n&ee Wages tend to increase in *me 80

81 Final Conclusions for This Job Training Program In long run, for compliers, minor posi*ve effect on employment status For always employed compliers, minor posi*ve effect on wages at all *me periods Background characteris*cs of individuals differ across principal strata Suggests need for more targeted programs Even if evalua*on is based on randomized experiment, difficult to analyze correctly 81

82 Observed Values Covariates, X i Treatment received, T i Observed outcome: Cri*cal to understand the assignment mechanism Reasons for T i = 1 versus T i = 0 82

Are Job- Training Programs Effec4ve?

Are Job- Training Programs Effec4ve? Are Job- Training Programs Effec4ve? Donald B. Rubin Harvard University Presentation based on joint work with Fabrizia Mealli, Paolo Frumento, and Barbara Pacini. 1 The Na4onal Job Corps Study A randomized

More information

Casual Methods in the Service of Good Epidemiological Practice: A Roadmap

Casual Methods in the Service of Good Epidemiological Practice: A Roadmap University of California, Berkeley From the SelectedWorks of Maya Petersen 2013 Casual Methods in the Service of Good Epidemiological Practice: A Roadmap Maya Petersen, University of California, Berkeley

More information

Complier Average Causal Effect (CACE)

Complier Average Causal Effect (CACE) Complier Average Causal Effect (CACE) Booil Jo Stanford University Methodological Advancement Meeting Innovative Directions in Estimating Impact Office of Planning, Research & Evaluation Administration

More information

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons*

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Bob Pruzek, University at Albany SUNY Summary Propensity Score Analysis (PSA) was introduced by Rosenbaum

More information

FOR OBJECTIVE CAUSAL INFERENCE, DESIGN TRUMPS ANALYSIS 1. BY DONALD B. RUBIN Harvard University

FOR OBJECTIVE CAUSAL INFERENCE, DESIGN TRUMPS ANALYSIS 1. BY DONALD B. RUBIN Harvard University The Annals of Applied Statistics 2008, Vol. 2, No. 3, 808 840 DOI: 10.1214/08-AOAS187 Institute of Mathematical Statistics, 2008 FOR OBJECTIVE CAUSAL INFERENCE, DESIGN TRUMPS ANALYSIS 1 BY DONALD B. RUBIN

More information

Propensity Score. Overview:

Propensity Score. Overview: Propensity Score Overview: What do we use a propensity score for? How do we construct the propensity score? How do we implement propensity score es

More information

Sta$s$cs is Easy. Dennis Shasha From a book co- wri7en with Manda Wilson

Sta$s$cs is Easy. Dennis Shasha From a book co- wri7en with Manda Wilson Sta$s$cs is Easy Dennis Shasha From a book co- wri7en with Manda Wilson Is the Coin Fair? You toss a coin 17 $mes and it comes up heads 15 out of 17 $mes. How likely is it that coin is fair? Could look

More information

Understanding. Design & Sta/s/cs. in Clinical Trials

Understanding. Design & Sta/s/cs. in Clinical Trials Understanding Design & Sta/s/cs in Clinical Trials 1 Why bother? 2 Experience- Based Medicine? Benjamin Rush Father of American Psychiatry 3 Experience- Based Medicine? Supported blood- lejng as treatment

More information

Aggressive Medical Management with or without Angioplasty and Sten8ng for Symptoma8c Intracranial Atherosclero8c Stenosis: Long Term Results

Aggressive Medical Management with or without Angioplasty and Sten8ng for Symptoma8c Intracranial Atherosclero8c Stenosis: Long Term Results Aggressive Medical Management with or without Angioplasty and Sten8ng for Symptoma8c Intracranial Atherosclero8c Stenosis: Long Term Results Disclosures Funding by NINDS U01 NS058728 Boston Scien8fic provided

More information

Welcome! Pragmatic Clinical Studies. David Hickam, MD, MPH Program Director Clinical Effectiveness Research. David Hickam, MD, MPH

Welcome! Pragmatic Clinical Studies. David Hickam, MD, MPH Program Director Clinical Effectiveness Research. David Hickam, MD, MPH Pragmatic Clinical Studies David Hickam, MD, MPH Program Director Clinical Effec2veness Research June 23, 2015 Welcome! David Hickam, MD, MPH Program Director Clinical Effectiveness Research 2 In this

More information

Biosta's'cs Board Review. Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016

Biosta's'cs Board Review. Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016 Biosta's'cs Board Review Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016 Review key biosta's'cs concepts Understand 2 X 2 tables Objec'ves By the end of this

More information

Learning Objec1ves. Study Design Considera1ons in Clinical Pharmacy

Learning Objec1ves. Study Design Considera1ons in Clinical Pharmacy 9/28/15 Study Design Considera1ons in Clinical Pharmacy Ludmila Bakhireva, MD, PhD, MPH Pree Sarangarm, PharmD, BCPS Learning Objec1ves Describe the features, advantages and disadvantages of the observa1onal

More information

Pragma&c Clinical Trials

Pragma&c Clinical Trials Pragma&c Clinical Trials Susanne May, PhD Associate Professor Department of Biosta&s&cs, School of Public Health, University of Washington Preliminaries PLEASE ask ques&ons at any &me REALLY good clinical

More information

Learning Objec1ves. Study Design Strategies. Cohort Studies 9/28/15

Learning Objec1ves. Study Design Strategies. Cohort Studies 9/28/15 9/28/15 Learning Objec1ves Describe the features, advantages and disadvantages of the observa1onal study designs Explain why the overall study design is important when evalua1ng studies & applying their

More information

Development and Applica0on of Real- Time Clinical Predic0ve Models

Development and Applica0on of Real- Time Clinical Predic0ve Models Development and Applica0on of Real- Time Clinical Predic0ve Models Ruben Amarasingham, MD, MBA Associate Professor, UT Southwestern Medical Center AHRQ- funded R24 UT Southwestern Center for Pa?ent- Centered

More information

Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial

Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial Dean Follmann, Erica BriDain, and John Powers Na5onal Ins5tute of Allergy and Infec5ous Diseases November 19, 2014 1 Current

More information

Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs

Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs Maya Petersen works.bepress.com/maya_petersen Divisions of Biosta,s,cs and Epidemiology, University of California,

More information

Randomized Experiments with Noncompliance. David Madigan

Randomized Experiments with Noncompliance. David Madigan Randomized Experiments with Noncompliance David Madigan Introduction Noncompliance is an important problem in randomized experiments involving humans Includes e.g. switching subjects to standard therapy

More information

3/31/2015. Designing Clinical Research Studies: So You Want to Be an

3/31/2015. Designing Clinical Research Studies: So You Want to Be an Designing Clinical Research Studies: So You Want to Be an Inves@gator Andrea Bonny, MD Ellen Lançon Connor, MD On behalf Of The NASPAG Research CommiPee Objec@ves Learn to design a clinical research project

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62

The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62 The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62 Maria D. Fitzpatrick Cornell University & NBER Timothy J. Moore George Washington University & NBER Funded by grants

More information

Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2. Pioneer Winter IDH 3034, IDH 4007

Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2. Pioneer Winter IDH 3034, IDH 4007 Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2 Pioneer Winter IDH 3034, IDH 4007 What s the purpose of this class? Learning Objec6ves Understand the principles of public health, epidemiology,

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Statistical Techniques for Analyzing Data From Prevention Trials: Treatment of No-Shows Using Rubin's Causal Model

Statistical Techniques for Analyzing Data From Prevention Trials: Treatment of No-Shows Using Rubin's Causal Model Psychological Methods 1998, Vol. 3, No. 2,147-159 Copyright 1998 by the American Psychological Association, Inc. 1082-989X/98/$3.00 Statistical Techniques for Analyzing Data From Prevention Trials: Treatment

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

In this module we will cover Correla4on and Validity.

In this module we will cover Correla4on and Validity. In this module we will cover Correla4on and Validity. A correla4on coefficient is a sta4s4c that is o:en used as an es4mate of measurement, such as validity and reliability. You will learn the strength

More information

A Transla)onal Framework for Methodological Rigor to Improve Pa)ent Centered Outcomes in End of Life Cancer Research

A Transla)onal Framework for Methodological Rigor to Improve Pa)ent Centered Outcomes in End of Life Cancer Research A Transla)onal Framework for Methodological Rigor to Improve Pa)ent Centered Outcomes in End of Life Cancer Research Francesca Dominici, PhD Senior Associate Dean for Research Professor of Biostatistics

More information

The Brief Cogni-ve Assessment Tool (BCAT): A New Test Emphasizing Contextual Memory and Execu<ve Func<ons

The Brief Cogni-ve Assessment Tool (BCAT): A New Test Emphasizing Contextual Memory and Execu<ve Func<ons The Brief Cogni-ve Assessment Tool (BCAT): A New Test Emphasizing Contextual Memory and Execu

More information

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods

Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of four noncompliance methods Merrill and McClure Trials (2015) 16:523 DOI 1186/s13063-015-1044-z TRIALS RESEARCH Open Access Dichotomizing partial compliance and increased participant burden in factorial designs: the performance of

More information

Key Statistical Considerations. Dr Delva Shamley

Key Statistical Considerations. Dr Delva Shamley Key Statistical Considerations Dr Delva Shamley WHAT STATISTICAL INFORMATION DO YOU NEED FOR A RESEARCH PROJECT? 1. Sample Size 2. Details of the type of data you will get from your outcome measures 3.

More information

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews Non-Experimental Evidence in Systematic Reviews OPRE REPORT #2018-63 DEKE, MATHEMATICA POLICY RESEARCH JUNE 2018 OVERVIEW Federally funded systematic reviews of research evidence play a central role in

More information

Compliance Mixture Modelling with a Zero Effect Complier Class and Missing. Data.

Compliance Mixture Modelling with a Zero Effect Complier Class and Missing. Data. Biometrics 000, 000 000 DOI: 000 000 0000 Compliance Mixture Modelling with a Zero Effect Complier Class and Missing Data. Michael E. Sobel 1, Bengt Muthén 2 1 Department of Statistics, Columbia University

More information

Missing Data and Imputation

Missing Data and Imputation Missing Data and Imputation Barnali Das NAACCR Webinar May 2016 Outline Basic concepts Missing data mechanisms Methods used to handle missing data 1 What are missing data? General term: data we intended

More information

A large database study in the general population in England

A large database study in the general population in England Time-dependent markers of comorbidity severity and change are associated with increased risk of mortality in heart failure: A large database study in the general population in England Claire A. Rushton,

More information

Designs in Partially Controlled Studies: Messages from a Review

Designs in Partially Controlled Studies: Messages from a Review Johns Hopkins University, Dept. of Biostatistics Working Papers 2-1-2005 Designs in Partially Controlled Studies: Messages from a Review Fan Li Johns Hopkins Bloomberg School of Public Health, Department

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Methods for treating bias in ISTAT mixed mode social surveys

Methods for treating bias in ISTAT mixed mode social surveys Methods for treating bias in ISTAT mixed mode social surveys C. De Vitiis, A. Guandalini, F. Inglese and M.D. Terribili ITACOSM 2017 Bologna, 16th June 2017 Summary 1. The Mixed Mode in ISTAT social surveys

More information

Bayesian and Frequentist Approaches

Bayesian and Frequentist Approaches Bayesian and Frequentist Approaches G. Jogesh Babu Penn State University http://sites.stat.psu.edu/ babu http://astrostatistics.psu.edu All models are wrong But some are useful George E. P. Box (son-in-law

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Tackling Tobacco Through Re-engineered Primary Care

Tackling Tobacco Through Re-engineered Primary Care 2/27/18 Tackling Tobacco Through Re-engineered Primary Care Daren Wu, M.D. Chief Medical Officer Learning Objec>ves Understand the key stumbling blocks that can interfere with tobacco screening and treatment,

More information

PEDIATRIC CT SCAN WHERE ARE WE, DO WE REALLY KNOW THE RISKS AND WHICH WAY TO FOLLOW?

PEDIATRIC CT SCAN WHERE ARE WE, DO WE REALLY KNOW THE RISKS AND WHICH WAY TO FOLLOW? PEDIATRIC CT SCAN WHERE ARE WE, DO WE REALLY KNOW THE RISKS AND WHICH WAY TO FOLLOW? Carla Conceição, MD Neuroradiology Department Hospital D. Estefânia, CHLC Hospital da Luz PEDIATRIC CT SCAN WHERE ARE

More information

Economic outcomes: Method for implementa5on

Economic outcomes: Method for implementa5on Economic outcomes: Method for implementa5on Philippe Beutels Centre for Health Economics Research & Modelling Infec

More information

Debate Regarding Oseltamivir Use for Seasonal and Pandemic Influenza

Debate Regarding Oseltamivir Use for Seasonal and Pandemic Influenza Debate Regarding Oseltamivir Use for Seasonal and Pandemic Influenza Aeron Hurt WHO Collabora@ng Centre for Reference and Research on Influenza, Melbourne, Australia www.influenzacentre.org NA inhibitor

More information

Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control

Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control Biostatistics (2017) 00, 00, pp. 1 13 doi:10.1093/biostatistics/kxx010 Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control MICHELA BACCINI Dipartimento

More information

Evalua5ng Health Claims in Alt-Med

Evalua5ng Health Claims in Alt-Med Chapter 10 Evalua5ng Health Claims in Alt-Med There are in fact two things, science and opinion; the former begets knowledge, the laier ignorance. Hippocrates of Kos A Big Chunk of Change In the U.S. approximately

More information

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling STATISTICS IN MEDICINE Statist. Med. 2009; 28:3363 3385 Published online 3 September 2009 in Wiley InterScience (www.interscience.wiley.com).3721 Estimating drug effects in the presence of placebo response:

More information

G , G , G MHRN

G , G , G MHRN Estimation of treatment-effects from randomized controlled trials in the presence non-compliance with randomized treatment allocation Graham Dunn University of Manchester Research funded by: MRC Methodology

More information

Classifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/

Classifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/ CSCI1950 Z Computa4onal Methods for Biology Lecture 18 Ben Raphael April 8, 2009 hip://cs.brown.edu/courses/csci1950 z/ Binary classifica,on Given a set of examples (x i, y i ), where y i = + 1, from unknown

More information

Signal detec,on methods for vaccine spontaneous report data. Lionel Van Holle (GSK Vaccines) ISOP 2015, Praha

Signal detec,on methods for vaccine spontaneous report data. Lionel Van Holle (GSK Vaccines) ISOP 2015, Praha Signal detec,on methods for vaccine spontaneous report data Lionel Van Holle (GSK Vaccines) ISOP 2015, Praha Disclosure LVH is a full,me employee of GSK Vaccines and owns restricted shares in the company

More information

Overview of Perspectives on Causal Inference: Campbell and Rubin. Stephen G. West Arizona State University Freie Universität Berlin, Germany

Overview of Perspectives on Causal Inference: Campbell and Rubin. Stephen G. West Arizona State University Freie Universität Berlin, Germany Overview of Perspectives on Causal Inference: Campbell and Rubin Stephen G. West Arizona State University Freie Universität Berlin, Germany 1 Randomized Experiment (RE) Sir Ronald Fisher E(X Treatment

More information

Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results

Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results Margaret Mason, MHSA Michael Paus6an, PhD, MS Amanda Markovitz, MPH 1 Overview of BCBSM Serving

More information

Carrying out an Empirical Project

Carrying out an Empirical Project Carrying out an Empirical Project Empirical Analysis & Style Hint Special program: Pre-training 1 Carrying out an Empirical Project 1. Posing a Question 2. Literature Review 3. Data Collection 4. Econometric

More information

Resolving the PSA testing controversy. Professor Villis Marshall AC Professor Bruce Armstrong AM Professor Mark Frydenberg

Resolving the PSA testing controversy. Professor Villis Marshall AC Professor Bruce Armstrong AM Professor Mark Frydenberg Resolving the PSA testing controversy Professor Villis Marshall AC Professor Bruce Armstrong AM Professor Mark Frydenberg Professor Villis Marshall AC Introduc)on Guidelines aim to inform tes)ng for the

More information

Clinical Research Project Design and Guidelines: Choosing a Research Ques8on

Clinical Research Project Design and Guidelines: Choosing a Research Ques8on Clinical Research Project Design and Guidelines: Choosing a Research Ques8on Odunayo M. Oluwatosin, FMCS (Nig) Department of Plas?c Surgery University College Hospital Ibadan 1 Learning outcome: by the

More information

For each of the following cases, describe the population, sample, population parameters, and sample statistics.

For each of the following cases, describe the population, sample, population parameters, and sample statistics. Chapter 5: Statistical Reasoning Section 5A Fundamentals of Statistics Statistics is the science of collecting, organizing and interpreting data Statistics is the data that describe or summarize something

More information

Introduction to Observational Studies. Jane Pinelis

Introduction to Observational Studies. Jane Pinelis Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching

More information

CDC's Tips From Former Smokers (Tips) Campaign and Its Impact on Quitlines

CDC's Tips From Former Smokers (Tips) Campaign and Its Impact on Quitlines CDC's Tips From Former Smokers (Tips) Campaign and Its Impact on Quitlines Lei Zhang and Ann Malarcher Centers for Disease Control and Preven8on OSH Evalua6on Webinar March 31, 2015 The findings and conclusions

More information

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3 Analysis of Vaccine Effects on Post-Infection Endpoints p.1/40 Data Collected in Phase IIb/III Vaccine Trial Longitudinal

More information

Forward Looking Statements

Forward Looking Statements March 12, 2015 1 Forward Looking Statements This presenta5on contains forward- looking statements within the meaning of the Private Securi5es Li5ga5on Reform Act of 1995 that involve substan5al risks and

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Fundamental Clinical Trial Design

Fundamental Clinical Trial Design Design, Monitoring, and Analysis of Clinical Trials Session 1 Overview and Introduction Overview Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics, University of Washington February 17-19, 2003

More information

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data Missing data Patrick Breheny April 3 Patrick Breheny BST 71: Bayesian Modeling in Biostatistics 1/39 Our final topic for the semester is missing data Missing data is very common in practice, and can occur

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Rajesh Mangrulkar, M.D., 2013 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Non-commercial Share Alike 3.0 License: http://creativecommons.org/licenses/by-nc-sa/3.0/

More information

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine Confounding by indication developments in matching, and instrumental variable methods Richard Grieve London School of Hygiene and Tropical Medicine 1 Outline 1. Causal inference and confounding 2. Genetic

More information

PREDICTING PRINCIPAL STRATUM MEMBERSHIP IN RANDOMIZED ENVIRONMENTAL INTERVENTIONS

PREDICTING PRINCIPAL STRATUM MEMBERSHIP IN RANDOMIZED ENVIRONMENTAL INTERVENTIONS PREDICTING PRINCIPAL STRATUM MEMBERSHIP IN RANDOMIZED ENVIRONMENTAL INTERVENTIONS by Katherine Freeland A thesis submitted to Johns Hopkins University in conformity with the requirements for the degree

More information

Addressing the Challenge of Medica3on Adherence. Jan Berger MD, MJ President and CEO Health Intelligence Partners

Addressing the Challenge of Medica3on Adherence. Jan Berger MD, MJ President and CEO Health Intelligence Partners Addressing the Challenge of Medica3on Adherence Jan Berger MD, MJ President and CEO Health Intelligence Partners Drugs Have Become the Mainstay of Healthcare Treatment America s Other Drug Problem Non-adherence

More information

Study Designs in HIV Research

Study Designs in HIV Research Study Designs in HIV Research Colette Smith (based on slides from Fiona Lampe) UK CAB meeting Thursday 17th August 2017 Main types of research studies Cross-sectional Observational Ecological Cohort Case-control

More information

Class 1: Introduction, Causality, Self-selection Bias, Regression

Class 1: Introduction, Causality, Self-selection Bias, Regression Class 1: Introduction, Causality, Self-selection Bias, Regression Ricardo A Pasquini April 2011 Ricardo A Pasquini () April 2011 1 / 23 Introduction I Angrist s what should be the FAQs of a researcher:

More information

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum Instrumental variables and their sensitivity to unobserved biases Dylan Small Department of Statistics, Wharton School, University of Pennsylvania Based on joint work with Paul Rosenbaum Overview Instrumental

More information

Local EBQ Course - PERU Epidemiology, Biostatistics & Qualitative Research Methods

Local EBQ Course - PERU Epidemiology, Biostatistics & Qualitative Research Methods , Biostatistics & 20 places available Iquitos, Peru Apply now for this 3-week course! Applica7on procedure and criteria for academic eligibility and funding can be found on www.uantwerp.be/ebq-peru Deadline

More information

THE RARE DISEASES CLINICAL RESEARCH NETWORK AS A NESTED CULTURAL COMMONS

THE RARE DISEASES CLINICAL RESEARCH NETWORK AS A NESTED CULTURAL COMMONS THE RARE DISEASES CLINICAL RESEARCH NETWORK AS A NESTED CULTURAL COMMONS Bre5 Frischmann, Benjamin N. Cardozo School of Law Katherine J. Strandburg, NYU School of Law Can Cui, NYU School of Law Case study

More information

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method

Lecture Outline. Biost 590: Statistical Consulting. Stages of Scientific Studies. Scientific Method Biost 590: Statistical Consulting Statistical Classification of Scientific Studies; Approach to Consulting Lecture Outline Statistical Classification of Scientific Studies Statistical Tasks Approach to

More information

Estimating average treatment effects from observational data using teffects

Estimating average treatment effects from observational data using teffects Estimating average treatment effects from observational data using teffects David M. Drukker Director of Econometrics Stata 2013 Nordic and Baltic Stata Users Group meeting Karolinska Institutet September

More information

Today. HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods

Today. HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods Today HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods Inferen.al sta.s.cs Inferen.al sta.s.cs: Sta.s.cal tests you apply to quan.ta.ve data to determine the likelihood that the results

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Clinical trials with incomplete daily diary data

Clinical trials with incomplete daily diary data Clinical trials with incomplete daily diary data N. Thomas 1, O. Harel 2, and R. Little 3 1 Pfizer Inc 2 University of Connecticut 3 University of Michigan BASS, 2015 Thomas, Harel, Little (Pfizer) Clinical

More information

Results Members Survey 2012

Results Members Survey 2012 Results Members Survey 2012 Phil Sell October, 2012 Members Survey 2012 Survey open September 26 th to October 7 th Separate survey for non member conference par:cipants 199 members ( sample size 563)

More information

Instrumental Variables I (cont.)

Instrumental Variables I (cont.) Review Instrumental Variables Observational Studies Cross Sectional Regressions Omitted Variables, Reverse causation Randomized Control Trials Difference in Difference Time invariant omitted variables

More information

Oncology Care Model Overview

Oncology Care Model Overview Oncology Care Model Overview Centers for Medicare & Medicaid Services Innova3on Center (CMMI) September 2017 Innova3on at CMS Center for Medicare & Medicaid Innova3on (Innova3on Center) Established by

More information

Nonresponse Rates and Nonresponse Bias In Household Surveys

Nonresponse Rates and Nonresponse Bias In Household Surveys Nonresponse Rates and Nonresponse Bias In Household Surveys Robert M. Groves University of Michigan and Joint Program in Survey Methodology Funding from the Methodology, Measurement, and Statistics Program

More information

Process of Science and hypothesis tes2ng in Behavioral Ecology

Process of Science and hypothesis tes2ng in Behavioral Ecology Process of Science and hypothesis tes2ng in Behavioral Ecology Goal: understand the way that scien2fic hypotheses and methodologies are used to gain knowledge. What separates science from non- science?

More information

Quan%ta%ve Tools for Benefit/Risk Op%miza%on in Medical Imaging MIRD CommiAee Pamphlet

Quan%ta%ve Tools for Benefit/Risk Op%miza%on in Medical Imaging MIRD CommiAee Pamphlet Quan%ta%ve Tools for Benefit/Risk Op%miza%on in Medical Imaging MIRD CommiAee Pamphlet Wesley Bolch, PhD, PE, DABHP, FHPS, FAAPM Medical Physics Program Department of Biomedical Engineering University

More information

Propensity scores: what, why and why not?

Propensity scores: what, why and why not? Propensity scores: what, why and why not? Rhian Daniel, Cardiff University @statnav Joint workshop S3RI & Wessex Institute University of Southampton, 22nd March 2018 Rhian Daniel @statnav/propensity scores:

More information

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology Sylvia Richardson 1 sylvia.richardson@imperial.co.uk Joint work with: Alexina Mason 1, Lawrence

More information

Posi%ve Psychotherapy for Youth at Clinical High- Risk for Psychosis

Posi%ve Psychotherapy for Youth at Clinical High- Risk for Psychosis Posi%ve Psychotherapy for Youth at Clinical High- Risk for Psychosis Lauren Drvaric, MSc. Psych., PhD. Candidate (co- inves%gator) Doctoral Research Trainee, Complex Mental Illness, Centre for Addic%on

More information

Special Literature Review: 2014 Awards for Outstanding Research Articles in Biosurveillance Finalists

Special Literature Review: 2014 Awards for Outstanding Research Articles in Biosurveillance Finalists Welcome to the Webinar! Special Literature Review: 2014 Awards for Outstanding Research Articles in Biosurveillance Finalists Presenters: Sylvia Halasz, PhD, MSc, YP.com Judy Akkina, MPH, PhD, USDA, APHIS

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research,

More information

Dr. Alessio Signori Longitudinal trajectories of EDSS in primary progressive MS pa:ents A latent class approach

Dr. Alessio Signori Longitudinal trajectories of EDSS in primary progressive MS pa:ents A latent class approach Dr. Alessio Signori Longitudinal trajectories of EDSS in primary progressive MS pa:ents A latent class approach Department of Health Sciences Section of Biostatistics University of Genoa, Italy Mul%ple

More information

Introduction; Study design

Introduction; Study design ; Study design Patrick Breheny January 12 Patrick Breheny STA 580: Biostatistics I 1/43 What is statistics? What is biostatistics, and why is it important? The statistical framework Statistics is the science

More information

Bayesian Nonparametric Methods for Precision Medicine

Bayesian Nonparametric Methods for Precision Medicine Bayesian Nonparametric Methods for Precision Medicine Brian Reich, NC State Collaborators: Qian Guan (NCSU), Eric Laber (NCSU) and Dipankar Bandyopadhyay (VCU) University of Illinois at Urbana-Champaign

More information

Tuberculosis: What's new in diagnos6cs and management?

Tuberculosis: What's new in diagnos6cs and management? Tuberculosis: What's new in diagnos6cs and management? Colin Menezes, Department of Internal Medicine, Chris Hani Baragwanath Academic Hospital, University of the Witwatersrand. Objec6ves of this talk:

More information

Causal Inference in Randomized Experiments With Mediational Processes

Causal Inference in Randomized Experiments With Mediational Processes Psychological Methods 2008, Vol. 13, No. 4, 314 336 Copyright 2008 by the American Psychological Association 1082-989X/08/$12.00 DOI: 10.1037/a0014207 Causal Inference in Randomized Experiments With Mediational

More information

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions.

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions. Greenland/Arah, Epi 200C Sp 2000 1 of 6 EPI 200C Final, June 4 th, 2009 This exam includes 24 questions. INSTRUCTIONS: Write all answers on the answer sheets supplied; PRINT YOUR NAME and STUDENT ID NUMBER

More information

F. Bochicchio The approach of the INRAP for preven8on and mi8ga8on and the new Direc8ve on BSS. Radon Protec8on Conf. Dresden, 2 3 Dec.

F. Bochicchio The approach of the INRAP for preven8on and mi8ga8on and the new Direc8ve on BSS. Radon Protec8on Conf. Dresden, 2 3 Dec. Radon protec8on conference New challenges for the European construc8on and ven8la8on branches of Europe Dresden (Sachsen, Germany) December 2 nd 3 rd 2013 The approach of the Italian Na1onal Radon Ac1on

More information

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1 Welch et al. BMC Medical Research Methodology (2018) 18:89 https://doi.org/10.1186/s12874-018-0548-0 RESEARCH ARTICLE Open Access Does pattern mixture modelling reduce bias due to informative attrition

More information

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions

Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions Bayesian Confidence Intervals for Means and Variances of Lognormal and Bivariate Lognormal Distributions J. Harvey a,b, & A.J. van der Merwe b a Centre for Statistical Consultation Department of Statistics

More information

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates JOINT EU/OECD WORKSHOP ON RECENT DEVELOPMENTS IN BUSINESS AND CONSUMER SURVEYS Methodological session II: Task Force & UN Handbook on conduct of surveys response rates, weighting and accuracy UN Handbook

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information