Propensity Score. Overview:

Size: px
Start display at page:

Download "Propensity Score. Overview:"

Transcription

1 Propensity Score Overview: What do we use a propensity score for? How do we construct the propensity score? How do we implement propensity score es<ma<on in STATA?

2 Joke (kind of ) Two heart surgeons (Jack and Jill) walk into a bar. Jack: I just finished my 100 th heart surgery! Jill: I finished my 100th heart surgery last week. Which probably means I m a beper heart surgeon. How many of your pa<ents died within 3 months of surgery? I ve only had 10 die. Jack: Five. So I m probably the beper surgeon. Jill: Or maybe mine are older and have a higher risk than your pa<ents. There may be differences in the pa<ents characteris<cs between Jack and Jill We want to show the difference due to treatment (Jill) We want to compare apples to apples not apples to oranges

3 Purpose of propensity scores It can produce apples- to- apples comparisons when treatment is non- random (non- ignorable treatment assignment) Provides a way to summarize covariate informa<on about treatment selec<on into a single number (scalar) Can be used to adjust for differences via study design, or matching, or during es<ma<on of the treatment effect (e.g., subclassifica.on or regression)

4 Propensity score es<ma<on Some caveats This is only relevant for selec<on on observables If you cannot write down a condi<oning strategy such that condi<oning on X will sa<sfy the backdoor criterion, then this is not the research design you choose You need to iden<fy the confounders, X, that will block all back doors based on economic theory and you will need data on them

5 BePer example: a case in which the propensity score is useful for causal inference Suppose that we are interested in whether a scholarship program caused children in to spend more years in high school (9-12). Suppose every 8 th grade graduate is eligible for this program You have data on every child, including test scores, family income, age, gender, etc. Scholarships are awarded based on some combina<on of test scores, family income, gender, etc., but you don t know the exact formula.

6 Mo<va<on (cont.) Ignorable treatment assignment: Scholarships are assigned to students randomly, independent of how a student is expected to perform in high school Calculate ATE by es<ma<ng simple difference in mean outcomes: 1 N (Y D =1) But what if ignorability is violated? For instance, assume you know that children with higher test scores are more likely to get the scholarship (posi<ve selec<on), but you don t know how important this and other factors are, you just know that the decision is based on informa<on you have (X) and some randomness. What can you do with this informa<on? 1 N (Y D = 0)

7 Mo<va<on (cont.) In principle, you could es<mate it using OLS controlling for X: Where X is a matrix of covariates that you think affect the probability of receiving a scholarship. OLS consistently es<mates the condi<onal mean, but if probability of gecng a scholarship is not a linear func<on of X, this condi<onal mean es<mate may not be informa<ve. Usually, we won t know how the selec<on depended on X, only that it did. For instance, they may use discrete cutoffs rather than a linear func<on

8 Mo<va<on (cont.) Suppose your variables are not con<nuous, but they are categories (somewhat arbitrarily). E.g. family income above or below $50 per week, scores above or below the mean, sex, age, etc. Now, you could put in dummy variables for each category and interac<on between all dummies. This would dis<nguish every group formed by the categories. Or you could run separate regressions for each group This is more flexible since it allows the effect of the scholarship to differ by group. These methods are in principle correct, but they are only feasible if you have a lot of data and few categories.

9 Construc<ng the Propensity Score Es<ma<on of average treatment effects based on propensity score es<ma<on can handle sparseness and ignorance about the func<onal form associated with treatment assignment. You will first need to have a selec<on into the treatment (in our case the scholarship) that is based on observables, or selec<on on observables. The following gives a brief overview of how the propensity score is constructed. In prac<ce, you can download a canned Stata command that will do all of this for you.

10 Defini<on and General Idea Defini:on: The propensity score is the condi<onal probability of being assigned to the treatment group (e.g., 9-12 grade scholarship), condi<onal on the par<cular covariates (X). Pr(D=1 X) is some marginal probability (e.g., 55%) The idea is to compare units who, based solely on their observables, had very similar probabili<es of being placed into treatment If condi<onal on X, two units have a similar probability of treatment, then we say they have similar propensity scores We then think that all the difference in the outcome variable is due to the treatment. If we compare a unit in the treatment group to a control group unit with two similar propensity scores, then condi<onal on the propensity score, all remaining varia<on between these two is randomness if selec<on on observables

11 First stage Es<ma<on using this method is a two- stage procedure First stage: es<mates the propensity score Second stage: calculate the average causal effect of interest by averaging differences in outcomes over units with similar propensity scores First stage: es:mate the propensity score: First, es<mate the following equa<on with binary treatment (D) on the LHS, and covariates (X) that determine selec<on into treatment on RHS using logit or probit model: Prob(D =1 X) = X +! Second, using es<mated coefficients, calculate the predicted LHS ˆ i = ˆX i The propensity score is just the predicted condi<onal probability of treatment (using es<mated coefficients on X) for each unit

12 Algorithm 1) Sort your data by the propensity score and divide it into blocks (groups) of observa<ons with similar propensity sores. 2) Within each block, test (using a t- test), whether the means of the covariates are equal in the treatment and control group. If so à stop, you re done with the first stage 3) If a par<cular block has one or more unbalanced covariates, divide that block into finer blocks and re- evaluate 4) If a par<cular covariate is unbalanced for mul<ple blocks, modify the ini<al logit or probit equa<on by including higher order terms and/or interac<ons with that covariate and start again.

13 Second Stage In the second stage, we look at the effect of treatment on the outcome (in our example of gecng the scholarship on years of schooling), using the propensity score. Once you have determined your propensity score with the procedure above, there are several ways to use it. I ll present two of them (canned version in Stata for both): Stra<fying on the propensity score Divide the data into blocks based on the propensity score (blocks are determined with the algorithm). Run the second stage regression within each block. Calculate the weighted mean of the within- block es<mates to get the average treatment effect. Matching on the propensity score Match each treatment observa<on with one or more control observa<ons, based on similar propensity scores. You then include a dummy for each matched group, which controls for everything that is common within that group.

14 Balancing within blocks 1. Sort the data by the propensity score 2. Divide the data into groups called blocks that have similar propensity scores (e.g., to 0.10, 0.10 to 0.20, etc.) 3. For each block, test whether the means of the covariates are equal for treatment and control using a t- test a. If they are, you are done with the first stage 4. If a par<cular block has one or more unbalanced covariates (X), divide that block into finer blocks and re- evaluate 5. If a par<cular covariate is unbalanced for mul<ple blocks, modify the ini<al logit or probit equa<on by including higher order terms and/or interac<ons with that covariate and start again

15 Implementa<on in STATA Mul<ple methods for es<ma<ng the propensity score Download psmatch2 from ssc ssc install psmatch2, replace First stage: pscore treat X 1 X 2 X 3, pscore(scorename) Second stage: apr (for matching) or aps (for stra<fying): a4r outcome treat, pscore(scorename)

16 General Remarks The propensity score approach becomes more appropriate the more we have randomness determining who gets treatment (closer to randomized experiment). The propensity score doesn t work very well if almost everyone with a high propensity score gets treatment and almost everyone with a low score doesn t: we need to be able to compare people with similar propensi<es who did and did not get treatment. The propensity score approach doesn t correct for unobservable variables that affect whether observa<ons receive treatment.

17 NSW example Comparison of propensity score matching with experimental results

18 NSW program During the mid- 1970s, Manpower Demonstra<on Research Corpora<on (MDRC) operated the Na<onal Supported Work Demonstra<on (NSW) NSW was a temporary employment program designed to help disadvantaged workers lacking basic job skills move into the labor market by giving them work experience and counseling in a sheltered environment Unlike other federally sponsored employment and training programs, though, the NSW program assigned qualified applicants to training posi<ons randomly Treatment group: received all the benefits of the NSW program Control group: let to fend for themselves NSW admiped into the program AFDC women, ex- drug addicts, ex- criminal offenders, and high school dropouts of both sexes

19 NSW Program Treatment group members were: guaranteed a job for 9-18 months depending on the target group and site divided into crews of 3-5 par<cipants who worked together and met frequently with an NSW counselor to discuss grievances and performance paid for their work Wage schedule offered the trainees lower wage rates than they would ve received on a regular job, but allowed their earnings to increase for sa<sfactory performance and apendance Ater their term expired, they were forced to find regular employment The type of work varied within sites gas sta<on apendant, working at a printer shop and males and females were frequently performing different kinds of work This was why the program costs varied across sites and target groups The program cost $9,100 per AFDC par<cipant and approximately $6,800 for other target groups trainees in 1982 dollars (US)

20 NSW Program MDRC collected earnings and demographic informa<on from both treatment and control at baseline and every 9 months thereater Conducted up to 4 post- baseline interviews

21 LaLonde (1986) study LaLonde, Robert J. (1986). Evalua<ng the Econometric Evalua<ons of Training Programs with Experimental Data. American Economic Review. 76(4): LaLonde s ideas: Outcome variable: Annual earnings in 1978 Get unbiased es<mate of the job training program s effects using randomized control group Compare that with what you get by selec<ng a control group from the en<re popula<on that looks like the treatment group using various causal inference methods

22 Need for a control group The fundamental problem of causal inference is causality is defined as the difference between two poten<al outcomes states, but for each individual, we only observe one of these. We are missing data on each trainees counterfactual what they would ve earned had they not been in the NSW experiment

23 Choice of a control group Best op<on: Randomize so that independence is sa<sfied Control group and treatment group are different only by random chance Eliminates bias due to baseline differences between the two groups and the heterogeneous treatment effects bias Oten<mes these kinds of randomized controls aren t available so labor economists would instead sample from various datasets to create (non- experimental) control groups So LaLonde sampled a non- experimental control group from two surveys: the Current Popula<on Survey (CPS) and the Panel Study of Income Dynamics (PSID) Sampled the en<re working popula<on Sampled those not working in 1976 Sampled those not working in 1975 or 1976

24 Similarity of treatment and control groups Treatment and control groups need to be similar. But in what way should they be similar? Most importantly, they need to be similar with regards to income pre- treatment since income is what we ll be examining post- treatment. So what did LaLonde find? First column is treatment group earnings in 1978 Second column is randomized control group Everything else are the non- random control groups

25 TABLE 2-ANNUAL EARNINGS OF NSW TREATMENTS, CONTROLS, AND EIGHT CANDIDATE COMPARISON GROUPS FROM THE PSID AND THE CPS-SSA Comparison Groupa,b Treat- CPS- CPS- CPS- CPS- Year ments Controls PSID-1 PSID-2 PSID-3 PSID-4 SSA-1 SSA-2 SSA-3 SSA $895 $877 7,303 2, ,654 7,788 3,748 4,575 2, (81) $1,794 (90) $646 (317) 7,442 (286) 2,697 (189) 665 (428) 6,770 (63) 8,547 (250) 4,774 (135) 3,800 (333) 2,036 (99) (63) (327) (317) (157) (463) (65) (302) (128) (337) 1977 $6,143 $1,518 7,983 3, ,213 8,562 4,851 5,277 2,844 (140) (112) (335) (376) (229) (484) (68) (317) (153) (450) 1978 $4,526 $2,885 8,146 3,636 1,631 7,564 8,518 5,343 5,665 3,700 (270) (244) (339) (421) (381) (480) (72) (365) (166) (593) 1979 $4,670 $3,819 8,016 3,569 1,602 7,482 8,023 5,343 5,782 3,733 (226) (208) (334) (381) (334) (462) (73) (371) (170) (543) Number of Observations , , a The Comparison Groups are defined as follows: PSID-1: All female household heads continuously from 1975 through 1979, who were between 20 and 55-years-old and did not classify themselves as retired in 1975; PSID-2: Selects from the PSID-1 group all women who received AFDC in 1975; PSID-3: Selects from the PSID-2 all women who were not working when surveyed in 1976; PSID-4: Selects from the PSID-1 group all women with children, none of whom are less than 5-years-old; CPS-SSA -1: All females from Westat CPS-SSA sample; CPS-SSA -2: Selects from CPS-SSA-1 all females who received AFDC in 1975; CPS-SSA-3: Selects from CPS-SSA-1 all females who were not working in the spring of 1976; CPS-SSA -4: Selects from CPS-SSA-2 all females who were not working in the spring of ball earnings are expressed in 1982 dollars. The numbers in parentheses are the standard errors. For the NSW treatments and controls, the number of observations refer only to 1975 and In the other years there are fewer observations, especially in At the time of the resurvey in 1979, treatments had been out of Supported Work for an average of 20 months.

26 TABLE 3-ANNUAL EARNINGS OF NSW MALE TREATMENTS, CONTROLS, AND SIX CANDIDATE COMPARISON GROUPS FROM THE PSID AND CPS-SSA Comparison Groupa,b Year Treatments Controls PSID-1 PSID-2 PSID-3 CPS-SSA-1 CPS-SSA-2 CPS-SSA $3,066 $3,027 19,056a 7,569 2,611 13,650 7,387 2,729 (283) (252) (272) (568) (492) (73) (206) (197) 1976 $4,035 $2,121 20,267 6,152 3,191 14,579 6,390 3,863 (215) (163) (296) (601) (609) (75) (187) (267) 1977 $6,335 $3,403 20,898 7,985 3,981 15,046 9,305 6,399 (376) (228) (296) (621) (594) (76) (225) (398) 1978 $5,976 $5,090 21,542 9,996 5,279 14,846 10,071 7,277 (402) (227) (311) (703) (686) (76) (241) (431) Number of Observations , ,992 1, athe Comparison Groups are defined as follows: PSID-1: All male household heads continuously from 1975 through 1978, who were less than 55-years-old and did not classify themselves as retired in 1975; PSID-2: Selects from the PSID-1 group all men who were not working when surveyed in the spring of 1976; PSID-3: Selects from the PSID-1 group all men who were not working when surveyed in either spring of 1975 or 1976; CPS-SSA-1: All males based on Westat's criteria, except those over 55-years-old; CPS-SSA-2: Selects from CPS-SSA-1 all males who were not working when surveyed in March 1976; CPS-SSA-3: Selects from the CPS-SSA-1 unemployed males in 1976 whose income in 1975 was below the poverty level. ball earnings are expressed in 1982 dollars. The numbers in parentheses are the standard errors. The number of observations refer only to 1975 and In the other years there are fewer observations. The sample of treatments is smaller than the sample of controls because treatments still in Supported Work as of January 1978 are excluded from the sample, and in the young high school target group there were by design more controls than treatments.

27 TABLE 4-EARNINGS COMPARISONS AND ESTIMATED TRAINING EFFECTS FOR THE NSW AFDC PARTICIPANTS USING COMPARISON GROUPS FROM THE PSID AND THE CPS-SSAa b Difference in NSW Treatment Earnings Differences: Unrestricted Less Comparison Group Difference in Difference in Controlling for Earnings Earnings Differences: All Observed Earnings Growth Quasi Difference Variables and Comparison Pre-Training Post-Training Treatments Less in Earnings Pre-Training Earnings Year, 1975 Year, 1979 Comparisons Growth Earnings Name of Growth Unad- Ad- Unad- Ad- Without With Unad- Ad- Without With Comparison justed justedc justed justedc Age Age justed justedc AFDC AFDC Groupd (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) 2, Controls (220) (122) (122) (307) (306) (323) (323) (308) (306) (312) PSID ,443-4,882-3,357-2,143 3,097 2, , ,097 (210) (326) (336) (403) (425) (317) (333) (357) (380) (409) (491) PSID-2 1,242-1,467-1,515 1, ,568 2,392 1,764 1,535 1,826 - (314) (216) (224) (468) (484) (473) (481) (472) (487) (537) PSID ,057 2,915 3,145 3,020 3,070 2,930 2,919 - PSID-4 (351) (202) (208) (532) (543) (557) (563) (531) (543) (592) 928-5,694-4,976-2,822-2,268 2,883 2,655 1, ,406 2,146 (311) (306) (323) (460) (491) (417) (434) (483) (503) (542) (652) CPS-SSA ,928-5,813-3,363-2,650 3,578 3,501 1,214 1, ,041 (64) (272) (309) (320) (365) (280) (282) (272) (309) (349) (503) CPS-SSA-2 1,595-2,888-2, ,215 2, (360) (204) (256) (428) (536) (438) (446) (468) (554) (651) CPS-SSA-3 1,207-3,715-3,150-1, ,603 2, ,246 CPS-SSA-4 (166) (226) (325) (311) (452) (307) (328) (305) (429) (481) (720) 1,684-1, ,126 1,833 1, (524) (249) (283) (630) (716) (654) (663) (637) (717) (814) a The columns above present the estimated training effect for each econometric model and comparison group. The dependent variable is earnings in Based on the experimental data, an unbiased estimate of the impact of training presented in col. 4 is $851. The first three columns present the difference between each comparison group's 1975 and 1979 earnings and the difference between the pre-training earnings of each comparison group and the NSW treatments. bestimates are in 1982 dollars. The numbers in parentheses are the standard errors. cthe exogenous variables used in the regression adjusted equations are age, age squared, years of schooling, high school dropout status, and race. dsee Table 2 for definitions of the comparison groups.

28 TABLE 5-EARNINGS COMPARISONS AND ESTIMATED TRAINING EFFECTS FOR THE NSW MALE PARTICIPANTS USING COMPARISON GROUPS FROM THE PSID AND THE CPS-SSAa,b Difference in NSW Treatment NsW Treatment Earni Earnings Differences: Difference in Unrestricted Difference in LessEComparnigso GroupEarnings Earnings Differences: Growth Quasi Difference Controlling for Comparison Pre-Training Post-Training Treatments Less in Earnings All Observed Year, 1975 Earnings Year, 1978 Comparisons Growth All and Name of Growth Unad- Ad- Unad- Ad- Without With Unad- Ad- Pre-Training Comparison justed justedc justed justed' Age Age justed justed' Earnings Groupd (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Controls $2,063 $39 $-21 $886 $798 $847 $856 $897 $802 $662 (325) (383) (378) (476) (472) (560) (558) (467) (467) (506) PSID-1 $2,043 -$15,997 -$7,624 -$15,578 -$8,067 $425 -$749 -$2,380 -$2,119 -$1,228 (237) (795) (851) (913) (990) (650) (692) (680) (746) (896) PSID-2 $6,071 - $4,503 - $3,669 - $4,020 - $3,482 $484 - $650 - $1,364 - $1,694 - $792 (637) (608) (757) (781) (935) (738) (850) (729) (878) (1024) PSID-3 ($3,322 ($455 $455 $697 -$509 $242 -$1,325 $629 - $552 $397 (780) (539) (704) (760) (967) (884) (1078) (757) (967) (1103) CPS-SSA-1 $1,196 -$10,585 -$4,654 -$8,870 -$4,416 $1,714 $195 -$1,543 -$1,102 -$805 (61) (539) (509) (562) (557) (452) (441) (426) (450) (484) CPS-SSA-2 $2,684 -$4,321 -$1,824 -$4,095 -$1,675 $226 -$488 -$1,850 -$782 -$319 (229) (450) (535) (537) (672) (539) (530) (497) (621) (761) CPS-SSA-3 $4,548 $337 $878 -$1,300 $224 -$1,637 -$1,388 -$1,396 $17 $1,466 (409) (343) (447) (590) (766) (631) (655) (582) (761) (984) athe columns above present the estimated training effect for each econometric model and comparison group. The dependent variable is earnings in Based on the experimental data an unbiased estimatc of the impact of training presented in col. 4 is $886. The first three columns present the difference between each comparison group's 1975 and 1978 earnings and the difference between the pre-training earnings of each comparison group and the NSW treatments. bestimates are in 1982 dollars. The numbers in parentheses are the standard errors. 'The exogenous variables used in the regression adjusted equations are age, age squared, years of schooling, high school dropout status, and race. dsee Table 3 for definitions of the comnarison arouns.

29 Lessons What were the take- aways? Fairly pessimis<c findings observa<onal data and causal inference methods available at that <me performed poorly when trying to reproduce the known ATE from the randomiza<on What did he do? Linear regression, fixed effects, latent variable selec<on modeling His es<mated treatment effect for women tended to overes<mate the impact of the program posi<ve self- selec<on But it tended to underes<mate the impact of the program for men nega<ve self- selec<on Why should you care? Even though the control group might seem like a good guess for the treatment group, your answers may s<ll be significantly biased

30 Dehija and Wahba (1999; 2002) Dehejia, Rajeev H. and Sadek Wahba (1999). Causal Effects in Nonexperimental Studies: Reevalua<ng the Evalua<on of Training Programs. Journal of the American Sta.s.cal Associa.on, vol. 94(448): Dehejia, Rajeev H. and Sadek Wahba (2002). Propensity Score- Matching Methods for Nonexperimental Causal Studies. The Review of Economics and Sta.s.cs. February, 84(1): These two studies introduce propensity score matching methods to economists and perform a kind of replica<on of LaLonde s study

31 Dehejia and Wahba (1999) DW (1999) re- analyze the data using propensity score matching and stra<fica<on These were new at the <me to economists, although the method was first established in Rosenbaum and Rubin (1983) Iden<fying assump<ons: (Y 0,Y 1 ) D p(x) p(x) is propensity score 0<Pr(D X)<1 Common support Stable unit treatment value assump<on (SUTVA) The response of subject i to the treatment D doesn t depend on the treatment given to anyone else except i

32 Assump<ons e(x) = Pr(D X) which is the condi<onal probability of treatment. Also called the propensity score This is a scalar summary of all observed covariates, X Key Result is that the propensity score is a balancing score X e(x) Pr[D X, e(x)] = Pr[D e(x)] ATE at e(x) is the average difference between the observed responses in each treatment group at e(x) E[Y 1 Y 0 ) e(x) ] = E[Y e(x), D=1] E[Y e(x), D=0]

33 Interpreta<on The overall es<mated ATE from this method is the individual treatment effect averaged over the distribu.on of e(x)

34 Analy<cal use of propensity score Matching subsets consis<ng of both treatment and control subjects with the same propensity score are matched Stra<fica<on Data is divided into several strata (or blocks ) based on the propensity score, then regular analysis is carried out within each strata

35 Implementa<on Include as many observed pretreatment variables ( covariates ) as possible The sta<s<cal significance of individual terms isn t important Func<onal form of covariates Consider higher order polynomials as well as interac<on terms. Why? BALANCE BETWEEN TREATMENT AND CONTROL Selec<on of the model Probit or logit

36 Matching algorithm Nearest neighbor algorithm Itera<vely find the pair of subjects with the shortest distance Easy to understand and implement; offers good results in prac<ce; fast running <me; rarely offers the best matching results compared to some op<mal matching procedure

37 Implementa<on Choices of distance Exact match not possible because propensity score is a con<nuous variable and the probability of having the same value of a con<nuous score is zero Use one distance measure to summarize the informa<on Mahalanobis distance Propensity score Mahalanobis distance with propensity score caliper Any distance with the requirement of exact match on a specific variable

38 Sotware R func<ons by Ben Hansen hpp:// STATA func<ons STATA 13 has new treatment effects methods built into it which includes nearest neighbor matching as well as propensity score matching methods Pre- STATA 13: psmatch2(); pscore; nnmatch

39 Procedures for PSM Iden<fy the propensity score model (e.g., logit or probit; covariates) Es<mate the propensity score with all the data Compute the distance between any two subjects Created matched pair/group using a specific matching algorithm Check covariate balance between the treatment and control group among matched subjects; if not good enough, go back to improve the propensity score model Contrast between treated and control subjects within each pair/group Obtain the ATE by averaging over all pairs/groups

40 Why are we doing this? Remember the goal of DW: The goal is to inves<gate the credibility of the conven<onal analy<cal results from non- experimental data So the authors compared the results from the experimental data to the results from the non- experimental data by combining the treatment group with a comparable control dataset

41

42

43

44 Checking the balance ater matching

45 Comparison of the analy<cal results

46 Observa<ons The results ater the propensity score matching/ stra<fica<on was much closer to the truth (if we assume the randomized experiment is the correct benchmark) The variances seem to be larger due to the loss of the data The results aren t very sensi<ve to the func<onal form of the chosen covariates in the propensity score model; however they are sensi<ve to the selec<on of covariates included in the propensity score model

47 Comments Limita<on of propensity score method Relies on an unverified assump<on condi<onal independence, or selec<on on observables Unlike randomiza<on, propensity score matching cannot be used if there is unobserved counfounders, or selec<on on unobservables Overlap You need substan<al overlap between the treatment and the control groups, otherwise, it may result in significant loss of the data in your analysis

1. INTRODUCTION. Lalonde estimates the impact of the National Supported Work (NSW) Demonstration, a labor

1. INTRODUCTION. Lalonde estimates the impact of the National Supported Work (NSW) Demonstration, a labor 1. INTRODUCTION This paper discusses the estimation of treatment effects in observational studies. This issue, which is of great practical importance because randomized experiments cannot always be implemented,

More information

Practical propensity score matching: a reply to Smith and Todd

Practical propensity score matching: a reply to Smith and Todd Journal of Econometrics 125 (2005) 355 364 www.elsevier.com/locate/econbase Practical propensity score matching: a reply to Smith and Todd Rajeev Dehejia a,b, * a Department of Economics and SIPA, Columbia

More information

Sta$s$cs is Easy. Dennis Shasha From a book co- wri7en with Manda Wilson

Sta$s$cs is Easy. Dennis Shasha From a book co- wri7en with Manda Wilson Sta$s$cs is Easy Dennis Shasha From a book co- wri7en with Manda Wilson Is the Coin Fair? You toss a coin 17 $mes and it comes up heads 15 out of 17 $mes. How likely is it that coin is fair? Could look

More information

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons*

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Bob Pruzek, University at Albany SUNY Summary Propensity Score Analysis (PSA) was introduced by Rosenbaum

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Casual Methods in the Service of Good Epidemiological Practice: A Roadmap

Casual Methods in the Service of Good Epidemiological Practice: A Roadmap University of California, Berkeley From the SelectedWorks of Maya Petersen 2013 Casual Methods in the Service of Good Epidemiological Practice: A Roadmap Maya Petersen, University of California, Berkeley

More information

Pros. University of Chicago and NORC at the University of Chicago, USA, and IZA, Germany

Pros. University of Chicago and NORC at the University of Chicago, USA, and IZA, Germany Dan A. Black University of Chicago and NORC at the University of Chicago, USA, and IZA, Germany Matching as a regression estimator Matching avoids making assumptions about the functional form of the regression

More information

Introduction to Observational Studies. Jane Pinelis

Introduction to Observational Studies. Jane Pinelis Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching

More information

Measuring Impact. Program and Policy Evaluation with Observational Data. Daniel L. Millimet. Southern Methodist University.

Measuring Impact. Program and Policy Evaluation with Observational Data. Daniel L. Millimet. Southern Methodist University. Measuring mpact Program and Policy Evaluation with Observational Data Daniel L. Millimet Southern Methodist University 23 May 2013 DL Millimet (SMU) Observational Data May 2013 1 / 23 ntroduction Measuring

More information

Are Job- Training Programs Effec4ve?

Are Job- Training Programs Effec4ve? Are Job- Training Programs Effec4ve? Donald B. Rubin Harvard University Presentation based on joint work with Fabrizia Mealli, Paolo Frumento, and Barbara Pacini. 1 The Na4onal Job Corps Study A randomized

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS EMPIRICAL STRATEGIES IN LABOUR ECONOMICS University of Minho J. Angrist NIPE Summer School June 2009 This course covers core econometric ideas and widely used empirical modeling strategies. The main theoretical

More information

Learning Objec1ves. Study Design Considera1ons in Clinical Pharmacy

Learning Objec1ves. Study Design Considera1ons in Clinical Pharmacy 9/28/15 Study Design Considera1ons in Clinical Pharmacy Ludmila Bakhireva, MD, PhD, MPH Pree Sarangarm, PharmD, BCPS Learning Objec1ves Describe the features, advantages and disadvantages of the observa1onal

More information

Class 1: Introduction, Causality, Self-selection Bias, Regression

Class 1: Introduction, Causality, Self-selection Bias, Regression Class 1: Introduction, Causality, Self-selection Bias, Regression Ricardo A Pasquini April 2011 Ricardo A Pasquini () April 2011 1 / 23 Introduction I Angrist s what should be the FAQs of a researcher:

More information

For Objec*ve Causal Inference, Design Trumps Analysis. Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012

For Objec*ve Causal Inference, Design Trumps Analysis. Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012 For Objec*ve Causal Inference, Design Trumps Analysis Donald B. Rubin Department of Sta*s*cs Harvard University 16 March 2012 1 Prologue to Objec*ve Causal Inference in Observa*onal Studies My Introduc*on

More information

Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results

Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results Blue Cross Blue Shield of Michigan Building a Statewide PCMH Program: Design, Evalua>on Methods, and Results Margaret Mason, MHSA Michael Paus6an, PhD, MS Amanda Markovitz, MPH 1 Overview of BCBSM Serving

More information

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH Why Randomize? This case study is based on Training Disadvantaged Youth in Latin America: Evidence from a Randomized Trial by Orazio Attanasio,

More information

In this module we will cover Correla4on and Validity.

In this module we will cover Correla4on and Validity. In this module we will cover Correla4on and Validity. A correla4on coefficient is a sta4s4c that is o:en used as an es4mate of measurement, such as validity and reliability. You will learn the strength

More information

Which Comparison-Group ( Quasi-Experimental ) Study Designs Are Most Likely to Produce Valid Estimates of a Program s Impact?:

Which Comparison-Group ( Quasi-Experimental ) Study Designs Are Most Likely to Produce Valid Estimates of a Program s Impact?: Which Comparison-Group ( Quasi-Experimental ) Study Designs Are Most Likely to Produce Valid Estimates of a Program s Impact?: A Brief Overview and Sample Review Form February 2012 This publication was

More information

Learning Objec1ves. Study Design Strategies. Cohort Studies 9/28/15

Learning Objec1ves. Study Design Strategies. Cohort Studies 9/28/15 9/28/15 Learning Objec1ves Describe the features, advantages and disadvantages of the observa1onal study designs Explain why the overall study design is important when evalua1ng studies & applying their

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2. Pioneer Winter IDH 3034, IDH 4007

Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2. Pioneer Winter IDH 3034, IDH 4007 Public Health / Public Works Introduc6on, reitera6on + Lessons 1-2 Pioneer Winter IDH 3034, IDH 4007 What s the purpose of this class? Learning Objec6ves Understand the principles of public health, epidemiology,

More information

Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs

Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs Applica(on of Causal Inference Methods to Improve Treatment of HIV in Resource Limited Se?ngs Maya Petersen works.bepress.com/maya_petersen Divisions of Biosta,s,cs and Epidemiology, University of California,

More information

Instrumental Variables I (cont.)

Instrumental Variables I (cont.) Review Instrumental Variables Observational Studies Cross Sectional Regressions Omitted Variables, Reverse causation Randomized Control Trials Difference in Difference Time invariant omitted variables

More information

Version No. 7 Date: July Please send comments or suggestions on this glossary to

Version No. 7 Date: July Please send comments or suggestions on this glossary to Impact Evaluation Glossary Version No. 7 Date: July 2012 Please send comments or suggestions on this glossary to 3ie@3ieimpact.org. Recommended citation: 3ie (2012) 3ie impact evaluation glossary. International

More information

Pragma&c Clinical Trials

Pragma&c Clinical Trials Pragma&c Clinical Trials Susanne May, PhD Associate Professor Department of Biosta&s&cs, School of Public Health, University of Washington Preliminaries PLEASE ask ques&ons at any &me REALLY good clinical

More information

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies

Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Mostly Harmless Simulations? On the Internal Validity of Empirical Monte Carlo Studies Arun Advani and Tymon Sªoczy«ski 13 November 2013 Background When interested in small-sample properties of estimators,

More information

1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement t

1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement t 1. Introduction Consider a government contemplating the implementation of a training (or other social assistance) program. The decision to implement the program depends on the assessment of its likely

More information

Health Informa.cs. Lecture 9. Samantha Kleinberg

Health Informa.cs. Lecture 9. Samantha Kleinberg Health Informa.cs Lecture 9 Samantha Kleinberg samantha.kleinberg@stevens.edu Next week: journal club For all papers: read, and prepare to comment on For your paper: Read the ar.cle (+ other references

More information

3/31/2015. Designing Clinical Research Studies: So You Want to Be an

3/31/2015. Designing Clinical Research Studies: So You Want to Be an Designing Clinical Research Studies: So You Want to Be an Inves@gator Andrea Bonny, MD Ellen Lançon Connor, MD On behalf Of The NASPAG Research CommiPee Objec@ves Learn to design a clinical research project

More information

Welcome! Pragmatic Clinical Studies. David Hickam, MD, MPH Program Director Clinical Effectiveness Research. David Hickam, MD, MPH

Welcome! Pragmatic Clinical Studies. David Hickam, MD, MPH Program Director Clinical Effectiveness Research. David Hickam, MD, MPH Pragmatic Clinical Studies David Hickam, MD, MPH Program Director Clinical Effec2veness Research June 23, 2015 Welcome! David Hickam, MD, MPH Program Director Clinical Effectiveness Research 2 In this

More information

Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial

Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial Discordant MIC Analysis: Tes5ng for Superiority within a Non- inferiority Trial Dean Follmann, Erica BriDain, and John Powers Na5onal Ins5tute of Allergy and Infec5ous Diseases November 19, 2014 1 Current

More information

Understanding. Design & Sta/s/cs. in Clinical Trials

Understanding. Design & Sta/s/cs. in Clinical Trials Understanding Design & Sta/s/cs in Clinical Trials 1 Why bother? 2 Experience- Based Medicine? Benjamin Rush Father of American Psychiatry 3 Experience- Based Medicine? Supported blood- lejng as treatment

More information

Quasi-experimental analysis Notes for "Structural modelling".

Quasi-experimental analysis Notes for Structural modelling. Quasi-experimental analysis Notes for "Structural modelling". Martin Browning Department of Economics, University of Oxford Revised, February 3 2012 1 Quasi-experimental analysis. 1.1 Modelling using quasi-experiments.

More information

Mul$ Voxel Pa,ern Analysis (fmri) Mul$ Variate Pa,ern Analysis (more generally) Magic Voxel Pa,ern Analysis (probably not!)

Mul$ Voxel Pa,ern Analysis (fmri) Mul$ Variate Pa,ern Analysis (more generally) Magic Voxel Pa,ern Analysis (probably not!) Mul$ Voxel Pa,ern Analysis (fmri) Mul$ Variate Pa,ern Analysis (more generally) Magic Voxel Pa,ern Analysis (probably not!) all MVPA really shows is that there are places where, in most people s brain,

More information

Considera*ons when undergoing personal genotyping

Considera*ons when undergoing personal genotyping Considera*ons when undergoing personal genotyping Kelly Ormond, MS, CGC Louanne Hudgins, MD, FACMG January 19, 2011 GENE 210 Disclosures and introduc*ons Professor Ormond provided paid consulta*on for

More information

Impact Evaluation Toolbox

Impact Evaluation Toolbox Impact Evaluation Toolbox Gautam Rao University of California, Berkeley * ** Presentation credit: Temina Madon Impact Evaluation 1) The final outcomes we care about - Identify and measure them Measuring

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha attrition: When data are missing because we are unable to measure the outcomes of some of the

More information

EC352 Econometric Methods: Week 07

EC352 Econometric Methods: Week 07 EC352 Econometric Methods: Week 07 Gordon Kemp Department of Economics, University of Essex 1 / 25 Outline Panel Data (continued) Random Eects Estimation and Clustering Dynamic Models Validity & Threats

More information

5 To Invest or not to Invest? That is the Question.

5 To Invest or not to Invest? That is the Question. 5 To Invest or not to Invest? That is the Question. Before starting this lab, you should be familiar with these terms: response y (or dependent) and explanatory x (or independent) variables; slope and

More information

Classifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/

Classifica4on. CSCI1950 Z Computa4onal Methods for Biology Lecture 18. Ben Raphael April 8, hip://cs.brown.edu/courses/csci1950 z/ CSCI1950 Z Computa4onal Methods for Biology Lecture 18 Ben Raphael April 8, 2009 hip://cs.brown.edu/courses/csci1950 z/ Binary classifica,on Given a set of examples (x i, y i ), where y i = + 1, from unknown

More information

Propensity Score Matching with Limited Overlap. Abstract

Propensity Score Matching with Limited Overlap. Abstract Propensity Score Matching with Limited Overlap Onur Baser Thomson-Medstat Abstract In this article, we have demostrated the application of two newly proposed estimators which accounts for lack of overlap

More information

Matched Cohort designs.

Matched Cohort designs. Matched Cohort designs. Stefan Franzén PhD Lund 2016 10 13 Registercentrum Västra Götaland Purpose: improved health care 25+ 30+ 70+ Registries Employed Papers Statistics IT Project management Registry

More information

The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62

The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62 The Mortality Effects of Re3rement: Evidence from Social Security Eligibility at Age 62 Maria D. Fitzpatrick Cornell University & NBER Timothy J. Moore George Washington University & NBER Funded by grants

More information

TRIPLL Webinar: Propensity score methods in chronic pain research

TRIPLL Webinar: Propensity score methods in chronic pain research TRIPLL Webinar: Propensity score methods in chronic pain research Felix Thoemmes, PhD Support provided by IES grant Matching Strategies for Observational Studies with Multilevel Data in Educational Research

More information

Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012

Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012 Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012 2 In Today s Class Recap Single dummy variable Multiple dummy variables Ordinal dummy variables Dummy-dummy interaction Dummy-continuous/discrete

More information

Can Quasi Experiments Yield Causal Inferences? Sample. Intervention 2/20/2012. Matthew L. Maciejewski, PhD Durham VA HSR&D and Duke University

Can Quasi Experiments Yield Causal Inferences? Sample. Intervention 2/20/2012. Matthew L. Maciejewski, PhD Durham VA HSR&D and Duke University Can Quasi Experiments Yield Causal Inferences? Matthew L. Maciejewski, PhD Durham VA HSR&D and Duke University Sample Study 1 Study 2 Year Age Race SES Health status Intervention Study 1 Study 2 Intervention

More information

Recent advances in non-experimental comparison group designs

Recent advances in non-experimental comparison group designs Recent advances in non-experimental comparison group designs Elizabeth Stuart Johns Hopkins Bloomberg School of Public Health Department of Mental Health Department of Biostatistics Department of Health

More information

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine

Confounding by indication developments in matching, and instrumental variable methods. Richard Grieve London School of Hygiene and Tropical Medicine Confounding by indication developments in matching, and instrumental variable methods Richard Grieve London School of Hygiene and Tropical Medicine 1 Outline 1. Causal inference and confounding 2. Genetic

More information

Technical Track Session IV Instrumental Variables

Technical Track Session IV Instrumental Variables Impact Evaluation Technical Track Session IV Instrumental Variables Christel Vermeersch Beijing, China, 2009 Human Development Human Network Development Network Middle East and North Africa Region World

More information

The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis

The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis Empir Econ DOI 10.1007/s00181-010-0434-z The role of self-reporting bias in health, mental health and labor force participation: a descriptive analysis Justin Leroux John A. Rizzo Robin Sickles Received:

More information

Economic outcomes: Method for implementa5on

Economic outcomes: Method for implementa5on Economic outcomes: Method for implementa5on Philippe Beutels Centre for Health Economics Research & Modelling Infec

More information

TESTING FOR COVARIATE BALANCE IN COMPARATIVE STUDIES

TESTING FOR COVARIATE BALANCE IN COMPARATIVE STUDIES TESTING FOR COVARIATE BALANCE IN COMPARATIVE STUDIES by Yevgeniya N. Kleyman A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Statistics) in The

More information

Demographic Responses to a Political Transformation: Evidence of Women s Empowerment from a Natural Experiment in Nepal

Demographic Responses to a Political Transformation: Evidence of Women s Empowerment from a Natural Experiment in Nepal Demographic Responses to a Political Transformation: Evidence of Women s Empowerment from a Natural Experiment in Nepal Jayash Paudel 1 Pedro de Araujo 2 1 University of Massachusetts Amherst 2 Colorado

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover). STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical methods 2 Course code: EC2402 Examiner: Per Pettersson-Lidbom Number of credits: 7,5 credits Date of exam: Sunday 21 February 2010 Examination

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum

Dylan Small Department of Statistics, Wharton School, University of Pennsylvania. Based on joint work with Paul Rosenbaum Instrumental variables and their sensitivity to unobserved biases Dylan Small Department of Statistics, Wharton School, University of Pennsylvania Based on joint work with Paul Rosenbaum Overview Instrumental

More information

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda DUE: June 6, 2006 Name 1) Earnings functions, whereby the log of earnings is regressed on years of education, years of on-the-job training, and

More information

Today. HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods

Today. HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods Today HW6 ques.ons? Next reading presenta.on: Friday (R25) Sta.s.cal methods Inferen.al sta.s.cs Inferen.al sta.s.cs: Sta.s.cal tests you apply to quan.ta.ve data to determine the likelihood that the results

More information

Health Informa.cs. Lecture 7. Samantha Kleinberg

Health Informa.cs. Lecture 7. Samantha Kleinberg Health Informa.cs Lecture 7 Samantha Kleinberg samantha.kleinberg@stevens.edu Prac.cali.es: preserving privacy Good prac.ces for preserving privacy Physical security Locked cabinet/office, don t print

More information

Working Paper: Designs of Empirical Evaluations of Non-Experimental Methods in Field Settings. Vivian C. Wong 1 & Peter M.

Working Paper: Designs of Empirical Evaluations of Non-Experimental Methods in Field Settings. Vivian C. Wong 1 & Peter M. EdPolicyWorks Working Paper: Designs of Empirical Evaluations of Non-Experimental Methods in Field Settings Vivian C. Wong 1 & Peter M. Steiner 2 Over the last three decades, a research design has emerged

More information

Early Release from Prison and Recidivism: A Regression Discontinuity Approach *

Early Release from Prison and Recidivism: A Regression Discontinuity Approach * Early Release from Prison and Recidivism: A Regression Discontinuity Approach * Olivier Marie Department of Economics, Royal Holloway University of London and Centre for Economic Performance, London School

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Assessing Studies Based on Multiple Regression. Chapter 7. Michael Ash CPPA

Assessing Studies Based on Multiple Regression. Chapter 7. Michael Ash CPPA Assessing Studies Based on Multiple Regression Chapter 7 Michael Ash CPPA Assessing Regression Studies p.1/20 Course notes Last time External Validity Internal Validity Omitted Variable Bias Misspecified

More information

State-of-the-art Strategies for Addressing Selection Bias When Comparing Two or More Treatment Groups. Beth Ann Griffin Daniel McCaffrey

State-of-the-art Strategies for Addressing Selection Bias When Comparing Two or More Treatment Groups. Beth Ann Griffin Daniel McCaffrey State-of-the-art Strategies for Addressing Selection Bias When Comparing Two or More Treatment Groups Beth Ann Griffin Daniel McCaffrey 1 Acknowledgements This work has been generously supported by NIDA

More information

Estimating average treatment effects from observational data using teffects

Estimating average treatment effects from observational data using teffects Estimating average treatment effects from observational data using teffects David M. Drukker Director of Econometrics Stata 2013 Nordic and Baltic Stata Users Group meeting Karolinska Institutet September

More information

What is: regression discontinuity design?

What is: regression discontinuity design? What is: regression discontinuity design? Mike Brewer University of Essex and Institute for Fiscal Studies Part of Programme Evaluation for Policy Analysis (PEPA), a Node of the NCRM Regression discontinuity

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Introduction to Program Evaluation

Introduction to Program Evaluation Introduction to Program Evaluation Nirav Mehta Assistant Professor Economics Department University of Western Ontario January 22, 2014 Mehta (UWO) Program Evaluation January 22, 2014 1 / 28 What is Program

More information

Moving beyond regression toward causality:

Moving beyond regression toward causality: Moving beyond regression toward causality: INTRODUCING ADVANCED STATISTICAL METHODS TO ADVANCE SEXUAL VIOLENCE RESEARCH Regine Haardörfer, Ph.D. Emory University rhaardo@emory.edu OR Regine.Haardoerfer@Emory.edu

More information

Final Exam - section 2. Thursday, December hours, 30 minutes

Final Exam - section 2. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

August 29, Introduction and Overview

August 29, Introduction and Overview August 29, 2018 Introduction and Overview Why are we here? Haavelmo(1944): to become master of the happenings of real life. Theoretical models are necessary tools in our attempts to understand and explain

More information

Randomized Evaluations

Randomized Evaluations Randomized Evaluations Introduction, Methodology, & Basic Econometrics using Mexico s Progresa program as a case study (with thanks to Clair Null, author of 2008 Notes) Sept. 15, 2009 Not All Correlations

More information

Observational & Quasi-experimental Research Methods

Observational & Quasi-experimental Research Methods Observational & Quasi-experimental Research Methods 10th Annual Kathleen Foley Palliative Care Retreat Old Québec, October 24, 2016 Melissa M. Garrido, PhD 1 and Jay Magaziner, PhD 2 1. Department of Veterans

More information

ECON Microeconomics III

ECON Microeconomics III ECON 7130 - Microeconomics III Spring 2016 Notes for Lecture #5 Today: Difference-in-Differences (DD) Estimators Difference-in-Difference-in-Differences (DDD) Estimators (Triple Difference) Difference-in-Difference

More information

Key questions when starting an econometric project (Angrist & Pischke, 2009):

Key questions when starting an econometric project (Angrist & Pischke, 2009): Econometric & other impact assessment approaches to policy analysis Part 1 1 The problem of causality in policy analysis Internal vs. external validity Key questions when starting an econometric project

More information

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity Measurement & Variables - Initial step is to conceptualize and clarify the concepts embedded in a hypothesis or research question with

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

WORKING PAPER SERIES

WORKING PAPER SERIES WORKING PAPER SERIES Evaluating School Vouchers: Evidence from a Within-Study Kaitlin P. Anderson Patrick J. Wolf, Ph.D. April 3, 2017 EDRE Working Paper 2017-10 The University of Arkansas, Department

More information

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews

Causal Validity Considerations for Including High Quality Non-Experimental Evidence in Systematic Reviews Non-Experimental Evidence in Systematic Reviews OPRE REPORT #2018-63 DEKE, MATHEMATICA POLICY RESEARCH JUNE 2018 OVERVIEW Federally funded systematic reviews of research evidence play a central role in

More information

Common Data Elements: Making the Mass of NIH Measures More Useful

Common Data Elements: Making the Mass of NIH Measures More Useful Common Data Elements WG Common Data Elements: Making the Mass of NIH Measures More Useful Jerry Sheehan Assistant Director for Policy Development Na?onal Library of Medicine Gene/c Alliance Webinar Series

More information

Methods of Randomization Lupe Bedoya. Development Impact Evaluation Field Coordinator Training Washington, DC April 22-25, 2013

Methods of Randomization Lupe Bedoya. Development Impact Evaluation Field Coordinator Training Washington, DC April 22-25, 2013 Methods of Randomization Lupe Bedoya Development Impact Evaluation Field Coordinator Training Washington, DC April 22-25, 2013 Content 1. Important Concepts 2. What vs. Why 3. Some Practical Issues 4.

More information

Multiple Regression Models

Multiple Regression Models Multiple Regression Models Advantages of multiple regression Parts of a multiple regression model & interpretation Raw score vs. Standardized models Differences between r, b biv, b mult & β mult Steps

More information

Advanced Handling of Missing Data

Advanced Handling of Missing Data Advanced Handling of Missing Data One-day Workshop Nicole Janz ssrmcta@hermes.cam.ac.uk 2 Goals Discuss types of missingness Know advantages & disadvantages of missing data methods Learn multiple imputation

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics What is Multilevel Modelling Vs Fixed Effects Will Cook Social Statistics Intro Multilevel models are commonly employed in the social sciences with data that is hierarchically structured Estimated effects

More information

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date: AP Statistics NAME: Exam Review: Strand 2: Sampling and Experimentation Date: Block: II. Sampling and Experimentation: Planning and conducting a study (10%-15%) Data must be collected according to a well-developed

More information

NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS

NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS NBER WORKING PAPER SERIES HOW WAS THE WEEKEND? HOW THE SOCIAL CONTEXT UNDERLIES WEEKEND EFFECTS IN HAPPINESS AND OTHER EMOTIONS FOR US WORKERS John F. Helliwell Shun Wang Working Paper 21374 http://www.nber.org/papers/w21374

More information

Establishing Causality Convincingly: Some Neat Tricks

Establishing Causality Convincingly: Some Neat Tricks Establishing Causality Convincingly: Some Neat Tricks Establishing Causality In the last set of notes, I discussed how causality can be difficult to establish in a straightforward OLS context If assumptions

More information

TREATMENT EFFECTIVENESS IN TECHNOLOGY APPRAISAL: May 2015

TREATMENT EFFECTIVENESS IN TECHNOLOGY APPRAISAL: May 2015 NICE DSU TECHNICAL SUPPORT DOCUMENT 17: THE USE OF OBSERVATIONAL DATA TO INFORM ESTIMATES OF TREATMENT EFFECTIVENESS IN TECHNOLOGY APPRAISAL: METHODS FOR COMPARATIVE INDIVIDUAL PATIENT DATA REPORT BY THE

More information

A large database study in the general population in England

A large database study in the general population in England Time-dependent markers of comorbidity severity and change are associated with increased risk of mortality in heart failure: A large database study in the general population in England Claire A. Rushton,

More information

Chapter 13 Summary Experiments and Observational Studies

Chapter 13 Summary Experiments and Observational Studies Chapter 13 Summary Experiments and Observational Studies What have we learned? We can recognize sample surveys, observational studies, and randomized comparative experiments. o These methods collect data

More information

Erratum and discussion of propensity-score reweighting

Erratum and discussion of propensity-score reweighting The Stata Journal (2008 8, Number 4, pp. 532 539 Erratum and discussion of propensity-score reweighting Austin Nichols Urban Institute Washington, DC austinnichols@gmail.com Keywords: st0136 1, xtreg,

More information

Simple Linear Regression One Categorical Independent Variable with Several Categories

Simple Linear Regression One Categorical Independent Variable with Several Categories Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

Economics 345 Applied Econometrics

Economics 345 Applied Econometrics Economics 345 Applied Econometrics Lab Exam Version 1: Solutions Fall 2016 Prof: Martin Farnham TAs: Joffré Leroux Rebecca Wortzman Last Name (Family Name) First Name Student ID Open EViews, and open the

More information

Getting ready for propensity score methods: Designing non-experimental studies and selecting comparison groups

Getting ready for propensity score methods: Designing non-experimental studies and selecting comparison groups Getting ready for propensity score methods: Designing non-experimental studies and selecting comparison groups Elizabeth A. Stuart Johns Hopkins Bloomberg School of Public Health Departments of Mental

More information