Propensity Score Analysis: Its rationale & potential for applied social/behavioral research. Bob Pruzek University at Albany

Size: px
Start display at page:

Download "Propensity Score Analysis: Its rationale & potential for applied social/behavioral research. Bob Pruzek University at Albany"

Transcription

1 Propensity Score Analysis: Its rationale & potential for applied social/behavioral research Bob Pruzek University at Albany

2 Aims: First, to introduce key ideas that underpin propensity score (PS) methodology and to provide some history of this subject; several of the main references will be discussed briefly. An example involving analysis of data from a true experiment will be used to develop some of the basic PS logic. Methods for conditioning will be emphasized, as will randomization and some other essentials. Next, counterfactual logic will be introduced and briefly illustrated.

3 I have prepared a PSA wiki, at: where a number of files that may be found that you may find informative about PSA. Feel free to add others, and to see/add links there. Password PSA. As for software related to PSA, several of the packages in R provide functions for executing PSArelated computations and graphics. Particularly if you are just beginning to learn the R system, the Links and files there may help. See my wiki: Password R.

4 Propensity score analysis, as we shall see is strongly connected to true experiments; in fact, PSA owes its central rationale to the logic that underpins such studies. For true experiments, units are randomly allocated to the (two) treatment groups at the outset of an experiment, that is, before any treatments begin. This helps to ensure that the units allocated to each treatment group are in one fundamental sense comparable to one another, so that when the time for experimental assessment arrives, the investigator can argue that (in principle) it was the treatments that caused the observed differences. If the one group scores systematically higher than the other, thanks to randomized allocation of units to treatments, this finding can (with some qualifications) be attributed to the treatments, and not some other factors.

5 Failure to randomize implies a greater likelihood of the presence of selection bias, this being the central problem toward which propensity score methods are mostly directed, usually (but not always) in the context of observational studies. Even in the best of circumstances, however, I will argue below that true experiments may fail. The problem will be seen as one where, lacking a design soundly based on comprehensive subject matter knowledge, results of true experiments can be either uninformative or misleading (regardless of sample size). The problem generalizes to PS studies too so it will be worthy of some discussion.

6 Three people have written the key articles and books that underpin propensity score methods. They are William Cochran, his student Donald Rubin, and then his student Paul Rosenbaum. Rubin s most recent book is in fact a Compilation of articles he as at least coauthored; it is his Matched sampling for causal effects (2006) (Cambridge). Rosenbaum completed his Observational studies (2nd Edit.) In A review of one of Cochran s reports, done 40 years ago is worth brief examination.

7 Cochran (1968) studied death rates of smokers and nonsmokers. It had been found when using unstratified data that death rates for smokers and non-smokers were nearly identical (evidence that many smokers and manufacturers of tobacco products found greatly to their liking). But Cochran decided to sort both smokers and non-smokers by age; after re-calculating rates after age-based stratification he found death rates were on average 40 to 50% higher for smokers than non-smokers -- and this was for very large samples. Results of this kind represent early versions of what now can be seen as propensity score analysis a term that gave nearly a million hits in a recent Google search!

8 More will be said about PSA soon enough, but to help ensure that PSA might be fully understood, I want to take a few minutes to present an example pertaining to a true experiment; the issues I highlight generalize to virtually all comparisons of treatments, experimental and observational. While the virtues of randomization are often praised, there are reasons not to be sanguine even when randomization has been used. The first figure below displays data as if for an experiment with two independent groups. My aim is to focus thinking on one of the simplest possible true experiments, where I shall assume initially that randomization was been used to assign individuals to groups. Real life complexities are avoided when the data are simulated (but there is a cost to this simplification to which I aim to return, if only briefly).

9

10

11 (Let rows = Blocks, columns = Treatments; a balanced $aov.summary Cell Means 2 x 2 factorial ) A Interpretation (re: graphic): 1 2 mean Red dotted line above B depicts row B1; Blue dotted line above col.mean depicts row B2. Summary table for 2 x 2 design Df Sum Sq Mean Sq F value Pr(>F) factor(a) factor(b) factor(a):factor(b) Residuals

12 Clearly, the interaction effect is huge, but no notable main effects are seen for either Rows (Treatments) or Columns (Blocks). Although these are simulated data, as if for a true experiment, this analysis demonstrated that when an analysis ignores a key blocking factor, it can fail to show major features of treatment effects. In nearly any comparable analysis of simple effects (only), whether block effects are available to the analysis, or even if the blocking information was not collected (!), the experimenter will generally have no idea of what has been missed. And it is a notable point that a proper statistical conclusion in the preceding case was to infer no treatment effects (e.g., A-Factor, N.S.). But if blocking has been used, having identified important individual differences associated w/ blocks, then, since so many potential blocking variates are generally available, the likelihood may be high of finding (notable?) interaction effects. And prospects for such findings are chiefly dependent on the subject-matter, and the design, not on statistical considerations.

13 The latter effect would normally be called causal, given standard caveats associated with inductive inference. There could be other (possibly causal ) effects, were other (blocking) factors identified and incorporated in the analysis. Naturally, for a principle this basic it is to be expected that others have examined this issue before. A review of some of the best early experimental design books (e.g. Cox, 1958) shows that design specialists had convinced themselves in their agricultural work that collections must be relatively homogeneous (in a certain sense) if simple designs were not often to lead to confusion. Cutting to the essense, a good summary would be to say that when target populations are broadly heterogeneous, then blocking is likely to be of central importance, and perhaps essential. This basic point seems to have gotten lost in recent times.

14 Although the preceding analysis was focused the concept of a true experiment, there are good reasons to consider blocking in virtually all studies of treatment effects, whether the study is experimental or observational. The best blocking variables are ones thought likely to be most correlated with the response, and/or likely to interact with the treatments. Some of the most recent work on PSA methods has been directed toward related problems, where attempts are made to make effective use of covariates in two ways -- as will be discussed briefly if time permits. But the problems are notably more difficult in observational studies; for true experiments almost everything is easier (although not easy). The current status of applied observational studies is that most of them have not employed blocking, or at least have not done so in an exemplary way. The PSA analyses that are presented below are therefore not wholly exemplary, even though they seem to have answered key questions that were being asked.

15 With a look back at the brief example where Bill Cochran used a covariate (age) to stratify units before proceeding to his analysis, we can now introduce the basic features of typical propensity score analyses. There are two basic Phases in a PSA: In Phase I pre-treatment covariates are used to construct a single variable, a propensity score, that aims to summarize all key differences among respondents with respect to the two treatments being compared. (Except for some very recent work, nearly all PSA s to date have focused on two group comparisons; we shall focus exclusively on such studies.) A number of methods have been used to estimate propensity scores, and we shall examine two below. In Phase II respondents (from both groups) are sorted on their propensity scores, and then the two groups are compared on one or more outcome measures, conditional on the P-scores.

16 Formally, a Propensity Score (c.f., Rosenbaum and Rubin 1983, Biometrika) P(x) is defined as the conditional probability of receiving a treatment, given scores on the observed covariates x. Symbolically, P(x i ) = Propensity Score (also a balancing score) = Pr (z * i =1 x i ) = E (z * i x i ) x i = Vector of covariate values for i-th individual z * i = 1 (Treatment) or 0 (Control) Note that the Propensity Score can be defined just as well for true experiments, since for two groups, when randomization is used the probability of receiving the treatment (or being in the control group) is just 1/2, and is a constant for all units or individuals. The first application of PSA that we illustrate below is based on matching each member of the treatment group with a member of the control group, using covariates.

17 Data shown in the next slide* derive from an observational study by Morten, et. al (1982, Amer. Jour. Epidemiology, p. 549 ff). Children of parents who had worked in a factory where lead was used in making batteries were matched by age and neighborhood with children whose parents did not work in lead-related industries. Whole blood was assessed for lead content to provide responses. Results shown compare Exposed w/ Control Children in what can be seen as a paired samples design. Conventional dependent sample analysis shows that the (95%) C.I. for the population mean difference is far from zero. The mean of the difference scores is 5.78, and the results support the interpretation that parents lead-related occupations tend to influence how much lead is found in their children's blood. This plot is called a Propensity Score Assessment Plot and is produced by the function granova.ds in the R package granova (Pruzek & Helmreich, 2007). Note that the heavy black line on the diagonal corresponds to X = Y, so if if a X > Y its point is below the identity diagonal. Parallel projections to the lower left line segment show the distribution of difference scores corresponding to the pairs; the red dashed line shows the average difference score, and the green line segment shows the 95% C.I.

18

19 The graphic shows more, however. Note the wide dispersion of lead measurements for Exposed children in comparison with their Control counterparts. One interpretation is that it is the particular characteristics of parents experiences at work or home with their children that need to be taken into account to make comparison most informative. That is, given the wide variation in blood levels across the Exposed group, where those with the lowest levels are quite comparable with their Control counterparts, but not those corresponding to points on the far right side, it seems reasonable to say that the general hypothesis should be reformulated in terms of specifics. Children whose parents worked at the factory in jobs with lower levels of exposure to lead, and parents who practiced good personal hygiene, despite having high exposure to lead via their jobs tended to be parents of children with lower blood lead levels.

20 Although it is not certain that Control & Exposed children did not differ in other ways (than age and neighborhood of residence), Rosenbaum (2002), who discusses this example in detail, uses a sensitivity analysis to show that the hidden bias would have to be extreme to explain away differences this large. The essential question asked in a sensitivity analysis is this: is there at least one covariate (the investigator can think of) that could explain as large a difference between treatment & control groups as was observed? Or not? In other words, can the analyst rule out an alternative explanation for the effect that was observed after (comprehensive) consideration of covariates that were not used in construction of the propensity score vector? If not, then the PSA effect stands as likely to be evidence of a causal relationship of the treatment ; if yes, then the difference is too small, relative to the observed treatment effect, to make a strong case that the treatment caused the difference. Sensitivity analyses can be essential to a wrap-up of a PSA study (but they are often not completed, which, alas, is a feature of our following illustration).

21 Consider next, estimation of propensity scores for a real medical using data on 996 initial Percutaneous Coronary Interventions (PCIs) performed in 1997 at the Lindner Center, Christ Hospital, Cincinnati. Description: Data from an observational study of 996 patients receiving a PCI at Ohio Heart Health in 1997 and followed for at least 6 months by the staff of the Lindner Center. This is a landmark dataset in the literature on propensity score adjustment for treatment selection bias due to practice of evidence based medicine; patients receiving abciximab tended to be more severely diseased than those who did not receive a cascade blocker. The binary variable abcix indicates control (0) or treatment (1). Logistic regression was used initially to estimate propensity scores.

22 The next two slides show two graphics. In the first, (densities) for both of the Lindner treatment groups, where the Counts were 298 for Control, and 698 for Treatment. Five strata are also identified by vertical lines in the plot. The loess plot, the second figure, is particularly informative, as it shows the (non-linear) regression lines for predicting costs (log metric) of the two treatments. Interpretation of the loess graphic comes after it s presentation. What you need to know at this point is that a form of regression (logistic regression) was used to produce the baseline variable on the next slides (the propensity scores (PS)); this PS variable is itself a function of the covariates chosen in the regression model. The variables that predict or discriminate best between treatment and control groups are the one s most strongly associated with the PS variable.

23

24

25 The preceding graphic shows all 996 data points colored to correspond with the treatment/control designations. Note that the loess (non-linear) regression curve for abcix = 1 lies above the curve for the control across nearly the full range of the propensity scores. This indicates that the costs of treatments (this being a proxy for health problems) for abcix were almost universally higher than for their control counterparts, after adjusting for all covariate differences using LR-based P-scores; 95% C.I.=(.05,.21) in Log metric. Summary results for strata are shown in the table below, as well as in the graphic on the next page. Stratum counts.0 counts.1 means.0 means.1 diff.means

26

27 Summary for PSA of Lindner Data (only parts of which were seen here) Before matching (n=966: 298 treatments and 698 controls) (not shown) 5 out of 7 variables, including major covariates ejecfrac, Ves1pro, acutemi, daibetic and stent show statistically significant differences between treatment and control groups. For the LR-based quintile stratification method, five strata were defined. Treatment effects differed somewhat across strata, indicating some interaction between covariates and treatments. Strata 2,3 & 4 showed the strongest treatment effects (might aim to characterize covariate characteristics for these strata); stratum 5 showed a change of sign in the mean response difference, but essentially a null effect. The related loess regression plot shows the full range of adjusted effect results. For the classification tree stratification method (not shown), 6 strata were found and concordance of treatment effects across propensity score levels were observed. DAE was used to summarize treatment effects, and the t test yielded significance, P<0.05, in favor of the treatment. That there were notable differences in results for LR & Classification tree methods should not be surprising, due to the major algorithm differences. The graphical-visualization methods for PSA provide especially clear images of the sizes and direction of treatment effects, and help to clarify the differences found for the different PSA methods.

28 Going beyond these examples, consider comparing two behavioral regimes with one another, say two diets or two exercise plans, or two food supplement conditions. When medical researchers develop vaccines to prevent disease, drugs to treat illness or methods to aid victims of injury they may be said to be designing treatments. When biologists study the effects of clear cutting forests, or educational psychologists evaluate how class size affects learning, they are studying the consequences of treatments. In short, treatments are ubiquitous, they can take virtually any form, but it is treatment comparisons that are often the essence of applied scientific study. If a non-standard treatment is selected, then the particular question of how to define the control group may need special attention. The requirement that observational studies use effective or appropriate pre-treatment covariates is most notable.

Propensity Score Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer Registry (SEER)

Propensity Score Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer Registry (SEER) Propensity Score Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer Registry (SEER) Yan Wu Advisor: Robert Pruzek Epidemiology and Biostatistics

More information

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons*

Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Propensity Score Methods for Longitudinal Data Analyses: General background, ra>onale and illustra>ons* Bob Pruzek, University at Albany SUNY Summary Propensity Score Analysis (PSA) was introduced by Rosenbaum

More information

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo.

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo. Goal of a Formal Sensitivity Analysis To replace a general qualitative statement that applies in all observational studies the association we observe between treatment and outcome does not imply causation

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Regression Discontinuity Analysis

Regression Discontinuity Analysis Regression Discontinuity Analysis A researcher wants to determine whether tutoring underachieving middle school students improves their math grades. Another wonders whether providing financial aid to low-income

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

Propensity Score Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer Registry (SEER)

Propensity Score Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer Registry (SEER) Propensity core Analysis to compare effects of radiation and surgery on survival time of lung cancer patients from National Cancer egistry (EE) Yan Wu Advisor: obert Pruzek Epidemiology and Biostatistics,

More information

PubH 7405: REGRESSION ANALYSIS. Propensity Score

PubH 7405: REGRESSION ANALYSIS. Propensity Score PubH 7405: REGRESSION ANALYSIS Propensity Score INTRODUCTION: There is a growing interest in using observational (or nonrandomized) studies to estimate the effects of treatments on outcomes. In observational

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Introduction to Observational Studies. Jane Pinelis

Introduction to Observational Studies. Jane Pinelis Introduction to Observational Studies Jane Pinelis 22 March 2018 Outline Motivating example Observational studies vs. randomized experiments Observational studies: basics Some adjustment strategies Matching

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision

Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision ISPUB.COM The Internet Journal of Epidemiology Volume 7 Number 2 Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision Z Wang Abstract There is an increasing

More information

Assessing the impact of unmeasured confounding: confounding functions for causal inference

Assessing the impact of unmeasured confounding: confounding functions for causal inference Assessing the impact of unmeasured confounding: confounding functions for causal inference Jessica Kasza jessica.kasza@monash.edu Department of Epidemiology and Preventive Medicine, Monash University Victorian

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Weintraub WS, Grau-Sepulveda MV, Weiss JM, et al. Comparative

More information

Sawtooth Software. The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? RESEARCH PAPER SERIES

Sawtooth Software. The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? RESEARCH PAPER SERIES Sawtooth Software RESEARCH PAPER SERIES The Number of Levels Effect in Conjoint: Where Does It Come From and Can It Be Eliminated? Dick Wittink, Yale University Joel Huber, Duke University Peter Zandan,

More information

Unit 7 Comparisons and Relationships

Unit 7 Comparisons and Relationships Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons

More information

OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data

OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data OHDSI Tutorial: Design and implementation of a comparative cohort study in observational healthcare data Faculty: Martijn Schuemie (Janssen Research and Development) Marc Suchard (UCLA) Patrick Ryan (Janssen

More information

EXPERIMENTAL RESEARCH DESIGNS

EXPERIMENTAL RESEARCH DESIGNS ARTHUR PSYC 204 (EXPERIMENTAL PSYCHOLOGY) 14A LECTURE NOTES [02/28/14] EXPERIMENTAL RESEARCH DESIGNS PAGE 1 Topic #5 EXPERIMENTAL RESEARCH DESIGNS As a strict technical definition, an experiment is a study

More information

A NEW TRIAL DESIGN FULLY INTEGRATING BIOMARKER INFORMATION FOR THE EVALUATION OF TREATMENT-EFFECT MECHANISMS IN PERSONALISED MEDICINE

A NEW TRIAL DESIGN FULLY INTEGRATING BIOMARKER INFORMATION FOR THE EVALUATION OF TREATMENT-EFFECT MECHANISMS IN PERSONALISED MEDICINE A NEW TRIAL DESIGN FULLY INTEGRATING BIOMARKER INFORMATION FOR THE EVALUATION OF TREATMENT-EFFECT MECHANISMS IN PERSONALISED MEDICINE Dr Richard Emsley Centre for Biostatistics, Institute of Population

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Today Retrospective analysis of binomial response across two levels of a single factor.

Today Retrospective analysis of binomial response across two levels of a single factor. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.3 Single Factor. Retrospective Analysis ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9,

More information

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers Tutorial in Biostatistics Received 21 November 2012, Accepted 17 July 2013 Published online 23 August 2013 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5941 Graphical assessment of

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

Confounding, Effect modification, and Stratification

Confounding, Effect modification, and Stratification Confounding, Effect modification, and Stratification Tunisia, 30th October 2014 Acknowledgment: Kostas Danis Takis Panagiotopoulos National Schoool of Public Health, Athens, Greece takis.panagiotopoulos@gmail.com

More information

UNSUPERVISED PROPENSITY SCORING: NN & IV PLOTS

UNSUPERVISED PROPENSITY SCORING: NN & IV PLOTS UNSUPERVISED PROPENSITY SCORING: NN & IV PLOTS Bob Obenchain, US Medical Outcomes Research Lilly Technology Center South, Indianapolis, IN 46285-5024 Key Words: unsupervised learning, propensity scores,

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector

More information

Trial designs fully integrating biomarker information for the evaluation of treatment-effect mechanisms in stratified medicine

Trial designs fully integrating biomarker information for the evaluation of treatment-effect mechanisms in stratified medicine Trial designs fully integrating biomarker information for the evaluation of treatment-effect mechanisms in stratified medicine Dr Richard Emsley Centre for Biostatistics, Institute of Population Health,

More information

Two-sample Categorical data: Measuring association

Two-sample Categorical data: Measuring association Two-sample Categorical data: Measuring association Patrick Breheny October 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 40 Introduction Study designs leading to contingency

More information

A Practical Guide to Getting Started with Propensity Scores

A Practical Guide to Getting Started with Propensity Scores Paper 689-2017 A Practical Guide to Getting Started with Propensity Scores Thomas Gant, Keith Crowland Data & Information Management Enhancement (DIME) Kaiser Permanente ABSTRACT This paper gives tools

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Bucholz EM, Butala NM, Ma S, Normand S-LT, Krumholz HM. Life

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl

Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl Judea Pearl University of California, Los Angeles Computer Science Department Los

More information

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Impact and adjustment of selection bias. in the assessment of measurement equivalence Impact and adjustment of selection bias in the assessment of measurement equivalence Thomas Klausch, Joop Hox,& Barry Schouten Working Paper, Utrecht, December 2012 Corresponding author: Thomas Klausch,

More information

Sensitivity Analysis in Observational Research: Introducing the E-value

Sensitivity Analysis in Observational Research: Introducing the E-value Sensitivity Analysis in Observational Research: Introducing the E-value Tyler J. VanderWeele Harvard T.H. Chan School of Public Health Departments of Epidemiology and Biostatistics 1 Plan of Presentation

More information

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,

More information

UNIT 5 - Association Causation, Effect Modification and Validity

UNIT 5 - Association Causation, Effect Modification and Validity 5 UNIT 5 - Association Causation, Effect Modification and Validity Introduction In Unit 1 we introduced the concept of causality in epidemiology and presented different ways in which causes can be understood

More information

Mediation Analysis With Principal Stratification

Mediation Analysis With Principal Stratification University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 3-30-009 Mediation Analysis With Principal Stratification Robert Gallop Dylan S. Small University of Pennsylvania

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi

More information

Using Propensity Score Matching in Clinical Investigations: A Discussion and Illustration

Using Propensity Score Matching in Clinical Investigations: A Discussion and Illustration 208 International Journal of Statistics in Medical Research, 2015, 4, 208-216 Using Propensity Score Matching in Clinical Investigations: A Discussion and Illustration Carrie Hosman 1,* and Hitinder S.

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Supplement 2. Use of Directed Acyclic Graphs (DAGs) Supplement 2. Use of Directed Acyclic Graphs (DAGs) Abstract This supplement describes how counterfactual theory is used to define causal effects and the conditions in which observed data can be used to

More information

The Regression-Discontinuity Design

The Regression-Discontinuity Design Page 1 of 10 Home» Design» Quasi-Experimental Design» The Regression-Discontinuity Design The regression-discontinuity design. What a terrible name! In everyday language both parts of the term have connotations

More information

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials CHL 5225 H Advanced Statistical Methods for Clinical Trials Two sources for course material 1. Electronic blackboard required readings 2. www.andywillan.com/chl5225h code of conduct course outline schedule

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 DETAILED COURSE OUTLINE Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002 Hal Morgenstern, Ph.D. Department of Epidemiology UCLA School of Public Health Page 1 I. THE NATURE OF EPIDEMIOLOGIC

More information

Measuring association in contingency tables

Measuring association in contingency tables Measuring association in contingency tables Patrick Breheny April 8 Patrick Breheny Introduction to Biostatistics (171:161) 1/25 Hypothesis tests and confidence intervals Fisher s exact test and the χ

More information

Title:The self-reported health of U.S. flight attendants compared to the general population

Title:The self-reported health of U.S. flight attendants compared to the general population Author's response to reviews Title:The self-reported health of U.S. flight attendants compared to the general population Authors: Eileen McNeely (emcneely@hsph.harvard.edu) Version:4Date:30 January 2014

More information

SELF-PROXY RESPONSE STATUS AND QUALITY OF CIGARETTE-RELATED INFORMATION

SELF-PROXY RESPONSE STATUS AND QUALITY OF CIGARETTE-RELATED INFORMATION SELF-PROXY RESPONSE STATUS AND QUALITY OF CIGARETTE-RELATED INFORMATION Donna Eisenhower, John Hall, Randy Brown, Mathematica Policy Research, Inc. Donna Eisenhower, Mathematica Policy Research, Inc.,

More information

Joseph W Hogan Brown University & AMPATH February 16, 2010

Joseph W Hogan Brown University & AMPATH February 16, 2010 Joseph W Hogan Brown University & AMPATH February 16, 2010 Drinking and lung cancer Gender bias and graduate admissions AMPATH nutrition study Stratification and regression drinking and lung cancer graduate

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Controlling Bias & Confounding

Controlling Bias & Confounding Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of

More information

OUTLIER SUBJECTS PROTOCOL (art_groupoutlier)

OUTLIER SUBJECTS PROTOCOL (art_groupoutlier) OUTLIER SUBJECTS PROTOCOL (art_groupoutlier) Paul K. Mazaika 2/23/2009 Outlier subjects are a problem in fmri data sets for clinical populations. This protocol and program are a method to identify outlier

More information

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1 Lecture 27: Systems Biology and Bayesian Networks Systems Biology and Regulatory Networks o Definitions o Network motifs o Examples

More information

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials.

Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials. Paper SP02 Inverse Probability of Censoring Weighting for Selective Crossover in Oncology Clinical Trials. José Luis Jiménez-Moro (PharmaMar, Madrid, Spain) Javier Gómez (PharmaMar, Madrid, Spain) ABSTRACT

More information

Causal Mediation Analysis with the CAUSALMED Procedure

Causal Mediation Analysis with the CAUSALMED Procedure Paper SAS1991-2018 Causal Mediation Analysis with the CAUSALMED Procedure Yiu-Fai Yung, Michael Lamm, and Wei Zhang, SAS Institute Inc. Abstract Important policy and health care decisions often depend

More information

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research 2012 CCPRC Meeting Methodology Presession Workshop October 23, 2012, 2:00-5:00 p.m. Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy

More information

Statistics 2. RCBD Review. Agriculture Innovation Program

Statistics 2. RCBD Review. Agriculture Innovation Program Statistics 2. RCBD Review 2014. Prepared by Lauren Pincus With input from Mark Bell and Richard Plant Agriculture Innovation Program 1 Table of Contents Questions for review... 3 Answers... 3 Materials

More information

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models

White Paper Estimating Complex Phenotype Prevalence Using Predictive Models White Paper 23-12 Estimating Complex Phenotype Prevalence Using Predictive Models Authors: Nicholas A. Furlotte Aaron Kleinman Robin Smith David Hinds Created: September 25 th, 2015 September 25th, 2015

More information

FACTORIAL DESIGN. 2.5 Types of Factorial Design

FACTORIAL DESIGN. 2.5 Types of Factorial Design Research Design UNIT 2 FACTORIAL DESIGN Structure 2.0 Introduction 2.1 Objectives 2.2 Meaning of Factorial Design 2.3 Terms Related to Factorial Design 2.4 Simple Two Factor Design 2.4.1 Lay Out of Factorial

More information

Summary. 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department

Summary. 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department 20 May 2014 EMA/CHMP/SAWP/298348/2014 Procedure No.: EMEA/H/SAB/037/1/Q/2013/SME Product Development Scientific Support Department evaluating patients with Autosomal Dominant Polycystic Kidney Disease

More information

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models

More information

Challenges of Observational and Retrospective Studies

Challenges of Observational and Retrospective Studies Challenges of Observational and Retrospective Studies Kyoungmi Kim, Ph.D. March 8, 2017 This seminar is jointly supported by the following NIH-funded centers: Background There are several methods in which

More information

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Sawtooth Software. MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc. Sawtooth Software RESEARCH PAPER SERIES MaxDiff Analysis: Simple Counting, Individual-Level Logit, and HB Bryan Orme, Sawtooth Software, Inc. Copyright 009, Sawtooth Software, Inc. 530 W. Fir St. Sequim,

More information

Propensity scores: what, why and why not?

Propensity scores: what, why and why not? Propensity scores: what, why and why not? Rhian Daniel, Cardiff University @statnav Joint workshop S3RI & Wessex Institute University of Southampton, 22nd March 2018 Rhian Daniel @statnav/propensity scores:

More information

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha attrition: When data are missing because we are unable to measure the outcomes of some of the

More information

AnExaminationoftheQualityand UtilityofInterviewerEstimatesof HouseholdCharacteristicsinthe NationalSurveyofFamilyGrowth. BradyWest

AnExaminationoftheQualityand UtilityofInterviewerEstimatesof HouseholdCharacteristicsinthe NationalSurveyofFamilyGrowth. BradyWest AnExaminationoftheQualityand UtilityofInterviewerEstimatesof HouseholdCharacteristicsinthe NationalSurveyofFamilyGrowth BradyWest An Examination of the Quality and Utility of Interviewer Estimates of Household

More information

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods

Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Identifying Peer Influence Effects in Observational Social Network Data: An Evaluation of Propensity Score Methods Dean Eckles Department of Communication Stanford University dean@deaneckles.com Abstract

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Matched Cohort designs.

Matched Cohort designs. Matched Cohort designs. Stefan Franzén PhD Lund 2016 10 13 Registercentrum Västra Götaland Purpose: improved health care 25+ 30+ 70+ Registries Employed Papers Statistics IT Project management Registry

More information

How to interpret results of metaanalysis

How to interpret results of metaanalysis How to interpret results of metaanalysis Tony Hak, Henk van Rhee, & Robert Suurmond Version 1.0, March 2016 Version 1.3, Updated June 2018 Meta-analysis is a systematic method for synthesizing quantitative

More information

Vocabulary. Bias. Blinding. Block. Cluster sample

Vocabulary. Bias. Blinding. Block. Cluster sample Bias Blinding Block Census Cluster sample Confounding Control group Convenience sample Designs Experiment Experimental units Factor Level Any systematic failure of a sampling method to represent its population

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

Live WebEx meeting agenda

Live WebEx meeting agenda 10:00am 10:30am Using OpenMeta[Analyst] to extract quantitative data from published literature Live WebEx meeting agenda August 25, 10:00am-12:00pm ET 10:30am 11:20am Lecture (this will be recorded) 11:20am

More information

Department of International Health

Department of International Health JOHNS HOPKINS U N I V E R S I T Y Center for Clinical Trials Department of Biostatistics Department of Epidemiology Department of International Health Memorandum Department of Medicine Department of Ophthalmology

More information

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer A NON-TECHNICAL INTRODUCTION TO REGRESSIONS David Romer University of California, Berkeley January 2018 Copyright 2018 by David Romer CONTENTS Preface ii I Introduction 1 II Ordinary Least Squares Regression

More information

UMbRELLA interim report Preparatory work

UMbRELLA interim report Preparatory work UMbRELLA interim report Preparatory work This document is intended to supplement the UMbRELLA Interim Report 2 (January 2016) by providing a summary of the preliminary analyses which influenced the decision

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

We define a simple difference-in-differences (DD) estimator for. the treatment effect of Hospital Compare (HC) from the

We define a simple difference-in-differences (DD) estimator for. the treatment effect of Hospital Compare (HC) from the Appendix A: Difference-in-Difference Estimation Estimation Strategy We define a simple difference-in-differences (DD) estimator for the treatment effect of Hospital Compare (HC) from the perspective of

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge

More information

Appendix B Statistical Methods

Appendix B Statistical Methods Appendix B Statistical Methods Figure B. Graphing data. (a) The raw data are tallied into a frequency distribution. (b) The same data are portrayed in a bar graph called a histogram. (c) A frequency polygon

More information

Computing discriminability and bias with the R software

Computing discriminability and bias with the R software Computing discriminability and bias with the R software Christophe Pallier June 26, 2002 Abstract Psychologists often model detection or discrimination decisions as being influenced by two components:

More information

Ambiguous Data Result in Ambiguous Conclusions: A Reply to Charles T. Tart

Ambiguous Data Result in Ambiguous Conclusions: A Reply to Charles T. Tart Other Methodology Articles Ambiguous Data Result in Ambiguous Conclusions: A Reply to Charles T. Tart J. E. KENNEDY 1 (Original publication and copyright: Journal of the American Society for Psychical

More information

Self-assessment test of prerequisite knowledge for Biostatistics III in R

Self-assessment test of prerequisite knowledge for Biostatistics III in R Self-assessment test of prerequisite knowledge for Biostatistics III in R Mark Clements, Karolinska Institutet 2017-10-31 Participants in the course Biostatistics III are expected to have prerequisite

More information

Overview of Perspectives on Causal Inference: Campbell and Rubin. Stephen G. West Arizona State University Freie Universität Berlin, Germany

Overview of Perspectives on Causal Inference: Campbell and Rubin. Stephen G. West Arizona State University Freie Universität Berlin, Germany Overview of Perspectives on Causal Inference: Campbell and Rubin Stephen G. West Arizona State University Freie Universität Berlin, Germany 1 Randomized Experiment (RE) Sir Ronald Fisher E(X Treatment

More information

Lesson 9: Two Factor ANOVAS

Lesson 9: Two Factor ANOVAS Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

Worksheet 6 - Multifactor ANOVA models

Worksheet 6 - Multifactor ANOVA models Worksheet 6 - Multifactor ANOVA models Multifactor ANOVA Quinn & Keough (2002) - Chpt 9 Question 1 - Nested ANOVA - one between factor In an unusually detailed preparation for an Environmental Effects

More information

Appendix III Individual-level analysis

Appendix III Individual-level analysis Appendix III Individual-level analysis Our user-friendly experimental interface makes it possible to present each subject with many choices in the course of a single experiment, yielding a rich individual-level

More information

A brief history of the Fail Safe Number in Applied Research. Moritz Heene. University of Graz, Austria

A brief history of the Fail Safe Number in Applied Research. Moritz Heene. University of Graz, Austria History of the Fail Safe Number 1 A brief history of the Fail Safe Number in Applied Research Moritz Heene University of Graz, Austria History of the Fail Safe Number 2 Introduction Rosenthal s (1979)

More information

Propensity score methods : a simulation and case study involving breast cancer patients.

Propensity score methods : a simulation and case study involving breast cancer patients. University of Louisville ThinkIR: The University of Louisville's Institutional Repository Electronic Theses and Dissertations 5-2016 Propensity score methods : a simulation and case study involving breast

More information

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES

NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES NEW METHODS FOR SENSITIVITY TESTS OF EXPLOSIVE DEVICES Amit Teller 1, David M. Steinberg 2, Lina Teper 1, Rotem Rozenblum 2, Liran Mendel 2, and Mordechai Jaeger 2 1 RAFAEL, POB 2250, Haifa, 3102102, Israel

More information