Experiments. ESP178 Research Methods Dillon Fitch 1/26/16. Adapted from lecture by Professor Susan Handy

Similar documents
Research Design. ESP178 Research Methods Dr. Susan Handy 1/19/16

Research Design. ESP 178 Applied Research Methods Calvin Thigpen 1/19/17

Review and Wrap-up! ESP 178 Applied Research Methods Calvin Thigpen 3/14/17 Adapted from presentation by Prof. Susan Handy

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

Experimental Research. Types of Group Comparison Research. Types of Group Comparison Research. Stephen E. Brock, Ph.D.

VALIDITY OF QUANTITATIVE RESEARCH

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Chapter 9 Experimental Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.

Experimental and Quasi-Experimental designs

26:010:557 / 26:620:557 Social Science Research Methods

Threats to Validity in Experiments. The John Henry Effect

Educational Research. S.Shafiee. Expert PDF Trial

Design of Experiments & Introduction to Research

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

2013/4/28. Experimental Research

STATISTICAL CONCLUSION VALIDITY

Lecture II: Difference in Difference. Causality is difficult to Show from cross

Instrumental Variables Estimation: An Introduction

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Why randomize? Rohini Pande Harvard University and J-PAL.

Rival Plausible Explanations

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Validity and Quantitative Research. What is Validity? What is Validity Cont. RCS /16/04

Research Questions, Variables, and Hypotheses: Part 2. Review. Hypotheses RCS /7/04. What are research questions? What are variables?

What We Will Cover in This Section

Statistical Power Sampling Design and sample Size Determination

9 research designs likely for PSYC 2100

STAB22 Statistics I. Lecture 12

Conducting Research in the Social Sciences. Rick Balkin, Ph.D., LPC-S, NCC

Lecture II: Difference in Difference and Regression Discontinuity

The Practice of Statistics 1 Week 2: Relationships and Data Collection

You must answer question 1.

Previous Example. New. Tradition

More on Experiments: Confounding and Obscuring Variables (Part 1) Dr. Stefanie Drew

CHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS

Underlying Theory & Basic Issues

Levels of Evaluation. Question. Example of Method Does the program. Focus groups with make sense? target population. implemented as intended?

SINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018

Experimental Research I. Quiz/Review 7/6/2011

Research Methods & Design Outline. Types of research design How to choose a research design Issues in research design

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30%

What We Will Cover in This Section

Challenges of Observational and Retrospective Studies

Experimental Validity

MBACATÓLICA JAN/APRIL Marketing Research. Fernando S. Machado. Experimentation

Goal: To become familiar with the methods that researchers use to investigate aspects of causation and methods of treatment

Variables Research involves trying to determine the relationship between two or more variables.

n Outline final paper, add to outline as research progresses n Update literature review periodically (check citeseer)

Evaluating Social Programs Course: Evaluation Glossary (Sources: 3ie and The World Bank)

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Experimental design (continued)

Chapter 13 Summary Experiments and Observational Studies

Research Methodology. Characteristics of Observations. Variables 10/18/2016. Week Most important know what is being observed.

Research Design & Protocol Development

Research Approach & Design. Awatif Alam MBBS, Msc (Toronto),ABCM Professor Community Medicine Vice Provost Girls Section

Introduction to Experiments

Lecture 4: Research Approaches

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Reflection Questions for Math 58B

Psychology Research Process

Lecture 9 Internal Validity

Chapter Three: Sampling Methods

Psychology Research Process

Threats and Analysis. Shawn Cole. Harvard Business School

UNIT II: RESEARCH METHODS

Leaning objectives. Outline. 13-Jun-16 BASIC EXPERIMENTAL DESIGNS

One slide on research question Literature review: structured; holes you will fill in Your research design

Designing Experiments... Or how many times and ways can I screw that up?!?

CAUTIONS ABOUT THE PRACTICE EXAM

RESEARCH METHODS. Winfred, research methods, ; rv ; rv

Research Designs. Inferential Statistics. Two Samples from Two Distinct Populations. Sampling Error (Figure 11.2) Sampling Error

Vocabulary. Bias. Blinding. Block. Cluster sample

RESEARCH METHODS. Winfred, research methods,

Experimental and quasi-experimental designs:

MAT Mathematics in Today's World

The validity of inferences about the correlation (covariation) between treatment and outcome.

Study Design. Svetlana Yampolskaya, Ph.D. Summer 2013

HPS301 Exam Notes- Contents

Communication Research Practice Questions

Conceptualization. ESP178 Research Methods Dr. Susan Handy 1/7/16

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

RESEARCH METHODS. A Process of Inquiry. tm HarperCollinsPublishers ANTHONY M. GRAZIANO MICHAEL L RAULIN

M6728. Goals. Depression Scores. Research Designs

Lecture 3. Previous lecture. Learning outcomes of lecture 3. Today. Trustworthiness in Fixed Design Research. Class question - Causality

EXPERIMENTAL AND EX POST FACTO DESIGNS M O N A R A H I M I

Regression Discontinuity Design

Causal Research Design- Experimentation

Lecture Start

EXPERIMENTAL RESEARCH DESIGNS

Independent Variables Variables (factors) that are manipulated to measure their effect Typically select specific levels of each variable to test

Methods of Randomization Lupe Bedoya. Development Impact Evaluation Field Coordinator Training Washington, DC April 22-25, 2013

Chapter 11. Experimental Design: One-Way Independent Samples Design

Randomization as a Tool for Development Economists. Esther Duflo Sendhil Mullainathan BREAD-BIRS Summer school

Topic #6. Quasi-experimental designs are research studies in which participants are selected for different conditions from pre-existing groups.

CHAPTER LEARNING OUTCOMES

9.0 L '- ---'- ---'- --' X

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

AP Psychology -- Chapter 02 Review Research Methods in Psychology

Public Policy & Evidence:

Transcription:

Experiments ESP178 Research Methods Dillon Fitch 1/26/16 Adapted from lecture by Professor Susan Handy

Recap Causal Validity Criterion Association Non-spurious Time order Causal Mechanism Context Explanation If cause happens, effect happens; if no cause, no effect. No extraneous third variable that can explain association. Cause comes before effect. Logical explanation for how cause leads to effect. Understand the conditions under which the relationship holds Experimental Designs

Movie Trailer BBC story

Experimental Design Terminology Depends on discipline Depends on sub-discipline Confusing Almost as bad as terminology in statistics.we ll get to that later as well.

Experimental Design Concepts Treatment (Condition)- unique IV of concern that can change Levels- discrete measures of the treatment Experimental unit what gets treated Manipulation- change in the treatment level Experimenter controlled (Intervention experiment) Non-experimenter controlled (Natural experiment) Assignment- treatments given to experimental units Random Non-Random (e.g. matching, counterbalanced) Control Experimental (manipulation, assignment, exclusion) Statistical (inclusion) Control is out of control Richard McElreath

Experimental Design Classification Quasi vs. True Between-Subject (parallel groups) vs. Within-Subject (repeated measures) By name (e.g. factorial, randomized block, etc.)

Experimental Design Criterion Association Aspect of Experimental Design Treatment group versus control group http://0ptimus.com/what-do-we-mean-when-we-say-experiment/?r=1&l=ri&fst=0

Experimental Design Criterion Association Aspect of Experimental Design Treatment group versus control group

Experimental Design Criterion Association Nonspuriousness Aspect of Experimental Design Treatment group versus control group Randomly assign participants to treatment and control

Experimental Design Criterion Association Nonspuriousness Aspect of Experimental Design Treatment group versus control group Randomly assign participants to treatment and control

Experimental Design Criterion Association Nonspuriousness Aspect of Experimental Design Treatment group versus control group Randomly assign participants to treatment and control Match control group members to treatment group members Match

Experimental Design Criterion Association Nonspuriousness Time Order Aspect of Experimental Design Treatment group versus control group Randomly assign participants to treatment and control Measure before and after treatment

Three Basic Experimental Designs Notation R/N = randomize/non-randomize assignment O = observation X = treatment E = effect size

Between-Subject Post-Test Experiment R X O1 Treatment Group E=O1-O2 O2 Control Group

Visualizing Effect Size Dependent (Response) Variable 10 Treatment Group Control Group Treatment effect = 10 Treatment X O1 O2 Time

Between-Subject Post-Test Experiment R X O1 Treatment Group E=O1-O2 O2 Control Group Pros: Simple Handles internal validity well: Maturation, Testing, Regression to the mean, Selection, pre-post test interaction Cons: Need large sample (for Randomization to work) Time order not fully accounted for Mortality may still be an issue Individual differences must be expected to washout (Intra subject variability)

Between-Subject Pre and Post- Test Experiment R O1 X O2 Treatment Group E=(O2-O1)- O3 O4 Control Group (O4-O3)

Visualizing Effect Size Dependent (Response) Variable 20 Treatment Group Control Group 5 Treatment effect = 15 O1 O3 Treatment X Time O2 O4

Between-Subject Pre and Post- Test Experiment R O1 X O2 Treatment Group E=(O2-O1)- O3 O4 Control Group (O4-O3) Pros: Simple Handles internal validity well: Maturation, Testing, Regression to the mean, Selection Time order Cons: Need large sample (for Randomization to work) Pre-post test interaction Mortality may still be an issue Individual differences must be expected to washout (Intra subject variability)

Within-Subject Experiment (AKA Repeated Measures, AKA Crossover) R/N O1 X1 O2 X2 O3 A Group O4 X2 O5 X1 O6 B Group E(X1)= 1 nn AA nn ii=1 AA (O2i O1i)+ 1 nn bb nn ii=1 BB 2 (O6i O4i)

Visualizing Effect Size Dependent (Response) Variable Subject 1 base Subject 2 base Group A: X1 effect = 2 X2 effect = 1 3 1 E: X1 effect = 0.5 X2 effect = 2.75 0 2 Group B: X1 effect = -1 X2 effect = 4.5 3 6-2 1 O1 X1 O2 X2 O3 O1 X2 O2 X1 O3 Time

Within-Subject Experiment (AKA Repeated Measures, AKA Crossover) R/N O1 X1 O2 X2 O3 A Group O4 X2 O5 X1 O6 Pros: Small sample sizes OK (individuals act as own control) Cost efficient Handles mortality well Confounding covariates reduced (Individual difference are OK) Cons: Time based effects: Order effects Carry-over effects Learning effects (testing) Pre-post test interaction (for all treatments) Regression to the mean Maturity B Group

My crossover study What is the relationship between the road environment and bicyclist stress?

Stress and Heart Rate Variability (HRV) HRV: variability in beat-to-beat time intervals, and shown to be related to stress RR INTERVAL (S) 1.2 1.1 1 0.9 0.8 0.7 0.6 Deep Breathing Stationary Bike 0.5 0.4

My crossover study Sample: 20 Female Undergrads Dependent Variable: Heart Rate Variability (SDNN) Independent Variables: Experimental Condition Exertion Personal Characteristics Weather etc.

My crossover study Treatments: B St. Russell Blvd. R REST O1 B St. O2 REST Russell O3 A Group O4 Russell O5 B St. O6 B Group Counterbalanced Treatments

Simulated Effect Size

Can we be sure that the experiment has the effect we think it has?

I.e. Are our inferences correct? Judgment of Null Hypothesis (H 0 ) Table of error types Reject Fail to reject Null hypothesis (H 0 ) is Valid/True Type I error (False Positive) Correct inference (True Negative) Invalid/False Correct inference (True Positive) Type II error (False Negative)

I.e. Are our inferences correct? Type S (Sign) error: Are we inferring the correct direction of treatment effect? Type M (Magnitude) error: Are we inflating (possibly deflating) the real magnitude of the treatment effect? Source: Gelman and Carlin (2014)

Susan Handy s quasilongitudinal Natural Experiment Movers Do your children play outside more or less than they did before you moved? Non-movers Do your children play outside more or less than they did a year ago?

Threats to Internal Validity for Experiments Threat Selection bias Definition when characteristics of treatment and control groups differ

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation Testing

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation Maturation

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Treatment misidentification Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced a process the researcher is not aware of is responsible for apparent effect of treatment expectancy effect (nocebo vs. placebo), observer effect

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Treatment misidentification Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced a process the researcher is not aware of is responsible for apparent effect of treatment expectancy effect (nocebo vs. placebo), observer effect

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Treatment misidentification Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced a process the researcher is not aware of is responsible for apparent effect of treatment expectancy effect (nocebo vs. placebo), observer effect Control group gets a fake treatment

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Treatment misidentification Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced a process the researcher is not aware of is responsible for apparent effect of treatment expectancy effect (nocebo vs. placebo), observer effect Feeling expected to

Threats to Internal Validity for Experiments Threat Selection bias Endogenous change History effects Contamination Treatment misidentification Mortality Definition when characteristics of treatment and control groups differ when subjects develop or change independent of treatment testing, maturation when something occurs during the experiment that influences outcomes when one group is aware of the other group and is influenced a process the researcher is not aware of is responsible for apparent effect of treatment expectancy effect (nocebo vs. placebo), observer effect Participants drop out at differential rates by group for systematic reason

Careful design and execution of experiments is critical!

Replication is important!

Can we be sure that the effect is the same beyond the experiment?

Generalizability more next week Population Participants A Different Population

Interaction of testing and treatment

But we need them!

TRB Special Report 298 careful before-and-after studies of policy interventions to promote more compact, mixed-used development to help determine what works and what does not Natural experiments Intervention studies Policy evaluation

Intervention Studies = Researcher designs and implements the treatment Natural Experiments = Researcher does not control the treatment

Davis Natural Experiments Target Store opening: Shopping VMT before and after Fifth Street Road Diet: Mode split to downtown and bike/ped safety before and after West Cannery Village Project: Project: VMT and active travel before-and-after moving into net-zero energy community

Natural experiments for programs Educational programs Promotional programs Bike give-away programs 53

Natural Experiments for Infrastructure http://www.google.com/url?sa=i&rct=j&q=&esrc=s&frm=1&source=images& cd=&docid=gf_snfcppwuirm&tbnid=cbyuwdleswlnqm:&ved=0caeqjxw& url=http%3a%2f%2fgreenlaneproject.org%2f&ei=nr0zusovmom4iglcyydq CQ&bvm=bv.52164340,d.cGE&psig=AFQjCNFV1BMutZ6GjiFrxkM4Vidcp0TcJg &ust=1379208956696122 Green Lane Project Portland State University 5 cities http://latimesblogs.latimes.com/photos/uncat egorized/2009/02/20/expomap_2.gif Expo Line Opening UC Irvine, USC 1 line 54

To Test Housing Program, Some Are Denied Aid New York Times, 12/8/2010 Half of the test subjects people who are behind on rent and in danger of being evicted are being denied assistance from the program for two years, with researchers tracking them to see if they end up homeless. http://www.nytimes.com/2010/12/09/nyregion/09placebo.html?pagewanted=all&_r=0

Moving to Opportunity Moving to Opportunity for Fair Housing (MTO) is a 10-year research demonstration that combines tenant-based rental assistance with housing counseling to help very low-income families move from poverty-stricken urban areas to low-poverty neighborhoods. Treatment group: Randomly selected households with children receive housing counseling and vouchers that must be used in areas with less than 10 percent poverty Control groups: One already receiving vouchers, one just coming into voucher program http://portal.hud.gov/hudportal/hud?src=/programdescription/mto

Study progression Cross-sectional studies Establish associations Basis for designing interventions Intervention studies Before-and-after measures Establish causal relationship

To do Work on assignment 2!!! Office hours Friday Experiments exercise on Tuesday Read articles for Friday section see website Don t forget lecture notes on website!

Refining your conceptual model Your stated question should match your model Keep your question general, even if it is motivated by a specific place Make sure your variables are consistent with your level of analysis Do you have a policy lever? Simplify where possible, but include the important control variables!