Research Designs. Internal Validity

Similar documents
2-Group Multivariate Research & Analyses

Limitations of 2-cond Designs

Research Process. the Research Loop. Now might be a good time to review the decisions made when conducting any research project.

Introduction to Path Analysis

Political Science 15, Winter 2014 Final Review

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Multiple Regression Models

Handout 1: Introduction to the Research Process and Study Design STAT 335 Fall 2016

Your goal in studying for the GED science test is scientific

Limitations of 2-cond Designs

Chapter 13 Summary Experiments and Observational Studies

This is lesson 12 Self-Administration.

Chapter Eight: Multivariate Analysis

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

One slide on research question Literature review: structured; holes you will fill in Your research design

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

OBSERVATION METHODS: EXPERIMENTS

18 INSTRUCTOR GUIDELINES

Chapter 1 Introduction

How to Work with the Patterns That Sustain Depression

Previous Example. New. Tradition

9 INSTRUCTOR GUIDELINES

appstats26.notebook April 17, 2015

Principles and language suggestions for talking with patients

EXPERIMENTAL METHODS 1/15/18. Experimental Designs. Experiments Uncover Causation. Experiments examining behavior in a lab setting

Genetic Counselor: Hi Lisa. Hi Steve. Thanks for coming in today. The BART results came back and they are positive.

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Sources of Variance & ANOVA

5 MISTAKES MIGRAINEURS MAKE

Barriers to concussion reporting. Qualitative Study of Barriers to Concussive Symptom Reporting in High School Athletics

CAUTIONS ABOUT THE PRACTICE EXAM

Understanding Pain. Teaching Plan: Guidelines for Teaching this Lesson

Behaviorism: An essential survival tool for practitioners in autism

Experimental Design Part II

3. Which word is an antonym

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

SUPPORTED LODGINGS. Providing a bridge to independent living for young people. Supported Lodgings

Intentional Action and Side Effects in Ordinary Language. The chairman of the board of a company has decided to implement

Chapter 11. Experimental Design: One-Way Independent Samples Design

Psychology: The Science

ON HOW TO DISCOVER YOUR CHILD S DEEPEST STRUGGLES WITH ANXIETY OR OCD

Problem Situation Form for Parents

Self-Injury. What is it? How do I get help? Adapted from Signs of Self-Injury Program

Obsessive-Compulsive Disorder (OCD)

Why Is Mommy Like She Is?

More Designs. Section 4.2B

kxk BG Factorial Designs kxk BG Factorial Designs Basic and Expanded Factorial Designs

Data Collection: Validity & Ethics

Chapter Eight: Multivariate Analysis

Lecture 4: Research Approaches

The Practice of Statistics 1 Week 2: Relationships and Data Collection

Other Designs. What Aids Our Causal Arguments? Time Designs Cohort. Time Designs Cross-Sectional

ADHD. What you need to know

Stories of depression

Goal: To become familiar with the methods that researchers use to investigate aspects of causation and methods of treatment

Chapter 10 Quasi-Experimental and Single-Case Designs

The scientific discovery that changed our perception of anxiety

DOCTOR: The last time I saw you and your 6-year old son Julio was about 2 months ago?

Healthy Communities Conference Ana Diez Roux 1. Okay, good afternoon. It s a pleasure to be here. I guess by, I

Sheila Barron Statistics Outreach Center 2/8/2011

15 INSTRUCTOR GUIDELINES

Reasoning with Uncertainty. Reasoning with Uncertainty. Bayes Rule. Often, we want to reason from observable information to unobservable information

Prevent Cervical Cancer: Take Care of Yourself and Those You Love

Endogeneity is a fancy word for a simple problem. So fancy, in fact, that the Microsoft Word spell-checker does not recognize it.

Bivariate &/vs. Multivariate

The Science of Psychology

Rival Plausible Explanations

Patrick Breheny. January 28

Teresa Anderson-Harper

Practical Brain-Focused Strategies for Working with Depression

Psychology Research Process

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Single-Factor Experimental Designs. Chapter 8

Suppose we tried to figure out the weights of everyone on campus. How could we do this? Weigh everyone. Is this practical? Possible? Accurate?

LESSON 1: Introduction to Medication Administration

Selection at one locus with many alleles, fertility selection, and sexual selection

Measures of Dispersion. Range. Variance. Standard deviation. Measures of Relationship. Range. Variance. Standard deviation.

REVIEW FOR THE PREVIOUS LECTURE

HEPATITIS C LESSONS PART 4

Autism, my sibling, and me

OPIOIDS. Questions about opioids, and the Answers that may SURPRISE YOU. A booklet for people who may benefit from reducing or stopping their opioid

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Research Methods. Observational Methods. Correlation - Single Score. Basic Methods. Elaine Blakemore. Title goes here 1. Observational.

OPIOIDS. Questions about opioids, and the Answers that may SURPRISE YOU. A booklet for people who may benefit from reducing or stopping their opioid

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

- Triangulation - Member checks - Peer review - Researcher identity statement

CHAPTER 8 EXPERIMENTAL DESIGN

An INSIDE OUT Family Discussion Guide. Introduction.

STAR-CENTER PUBLICATIONS. Services for Teens at Risk

Higher Psychology RESEARCH REVISION

It s About You Too! A guide for children who have a parent with a mental illness

Engaging the Disengaged: Strategies for Promoting Behavior Change in Diabetes. William H. Polonsky, PhD, CDE

Meeting a Kid with Autism

Teens. Self-Talk. Be positive. Practice. Try it and see. Using your thoughts, feelings and actions PATIENT EDUCATION

Draft 0-25 special educational needs (SEN) Code of Practice: young disabled people s views

Step 4-- Select a Method ---- Experimental Designs I. Why do Experiments?

Goal: To become familiar with the methods that researchers use to investigate aspects of causation and methods of treatment

CBT+ Measures Cheat Sheet

Transcription:

Reviewing a few things Research Designs Review of a few things Demonstrations vs. Comparisons Experimental & Non-Experimental Designs IVs and DVs Between Group vs. Within-Group Designs Kinds of bivariate research hypotheses (and evidence to support) Associative research hypothesis show a statistical relationship between the variables Causal research hypothesis temporal precedence statistical relationship between the variables no alternative explanation of the relationship - no confounds Kinds of Validity Measurement Validity Statistical Conclusion Validity External Validity Internal Validity Two ways we show our studies have the validity we hope for... replication (same study) & convergence (variations) Reviewing a few more things What kind of validity relates to the generalizability of the results? External Validity What are the components of this type of validity? Setting Task/Stimulus Social/Temporal What validity relates to the causal interpretability of the results? Internal Validity What are the components of this type of validity & what type of variable is each involved with? Initial Equivalence -- subject or measured variables Ongoing Equivalence -- procedural or manipulated variables

What are the three types of variable at the beginning of a study??? Causal variable Effect Variable Potential Confounds What are the five types at the end of the study??? Tell which are good and which are bad when testing causal RH: Causal variable Effect Variable Confound Variable Control Variable Constant To test a causal research hypothesis, a design must provide: manipulation of the causal variable measurement of the effect variable elimination of confounds/alternative hypotheses (I.e., everything that isn t the causal or effect variable is either a constant or is a control variable) For practice... Study purpose: to compare two different ways of teaching social skills (role playing vs. watching a videotape). Causal Variable? Effect Variable? Potential Confounds? Teaching method Social skills All other variables Study procedure: 10 pairs of 6th grade girls role-played an initial meeting while 20 8th grade girls watched a video about meeting new people. Then all the participants took a social skills test. Any controls (var or const.)? Any confounding variables? Gender -- constant Age/grade difference How do you know what variables to control, so that they don t become confounds? Any variable not the causal variable must be controlled Can we causally interpret the results? Nope -- confounds! There are two basic ways of providing evidence to support a RH: -- a demonstration and a comparison a demonstration involves using the treatment and showing that the results are good a comparison (an experiment) involves showing the difference between the results of the treatment and a control lots of commercials use demonstrations We washed these dirty clothes in Tide -- see how clean!!! After taking Tums her heartburn improved!!! He had a terrible headache. After taking Tylenol he s dancing with his daughter! The evidence from a demonstration usually meets with the response -- Compared to what?? a single demonstration is a implicit comparison doesn t this wash look better then yours? did you last heartburn improve this fast? didn t your last headache last longer than this? explicit comparisons are preferred!!!

When testing causal RH: we must have a fair comparison or a well-run Experiment that provides init eq of subject variables & ongoing eq of procedural variables For example what if our experiment intended to show that Tide works better compared Really dirty light-colored clothes washed in a small amount of cold water for 5 minutes with a single rinse -- using Brand-X vs. Barely dirty dark-colored clothes washed in a large amount of hot water for 25 minutes with a double rinse -- using Tide What is supposed to be the causal variable that produces the difference in the cleanness of the two loads of clothes? Can you separate the initial and ongoing equivalence confounds? Initial Equivalence confounds Ongoing Equivalence confounds dirtyness of clothes color of clothes amount of water length of washing single vs. double rinse Research Designs True Experiments If well-done, can be used to test causal RH: -- alternative hyp. are ruled out because there are no confounds!!! Non-Experiments No version can be used to test causal RH: -- can t rule out alternative hyp. Because there are confounds!! True Experiment random assignment of individual participants by researcher before IV manip (provides initial equivalence - subject variables - internal validity) treatment/manipulation performed by researcher (provides temporal precedence & ongoing equivalence - internal validity) good control of procedural variables during task completion & DV measurement (provides ongoing equivalence - internal validity) Quasi-Experiment no random assignment of individuals (but perhaps random assignment of intact groups) treatment/manipulation performed by researcher poor or no control of procedural variables during task, etc. Natural Groups Design also called Concomitant Measures or Correlational Design no random assignment of individuals (already in IV groups ) no treatment manipulation performed by researcher (all variables are measured) -- a comparison among participants already in groups no control of procedural variables during task, etc. Words of Caution About the terms IVs, DVs & causal RH:s... You might have noticed that we ve not yet used these terms.. Instead we ve talked about causal variables and effect variables -- as you probably remember.. the Independent Variable (IV) is the causal variable the Dependent Variable (DV) is the effect variable However, from the last slide, you have know that we can only say the IV causes the DV if we have a true experiment (and the internal validity it provides) initial equivalence (control of subject variables) random assignment of participants ongoing equivalence (control of procedural variables) experimenter manipulates IV, measures DV and controls all other procedural variables

The problem seems to come from there being at least three different meanings or uses of the term IV... 1 the variable manipulated by the researcher it s the IV because it is independent of any naturally occurring contingencies or relationships between behaviors the researcher, and the researcher alone, determines the value of the IV for each participant 2 the grouping, condition, or treatment variable 3 the presumed causal variable in the cause-effect relationship In these last two both the IV & DV might be measured!!! So you don t have a True Experiment... no IV manipulation to provide temporal precedence no random assignment to provide init. eq. for subject vars no control to provide onging eq. for procedural variables and can t test a causal RH: This is important stuff -- so here s a different approach... It is impossible to have sufficient internal validity to infer cause when studying some IV-DV relationships Say we wanted to test the idea that attending private colleges CAUSES people to be more politically conservative than does attending public universities. We wouldn t be able to randomly assign folks to the type of college they attend (no initial eq.) We wouldn t be able to control all the other things that happen during those 4 years (no ongoing equivalence) Here are some other categories of IV s with the same problem gender, age, # siblings ethnic background, race, neighborhood characteristics/behaviors of your parents things that happened earlier in your life IVs vs Confounds Both IVs and Confounds are causal variables!!! variables that may cause (influence, etc. ) scores on the DVs What s the difference??? The IV is the intended causal variable in the study! We are trying to study if & how & how much the IV influences the DV! A confound interferes with our ability to study the causal relationship between the IV & the DV, because it is another causal variable that might be influencing the DV. If the IV difference between the conditions is confounded, then if there is a DV difference between the conditions, we don t know if that difference was caused by the IV, the confound or a combination of both!!!!

Between Groups vs. Within-Groups Designs Between Groups also called Between Subjects or Cross-sectional each participant is in one (& only one) of the treatments/conditions different groups of participants are in each treatment/condition typically used to study differences -- when, in application, a participant will usually be in one treatment/condition or another Within-Groups Designs also called Within-Subjects, Repeated Measures, or Longitudinal each participant is in all (every one) of the treatment/conditions one group of participants, each one in every treatment/condition typically used to study changes -- when, in application, a participant will usually be moving from one condition to another Between Groups Design Experimental Tx Pat Sam Kim Lou Todd Bill Traditional Tx Glen Sally Kishon Phil Rae Kris Different participants in each treatment/condition Within-Groups Design Experimental Tx Pat Sam Kim Lou Todd Bill Traditional Tx Pat Sam Kim Lou Todd Bill All participants in each treatment/condition Research Designs Putting this all together -- here s a summary of the four types of designs we ll be working with... True Experiment w/ proper RA/CB - init eqiv manip of IV by researcher Non-experiment no or poor RA/CB may have IV manip Between Groups (dif parts. in each IV condition) Within-Groups (each part. in all IV conditions) Results might be causally interpreted -- if good ongoing equivalence Results might be causally interpreted -- if good ongoing equivalence Results can not be causally interpreted Results can not be causally interpreted

Four versions of the same study which is which? Each participant in our object identification study was asked to select whether they wanted to complete the visual or the auditory condition. Each participant in our object identification study completed both the visual and the auditory conditions in a randomly chosen order for each participant. BG Non WG Exp So, you gotta have a True Experiment for the results to be causally interpretable? But, does running a True Experiment guarantee that the results will be causally interpretable? What are the elements of a True Experiment?? Random Assignment if Individuals to IV conditions by the researcher before manipulation of the IV Supposed to give us initial equivalence of measured/subject variables. Each participant in our object identification study was randomly assigned to complete either the visual or the auditory condition. Each participant in our object identification study completed first the visual and the the auditory condition. BG Exp. WG Non Manipulation of the IV by the researcher Supposed to give us temporal precedence & help control ongoing equivalence of manipulated/procedural variables Please note: A true experiment is defined by these two elements! BUT there is an asymmetry between true exp and causal interp Huh? True Exp is necessary, but not sufficient, for causal interpretability! What could possibly go wrong.??? Random Assignment might not take RA is a probabilistic process there s no guarantee that the groups will be equivalent on all subject variables! Might introduce a confound when doing the IV manipulation might treat the conditions differently other than the IV May miss or even cause other ongoing equivalence confounds often, especially for younger researchers or newer research topics, we don t really know what to control we may know what to control and just not get it done

If only True Experiments can be causally interpreted, why even bother running non-experiments? 1 st Remember that we can t always run a true experiment! Lots of variables we care about can t be RA & manip gender, family background, histories and experiences, personality, etc. Even if we can RA & manip, lots of studies require long-term or field research that makes ongoing equivalence (also required for causal interp) very difficult or impossible. We would greatly limit the information we could learn about how variables are related to each other if we only ran studies that could be causally interpreted. If only True Experiments can be causally interpreted, why even bother running nonexperiments? Cont 2nd We get very useful information from non-experiments! True, if we don t run a True Experiment, we are limited to learning predictive information and testing associative RH: But associative information is the core of our understanding about what variables relate to each other and how they relate Most of the information we use in science, medicine, education, politics, and everyday decisions are based on only associative information and things go pretty well! Also, designing and conducting True Experiments is made easier if we have a rich understanding of what variables are potential causes and confounds of the behavior we are studying Between Groups True Experiment Untreated Treated participant pool random participant not-to-betreated group assignment to-be-treated group no treatment treatment control group experimental group Rem -- samples & groups are intended to represent populatioins

Within-Groups True Experiment Between Groups Non-experiment Untreated Treated Untreated Treated participant pool random participant assignment 1/2 of subjects untreated 1/2 of subjects treated treated untreated Each participant represents each target population, in a counterbalanced order control group experimental group The design has the external validity advantage that each subject REALLY is a member of the population of interest (but we still need a representative sample) The design has the internal validity disadvantages that... we don t know how participants end up in the populations no random participant assignment (no initial equivalence) we don t know how the populations differ in addition to the treatment per se no control of procedural variables (no ongoing equivalence) Within-Groups Non-experiment Untreated treatment occurs to the whole population Treated control group treatment group The design has the external validity advantage that each subject REALLY is a member of each population of interest (but we still need a representative sample) The design has the internal validity disadvantages that... we don t know how the populations differ in addition to the treatment per se no control of procedural variables (no ongoing equivalence)

There is always just one more thing... Sometimes there is no counterbalancing in a Within-groups design, but there can still be causal interpretation A good example is when the IV is amount of practice with 10 practice and a 50 practice conditions. There is no way a person can be in the 50 practice condition, and then be in the 10 practice condition Under these conditions (called a seriated IV ), what matters is whether or not we can maintain ongoing equivalence so that the only reason for a change in performance would be the increased practice The length of time involved is usually a very important consideration Which of these would you be more comfortable giving a causal interpretation? When we gave folks an initial test, 10 practice and then the test again, we found that at their performance went up! When we gave folks an initial assessment, 6 months of once-a-week therapy and then the assessment again, their depression went down!