Lecture 9 Internal Validity

Similar documents
Experimental Psychology

26:010:557 / 26:620:557 Social Science Research Methods

VALIDITY OF QUANTITATIVE RESEARCH

I. Identifying the question Define Research Hypothesis and Questions

Experimental Design. Dewayne E Perry ENS C Empirical Studies in Software Engineering Lecture 8

A. Indicate the best answer to each the following multiple-choice questions (20 points)

Research Methods & Design Outline. Types of research design How to choose a research design Issues in research design

Validity and Quantitative Research. What is Validity? What is Validity Cont. RCS /16/04

STATISTICAL CONCLUSION VALIDITY

EXPERIMENTAL RESEARCH DESIGNS

Outline. What s inside this paper? My expectation. Software Defect Prediction. Traditional Method. What s inside this paper?

In this chapter we discuss validity issues for quantitative research and for qualitative research.

PLS 506 Mark T. Imperial, Ph.D. Lecture Notes: Reliability & Validity

Research Design. Beyond Randomized Control Trials. Jody Worley, Ph.D. College of Arts & Sciences Human Relations

MBios 478: Systems Biology and Bayesian Networks, 27 [Dr. Wyrick] Slide #1. Lecture 27: Systems Biology and Bayesian Networks

Conducting Research in the Social Sciences. Rick Balkin, Ph.D., LPC-S, NCC

9 research designs likely for PSYC 2100

MS&E 226: Small Data

Chapter 9 Experimental Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.

Unit 1 Exploring and Understanding Data

Representation and Analysis of Medical Decision Problems with Influence. Diagrams

EPSE 594: Meta-Analysis: Quantitative Research Synthesis

Causal Research Design- Experimentation

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Research Design. Source: John W. Creswell RESEARCH DESIGN. Qualitative, Quantitative, and Mixed Methods Approaches Third Edition

Political Science 15, Winter 2014 Final Review

Bayesian (Belief) Network Models,

Simpson s Paradox and the implications for medical trials

The Regression-Discontinuity Design

Supplement 2. Use of Directed Acyclic Graphs (DAGs)

Book review of Herbert I. Weisberg: Bias and Causation, Models and Judgment for Valid Comparisons Reviewed by Judea Pearl

Supplementary notes for lecture 8: Computational modeling of cognitive development

RESEARCH METHODS. Winfred, research methods, ; rv ; rv

RESEARCH METHODS. A Process of Inquiry. tm HarperCollinsPublishers ANTHONY M. GRAZIANO MICHAEL L RAULIN

RESEARCH METHODS. Winfred, research methods,

A Brief Introduction to Bayesian Statistics

Chapter 02 Developing and Evaluating Theories of Behavior

Student Performance Q&A:

Phil 12: Logic and Decision Making (Winter 2010) Directions and Sample Questions for Final Exam. Part I: Correlation

From Description to Causation

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Topic #2. A key criterion in evaluating any test, measure, or piece of research is validity.

ERA: Architectures for Inference

How was your experience working in a group on the Literature Review?

investigate. educate. inform.

Definitions of Nature of Science and Scientific Inquiry that Guide Project ICAN: A Cheat Sheet

Study Design. Svetlana Yampolskaya, Ph.D. Summer 2013

Empirical Knowledge: based on observations. Answer questions why, whom, how, and when.

Research Approach & Design. Awatif Alam MBBS, Msc (Toronto),ABCM Professor Community Medicine Vice Provost Girls Section

Confounding and Effect Modification

P(HYP)=h. P(CON HYP)=p REL. P(CON HYP)=q REP

Internal Validity and Experimental Design

Communication Research Practice Questions

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

Statistics. Dr. Carmen Bruni. October 12th, Centre for Education in Mathematics and Computing University of Waterloo

CHAPTER III RESEARCH METHODOLOGY

SUPPLEMENTARY INFORMATION

How many speakers? How many tokens?:

The following are questions that students had difficulty with on the first three exams.

Cognitive domain: Comprehension Answer location: Elements of Empiricism Question type: MC

Signal Detection Theory and Bayesian Modeling

Formative and Impact Evaluation. Formative Evaluation. Impact Evaluation

Research Approaches Quantitative Approach. Research Methods vs Research Design

SINGLE-CASE RESEARCH. Relevant History. Relevant History 1/9/2018

STATISTICS & PROBABILITY

Bayes Linear Statistics. Theory and Methods

Problem Set and Review Questions 2

Durkheim. Durkheim s fundamental task in Rules of the Sociological Method is to lay out

Insight Assessment Measuring Thinking Worldwide

Running head: MOBILITY RESEARCH CRITIQUE 1. Mobility Research Critique. Amy Bradley, Karilyn Bufka and Jessica Riley. Ferris State University

Experimental Design Part II

Research Strategies: How Psychologists Ask and Answer Questions. Module 2

Knowledge is rarely absolutely certain. In expert systems we need a way to say that something is probably but not necessarily true.

HPS301 Exam Notes- Contents

CHAPTER NINE DATA ANALYSIS / EVALUATING QUALITY (VALIDITY) OF BETWEEN GROUP EXPERIMENTS

Artificial Intelligence Programming Probability

Key Ideas. Explain how science is different from other forms of human endeavor. Identify the steps that make up scientific methods.

Chapter 1 Introduction to I/O Psychology

Research Plan And Design

STA630 Research Methods Solved MCQs By

Overview of Perspectives on Causal Inference: Campbell and Rubin. Stephen G. West Arizona State University Freie Universität Berlin, Germany

How to Do Experiments

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

The Logic of Causal Inference

Introduction to Applied Research in Economics

Regression Discontinuity Analysis

6. A theory that has been substantially verified is sometimes called a a. law. b. model.

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Modeling State Space Search Technique for a Real World Adversarial Problem Solving

Lecture 3. Previous lecture. Learning outcomes of lecture 3. Today. Trustworthiness in Fixed Design Research. Class question - Causality

2013/4/28. Experimental Research

The Research Process: Coming to Terms

You must answer question 1.

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Part I - Multiple Choice. 10 points total.

Design of Experiments & Introduction to Research

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN)

Transcription:

Lecture 9 Internal Validity

Objectives Internal Validity Threats to Internal Validity Causality Bayesian Networks

Internal validity The extent to which the hypothesized relationship between 2 or more variables is supported by evidence in a given study.

Validity Internal Validity is the approximate truth about inferences regarding cause-effect or causal relationships. External Validity: Assuming that there is a causal relationship in this study between the constructs of the cause and the effect, can we generalize this effect to other persons, places or times? Statistical validity has to do with basing conclusions on proper use of statistics. Construct validity refers to the degree to which inferences can legitimately be made from your study to the theoretical constructs on which those operationalizations were based.

Internal validity

External Validity

Statistical Validity In Reality We Conclude H0 (null hypothesis) true H1 (alternative hypothesis) false H0 (null hypothesis) false H1 (alternative hypothesis) True We accept H0 We reject H1 1- I (e.g.,.95) THE CONFIDENCE LEVEL The probability we say there is no relationship when there is not J (e.g., 20) Type II Error The probability we say there is no relationship when there is one We reject H0 We accept H1 I (e.g.,.05) Type I Error The probability we say there is a relationship when there is not 1- J (e.g., 80) THE POWER The probability we say there is a relationship when there is one

Construct Validity

Threats to the Validity of Research Threats to internal validity Although it is claimed that the independent variable caused change in the dependent variable, the changes in the dependent variable may have actually been caused by a confounding variable. Threats to external validity Although claimed that the results are more general, the observed effects may actually only be found under limited conditions or for specific groups of people. Threats to construct validity Although it is claimed that the measured variables and the experimental manipulations relate to the conceptual variables of interests, they actually may not. Threats to statistical validity Conclusions regarding the research may be incorrect because a Type 1 or Type 2 error was made.

Challenges to Internal Validity a. History b. Maturation c. Experimental mortality d. Instrumentation e. Testing f. Interactions with selection

History Any events that occur during the course of the experiment which might effect outcome. Example: An important event not related to the experiment affects the measurement of pre and post-test.

Maturation Changes in the subjects over the course of the experiment. Example: Age, experience, physical development of participants that leads to increase in knowledge and understanding of the world or behavior which can affect program results.

Experimental Mortality Dropouts from the experiment; especially when the dropouts systematically bias the comparisons Example: If your include pretest subsequent dropouts in the pretest and not in the posttest you will bias the test based on the characteristics of dropouts. And, you won't necessarily solve this problem by comparing pre-post averages for only those who stayed in the study. This subsample would certainly not be representative even of the original entire sample

Instrumentation Any way in which the instrument used for observation or collecting data changes from the pre-test to the post-test. Example: Test, Interview, Measurement Technique or Instrument.

Regression Threat A regression threat, also known as a "regression artifact" or "regression to the mean" is a statistical phenomenon that occurs whenever you have a nonrandom sample from a population and two measures that are imperfectly correlated.

Regression Threat The highest and lowest scorers will regress toward the mean at a higher rate than those who scored close to the mean. There will be a higher degree of regression for unreliable measures than for more reliable ones.

The absolute amount of regression to the mean depends on two factors: The degree of asymmetry (i.e., how far from the overall mean of the first measure the selected group's mean is) The correlation between the two measures

Testing The observations gathered may influence the way subjects behave, thus effecting the outcome. Example: having had the experience of taking the GRE once, without any additional preparation, you are more likely to improve your score on a re-take.

Interactions with Selection When there is a relationship between the treatment and the selection of subjects, this causes a systematic bias which affects causal inference Example: A selection threat is any factor other than the program that leads to posttest differences between groups. These include: history, maturation, test, instrumentation, mortality, and regression.

SIMPSON S PARADOX Any statistical relationship between two variables may be reversed by including additional factors in the analysis.

SIMPSON S PARADOX Example by Judea Pearl The classical case demonstrating Simpson's paradox took place in 1975, when UC Berkeley was investigated for sex bias in graduate admission. In this study, overall data showed a higher rate of admission among male applicants, but, broken down by departments, data showed a slight bias in favor of admitting female applicants. The explanation is simple: female applicants tended to apply to more competitive departments than males, and in these departments, the rate of admission was low for both males and females..

FISHNET (JUDEA PEARL)

Which factors SHOULD be included in the analysis? All conclusions are extremely sensitive to which variables we choose to hold constant when we are comparing, and that is why the adjustment problem is so critical in the analysis of observational studies. According to Judea Pearl, such factors can now be identified by simple graphical means.

The basic foundation of probability theory follows from the following intuitive definition of conditional probability. P(A,B) = P(A B)P(B) In this definition events A and B are simultaneous an have no (explicit) temporal order we can write P(A,B) = P(B,A) = P(B A)P(A) Bayes Theorem This leads us to a common form of Bayes Theory, the equation: P(A) = P(B A)P(B)/P(A B) (marginalization) which allows us to compute the probability of one event in terms of observations of another and knowledge of joint distributions.

Use of Bayes Rule Often one is interested in particular conditional probability and discovers that The reverse conditional probabilities are more easily obtained. Example: One is interested in the P(disease symptom), but typically P (symptom disease) is better known. Causality Example: One is interested in the P(cause effect), but typically P (effect cause) is better known.

The heart of Bayesian inference lies in the inversion formula which States that the belief we accord to a hypothesis H upon obtaining Evidence e can be comnputed by multiplying our previous belief P(H) by the likelhood P(e H) that e will materialize if H is true. P(H e) = P(e H) P(H) / P(e) P(H e) = posterior probability P(H) = prior probability From the definition of condition probability P(A B) = P(A,B) / P(B) P(B A) = P(A,B) / P(A) Bayes Theorem

Bayesian networks Graphs in probabilistic form To provide convenient means of espressing substantive assumptions To facilitate economical representation of joint probability functions, and To facilitate efficient inferences from observation Bayesian networks provide a language for qualitatively representing the conditional independence properties of a distribution. This allows a natural and compact representation of the distribution, eases knowledge acquisition, and supports effective inference algorithms.