Introduction to Econometrics

Size: px
Start display at page:

Download "Introduction to Econometrics"

Transcription

1 Introduction to Econometrics James H. Stock HARVARD UNIVERSITY Mark W. Watson PRINCETON UNIVERSITY... ~ Boston San Francisco New York London Toronco Sydney ToJ..:yo Singapore Madrid Mexico City Munich Paris Cape Town Hong Kong Montreal

2 Brief Contents PART ONE CHAPTE R I CHAPTER 2 CHAPTER 3 Introduction and Review Economic Questions and Data 3 ReviewofProbability 17 Review ofstatistics 65 PART TWO Fundamentals of Regression Analysis 109 CHAPTER 4 Linear Regression with One Regressor 1II CHAPTER 5 Regression with a Single Regressor: Hypothesis Tests and Confidence In tervals 148 CHAPTER 6 Linear Regression with Multiple Regressors 186 CHAPTER 7 CHAPTER 8 CHAPTER 9 Hypothesis Tests and Confidence Intervals in Mult ipl e Regression 220 Nonlinear Regression Functions 254 Assessing Studies Based on Multiple Regression 312 PART THREE Further Topics in Regression Analysis 347 CHAPTER 10 Regression with Panel Data 349 CHAPTER II Regression with a Binary Dependent Variable 383 C HAPTER 12 Instrumental Variables Regression 421 CHAPTER 13 Experiments and Quasi-Experiments 468 PART FOUR CHAPTER 14 Regression Analysis of Economic Time Series Data 523 Introduction to T ime Series Regression and Forecasting 525 CHAPTER 15 Estimation of Dynamic Causal Effects 591 C HAPTER 16 Additional Topics in Time Series Regression 637 PART FIVE The EconometricTheory ofregression Analysis 675 CHAPTER 17 The Theory of Linear Regression wit h One Regressor 677 CHAPTER 18 The T heory of Multiple Regression 704 v

3 Contents Preface XXVIl PART ONE Introduction and Review CHAPTER I Economic Questions and Data Economic Questions We Examine 4 Question # 1: Does Reducing Class Size Improve Elementary School Education? 4 Question # 2: Is There Racial Discrimination in the Market for Home Loans? Question # 3: How Much Do Cigarette Taxes Reduce Smoking? 5 Question #4: W hat Will the Rate of Inflation Be Next Year? 6 Quantitative Questions, Quantitative Answers Causal Effects and Idealized Experime nts 8 Estimation ofcausal Effects 8 Forecasting and Causality Data: Sources and Types 10 Experimental versus Observational Data Cross-Sectional Data [I Time Series Data II Panel Data CHAPTER 2 Review of Probability Random Variables and Probability Distributions 18 Probabilities, the Sample Space, and Random Variables 18 Probability Distribution ofa Discrete Random Variable 19 Probability Distribution ofa Continuous Random Variable Expected Values, Mean, and Variance 23 The Expected Value ofa Random Variable 23 The Standard Deviation and Variance 24 Mean and Variance ofa Linear Function ofa Random Variable Other Measures ofthe Shape ofa Distribution Two Random Variables 29 Joint and Marginal Distributions Conditional Distributions vii

4 viii CO NTENTS 2.4 Independence 34 Covariance and Correlation 34 The Mean and Variance ofsums of Random Variables 35 The Normal, Chi-Squared,Student t, and F Distributions The Normal Distribution 39 The Chi-Squared Distribution 43 The Student t Distribution 44 The F Distribution Random Sampling and the Distribution ofthe Sample Average Random Sampling 45 The Sampling Distribution of the Sample Average Large-Sample Approximations to Sampling Distributions The Law of Large Numbers and Consistency 49 The Central Li mit Theorem 52 APPENDIX 2.1 Derivation of Results in Key Concept CHAPTER Review ofstatistics 65 Estimation ofthe Population Mean 66 Estimators and Their Properties 67 Properties of Y 68 The Importance of Random Sampling 70 Hypothesis Tests Concerning the Population Mean 71 Null and Alternative Hypotheses 72 The p-value 72 Calculating thep-value When O' y Is Known 74 The Sample Variance, Sample Standard Deviation, and Standard Error Calculating thep-value When O'y Is Unknown 76 The t-statistic 77 Hypothesis Testing with a Prespecifled Significance Level 78 One-Sided Alternatives 80 Confidence Interva ls for the Population Mean 81 Comparing Means from Different Populations 83 Hypothesis Tests for the Difference Between Two Means 83 Confidence Intervals for the Difference Between Two Population Means Differences-of-Means Estimat ion ofcausal Effects Using Experimental Data 85 The Causal Effect as a Difference of Conditional Expectations 85 Estimation of the Causal Effec t Using Differences of Means

5 CONTENTS ix 3.6 Using the t-statistic W hen the Sample Size Is Small 88 T he t-statistic and the Student t Distribution 88 Use ofthe Student t Distribution in Practice Scatterplot, the Sample Covariance, and the Sample Correlation 92 Scatterp[ots 93 Sample Covariance and Correlation 94 APPENDIX 3. 1 The U.S. Current Population Survey [05 APPENDIX 3.2 Two Proofs That Y [s the Least Squares Estimator ofj.ly [06 APPENDIX 3.3 A ProofThat the Sample Variance Is Consistent [07 PART TWO Fundamentals of Regression Analysis 109 CHAPTER 4 linear Regression w ith One Regressor III 4. 1 The Linear Regression Model Estimating the Coefficients of the Linear Regression Model 116 T he Ordinary Least Squares Estimator [18 OLS Estimates ofthe Re [ationship Between Test Scores and the Student-Teacher Ratio 120 Why Use the OLS Estimator? Measures of Fit 123 The R2 [23 The Standard Error of the Regression 124 Application to the Test Score Data T he Least Squares Assumptions 126 Assumption #1: The Conditional Distribution ofu, Given X, Has a Mean ofzero [26 Assumption #2: (X,. Y,). i = [..... n Are Independent[y and Ide ntically Distributed [28 Assumption #3: Large Outliers Are Unlike[y [29 Use of the Least Squares Assumptions The Sampling Distribution of the OLS Estimators 131 T he Sampling Distribution ofthe OLS Estimators Conclusion 135 APP ENDIX 4. 1 The California Test Score Data Set [43 A PPEND IX 4. 2 Derivation of the OLS Estimators 143 A PPEN DIX 4. 3 Sampling Di stribution of the O LS Estimator 144

6 x CO NTENTS II CHAPTER 5 Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Testing Hypotheses About One ofthe Regression Coefficients 149 Two-Sided Hypotheses Concerning f O ne-sided Hypotheses Concerning f Testing Hypotheses About the Intercept f Confidence Intervals for a Regression Coefficient ISS 5.3 Regression W hen X Is a BinaryVariable 158 I. Interpretation ofthe Regression Coefficients 158 I 5.4 Heteroskedasticity and Homoskedasticity 160 W hat Are Heteroskedasticity and Homoskedasticity? 160 Mathematical Implications of Homoskedasticity 163 W hat Does This Mean in Practice? The Theoretical Foundations ofordinary Least Squares 166 Linear Conditionally Unbiased Estimators and the Gauss-Markov -r heorem Regression Estimators Other Than OLS Using the t-statistic in Regression When the Sample Si z~ Is Small 169 The t-statistic and the Student t Distribution 170 Use ofthe Student t Distribution in Practice Conclusion 171 APPE NDI X 5.1 Formulas for OLS Standard Errors 180 APPEND IX 5.2 The Gauss-Markov Conditions and a Proof of the Gauss-Markov Theorem 182 CHAPTER 6 Linear Regression with Multiple Regressors Omitted Variable Bias 186 Definition ofomitted Variable Bias A Formula for Omitted Va riable Bias Addressing Omitted Variable Bias by Dividing the Data into Groub s 6.2 The Multiple Regression Model 193 The Population Regression Line 193 The Population Multiple Regre ssion Model The OLS Estimator in Multiple Regression 196 The OLS Esti mator 197 Appl ication to Test Scores and the Student-Teacher Ratio

7 CONT ENTS xi 6.4 Measures of Fit in Multiple Regression 200 T he Standard Error ofthe Regression (SER) 200 The R2 200 The 'I\djusred R2" 20 I Application to Test Scores The Least Squares Assumptions in Multiple Regression 202 Assumption # 1: The Conditional Distribution ofu,given X I, ' X 2i,...,X k, Has a Mean ofzero 203 Assumption # 2: (X I,' X 2 "...,X iu ' Y,) i =I,...,n Are i.i.d. 203 Assum ption # 3: Large Outliers Are Unlikely 203 Assumption #4: No Perfect Mul ticollinearity The Distribution ofthe OLS Estimators in Multiple Regression Multicollinearity 206 Examples of Perfect Multicollinearity Imperfect Multicollinearity Conclusion APPENDIX 6.1 Derivation of Eq uation (6.1) 218 APPEN DIX 6.2 Distribution of the OLS Estimators W hen There Are Two Regressors and Homoskedastic Errors 218 CHAPTER 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Hypothesis Tests and Confidence Intervals fo r a Single Coefficient 221 Standard Errors for the OLS Estimators 22 1 Hypothesis Tests for a Si ngle Coeffi cient 22 1 Confidence Intervals for a Single Coefficient 223 Application to Test Scores and the Student- Teacher Ratio 7.2 Tests ofjoint Hypotheses 225 Testing Hypotheses on Two or More Coefficients 225 T he F-Statistic 227 Application to Test Scores and the Student- Teacher Ratio T he Homoskedastici ry-only F-Statistic Testing Si ngle Restrictions Involving Multiple Coefficients Confidence Sets for Multiple Coefficients

8 xii CONTE NTS 7.S Model Specification for Multiple Regression 235 Omitted Variable Bias in Multiple Regression 236 Model Specification in Theory and in Practice 236 Interpreting the R2 and the Adjusted R2 in Practice 237 Analysis of the Test Score Data Set 239 Conclusion 244 APPE ND IX 7. 1 The Bonferroni Test ofa Joint Hypotheses 251 CHAPTER CHAPTER Nonlinear Regression Functions 254 A General Strategy for Modeling Nonlinear Regression Functions 256 Test Scores and District Income 256 The Effect on Yofa Change in X in Nonlinear Specifications 260 A General Approach to Modeling Nonlinearities Using Multiple Regression Nonlinear Functions ofa Single Independent Variable 264 Polynomials 265 Logarithms 267 Polynomial and Logarithmic Models oftest Scores and District Income Interactions Between Independent Variables 277 Interactions Between Two Binary Variable s 277 Interactions Between a Continuous and a Binary Variable 280 Interactions Between Two Continuous Variables 286 Nonlinear Effects on Test Scores ofthe Student-Teacher Ratio Discussion of Regression Results 291 Summary of Findings 295 Conclusion 296 APPENDI X 8.1 Regression Functions That Are Nonlinear in the Parameters 307 Assessing Studies Based on Multiple Regression Internal and External Validity Threats to Internal Validity 313 Threats to External Validity Threats to Internal Validity ofmultiple Regression Analysis 316 Omitted Variable Bias 316 Misspecification of the Functional Form ofthe Regression Function 319 Errors-in-Variables 319 Sample Selection

9 CONTE NTS xiii Simultaneous Causa li ty 324 Sources of Inconsistency of OLS Standard Errors Internal and External Validity When the Regression Is Used for Forecasting 327 Using Regression Models for Forecasting 327 Assessing the Va lidity of Regress ion Models for Forecasting Example: Test Scores and Class Size 329 External Validity 329 Internal Validity 336 Discuss ion and Implications S Conclusion 338 APPEND IX 9.1 The Massachusetts Elementary School Testing Data 344 PART THREE Further Topics in Regression Analysis 347 CHAPTER 10 Regression with Panel Data Panel Data 350 Example: Traffic Deaths and Alcohol Taxes Panel Data with Two Time Periods: "Before and After" Comparisons Fixed Effects Regression 356 The Fixed Effects Regression Model 356 Estimation and Inference 359 Application to Traffic Deaths Regression with Time Fixed Effects 361 Time Effects Only 36 1 Both En tity and Time Fixed Effects S The Fixed Effects Regression Assumptions and Standard Errors for Fixed Effects Regression 364 The Fixed Effects Regression Assumptions 364 Standard Errors for Fi xed Effects Regression Drunk Driving Laws and Traffic Deaths Conclusion 371 APPENDIX The State Traffic Fatality Data Set 378 APPENDIX 10.2 Standard Errors for Fixed Effects Regression with Serially Correlated Errors 379

10 xiv CONTENTS Regression with a Binary DependentVariable CHAPTER II 11.1 Binary Dependent Variables 385 The Linear Probability Model Probit and Logit Regression 389 Probit Regression 389 Logit Regression 394 Comparing the Linear Probability, Probit, and Logit Models 11.3 Nonlinear Least Squares Estimation 397 Maximum Likelihood Estimation 398 Measures of Fit Application to the Boston HMDA Data Binary Dependent Variables and the Linear Probability Model 396 Estimation and Inference in the Logit and Probit Models Summary 407 APPENDIX The Boston HMDA Data Set 415 APPENDIX Maximum Likelihood Estimati on 415 APPENDIX Other Limited Dependent Variable Models CHAPTER Instrumental Variables Regression 421 The IV Estimator w ith a Single Regressor and a Single Instrument 422 The IV Model and Assumptions 422 The Two Stage Least Squares Estimator 423 Why Does IV Regression Work? 424 The Sampling Distribution of the TSLS Estimator Application to the Demand for Cigarettes 430 The General IV Regression Model TSLS in the General IV Model Instrument Relevance and Exogeneity in the General IV Model The IV Regression Assumptions and Sampling Distribution ofthe TSLS Estimator 434 Inference Using the TSLS Estimator 437 Application to the Demand for Cigarettes 437 Checking Instrument Validity 439 Assumption # 1: Instrument Relevance 439 Assumption # 2: Ins trument Exogeneity 443 Application to the Demand for Cigarettes

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover). STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical methods 2 Course code: EC2402 Examiner: Per Pettersson-Lidbom Number of credits: 7,5 credits Date of exam: Sunday 21 February 2010 Examination

More information

Data Analysis with SPSS

Data Analysis with SPSS Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco

More information

Assessing Studies Based on Multiple Regression. Chapter 7. Michael Ash CPPA

Assessing Studies Based on Multiple Regression. Chapter 7. Michael Ash CPPA Assessing Studies Based on Multiple Regression Chapter 7 Michael Ash CPPA Assessing Regression Studies p.1/20 Course notes Last time External Validity Internal Validity Omitted Variable Bias Misspecified

More information

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS - CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,

More information

An Introduction to Modern Econometrics Using Stata

An Introduction to Modern Econometrics Using Stata An Introduction to Modern Econometrics Using Stata CHRISTOPHER F. BAUM Department of Economics Boston College A Stata Press Publication StataCorp LP College Station, Texas Contents Illustrations Preface

More information

Final Exam - section 2. Thursday, December hours, 30 minutes

Final Exam - section 2. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

INTRODUCTION TO ECONOMETRICS (EC212)

INTRODUCTION TO ECONOMETRICS (EC212) INTRODUCTION TO ECONOMETRICS (EC212) Course duration: 54 hours lecture and class time (Over three weeks) LSE Teaching Department: Department of Economics Lead Faculty (session two): Dr Taisuke Otsu and

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

The preceding five chapters explain how to use multiple regression to analyze the

The preceding five chapters explain how to use multiple regression to analyze the CHAPTER 9 Assessing Studies Based on Multiple Regression The preceding five chapters explain how to use multiple regression to analyze the relationship among variables in a data set. In this chapter, we

More information

MTH 225: Introductory Statistics

MTH 225: Introductory Statistics Marshall University College of Science Mathematics Department MTH 225: Introductory Statistics Course catalog description Basic probability, descriptive statistics, fundamental statistical inference procedures

More information

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros Marno Verbeek Erasmus University, the Netherlands Using linear regression to establish empirical relationships Linear regression is a powerful tool for estimating the relationship between one variable

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

EC352 Econometric Methods: Week 07

EC352 Econometric Methods: Week 07 EC352 Econometric Methods: Week 07 Gordon Kemp Department of Economics, University of Essex 1 / 25 Outline Panel Data (continued) Random Eects Estimation and Clustering Dynamic Models Validity & Threats

More information

Econometric Game 2012: infants birthweight?

Econometric Game 2012: infants birthweight? Econometric Game 2012: How does maternal smoking during pregnancy affect infants birthweight? Case A April 18, 2012 1 Introduction Low birthweight is associated with adverse health related and economic

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Measurement Error in Nonlinear Models

Measurement Error in Nonlinear Models Measurement Error in Nonlinear Models R.J. CARROLL Professor of Statistics Texas A&M University, USA D. RUPPERT Professor of Operations Research and Industrial Engineering Cornell University, USA and L.A.

More information

Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012

Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012 Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012 2 In Today s Class Recap Single dummy variable Multiple dummy variables Ordinal dummy variables Dummy-dummy interaction Dummy-continuous/discrete

More information

Carrying out an Empirical Project

Carrying out an Empirical Project Carrying out an Empirical Project Empirical Analysis & Style Hint Special program: Pre-training 1 Carrying out an Empirical Project 1. Posing a Question 2. Literature Review 3. Data Collection 4. Econometric

More information

Limited dependent variable regression models

Limited dependent variable regression models 181 11 Limited dependent variable regression models In the logit and probit models we discussed previously the dependent variable assumed values of 0 and 1, 0 representing the absence of an attribute and

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA

Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA Training Course on Applied Econometric Analysis June 1, 2015, WIUT, Tashkent, Uzbekistan Why do we need

More information

Linear Regression Analysis

Linear Regression Analysis Linear Regression Analysis WILEY SERIES IN PROBABILITY AND STATISTICS Established by WALTER A. SHEWHART and SAMUEL S. WILKS Editors: David J. Balding, Peter Bloomfield, Noel A. C. Cressie, Nicholas I.

More information

NPTEL Project. Econometric Modelling. Module 14: Heteroscedasticity Problem. Module 16: Heteroscedasticity Problem. Vinod Gupta School of Management

NPTEL Project. Econometric Modelling. Module 14: Heteroscedasticity Problem. Module 16: Heteroscedasticity Problem. Vinod Gupta School of Management 1 P age NPTEL Project Econometric Modelling Vinod Gupta School of Management Module 14: Heteroscedasticity Problem Module 16: Heteroscedasticity Problem Rudra P. Pradhan Vinod Gupta School of Management

More information

Modern Regression Methods

Modern Regression Methods Modern Regression Methods Second Edition THOMAS P. RYAN Acworth, Georgia WILEY A JOHN WILEY & SONS, INC. PUBLICATION Contents Preface 1. Introduction 1.1 Simple Linear Regression Model, 3 1.2 Uses of Regression

More information

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided. Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height

More information

Chapter 11 Regression with a Binary Dependent Variable

Chapter 11 Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable Solutions to Empirical Exercises 1. Smkban (1) (2) (3) Linear Probability 0.078** Linear Probability 0.047** Age 0.0097** (0.0018) Age 2 0.00013**

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines

Ec331: Research in Applied Economics Spring term, Panel Data: brief outlines Ec331: Research in Applied Economics Spring term, 2014 Panel Data: brief outlines Remaining structure Final Presentations (5%) Fridays, 9-10 in H3.45. 15 mins, 8 slides maximum Wk.6 Labour Supply - Wilfred

More information

The Statistical Analysis of Failure Time Data

The Statistical Analysis of Failure Time Data The Statistical Analysis of Failure Time Data Second Edition JOHN D. KALBFLEISCH ROSS L. PRENTICE iwiley- 'INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents Preface xi 1. Introduction 1 1.1

More information

Part 1. Online Session: Math Review and Math Preparation for Course 5 minutes Introduction 45 minutes Reading and Practice Problem Assignment

Part 1. Online Session: Math Review and Math Preparation for Course 5 minutes Introduction 45 minutes Reading and Practice Problem Assignment Course Schedule PREREQUISITE (Pre-Class) Advanced Education Diagnostic Test 10 minutes Excel 2007 Exercise SECTION 1. (Completed before face-to-face sections begin) (2 hours) Part 1. Online Session: Math

More information

Methods for Addressing Selection Bias in Observational Studies

Methods for Addressing Selection Bias in Observational Studies Methods for Addressing Selection Bias in Observational Studies Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA What is Selection Bias? In the regression

More information

Ordinal Data Modeling

Ordinal Data Modeling Valen E. Johnson James H. Albert Ordinal Data Modeling With 73 illustrations I ". Springer Contents Preface v 1 Review of Classical and Bayesian Inference 1 1.1 Learning about a binomial proportion 1 1.1.1

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS EMPIRICAL STRATEGIES IN LABOUR ECONOMICS University of Minho J. Angrist NIPE Summer School June 2009 This course covers core econometric ideas and widely used empirical modeling strategies. The main theoretical

More information

IS BEER CONSUMPTION IN IRELAND ACYCLICAL?

IS BEER CONSUMPTION IN IRELAND ACYCLICAL? IS BEER CONSUMPTION IN IRELAND ACYCLICAL? GEARÓID GIBBS Senior Sophister In this econometric investigation, Gearóid Gibbs examines beer consumption in Ireland and its relation to the business cycle. Citing

More information

Instrumental Variables Estimation: An Introduction

Instrumental Variables Estimation: An Introduction Instrumental Variables Estimation: An Introduction Susan L. Ettner, Ph.D. Professor Division of General Internal Medicine and Health Services Research, UCLA The Problem The Problem Suppose you wish to

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009

Lecture 14: Adjusting for between- and within-cluster covariates in the analysis of clustered data May 14, 2009 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences p. 1/3 Measurement, Design, and Analytic Techniques in Mental Health and Behavioral Sciences Lecture 14: Adjusting

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

The University of North Carolina at Chapel Hill School of Social Work

The University of North Carolina at Chapel Hill School of Social Work The University of North Carolina at Chapel Hill School of Social Work SOWO 918: Applied Regression Analysis and Generalized Linear Models Spring Semester, 2014 Instructor Shenyang Guo, Ph.D., Room 524j,

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Lee D. Walker University of South Carolina e-mail: walker23@gwm.sc.edu COURSE OVERVIEW This course focuses on the theory, practice, and application of linear regression. As Agresti

More information

DECISION ANALYSIS WITH BAYESIAN NETWORKS

DECISION ANALYSIS WITH BAYESIAN NETWORKS RISK ASSESSMENT AND DECISION ANALYSIS WITH BAYESIAN NETWORKS NORMAN FENTON MARTIN NEIL CRC Press Taylor & Francis Croup Boca Raton London NewYork CRC Press is an imprint of the Taylor Si Francis an Croup,

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Practical Multivariate Analysis

Practical Multivariate Analysis Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 5, 6, 7, 8, 9 10 & 11)

More information

STATISTICS & PROBABILITY

STATISTICS & PROBABILITY STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness

More information

PRACTICAL STATISTICS FOR MEDICAL RESEARCH

PRACTICAL STATISTICS FOR MEDICAL RESEARCH PRACTICAL STATISTICS FOR MEDICAL RESEARCH Douglas G. Altman Head of Medical Statistical Laboratory Imperial Cancer Research Fund London CHAPMAN & HALL/CRC Boca Raton London New York Washington, D.C. Contents

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Introductory Statistical Inference with the Likelihood Function

Introductory Statistical Inference with the Likelihood Function Introductory Statistical Inference with the Likelihood Function Charles A. Rohde Introductory Statistical Inference with the Likelihood Function 123 Charles A. Rohde Bloomberg School of Health Johns Hopkins

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

Session 1: Dealing with Endogeneity

Session 1: Dealing with Endogeneity Niehaus Center, Princeton University GEM, Sciences Po ARTNeT Capacity Building Workshop for Trade Research: Behind the Border Gravity Modeling Thursday, December 18, 2008 Outline Introduction 1 Introduction

More information

Understanding. Regression Analysis

Understanding. Regression Analysis Understanding Regression Analysis Understanding Regression Analysis Michael Patrick Allen Washington State University Pullman, Washington Plenum Press New York and London Llbrary of Congress Cataloging-in-Publication

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

Statistical Methods in Food and Consumer Research. Second Edition

Statistical Methods in Food and Consumer Research. Second Edition Statistical Methods in Food and Consumer Research Second Edition Food Science and Technology International Series Series Editor Steve L. Taylor University of Nebraska Lincoln, USA Advisory Board Ken Buckle

More information

Empirical Strategies

Empirical Strategies Empirical Strategies Joshua Angrist BGPE March 2012 These lectures cover many of the empirical modeling strategies discussed in Mostly Harmless Econometrics (MHE). The main theoretical ideas are illustrated

More information

Convolutional Coding: Fundamentals and Applications. L. H. Charles Lee. Artech House Boston London

Convolutional Coding: Fundamentals and Applications. L. H. Charles Lee. Artech House Boston London Convolutional Coding: Fundamentals and Applications L. H. Charles Lee Artech House Boston London Contents Preface xi Chapter 1 Introduction of Coded Digital Communication Systems 1 1.1 Introduction 1 1.2

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

The Linear Regression Model Under Test

The Linear Regression Model Under Test Walter Kramer Harald Sonnberger The Linear Regression Model Under Test Physica-Verlag Heidelberg Wien Professor Dr. WALTER KRAMER, Fachbereich Wirtschaftswissenschaften, UniversWit Hannover, Wunstorfer

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14 Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14 Still important ideas Contrast the measurement of observable actions (and/or characteristics)

More information

Political Science 15, Winter 2014 Final Review

Political Science 15, Winter 2014 Final Review Political Science 15, Winter 2014 Final Review The major topics covered in class are listed below. You should also take a look at the readings listed on the class website. Studying Politics Scientifically

More information

STATISTICS APPLIED TO CLINICAL TRIALS SECOND EDITION

STATISTICS APPLIED TO CLINICAL TRIALS SECOND EDITION STATISTICS APPLIED TO CLINICAL TRIALS SECOND EDITION Statistics Applied to Clinical Trials, Second Edition by TON J. CLEOPHAS, MD, PhD, Associate-Professor, President American College of Angiology, Co-Chair

More information

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology

f WILEY ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Staffordshire, United Kingdom Keele University School of Psychology ANOVA and ANCOVA A GLM Approach Second Edition ANDREW RUTHERFORD Keele University School of Psychology Staffordshire, United Kingdom f WILEY A JOHN WILEY & SONS, INC., PUBLICATION Contents Acknowledgments

More information

Identification of population average treatment effects using nonlinear instrumental variables estimators : another cautionary note

Identification of population average treatment effects using nonlinear instrumental variables estimators : another cautionary note University of Iowa Iowa Research Online Theses and Dissertations Fall 2014 Identification of population average treatment effects using nonlinear instrumental variables estimators : another cautionary

More information

Lecture II: Difference in Difference. Causality is difficult to Show from cross

Lecture II: Difference in Difference. Causality is difficult to Show from cross Review Lecture II: Regression Discontinuity and Difference in Difference From Lecture I Causality is difficult to Show from cross sectional observational studies What caused what? X caused Y, Y caused

More information

Empirical Tools of Public Finance. 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley

Empirical Tools of Public Finance. 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley Empirical Tools of Public Finance 131 Undergraduate Public Economics Emmanuel Saez UC Berkeley 1 DEFINITIONS Empirical public finance: The use of data and statistical methods to measure the impact of government

More information

Applied Linear Regression

Applied Linear Regression Applied Linear Regression Applied Linear Regression Third Edition SANFORD WEISBERG University of Minnesota School of Statistics Minneapolis, Minnesota A JOHN WILEY & SONS, INC., PUBLICATION Copyright

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014

More information

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple

More information

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name

Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda. DUE: June 6, Name Problem Set 5 ECN 140 Econometrics Professor Oscar Jorda DUE: June 6, 2006 Name 1) Earnings functions, whereby the log of earnings is regressed on years of education, years of on-the-job training, and

More information

Validity, Reliability and Classical Assumptions

Validity, Reliability and Classical Assumptions , Reliability and Classical Assumptions Presented by Mahendra AN Sources: www-psych.stanford.edu/~bigopp/.ppt http://ets.mnsu.edu/darbok/ethn402-502/reliability.ppt http://5martconsultingbandung.blogspot.com/2011/01/uji-asumsi-klasik.html

More information

STAT445 Midterm Project1

STAT445 Midterm Project1 STAT445 Midterm Project1 Executive Summary This report works on the dataset of Part of This Nutritious Breakfast! In this dataset, 77 different breakfast cereals were collected. The dataset also explores

More information

Econometrics II - Time Series Analysis

Econometrics II - Time Series Analysis University of Pennsylvania Economics 706, Spring 2008 Econometrics II - Time Series Analysis Instructor: Frank Schorfheide; Room 525, McNeil Building E-mail: schorf@ssc.upenn.edu URL: http://www.econ.upenn.edu/

More information

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth

The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth 1 The Effects of Maternal Alcohol Use and Smoking on Children s Mental Health: Evidence from the National Longitudinal Survey of Children and Youth Madeleine Benjamin, MA Policy Research, Economics and

More information

Where and How Do Kids Get Their Cigarettes? Chaloupka Ross Peck

Where and How Do Kids Get Their Cigarettes? Chaloupka Ross Peck Where and How Do Kids Get Their Cigarettes? Chaloupka Ross Peck Prepared For the Illinois Economic Association October 2001 Not to be Quoted without permission Issues: Most Adult Smokers Begin Smoking

More information

Contents. Part 1 Introduction. Part 2 Cross-Sectional Selection Bias Adjustment

Contents. Part 1 Introduction. Part 2 Cross-Sectional Selection Bias Adjustment From Analysis of Observational Health Care Data Using SAS. Full book available for purchase here. Contents Preface ix Part 1 Introduction Chapter 1 Introduction to Observational Studies... 3 1.1 Observational

More information

Bayesian and Classical Approaches to Inference and Model Averaging

Bayesian and Classical Approaches to Inference and Model Averaging Bayesian and Classical Approaches to Inference and Model Averaging Course Tutors Gernot Doppelhofer NHH Melvyn Weeks University of Cambridge Location Norges Bank Oslo Date 5-8 May 2008 The Course The course

More information

Final Research on Underage Cigarette Consumption

Final Research on Underage Cigarette Consumption Final Research on Underage Cigarette Consumption Angie Qin An Hu New York University Abstract Over decades, we witness a significant increase in amount of research on cigarette consumption. Among these

More information

HANDBOOK OF SOCIAL ECONOMICS

HANDBOOK OF SOCIAL ECONOMICS HANDBOOK OF SOCIAL ECONOMICS VOLUME Edited by JESS BENHABIB ALBERTO BISIN MATTHEW O. JACKSON j & * f» rastitsi &isis«.'fwjiwm GLSEVIER Amsterdam Boston Heidelberg London New York Oxford Paris San

More information

Analyzing binary outcomes, going beyond logistic regression

Analyzing binary outcomes, going beyond logistic regression Analyzing binary outcomes, going beyond logistic regression 2018 EHE Forum presentation James O. Uanhoro Department of Educational Studies Premise Obtaining relative risk using Poisson regression Obtaining

More information

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to CHAPTER - 6 STATISTICAL ANALYSIS 6.1 Introduction This chapter discusses inferential statistics, which use sample data to make decisions or inferences about population. Populations are group of interest

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables

More information

What Are We Weighting For?

What Are We Weighting For? What Are We Weighting For? Gary Solon Steven J. Haider Jeffrey M. Wooldridge Solon, Haider, and Wooldridge abstract When estimating population descriptive statistics, weighting is called for if needed

More information

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables Tables and Formulas for Sullivan, Fundamentals of Statistics, 4e 014 Pearson Education, Inc. Chapter Organizing and Summarizing Data Relative frequency = frequency sum of all frequencies Class midpoint:

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

Estimating average treatment effects from observational data using teffects

Estimating average treatment effects from observational data using teffects Estimating average treatment effects from observational data using teffects David M. Drukker Director of Econometrics Stata 2013 Nordic and Baltic Stata Users Group meeting Karolinska Institutet September

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR)

APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR) APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR) Lung function is related to physical characteristics such as age and height. In order to assess the Peak Expiratory Flow

More information

Examining Relationships Least-squares regression. Sections 2.3

Examining Relationships Least-squares regression. Sections 2.3 Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Statistical Tolerance Regions: Theory, Applications and Computation

Statistical Tolerance Regions: Theory, Applications and Computation Statistical Tolerance Regions: Theory, Applications and Computation K. KRISHNAMOORTHY University of Louisiana at Lafayette THOMAS MATHEW University of Maryland Baltimore County Contents List of Tables

More information