August 29, Introduction and Overview

Similar documents
Carrying out an Empirical Project

Substantive Significance of Multivariate Regression Coefficients

Session 1: Dealing with Endogeneity

Public Policy & Evidence:

Economics 2010a. Fall Lecture 11. Edward L. Glaeser

Session 3: Dealing with Reverse Causality

Empirical Validation in Agent-Based Models

Analysis A step in the research process that involves describing and then making inferences based on a set of data.

Political Science 15, Winter 2014 Final Review

Qualitative Data Analysis. Richard Boateng, PhD. Arguments with Qualitative Data. Office: UGBS RT18 (rooftop)

Work, Employment, and Industrial Relations Theory Spring 2008

Glossary of Research Terms Compiled by Dr Emma Rowden and David Litting (UTS Library)

Causality and Statistical Learning

III. WHAT ANSWERS DO YOU EXPECT?

CHAPTER 7 QUALITATIVE RESEARCH TOOLS

Chapter 1 Introduction to Educational Research

Quasi-experimental analysis Notes for "Structural modelling".

You must answer question 1.

Chapter 1 Review Questions

Statistical reports Regression, 2010

Writing Reaction Papers Using the QuALMRI Framework

In this chapter we discuss validity issues for quantitative research and for qualitative research.

Hypothesis-Driven Research

Meta-Analysis and Publication Bias: How Well Does the FAT-PET-PEESE Procedure Work?

Section 3.2 Least-Squares Regression

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

A NON-TECHNICAL INTRODUCTION TO REGRESSIONS. David Romer. University of California, Berkeley. January Copyright 2018 by David Romer

Establishing Causality Convincingly: Some Neat Tricks

School of Nursing, University of British Columbia Vancouver, British Columbia, Canada

Introduction to Applied Research in Economics Kamiljon T. Akramov, Ph.D. IFPRI, Washington, DC, USA

MODULE 3 APPRAISING EVIDENCE. Evidence-Informed Policy Making Training

Business Statistics Probability

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Research Methodology in Social Sciences. by Dr. Rina Astini

Assignment 4: True or Quasi-Experiment

INTERVIEWS II: THEORIES AND TECHNIQUES 5. CLINICAL APPROACH TO INTERVIEWING PART 1

Cost-Utility Analysis (CUA) Explained

CONCEPTUAL FRAMEWORK, EPISTEMOLOGY, PARADIGM, &THEORETICAL FRAMEWORK

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Sociology 593 Exam 2 March 28, 2003

Multiple Linear Regression (Dummy Variable Treatment) CIVL 7012/8012

Still important ideas

9 research designs likely for PSYC 2100

Econ 270: Theoretical Modeling 1

Evidence Informed Practice Online Learning Module Glossary

6. A theory that has been substantially verified is sometimes called a a. law. b. model.

Chapter 02 Developing and Evaluating Theories of Behavior

Identification with Models and Exogenous Data Variation

Logic models to enhance program performance

The Thesis Writing Process and Literature Review

support support support STAND BY ENCOURAGE AFFIRM STRENGTHEN PROMOTE JOIN IN SOLIDARITY Phase 3 ASSIST of the SASA! Community Mobilization Approach

Speaker Notes: Qualitative Comparative Analysis (QCA) in Implementation Studies

Cognitive domain: Comprehension Answer location: Elements of Empiricism Question type: MC

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Ordinary Least Squares Regression

The interplay of domain-specific and domain general processes, skills and abilities in the development of science knowledge

SMALL n IMPACT EVALUATION. Howard White and Daniel Phillips 3ie

UNLOCKING VALUE WITH DATA SCIENCE BAYES APPROACH: MAKING DATA WORK HARDER

What is: regression discontinuity design?

INTRODUCTION TO ECONOMETRICS (EC212)

CASE STUDY 2: VOCATIONAL TRAINING FOR DISADVANTAGED YOUTH

Unit 1 Exploring and Understanding Data

The Limits of Inference Without Theory

Running head: INDIVIDUAL DIFFERENCES 1. Why to treat subjects as fixed effects. James S. Adelman. University of Warwick.

Psych 1Chapter 2 Overview

Bradford Hill Criteria for Causal Inference Based on a presentation at the 2015 ANZEA Conference

Chapter Eight: Multivariate Analysis

GPP 501 Microeconomic Analysis for Public Policy Fall 2017

TECH 646 Analysis of Research in Industry and Technology

NEXT. Top 12 Myths about OEE

Unlike standard economics, BE is (most of the time) not based on rst principles, but on observed behavior of real people.

Measuring Impact. Conceptual Issues in Program and Policy Evaluation. Daniel L. Millimet. Southern Methodist University.

The Perils of Empirical Work on Institutions

SOCIOLOGICAL RESEARCH PART I. If you've got the truth you can demonstrate it. Talking doesn't prove it. Robert A. Heinlein

Causality and Statistical Learning

An Experimental Investigation of Self-Serving Biases in an Auditing Trust Game: The Effect of Group Affiliation: Discussion

Introduction to Behavioral Economics Like the subject matter of behavioral economics, this course is divided into two parts:

CHAPTER 2: TWO-VARIABLE REGRESSION ANALYSIS: SOME BASIC IDEAS

Sociological Research Methods and Techniques Alan S.Berger 1

Introduction to Applied Research in Economics

Audio: In this lecture we are going to address psychology as a science. Slide #2

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Chapter Eight: Multivariate Analysis

Economics of obesity

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Communication Research Practice Questions

ACADEMIC APPLICATION:

The Regression-Discontinuity Design

CSC2130: Empirical Research Methods for Software Engineering

What Constitutes a Good Contribution to the Literature (Body of Knowledge)?

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Probability Models for Sampling

Chapter 13 Summary Experiments and Observational Studies

5 To Invest or not to Invest? That is the Question.

1.4 - Linear Regression and MS Excel

Handout: Instructions for 1-page proposal (including a sample)

1. Finding your audience (AKA, who you re explaining things to and whose interest you re trying to pique).

Why do Psychologists Perform Research?

Funnelling Used to describe a process of narrowing down of focus within a literature review. So, the writer begins with a broad discussion providing b

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Transcription:

August 29, 2018 Introduction and Overview

Why are we here? Haavelmo(1944): to become master of the happenings of real life. Theoretical models are necessary tools in our attempts to understand and explain events in real life.... Whatever be the explanations we prefer, it is not to be forgotten that they are all our own artificial inventions in a search for an understanding of real life; they are not hidden truths to be discovered.

The point of empirical work is to tell stories that matter to us Understanding concrete developments ion the world around us Providing useful guides for action Persuading others Not technique! Every economic story we might want to tell has some reflection in data we can observe.

Our goal is to answer questions posed to us by the world to advance a political project to design a policy to make money Judgements of relevance or importance cannot be made from within economics, but only from some outside vantage point.

The map is not the territory Economic data is the product of some social process Concrete social activity comes, through a series of specific activities, to be represented in quantitative form Heterogeneous human activity and its products are classified into discrete categories Market transactions are recorded, then grouped into accounts Private accounts are reported to public authorities in standardized form Survey are conducted, and results compiled Quantities like output or profit or the price level are not preexisting objects in the world, but come into being through particular accounting practices

Products of all this activity comes to us as data, which we select from, assemble, transform through various statistical techniques One important set of techniques is to summarize the variation in the data in terms of a limited set of parameters. Standard statistical distributions are a useful tool for this. always keep in mind that distributions are tools for describing data, not objects existing independently in the world. We use these summaries to describe historical developments, tell stories, make and evaluate causal claims This activity then gets generalized as economic theory

Econometrics Econometrics is one set of tools for organizing data and bringing it into relation with economic theory It is just one set of tools among others!

Econometrics starts from premises The data we see is a reflection of a genuine economic process This data is drawn from an underlying population Variables in that population have their own well-defined statistical distributions The variables are causally linked in such a way that we can regard one of them (the dependent variable) as a function of a set of others (the independent variables) plus a random error term We can draw repeated samples from the same population in the ideal case with identical values of the independent variables So there is a data generating process with stable properties which we can describe statistically

What does a regression do? Seeks to establish the effect of a ceteris paribus change in an independent variable on a dependent variable... by looking at the variation shared by those variables that is not shared with other (independent) variables Express this as an equation in which the dependent variable is a (linear) function of the independent variables plus an error term Assesses the relationship between the dependent and independent variables by comparing it to hypothetical null of no relationship... Strictly speaking in econometrics we don t test our own claims, we test the null and treat rejection of the null as evidence for our preferred hypothesis

... on the basis of an assumed distribution of the error term Often focus on qualitative question of whether observed distribution is far enough from the null hypothesis Sometimes, main finding is the estimated parameter value, often it is simply that a variable was significant or not significant Only qualitative results possible in common case where independent variable used is supposed to be a proxy or instrument for true variable of interest...... or where independent variable is indicator rather than numerical Treats link between independent and dependent variable as black box we are only observing variation at beginning and end of causal chain

People doing regressions worry about: Specification Identification Endogeneity Distribution of the error term Heteroskedasticity Kurtosis Independence (from independent variables and each other) External validity

Econometrics Sometimes a regression is the best approach: We are confident about the relationship between data we observe and underlying relationship we are interested in There s well-defined population that our sample is drawn from The question we are interested in is causal...... and is about effect of a cause rather than cause of an effect Accepting or rejecting the null hypothesis of no relationship answers an interesting question... Negative answers (accepting the null) are more often interesting than positive ones... and/or we can interpret the coefficients quantitatively. The ceteris paribus ( all else equal ) assumption makes sense Reasonably direct link between the variables

For example: Synthetic control studies of minimum wage laws ask what happened to employment in a jurisdiction that raised the minimum wage compared with employment in comparable jurisdictions with similar employment trends that did not raise the minimum We are comparing one place and time to a well defined population of other places We are interested in effect of minimum wage change, not cause of employment change The qualitative question were there fewer jobs? is an interesting one It makes sense to think of the minimum wage increase as a distinct event that could be different without anything else changing

Herndon (2017) compares defaults losses on low documentation mortgages ( liar s loans ) with losses on otherwise similar conventional loans There is a well-defined population of mortgage loans issued in the given period Knowing whether default rates were significantly higher in the absence of documentation is a little interesting...... and the study also tells us how much additional default losses were incurred on low-documentation loans Comparing the experience of a low-documentation loan to the counterfactual of a conventional loan issued to an apparently similar borrower in the time period makes sense There is no ambiguity about the mechanism linking absent documentation to higher default rates

The limits of econometrics Often a regression is not sensible, because of one or more of: We can t directly observe what we are interested in and have to depend on some more or less indirect proxy The question of interest is not causal In many cases we are interested in the joint distribution of the variables and not a cause and effect relationship E.g. the question how much do women earn relative to men is different from the question how does being a woman cause your earnings to change, all else equal? We already know effect of cause (typically because of accounting) There is not a clear or sensible counterfactual

There is no meaningful sense in which the case of interest is drawn from a larger population Was the Great Depression drawn from the distribution of 19th century downturns? is a sensible question to an econometrician. But probably not to anyone else! Our goal is to explain the observed outcome ( cause of effect ) rather than isolate the influence of one variable ( effect of cause ) Important case: Our variables are linked by accounting identities, so we already know effects of causes, want to know what part of variation in dependent variable is attributed to each Rejecting the null doesn t tell us anything interesting, and it s not possible to interpret coefficients quantitatively The mechanism linking variables is unclear/ambiguous, or can be studied directly

Alternatives to econometrics What else is there? Descriptive statistics Characterize distribution of data in sample No reliance on underlying population or data generating process Examples: Principal component analysis, decompositions Accounting Focus on mechanism Qualitative approaches Narrative surveys interviews case studies

Summers, The Scientific Illusion in Empirical Macroeconomics (1991): Formal empirical work which... tries to take models seriously econometrically has had almost no influence... The only empirical research that has influenced thinking about substantive questions has been... attempts to gauge the strength of associations rather than to estimate structural parameters, verbal characterizations of how causal relations might operate rather than explicit mathematical models, and the skillful use of carefully chosen natural experiments rather than sophisticated statistical technique to achieve identification.... Formal econometric work has had little impact on the growth of economic knowledge.

Successful pieces of pragmatic empirical work have three elements in common... First and foremost, in each case, the bottom line was a stylized fact or collection of stylized facts characterizing an aspect of how the world worked rather than parameter estimates or formal tests of a point hypothesis.... Second, pragmatic pieces of empirical work produce regularities of a kind that theory can seek to explain.... Third, successful pieces of pragmatic empirical work have no scientific pretense... No single test is held out as decisive. Many different types of data are examined.

Empirical work in real world In practice, successful empirical work in most contexts depends on A compelling story A clear relationship between observable variables that is consistent with our story, and inconsistent with alternative Scatterplots do a lot of work in real-world debates Attention to mechanism linking cause and effect Accounting - dividing the outcome you are interested in into meaningful components Explaining specific facts/events that people are aware of and care about natural experiments

At the same time, we do need statistical training: Apparent patterns may be spurious may have alternative explanation Time series data is particularly susceptible to spurious patterns Data may need to be transformed appropriately before it is meaningful Regressions are a good way to answer some questions Other people are doing econometrics even if you aren t Joan Robinson: the point of studying economics is not to be fooled by economists

Good empirical work Asks clear, interesting, well-posed question Question comes from world outside of economics - some practical reason to know answer, whether political, business, story we want to tell to wider world Question is answerable. Many interesting sounding questions (especially causal questions) do not have answers! How much questions are more interesting than whether questions Looks for things that will be different about the world in the case where one answer to our question is right, versus if another answer is Even when we are doing exploratory analysis we should have some set of alternatives in mind

Good empirical work Spends a lot of time with the data internal questions how big, what kind of spread, what is it indexed over, observations by index values? maximum and minimum values? how many values are missing? external questions how was it collected, by whom? how are the variables defined? how are they measured? are some values imputed, topcoded, etc.? cleaning data there will be missing values, outliers that seem spurious (but be careful!), fields we don t want, numbers encoded as text, etc. transforming data may need to convert units, take logs, normalize values by something often not obvious what transformation appropriate

Good empirical work Makes use of broader knowledge of world thinks about causal links, does not treat relationship as black box makes use of accounting relationships Develops a clear, substantive claim about the world Describes concrete historical developments Or if it posits general relationship, is very clear about domain over which it applies Presents results in a small number of clear, self-explanatory tables or charts