Regression Analysis II

Size: px
Start display at page:

Download "Regression Analysis II"

Transcription

1 Regression Analysis II Lee D. Walker University of South Carolina COURSE OVERVIEW This course focuses on the theory, practice, and application of linear regression. As Agresti and Finlay argue concerning all social science, the goal of political science research is to understand, explain, and make inference about social phenomena. The goal of this course is to provide students with the intermediate level tools needed to design and implement studies using regression analysis, to read and examine literature that uses regression analysis, and to pursue advance methods in quantitative political analysis. This course assumes a basic understanding of statistics, probability and bivariate regression. Nevertheless, the course begins with a review of basic statistics and the bivariate regression model. We then study multiple regression in depth. We will cover both theory and practice of major aspects of multiple regression analysis. Specifically, we will discuss basic statistical and probability distributions, the bivariate regression model, the multiple regression model, model building, regression diagnostics, what to do when ordinary least squares regression assumptions are violated, linear alternatives to the OLS regression model, and two generalized linear model alternatives to the OLS model that do not assume linearity. I come to this course from a comparative politics background and take an applied approach to statistical methods. I attempt to minimize mathematics in favor of application and interpretation. That being said, some mathematics is unavoidable. I seek to make this class as assessable as possible for students who solely want a firm foundation in statistical application and interpretation. At the same time, I hope that the course will encourage students to seek instruction in more advanced statistical methodologies and approaches. We will use Agresti and Finlay s Statistical Methods for the Social Sciences as the primary text. I also use five Sage monographs that are invaluable as secondary texts for the course and for your personal libraries. From a Computing standpoint, you may complete assignments in any reliable statistical software package. In addition, we read several articles throughout the course. A few of these articles are useful applications, while others are methodological innovations. These articles are listed at the end of the syllabus. Literature Main Text: Agresti, Alan and Barbara Finlay. (1997). Statistical Methods for the Social Sciences 3 rd Edition. Upper Saddle River, NJ: Prentice Hall, Inc.

2 Secondary Text: Berry, William. (1993). Understanding Regression Assumptions. London: Sage. Fox, John. (1991). Regression Diagnostics: An Introduction. London: Sage Gill, Jeff (2000). Generalized Linear Models: A Unified Approach. London: Sage. Hardy, Melissa A. (1993). Regression with Dummy Variables, London: Sage Namboodiri, K. (1984). Matrix Algebra: An Introduction. London: Sage Optional but important and good: Fox, John. (2002). An R and S-PLUS Companion to Applied Regression. Thousand Oaks: Sage Publications. Kennedy, P. (2003). A Guide to Econometrics. Fifth Edition. Cambridge: MIT Press. Assignments: You will be asked to complete five homework assignments during the course, roughly one assignment each week. These assignments emphasize application and interpretation. Generally, you will replicate findings from a previous analysis and extend the analysis based on the specifications of the assignment. These assignments will involve the use of a statistical package. From time to time, you may have smaller assignments, which are designed to aid your understanding of regression concepts or computing operations. My Availability: I am at your disposal. Please do not hesitate to attend my office hours or make an appointment to meet with me. I am excited about working with you and welcome the interaction. Moreover, my teaching assistant is also available to assist you with substantive, statistical, or computing questions. COURSE SCHEDULE Class 1 and Prior Information: Introduction, Sampling, Descriptive Statistics, and Probability Distributions 1. Introduction to Class 2. Description and Inference 3. Sampling 4. Measures of Central Tendencies 5. Measures of Variation 6. Population Parameters 7. Probability distributions for Discrete and Continuous Variables 8. Theoretical Probability Distributions 9. Sampling Distributions 10. Population, Sample, and Sampling Distributions

3 11. Readings a. Agresti and Finlay (Chapter 1, 2, 3). b. King, Keohane, and Verba (Chapter 1) c. King (1986) Statistical inference 1. Point estimation 2. Confidence Interval for a mean 3. Confidence interval for a proportion 4. Choice of sample size 5. Confidence interval for a median 6. Introduction to STATA and R: Data Manipulation in STATA and R 7. Readings a. Agresti and Finlay (Chapter 4 and 5)} b. Fox (2002: )} c. Fox (2002: 34-84} (Reading and Manipulating Data in R) Hypothesis Testing 1. Decisions and error in Test of Hypotheses 2. Small sample Inference for Mean and Proportion 3. Test of Independence 4. Association in 2X2 Tables 5. Proportional Reduction of Error 6. Hypothesis Testing in STATA and R 7. Readings} a. Agresti and Finlay (Chapter 6 and 7) b. Gill (1999) c. Fox (2002: ) d. optional: Fox (2002: 34-84} (Reading and Manipulating Data in R) e. Handout: Hanushek and Jackson (1978) Pre-Assignment 1: Probability and Statistics Class 2: The Bivariate Linear Regression Model Regression Coefficients, R-square and Correlation 1. Least Squares Prediction Equation 2. Linear Regression model 3. Measuring linear association 4. Inference for slope and correlation 5. Test of Independence and Confidence Intervals 6. Coefficient of Determination R-square 7. Readings a. Agresti and Finlay (Chapter 9) b. Fox (2002:18-34) Class 3: The Bivariate Linear Regression Model Model Assumptions and Violations 2. Extrapolation 3. Outliers 4. Residuals

4 5. Data transformation and Graphical Presentations 6. Readings a. Berry pages 3-12 b. Fox (1991: 46-48) c. Fox (2002: ) Pre-Assignment1 due Assignment 2: Bivariate Regression Class 4: Multiple Linear Regression: Correlation and Multiple Regression 1. Multiple Regression Model 2. Multiple Correlation and R-Squared 3. Inference for Multiple Regression Coefficients 4. Modeling Interaction a. Agresti and Finlay: Chapter 11 b. Berry: c. Brambor, Clark, and Golder (2005) Class 5: Multiple Linear Regression 2 1. Comparing Regression Models: The F-Test 2. Partial Correlation-Partial Effects 3. Inference for Partial Correlations 4. Standardized Regression Coefficient 5. Problems with Standardized Regression Coefficients 6. Readings a. Agresti and Finlay (Chapter 11) Class 6: Model Building--Interactions and Dummy Variables 1. Comparing means and Regression Lines 2. Analysis of Covariance Models 3. Inference for Analysis of Covariance Model 4. Comparing Regression Models: The F-Test a. Hardy (1-21) b. Agresti and Finlay (Chapter 13) Assignment 2 due Assignment 3: Multiple Linear Regression Class 7: Model Building More on Categorical Independent Variables 1.Testing Hypotheses with Categorical Independent Variable 2.The nominal categorical independent Variables 3. Ordinal categorical independent Variables 4. Comparing Models: F-Test 5. Treating categorical variables as interval

5 6. Readings a. Hardy 21-29, 48-53, b. Fox c. McDaniel (1996) Class 8: Polynomial Regression as a solution to NonLinear Relationship 1. The Linearity Assumption 2. Detection of nonlinear relationship 3. Scatter Diagrams and Residual and Partial Residual Plots 4. Quadratic Regression a. Bollen and Jackman (1985) b. Agresti and Finlay (1996): Chapter 14, c. Fox (1991): pages 53 to 61. Assignment 3 due Assignment 4: Bollen/Jackman replication Class 9: Multiple Regression Diagnostics and Model Building Residual Analysis, Multicollinearity, Heterosecedasticity, and Influential Observations 1. Model Selection Procedures 2. Omitted Variable Bias 3. Missing Data 4. Measurement Error 5. Collinearity and Multicollinearity in Models 6. Readings a. Berry: b. Fox 1991: c. King, Keohane, and Verba: Chapter 5 d. Agresti and Finlay (Chapter 14: pages ) Class 10: Regression Diagnostics and Social Science Questions 1. Measure of Influence: Cook s Distance DFFITS and DFBETAS 2. Measure of Leverage: hat value 3. Measures of Distance: residuals, standardized residuals, studentized residuals 4. Use of regression diagnostics as explanatory statistic a. Fox (1991: 21-34) b. Berry (6-11) c. Agresti and Finlay (Chapter 14: pages ) d. Bollen and Jackman (1985) Class 11: Multiple Regression Using Matrices 1. Expressing Linear Equations using Matrix Form 2. The Linear Model 3. OLS Assumptions in Matrix Form 4. OLS using Matrices

6 5. Residuals 6. Variance-Covariance Matrices 7. Readings a. Namboodiri (1984) 7-55} b. Fox (80-82 and 83-85)} c. Bekker and Wansbeek (1996)} Assignment 4 due Assignment 5: Georgia Violent Crime Replication Class 12: Heteroscedasticity and Autocorrelation detection and Weighted Least Squares 1. Effects of non-constant errors on estimation and inference 2. Detection of non-constant errors 3. The Omega Matrix 4. Breusch and Pagan/Cook-Weisberg Test for Heteroscedasticity 5. White's Test for Heteroscedasticity 6. Weighted Least Squares and Generalized Least Squares as solutions to heteroscedasticity 7. Readings a. Berry: b.gill (2001: 42-44) c. Fox: d. Lewis and Linzer (2005) Class 13: Non-Normally Distributed Errors and Robust Estimation 1. The Effects on Non-normal errors on estimation and inference 2. Detection of non-normal errors 3. qq-normal plots 4. Huber-M Estimator a. Chatterjee and Wiseman (1983) Class 14: Linear Model and Discrete Data and Limited Dependent Variable 1. Consequence of Violating Continuous Variable assumption 2. Why and when to choose the linear model. 3. Strengths of alternative approaches to the discrete outcome variable 4. OLS versus WLS versus Proportional Odds a. Fujimoto (2005) b. Meier et al. (1999) c. Fox (1991): pages Class 15: The Generalized Linear Model and Analysis of Count Data: The Poisson Model 1. Comparison to OLS 2. MLE Estimation 3. The Poisson Regression Model 4. Interpreting Poisson Coefficients 5. Model Checking

7 6. Wald and Likelihood Ratio tests 7. Residuals and Overdispersion 8. Example: Nonresponse Count data 9. Readings a. Gill (2000) pages 1-7, 9-32, b. King (1988) c. Long (1997) Chapter 8: Count Outcomes Assignment 5 due Class16: Modeling Highly Skewed Data: The Gamma Regression Model 1. Gamma distribution 2. Exponential Family and Exponential Regression 3. Gamma link function 4. Gamma Regression Model 5. Interpreting Gamma regression coefficient estimates 6. Model Checking 7. Wald and Likelihood Ratio tests 8. Residuals and Overdispersion 9. Readings a. Gill (2000) pages 1-7, 9-32, 49-51, b. Agresti and Finlay (1996): pages ; Class 17: Wrap up, Extensions and a quick look at Logistic Regression 1. Dichotomous Choice Model 2. Logistic Regression Model 3. Interpreting Logit Coefficients 4. MLE estimates 5. Newton-Raphson Algorithm 6. Likelihood Ratio Test 7. Readings a. Gill (2000) pages 1-7, 9-32, b. Agresti and Finlay: Chapter 15

8 Articles and Other Useful References Agresti, Alan. (1996). An Introduction to Categorical Data Analysis. New York: John Wiley and Sons. Beck, Nathaniel and Jonathan N. Katz What to do (and not to do) with Time-Series Cross-Section Data. APSR 89(3): Bekker, paul A. and Tom J. Wansbeek. (1996). Proxies versus Omitted Variables in Regression Analysis. Linear Algebra and Its Application. 237/238: Bollen, Kenneth A. and Robert W. Jackman. (1985). Regression Diagnostics: An Expository Treatment of Outliers and Influential Cases. Sociological Methods and Research 13(4): Brambor, Thomas, William Roberts Clark and Matt Golder (2005), Understanding Interaction Models: Improving Empirical Analysis. Political Analysis 16(1): Lewis, Jeffrey B. and Drew A. Linzer. (2005). Estimating Regression Models in Which the Dependent Variable is Based on Estimates. Political Analysis 13: Chatterjee, Sangit and Frederick Wiseman. (1983). Use of Regression Diagnostics in Political Science. American Journal of Political Science 27(3): Fujimoto, Kayo. (2005). From Women s College to Work: Inter-Organizational Networks in the Japanese Female Labor Market. Social Science Research 34: Gibson, James L. Gregory A. Caldeira, and Venessa A. Baird On the Legitimacy of National High Courts. American Political Science Review 92(2): Gill, Jeff. (1999). The Insignificance of Null Hypothesis Significance Testing. Political Research Quarterly 52: King, Gary. (1986). How not to Lie with Statistics: Avoiding Common Mistakes in Quantitative Political Science. American Journal of Political Science 30(August 1986), 666:687. King, Gary. (1988). Statistical Models for Political Science Event Counts: Bias in Convetional Procedures and Evidence for the Exponential Poisson Regression Model, American Journal of Political Science 32(3): King, Gary, Robert O. Keohane, and Sidney Verba. (1994). Designing Social Inquiry. Princeton: Princeton University Press. Chapter 1 and Chapter 5. Long, J. Scott. (1997). Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks: Sage.

9 McDaniel, Timothy. (1996). Categorical Independent Variables in Ordinary Least Squares Regression. Meier, Kenneth J., Robert D. Wrinkle and J.L. Polinard. (1999). Equity Versus Excellence in Organizations: A Substantively Weighted Least Squares Analysis. American Review of Public Administration 29(1): Samuels David J. (2000). "The gubernatorial coattails effect: federalism and congressional elections in Brazil." Journal of Politics 62 (1): Venables, W.N. and B. D. Ripley. (2001). Modern Applied Statistics with S-Plus. New York: Springer.

10

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS - CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,

More information

The University of North Carolina at Chapel Hill School of Social Work

The University of North Carolina at Chapel Hill School of Social Work The University of North Carolina at Chapel Hill School of Social Work SOWO 918: Applied Regression Analysis and Generalized Linear Models Spring Semester, 2014 Instructor Shenyang Guo, Ph.D., Room 524j,

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus

SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits. Master Syllabus SW 9300 Applied Regression Analysis and Generalized Linear Models 3 Credits Master Syllabus I. COURSE DOMAIN AND BOUNDARIES This is the second course in the research methods sequence for WSU doctoral students.

More information

Applied Regression The University of Texas at Dallas EPPS 6316, Spring 2013 Tuesday, 7pm 9:45pm Room: FO 2.410

Applied Regression The University of Texas at Dallas EPPS 6316, Spring 2013 Tuesday, 7pm 9:45pm Room: FO 2.410 Applied Regression The University of Texas at Dallas EPPS 6316, Spring 2013 Tuesday, 7pm 9:45pm Room: FO 2.410 Professor: J.C. Barnes, Ph.D. Email: jcbarnes@utdallas.edu Phone: (972) 883-2046 Office: GR

More information

INTRODUCTION TO ECONOMETRICS (EC212)

INTRODUCTION TO ECONOMETRICS (EC212) INTRODUCTION TO ECONOMETRICS (EC212) Course duration: 54 hours lecture and class time (Over three weeks) LSE Teaching Department: Department of Economics Lead Faculty (session two): Dr Taisuke Otsu and

More information

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros

Marno Verbeek Erasmus University, the Netherlands. Cons. Pros Marno Verbeek Erasmus University, the Netherlands Using linear regression to establish empirical relationships Linear regression is a powerful tool for estimating the relationship between one variable

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Understanding. Regression Analysis

Understanding. Regression Analysis Understanding Regression Analysis Understanding Regression Analysis Michael Patrick Allen Washington State University Pullman, Washington Plenum Press New York and London Llbrary of Congress Cataloging-in-Publication

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014

More information

Applied Linear Regression

Applied Linear Regression Applied Linear Regression Applied Linear Regression Third Edition SANFORD WEISBERG University of Minnesota School of Statistics Minneapolis, Minnesota A JOHN WILEY & SONS, INC., PUBLICATION Copyright

More information

Modern Regression Methods

Modern Regression Methods Modern Regression Methods Second Edition THOMAS P. RYAN Acworth, Georgia WILEY A JOHN WILEY & SONS, INC. PUBLICATION Contents Preface 1. Introduction 1.1 Simple Linear Regression Model, 3 1.2 Uses of Regression

More information

Linear Regression Analysis

Linear Regression Analysis Linear Regression Analysis WILEY SERIES IN PROBABILITY AND STATISTICS Established by WALTER A. SHEWHART and SAMUEL S. WILKS Editors: David J. Balding, Peter Bloomfield, Noel A. C. Cressie, Nicholas I.

More information

Ecological Statistics

Ecological Statistics A Primer of Ecological Statistics Second Edition Nicholas J. Gotelli University of Vermont Aaron M. Ellison Harvard Forest Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A. Brief Contents

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Applications of Regression Models in Epidemiology

Applications of Regression Models in Epidemiology Applications of Regression Models in Epidemiology Applications of Regression Models in Epidemiology Erick Suárez, Cynthia M. Pérez, Roberto Rivera, and Melissa N. Martínez Copyright 2017 by John Wiley

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

MEA DISCUSSION PAPERS

MEA DISCUSSION PAPERS Inference Problems under a Special Form of Heteroskedasticity Helmut Farbmacher, Heinrich Kögel 03-2015 MEA DISCUSSION PAPERS mea Amalienstr. 33_D-80799 Munich_Phone+49 89 38602-355_Fax +49 89 38602-390_www.mea.mpisoc.mpg.de

More information

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS) Chapter : Advanced Remedial Measures Weighted Least Squares (WLS) When the error variance appears nonconstant, a transformation (of Y and/or X) is a quick remedy. But it may not solve the problem, or it

More information

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n. University of Groningen Latent instrumental variables Ebbes, P. IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Advanced Bayesian Models for the Social Sciences

Advanced Bayesian Models for the Social Sciences Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University

More information

Interaction Effects: Centering, Variance Inflation Factor, and Interpretation Issues

Interaction Effects: Centering, Variance Inflation Factor, and Interpretation Issues Robinson & Schumacker Interaction Effects: Centering, Variance Inflation Factor, and Interpretation Issues Cecil Robinson Randall E. Schumacker University of Alabama Research hypotheses that include interaction

More information

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences. SPRING GROVE AREA SCHOOL DISTRICT PLANNED COURSE OVERVIEW Course Title: Basic Introductory Statistics Grade Level(s): 11-12 Units of Credit: 1 Classification: Elective Length of Course: 30 cycles Periods

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Advanced Handling of Missing Data

Advanced Handling of Missing Data Advanced Handling of Missing Data One-day Workshop Nicole Janz ssrmcta@hermes.cam.ac.uk 2 Goals Discuss types of missingness Know advantages & disadvantages of missing data methods Learn multiple imputation

More information

AP Statistics. Semester One Review Part 1 Chapters 1-5

AP Statistics. Semester One Review Part 1 Chapters 1-5 AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically

More information

Score Tests of Normality in Bivariate Probit Models

Score Tests of Normality in Bivariate Probit Models Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model

More information

STAT445 Midterm Project1

STAT445 Midterm Project1 STAT445 Midterm Project1 Executive Summary This report works on the dataset of Part of This Nutritious Breakfast! In this dataset, 77 different breakfast cereals were collected. The dataset also explores

More information

How to describe bivariate data

How to describe bivariate data Statistics Corner How to describe bivariate data Alessandro Bertani 1, Gioacchino Di Paola 2, Emanuele Russo 1, Fabio Tuzzolino 2 1 Department for the Treatment and Study of Cardiothoracic Diseases and

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Limited dependent variable regression models

Limited dependent variable regression models 181 11 Limited dependent variable regression models In the logit and probit models we discussed previously the dependent variable assumed values of 0 and 1, 0 representing the absence of an attribute and

More information

CHILD HEALTH AND DEVELOPMENT STUDY

CHILD HEALTH AND DEVELOPMENT STUDY CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in

More information

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1 M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page 1 15.6 Influence Analysis FIGURE 15.16 Minitab worksheet containing computed values for the Studentized deleted residuals, the hat matrix elements, and

More information

Staff Papers Series. Department of Agricultural and Applied Economics

Staff Papers Series. Department of Agricultural and Applied Economics Staff Paper P89-19 June 1989 Staff Papers Series CHOICE OF REGRESSION METHOD FOR DETRENDING TIME SERIES DATA WITH NONNORMAL ERRORS by Scott M. Swinton and Robert P. King Department of Agricultural and

More information

Political Science 15, Winter 2014 Final Review

Political Science 15, Winter 2014 Final Review Political Science 15, Winter 2014 Final Review The major topics covered in class are listed below. You should also take a look at the readings listed on the class website. Studying Politics Scientifically

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

Chapter 1: Explaining Behavior

Chapter 1: Explaining Behavior Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring

More information

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Practical Multivariate Analysis

Practical Multivariate Analysis Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint

More information

Performance of Median and Least Squares Regression for Slightly Skewed Data

Performance of Median and Least Squares Regression for Slightly Skewed Data World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of

More information

This tutorial presentation is prepared by. Mohammad Ehsanul Karim

This tutorial presentation is prepared by. Mohammad Ehsanul Karim STATA: The Red tutorial STATA: The Red tutorial This tutorial presentation is prepared by Mohammad Ehsanul Karim ehsan.karim@gmail.com STATA: The Red tutorial This tutorial presentation is prepared by

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

An Introduction to Modern Econometrics Using Stata

An Introduction to Modern Econometrics Using Stata An Introduction to Modern Econometrics Using Stata CHRISTOPHER F. BAUM Department of Economics Boston College A Stata Press Publication StataCorp LP College Station, Texas Contents Illustrations Preface

More information

Adaptive Aspirations in an American Financial Services Organization: A Field Study

Adaptive Aspirations in an American Financial Services Organization: A Field Study Adaptive Aspirations in an American Financial Services Organization: A Field Study Stephen J. Mezias Department of Management and Organizational Behavior Leonard N. Stern School of Business New York University

More information

Data Analysis with SPSS

Data Analysis with SPSS Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco

More information

Analyzing binary outcomes, going beyond logistic regression

Analyzing binary outcomes, going beyond logistic regression Analyzing binary outcomes, going beyond logistic regression 2018 EHE Forum presentation James O. Uanhoro Department of Educational Studies Premise Obtaining relative risk using Poisson regression Obtaining

More information

Measurement Error in Nonlinear Models

Measurement Error in Nonlinear Models Measurement Error in Nonlinear Models R.J. CARROLL Professor of Statistics Texas A&M University, USA D. RUPPERT Professor of Operations Research and Industrial Engineering Cornell University, USA and L.A.

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

Clincial Biostatistics. Regression

Clincial Biostatistics. Regression Regression analyses Clincial Biostatistics Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a

More information

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis

More information

Industrial and Manufacturing Engineering 786. Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner

Industrial and Manufacturing Engineering 786. Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner Industrial and Manufacturing Engineering 786 Applied Biostatistics in Ergonomics Spring 2012 Kurt Beschorner Note: This syllabus is not finalized and is subject to change up until the start of the class.

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

Propensity Score Analysis Shenyang Guo, Ph.D.

Propensity Score Analysis Shenyang Guo, Ph.D. Propensity Score Analysis Shenyang Guo, Ph.D. Upcoming Seminar: April 7-8, 2017, Philadelphia, Pennsylvania Propensity Score Analysis 1. Overview 1.1 Observational studies and challenges 1.2 Why and when

More information

Online Appendix. According to a recent survey, most economists expect the economic downturn in the United

Online Appendix. According to a recent survey, most economists expect the economic downturn in the United Online Appendix Part I: Text of Experimental Manipulations and Other Survey Items a. Macroeconomic Anxiety Prime According to a recent survey, most economists expect the economic downturn in the United

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

Chapter 4: More about Relationships between Two-Variables Review Sheet

Chapter 4: More about Relationships between Two-Variables Review Sheet Review Sheet 4. Which of the following is true? A) log(ab) = log A log B. D) log(a/b) = log A log B. B) log(a + B) = log A + log B. C) log A B = log A log B. 5. Suppose we measure a response variable Y

More information

Introduction to Survival Analysis Procedures (Chapter)

Introduction to Survival Analysis Procedures (Chapter) SAS/STAT 9.3 User s Guide Introduction to Survival Analysis Procedures (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation

More information

Introduction to the Logic of Comparative Research

Introduction to the Logic of Comparative Research SPS Seminar 1st term 2017-2018 Introduction to the Logic of Comparative Research Organised by Stefano Bartolini 26-27 September (14:00-16:00) and 4-6 October 2017 (11:00-13:00) Seminar Room 2, Badia Fiesolana

More information

Final Exam - section 2. Thursday, December hours, 30 minutes

Final Exam - section 2. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

Introduction to Meta-Analysis

Introduction to Meta-Analysis Introduction to Meta-Analysis Nazım Ço galtay and Engin Karada g Abstract As a means to synthesize the results of multiple studies, the chronological development of the meta-analysis method was in parallel

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

Course Information and Reading List

Course Information and Reading List Course Information and Reading List Please note the following: In case you need or want a grade for the course, the grade is determined as follows: - Participation in class: 30 % - Research paper: 70 %

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985)

Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985) Confirmations and Contradictions Journal of Political Economy, Vol. 93, No. 2 (Apr., 1985) Estimates of the Deterrent Effect of Capital Punishment: The Importance of the Researcher's Prior Beliefs Walter

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

ICPSR Causal Inference in the Social Sciences. Course Syllabus

ICPSR Causal Inference in the Social Sciences. Course Syllabus ICPSR 2012 Causal Inference in the Social Sciences Course Syllabus Instructors: Dominik Hangartner London School of Economics Marco Steenbergen University of Zurich Teaching Fellow: Ben Wilson London School

More information

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA BIOSTATISTICAL METHODS AND RESEARCH DESIGNS Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA Keywords: Case-control study, Cohort study, Cross-Sectional Study, Generalized

More information

Correlated to: ACT College Readiness Standards Science (High School)

Correlated to: ACT College Readiness Standards Science (High School) ACT College Readiness Science Score Range - 1-12 Students who score in the 1 12 range are most likely beginning to develop the knowledge and skills assessed in the other score ranges. locate data in simple

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values Sutthipong Meeyai School of Transportation Engineering, Suranaree University of Technology,

More information

REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK

REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK Agatha POPESCU University of Agricultural Sciences and Veterinary Medicine Bucharest, 59 Marasti, District 1, 11464,

More information

PRINCIPLES OF STATISTICS

PRINCIPLES OF STATISTICS PRINCIPLES OF STATISTICS STA-201-TE This TECEP is an introduction to descriptive and inferential statistics. Topics include: measures of central tendency, variability, correlation, regression, hypothesis

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

On the purpose of testing:

On the purpose of testing: Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase

More information

Bootstrapping Residuals to Estimate the Standard Error of Simple Linear Regression Coefficients

Bootstrapping Residuals to Estimate the Standard Error of Simple Linear Regression Coefficients Bootstrapping Residuals to Estimate the Standard Error of Simple Linear Regression Coefficients Muhammad Hasan Sidiq Kurniawan 1) 1)* Department of Statistics, Universitas Islam Indonesia hasansidiq@uiiacid

More information

Georgetown University ECON-616, Fall Macroeconometrics. URL: Office Hours: by appointment

Georgetown University ECON-616, Fall Macroeconometrics.   URL:  Office Hours: by appointment Georgetown University ECON-616, Fall 2016 Macroeconometrics Instructor: Ed Herbst E-mail: ed.herbst@gmail.com URL: http://edherbst.net/ Office Hours: by appointment Scheduled Class Time and Organization:

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 5, 6, 7, 8, 9 10 & 11)

More information

Jake Bowers Wednesdays, 2-4pm 6648 Haven Hall ( ) CPS Phone is

Jake Bowers Wednesdays, 2-4pm 6648 Haven Hall ( ) CPS Phone is Political Science 688 Applied Bayesian and Robust Statistical Methods in Political Research Winter 2005 http://www.umich.edu/ jwbowers/ps688.html Class in 7603 Haven Hall 10-12 Friday Instructor: Office

More information

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS

EMPIRICAL STRATEGIES IN LABOUR ECONOMICS EMPIRICAL STRATEGIES IN LABOUR ECONOMICS University of Minho J. Angrist NIPE Summer School June 2009 This course covers core econometric ideas and widely used empirical modeling strategies. The main theoretical

More information

Estimating Heterogeneous Choice Models with Stata

Estimating Heterogeneous Choice Models with Stata Estimating Heterogeneous Choice Models with Stata Richard Williams Notre Dame Sociology rwilliam@nd.edu West Coast Stata Users Group Meetings October 25, 2007 Overview When a binary or ordinal regression

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About 7 Statistical Issues that Researchers Shouldn t Worry (So Much) About By Karen Grace-Martin Founder & President About the Author Karen Grace-Martin is the founder and president of The Analysis Factor.

More information