Fitting discrete-data regression models in social science

Size: px
Start display at page:

Download "Fitting discrete-data regression models in social science"

Transcription

1 Fitting discrete-data regression models in social science Andrew Gelman Dept. of Statistics and Dept. of Political Science Columbia University For Greg Wawro sclass, 7 Oct 2010

2 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

3 Natural arsenic in well water

4 Mix of high and low-arsenic wells

5 Digging new wells

6 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

7 Survey data: would you switch wells? Logistic regression Predictor variables: Distance to nearest safe well Arsenic level of your current well Education Membership in community organizations (not predictive)

8 Predicting switching given distance Pr (switch) = logit -1 ( *dist100)

9 Predicting switching given distance and arsenic level Pr (switch) = logit -1 ( *dist *As)

10 Predicting switching given distance and arsenic level Pr (switch) = logit -1 ( *dist *As)

11 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

12 Adding the interaction

13 Using centered inputs

14

15 Fitted model with interactions Nonparallel lines (on logit scale)

16 Adding social predictors assoc has wrong sign and is not statistically significant, so discard!

17 After discarding assoc

18 Try more interactions (Interpret each coefficient )

19 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

20 Residual plot

21 Binned residual plot

22 Binned residuals vs. distance

23 Binned residuals vs. arsenic

24 Try the log scale:

25 New model (Qualitatively similar to earlier model)

26 Comparing old and new models (Education is held constant at its average value)

27 Binned residuals new model (Pretty good, not perfect)

28 Model for switching Distance to walk comes in linearly Does this make sense? Yes Current arsenic level comes in on the log scale Does this make sense? Yes (psychologically) No (physically) Positive interaction between distance and arsenic Does this make sense??

29 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

30 Probit regression Take the logistic regression coefficients and s.e. s: and just divide everything by 1.6

31 Latent-data model Logit: Probit: Same model, but Probit coefs are logit coefs divided by 1.6

32 Taking the latent data seriously Example: modeling political preferences Pr(person i votes Republican) = Latent data Interpret z i as a continuous attitude Can be measured using feeling thermometer questions Can be modeled as being stable across issues or over time

33 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

34 Logistic regression as a formal model Well-switching model: of choice Decision for household i with As level As i a i As i = benefit of switching to a safe well b i + c i x i = cost of switching to a well at distance x i Pr(switch) = Pr(y i =1) = Pr(a i As i > b i + c i x i ) = Pr((a i As i -b i )/c i > x i )

35 Decision for household i with As level As i a i As i = benefit of switching to a safe well b i + c i x i = cost of switching to a well at distance x i Pr(switch) = Pr(y i =1) = Pr(a i As i > b i + c i x i ) = Pr((a i As i -b i )/c i > x i ) All depends on distribution of (a i As i -b i )/c i The net benefit of switching, divided by the cost per distance traveled to a new well If (a i As i -b i )/c i has an approximately normal distin the population, then the logit/probit model makes sense

36 Discrete choice model

37 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

38 Some topics in fitting discrete-data regression models Poisson models Treating discrete variables as continuous Unordered categorical regression Robust alternative to logit-probit

39 Poisson models Always Poisson regression, never Poisson distribution Always allow for overdispersion

40 Treating discrete variables as continuous Binary variables Often you can just fit with a linear model Ordered categories Strong Dem, Dem, Weak Dem,..., Strong Rep Just treat them as 1-7 on a continuous scale

41 Unordered categorical regression Use an ordered model if possible or else try repeated binary splits Example: Bush/Clinton/Perot/no-vote Vote or no-vote If vote: major party or Perot If major party: Bush or Clinton

42 Robust alternative to logit-probit Bound the probabilities away from 0 and 1 to allow for contamination

43 Robit (robust) regression

44 Today s class Example: wells in Bangladesh Building a logistic regression model Logistic regression with interactions Evaluating, checking, and comparing models Probit regression and the latent-variable model Mapping the logit/probit regressions to a formal model of preferences Related methodological topics

Data Analysis Using Regression and Multilevel/Hierarchical Models

Data Analysis Using Regression and Multilevel/Hierarchical Models Data Analysis Using Regression and Multilevel/Hierarchical Models ANDREW GELMAN Columbia University JENNIFER HILL Columbia University CAMBRIDGE UNIVERSITY PRESS Contents List of examples V a 9 e xv " Preface

More information

Logistic regression. Department of Statistics, University of South Carolina. Stat 205: Elementary Statistics for the Biological and Life Sciences

Logistic regression. Department of Statistics, University of South Carolina. Stat 205: Elementary Statistics for the Biological and Life Sciences Logistic regression Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 1 Logistic regression: pp. 538 542 Consider Y to be binary

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

Advanced Bayesian Models for the Social Sciences

Advanced Bayesian Models for the Social Sciences Advanced Bayesian Models for the Social Sciences Jeff Harden Department of Political Science, University of Colorado Boulder jeffrey.harden@colorado.edu Daniel Stegmueller Department of Government, University

More information

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill) Advanced Bayesian Models for the Social Sciences Instructors: Week 1&2: Skyler J. Cranmer Department of Political Science University of North Carolina, Chapel Hill skyler@unc.edu Week 3&4: Daniel Stegmueller

More information

Appendix D: Statistical Modeling

Appendix D: Statistical Modeling Appendix D: Statistical Modeling Cluster analysis Cluster analysis is a method of grouping people based on specific sets of characteristics. Often used in marketing and communication, its goal is to identify

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Midterm, 2016 Exam policy: This exam allows one one-page, two-sided cheat sheet; No other materials. Time: 80 minutes. Be sure to write your name and

More information

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen

More information

Arab American Voters 2014

Arab American Voters 2014 Arab American Voters 2014 Their Identity and Political Concerns November 24, 2014 Executive Summary Identity and Personal Concerns Ethnic pride and identity remains high among Arab Americans. A majority

More information

Online appendix to The 2008 election: A preregistered replication analysis. 2. Switch to fully Bayesian inference and replications using old data

Online appendix to The 2008 election: A preregistered replication analysis. 2. Switch to fully Bayesian inference and replications using old data Online appendix to The 2008 election: A preregistered replication analysis Rayleigh Lei Andrew Gelman Yair Ghitza 1 Feb 2016 1. Introduction Lei et al. (2015) describe a set of post hoc replications and

More information

Technical appendix Strengthening accountability through media in Bangladesh: final evaluation

Technical appendix Strengthening accountability through media in Bangladesh: final evaluation Technical appendix Strengthening accountability through media in Bangladesh: final evaluation July 2017 Research and Learning Contents Introduction... 3 1. Survey sampling methodology... 4 2. Regression

More information

Statistical reports Regression, 2010

Statistical reports Regression, 2010 Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that

More information

CSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression

CSE 258 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression CSE 258 Lecture 2 Web Mining and Recommender Systems Supervised learning Regression Supervised versus unsupervised learning Learning approaches attempt to model data in order to solve a problem Unsupervised

More information

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About 7 Statistical Issues that Researchers Shouldn t Worry (So Much) About By Karen Grace-Martin Founder & President About the Author Karen Grace-Martin is the founder and president of The Analysis Factor.

More information

In this module I provide a few illustrations of options within lavaan for handling various situations.

In this module I provide a few illustrations of options within lavaan for handling various situations. In this module I provide a few illustrations of options within lavaan for handling various situations. An appropriate citation for this material is Yves Rosseel (2012). lavaan: An R Package for Structural

More information

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data

Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data Analysis of Environmental Data Conceptual Foundations: En viro n m e n tal Data 1. Purpose of data collection...................................................... 2 2. Samples and populations.......................................................

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

Learning about Unobservable Quality: Empirical. Evidence from Choice of Health Facilities in Tanzania

Learning about Unobservable Quality: Empirical. Evidence from Choice of Health Facilities in Tanzania Learning about Unobservable Quality: Empirical Evidence from Choice of Health Facilities in Tanzania Kenneth L. Leonard February 5, 2004 Abstract Patients in rural Africa choose between available health

More information

University of Oxford Intermediate Social Statistics: Lecture One

University of Oxford Intermediate Social Statistics: Lecture One University of Oxford Intermediate Social Statistics: Lecture One Raymond M. Duch Nuffield College Oxford www.raymondduch.com @RayDuch January 17, 2012 Course Requirements Course Requirements Eight Lectures

More information

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics. Dae-Jin Lee dlee@bcamath.org Basque Center for Applied Mathematics http://idaejin.github.io/bcam-courses/ D.-J. Lee (BCAM) Intro to GLM s with R GitHub: idaejin 1/40 Modeling count data Introduction Response

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

Analyzing binary outcomes, going beyond logistic regression

Analyzing binary outcomes, going beyond logistic regression Analyzing binary outcomes, going beyond logistic regression 2018 EHE Forum presentation James O. Uanhoro Department of Educational Studies Premise Obtaining relative risk using Poisson regression Obtaining

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

Clincial Biostatistics. Regression

Clincial Biostatistics. Regression Regression analyses Clincial Biostatistics Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a

More information

STAT 503X Case Study 1: Restaurant Tipping

STAT 503X Case Study 1: Restaurant Tipping STAT 503X Case Study 1: Restaurant Tipping 1 Description Food server s tips in restaurants may be influenced by many factors including the nature of the restaurant, size of the party, table locations in

More information

Selection and Combination of Markers for Prediction

Selection and Combination of Markers for Prediction Selection and Combination of Markers for Prediction NACC Data and Methods Meeting September, 2010 Baojiang Chen, PhD Sarah Monsell, MS Xiao-Hua Andrew Zhou, PhD Overview 1. Research motivation 2. Describe

More information

+/ 4.0%. Strong oppose legalization. Value % 13.7% 6.8% 41.7% 36.3% 12.9% 5.9% 44.9%

+/ 4.0%. Strong oppose legalization. Value % 13.7% 6.8% 41.7% 36.3% 12.9% 5.9% 44.9% Colorado Cannabis Survey The following questions are from a survey of 6000 Colorado registered voters. The interviews were conductedd via land lines and cell phones on 8 July, 05. The sampling error for

More information

Modeling unobserved heterogeneity in Stata

Modeling unobserved heterogeneity in Stata Modeling unobserved heterogeneity in Stata Rafal Raciborski StataCorp LLC November 27, 2017 Rafal Raciborski (StataCorp) Modeling unobserved heterogeneity November 27, 2017 1 / 59 Plan of the talk Concepts

More information

Why Mixed Effects Models?

Why Mixed Effects Models? Why Mixed Effects Models? Mixed Effects Models Recap/Intro Three issues with ANOVA Multiple random effects Categorical data Focus on fixed effects What mixed effects models do Random slopes Link functions

More information

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan Media, Discussion and Attitudes Technical Appendix 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan 1 Contents 1 BBC Media Action Programming and Conflict-Related Attitudes (Part 5a: Media and

More information

Introduction to Logistic Regression

Introduction to Logistic Regression Introduction to Logistic Regression Author: Nicholas G Reich This material is part of the statsteachr project Made available under the Creative Commons Attribution-ShareAlike 3.0 Unported License: http://creativecommons.org/licenses/by-sa/3.0/deed.en

More information

Reveal Relationships in Categorical Data

Reveal Relationships in Categorical Data SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

On the purpose of testing:

On the purpose of testing: Why Evaluation & Assessment is Important Feedback to students Feedback to teachers Information to parents Information for selection and certification Information for accountability Incentives to increase

More information

PRESIDENT OBAMA, THE SUPREME COURT AND THE ECONOMY May 6-12, 2009

PRESIDENT OBAMA, THE SUPREME COURT AND THE ECONOMY May 6-12, 2009 CBS NEWS POLL For Release: Wednesday, May 13, 2009 6:30 pm (EDT) PRESIDENT OBAMA, THE SUPREME COURT AND THE ECONOMY May 6-12, 2009 Americans continue to give President Barack Obama high marks, and increasing

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

Class 7 Everything is Related

Class 7 Everything is Related Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational

More information

THE ARAB AMERICAN VOTE September 27, 2012

THE ARAB AMERICAN VOTE September 27, 2012 THE ARAB AMERICAN VOTE 2012 September 27, 2012 Executive Summary New polling on Arab American voters in 2012* shows a continuing shift away from the Republican Party, a 15% drop in support for President

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

QPM Lab 9: Contingency Tables and Bivariate Displays in R

QPM Lab 9: Contingency Tables and Bivariate Displays in R QPM Lab 9: Contingency Tables and Bivariate Displays in R Department of Political Science Washington University, St. Louis November 3-4, 2016 QPM Lab 9: Contingency Tables and Bivariate Displays in R 1

More information

Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction

Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction Amir Barati Farimani, Mohammad Heiranian, Narayana R. Aluru 1 Department

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis

More information

Regression so far... Lecture 22 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

Regression so far... Lecture 22 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102 Background Regression so far... Lecture 22 - Sta102 / BME102 Colin Rundel November 23, 2015 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

11/24/2017. Do not imply a cause-and-effect relationship

11/24/2017. Do not imply a cause-and-effect relationship Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection

More information

A Practical Guide to Measuring Social Structure Using Indirectly Observed Network Data

A Practical Guide to Measuring Social Structure Using Indirectly Observed Network Data Journal of Statistical Theory and Practice, 7:120 132,2013 Copyright Grace Scientific Publishing, LLC ISSN: 1559-8608 print / 1559-8616 online DOI: 10.1080/15598608.2013.756360 A Practical Guide to Measuring

More information

You must answer question 1.

You must answer question 1. Research Methods and Statistics Specialty Area Exam October 28, 2015 Part I: Statistics Committee: Richard Williams (Chair), Elizabeth McClintock, Sarah Mustillo You must answer question 1. 1. Suppose

More information

Opium for the Masses: How Foreign Media Can Stabilize Authoritarian Regimes. Online Supplement

Opium for the Masses: How Foreign Media Can Stabilize Authoritarian Regimes. Online Supplement Opium for the Masses: How Foreign Media Can Stabilize Authoritarian Regimes Online Supplement This online supplement contains the results of additional analyses that had to be omitted from the paper because

More information

Inequality and Happiness

Inequality and Happiness Inequality and Happiness Claudia Biancotti and Giovanni D Alessio Bank of Italy, Economic Research Department Unequal results Theoretical work, field experiments and surveybased studies show that inequality

More information

Sociology Exam 3 Answer Key [Draft] May 9, 201 3

Sociology Exam 3 Answer Key [Draft] May 9, 201 3 Sociology 63993 Exam 3 Answer Key [Draft] May 9, 201 3 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. Bivariate regressions are

More information

Estimating Heterogeneous Choice Models with Stata

Estimating Heterogeneous Choice Models with Stata Estimating Heterogeneous Choice Models with Stata Richard Williams Notre Dame Sociology rwilliam@nd.edu West Coast Stata Users Group Meetings October 25, 2007 Overview When a binary or ordinal regression

More information

Least likely observations in regression models for categorical outcomes

Least likely observations in regression models for categorical outcomes The Stata Journal (2002) 2, Number 3, pp. 296 300 Least likely observations in regression models for categorical outcomes Jeremy Freese University of Wisconsin Madison Abstract. This article presents a

More information

War and Relatedness Enrico Spolaore and Romain Wacziarg September 2015

War and Relatedness Enrico Spolaore and Romain Wacziarg September 2015 War and Relatedness Enrico Spolaore and Romain Wacziarg September 2015 Online Appendix Supplementary Empirical Results, described in the main text as "Available in the Online Appendix" 1 Table AUR0 Effect

More information

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

Political Science 15, Winter 2014 Final Review

Political Science 15, Winter 2014 Final Review Political Science 15, Winter 2014 Final Review The major topics covered in class are listed below. You should also take a look at the readings listed on the class website. Studying Politics Scientifically

More information

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN)

(CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) UNIT 4 OTHER DESIGNS (CORRELATIONAL DESIGN AND COMPARATIVE DESIGN) Quasi Experimental Design Structure 4.0 Introduction 4.1 Objectives 4.2 Definition of Correlational Research Design 4.3 Types of Correlational

More information

RECOMMENDED CITATION: Pew Research Center, October 2014, Most Are Confident in Government s Ability to Prevent Major Ebola Outbreak in U.S.

RECOMMENDED CITATION: Pew Research Center, October 2014, Most Are Confident in Government s Ability to Prevent Major Ebola Outbreak in U.S. NUMBERS, FACTS AND TRENDS SHAPING THE WORLD FOR RELEASE OCTOBER 6, 2014 FOR FURTHER INFORMATION ON THIS REPORT: Carroll Doherty, Director of Political Research Alec Tyson, Senior Researcher Rachel Weisel,

More information

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS)

TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) TITLE: A Data-Driven Approach to Patient Risk Stratification for Acute Respiratory Distress Syndrome (ARDS) AUTHORS: Tejas Prahlad INTRODUCTION Acute Respiratory Distress Syndrome (ARDS) is a condition

More information

Use of GEEs in STATA

Use of GEEs in STATA Use of GEEs in STATA 1. When generalised estimating equations are used and example 2. Stata commands and options for GEEs 3. Results from Stata (and SAS!) 4. Another use of GEEs Use of GEEs GEEs are one

More information

A practical guide to measuring social structure using. indirectly observed network data

A practical guide to measuring social structure using. indirectly observed network data A practical guide to measuring social structure using indirectly observed network data Tyler H. McCormick Amal Moussa Johannes Ruf Thomas A. DiPrete Andrew Gelman Julien Teitler Tian Zheng Abstract Aggregated

More information

On-line Appendix Learning Together Slowly: Bayesian Learning About Political Facts Journal of Politics, 79 (4), 2017

On-line Appendix Learning Together Slowly: Bayesian Learning About Political Facts Journal of Politics, 79 (4), 2017 On-line Appendix Learning Together Slowly: Bayesian Learning About Political Facts Journal of Politics, 79 (4), 2017 Seth J. Hill University of California, San Diego August 7, 2017 Contents A Crossover

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

Survey of Young Americans Attitudes toward Politics and Public Service 30th Edition: June 21 July 3, 2016

Survey of Young Americans Attitudes toward Politics and Public Service 30th Edition: June 21 July 3, 2016 Survey of Young Americans Attitudes toward Politics and Public Service 30th Edition: June 21 July 3, 2016 N=1,001 18-29 Year Olds in English and Spanish (with GfK KnowledgePanel i ) Margin of Error: +/-

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI /

Index. Springer International Publishing Switzerland 2017 T.J. Cleophas, A.H. Zwinderman, Modern Meta-Analysis, DOI / Index A Adjusted Heterogeneity without Overdispersion, 63 Agenda-driven bias, 40 Agenda-Driven Meta-Analyses, 306 307 Alternative Methods for diagnostic meta-analyses, 133 Antihypertensive effect of potassium,

More information

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

Logistic regression: Why we often can do what we think we can do 1.

Logistic regression: Why we often can do what we think we can do 1. Logistic regression: Why we often can do what we think we can do 1. Augst 8 th 2015 Maarten L. Buis, University of Konstanz, Department of History and Sociology maarten.buis@uni.konstanz.de All propositions

More information

Ethnicity and Maternal Health Care Utilization in Nigeria: the Role of Diversity and Homogeneity

Ethnicity and Maternal Health Care Utilization in Nigeria: the Role of Diversity and Homogeneity Ethnicity and Maternal Health Care Utilization in Nigeria: the Role of Diversity and Homogeneity In spite of the significant improvements in the health of women worldwide, maternal mortality ratio has

More information

Ana Miriam Romo Anaya 1 English version: Gerardo Piña 1 INEGI, Abstract

Ana Miriam Romo Anaya 1 English version: Gerardo Piña 1 INEGI,   Abstract The development of multivariate econometric logit type models as a methodology for estimating the determinants in the measurement of subjective wellbeing in Mexico Ana Miriam Romo Anaya 1 English version:

More information

Midterm project due next Wednesday at 2 PM

Midterm project due next Wednesday at 2 PM Course Business Midterm project due next Wednesday at 2 PM Please submit on CourseWeb Next week s class: Discuss current use of mixed-effects models in the literature Short lecture on effect size & statistical

More information

Capability, Credibility, and Extended General Deterrence. Supplemental Material

Capability, Credibility, and Extended General Deterrence. Supplemental Material Capability, Credibility, and Extended General Deterrence Supplemental Material Jesse C. Johnson University of Kentucky Brett Ashley Leeds Rice University and Ahra Wu Rice University This document includes

More information

Statistical questions for statistical methods

Statistical questions for statistical methods Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the

More information

Chapter 3 CORRELATION AND REGRESSION

Chapter 3 CORRELATION AND REGRESSION CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression

More information

Case Studies of Signed Networks

Case Studies of Signed Networks Case Studies of Signed Networks Christopher Wang December 10, 2014 Abstract Many studies on signed social networks focus on predicting the different relationships between users. However this prediction

More information

2 Assumptions of simple linear regression

2 Assumptions of simple linear regression Simple Linear Regression: Reliability of predictions Richard Buxton. 2008. 1 Introduction We often use regression models to make predictions. In Figure?? (a), we ve fitted a model relating a household

More information

Executive Summary Survey of Oregon Voters Oregon Voters Have Strong Support For Increasing the Cigarette Tax

Executive Summary Survey of Oregon Voters Oregon Voters Have Strong Support For Increasing the Cigarette Tax Executive Summary Survey of Oregon Voters Oregon Voters Have Strong Support For Increasing the Cigarette Tax Despite hesitation towards new tax increases, a strong majority of voters support an increase

More information

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Predicting Breast Cancer Survival Using Treatment and Patient Factors Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women

More information

Small-area estimation of mental illness prevalence for schools

Small-area estimation of mental illness prevalence for schools Small-area estimation of mental illness prevalence for schools Fan Li 1 Alan Zaslavsky 2 1 Department of Statistical Science Duke University 2 Department of Health Care Policy Harvard Medical School March

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

Online Appendix. According to a recent survey, most economists expect the economic downturn in the United

Online Appendix. According to a recent survey, most economists expect the economic downturn in the United Online Appendix Part I: Text of Experimental Manipulations and Other Survey Items a. Macroeconomic Anxiety Prime According to a recent survey, most economists expect the economic downturn in the United

More information

14/07/2014. Disclosures and acknowledgements. Study Design (NCT ) Rationale for a QIV efficacy study in children

14/07/2014. Disclosures and acknowledgements. Study Design (NCT ) Rationale for a QIV efficacy study in children /7/ Disclosures and acknowledgements I am employed by the GlaxoSmithKline group of companies and I own stocks/options of the GlaxoSmithKline group of companies; my travel to this meeting was funded by

More information

Chapter Eight: Multivariate Analysis

Chapter Eight: Multivariate Analysis Chapter Eight: Multivariate Analysis Up until now, we have covered univariate ( one variable ) analysis and bivariate ( two variables ) analysis. We can also measure the simultaneous effects of two or

More information

Longitudinal and Hierarchical Analytic Strategies for OAI Data

Longitudinal and Hierarchical Analytic Strategies for OAI Data Longitudinal and Hierarchical Analytic Strategies for OAI Data Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF OARSI Montreal September 10, 2009 Outline 1.

More information

Propensity Score Methods for Causal Inference with the PSMATCH Procedure

Propensity Score Methods for Causal Inference with the PSMATCH Procedure Paper SAS332-2017 Propensity Score Methods for Causal Inference with the PSMATCH Procedure Yang Yuan, Yiu-Fai Yung, and Maura Stokes, SAS Institute Inc. Abstract In a randomized study, subjects are randomly

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

A Latent Variable Approach for the Construction of Continuous Health Indicators

A Latent Variable Approach for the Construction of Continuous Health Indicators A Latent Variable Approach for the Construction of Continuous Health Indicators David Conne and Maria-Pia Victoria-Feser No 2004.07 Cahiers du département d économétrie Faculté des sciences économiques

More information

Generalized Linear Models and Logistic Regression

Generalized Linear Models and Logistic Regression Generalized Linear Models and Logistic Regression Author: Nicholas G Reich, OpenIntro This material is part of the statsteachr project Made available under the Creative Commons Attribution-ShareAlike 3.0

More information

Practical Multivariate Analysis

Practical Multivariate Analysis Texts in Statistical Science Practical Multivariate Analysis Fifth Edition Abdelmonem Afifi Susanne May Virginia A. Clark CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint

More information

Extending Rungie et al. s model of brand image stability to account for heterogeneity

Extending Rungie et al. s model of brand image stability to account for heterogeneity University of Wollongong Research Online Faculty of Commerce - Papers (Archive) Faculty of Business 2007 Extending Rungie et al. s model of brand image stability to account for heterogeneity Sara Dolnicar

More information

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Journal of Social and Development Sciences Vol. 4, No. 4, pp. 93-97, Apr 203 (ISSN 222-52) Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm Henry De-Graft Acquah University

More information

Modern Regression Methods

Modern Regression Methods Modern Regression Methods Second Edition THOMAS P. RYAN Acworth, Georgia WILEY A JOHN WILEY & SONS, INC. PUBLICATION Contents Preface 1. Introduction 1.1 Simple Linear Regression Model, 3 1.2 Uses of Regression

More information

A comparison of direct and indirect methods for the estimation of health utilities from clinical outcomes

A comparison of direct and indirect methods for the estimation of health utilities from clinical outcomes A comparison of direct and indirect methods for the estimation of health utilities from clinical outcomes Mónica Hernández Alava, Allan Wailoo, Fred Wolfe and Kaleb Michaud 2 Introduction Mapping (or cross-walking

More information

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic Statistics: A Brief Overview Part I Katherine Shaver, M.S. Biostatistician Carilion Clinic Statistics: A Brief Overview Course Objectives Upon completion of the course, you will be able to: Distinguish

More information

Modest Rise in Percentage Favoring General Legalization BROAD PUBLIC SUPPORT FOR LEGALIZING MEDICAL MARIJUANA

Modest Rise in Percentage Favoring General Legalization BROAD PUBLIC SUPPORT FOR LEGALIZING MEDICAL MARIJUANA NEWS Release 1615 L Street, N.W., Suite 700 Washington, D.C. 20036 Tel (202) 419-4350 Fax (202) 419-4399 FOR IMMEDIATE RELEASE: Thursday April 1, 2010 Modest Rise in Percentage Favoring General Legalization

More information