General Example: Gas Mileage (Stat 5044 Schabenberger & J.P.Morgen)

Size: px
Start display at page:

Download "General Example: Gas Mileage (Stat 5044 Schabenberger & J.P.Morgen)"

Transcription

1 General Example: Gas Mileage (Stat 5044 Schabenberger & J.P.Morgen) From Motor Trend magazine data were obtained for n=32 cars on the following variables: Y= Gas Mileage (miles per gallon, MPG) X1= Engine size in cubic inches (SIZE). X2= Horsepower (HP). X3= Weight in pounds (Weight). X4= Engine shape (1=straight, 0= V, ShAPE). X5= Number of cylinders (CYLIN). X6= Transmission type (1= manual, 0= automatic, TRANS). X7= Number of forward gears (GEARS). (1) If we were to perform a SLR of MPG on TRANS, the slope of the fitted Regression line estimates the difference in the mean mileage of cars with (TRANS=1) and without (TRANS=0) manual transmission. Model <.0001 Error Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 trans <.0001 A manual transmission increases the gas mileage by mile per gallon on average. This increase is significant. However, we know that the cars in the two groups are different with respect to other characteristics, not only the transmission type. Does having a manual transmission still improve the gas mileage if, e.g., the car engine size and horsepower are taken into account?? To answer this question, we need to fit a model with three variables simultaneously, A multiple Regression Model.

2 Model <.0001 Error Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 size hp trans The Correlation between the response and the seven regressors in our example is as follow: The SAS System 1 The CORR Procedure 1 With Variables: mpg 7 Variables: size hp weight shape cylin trans gears Pearson Correlation Coefficients, N = 32 Prob > r under H0: Rho=0 size hp weight shape cylin trans gears mpg <.0001 The Correlation among the seven regressors in our example is as follow: The CORR Procedure 7 Variables: size hp weight shape cylin trans gears Pearson Correlation Coefficients, N = 32 Prob > r under H0: Rho=0 size hp weight shape cylin trans gears size hp weight shape cylin trans

3 gears This indicator for A Multicolinearity problem Correlations among predictors resulting in an increase in variance Reduces the significance value of the variable Occurs when several predictor variables are used in the model Model <.0001 Error Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept size hp weight shape cylin trans gears Model Selection 1- Forward Selection Forward Selection: Step 1 Variable size Entered: R-Square = and C(p) = Model <.0001 Error Parameter Standard

4 Intercept <.0001 size <.0001 Bounds on condition number: 1, 1 Forward Selection: Step 2 Variable trans Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size <.0001 trans Bounds on condition number: , Forward Selection: Step 3 The SAS System 11 Forward Selection: Step 3 Variable weight Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size weight trans Bounds on condition number: , Forward Selection: Step 4 Variable cylin Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size weight cylin trans

5 Bounds on condition number: , Forward Selection: Step 5 The SAS System 12 Forward Selection: Step 5 Variable hp Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size hp weight cylin trans Bounds on condition number: , No other variable met the significance level for entry into the model. Summary of Forward Selection Variable Number Partial Model Step Entered Vars In R-Square R-Square C(p) F Value Pr > F 1 size < trans weight cylin hp Backward Elimination Backward Elimination: Step 0 All Variables Entered: R-Square = and C(p) = Model <.0001 Error Intercept size hp weight

6 shape cylin trans gears Bounds on condition number: , Backward Elimination: Step 1 Variable trans Removed: R-Square = and C(p) = Model <.0001 Error The SAS System 14 Backward Elimination: Step 1 Intercept size hp weight shape cylin gears Bounds on condition number: , Backward Elimination: Step 2 Variable shape Removed: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size hp weight cylin gears Bounds on condition number: , All variables left in the model are significant at the level. The SAS System 15 Summary of Backward Elimination

7 Variable Number Partial Model Step Removed Vars In R-Square R-Square C(p) F Value Pr > F 1 trans shape Stepwise Selection Stepwise Selection: Step 1 Variable size Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size <.0001 Bounds on condition number: 1, 1 Stepwise Selection: Step 2 Variable trans Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 size <.0001 trans Bounds on condition number: , Stepwise Selection: Step 3 The SAS System 17 Stepwise Selection: Step 3 Variable weight Entered: R-Square = and C(p) = Model <.0001 Error

8 Intercept <.0001 size weight trans Bounds on condition number: , Stepwise Selection: Step 4 Variable size Removed: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 weight <.0001 trans Bounds on condition number: , Stepwise Selection: Step 5 The SAS System 18 Stepwise Selection: Step 5 Variable shape Entered: R-Square = and C(p) = Model <.0001 Error Intercept <.0001 weight shape trans Bounds on condition number: , All variables left in the model are significant at the level. No other variable met the significance level for entry into the model. Summary of Stepwise Selection Variable Variable Number Partial Model Step Entered Removed Vars In R-Square R-Square C(p) F Value Pr > F 1 size < trans weight size shape

9 Model Comparison Reduced Model Model <.0001 Error Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept <.0001 size <.0001 trans Full Model Model <.0001 Error Root MSE R-Square Dependent Adj R-Sq Coeff Var Parameter Estimates Variable DF Estimate Error t Value Pr > t Intercept size hp weight shape cylin trans gears

10 F-TEST Test 1 Results for Dependent Variable mpg Source DF Square F Value Pr > F Numerator Denominator SAS Code data Gasmileage; input mpg size hp weight shape cylin trans gears; datalines; ;;; model mpg=trans; model mpg=size hp trans; proc corr data=gasmileage nosimple; var size hp weight shape cylin trans gears; with mpg; proc corr data=gasmileage nosimple; var size hp weight shape cylin trans gears; model mpg=size hp weight shape cylin trans gears; model mpg=size hp weight shape cylin trans gears/ ss1 ss2; /* model mpg=size hp weight shape cylin trans gears/ selection=rsquare cp; */ model mpg=size hp weight shape cylin trans gears/ selection=forward slentry=.25; model mpg=size hp weight shape cylin trans gears/ selection=backward slstay=.25; model mpg=size hp weight shape cylin trans gears/ selection=stepwise slstay=.25 slentry=.25;

11 model mpg=size trans; model mpg=size hp weight shape cylin trans gears; test weight=0, shape=0, cylin=0, hp=0, gears=0;

HZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION

HZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION HZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION Using the malt quality dataset on the class s Web page: 1. Determine the simple linear correlation of extract with the remaining variables.

More information

Use the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.

Use the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions. Fall, 2002 Grad Stats Final Exam There are four questions on this exam, A through D, and each question has multiple sub-questions. Unless otherwise indicated, each sub-question is worth 3 points. Question

More information

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992

Psych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992 Exam #2, Spring 1992 Question 1 A group of researchers from a neurobehavioral institute are interested in the relationships that have been found between the amount of cerebral blood flow (CB FLOW) to the

More information

4Stat Wk 10: Regression

4Stat Wk 10: Regression 4Stat 342 - Wk 10: Regression Loading data with datalines Regression (Proc glm) - with interactions - with polynomial terms - with categorical variables (Proc glmselect) - with model selection (this is

More information

The Association Design and a Continuous Phenotype

The Association Design and a Continuous Phenotype PSYC 5102: Association Design & Continuous Phenotypes (4/4/07) 1 The Association Design and a Continuous Phenotype The purpose of this note is to demonstrate how to perform a population-based association

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

Data Science and Statistics in Research: unlocking the power of your data

Data Science and Statistics in Research: unlocking the power of your data Data Science and Statistics in Research: unlocking the power of your data Session 1.4: Data and variables 1/ 33 OUTLINE Types of data Types of variables Presentation of data Tables Summarising Data 2/

More information

ANOVA. Thomas Elliott. January 29, 2013

ANOVA. Thomas Elliott. January 29, 2013 ANOVA Thomas Elliott January 29, 2013 ANOVA stands for analysis of variance and is one of the basic statistical tests we can use to find relationships between two or more variables. ANOVA compares the

More information

Effects of Nutrients on Shrimp Growth

Effects of Nutrients on Shrimp Growth Data Set 5: Effects of Nutrients on Shrimp Growth Statistical setting This Handout is an example of extreme collinearity of the independent variables, and of the methods used for diagnosing this problem.

More information

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized) Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions

More information

Notes for laboratory session 2

Notes for laboratory session 2 Notes for laboratory session 2 Preliminaries Consider the ordinary least-squares (OLS) regression of alcohol (alcohol) and plasma retinol (retplasm). We do this with STATA as follows:. reg retplasm alcohol

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis

More information

Introduction to regression

Introduction to regression Introduction to regression Regression describes how one variable (response) depends on another variable (explanatory variable). Response variable: variable of interest, measures the outcome of a study

More information

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine

Data Analysis in Practice-Based Research. Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Data Analysis in Practice-Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine Multilevel Data Statistical analyses that fail to recognize

More information

STAT Factor Analysis in SAS

STAT Factor Analysis in SAS STAT 5600 Factor Analysis in SAS The data for this example come from the decathlon results in the 1988 Olympics. The decathlon is a two-day competition, with the 100 m race, long jump, shot put, high jump,

More information

Effect of Source and Level of Protein on Weight Gain of Rats

Effect of Source and Level of Protein on Weight Gain of Rats Effect of Source and Level of Protein on of Rats 1 * two factor analysis of variance with interaction; 2 option ls=120 ps=75 nocenter nodate; 3 4 title Effect of Source of Protein and Level of Protein

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Chapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors

Chapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors Chapter 9 Factorial ANOVA with Two Between-Group Factors 10/22/2001 1 Factorial ANOVA with Two Between-Group Factors Recall that in one-way ANOVA we study the relation between one criterion variable and

More information

All Possible Regressions Using IBM SPSS: A Practitioner s Guide to Automatic Linear Modeling

All Possible Regressions Using IBM SPSS: A Practitioner s Guide to Automatic Linear Modeling Georgia Southern University Digital Commons@Georgia Southern Georgia Educational Research Association Conference Oct 7th, 1:45 PM - 3:00 PM All Possible Regressions Using IBM SPSS: A Practitioner s Guide

More information

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1 M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page 1 15.6 Influence Analysis FIGURE 15.16 Minitab worksheet containing computed values for the Studentized deleted residuals, the hat matrix elements, and

More information

Hungry Mice. NP: Mice in this group ate as much as they pleased of a non-purified, standard diet for laboratory mice.

Hungry Mice. NP: Mice in this group ate as much as they pleased of a non-purified, standard diet for laboratory mice. Hungry Mice When laboratory mice (and maybe other animals) are fed a nutritionally adequate but near-starvation diet, they may live longer on average than mice that eat a normal amount of food. In this

More information

Using SAS to Conduct Pilot Studies: An Instructors Guide

Using SAS to Conduct Pilot Studies: An Instructors Guide Using SAS to Conduct Pilot Studies: An Instructors Guide Sean W. Mulvenon, University of Arkansas, Fayetteville, AR Ronna C. Turner, University of Arkansas, Fayetteville, AR ABSTRACT An important component

More information

Moderated Mediation Analysis Andrew Hayes, Ph.D. Upcoming Seminar: October 27-28, 2017, Philadelphia, PA

Moderated Mediation Analysis Andrew Hayes, Ph.D. Upcoming Seminar: October 27-28, 2017, Philadelphia, PA oderated ediation Analysis Andrew Hayes, Ph.D. Upcoming Seminar: October 27-28, 2017, Philadelphia, PA Two moderators of the same path A first stage dual moderated mediation model A second stage dual moderated

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Class 7 Everything is Related

Class 7 Everything is Related Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational

More information

7) Briefly explain why a large value of r 2 is desirable in a regression setting.

7) Briefly explain why a large value of r 2 is desirable in a regression setting. Directions: Complete each problem. A complete problem has not only the answer, but the solution and reasoning behind that answer. All work must be submitted on separate pieces of paper. 1) Manatees are

More information

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS

CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS - CLASSICAL AND. MODERN REGRESSION WITH APPLICATIONS SECOND EDITION Raymond H. Myers Virginia Polytechnic Institute and State university 1 ~l~~l~l~~~~~~~l!~ ~~~~~l~/ll~~ Donated by Duxbury o Thomson Learning,,

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

SPSS output for 420 midterm study

SPSS output for 420 midterm study Ψ Psy Midterm Part In lab (5 points total) Your professor decides that he wants to find out how much impact amount of study time has on the first midterm. He randomly assigns students to study for hours,

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78 35. What s My Line? You use the same bar of soap to shower each morning. The bar weighs 80 grams when it is new. Its weight goes down by 6 grams per day on average. What is the equation of the regression

More information

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do. Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and

More information

NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003

NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003 NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003 Obs GROUP I DOPA LNDOPA 1 neurblst 1 48.000 1.68124 2 neurblst 1 133.000 2.12385 3 neurblst 1 34.000 1.53148

More information

1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA.

1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA. LDA lab Feb, 6 th, 2002 1 1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA. 2. Scientific question: estimate the average

More information

Manuscript Presentation: Writing up APIM Results

Manuscript Presentation: Writing up APIM Results Manuscript Presentation: Writing up APIM Results Example Articles Distinguishable Dyads Chung, M. L., Moser, D. K., Lennie, T. A., & Rayens, M. (2009). The effects of depressive symptoms and anxiety on

More information

Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose

Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose CLP 945 Example 4b page 1 Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose These data were simulated loosely based on real data reported in the citation below.

More information

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies. Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. STAT 280 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

Lab 5a Exploring Correlation

Lab 5a Exploring Correlation Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for

More information

A macro of building predictive model in PROC LOGISTIC with AIC-optimal variable selection embedded in cross-validation

A macro of building predictive model in PROC LOGISTIC with AIC-optimal variable selection embedded in cross-validation SESUG Paper AD-36-2017 A macro of building predictive model in PROC LOGISTIC with AIC-optimal variable selection embedded in cross-validation Hongmei Yang, Andréa Maslow, Carolinas Healthcare System. ABSTRACT

More information

SPSS output for 420 midterm study

SPSS output for 420 midterm study Ψ Psy Midterm Part In lab (5 points total) Your professor decides that he wants to find out how much impact amount of study time has on the first midterm. He randomly assigns students to study for hours,

More information

Answer all three questions. All questions carry equal marks.

Answer all three questions. All questions carry equal marks. UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science and Statistics Postgraduate Diploma in Statistics Trinity Term 2 Introduction to Regression

More information

Chapter 6 Measures of Bivariate Association 1

Chapter 6 Measures of Bivariate Association 1 Chapter 6 Measures of Bivariate Association 1 A bivariate relationship involves relationship between two variables. Examples: Relationship between GPA and SAT score Relationship between height and weight

More information

Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose COMPLETED VERSION

Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose COMPLETED VERSION Psyc 944 Example 9b page 1 Between-Person and Within-Person Effects of Negative Mood Predicting Next-Morning Glucose COMPLETED VERSION These data were simulated loosely based on real data reported in the

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables

More information

Quick Selections. How to Use Service Factor. Worm Gear Reducers. Single Reduction

Quick Selections. How to Use Service Factor.  Worm Gear Reducers. Single Reduction Quick Selections How to Use Based on required output RPM and input motor horsepower, read across chart for the appropriate reducer size. As a rule of thumb, use 1.00 service factor chart for applications

More information

Correlation & Regression Exercises Chapters 14-15

Correlation & Regression Exercises Chapters 14-15 Correlation & Regression Exercises Chapters 14-15 1. Which of these are true and which are false? Explain why the false statements are wrong. a. If the slope of the line is 1, then the correlation must

More information

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

Attachment E, Exhibit 1 Los Vaqueros Reservoir Recreation Model

Attachment E, Exhibit 1 Los Vaqueros Reservoir Recreation Model Attachment E, Exhibit 1 Los Vaqueros Reservoir Recreation Model Proposition 1 February 2018 This page is intentionally left blank. LOS VAQUEROS RESERVOIR EXISTING DAY-USE FACILITIES Parking Equestrian/

More information

2. Scientific question: Determine whether there is a difference between boys and girls with respect to the distance and its change over time.

2. Scientific question: Determine whether there is a difference between boys and girls with respect to the distance and its change over time. LDA lab Feb, 11 th, 2002 1 1. Objective:analyzing dental data using ordinary least square (OLS) and Generalized Least Square(GLS) in STATA. 2. Scientific question: Determine whether there is a difference

More information

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple

More information

Measuring Regressor Information Content In The Presence of Collinearity

Measuring Regressor Information Content In The Presence of Collinearity Measuring Regressor Information Content In The Presence of Collinearity Joseph Retzer, Market Probe, Inc., Milwaukee WI Kurt Pflughoeft, University of Texas, EI Paso TX Abstract Design matrix collinearity

More information

Answer Key to Problem Set #1

Answer Key to Problem Set #1 Answer Key to Problem Set #1 Two notes: q#4e: Please disregard q#5e: The frequency tables of the total CESD scales of 94, 96 and 98 in question 5e should sum up to 328 observation not 924 (the student

More information

REGRESSION ANALYSIS ON THE COMPUTER. Dependent variable is %BF. Variable Coefficient Constant waist SE(b 0.

REGRESSION ANALYSIS ON THE COMPUTER. Dependent variable is %BF. Variable Coefficient Constant waist SE(b 0. 672 CHAPER 27 Inferences for Regression REGRESSION ANALYSIS ON HE COMPUER All statistics packages make a table of results for a regression. hese tables differ slightly from one package to another, but

More information

This tutorial presentation is prepared by. Mohammad Ehsanul Karim

This tutorial presentation is prepared by. Mohammad Ehsanul Karim STATA: The Red tutorial STATA: The Red tutorial This tutorial presentation is prepared by Mohammad Ehsanul Karim ehsan.karim@gmail.com STATA: The Red tutorial This tutorial presentation is prepared by

More information

Predicting Relations between Variables

Predicting Relations between Variables Predicting Relations between Variables Where we have been Hypotheses: conjectures about the way some phenomenon behaves Need to be tested by predictions Can be stated in terms of variables Variables: things

More information

Introduction billion gallons of ethanol were produced in the U.S. during 2009.

Introduction billion gallons of ethanol were produced in the U.S. during 2009. Introduction 10.6 billion gallons of ethanol were produced in the U.S. during 2009. (Renewable Fuels Association) As corn-milling technologies increase efficiency, corn co-products continue to evolve.

More information

REGRESSION ANALYSIS ON THE COMPUTER. Dependent variable is %BF. Variable Constant waist. Coefficient SE(b 0 ) b 0 SE(b 0 )

REGRESSION ANALYSIS ON THE COMPUTER. Dependent variable is %BF. Variable Constant waist. Coefficient SE(b 0 ) b 0 SE(b 0 ) 672 CHAPER 27 Inferences for Regression REGRESSION ANALYSIS ON HE COMPUER All statistics packages make a table of results for a regression. hese tables differ slightly from one package to another, but

More information

LOGLINK Example #1. SUDAAN Statements and Results Illustrated. Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example.

LOGLINK Example #1. SUDAAN Statements and Results Illustrated. Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example. LOGLINK Example #1 SUDAAN Statements and Results Illustrated Log-linear regression modeling MODEL TEST SUBPOPN EFFECTS Input Data Set(s): EPIL.SAS7bdat ( Thall and Vail (1990)) Example Use the Epileptic

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Homework Linear Regression Problems should be worked out in your notebook

Homework Linear Regression Problems should be worked out in your notebook Homework Linear Regression Problems should be worked out in your notebook 1. Following are the mean heights of Kalama children: Age (months) 18 19 20 21 22 23 24 25 26 27 28 29 Height (cm) 76.1 77.0 78.1

More information

ID# Final Exam PS 217, Spring 2002 (You must use your Skidmore ID#!)

ID# Final Exam PS 217, Spring 2002 (You must use your Skidmore ID#!) ID# Final Exam PS 217, Spring 2002 (You must use your Skidmore ID#!) OK, here s the last stats exam the one that you ve been anxiously awaiting. As always, you should adhere to the Skidmore Honor Code.

More information

Chapter 10: Moderation, mediation and more regression

Chapter 10: Moderation, mediation and more regression Chapter 10: Moderation, mediation and more regression Smart Alex s Solutions Task 1 McNulty et al. (2008) found a relationship between a person s Attractiveness and how much Support they give their partner

More information

Math 124: Module 2, Part II

Math 124: Module 2, Part II , Part II David Meredith Department of Mathematics San Francisco State University September 15, 2009 What we will do today 1 Explanatory and Response Variables When you study the relationship between two

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING Interpreting Interaction Effects; Interaction Effects and Centering Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Models with interaction effects

More information

1. You want to find out what factors predict achievement in English. Develop a model that

1. You want to find out what factors predict achievement in English. Develop a model that Questions and answers for Chapter 10 1. You want to find out what factors predict achievement in English. Develop a model that you think can explain this. As usual many alternative predictors are possible

More information

Regression models, R solution day7

Regression models, R solution day7 Regression models, R solution day7 Exercise 1 In this exercise, we shall look at the differences in vitamin D status for women in 4 European countries Read and prepare the data: vit

More information

Name MATH0021Final Exam REVIEW UPDATED 8/18. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Name MATH0021Final Exam REVIEW UPDATED 8/18. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Name MATH0021Final Exam REVIEW UPDATED 8/18 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the equation. 1) 6x - (3x - 1) = 2 1) A) - 1 3 B)

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

Correlation and regression

Correlation and regression PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision

More information

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and

More information

Study Guide #2: MULTIPLE REGRESSION in education

Study Guide #2: MULTIPLE REGRESSION in education Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that

More information

manipulation influences other variables, the researcher is conducting a(n)

manipulation influences other variables, the researcher is conducting a(n) Math 1342 Finals Review Selective Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 1) If a researcher manipulates one of the variables and tries to

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

Motor fitness variables as predictors in Kho-Kho playing ability among women

Motor fitness variables as predictors in Kho-Kho playing ability among women Available online at: http://euroasiapub.org, pp. 160~165 Motor fitness variables as predictors in Kho-Kho playing ability among women Dr. Mahender Singh 1, Joint Director Sports Department Chandigarh Chander

More information

Final Exam - section 2. Thursday, December hours, 30 minutes

Final Exam - section 2. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

3.2A Least-Squares Regression

3.2A Least-Squares Regression 3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to

More information

Regression. Page 1. Variables Entered/Removed b Variables. Variables Removed. Enter. Method. Psycho_Dum

Regression. Page 1. Variables Entered/Removed b Variables. Variables Removed. Enter. Method. Psycho_Dum Regression Model Variables Entered/Removed b Variables Entered Variables Removed Method Meds_Dum,. Enter Psycho_Dum a. All requested variables entered. b. Dependent Variable: Beck's Depression Score Model

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. For Questions 1 & 2: It is known that the distribution of starting salaries for MSU Education majors has

More information

Math 215, Lab 7: 5/23/2007

Math 215, Lab 7: 5/23/2007 Math 215, Lab 7: 5/23/2007 (1) Parametric versus Nonparamteric Bootstrap. Parametric Bootstrap: (Davison and Hinkley, 1997) The data below are 12 times between failures of airconditioning equipment in

More information

AP Statistics Practice Test Ch. 3 and Previous

AP Statistics Practice Test Ch. 3 and Previous AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way:

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way: ROC Curves We developed a method to make diagnoses of anxiety using criteria provided by Phillip. Would it also be possible to make such diagnoses based on a much more simple scheme, a simple cutoff point

More information

Stats for Clinical Trials, Math 150 Jo Hardin Logistic Regression example: interaction & stepwise regression

Stats for Clinical Trials, Math 150 Jo Hardin Logistic Regression example: interaction & stepwise regression Stats for Clinical Trials, Math 150 Jo Hardin Logistic Regression example: interaction & stepwise regression Interaction Consider data is from the Heart and Estrogen/Progestin Study (HERS), a clinical

More information

SAS Data Setup: SPSS Data Setup: STATA Data Setup: Hoffman ICPSR Example 5 page 1

SAS Data Setup: SPSS Data Setup: STATA Data Setup: Hoffman ICPSR Example 5 page 1 Hoffman ICPSR Example 5 page 1 Example 5: Examining BP and WP Effects of Negative Mood Predicting Next-Morning Glucose (complete data, syntax, and output available for SAS, SPSS, and STATA electronically)

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

STAT362 Homework Assignment 5

STAT362 Homework Assignment 5 STAT362 Homework Assignment 5 Sharon O Boyle Problem 1, Problem 3.6, p. 117 SAS Program * Sharon O'Boyle; * Stat 362; * Homework Assignment 5; * Problem 3.6, p. 117; * Program to compute Odds Ratio and

More information

THE UNIVERSITY OF SUSSEX. BSc Second Year Examination DISCOVERING STATISTICS SAMPLE PAPER INSTRUCTIONS

THE UNIVERSITY OF SUSSEX. BSc Second Year Examination DISCOVERING STATISTICS SAMPLE PAPER INSTRUCTIONS C8552 THE UNIVERSITY OF SUSSEX BSc Second Year Examination DISCOVERING STATISTICS SAMPLE PAPER INSTRUCTIONS Do not, under any circumstances, remove the question paper, used or unused, from the examination

More information