Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Size: px
Start display at page:

Download "Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction"

Transcription

1 Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and work with examples of multiple regression to illustrate principles of multiple regression. Objectives Learn how to assess scatterplots and residuals in a single linear regression. Use an example of multiple regression with no intercorrelation among X variables to illustrate principles of multiple regression. Use an example of multiple regression with intercorrelation among X variables to illustrate additional principles of multiple regression. Learn to interpret JMP output for multiple regression analyses. Exercise 1- Assessing scatterplots and residuals to identify problems in linear regression Quinn and Keough (pp 97-98) and the JMP manual authors (pp ) propose that researchers use scatterplots of X values, Y values, and residuals to examine results of regression analyses. We performed similar assessments as a part of last week s lab when we identified outliers in our measurement data. Today, we will examine four pairs of variables and assess the utility of scatterplots to identify potential problems with certain data and linear regression. Opening the data file Open the file scatterplotsf03.jmp. You will see four pairs of ExampleX and ExampleY variables. Conducting regressions Select the Analyze and Fit Y by X commands, and drag ExampleX1 into the X, Factor box and ExampleY1 into the Y, Response box. Select the red triangle under Bivariate fit of Example1Y by Example1X and select Fit Line. You will see the predicted linear relationship between the two variables. Make a table indicating the example number, slope, intercept, and r square in your output window. Repeat this procedure for all four pairs of variables. What do you conclude about the regression statistics? Now examine each scatterplot closely and write a brief description of whether one of the following issues apply: typical bivariate normal distribution of values for both variables, nonlinear relationship between X and Y, outliers on Y axis, extreme values on X axis. Once you have completed this, click the red triangle under Linear Fit and select Plot Residuals for each regression. For each example, write an additional brief description of how the pattern of residuals relates to your first assessment. Finally, determine whether your examples conform to the four examples in the Anscombe data set (page 97 of your text), and if so, identify which one.

2 Bio 345 Week 5- Residuals and multiple regression -2- Exercise 2- Using an example to illustrate principles of multiple regression We will use experimental data from my research on beetle/host plant relationships to illustrate principles of multiple regression. This data set is useful because there are relatively few predictor variables (only 3), and they are all potentially important. Thus, we can concentrate on the differences between multiple versus single linear regression and postpone discussion about selection of X variables for regression models for a later exercise. In addition to the actual predictor variables, which are intercorrelated, we will also work with three uncorrelated predictor variables to study the effects of intercorrelation on regression coefficients and correlations. 1 Opening the data file Open the file calif_beetle_survival.jmp In August 1989 I performed a series of experiments comparing growth and survival of beetle larvae on four willow species in the laboratory and field. I wanted to know whether larval survival was related to the concentration of chemicals in the leaves that beetles used to make a defensive secretion. But I also wanted to know whether the nutritional quality of the plants was important. I measured 1. water content of the leaves [mass of H 2 O in leaves/total leaf mass] 2. mean growth rate of larvae in the laboratory on each plant [Ln(final mass - initial mass)/# days] 3. the total amount of host plant salicylates in the leaves [log(mg of salicylates/leaf mass in g)] 4. dependent variable: the average survival of beetle larvae per plant in nature (% survival) I originally created the three variables Prin1, Prin2, and Prin3 during an exploratory analysis of this data, which I included in my Ph.D. dissertation. Regressing the uncorrelated predictor variables onto survival Conducting single linear regressions Construct a table with five columns: Variable name, slope, intercept, correlation coefficient, r square, and MS ERR. Use the Analyze and Multivariate commands with the four variables Prin1, Prin2, Prin3, and survival to obtain correlations between the three X variables and Y. Note whether the X variables appear to be intercorrelated. Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Fill in the values in your table with the regression output. 1 The uncorrelated predictor variables were constructed through a multivariate technique that we will cover later in the semester, Principal Components Analysis. We will discuss how this analysis constructs uncorrelated variables at a later time.

3 Bio 345 Week 5- Residuals and multiple regression -3- Conducting multiple linear regression Select the Analyze and Fit Model commands, drag Prin1, Prin2, and Prin3 into the Construct Model Effects box, and drag survival into the Y box. Select Run Model and examine the output. Interpreting the regression output At the top of the output, you will see a plot of the predicted versus actual values of Y. To the right of these are leverage plots, which we will discuss later. There are many similarities to the output from single linear regression. The Summary of Fit window shows you the value for multiple R square. It also shows the R square adj. Finally, the Summary of Fit window shows the mean of the dependent (response) variable and the number of observations. The ANOVA table shows the Sum of Squares explained by the regression variable on the Model line. The total Sum of Squares is shown under the C Total line. Finally, the SS shown on the Error line in the ANOVA table represents the remaining variation not explained by the regression variable. The Parameter Estimates window shows the intercept and partial regression coefficient estimates for each variable, their standard errors, and t tests for significance of the slopes. In the Effects Tests window below this, the Sum of Squares attributed to each regression variable and an F test are indicated. Using a hand calculator, confirm that the Sum of Squares values for each variable, added that of the others, equals the total Sum of Squares for the multiple regression model in the Analysis of Variance window. Calculating predicted and residual values Click on the red triangle at the top of the output window (next to Response Survival), and select Save Columns and Predicted Values. Click on the red triangle again and save the residuals. You should see two new columns added to the data set, corresponding to the regression and error variables. Calculating regression coefficients To obtain values for intercorrelation among X variables in the multiple regression, click on the red triangle at the top of the output window (next to Response Survival), select Estimates and Correlation of Estimates. The values are shown at the bottom of the output. To obtain partial correlation coefficients, return to the data table and select Analyze and Multivariate and drag your X variables and Y into the Y, columns box. In the red triangle next to Multivariate in the output window, select Partial Correlations. You will see two correlation matrices at the top of the output window, one for the raw correlations between the X variables and Y, and another for the partial correlations in the multiple regression. Calculating Beta coefficients For some reason, it is much harder to obtain beta coefficients in JMP 4 than it was in JMP 3. We will not conduct this analysis here, but you can do this by creating new columns representing the standardized values for each independent variable and the dependent variable (in the formula editor this is performed by the Col Standardize function. Then you can rerun the regression

4 Bio 345 Week 5- Residuals and multiple regression -4- using the standardized variables. The slope values in the multiple regression represent the beta coefficients. Illustrating general regression principles Return to your data table and locate the column window on the left side of the table. Scroll down until you see the Predicted survival and Residual survival variables. Now use the Analyze and Distribution commands to obtain distributions for three variables: survival, Predicted Survival, and Residual Survival. Examine the moments carefully. Which principles of regression, covered in lecture and shown on the handout on multiple regression principles, are illustrated here? Now click the red triangle beneath each variable name, select Display and More Moments. What additional principle of regression is now evident? Now select the Analyze and Fit Y by X commands, and place PRIN1, PRIN2, PRIN3, and Predicted Survival into the X box and Residual Survival into the Y box. Then select the red triangle under each bivariate fit and select Fit Line. Which additional multiple regression principle is now evident? Multiple regression when X s are uncorrelated Return to the output of your multiple regression analysis and compare the partial regression slopes shown here to the slope values from the individual regressions that you wrote into your table above. Calculate the sum of the r square values from the individual regressions and compare this value to multiple r square. What additional principle of multiple regression is illustrated here? Multiple regression when X s are correlated Now we will analyze the relationship between the original independent variables (water content, growth, host plant chemistry) and survival. First, we will run the single regressions. Construct another table with five columns: Variable name, slope, intercept, correlation coefficient, r square, and MS ERR. Use the Analyze and Multivariate commands with the four variables watcont, growth, logsaln, and survival to obtain correlations between the three X variables and Y. Note that the X variables are intercorrelated. Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Fill in the values in your table with the regression output. To run the multiple regression, select the Analyze and Fit Model commands, drag watcont, growth, and logsaln into the Construct Model Effects box, and drag survival into the Y box. Select Run Model and compare the partial regression slopes shown here to the slope values from the individual regressions. Calculate the sum of the r square values from the individual regressions and compare this value to multiple r square. What principle of multiple regression is illustrated here? Exercise 3- Comparison of multiple regression to simple regression models Advantages of multiple regression When one suspects that several factors influence the dependent variable, then it can be better to include all of them. The multiple regression allows for a more sensitive test for multiple effects

5 Bio 345 Week 5- Residuals and multiple regression -5- than a series of single regressions. Specifically, the effects of each independent variable are adjusted for the effects of the other independent variables. Effects of inter correlation It is best to conduct multiple regression when independent variables are mildly inter correlated. If there is high inter correlation, it is difficult to separate the contribution of each independent variable in explaining the variation in the dependent variable. With high inter correlation, the two independent variables are measuring the same thing, and it is not really fair to consider them separately. Another problem is that the partial regression coefficients are very unstable and susceptable to outliers. See the JMP manual (pages ) and Keough and Quinn (pages ) for more on the collinearity problem. If there is no inter correlation, single regressions would yield the same information as the multiple regression. Application to the data set In the data we are analyzing, we can test the effect of host plant salicylates on larval survival while adjusting for larval growth and water content. The multiple regression reveals new information because the single regression does not account for inter correlation among plant characteristics. Specifically, high salicylate plants tended to have low water contents (and low beetle growth). We may not detect a relationship between salicylate content and survival because the effect of salicylates is cancelled out by the low water content of high salicylate plants. In the multiple regression, both variables are included, and the effect of salicylate content is adjusted for the relationship between water content and survival. Comparing the multiple regression to the single regressions Return to the output from your multiple regression model (which shows up as a Fit Least Squares window), click the red triangle next to Response survival, select Save Columns, and Effect Leverage Pairs. This operation saves the data from the leverage plots shown on the right hand side of the regression output. Save your file as calif_beetle_survivalmod.jmp. The Effect Leverage plots show the relationship between each predictor variable and Y, after adjustment by inclusion of the other predictor variables in the model. Data from multiple regressions are often presented this way (see Fig 6.4, page 126 of Quinn and Keough for an example). Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Compare the degree of scatter around the regression lines in the single regressions to that observed in the leverage plots (which you can also examine using the Fit Y by X platform). In which plots do you see less scatter (leverage from the multiple regression or the simple linear regression)? Write a brief paragraph describing how the effect of each variable depended on whether it was regressed against survival alone or included in a multiple regression model. Tasks 1. Evaluate the four scatterplots found in the file scatterplotsf03.jmp. Write a brief description of whether any of the scatterplots indicate problems with the data or with the relationship between X and Y. Use residual plots to help you evaluate the plots. 2. Learn to interpret the output from a multiple regression.

6 Bio 345 Week 5- Residuals and multiple regression Describe how two examples conform to the principles of multiple regression discussed in class: one without intercorrelation among independent variables and the other with intercorrelation. You will include some tables here. 4. Write a brief paragraph summarizing your observations about the scatter around the regression line for leverage plots resulting from a multiple regression versus single linear regressions, and describing how the effect of each independent variable depended on whether it was regressed against the dependent variable alone or included in a multiple regression model.

7 Principles of multiple regression The mean of the RV is equal to the mean in Y. The mean of the error variable equals zero. SS RV + SS Err = SS Y Multiple r 2 = SS RV /SS Y The error variable is uncorrelated with the RV and with the X variables If X variables are not intercorrelated Partial regression slopes in the multiple regression are equal to slopes of X variables individually regressed on Y Sum of squared correlation coefficients = multiple r 2 If X variables are intercorrelated Partial regression slopes in the multiple regression differ from slopes of X variables individually regressed on Y Sum of squared correlation coefficients > multiple r 2

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction In this exercise, we will conduct one-way analyses of variance using two different

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Example of Interpreting and Applying a Multiple Regression Model

Example of Interpreting and Applying a Multiple Regression Model Example of Interpreting and Applying a Multiple Regression We'll use the same data set as for the bivariate correlation example -- the criterion is 1 st year graduate grade point average and the predictors

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

Multiple Regression Using SPSS/PASW

Multiple Regression Using SPSS/PASW MultipleRegressionUsingSPSS/PASW The following sections have been adapted from Field (2009) Chapter 7. These sections have been edited down considerablyandisuggest(especiallyifyou reconfused)thatyoureadthischapterinitsentirety.youwillalsoneed

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

SCATTER PLOTS AND TREND LINES

SCATTER PLOTS AND TREND LINES 1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Simple Linear Regression One Categorical Independent Variable with Several Categories

Simple Linear Regression One Categorical Independent Variable with Several Categories Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary

More information

Regression Including the Interaction Between Quantitative Variables

Regression Including the Interaction Between Quantitative Variables Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance

More information

CHILD HEALTH AND DEVELOPMENT STUDY

CHILD HEALTH AND DEVELOPMENT STUDY CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Chapter 3 CORRELATION AND REGRESSION

Chapter 3 CORRELATION AND REGRESSION CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Correlation SPSS procedure for Pearson r Interpretation of SPSS output Presenting results Partial Correlation Correlation

More information

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

Statistics for Psychology

Statistics for Psychology Statistics for Psychology SIXTH EDITION CHAPTER 12 Prediction Prediction a major practical application of statistical methods: making predictions make informed (and precise) guesses about such things as

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

STATISTICS & PROBABILITY

STATISTICS & PROBABILITY STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness

More information

2 Assumptions of simple linear regression

2 Assumptions of simple linear regression Simple Linear Regression: Reliability of predictions Richard Buxton. 2008. 1 Introduction We often use regression models to make predictions. In Figure?? (a), we ve fitted a model relating a household

More information

Math 075 Activities and Worksheets Book 2:

Math 075 Activities and Worksheets Book 2: Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are

More information

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival* LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,

More information

Lab 5a Exploring Correlation

Lab 5a Exploring Correlation Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for

More information

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A. Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic

More information

Section 3 Correlation and Regression - Teachers Notes

Section 3 Correlation and Regression - Teachers Notes The data are from the paper: Exploring Relationships in Body Dimensions Grete Heinz and Louis J. Peterson San José State University Roger W. Johnson and Carter J. Kerk South Dakota School of Mines and

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Stat 13, Lab 11-12, Correlation and Regression Analysis

Stat 13, Lab 11-12, Correlation and Regression Analysis Stat 13, Lab 11-12, Correlation and Regression Analysis Part I: Before Class Objective: This lab will give you practice exploring the relationship between two variables by using correlation, linear regression

More information

5 To Invest or not to Invest? That is the Question.

5 To Invest or not to Invest? That is the Question. 5 To Invest or not to Invest? That is the Question. Before starting this lab, you should be familiar with these terms: response y (or dependent) and explanatory x (or independent) variables; slope and

More information

Data Analysis with SPSS

Data Analysis with SPSS Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco

More information

Using SPSS for Correlation

Using SPSS for Correlation Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded

More information

Linear Regression in SAS

Linear Regression in SAS 1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of

More information

The North Carolina Health Data Explorer

The North Carolina Health Data Explorer The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows

More information

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: 1 MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: Predicting State Rates of Robbery per 100K We know that robbery rates vary significantly from state-to-state in the United States. In any given state, we

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups Survey Methods & Design in Psychology Lecture 10 ANOVA (2007) Lecturer: James Neill Overview of Lecture Testing mean differences ANOVA models Interactions Follow-up tests Effect sizes Parametric Tests

More information

Sample Exam Paper Answer Guide

Sample Exam Paper Answer Guide Sample Exam Paper Answer Guide Notes This handout provides perfect answers to the sample exam paper. I would not expect you to be able to produce such perfect answers in an exam. So, use this document

More information

Math 215, Lab 7: 5/23/2007

Math 215, Lab 7: 5/23/2007 Math 215, Lab 7: 5/23/2007 (1) Parametric versus Nonparamteric Bootstrap. Parametric Bootstrap: (Davison and Hinkley, 1997) The data below are 12 times between failures of airconditioning equipment in

More information

1 Version SP.A Investigate patterns of association in bivariate data

1 Version SP.A Investigate patterns of association in bivariate data Claim 1: Concepts and Procedures Students can explain and apply mathematical concepts and carry out mathematical procedures with precision and fluency. Content Domain: Statistics and Probability Target

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

Introduction to regression

Introduction to regression Introduction to regression Regression describes how one variable (response) depends on another variable (explanatory variable). Response variable: variable of interest, measures the outcome of a study

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.

Problem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided. Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

First of two parts Joseph Hogan Brown University and AMPATH

First of two parts Joseph Hogan Brown University and AMPATH First of two parts Joseph Hogan Brown University and AMPATH Overview What is regression? Does regression have to be linear? Case study: Modeling the relationship between weight and CD4 count Exploratory

More information

Analysis of Variance (ANOVA) Program Transcript

Analysis of Variance (ANOVA) Program Transcript Analysis of Variance (ANOVA) Program Transcript DR. JENNIFER ANN MORROW: Welcome to Analysis of Variance. My name is Dr. Jennifer Ann Morrow. In today's demonstration, I'll review with you the definition

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

This tutorial presentation is prepared by. Mohammad Ehsanul Karim

This tutorial presentation is prepared by. Mohammad Ehsanul Karim STATA: The Red tutorial STATA: The Red tutorial This tutorial presentation is prepared by Mohammad Ehsanul Karim ehsan.karim@gmail.com STATA: The Red tutorial This tutorial presentation is prepared by

More information

Statistical reports Regression, 2010

Statistical reports Regression, 2010 Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that

More information

Class 7 Everything is Related

Class 7 Everything is Related Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational

More information

Psychology of Perception Psychology 4165, Spring 2003 Laboratory 1 Weight Discrimination

Psychology of Perception Psychology 4165, Spring 2003 Laboratory 1 Weight Discrimination Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)

More information

Psychology of Perception Psychology 4165, Fall 2001 Laboratory 1 Weight Discrimination

Psychology of Perception Psychology 4165, Fall 2001 Laboratory 1 Weight Discrimination Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)

More information

Measuring the User Experience

Measuring the User Experience Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide

More information

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

An Introduction to Modern Econometrics Using Stata

An Introduction to Modern Econometrics Using Stata An Introduction to Modern Econometrics Using Stata CHRISTOPHER F. BAUM Department of Economics Boston College A Stata Press Publication StataCorp LP College Station, Texas Contents Illustrations Preface

More information

Normal Q Q. Residuals vs Fitted. Standardized residuals. Theoretical Quantiles. Fitted values. Scale Location 26. Residuals vs Leverage

Normal Q Q. Residuals vs Fitted. Standardized residuals. Theoretical Quantiles. Fitted values. Scale Location 26. Residuals vs Leverage Residuals 400 0 400 800 Residuals vs Fitted 26 42 29 Standardized residuals 2 0 1 2 3 Normal Q Q 26 42 29 360 400 440 2 1 0 1 2 Fitted values Theoretical Quantiles Standardized residuals 0.0 0.5 1.0 1.5

More information

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties

Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance

The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

Reveal Relationships in Categorical Data

Reveal Relationships in Categorical Data SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Multiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple

More information

Chapter 14: More Powerful Statistical Methods

Chapter 14: More Powerful Statistical Methods Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint

More information

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How to Conduct On-Farm Trials Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How can you determine whether a treatment (this might be an additive, a fertilizer, snake

More information

3.2A Least-Squares Regression

3.2A Least-Squares Regression 3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to

More information

SPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm

SPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm SPSS Portfolio Brittany Murray BUSA 2182 MWF 1:00pm-1:50pm Table Of Contents I) SPSS Computer Lab Assignment # 1 Frequency Distribution a) Cover Page b) Explanatory Paragraph c) Appendix II) SPSS Computer

More information

Effects of Nutrients on Shrimp Growth

Effects of Nutrients on Shrimp Growth Data Set 5: Effects of Nutrients on Shrimp Growth Statistical setting This Handout is an example of extreme collinearity of the independent variables, and of the methods used for diagnosing this problem.

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

Lab 8: Multiple Linear Regression

Lab 8: Multiple Linear Regression Lab 8: Multiple Linear Regression 1 Grading the Professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these

More information

ANOVA in SPSS (Practical)

ANOVA in SPSS (Practical) ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling

More information

The Effectiveness of Captopril

The Effectiveness of Captopril Lab 7 The Effectiveness of Captopril In the United States, pharmaceutical manufacturers go through a very rigorous process in order to get their drugs approved for sale. This process is designed to determine

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data

Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this

More information

Survey Project Data Analysis Guide

Survey Project Data Analysis Guide Survey Project Data Analysis Guide I. Computing Scale Scores. - In the data file that I have given you, I have already done the following. - Selected the items that will be used for the Radford Morality

More information

A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel

A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel Pre-lab Assignment: Introduction Reading: 1. Chapter sections 1.4 through 1.6 in your course text. 2. This lab handout. Questions:

More information

Lesson 9: Two Factor ANOVAS

Lesson 9: Two Factor ANOVAS Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction

More information

EXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE

EXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE EXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE This is a team experiment. Each team will prepare one set of reagents; each person will do an individual unknown and each team will submit a single report.

More information