Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction
|
|
- Reynard Baldwin
- 5 years ago
- Views:
Transcription
1 Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and work with examples of multiple regression to illustrate principles of multiple regression. Objectives Learn how to assess scatterplots and residuals in a single linear regression. Use an example of multiple regression with no intercorrelation among X variables to illustrate principles of multiple regression. Use an example of multiple regression with intercorrelation among X variables to illustrate additional principles of multiple regression. Learn to interpret JMP output for multiple regression analyses. Exercise 1- Assessing scatterplots and residuals to identify problems in linear regression Quinn and Keough (pp 97-98) and the JMP manual authors (pp ) propose that researchers use scatterplots of X values, Y values, and residuals to examine results of regression analyses. We performed similar assessments as a part of last week s lab when we identified outliers in our measurement data. Today, we will examine four pairs of variables and assess the utility of scatterplots to identify potential problems with certain data and linear regression. Opening the data file Open the file scatterplotsf03.jmp. You will see four pairs of ExampleX and ExampleY variables. Conducting regressions Select the Analyze and Fit Y by X commands, and drag ExampleX1 into the X, Factor box and ExampleY1 into the Y, Response box. Select the red triangle under Bivariate fit of Example1Y by Example1X and select Fit Line. You will see the predicted linear relationship between the two variables. Make a table indicating the example number, slope, intercept, and r square in your output window. Repeat this procedure for all four pairs of variables. What do you conclude about the regression statistics? Now examine each scatterplot closely and write a brief description of whether one of the following issues apply: typical bivariate normal distribution of values for both variables, nonlinear relationship between X and Y, outliers on Y axis, extreme values on X axis. Once you have completed this, click the red triangle under Linear Fit and select Plot Residuals for each regression. For each example, write an additional brief description of how the pattern of residuals relates to your first assessment. Finally, determine whether your examples conform to the four examples in the Anscombe data set (page 97 of your text), and if so, identify which one.
2 Bio 345 Week 5- Residuals and multiple regression -2- Exercise 2- Using an example to illustrate principles of multiple regression We will use experimental data from my research on beetle/host plant relationships to illustrate principles of multiple regression. This data set is useful because there are relatively few predictor variables (only 3), and they are all potentially important. Thus, we can concentrate on the differences between multiple versus single linear regression and postpone discussion about selection of X variables for regression models for a later exercise. In addition to the actual predictor variables, which are intercorrelated, we will also work with three uncorrelated predictor variables to study the effects of intercorrelation on regression coefficients and correlations. 1 Opening the data file Open the file calif_beetle_survival.jmp In August 1989 I performed a series of experiments comparing growth and survival of beetle larvae on four willow species in the laboratory and field. I wanted to know whether larval survival was related to the concentration of chemicals in the leaves that beetles used to make a defensive secretion. But I also wanted to know whether the nutritional quality of the plants was important. I measured 1. water content of the leaves [mass of H 2 O in leaves/total leaf mass] 2. mean growth rate of larvae in the laboratory on each plant [Ln(final mass - initial mass)/# days] 3. the total amount of host plant salicylates in the leaves [log(mg of salicylates/leaf mass in g)] 4. dependent variable: the average survival of beetle larvae per plant in nature (% survival) I originally created the three variables Prin1, Prin2, and Prin3 during an exploratory analysis of this data, which I included in my Ph.D. dissertation. Regressing the uncorrelated predictor variables onto survival Conducting single linear regressions Construct a table with five columns: Variable name, slope, intercept, correlation coefficient, r square, and MS ERR. Use the Analyze and Multivariate commands with the four variables Prin1, Prin2, Prin3, and survival to obtain correlations between the three X variables and Y. Note whether the X variables appear to be intercorrelated. Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Fill in the values in your table with the regression output. 1 The uncorrelated predictor variables were constructed through a multivariate technique that we will cover later in the semester, Principal Components Analysis. We will discuss how this analysis constructs uncorrelated variables at a later time.
3 Bio 345 Week 5- Residuals and multiple regression -3- Conducting multiple linear regression Select the Analyze and Fit Model commands, drag Prin1, Prin2, and Prin3 into the Construct Model Effects box, and drag survival into the Y box. Select Run Model and examine the output. Interpreting the regression output At the top of the output, you will see a plot of the predicted versus actual values of Y. To the right of these are leverage plots, which we will discuss later. There are many similarities to the output from single linear regression. The Summary of Fit window shows you the value for multiple R square. It also shows the R square adj. Finally, the Summary of Fit window shows the mean of the dependent (response) variable and the number of observations. The ANOVA table shows the Sum of Squares explained by the regression variable on the Model line. The total Sum of Squares is shown under the C Total line. Finally, the SS shown on the Error line in the ANOVA table represents the remaining variation not explained by the regression variable. The Parameter Estimates window shows the intercept and partial regression coefficient estimates for each variable, their standard errors, and t tests for significance of the slopes. In the Effects Tests window below this, the Sum of Squares attributed to each regression variable and an F test are indicated. Using a hand calculator, confirm that the Sum of Squares values for each variable, added that of the others, equals the total Sum of Squares for the multiple regression model in the Analysis of Variance window. Calculating predicted and residual values Click on the red triangle at the top of the output window (next to Response Survival), and select Save Columns and Predicted Values. Click on the red triangle again and save the residuals. You should see two new columns added to the data set, corresponding to the regression and error variables. Calculating regression coefficients To obtain values for intercorrelation among X variables in the multiple regression, click on the red triangle at the top of the output window (next to Response Survival), select Estimates and Correlation of Estimates. The values are shown at the bottom of the output. To obtain partial correlation coefficients, return to the data table and select Analyze and Multivariate and drag your X variables and Y into the Y, columns box. In the red triangle next to Multivariate in the output window, select Partial Correlations. You will see two correlation matrices at the top of the output window, one for the raw correlations between the X variables and Y, and another for the partial correlations in the multiple regression. Calculating Beta coefficients For some reason, it is much harder to obtain beta coefficients in JMP 4 than it was in JMP 3. We will not conduct this analysis here, but you can do this by creating new columns representing the standardized values for each independent variable and the dependent variable (in the formula editor this is performed by the Col Standardize function. Then you can rerun the regression
4 Bio 345 Week 5- Residuals and multiple regression -4- using the standardized variables. The slope values in the multiple regression represent the beta coefficients. Illustrating general regression principles Return to your data table and locate the column window on the left side of the table. Scroll down until you see the Predicted survival and Residual survival variables. Now use the Analyze and Distribution commands to obtain distributions for three variables: survival, Predicted Survival, and Residual Survival. Examine the moments carefully. Which principles of regression, covered in lecture and shown on the handout on multiple regression principles, are illustrated here? Now click the red triangle beneath each variable name, select Display and More Moments. What additional principle of regression is now evident? Now select the Analyze and Fit Y by X commands, and place PRIN1, PRIN2, PRIN3, and Predicted Survival into the X box and Residual Survival into the Y box. Then select the red triangle under each bivariate fit and select Fit Line. Which additional multiple regression principle is now evident? Multiple regression when X s are uncorrelated Return to the output of your multiple regression analysis and compare the partial regression slopes shown here to the slope values from the individual regressions that you wrote into your table above. Calculate the sum of the r square values from the individual regressions and compare this value to multiple r square. What additional principle of multiple regression is illustrated here? Multiple regression when X s are correlated Now we will analyze the relationship between the original independent variables (water content, growth, host plant chemistry) and survival. First, we will run the single regressions. Construct another table with five columns: Variable name, slope, intercept, correlation coefficient, r square, and MS ERR. Use the Analyze and Multivariate commands with the four variables watcont, growth, logsaln, and survival to obtain correlations between the three X variables and Y. Note that the X variables are intercorrelated. Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Fill in the values in your table with the regression output. To run the multiple regression, select the Analyze and Fit Model commands, drag watcont, growth, and logsaln into the Construct Model Effects box, and drag survival into the Y box. Select Run Model and compare the partial regression slopes shown here to the slope values from the individual regressions. Calculate the sum of the r square values from the individual regressions and compare this value to multiple r square. What principle of multiple regression is illustrated here? Exercise 3- Comparison of multiple regression to simple regression models Advantages of multiple regression When one suspects that several factors influence the dependent variable, then it can be better to include all of them. The multiple regression allows for a more sensitive test for multiple effects
5 Bio 345 Week 5- Residuals and multiple regression -5- than a series of single regressions. Specifically, the effects of each independent variable are adjusted for the effects of the other independent variables. Effects of inter correlation It is best to conduct multiple regression when independent variables are mildly inter correlated. If there is high inter correlation, it is difficult to separate the contribution of each independent variable in explaining the variation in the dependent variable. With high inter correlation, the two independent variables are measuring the same thing, and it is not really fair to consider them separately. Another problem is that the partial regression coefficients are very unstable and susceptable to outliers. See the JMP manual (pages ) and Keough and Quinn (pages ) for more on the collinearity problem. If there is no inter correlation, single regressions would yield the same information as the multiple regression. Application to the data set In the data we are analyzing, we can test the effect of host plant salicylates on larval survival while adjusting for larval growth and water content. The multiple regression reveals new information because the single regression does not account for inter correlation among plant characteristics. Specifically, high salicylate plants tended to have low water contents (and low beetle growth). We may not detect a relationship between salicylate content and survival because the effect of salicylates is cancelled out by the low water content of high salicylate plants. In the multiple regression, both variables are included, and the effect of salicylate content is adjusted for the relationship between water content and survival. Comparing the multiple regression to the single regressions Return to the output from your multiple regression model (which shows up as a Fit Least Squares window), click the red triangle next to Response survival, select Save Columns, and Effect Leverage Pairs. This operation saves the data from the leverage plots shown on the right hand side of the regression output. Save your file as calif_beetle_survivalmod.jmp. The Effect Leverage plots show the relationship between each predictor variable and Y, after adjustment by inclusion of the other predictor variables in the model. Data from multiple regressions are often presented this way (see Fig 6.4, page 126 of Quinn and Keough for an example). Use the Fit Y by X platform to conduct three regressions of each X variable onto survival. Compare the degree of scatter around the regression lines in the single regressions to that observed in the leverage plots (which you can also examine using the Fit Y by X platform). In which plots do you see less scatter (leverage from the multiple regression or the simple linear regression)? Write a brief paragraph describing how the effect of each variable depended on whether it was regressed against survival alone or included in a multiple regression model. Tasks 1. Evaluate the four scatterplots found in the file scatterplotsf03.jmp. Write a brief description of whether any of the scatterplots indicate problems with the data or with the relationship between X and Y. Use residual plots to help you evaluate the plots. 2. Learn to interpret the output from a multiple regression.
6 Bio 345 Week 5- Residuals and multiple regression Describe how two examples conform to the principles of multiple regression discussed in class: one without intercorrelation among independent variables and the other with intercorrelation. You will include some tables here. 4. Write a brief paragraph summarizing your observations about the scatter around the regression line for leverage plots resulting from a multiple regression versus single linear regressions, and describing how the effect of each independent variable depended on whether it was regressed against the dependent variable alone or included in a multiple regression model.
7 Principles of multiple regression The mean of the RV is equal to the mean in Y. The mean of the error variable equals zero. SS RV + SS Err = SS Y Multiple r 2 = SS RV /SS Y The error variable is uncorrelated with the RV and with the X variables If X variables are not intercorrelated Partial regression slopes in the multiple regression are equal to slopes of X variables individually regressed on Y Sum of squared correlation coefficients = multiple r 2 If X variables are intercorrelated Partial regression slopes in the multiple regression differ from slopes of X variables individually regressed on Y Sum of squared correlation coefficients > multiple r 2
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 8 One Way ANOVA and comparisons among means Introduction In this exercise, we will conduct one-way analyses of variance using two different
More informationMultiple Linear Regression Analysis
Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationExample of Interpreting and Applying a Multiple Regression Model
Example of Interpreting and Applying a Multiple Regression We'll use the same data set as for the bivariate correlation example -- the criterion is 1 st year graduate grade point average and the predictors
More informationCHAPTER TWO REGRESSION
CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect
More information10. LINEAR REGRESSION AND CORRELATION
1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have
More informationSimple Linear Regression
Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis
More informationPreliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)
Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete
More informationMultiple Regression Using SPSS/PASW
MultipleRegressionUsingSPSS/PASW The following sections have been adapted from Field (2009) Chapter 7. These sections have been edited down considerablyandisuggest(especiallyifyou reconfused)thatyoureadthischapterinitsentirety.youwillalsoneed
More informationFrom Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1
From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of
More informationSCATTER PLOTS AND TREND LINES
1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as
More informationSimple Linear Regression the model, estimation and testing
Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationSimple Linear Regression One Categorical Independent Variable with Several Categories
Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary
More informationRegression Including the Interaction Between Quantitative Variables
Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance
More informationCHILD HEALTH AND DEVELOPMENT STUDY
CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationChapter 3 CORRELATION AND REGRESSION
CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationChapter 3: Examining Relationships
Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationChapter 3: Describing Relationships
Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate
More informationMULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES
24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Correlation SPSS procedure for Pearson r Interpretation of SPSS output Presenting results Partial Correlation Correlation
More informationApplied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business
Applied Medical Statistics Using SAS Geoff Der Brian S. Everitt CRC Press Taylor Si Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor & Francis Croup, an informa business A
More information12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2
PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special
More informationCorrelation and Regression
Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott
More informationStatistics for Psychology
Statistics for Psychology SIXTH EDITION CHAPTER 12 Prediction Prediction a major practical application of statistical methods: making predictions make informed (and precise) guesses about such things as
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationSTATISTICS & PROBABILITY
STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness
More information2 Assumptions of simple linear regression
Simple Linear Regression: Reliability of predictions Richard Buxton. 2008. 1 Introduction We often use regression models to make predictions. In Figure?? (a), we ve fitted a model relating a household
More informationMath 075 Activities and Worksheets Book 2:
Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are
More informationLAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*
LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,
More informationLab 5a Exploring Correlation
Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for
More informationTEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.
Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic
More informationSection 3 Correlation and Regression - Teachers Notes
The data are from the paper: Exploring Relationships in Body Dimensions Grete Heinz and Louis J. Peterson San José State University Roger W. Johnson and Carter J. Kerk South Dakota School of Mines and
More informationBangor University Laboratory Exercise 1, June 2008
Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground
More informationStat 13, Lab 11-12, Correlation and Regression Analysis
Stat 13, Lab 11-12, Correlation and Regression Analysis Part I: Before Class Objective: This lab will give you practice exploring the relationship between two variables by using correlation, linear regression
More information5 To Invest or not to Invest? That is the Question.
5 To Invest or not to Invest? That is the Question. Before starting this lab, you should be familiar with these terms: response y (or dependent) and explanatory x (or independent) variables; slope and
More informationData Analysis with SPSS
Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco
More informationUsing SPSS for Correlation
Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded
More informationLinear Regression in SAS
1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25
More informationPitfalls in Linear Regression Analysis
Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression
More informationRegression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES
CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of
More informationThe North Carolina Health Data Explorer
The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows
More informationMULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:
1 MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: Predicting State Rates of Robbery per 100K We know that robbery rates vary significantly from state-to-state in the United States. In any given state, we
More information11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationOverview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups
Survey Methods & Design in Psychology Lecture 10 ANOVA (2007) Lecturer: James Neill Overview of Lecture Testing mean differences ANOVA models Interactions Follow-up tests Effect sizes Parametric Tests
More informationSample Exam Paper Answer Guide
Sample Exam Paper Answer Guide Notes This handout provides perfect answers to the sample exam paper. I would not expect you to be able to produce such perfect answers in an exam. So, use this document
More informationMath 215, Lab 7: 5/23/2007
Math 215, Lab 7: 5/23/2007 (1) Parametric versus Nonparamteric Bootstrap. Parametric Bootstrap: (Davison and Hinkley, 1997) The data below are 12 times between failures of airconditioning equipment in
More information1 Version SP.A Investigate patterns of association in bivariate data
Claim 1: Concepts and Procedures Students can explain and apply mathematical concepts and carry out mathematical procedures with precision and fluency. Content Domain: Statistics and Probability Target
More information6. Unusual and Influential Data
Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the
More informationSTATISTICS INFORMED DECISIONS USING DATA
STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams
More informationIntroduction to regression
Introduction to regression Regression describes how one variable (response) depends on another variable (explanatory variable). Response variable: variable of interest, measures the outcome of a study
More informationList of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition
List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing
More informationProblem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.
Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationFirst of two parts Joseph Hogan Brown University and AMPATH
First of two parts Joseph Hogan Brown University and AMPATH Overview What is regression? Does regression have to be linear? Case study: Modeling the relationship between weight and CD4 count Exploratory
More informationAnalysis of Variance (ANOVA) Program Transcript
Analysis of Variance (ANOVA) Program Transcript DR. JENNIFER ANN MORROW: Welcome to Analysis of Variance. My name is Dr. Jennifer Ann Morrow. In today's demonstration, I'll review with you the definition
More informationMULTIPLE REGRESSION OF CPS DATA
MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear
More informationBIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA
BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationThis tutorial presentation is prepared by. Mohammad Ehsanul Karim
STATA: The Red tutorial STATA: The Red tutorial This tutorial presentation is prepared by Mohammad Ehsanul Karim ehsan.karim@gmail.com STATA: The Red tutorial This tutorial presentation is prepared by
More informationStatistical reports Regression, 2010
Statistical reports Regression, 2010 Niels Richard Hansen June 10, 2010 This document gives some guidelines on how to write a report on a statistical analysis. The document is organized into sections that
More informationClass 7 Everything is Related
Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational
More informationPsychology of Perception Psychology 4165, Spring 2003 Laboratory 1 Weight Discrimination
Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)
More informationPsychology of Perception Psychology 4165, Fall 2001 Laboratory 1 Weight Discrimination
Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)
More informationMeasuring the User Experience
Measuring the User Experience Collecting, Analyzing, and Presenting Usability Metrics Chapter 2 Background Tom Tullis and Bill Albert Morgan Kaufmann, 2008 ISBN 978-0123735584 Introduction Purpose Provide
More informationTHE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE
Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationAn Introduction to Modern Econometrics Using Stata
An Introduction to Modern Econometrics Using Stata CHRISTOPHER F. BAUM Department of Economics Boston College A Stata Press Publication StataCorp LP College Station, Texas Contents Illustrations Preface
More informationNormal Q Q. Residuals vs Fitted. Standardized residuals. Theoretical Quantiles. Fitted values. Scale Location 26. Residuals vs Leverage
Residuals 400 0 400 800 Residuals vs Fitted 26 42 29 Standardized residuals 2 0 1 2 3 Normal Q Q 26 42 29 360 400 440 2 1 0 1 2 Fitted values Theoretical Quantiles Standardized residuals 0.0 0.5 1.0 1.5
More informationApplication of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties
Application of Local Control Strategy in analyses of the effects of Radon on Lung Cancer Mortality for 2,881 US Counties Bob Obenchain, Risk Benefit Statistics, August 2015 Our motivation for using a Cut-Point
More informationNORTH SOUTH UNIVERSITY TUTORIAL 2
NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate
More informationThe SAGE Encyclopedia of Educational Research, Measurement, and Evaluation Multivariate Analysis of Variance
The SAGE Encyclopedia of Educational Research, Measurement, Multivariate Analysis of Variance Contributors: David W. Stockburger Edited by: Bruce B. Frey Book Title: Chapter Title: "Multivariate Analysis
More informationSection 3.2 Least-Squares Regression
Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables
Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge
More informationIntro to SPSS. Using SPSS through WebFAS
Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800
More informationReveal Relationships in Categorical Data
SPSS Categories 15.0 Specifications Reveal Relationships in Categorical Data Unleash the full potential of your data through perceptual mapping, optimal scaling, preference scaling, and dimension reduction
More informationLab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.
Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download
More informationBiostatistics II
Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,
More informationMultiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple
More informationChapter 14: More Powerful Statistical Methods
Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint
More informationHow to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona
How to Conduct On-Farm Trials Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How can you determine whether a treatment (this might be an additive, a fertilizer, snake
More information3.2A Least-Squares Regression
3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to
More informationSPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm
SPSS Portfolio Brittany Murray BUSA 2182 MWF 1:00pm-1:50pm Table Of Contents I) SPSS Computer Lab Assignment # 1 Frequency Distribution a) Cover Page b) Explanatory Paragraph c) Appendix II) SPSS Computer
More informationEffects of Nutrients on Shrimp Growth
Data Set 5: Effects of Nutrients on Shrimp Growth Statistical setting This Handout is an example of extreme collinearity of the independent variables, and of the methods used for diagnosing this problem.
More informationDr. Kelly Bradley Final Exam Summer {2 points} Name
{2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.
More informationLab 8: Multiple Linear Regression
Lab 8: Multiple Linear Regression 1 Grading the Professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these
More informationANOVA in SPSS (Practical)
ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling
More informationThe Effectiveness of Captopril
Lab 7 The Effectiveness of Captopril In the United States, pharmaceutical manufacturers go through a very rigorous process in order to get their drugs approved for sale. This process is designed to determine
More informationLecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression
Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample
More informationIntroduction to Multilevel Models for Longitudinal and Repeated Measures Data
Introduction to Multilevel Models for Longitudinal and Repeated Measures Data Today s Class: Features of longitudinal data Features of longitudinal models What can MLM do for you? What to expect in this
More informationSurvey Project Data Analysis Guide
Survey Project Data Analysis Guide I. Computing Scale Scores. - In the data file that I have given you, I have already done the following. - Selected the items that will be used for the Radford Morality
More informationA Penny for Your Thoughts: Scientific Measurements and Introduction to Excel
A Penny for Your Thoughts: Scientific Measurements and Introduction to Excel Pre-lab Assignment: Introduction Reading: 1. Chapter sections 1.4 through 1.6 in your course text. 2. This lab handout. Questions:
More informationLesson 9: Two Factor ANOVAS
Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction
More informationEXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE
EXPERIMENT 3 ENZYMATIC QUANTITATION OF GLUCOSE This is a team experiment. Each team will prepare one set of reagents; each person will do an individual unknown and each team will submit a single report.
More information