Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Similar documents
Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Chapter 3 CORRELATION AND REGRESSION

Section 3.2 Least-Squares Regression

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

3.2 Least- Squares Regression

Chapter 3: Examining Relationships

AP Statistics Practice Test Ch. 3 and Previous

Chapter 3: Describing Relationships

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

3.2A Least-Squares Regression

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78

Math 075 Activities and Worksheets Book 2:

STATISTICS INFORMED DECISIONS USING DATA

STAT 201 Chapter 3. Association and Regression

CHAPTER 3 Describing Relationships

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Unit 1 Exploring and Understanding Data

bivariate analysis: The statistical analysis of the relationship between two variables.

INTERPRET SCATTERPLOTS

Relationships. Between Measurements Variables. Chapter 10. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Introduction to regression

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Examining Relationships Least-squares regression. Sections 2.3

Lecture 12 Cautions in Analyzing Associations

Chapter 1: Exploring Data

1.4 - Linear Regression and MS Excel

Lecture 10: Chapter 5, Section 2 Relationships (Two Categorical Variables)

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you

7) Briefly explain why a large value of r 2 is desirable in a regression setting.

Practice First Midterm Exam

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

Homework Linear Regression Problems should be worked out in your notebook

Chapter 4: Scatterplots and Correlation

Correlation and regression

Scatter Plots and Association

REVIEW PROBLEMS FOR FIRST EXAM

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CHAPTER ONE CORRELATION

3.4 What are some cautions in analyzing association?

MEASURES OF ASSOCIATION AND REGRESSION

14.1: Inference about the Model

CHILD HEALTH AND DEVELOPMENT STUDY

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

IAPT: Regression. Regression analyses

6. Unusual and Influential Data

STATISTICS & PROBABILITY

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

Math 124: Module 2, Part II

This means that the explanatory variable accounts for or predicts changes in the response variable.

Multiple Regression Analysis

Eating and Sleeping Habits of Different Countries

SCATTER PLOTS AND TREND LINES

Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January

(a) 50% of the shows have a rating greater than: impossible to tell

Simple Linear Regression the model, estimation and testing

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation)

NORTH SOUTH UNIVERSITY TUTORIAL 2

Regression. Regression lines CHAPTER 5

12.1 Inference for Linear Regression. Introduction

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

Statistical Methods Exam I Review

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

5 To Invest or not to Invest? That is the Question.

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

EXECUTIVE SUMMARY DATA AND PROBLEM

MULTIPLE REGRESSION OF CPS DATA

Understandable Statistics

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

CHAPTER TWO REGRESSION

Chapter 1 Where Do Data Come From?

Multiple Choice Questions

Business Statistics Probability

(a) 50% of the shows have a rating greater than: impossible to tell

Correlation & Regression Exercises Chapters 14-15

STATS Relationships between variables: Correlation

Multiple Linear Regression Analysis

Political Science 15, Winter 2014 Final Review

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

Ordinary Least Squares Regression

How Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1

Stat 13, Lab 11-12, Correlation and Regression Analysis

Paper Airplanes & Scientific Methods

WELCOME! Lecture 11 Thommy Perlinger

M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1

Simple Linear Regression

appstats26.notebook April 17, 2015

Brad Pilon & John Barban

BIVARIATE DATA ANALYSIS

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.

8.SP.1 Hand span and height

STATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013

Name MATH0021Final Exam REVIEW UPDATED 8/18. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.

MATH 2560 C F03 Elementary Statistics I LECTURE 6: Scatterplots (Continuation).

Transcription:

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample vs. Population Least Squares Regression Line Summarize linear relationship between explanatory (x) and response (y) values with line that minimizes sum of squared prediction errors (called residuals).! Slope: predicted change in response y for every unit increase in explanatory value x! Intercept: where best-fitting line crosses y-axis (predicted response for x=0?) Cengage Learning Elementary Statistics: Looking at the Big Picture 1 Elementary Statistics: Looking at the Big Picture L12.4 Example: Least Squares Regression Line! Background: Car-buyer used software to regress price on age for 14 used Grand Am s. Example: Extrapolation! Background: Car-buyer used software to regress price on age for 14 used Grand Am s.! Question: What do the slope (-1,288) and intercept (14,690) tell us?! Response: " Slope: For each additional year in age, predict price " Intercept: Best-fitting line! Question: Should we predict a new Grand Am to cost $14,690-$1,288(0)=$14,690?! Response: Practice: 5.70f p.203 Elementary Statistics: Looking at the Big Picture L12.6 Practice: 5.54c p.197 Elementary Statistics: Looking at the Big Picture L12.8

Definition! Extrapolation: using the regression line to predict responses for explanatory values outside the range of those used to construct the line. Example: More Extrapolation! Background: A regression of 17 male students weights (lbs.) on heights (inches) yields the equation! Question: What weight does the line predict for a 20-inch-long infant?! Response: Elementary Statistics: Looking at the Big Picture L12.9 Practice: 5.36e p.193 Elementary Statistics: Looking at the Big Picture L12.11 Expressions for slope and intercept Consider slope and intercept of the least squares regression line! Slope: so if x increases by a standard deviation, predict y to increase by r standard deviations! Intercept: so when predict #the line passes through the point of averages Elementary Statistics: Looking at the Big Picture L12.13 Example: Individual Summaries on Scatterplot! Background: Car-buyer plotted price vs. age for 14 used Grand Ams [(4, 13,000), (8, 4,000), etc.]! Question: Guess the means and sds of age and price?! Response: Age has approx. mean yrs, sd yrs; price has approx. mean $, sd $. Practice: 5.50a-d p.196 Elementary Statistics: Looking at the Big Picture L12.17

Definitions! Residual: error in using regression line to predict y given x. It equals the vertical distance observed minus predicted which can be written! s: denotes typical residual size, calculated as Note: s just averages out the residuals Example: Considering Residuals! Background: Car-buyer regressed price on age for 14 used Grand Ams [(4, 13,000), (8, 4,000), etc.]. Note: Slope is negative so r is negative sq. root of R-Sq: -0.89.! Question: What does s = 2,175 tell us?! Response: Regression line predictions not perfect: " x=4$predict = actual y=13,000$prediction error = " x=8$predict = actual y=4,000$prediction error = " Typical size of 14 prediction errors is (dollars) Elementary Statistics: Looking at the Big Picture L12.18 Practice: 5.56a p.197 Elementary Statistics: Looking at the Big Picture L12.20 Example: Considering Residuals " Typical size of 14 prediction errors is s = 2,175 (dollars): Some points vertical distance from line more, some less; 2,175 is typical distance. Example: Typical Residual Size s close to or 0! Background: Scatterplots show relationships " Price per kilogram vs. price per lb. for groceries " Students final exam score vs. (number) order handed in Regression line approx. same as line at average y-value.! Questions: Which has s = 0? Which has s close to?! Responses: Plot on left has s = : no prediction errors. Plot on right: s close to. (Regressing on x doesn t help; regression line is approximately horizontal.) Elementary Statistics: Looking at the Big Picture L12.21 Practice: 5.58 p.198 Elementary Statistics: Looking at the Big Picture L12.23

Explanatory/Response Roles in Regression Our choice of roles, explanatory or response, does not affect the value of the correlation r, but it does affect the regression line. Example: Regression Line when Roles are Switched! Background: Compare regression of y on x (left) and regression of x on y (right) for same 4 points: Elementary Statistics: Looking at the Big Picture L12.24! Question: Do we get the same line regressing y on x as we do regressing x on y? Context needed;! Response: The lines are very different. consider variables " Regressing y on x: slope and their roles " Regressing x on y: slope before regressing. Elementary Statistics: Looking at the Big Picture L12.26 Practice: 5.60b p.198 Definitions! Outlier: (in regression) point with unusually large residual! Influential observation: point with high degree of influence on regression line. Example: Outliers and Influential Observations! Background: Exploring relationship between orders for new planes and fleet size. (r=+0.69) Elementary Statistics: Looking at the Big Picture L12.27! Question: Are Southwest and JetBlue outliers or influential?! Response: " Southwest: (omit it$slope changes a lot) " JetBlue: (large residual; omit it$r increases to +0.97) Elementary Statistics: Looking at the Big Picture L12.29 Practice: 5.70d p.203

Example: Outliers and Influential Observations! Background: Exploring relationship between orders for new planes and fleet size. (r = +0.69)! Question: How does Minitab classify Southwest and JetBlue?! Response: " Southwest: (marked in Minitab) " JetBlue: (marked in Minitab) Influential observations tend to be extreme in horizontal direction. Definitions! Slope : how much response y changes in general (for entire population) for every unit increase in explanatory variable x! Intercept : where the line that best fits all explanatory/response points (for entire population) crosses the y-axis Looking Back: Greek letters often refer to population parameters. Elementary Statistics: Looking at the Big Picture L12.31 Practice: 5.70g p.203 Elementary Statistics: Looking at the Big Picture L12.32 Line for Sample vs. Population! Sample: line best fitting sampled points: predicted response is Example: Role of Sample Size! Background: Relationship between ages of students mothers and fathers; both scatterplots have r = +0.78, but sample size is over 400 (on left) or just 5 (on right):! Population: line best fitting all points in population from which given points were sampled: mean response is A larger sample helps provide more evidence of a relationship between two quantitative variables in the general population. Elementary Statistics: Looking at the Big Picture L12.33! Question: Which plot provides more evidence of strong positive relationship in population?! Response: Plot on Can believe configuration on occurred by chance. Elementary Statistics: Looking at the Big Picture L12.35 Practice: 5.64 p.200

Additional Variables in Regression! Confounding Variable: Combining two groups that differ with respect to a variable that is related to both explanatory and response variables can affect the nature of their relationship.! Multiple Regression: More advanced treatments consider impact of not just one but two or more quantitative explanatory variables on a quantitative response. Example: Additional Variables! Background: A regression of phone time (in minutes the day before) and weight shows a negative relationship.! Questions: Do heavy people talk on the phone less? Do light people talk more?! Response: is confounding variable$regress separately for no relationship Elementary Statistics: Looking at the Big Picture L12.36 Practice: 5.113a p.219 Elementary Statistics: Looking at the Big Picture L12.38 Example: Multiple Regression! Background: We used a car s age to predict its price.! Question: What additional quantitative variable would help predict a car s price?! Response: Lecture Summary (Regression)! Equation of regression line " Interpreting slope and intercept " Extrapolation! Residuals: typical size is s! Line affected by explanatory/response roles! Outliers and influential observations! Line for sample or population; role of sample size! Additional variables Practice: 5.69b-d p.201 Elementary Statistics: Looking at the Big Picture L12.40 Elementary Statistics: Looking at the Big Picture L12.41