Homework Linear Regression Problems should be worked out in your notebook

Size: px
Start display at page:

Download "Homework Linear Regression Problems should be worked out in your notebook"

Transcription

1 Homework Linear Regression Problems should be worked out in your notebook 1. Following are the mean heights of Kalama children: Age (months) Height (cm) a) Sketch a scatter plot b) Describe the pattern of the scatterplot. c) What is the correlation coefficient? Interpret in terms of the problem. d) Calculate and interpret the slope. e) Calculate and interpret the y-intercept. f) Write the equation of the regression line. Draw the regression line. g) Predict the height of a 32 month old child. h) Make a residual plot and comment on whether a linear model is appropriate. 2. The average prices (in dollars) per ounce of gold and silver for the years 1986 through 1994 are given below. Year Gold Silver a. What is the explanatory variable? Explain. b. Find the regression line for gold predicting silver. c. Interpret the slope and y-intercept. d. What is the correlation coefficient? Interpret. e. Find the regression line for silver predicting gold. f. Interpret the slope and y-intercept. g. What is the correlation coefficient? Interpret. Compare your answer to part d. h. What is the coefficient of determination? Interpret. 3. Good runners take more steps per second as they speed up. Here are the average numbers of steps per second for a group of top female runners at different speeds. The speeds are in feet per second. Speed (ft/s) Steps per second a) You want to predict steps per second from running speed. Which is the explanatory variable? Make a scatterplot of the data with this goal in mind. b) Describe the pattern of the scatterplot. c) What is the correlation coefficient? Interpret in terms of the problem. d) Calculate and interpret the slope. e) Calculate and interpret the y-intercept. f) Write the equation of the regression line. Draw the regression line. g) If you need to cover 20 ft/s to win a race, predict the steps per second you ll need to maintain. h) Make a residual plot and comment on whether a linear model is appropriate.

2 4. Car dealers across North America use the Red Book to help them determine the value of used cars that their customers trade in when purchasing new cars. The book lists on a monthly basis the amount paid at recent used-car auctions and indicates the values according to condition and optional features, but does not inform the dealers as to how odometer readings affect the trade-in value. In an experiment to determine whether the odometer reading should be included, ten 3-year-old cars are randomly selected of the same make, condition, and options. The trade-in value (in $100) and mileage (in 1000s of miles) are shown below. Odometer Trade-in a) Describe the pattern of the scatterplot. b) Find the sample regression line for determining how the odometer reading affects the trade-in value of the car. c) Interpret the slope in terms of the problem. d) Calculate and interpret the correlation coefficient. e) Calculate and interpret the coefficient of determination. f) Predict the trade-in value of a car with 60,000 miles. g) What would be the odometer reading of a car with a trade-in value of $4200? h) Make a residual plot and comment on whether a linear model is appropriate. i) What is the residual for the car with 92,000 miles on the odometer? 5. In one of the Boston city parks there has been a problem with muggings in the summer months. A police cadet took a random sample of 10 days (out of the 90-day summer) and compiled the following data. For each day, x represents the number of police officers on duty in the park and y represents the number of reported muggings on that day.. x y a) Sketch a scatter plot. Describe the pattern of the scatterplot. b) What is the regression line? c) What is the correlation coefficient? Interpret in terms of the problem. d) Interpret the slope in terms of the problem. e) Find the coefficient of determination and interpret in terms of the problem. f) Predict the number of muggings if there are 9 police officers on duty. 6. Each of the following statements contains a blunder. Explain in each case what is wrong. a. There is a high correlation between the gender of American workers and their income b. We found a high correlation (r = 1.09) between students ratings of faculty teaching and ratings made by other faculty members. c. The correlation between planting rate and yield of corn was found to be r =.23 bushel.

3 7. Foal weight at birth is an indicator of health, so it is of interest to breeders of thoroughbred horses. Is foal weight related to the weight of the mare? The accompanying data are from the article Suckling Behavior Does Not Measure Milk Intake in Horses (animal Behavior [1999]) Observation Mare weight(kg) Foal weight(kg) a) Describe the pattern of the scatterplot. b) Find the equation of the regression line. c) Interpret the slope in terms of the problem. d) Interpret the y-intercept in terms of the problem. e) Calculate and interpret the correlation coefficient. f) Calculate and interpret the coefficient of determination. 8. The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several dealers lots. A computer printout showing the results of a straight line to the data by the method of least squares gives: Price = Age R-sq = 75.5% a) Find the correlation coefficient for the relationship between price and age of Voyagers based on these data. b) What is the slope of the regression line? Interpret it in the context of these data. c) How will the size of the correlation coefficient change if the 10-year-old Voyager is removed from the data set? Explain. d) How will the slope of the LSRL change if the 10- year-old Voyager is removed from the data? Plymouth Voyagers Scatter Plot Price_ Age_in_years 9. One measure of the success of knee surgery is postsurgical range of motion for the knee joint. Postsurgical range of motion was recorded for 12 patients who had surgery following a knee dislocation. The age of each patient was also recorded ( Reconstruction American Journal of Sports Medicine). The average age was years and standard deviation of years. The average range of motion was degrees with a standard deviation of degrees. The correlation coefficient was r = a) If we use age to try and predict the range of motion, what is the slope? What is the y-intercept? Interpret the two in context of the problem. b) Use the regression line to predict the range of motion of someone 32 years of age. c) Use the regression line to predict the range of motion of someone 50 years of age. Do you feel this is an accurate prediction? Explain your thoughts.

4 10. Newsweek gave the following 1994 average weekly earnings from allowances, chores, work, and gifts for children of ages 4 through 12. Age Earnings $5. 87 $7. 42 $7. 62 $ $ $ $ $ $ a. Construct a scatter plot. Describe the pattern of the scatterplot. b. Interpret the slope in terms of the problem. c. Find the coefficient of determination and interpret in terms of the problem. d. Find the correlation coefficient and interpret in terms of the problem. e. Predict the weekly earnings of a child who is age 16. Do you think this is a good prediction? Explain. 11. The paper A Cross-National Relationship between Sugar Consumption and Major Depression? (Depression and Anxiety [2002]) concluded that there was a strong correlation ( r.9444 ) between refined sugar consumption (calories per person per day) and annual rate of major depression (cases per 100 people) based on data from 6 countries. The average sugar consumption was calories per person per day with a standard deviation of calories while the annual rate of depression was 4.26 cases with a standard deviation of cases. a) What is the slope of the regression line of annual rate of depression based on sugar consumption? What is the y-intercept? Interpret the two in context of the problem. b) Use the regression line to predict the depression rate of the United States if the average person consumes 300 calories per person per day. c) New Zealand s depression rate is 5.7 annual cases per 100 people. Use the model to find the possible sugar consumption. Does the regression line allow us to make this prediction? Explain. 12. How quickly can athletes return to their sport following injuries requiring surgery? The paper Arthroscopic Distal Clavicle Resection for Isolated Atraumatic Osteolysis in Weight Lifters (American Journal of Sports Medicine, 1998) discovered there was a moderate positive (r =.55) linear relationship between a lifters age and the number of days after arthroscopic shoulder surgery before being able to return to their sport between 10 weight lifters. The average age of the weight lifters was 30.4 with standard deviation of years. The average number of days before being able to return to their sport was 3.2 days with a standard deviation of days. a. Determine the line to predict the number of days based on the age of the weight lifter. b. Determine the coefficient of determination and interpret in terms of the problem. c. Given the spread of the lifters was from 26 to 34 years old, predict the number of days for a 28 year old lifter. Do you feel this prediction is accurate? Explain.

5 13. Success in hunting varies greatly among species of animals. Lions, who hunt singly, are rarely successful in more than 10 percent of their hunts. Wild African dogs, who hunt in packs, are among the most efficient of all hunters, succeeding at a rate of over 90 percent of their hunts. In the early 1960 s, researcher Jane Goodall discovered that chimpanzees were not solely vegetarian in their diets, as had previously been thought. This discovery spurred a tremendous amount of primate research. Some of the latest primatology research has been done on chimpanzees to find out if larger hunting parties increase the chances of a successful hunt. The results of one such research project are summarized in the table for the number of chimpanzees in the hunting party versus the percentage of successful hunts. Number of Chimps Percent of Success a. Construct a scatter plot. b. Determine the regression line. c. Interpret the y-intercept. Does the interpretation make sense in this context? d. Interpret the slope. e. Find the correlation coefficient and interpret in terms of the problem. f. Find the coefficient of determination and interpret in terms of the problem. g. Sketch the residual plot. Interpret in terms of the problem. 14. The following is a table of the number of registered automatic weapons (in thousands) of selected states and their corresponding murder rates. Weapons Rates a. Determine the regression line. b. Predict the number of weapons for a state with a rate of 8.5? c. Predict the murder rate for a state with 10,000 registered automatic weapons. 15. The following output data from MINITAB shows the height of girls (in cm) based on the number of years old. Predictor Coef Stdev t-ratio p Constant Age(yrs) s=1.518 R-sq=99.5% a) What is the equation of the least squares line? Interpret the slope. b) Find the correlation coefficient and coefficient of determination. Interpret in the context of the problem. c) Predict the height of a 3 year old girl. d) Predict the age if a girl is 135 cm.

6 16. Women made significant gains in the 1970 s in terms of their acceptance into professions that had been traditionally populated by men. To measure just how big these gains were, we will compare the percentage of professional degrees award to women in to the percentage awarded in for selected fields of student. Field Degrees in Degrees in Dentistry 2.0% 11.9% Law Medicine Optometry Osteopathic medicine Podiatry Theology Veterinary medicine a) What is the regression line? b) Interpret the slope in terms of the problem. c) Find the coefficient of determination and interpret in terms of the problem. d) Sketch the residual plot. Interpret. e) Find the residual for optometry. f) Find the residual for veterinary medicine. Did the regression line over or under predict? Explain. 17. Shells of mollusks function as both part of the skeletal system and as protective armor. It has been argued that many features of these shells were the result of natural selection in the constant battle against predators. The paper Postmortem Changes in Strength of Gastropod Shells included scatter plot of data on x = shell height (cm) and y = breaking strength (newtons). The least squares line for a sample of 38 hermit crab shells was y x. a. What are the slope and intercept of this line? b. When shell height increases by 1 cm, by how much does breaking strength tend to change? c. What breaking strength would you predict when shell height is 2 cm? d. Does this approximate linear relationship appear to hold for shell heights as small as 1 cm? Explain your thoughts. 18. Given the following data sets, find the regression line. Sketch the residual plot and comment on the likelihood of the regression line being a good model. x y x y

7 19. The data come from a study of ice cream consumption that spanned the springs and summers of three years. The ice cream consumption (pints per capita per year), family income of consumers ($1000 per year) and the temperature (degrees Fahrenheit) is listed below. Consumption Income Temperature a. Complete two scatter plots with consumption being the response variable for each plot. b. Find the two regression lines. c. Interpret the slopes. d. Interpret the coefficient of determinations. e. Sketch and interpret both residual plots. f. Which do you think is the better predictor of consumption? Explain. g. Predict the consumption for a temperature of 53 degrees. h. Predict the consumption for an income of $17,500. i. Predict the income and temperature for 3 gallons a year. 20. People with diabetes measure their fasting plasma glucose (FPG; measured in units of milligrams per milliliter) after fasting for at least 8 hours. Another measurement, made at regular medical checkups is called HbA. This is roughly the percent of red blood cells that have a glucose molecule attached. It measures average exposure to glucose over a period of several months. The table below gives data on both HbA and FPG for 18 diabetics five months after they had completed a diabetes education class. HbA FPG HbA FPG Subject (%) (mg/ml) Subject (%) (mg/ml) a) Sketch a scatter plot. Describe the scatterplot. Subject 15 is an outlier in the y direction. Subject 18 is an outlier in the x direction. b) Find the correlation and the regression line for all 18 subjects c) Find the correlation and the regression line when only subject 15 is removed. d) Find the correlation and the regression line when only subject 18 is removed. e) Are either or both of these points influential for the correlation? Explain why r changes in opposite directions when we remove each of these points. f) Is either Subject 15 or Subject 18 strongly influential for the least-squares line?

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78 35. What s My Line? You use the same bar of soap to shower each morning. The bar weighs 80 grams when it is new. Its weight goes down by 6 grams per day on average. What is the equation of the regression

More information

AP Statistics Practice Test Ch. 3 and Previous

AP Statistics Practice Test Ch. 3 and Previous AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry

More information

Math 075 Activities and Worksheets Book 2:

Math 075 Activities and Worksheets Book 2: Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are

More information

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do. Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and

More information

3.2A Least-Squares Regression

3.2A Least-Squares Regression 3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Equation of Regression Line; Residuals Effect of Explanatory/Response Roles Unusual Observations Sample

More information

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 140 est 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

3.4 What are some cautions in analyzing association?

3.4 What are some cautions in analyzing association? 3.4 What are some cautions in analyzing association? Objectives Extrapolation Outliers and Influential Observations Correlation does not imply causation Lurking variables and confounding Simpson s Paradox

More information

14.1: Inference about the Model

14.1: Inference about the Model 14.1: Inference about the Model! When a scatterplot shows a linear relationship between an explanatory x and a response y, we can use the LSRL fitted to the data to predict a y for a given x. However,

More information

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals

More information

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question. Name: Class: Date: Chapter 3 Review Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice Chapter 14 Inference for Regression Our!nal topic of the year involves inference for the regression model. In Chapter 3 we learned how to!nd the Least Squares Regression Line for a set of bivariate data.

More information

STAT 201 Chapter 3. Association and Regression

STAT 201 Chapter 3. Association and Regression STAT 201 Chapter 3 Association and Regression 1 Association of Variables Two Categorical Variables Response Variable (dependent variable): the outcome variable whose variation is being studied Explanatory

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. STAT 280 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual? STATISTICS 216, SPRING 2006 Name: EXAM 1; February 21, 2006; 100 points. Instructions: Closed book. Closed notes. Calculator allowed. Double-sided exam. NO CELL PHONES. Multiple Choice (3pts each). Circle

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Homework #3 Name Due Due on on February Tuesday, Due on February 17th, Sept Friday 28th 17th, Friday SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill

More information

BIVARIATE DATA ANALYSIS

BIVARIATE DATA ANALYSIS BIVARIATE DATA ANALYSIS Sometimes, statistical studies are done where data is collected on two variables instead of one in order to establish whether there is a relationship between the two variables.

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible Pledge: 10/4/2007 MATH 171 Name: Dr. Lunsford Test 1 100 Points Possible I. Short Answer and Multiple Choice. (36 points total) 1. Circle all of the items below that are measures of center of a distribution:

More information

7) Briefly explain why a large value of r 2 is desirable in a regression setting.

7) Briefly explain why a large value of r 2 is desirable in a regression setting. Directions: Complete each problem. A complete problem has not only the answer, but the solution and reasoning behind that answer. All work must be submitted on separate pieces of paper. 1) Manatees are

More information

3.2 Least- Squares Regression

3.2 Least- Squares Regression 3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these

More information

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll  you Reminders/Comments Thanks for the quick feedback I ll try to put HW up on Saturday and I ll email you Final project will be assigned in the last week of class You ll have that week to do it Participation

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Answer all three questions. All questions carry equal marks.

Answer all three questions. All questions carry equal marks. UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science and Statistics Postgraduate Diploma in Statistics Trinity Term 2 Introduction to Regression

More information

SCATTER PLOTS AND TREND LINES

SCATTER PLOTS AND TREND LINES 1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

STAT 135 Introduction to Statistics via Modeling: Midterm II Thursday November 16th, Name:

STAT 135 Introduction to Statistics via Modeling: Midterm II Thursday November 16th, Name: STAT 135 Introduction to Statistics via Modeling: Midterm II Thursday November 16th, 2017 Name: 1 1 Short Answer a) For each of these five regression scenarios, name an appropriate visualization (along

More information

Practice First Midterm Exam

Practice First Midterm Exam Practice First Midterm Exam Statistics 200 (Pfenning) This is a closed book exam worth 150 points. You are allowed to use a calculator and a two-sided sheet of notes. There are 9 problems, with point values

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

Regression. Regression lines CHAPTER 5

Regression. Regression lines CHAPTER 5 CHAPTER 5 NASA/GSFC Can scientists predict in advance how many hurricanes the coming season will bring? Exercise 5.44 has some data. Regression IN THIS CHAPTER WE COVER... Linear (straight-line) relationships

More information

5 To Invest or not to Invest? That is the Question.

5 To Invest or not to Invest? That is the Question. 5 To Invest or not to Invest? That is the Question. Before starting this lab, you should be familiar with these terms: response y (or dependent) and explanatory x (or independent) variables; slope and

More information

Unit 8 Day 1 Correlation Coefficients.notebook January 02, 2018

Unit 8 Day 1 Correlation Coefficients.notebook January 02, 2018 [a] Welcome Back! Please pick up a new packet Get a Chrome Book Complete the warm up Choose points on each graph and find the slope of the line. [b] Agenda 05 MIN Warm Up 25 MIN Notes Correlation 15 MIN

More information

Unit 8 Bivariate Data/ Scatterplots

Unit 8 Bivariate Data/ Scatterplots Unit 8 Bivariate Data/ Scatterplots Oct 20 9:19 PM Scatterplots are used to determine if there is a relationship between two variables. /Correlation /Correlation /Correlation Line of best fit cuts the

More information

Correlation & Regression Exercises Chapters 14-15

Correlation & Regression Exercises Chapters 14-15 Correlation & Regression Exercises Chapters 14-15 1. Which of these are true and which are false? Explain why the false statements are wrong. a. If the slope of the line is 1, then the correlation must

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS Circle the best answer. This scenario applies to Questions 1 and 2: A study was done to compare the lung capacity of coal miners to the lung

More information

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60 M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions

More information

A response variable is a variable that. An explanatory variable is a variable that.

A response variable is a variable that. An explanatory variable is a variable that. Name:!!!! Date: Scatterplots The most common way to display the relation between two quantitative variable is a scatterplot. Statistical studies often try to show through scatterplots, that changing one

More information

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation)

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Investigating relationships between variables is central to what we do in statistics. Why is it important to investigate and

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D) Decide whether or not the conditions and assumptions for inference with

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell q 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included in

More information

Chapter 4: More about Relationships between Two-Variables

Chapter 4: More about Relationships between Two-Variables 1. Which of the following scatterplots corresponds to a monotonic decreasing function f(t)? A) B) C) D) G Chapter 4: More about Relationships between Two-Variables E) 2. Which of the following transformations

More information

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013# UF#Stats#Club#STA##Exam##Review#Packet# #Fall## The following data consists of the scores the Gators basketball team scored during the 8 games played in the - season. 84 74 66 58 79 8 7 64 8 6 78 79 77

More information

Chapter 3 CORRELATION AND REGRESSION

Chapter 3 CORRELATION AND REGRESSION CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression

More information

Pre-Test Unit 9: Descriptive Statistics

Pre-Test Unit 9: Descriptive Statistics Pre-Test Unit 9: Descriptive Statistics You may use a calculator. The following table shows how many text messages different students sent this week. Answer the following questions using the table. 20

More information

Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2

Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2 Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2 1. Suppose it is the year 1985 and you are doing research on the differences in wages earned by men and women in the U.S. workforce.

More information

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell KEY 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? 15 What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included

More information

Lab 5a Exploring Correlation

Lab 5a Exploring Correlation Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for

More information

Chapter 4: More about Relationships between Two-Variables Review Sheet

Chapter 4: More about Relationships between Two-Variables Review Sheet Review Sheet 4. Which of the following is true? A) log(ab) = log A log B. D) log(a/b) = log A log B. B) log(a + B) = log A + log B. C) log A B = log A log B. 5. Suppose we measure a response variable Y

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 STAB22H3 Statistics I, LEC 01 and LEC 02 Duration: 1 hour and 45 minutes Last Name: First Name:

More information

Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January

Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January PID: Last Name, First Name: Section: Approximate time spent to complete this assignment: hour(s) Readings: Chapters 7, 8 and 9. Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January Exercise

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

REVIEW PROBLEMS FOR FIRST EXAM

REVIEW PROBLEMS FOR FIRST EXAM M358K Sp 6 REVIEW PROBLEMS FOR FIRST EXAM Please Note: This review sheet is not intended to tell you what will or what will not be on the exam. However, most of these problems have appeared on or are very

More information

CHILD HEALTH AND DEVELOPMENT STUDY

CHILD HEALTH AND DEVELOPMENT STUDY CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in

More information

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. For Questions 1 & 2: It is known that the distribution of starting salaries for MSU Education majors has

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

INTERPRET SCATTERPLOTS

INTERPRET SCATTERPLOTS Chapter2 MODELING A BUSINESS 2.1: Interpret Scatterplots 2.2: Linear Regression 2.3: Supply and Demand 2.4: Fixed and Variable Expenses 2.5: Graphs of Expense and Revenue Functions 2.6: Breakeven Analysis

More information

Semester 1 Final Scientific calculators are allowed, NO GRAPHING CALCULATORS. You must show all your work to receive full credit.

Semester 1 Final Scientific calculators are allowed, NO GRAPHING CALCULATORS. You must show all your work to receive full credit. Algebra 1 Name: Semester 1 Final Scientific calculators are allowed, NO GRAPHING CALCULATORS. You must show all your work to receive full credit. (F.IF.2 DOK 1) (1 point) 1. Evaluate the function when

More information

Introduction to regression

Introduction to regression Introduction to regression Regression describes how one variable (response) depends on another variable (explanatory variable). Response variable: variable of interest, measures the outcome of a study

More information

Stat 13, Lab 11-12, Correlation and Regression Analysis

Stat 13, Lab 11-12, Correlation and Regression Analysis Stat 13, Lab 11-12, Correlation and Regression Analysis Part I: Before Class Objective: This lab will give you practice exploring the relationship between two variables by using correlation, linear regression

More information

Section I: Multiple Choice Select the best answer for each question.

Section I: Multiple Choice Select the best answer for each question. Chapter 1 AP Statistics Practice Test (TPS- 4 p78) Section I: Multiple Choice Select the best answer for each question. 1. You record the age, marital status, and earned income of a sample of 1463 women.

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables

More information

AP Stats Chap 27 Inferences for Regression

AP Stats Chap 27 Inferences for Regression AP Stats Chap 27 Inferences for Regression Finally, we re interested in examining how slopes of regression lines vary from sample to sample. Each sample will have it s own slope, b 1. These are all estimates

More information

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014

More information

Beware of Confounding Variables

Beware of Confounding Variables Beware of Confounding Variables If I wanted to prove that smoking causes heart issues, what are some confounding variables? The object of an experiment is to prove that A causes B. A confounding variable

More information

12.1 Inference for Linear Regression. Introduction

12.1 Inference for Linear Regression. Introduction 12.1 Inference for Linear Regression vocab examples Introduction Many people believe that students learn better if they sit closer to the front of the classroom. Does sitting closer cause higher achievement,

More information

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.

about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat. Brad Pilon 1 2 3 ! For many people, the best thing about Eat Stop Eat is that there is the equivalent of two days a week where you don t have to worry about what you eat.! However, this still means there

More information

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

STATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013

STATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013 STATISTICS 201 Survey: Provide this Info Outline for today: Go over syllabus Provide requested information on survey (handed out in class) Brief introduction and hands-on activity Name Major/Program Year

More information

Multiple Choice Questions

Multiple Choice Questions ACTM State Statistics Work the multiple choice questions first, selecting the single best response from those provided and entering it on your scantron form. You may write on this test and keep the portion

More information

EXECUTIVE SUMMARY DATA AND PROBLEM

EXECUTIVE SUMMARY DATA AND PROBLEM EXECUTIVE SUMMARY Every morning, almost half of Americans start the day with a bowl of cereal, but choosing the right healthy breakfast is not always easy. Consumer Reports is therefore calculated by an

More information

STAT445 Midterm Project1

STAT445 Midterm Project1 STAT445 Midterm Project1 Executive Summary This report works on the dataset of Part of This Nutritious Breakfast! In this dataset, 77 different breakfast cereals were collected. The dataset also explores

More information

Unit 3 Lesson 2 Investigation 4

Unit 3 Lesson 2 Investigation 4 Name: Investigation 4 ssociation and Causation Reports in the media often suggest that research has found a cause-and-effect relationship between two variables. For example, a newspaper article listed

More information

AP Statistics Practice Test Unit Seven Sampling Distributions. Name Period Date

AP Statistics Practice Test Unit Seven Sampling Distributions. Name Period Date AP Statistics Practice Test Unit Seven Sampling Distributions Name Period Date Vocabulary: 1. Define and provide an example of a statistic.. Define Sampling Distribution. 3. Define the variability of a

More information

Chapter 5: Summarizing Bivariate Data Review Pack

Chapter 5: Summarizing Bivariate Data Review Pack Chapter 5: Summarizing Bivariate Data Review Pack Name 1. What is it that the Pearson correlation coefficient quantifies? 2. If a scatter plot exhibits a strong positive relationship, what can be said

More information

MEASURES OF GROUP CHARACTERISTICS

MEASURES OF GROUP CHARACTERISTICS MEASURES OF GROUP CHARACTERISTICS You are familiar with the idea of measuring things -- a person s height, a steak s weight, a car s value (its selling price). The purpose of any measurement is that it

More information

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Statistics Final Review Semeter I Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The Centers for Disease

More information

Caffeine & Calories in Soda. Statistics. Anthony W Dick

Caffeine & Calories in Soda. Statistics. Anthony W Dick 1 Caffeine & Calories in Soda Statistics Anthony W Dick 2 Caffeine & Calories in Soda Description of Experiment Does the caffeine content in soda have anything to do with the calories? This is the question

More information

STATISTICS & PROBABILITY

STATISTICS & PROBABILITY STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness

More information

Welcome to OSA Training Statistics Part II

Welcome to OSA Training Statistics Part II Welcome to OSA Training Statistics Part II Course Summary Using data about a population to draw graphs Frequency distribution and variability within populations Bell Curves: What are they and where do

More information

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points. Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points. 1. The bell-shaped frequency curve is so common that if a population has this shape, the measurements are

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the 1) Which of the following is the properly rounded mean for the given data? 7, 8, 13, 9, 10, 11 A)

More information

Examining Relationships Least-squares regression. Sections 2.3

Examining Relationships Least-squares regression. Sections 2.3 Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability

More information

Exponential Decay. Lesson2

Exponential Decay. Lesson2 Lesson2 Exponential Decay In 1989, the oil tanker Exxon Valdez ran aground in waters near the Kenai peninsula of Alaska. Over 10 million gallons of oil spread on the waters and shoreline of the area, endangering

More information

Introduction to Econometrics

Introduction to Econometrics Global edition Introduction to Econometrics Updated Third edition James H. Stock Mark W. Watson MyEconLab of Practice Provides the Power Optimize your study time with MyEconLab, the online assessment and

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

STOR 155 Section 2 Midterm Exam 1 (9/29/09)

STOR 155 Section 2 Midterm Exam 1 (9/29/09) STOR 155 Section 2 Midterm Exam 1 (9/29/09) Name: PID: Instructions: Both the exam and the bubble sheet will be collected. On the bubble sheet, print your name and ID number, sign the honor pledge, also

More information

INTERMEDIATE ALGEBRA Review for Exam 3

INTERMEDIATE ALGEBRA Review for Exam 3 INTERMEDIATE ALGEBRA Review for Eam 3 Consider the polnomials below. Answer the questions. 1) a) -163 + 6 + 34-2 - 82 b) u + 9u9v2 + u4v3 + 7u + 4v6 i) Determine the degree of each term of the polnomial.

More information

Department of Statistics TEXAS A&M UNIVERSITY STAT 211. Instructor: Keith Hatfield

Department of Statistics TEXAS A&M UNIVERSITY STAT 211. Instructor: Keith Hatfield Department of Statistics TEXAS A&M UNIVERSITY STAT 211 Instructor: Keith Hatfield 1 Topic 1: Data collection and summarization Populations and samples Frequency distributions Histograms Mean, median, variance

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information