Correlation & Regression Exercises Chapters 14-15

Size: px
Start display at page:

Download "Correlation & Regression Exercises Chapters 14-15"

Transcription

1 Correlation & Regression Exercises Chapters Which of these are true and which are false? Explain why the false statements are wrong. a. If the slope of the line is 1, then the correlation must also be 1. b. A correlation of 0.8 means that 80% of the points in the scatterplot lie above the regression line. c. If the correlation between two lists of numbers is zero, then there can be no relationship between them. d. For all of the books in the Library of Congress, the correlation between the thickness of the books (in inches) and their number of pages would be positive. e. For all of the cars registered in the state of Ohio, the correlation between their fuel efficiency (in miles per gallon) and their weight (in pounds) would be positive. f. If the correlation between height (in inches) and weight (in pounds) for a group of people is 0.7, then the correlation between their heights (in centimeters) and their weights (in kilograms) will still be 0.7. g. If the correlation between two variables is negative, then high values of one variable tend to be associated with low values of the other variable. 2. If the standard deviation of x is equal to the standard deviation of y then the slope of the regression line relating x and y will be: A. equal to 1. B. equal to the correlation. C. equal to the mean of x. D. equal to the standard deviation of x. Explain your choice. 3. Consider the following two correlations: I The correlation between weight (in pounds) and height (in inches) for all of the babies born in Brooklyn, N.Y. this year. II The correlation between weight (in kilograms) and height (in centimeters) for all of the babies born in Brooklyn, N.Y. this year. There are about 2.2 pounds to the kilogram and about 2.54 centimeters to the inch. Which statement is true? A. Correlation I is larger. B. Correlation II is larger. C. Correlations I and II are equal. D. There is not enough information to tell which correlation is larger. Explain your choice. 1

2 4. A realtor took a random sample of records of sales of homes from the files maintained by the Albuquerque Board of Realtors. From this sample he recorded the amount paid in real estate taxes (in dollars), and the sales price of the home (in thousands of dollars). From this information the following output was created. a. What is the correlation between the sales price and the taxes paid? b. We know that a home sold for a price of $180,000. Use the least squares line presented above to predict the average taxes for homes that sold at this price. c. We know that a home sold for $400,000. Would it be appropriate to predict the average taxes for a home that sold at this price? If so, make the prediction. If not explain why not. 5. Over 400 students in a statistics class were asked their GPA in high school and their GPA so far in college. The results were analyzed giving the following regression output: a. One student in the class had a high school GPA of 3.2. What would you predict for her GPA at Ohio State? Show your work. b. How do you interpret the GPA coefficient of given in the output? 2

3 6. A survey of homes in Whitehall, Ohio recorded the market value (market) of the home, the size of the home in square feet (sqft), and the number of bedrooms (bed) that each home had. Two resulting regression outputs are given below: The regression equation is market = sqft Predictor Coef StDev T P Constant sqft S = R-Sq = 75.2% R-Sq(adj) = 75.1% The regression equation is market = bed Predictor Coef StDev T P Constant bed S = R-Sq = 15.7% R-Sq(adj) = 15.4% a. What is the correlation between the number of bedrooms and the market value? b. We know that a home in Whitehall has three bedrooms. What market value would we predict for this home? c. If you can choose either square footage or number of bedrooms to use as a predictor of market value, which would make the best predictor? Explain your reasoning. 7. A study measures the average annual snowfall (in inches) for 10 cities over the last decade along with the greatest Earth movement (on the Richter scale) over this same time period. The study included data from five cities in California s San Francisco Bay Area and five cities from Canada s province of Ontario. The study found a very strong negative correlation between the two variables. Does this mean that a strong snowfall will prevent earthquakes? Explain your answer briefly (identify the type of spurious argument involved and draw a picture to illustrate). 8. As the price of gasoline increases many people are considering purchasing gasoline electric hybrids. Are hybrids really different from other cars? We can use scatterplots and correlation to explore the relationship of variables and see how hybrids fall in these groups. Use the computer software of your choice to open the data set Cars2006. a. Create a scatterplot between the highway and city gas mileage. Illuminate the hybrid cars like the Toyota Prius, Honda Insight, Toyota Highlander Hybrid, Lexus RX400H, and Ford Escape Hybrid. Do these cars appear as outliers in the scatterplot? b. If the Toyota Prius and Honda Insight were removed from the scatterplot, would the correlation increase or decrease? Explain. c. Create a scatterplot between the highway mileage and engine displacement. Do the hybrids appear as outliers in this scatterplot? Explain. 3

4 9. Below is a scatterplot of the relationship between the Infant Mortality Rate and the Percent of Juveniles Not Enrolled in School for each of the 50 states plus the District of Columbia. The correlation is If the District of Columbia (identified by the X) had been left out of the data set, then the correlation between these two variables for the 50 states would: A. be higher than B. not change at all. C. be lower than Pick one and explain briefly. 10. The director of admissions in a small college administered a newly designed entrance test to 100 students selected at random from the upcoming freshman class. The purpose of this study was to determine whether students' grade point average (GPA) at the end of the freshman year can be predicted from the entrance test score. At the end of the year when all the data are available, what would be the graph you would use to display the data? A. A histogram of the entrance test scores. B. A histogram of the GPAs. C. A scatterplot with GPA on the y-axis and the entrance test scores on the x-axis. D. A scatter plot with the entrance test scores on the y-axis and GPA on the x-axis. 11. Climatologists can estimate the amount of rainfall in California on a year by year basis over the last two thousand years by looking at the distance between the rings in very old redwood trees that have recently fallen (the idea being that the tree would grow faster and hence the rings would be farther apart for years with more rainfall). In this situation, which of the two variables below should be plotted on the Y-axis of a scatterplot and which should be plotted on the X-axis? Explain why. Variable 1: The distance between the rings Variable 2: The amount of rainfall 12. In an observational study, correlation does not imply causation because A. a high correlation may result from both X and Y being related to an unknown confounding variable. B. a high correlation may result from an outlier in the scatter plot. C. correlations may be negative. D. The regression line may have a steep slope. 4

5 13. The scatterplot and regression output below describe the relationship between the gestation age (in weeks) and the birth weight (in grams) of 100 low birth weight infants born in Boston b i r t h w t a. Is it appropriate to use the regression method to estimate the birth weight of an infant born with a gestational age of 30 weeks? If so explain why and make the estimate showing your work. If not, explain why not. b. Is it appropriate to use the regression method to estimate the birth weight of an infant born with a gestational age of 40 weeks? If so explain why and make the estimate showing your work. If not, explain why not. 14. Correlations will give a deceiving impression of the strength of an association A. when the pattern of points in the scatterplot is not linear. B. when the X and Y variables have a negative association. C. when the standard deviation of both X and Y are large. D. all of the above Pick one and explain. 15. Match the correlation to the situation. Your choices are and a. The correlation between the size of the home loan and the purchase price of the house. b. The correlation between the weight of the infant and the length of time they stay at the hospital after birth. c. The correlation between a college student s grade on an English class final exam and the student s score on the math part of the ACT college entrance exam. Explain your reasoning for deciding which correlation goes with which situation gestage (weeks) 16. The correlation between the price of the dinner and the tips left by customers at a restaurant is True or False: If every customer decided to give one dollar less in tips then this correlation would go down. 17. The correlation between the ages of a group of students and the ages of their fathers is 0.8. Two years later all of the ages of the students and of their fathers would have increased by 2 years. True or False: The correlation here would still be

6 18. Put the following four correlations in order from lowest to highest (be sure to remember that negative numbers are lower than positive numbers). Explain your reasoning. A. The correlation between the ages of all the husband and wife pairs in Ohio. B. The correlation between the weights of all the husbands and wives in Ohio. C. The correlation between the number of questions wrong and the number of questions right for all the students taking a test. D. The correlation between the weight and the miles per gallon of all the cars in Ohio. 19. The weights of 148 sets of twins born at the MetroHealth Medical Center in Cleveland, Ohio were recorded for a full year. How strongly is the weight of the first born associated with the weight of the twin? A scatterplot is shown below (all weights are in kilograms). a. The correlation between the weight of the first born and the weight of the twin is about A B C D Explain your choice. b. If the weights of the first born had been measured in pounds instead of kilograms, then: A. the value of the median weight of the first born twin would. B. the value of the standard deviation of the weights of the first born would. C. the value of the correlation between the two twins' weights would. Fill in the blanks from the possible choices listed below (Note. There are about 2.2 pounds in one kilogram). You may use an answer more than once. (1) be multiplied by 2.2 (2) be divided by 2.2 (3) stay the same (4) be multiplied by 2.2 times the correlation 20. The correlation between X and Y is 0.8. This says that A. Larger than average values of X are associated with smaller than average values of Y. B. Larger than average values of X are associated with negative values of Y. C. X tends to cause Y not to happen D. This is not possible. A correlation cannot be negative. Explain your choice. 6

7 21. The height, in cm, and length of the middle metacarpal bone, in mm, of 10 skeletons were measured. (The metacarpal bones are in the hand between the wrist and fingers.) The scatter diagram is given below. a. If the height and metacarpal length of the skeletons had been measured in inches instead of centimeters and millimeters, then the correlation between stature and metacarpal length for these 10 skeletons would go up, go down, or stay the same. Pick one and explain. b. One of these skeletons (identified by the X) had a metacarpal size of 52 mm and a height of 183 cm. If the height of this skeleton had been misrecorded as 153 cm, then the correlation between stature and metacarpal length for these 10 skeletons (including the misrecorded value) would go up, go down, or stay the same. Pick one and explain. Using the data in the scatterplot above (i.e., without the error mentioned in part b), a researcher gets the following output for the regression of stature on metacarpal length: Dependent variable is: stature No Selector R squared = 78.5% R squared (adjusted) = 75.8% s = with 10-2 = 8 degrees of freedom Source Regression Residual Sum of Squares df 1 8 Mean Square F-ratio 29.2 Variable Constant metacarpal Coefficient s.e. of Coeff t-ratio prob c. A new metacarpal bone, which is 45 mm long, is found at an archeological dig. An investigator wants to use the data from the 10 skeletons mentioned above to make a prediction about the height of the person this new metacarpal bone came from. For the new metacarpal bone that was found, you would expect it to come from a skeleton that was cm tall. Fill in the blank and explain. 7

8 22. Each year, g3 Mystery Shopping, a market research company based in Sylvania, Ohio conducts a study of the drive-thru windows of the national fast-food restaurant chains. In one part of the study, a g3 Mystery Shopping employee orders a main item, a side item, and a drink at a drive-thru window (for example, a sandwich, a fries, and a soft drink) and then keeps track of how long it takes to be served. The time, in seconds, reported for each chain is then a summary of visits to that chain s locations nationwide. Below are a scatterplot and a regression output using the times for 24 chains that were evaluated in both the 1998 and 1999 surveys. Dependent variable is: 1999 time No Selector 26 total cases of which 2 are missing R squared = 80.5% R squared (adjusted) = 79.7% s = with 24-2 = 22 degrees of freedom Source Regression Residual Variable Constant 1998 time Sum of Squares Coefficient df 1 22 s.e. of Coeff Mean Square t-ratio F-ratio 91.0 prob t i m e time a. One chain took 180 seconds to serve customers in What would you predict as the time for that chain to serve drive-thru customers in 1999? Show your work. b. The correlation between the 1998 times and the 1999 times was. c. The Steak n Shake chain did poorly in this survey taking 361 seconds to serve drive-thru customers in 1998 and 340 seconds in If Steak n Shake was not included in the survey, then the correlation between the 1998 and 1999 values would have been A. lower than the answer to part b above. B. higher than the answer to part b above. C. would not have changed the answer to part b above. Pick one and explain. 23. The correlation between the height and the age of a group of students in 2012 was In 2013 the ages of the students had, of course, all gone up by one year but none of the heights had changed. True or False: For this group, the correlation between height and age would thus be greater than 0.08 in

9 24. Subjects taking part in an experimental test of a new drug have a blood test taken before the experiment begins to be sure that a variety of tests are within normal limits. Two of the tests measure the amount of hemoglobin (the protein in the red blood cells that carries the oxygen from the lungs to the body s tissues) and the Red Blood Cell Count (the number of red blood cells per milliliter). Since hemoglobin is carried in the red blood cells it stands to reason that the more red blood cells a person has, the higher their hemoglobin levels will be. A researcher carries out a regression analysis to study this relationship. Here is the output from this analysis: Dependent variable is: No Selector 1357 total cases of which 29 are missing Hemoglobin R squared = 64.6% R squared (adjusted) = 64.6% s = with = 1326 degrees of freedom Source Regression Residual Variable Constant Red Blood C Sum of Squares Coefficient df s.e. of Coeff Mean Square t-ratio F-ratio 2419 prob Red Blood Cell Count a. One man had a red blood cell count of 5.5 per ml. Based on this output what hemoglobin level would you predict for this man? Show your calculations. b. Explain what aspects of the scatterplot above helps you know that your calculations in part a) were appropriate. c. What is the correlation between the hemoglobin level and the red blood cell count? 25. In searching for the causes of a disease, a researcher discovers that high levels of a certain protein are always present in subjects with the disease and is found at low levels in healthy subjects. Is this strong evidence that the protein causes the disease? Explain why or why not. H e m o g l o b i n 9

10 26. Which of these are true and which are false? Explain why the false statements are wrong. a. If two variables have a correlation of 0.3 and you add 0.1 to every value of both variables, then the correlation will become 0.4. b. A correlation of 0.5 means that half of the points in the scatterplot fall in a linear pattern. c. If the correlation is close to negative one, then the regression line will have a negative slope. d. The square of the correlation coefficient tells you the percentage of the variability in Y that is explained by knowing X. e. An outlier in the scatterplot can heavily influence a regression line. f. An outlier that falls right on the regression line can still have an important effect on the correlation. g. The regression line is not appropriate for making predictions when X and Y have a nonlinear relationship. h. If there is a linear pattern to the data, then linear regression can be appropriately used for extrapolation. i. If there is a non-linear relationship between x and y, then the correlation will always be zero. j. The correlation r measures both the direction and strength of a straight-line relationship. k. When the correlation between two variables is nearly negative 1, then there is a causeand-effect relationship between them. l. A correlation close to negative one indicates there were no outliers on the scatterplot. m. If there is a linear pattern to the data and there are no outliers driving the regression, then linear regression can be appropriately used for predictions within the range of the data. 10

11 EESEE Exercises The following exercises make use of stories in the Electronic Encyclopedia of Statistics Examples and Exercises, or EESEE (pronounced ee-zee). EESEE is included in the StatsPortal materials that accompany the textbook under the Resources tab. You will find the specific stories listed alphabetically by title. 27. EESEE Story Hubble Recession Velocity. In 1929 the astronomer Edwin Hubble investigated the relationship between the distance from Earth (in millions of light years) and the recession velocity (in kilometers/sec) of 24 galaxies. Read through this story s protocol. Hubble theorized that the relationship should be approximately linear. The data from Hubble s investigation are in the data file called Hubble. a. Suppose you find a galaxy with a recession velocity of 500 km/sec and want to estimate that galaxy s distance from Earth using Hubble s data. Which variable would you choose to be the y variable, and which would you choose to be the x variable? Explain. b. Use the computer to fit the regression line for distance on recession velocity. How far away from Earth do you predict the galaxy from part a to be? 28. EESEE Story Nutrition and Breakfast Cereals. This story details a study of the nutritional content of popular breakfast cereals. Read through the introduction to this story and open the data file called Cereals. This data gives the nutritional information from the box labels of 77 brands of breakfast cereal. a. Examine the amount of sodium in the cereals. Make a histogram and describe its shape. Calculate the values needed for the five-number summary and make a boxplot. Does the five-number summary do a good job of describing the distribution in this case? b. Examine the relationship between the amount of potassium in the cereals and the amount of dietary fiber. What is the correlation? c. Suppose a new breakfast cereal comes on the market with 300 milligrams of potassium per serving. Would it be appropriate to use the regression method to predict the amount of dietary fiber in a serving of this cereal? If yes, what is the prediction? If no, explain why not. 11

12 29. EESEE Story Blood Alcohol Content. How much does drinking beer increase the alcohol content of your blood? Read the introduction and protocol for this story. This question was addressed in an experiment at the Drackett Towers dormitory on The Ohio State University campus just before the State of Ohio raised the drinking age to 21. Sixteen students volunteered to take part in the experiment. Before the experiment each of the subjects blew into a Breathalyzer to show that their blood alcohol content (BAC) was at the zero mark. The student volunteers then drank a varying number of 12 ounce beers (between one and nine). How much each student drank was assigned by drawing tickets from a bowl. About 30 minutes later, an officer from the OSU Police Department measured their BAC using the Breathalyzer machine. Data from this experiment is in the datafile called bloodalc. Details of the variables are given in the results section of the story. a. Suppose you want to estimate how a person s Blood Alcohol Content is affected by the number of beers they drink. Make a scatter plot of BAC versus beers. Which variable did you choose to be the y variable and which did you choose to be the x variable? Explain. b. Use the computer to find the correlation between BAC and beers. Is the correlation coefficient an appropriate measure of the strength of the association between BAC and beers? Explain briefly. c. Use the computer to fit the regression line for BAC on beers. If a student drinks five beers, on average what do you predict the student s BAC will be? Show your work. c. Would the regression method be as accurate for predicting BAC for a person who drinks 15 beers? Explain. 12

13 Online Problems 30. To understand the ideas of this section try the Correlation and Regression applet in the StatsPortal website (you can find the collection of applets under the Resources tab). Read through the directions to the applet. Notice that you can add points to the scatter plot just by clicking. The correlation of the points will appear in the upper-left corner. To clear the points and start again just click on the Clear button. a. Create a scatter of points in the lower-left corner that has a correlation that is near zero. b. Now add a single point to your scatterplot in the upper-right corner. Click and drag the point to different places on the scatterplot. How much can you change the correlation by manipulating this single point? c. Clear your scatter plot using the trash icon. Create a new scatterplot that has a straight line of points and a correlation that is near 1. d. Now add a single point to your scatterplot in the upper-right corner. Click and drag the point to different places on the scatter plot. How much can you change the correlation by manipulating this single point? e. Based on what you have learned above, an outlier in a scatterplot can: A. increase the correlation. B. decrease the correlation. C. either increase or decrease the correlation. D. have no influence on the correlation. 13

14 31. What is the best line that fits a pattern of points? The least squares line is the line that minimizes the squared vertical distance from the points and the line. How well can you determine this line? You can use the Correlation and Regression applet in the StatsPortal website to experiment with determining this line. a. Open the applet. Create a scatterplot that has a linear pattern and a correlation around 0.7. b. Click the Draw your own line radio button. The next two points you create when you click on the plot will then form your line. You can change your line by dragging one of these two endpoints. The resulting relative sum of squares for your line is shown on the left. A value of 1 is for the best line possible so, for example, the value of 1.12 in the picture below indicates that the green line drawn has 12% high sum of squares than the least-squares line. Try moving the line you have created to try to reduce the relative sum of squares. c. When you have the line that you think is best you can click on Show least-squares line to see the actual best fit line. How did your line compare? d. Draw a new set of points and see if you can draw a line close to the least squares line on your first attempt. Was the line you drew centered correctly? Was its slope too steep or too shallow? 14

15 32. How well can you match correlations to their scatterplots? To find out try the following online applet: a. Click the New Plots button. The applet will present four scatterplots and four correlations. b. Examine each plot and pick the correlation coefficient that matches the scatterplot. When you have made your guesses click the Answers button to find out if you are correct. c. You may continue to generate new plots. Try to achieve a streak of at least 20 in a row. 15

Lab 5a Exploring Correlation

Lab 5a Exploring Correlation Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for

More information

3.2A Least-Squares Regression

3.2A Least-Squares Regression 3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to

More information

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78

HW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78 35. What s My Line? You use the same bar of soap to shower each morning. The bar weighs 80 grams when it is new. Its weight goes down by 6 grams per day on average. What is the equation of the regression

More information

Math 075 Activities and Worksheets Book 2:

Math 075 Activities and Worksheets Book 2: Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

AP Statistics Practice Test Ch. 3 and Previous

AP Statistics Practice Test Ch. 3 and Previous AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry

More information

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible Pledge: 10/4/2007 MATH 171 Name: Dr. Lunsford Test 1 100 Points Possible I. Short Answer and Multiple Choice. (36 points total) 1. Circle all of the items below that are measures of center of a distribution:

More information

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do. Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and

More information

3.2 Least- Squares Regression

3.2 Least- Squares Regression 3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these

More information

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question. Name: Class: Date: Chapter 3 Review Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell KEY 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? 15 What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included

More information

STOR 155 Section 2 Midterm Exam 1 (9/29/09)

STOR 155 Section 2 Midterm Exam 1 (9/29/09) STOR 155 Section 2 Midterm Exam 1 (9/29/09) Name: PID: Instructions: Both the exam and the bubble sheet will be collected. On the bubble sheet, print your name and ID number, sign the honor pledge, also

More information

REVIEW PROBLEMS FOR FIRST EXAM

REVIEW PROBLEMS FOR FIRST EXAM M358K Sp 6 REVIEW PROBLEMS FOR FIRST EXAM Please Note: This review sheet is not intended to tell you what will or what will not be on the exam. However, most of these problems have appeared on or are very

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

Chapter 3 CORRELATION AND REGRESSION

Chapter 3 CORRELATION AND REGRESSION CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell q 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included in

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. STAT 280 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Equation of Regression Line; Residuals Effect of Explanatory/Response Roles Unusual Observations Sample

More information

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Homework #3 Name Due Due on on February Tuesday, Due on February 17th, Sept Friday 28th 17th, Friday SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

Practice First Midterm Exam

Practice First Midterm Exam Practice First Midterm Exam Statistics 200 (Pfenning) This is a closed book exam worth 150 points. You are allowed to use a calculator and a two-sided sheet of notes. There are 9 problems, with point values

More information

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

This means that the explanatory variable accounts for or predicts changes in the response variable.

This means that the explanatory variable accounts for or predicts changes in the response variable. Lecture Notes & Examples 3.1 Section 3.1 Scatterplots and Correlation (pp. 143-163) Most statistical studies examine data on more than one variable. We will continue to use tools we have already learned

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation)

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Investigating relationships between variables is central to what we do in statistics. Why is it important to investigate and

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

INTERPRET SCATTERPLOTS

INTERPRET SCATTERPLOTS Chapter2 MODELING A BUSINESS 2.1: Interpret Scatterplots 2.2: Linear Regression 2.3: Supply and Demand 2.4: Fixed and Variable Expenses 2.5: Graphs of Expense and Revenue Functions 2.6: Breakeven Analysis

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual? STATISTICS 216, SPRING 2006 Name: EXAM 1; February 21, 2006; 100 points. Instructions: Closed book. Closed notes. Calculator allowed. Double-sided exam. NO CELL PHONES. Multiple Choice (3pts each). Circle

More information

7) Briefly explain why a large value of r 2 is desirable in a regression setting.

7) Briefly explain why a large value of r 2 is desirable in a regression setting. Directions: Complete each problem. A complete problem has not only the answer, but the solution and reasoning behind that answer. All work must be submitted on separate pieces of paper. 1) Manatees are

More information

Relationships. Between Measurements Variables. Chapter 10. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Relationships. Between Measurements Variables. Chapter 10. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Chapter 10 Between Measurements Variables Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc. Thought topics Price of diamonds against weight Male vs female age for dating Animals

More information

Homework Linear Regression Problems should be worked out in your notebook

Homework Linear Regression Problems should be worked out in your notebook Homework Linear Regression Problems should be worked out in your notebook 1. Following are the mean heights of Kalama children: Age (months) 18 19 20 21 22 23 24 25 26 27 28 29 Height (cm) 76.1 77.0 78.1

More information

Dr. Kelly Bradley Final Exam Summer {2 points} Name

Dr. Kelly Bradley Final Exam Summer {2 points} Name {2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the W's for the description of data. 1) A survey of bicycles parked outside college

More information

14.1: Inference about the Model

14.1: Inference about the Model 14.1: Inference about the Model! When a scatterplot shows a linear relationship between an explanatory x and a response y, we can use the LSRL fitted to the data to predict a y for a given x. However,

More information

Chapter 4: More about Relationships between Two-Variables Review Sheet

Chapter 4: More about Relationships between Two-Variables Review Sheet Review Sheet 4. Which of the following is true? A) log(ab) = log A log B. D) log(a/b) = log A log B. B) log(a + B) = log A + log B. C) log A B = log A log B. 5. Suppose we measure a response variable Y

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Eating and Sleeping Habits of Different Countries

Eating and Sleeping Habits of Different Countries 9.2 Analyzing Scatter Plots Now that we know how to draw scatter plots, we need to know how to interpret them. A scatter plot graph can give us lots of important information about how data sets are related

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60 M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions

More information

Unit 8 Bivariate Data/ Scatterplots

Unit 8 Bivariate Data/ Scatterplots Unit 8 Bivariate Data/ Scatterplots Oct 20 9:19 PM Scatterplots are used to determine if there is a relationship between two variables. /Correlation /Correlation /Correlation Line of best fit cuts the

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Evaluate the expression for the given value or values. ) 42 + y for y = 43 A) 76 B) 58 C) 85 D) 67

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 STAB22H3 Statistics I, LEC 01 and LEC 02 Duration: 1 hour and 45 minutes Last Name: First Name:

More information

STAT 201 Chapter 3. Association and Regression

STAT 201 Chapter 3. Association and Regression STAT 201 Chapter 3 Association and Regression 1 Association of Variables Two Categorical Variables Response Variable (dependent variable): the outcome variable whose variation is being studied Explanatory

More information

AP Statistics Practice Test Unit Seven Sampling Distributions. Name Period Date

AP Statistics Practice Test Unit Seven Sampling Distributions. Name Period Date AP Statistics Practice Test Unit Seven Sampling Distributions Name Period Date Vocabulary: 1. Define and provide an example of a statistic.. Define Sampling Distribution. 3. Define the variability of a

More information

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice Chapter 14 Inference for Regression Our!nal topic of the year involves inference for the regression model. In Chapter 3 we learned how to!nd the Least Squares Regression Line for a set of bivariate data.

More information

CP Statistics Sem 1 Final Exam Review

CP Statistics Sem 1 Final Exam Review Name: _ Period: ID: A CP Statistics Sem 1 Final Exam Review Multiple Choice Identify the choice that best completes the statement or answers the question. 1. A particularly common question in the study

More information

Regression. Lelys Bravo de Guenni. April 24th, 2015

Regression. Lelys Bravo de Guenni. April 24th, 2015 Regression Lelys Bravo de Guenni April 24th, 2015 Outline Regression Simple Linear Regression Prediction of an individual value Estimate Percentile Ranks Regression Simple Linear Regression The idea behind

More information

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 140 est 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions

More information

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points. For Questions 1 & 2: It is known that the distribution of starting salaries for MSU Education majors has

More information

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals

More information

AP Stats Review for Midterm

AP Stats Review for Midterm AP Stats Review for Midterm NAME: Format: 10% of final grade. There will be 20 multiple-choice questions and 3 free response questions. The multiple-choice questions will be worth 2 points each and the

More information

Chapter Which of these are true and which are false? Explain why the false statements are wrong

Chapter Which of these are true and which are false? Explain why the false statements are wrong Chapter 8 1. Which of these are true and which are false? Explain why the false statements are wrong 1 a. By repeating a measurement many times, a researcher can improve the validity of the results. False

More information

Test 1C AP Statistics Name:

Test 1C AP Statistics Name: Test 1C AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. At the beginning of the school year, a high-school teacher asks every student in her classes

More information

Chapter 4: More about Relationships between Two-Variables

Chapter 4: More about Relationships between Two-Variables 1. Which of the following scatterplots corresponds to a monotonic decreasing function f(t)? A) B) C) D) G Chapter 4: More about Relationships between Two-Variables E) 2. Which of the following transformations

More information

Chapter Three in-class Exercises. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter Three in-class Exercises. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Name Chapter Three in-class Exercises MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) The table below lists the populations, in thousands, of several

More information

3.4 What are some cautions in analyzing association?

3.4 What are some cautions in analyzing association? 3.4 What are some cautions in analyzing association? Objectives Extrapolation Outliers and Influential Observations Correlation does not imply causation Lurking variables and confounding Simpson s Paradox

More information

STAT445 Midterm Project1

STAT445 Midterm Project1 STAT445 Midterm Project1 Executive Summary This report works on the dataset of Part of This Nutritious Breakfast! In this dataset, 77 different breakfast cereals were collected. The dataset also explores

More information

ANALYZING BIVARIATE DATA

ANALYZING BIVARIATE DATA Analyzing bivariate data 1 ANALYZING BIVARIATE DATA Lesson 1: Creating frequency tables LESSON 1: OPENER There are two types of data: categorical and numerical. Numerical data provide numeric measures

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

CHILD HEALTH AND DEVELOPMENT STUDY

CHILD HEALTH AND DEVELOPMENT STUDY CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in

More information

manipulation influences other variables, the researcher is conducting a(n)

manipulation influences other variables, the researcher is conducting a(n) Math 1342 Finals Review Selective Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 1) If a researcher manipulates one of the variables and tries to

More information

EXECUTIVE SUMMARY DATA AND PROBLEM

EXECUTIVE SUMMARY DATA AND PROBLEM EXECUTIVE SUMMARY Every morning, almost half of Americans start the day with a bowl of cereal, but choosing the right healthy breakfast is not always easy. Consumer Reports is therefore calculated by an

More information

Lesson Using Lines to Make Predictions

Lesson Using Lines to Make Predictions STTWY STUDENT HNDOUT STUDENT NME DTE INTRODUCTION Statistical methods are used in forensics to identify human remains based on the measurements of bones. In the 1950s, Dr. Mildred Trotter and Dr. Goldine

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Regression. Regression lines CHAPTER 5

Regression. Regression lines CHAPTER 5 CHAPTER 5 NASA/GSFC Can scientists predict in advance how many hurricanes the coming season will bring? Exercise 5.44 has some data. Regression IN THIS CHAPTER WE COVER... Linear (straight-line) relationships

More information

Test 1: Professor Symanzik Statistics

Test 1: Professor Symanzik Statistics Page 1 of 11 1 (6 Points) A researcher wants to learn whether regularly taking chromium picolinate may reduce elevated cholesterol values. The researcher is considering two approaches to study this issue:

More information

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS Circle the best answer. This scenario applies to Questions 1 and 2: A study was done to compare the lung capacity of coal miners to the lung

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Welcome to OSA Training Statistics Part II

Welcome to OSA Training Statistics Part II Welcome to OSA Training Statistics Part II Course Summary Using data about a population to draw graphs Frequency distribution and variability within populations Bell Curves: What are they and where do

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

MEASURES OF ASSOCIATION AND REGRESSION

MEASURES OF ASSOCIATION AND REGRESSION DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti

More information

Scatter Plots and Association

Scatter Plots and Association ? LESSON 1.1 ESSENTIAL QUESTION Scatter Plots and Association How can you construct and interpret scatter plots? Measurement and data 8.11.A Construct a scatterplot and describe the observed data to address

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

How to assess the strength of relationships

How to assess the strength of relationships Publishing Date: April 1994. 1994. All rights reserved. Copyright rests with the author. No part of this article may be reproduced without written permission from the author. Meta Analysis 3 How to assess

More information

Simple Linear Regression the model, estimation and testing

Simple Linear Regression the model, estimation and testing Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.

More information

Chapter 4: Scatterplots and Correlation

Chapter 4: Scatterplots and Correlation Chapter 4: Scatterplots and Correlation http://www.yorku.ca/nuri/econ2500/bps6e/ch4-links.pdf Correlation text exr 4.10 pg 108 Ch4-image Ch4 exercises: 4.1, 4.29, 4.39 Most interesting statistical data

More information

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Chapters 6 & 7 Exam Review Math 0306 Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Find fraction notation for the ratio. You need not simplify.

More information

Identify two variables. Classify them as explanatory or response and quantitative or explanatory.

Identify two variables. Classify them as explanatory or response and quantitative or explanatory. OLI Module 2 - Examining Relationships Objective Summarize and describe the distribution of a categorical variable in context. Generate and interpret several different graphical displays of the distribution

More information

Review for Final Exam Math 20

Review for Final Exam Math 20 Review for Final Exam Math 20 Write the number in words. 1) 135,060 1) Rewrite the following number using digits. 2) Eight thousand, one hundred sixty-seven 2) Fill in the digits for the given place values

More information

Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January

Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January PID: Last Name, First Name: Section: Approximate time spent to complete this assignment: hour(s) Readings: Chapters 7, 8 and 9. Homework 2 Math 11, UCSD, Winter 2018 Due on Tuesday, 23rd January Exercise

More information

Part I: Alcohol Metabolization Explore and Explain

Part I: Alcohol Metabolization Explore and Explain Name Date Part I: Alcohol Metabolization Explore and Explain Just like any other type of food or beverage, alcohol is digested and then metabolized by the body. When a substance is metabolized by the body,

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll  you Reminders/Comments Thanks for the quick feedback I ll try to put HW up on Saturday and I ll email you Final project will be assigned in the last week of class You ll have that week to do it Participation

More information

Test 1 Version A STAT 3090 Spring 2018

Test 1 Version A STAT 3090 Spring 2018 Multiple Choice: (Questions 1 20) Answer the following questions on the scantron provided using a #2 pencil. Bubble the response that best answers the question. Each multiple choice correct response is

More information

Lesson 11 Correlations

Lesson 11 Correlations Lesson 11 Correlations Lesson Objectives All students will define key terms and explain the difference between correlations and experiments. All students should be able to analyse scattergrams using knowledge

More information

Paper Airplanes & Scientific Methods

Paper Airplanes & Scientific Methods Paper Airplanes & Scientific Methods Scientific Inquiry refers to the many different ways in which scientists investigate the world. Scientific investigations are done to answer questions and solve problems.

More information

NORTH SOUTH UNIVERSITY TUTORIAL 2

NORTH SOUTH UNIVERSITY TUTORIAL 2 NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate

More information

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.

Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Statistics Final Review Semeter I Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The Centers for Disease

More information

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months? Medical Statistics 1 Basic Concepts Farhad Pishgar Defining the data Population and samples Except when a full census is taken, we collect data on a sample from a much larger group called the population.

More information

Multiple Choice Questions

Multiple Choice Questions ACTM State Statistics Work the multiple choice questions first, selecting the single best response from those provided and entering it on your scantron form. You may write on this test and keep the portion

More information

STATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013

STATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013 STATISTICS 201 Survey: Provide this Info Outline for today: Go over syllabus Provide requested information on survey (handed out in class) Brief introduction and hands-on activity Name Major/Program Year

More information

STAT 503X Case Study 1: Restaurant Tipping

STAT 503X Case Study 1: Restaurant Tipping STAT 503X Case Study 1: Restaurant Tipping 1 Description Food server s tips in restaurants may be influenced by many factors including the nature of the restaurant, size of the party, table locations in

More information