M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

Similar documents
M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

Chapter 1: Exploring Data

AP Statistics. Semester One Review Part 1 Chapters 1-5

Chapter 4: More about Relationships between Two-Variables Review Sheet

AP Statistics Practice Test Ch. 3 and Previous

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Identify two variables. Classify them as explanatory or response and quantitative or explanatory.

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

Unit 1 Exploring and Understanding Data

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

STAT 201 Chapter 3. Association and Regression

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Unit 8 Day 1 Correlation Coefficients.notebook January 02, 2018

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

STATISTICS & PROBABILITY

Understandable Statistics

Test 1C AP Statistics Name:

CHAPTER 3 Describing Relationships

(a) 50% of the shows have a rating greater than: impossible to tell

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

(a) 50% of the shows have a rating greater than: impossible to tell

V. Gathering and Exploring Data

Section I: Multiple Choice Select the best answer for each question.

Welcome to OSA Training Statistics Part II

AP STATISTICS 2010 SCORING GUIDELINES (Form B)

Chapter 4: More about Relationships between Two-Variables

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

STATISTICS INFORMED DECISIONS USING DATA

Introduction to Statistical Data Analysis I

Chapter 1. Picturing Distributions with Graphs

3.4 What are some cautions in analyzing association?

BIVARIATE DATA ANALYSIS

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

STOR 155 Section 2 Midterm Exam 1 (9/29/09)

Section 3.2 Least-Squares Regression

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Outline. Practice. Confounding Variables. Discuss. Observational Studies vs Experiments. Observational Studies vs Experiments

Business Statistics Probability

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Chapter 1 Where Do Data Come From?

CP Statistics Sem 1 Final Exam Review

Math 214 REVIEW SHEET EXAM #1 Exam: Wednesday March, 2007

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

2.4.1 STA-O Assessment 2

DO NOT OPEN THIS BOOKLET UNTIL YOU ARE TOLD TO DO SO

Lesson 1: Distributions and Their Shapes

STAT243 LS: Intro to Probability and Statistics Quiz 1, Feb 10, 2017 KEY

Instructions and Checklist

4.3 Measures of Variation

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 3: Examining Relationships

3.2 Least- Squares Regression

Student Performance Q&A:

STT315 Chapter 2: Methods for Describing Sets of Data - Part 2

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

3.2A Least-Squares Regression

Undertaking statistical analysis of

Averages and Variation

Analysis and Interpretation of Data Part 1

Practice First Midterm Exam

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

Pre-Test Unit 9: Descriptive Statistics

Statistics and Probability

9 research designs likely for PSYC 2100

Chapter 3: Describing Relationships

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Still important ideas

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

What is Data? Part 2: Patterns & Associations. INFO-1301, Quantitative Reasoning 1 University of Colorado Boulder

1.4 - Linear Regression and MS Excel

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Statistical Methods Exam I Review

Chapter 4. More On Bivariate Data. More on Bivariate Data: 4.1: Transforming Relationships 4.2: Cautions about Correlation

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

AP Stats Review for Midterm

How to interpret scientific & statistical graphs

PRINTABLE VERSION. Quiz 1. True or False: The amount of rainfall in your state last month is an example of continuous data.

Quizzes (and relevant lab exercises): 20% Midterm exams (2): 25% each Final exam: 30%

12.1 Inference for Linear Regression. Introduction

Unit 3 Lesson 2 Investigation 4

AP Statistics Practice Test Unit Seven Sampling Distributions. Name Period Date

Probability and Statistics. Chapter 1

Examining Relationships Least-squares regression. Sections 2.3

Key: 18 5 = 1.85 cm. 5 a Stem Leaf. Key: 2 0 = 20 points. b Stem Leaf Key: 2 0 = 20 cm. 6 a Stem Leaf. c Stem Leaf

Math 124: Module 2, Part II

Statistics is a broad mathematical discipline dealing with

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

International Statistical Literacy Competition of the ISLP Training package 3

Students will understand the definition of mean, median, mode and standard deviation and be able to calculate these functions with given set of

STA Module 9 Confidence Intervals for One Population Mean

Math 2200 First Mid-Term Exam September 22, 2010

Transcription:

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60

Multiple choice questions (1 point each) For questions 1 and 2 consider the following: The distribution of scores on a certain statistics test is strongly skewed to the left. 1. Which set of measures of center and spread are more appropriate for the distribution of scores? a) Mean and standard deviation b) Median and interquartile range c) Mean and interquartile range d) Median and standard deviation 2. What does this suggest about the difficulty of the test? a) It was an easy test b) It was a hard test c) It wasn t too hard or too easy d) It is impossible to tell 3. Which one of the following variables is NOT categorical? a) whether or not an individual has pierced ears b) religion c) diameter of sequoia trees d) zipcode 4. Which one of the following statements is FALSE? a) The only way the standard deviation can be 0 is when all the observations have the same value. b) If you interchange the explanatory variable and the response variable, the correlation coefficient remains the same. c) If the correlation coefficient between two variables is 0, that means that there is no possible relationship between the two variables. d) The correlation coefficient has no units. 5. X and Y are two categorical variables. The best way to determine if there is a relation between them is a) to calculate the correlation between X and Y. b) to draw a scatterplot of the X and Y values c) to make a two-way table of the X and Y values d) all of the above 6. For an exam given to a class, the student's scores ranged from 35 to 98, with a mean of 74. Which of the following is the most likely value for the standard deviation? a) -10 b) 0 c) 63 d) 13 7. The typical amount of sleep per night for college students can be assumed to be normally distributed with a mean of 7 hours and a standard deviation of 1.2 hours. From the Standard Deviation Rule we know that about 95% of the college students typically sleep between a) 5.8 and 8.2 hours per night. b) 4.6 and 9.4 hours per night. 2

c) 6 and 8 hours per night. d) It is impossible to determine the answer from the given information. 8. A study found a correlation of r = -0.61 between the gender of a worker and his or her income. You may correctly conclude that: a) women earn less than men on the average. b) women earn more than men on the average. c) this is incorrect because r makes no sense here. d) an arithmetic mistake was made. Correlation must be positive. 9. A study of the salaries of professors at Smart University shows that the median salary for female professors is considerably less than the median male salary. Further investigation shows that the median salary for female professors is more than the median male salary in every department (English, Physics, etc.) of the university. This apparent contradiction is an example of a) extrapolation b) Simpson s paradox c) causation d) correlation 10. Consider the following data set: 2 3 5 7 8 10 12 15 20 31. According to the 1.5(IQR) rule, a) only 31 is an outlier. b) only 2 is an outlier. c) both 2 and 31 are outliers. d. there are no outliers. 11. (3 points) Match each of the five scatterplots with its correlation. 0.7-0.75-0.95 0 0.95 12. (4 points) Answer the following questions: a) Which measure of spread indicates variation about the mean? Standard deviation b) Which graphical display shows the five-number summary? boxplot c) Which of the following statistical measures are resistant? Circle those that are: mean median range IQR standard deviation correlation coefficient 3

13. (3 points) Given a dotplot below with two arrows. Indicate on the graph which one of the two arrows marks the value of the mean, and which one marks the median. Explain your decision. median mean Since the mean is sensitive to outliers, it is always pulled by the tail. 14. (10 points) For each of the situations described below, write the appropriate graphical display(s). You need to use ALL of these once (that means you need to write more than one graphs at some parts): histogram scatterplot dotplot pie chart double bargraph stemplot side-by-side boxplots bargraph a) You want to explore the relationship between weight of the brain and IQ scores. Graphical display(s): scatterplot. b) You want to explore the relationship between nationality and party affiliation. Graphical display(s): double bargaph. c) You want to explore the distribution of lengths of adult snakes living in California. Graphical display(s): histogram, dotplot, stemplot. d) You want to explore the relationship between work shift (morning, afternoon, night shift) and the number of accidents during the different shifts. Graphical display(s): side-by-side boxplot. e) You want to explore the distribution of taste preferences for a new soft drink (awful, OK, good, excellent) Graphical display(s): bargraph, pie chart. 4

15. (14 points) Given below is a scatterplot showing the relationship between the horsepower of a car and the weight of the car. Horsepower Weight Mean 116 3.30 Standard deviation 24 0.38 a) Describe the scatterplot. Make sure you mention all four features. Form: linear Strength: strong Direction: positive Outliers: no outliers b) Guess the correlation coefficient (circle the best answer): 0.94 0.13 1.2 0.9 0.23 1 c) The line drawn on the graph is called the least squares regression line. Find the equation of the line (Y = a + bx) using the statistics given above in the table, and the correlation coefficient you guessed in part (b). Use two decimal digits in your answers. b r s y = = 0 9 0.. 38 = 0. 01425 0. 01 s 24 x a = Y bx = 330. 0. 01( 116) = 214. The equation of the least squares regression line is Y = 214. + 0. 01X d) Provide an interpretation of the slope in this context. The slope is 0.01. That is, when the horsepower of a car increases by one unit, the weight of the car increases by about 0.01 tons on average. 5

e) A car has 150 horsepower. Using the equation from part (c), predict the weight of the car. Y = 2.14 + 0.01(150) = 3.64 tons e) Would it be OK to use the line to predict the weight of a car with 50 horsepower? Explain. No, 50 is outside of the range of the explanatory variable. It is not reliable to use the regression line in this case extrapolation. 16. (10 points) Students in Introductory Statistics were presented with a page containing 30 colored rectangles. Their task was to name the colors as quickly as possible; their times were recorded. We can compare the scores for the 16 men and 31 women who participated in the experiment by making separate box plots for each gender. Are the following statements true or false? T F The median time for women is about the same as the lower quartile for men. T F The times women could name the colors is less variable than the times for men T F About 8 of the men needed 18-25 seconds to name the colors. T F About half of the women needed 14-17 seconds to name the colors. T F Since the distribution of the times for men is fairly symmetric, we can conclude that the mean time for men is about 23 seconds. 6

17. (7 points) The unemployment rates for the twenty largest cities in the United States are given below. A student created a stemplot for the data: 4 1 2 4 5 9 5 1 3 6 6 3 5 8 9 7 1 8 3 10 0 a. The stemplot is not correct. Find the mistakes, and correct them. 4 1 2 4 4 5 9 5 1 3 6 6 6 6 3 5 5 8 9 7 1 8 3 3 9 10 0 b. Describe the shape of the correct stemplot. Skewed to the right c. Find the range of the data. Range = max. min. = 10.0% 4.1% = 5.9% d. Find the interquartile range of the data, and interpret it in context. Q1 = (4.5 + 4.9) / 2 = 4.7 Q3 = (6.8 + 6.9) /2 = 6.85 IQR = Q3 Q1 = 6.85 4.7 = 2.15 The unemployment rate in the middle 10 of the 20 cities ranges between 4.7% and 6.85%. So the range of the unemployment rate for those 10 middle cities is 2.15%. 7

18. (4 points) Consider the following two distributions of scores on a quiz of 14 students in class A, and 14 students in class B. Class A Class B Which distribution has the lower standard deviation and why? Class B has the lower standard deviation because on average the data points are closer from the mean. 19. (4 points) There is a high correlation between ice cream sales and the number of shark attacks. Does this mean that we can lower the number of shark attacks by eating less ice cream? Explain using appropriate statistical terms. No, we can t lower the number of shark attacks by eating less ice cream. Even high correlation doesn t mean that one of the variables causes the changes in the other one. Correlation does not imply causation. The lurking variable connecting these two variables is temperature, or season. In summer, when the temperature is high many people eat ice cream, and many people swim in the ocean. 8