ACTIVITY 10 A Little Tacky!

Size: px
Start display at page:

Download "ACTIVITY 10 A Little Tacky!"

Transcription

1 Introduction to Inference ACTIVITY 10 A Little Tacky! Materials: Small box of thumbtacks When you flip a fair coin, it is equally likely to land "heads" or "tails." Do thumbtacks behave in the same way? In this activity, you will toss a thumbtack several times and observe whether it comes to rest with the point up (U) or point down (D). The question you are trying to answer is: what proportion of the time does a tossed thumbtack settle with its point up (U)? 1. Before you begin the activity, make a guess about what will happen. If you could toss your thumbtack over and over and over, what proportion of all tosses do you think would settle with the point up (U)? 2. Toss your thumbtack 50 times. Record the result of each toss (U or D) in a table like the one shown. In the third column, calculate the proportion of point up (U) tosses you have obtained so far. Toss. Outcome Cumulative proportion of U's 3. Make a scatterplot with the number of tosses on the horizontal axis and the cumulative proportion of U's on the vertical axis. Connect consecutive points with a line segment. Does the overall proportion of U's seem to be approaching a single value? 4. Your set of 50 tosses can be thought of as a simple random sample from the population of all possible tosses of your thumbtack. The parameter p is the (unknown) population proportion of tosses that would land point up (U). What is your best estimate for p? It's 6, the proportion of U's in your 50 thumbtack tosses. Record your value ofb. How does it compare with the conjecture you made in step l? 5. If you tossed your thumbtack 50 more times (don't do it!), would you expect to get the same value of j? In chapter 9, we learned that the values of6 in repeated samples could be described by a sampling distribution. The mean of the sampling distribution 4 is equal to the population proportion p. How far will your sample proportion j be from the true value p? If the sampling distribution is approximately normal, then the rule tells us that about 95% of all j-values will be within two standard deviations of p.

2 ACTIVITY 10 A Little Tacky! (continued) 6. The sampling distribution of$ will be approximately normal if n$ 2 10 and n(l -6) Verify that these conditions are satisfied for your sample. 7. Estimate /? the standard deviation of the sampling distribution by computing using your value of$. This is the formula we developed in Chapter 9 with p replaced by$. I. Construct the interval b i 2 / 9 based on your sample of 50 tosses. This is called a confidence interval for Your teacher will draw a number line with a scale marked off from 0 to 1 that has tick marks every 0.05 units. Draw your confidence interval above the number line. Your classmates will do the same. Do most of the intervals overlap? If so, what values are contained in all of the overlapping intervals? 10. About 95% of the time, the sample proportion $ of point up (U) tosses will be within two standard deviations of the actual population praportion of point up(u) tosses of a thumbtack. But if 6 is within two standard devia- -.. tions of p, then p is within two standard deviations of$. So about 95% of the time, the interval bf 2 will contain the true proportion p. n 11. There is no way to know whether the confidence interval you constructed in step 8 actually "catches" the true proportion p of times that your thumbtack will land point up. What we can say is that the method you used in step 8 will succeed in capturing the unknown population parameter about 95% of the time. Likewise, we would expect about 95% of all the confidence intervals drawn by the members of your class in step 9 to capture p. This activity shows you how sample statistics can be used to estimate unknown population parameters. This is one of the two types of statistical inference that you will meet in this chapter. INTRODUCTION When we select a sample, we know the responses of the individuals in the sample. Often we are not content with information about the sample. We want to infer from the sample data some conclusion about a wider population that the sample represents.

3 / _Z 4 - Chapter 10 Introduction to Inference I I I STATISTICAL INFERENCE Statistical inference provides methods for drawing conclusions about a population from sample data. I I I We have, of course, been drawing conclusions from data all along. What is new in formal inference is that we use probability to express the strength of our conclusions. Probability allows us to take chance variation into account and so to correct our judgment by calculation.- ere are two examples of how probability can correct our judgment. EXAMPLE 10.1 DRAFT LOTTERIES AND DRUG STUDIES In the Vietnam War years, a lottery determined the order in which men were drafted for army service. The lottery assigned draft numbers by choosing birth dates in random order. We expect a correlation near zero between birth dates and draft numbers if the draft numbers come from random choice. The actual correlation between birth date and draft number in the first draft lottery was r = That is, men born later in the year tended to get lower draft numbers. Is this small correlation evidence that the lottery was biased? Our unaided judgment can't tell because any two variables will have some association in practice, just by chance. So we calculate that a correlation this far from zero has probability less than in a truly random lottery. Because a correlation as strong as that observed would almost never occur in a random lottery, there is strong evidence that the lottery was unfair. Probability calculations can also protect us from jumping to a conclusion when only chance variation is at work. Give a new drug and a placebo to 20 patients each; 12 of those taking the drug show improvement, but only 8 of the placebo patients improve. Is the drug more effective than the placebo? Perhaps, but a difference this large or larger between the results in the two groups would occur about one time in five simply because of chance variation. An effect that could so easily be just chance is not convincing. In this chapter, we will meet the two most common types of formal statistical inference. Section 10.1 concerns confidence intervals for estimating the value of a population parameter. Section 10.2 presents tests of significance, which assess the evidence for a claim about a population. Both types of inference are based on the sampling distributions of statistics. That is, both report probabilities that state what would happen if we used the inference method many times. This kind of probability statement is characteristic of statistical inference. Users of statistics must understand the meaning of the probability statements that appear, for example, on computer output for statistical procedures. The methods of formal inference require the long-run regular behavior that probability describes. Inference is most reliable when the data are produced by a properly randomized design. When you use statistical inference you are acting as if the data are a random sample or come porn a randomized experiment. If this is not true, your conclusions may be open to challenge. Do not be overly impressed by the complex details of formal inference. This elaborate machinery

4 Estimating with C o n f i d e n ~ e 4, ~.I -_ ~ < - cannot remedy basic flaws in producing the data, such as voluntary response samples and uncontrolled experiments. Use the common sense developed in your study of the first nine chapters of this book, and proceed to formal inference only when you are satisfied that the data deserve such analysis. The purpose of this chapter is to describe the reasoning used in statistical inference. We will illustrate the reasoning by a few specific inference techniques, but these are oversimplified so that they are not very useful in practice. Later chapters will first show how to moq these techniques to make them practically useful and will then introduce inference methods for use in most of the settings we met in learning to explore data. There are libraries-both of books and of computer softwardull of more elaborate statistical techniques. Informed use of any of these methods requires an understanding of the underlying reasoning. A computer will do the arithmetic, but you must still exercise judgment based on understanding ESTIMATING WITH CONFIDENCE What decides whether you will gain admission to the college or university of your choice? By taking challenging courses (AP courses, for example) and earning hlgh grades in those courses, you certainly improve your chances. Many schools also look closely at your performance on standardued tests, such as the SAT I: Reasoning Test (more commonly referred to as the SAT). This test has two one for verbal reasoning ability and one for mathematical reasoning ability. In 2000, 1,260,278 collegebound seniors took the SAT. Their mean SAT Math score was 514 with a standard deviation of 113. For the SAT Verbal, the mean was 505 with a standard deviation of l l l. In early 2000, University of California President Richard Atkinson stirred considerable controversy when he suggested that the University of California system drop the SAT I: Reasoning Test as a factor in college admissions decisions. He suggested replacing this test with tests that reflect course content better. EXAMPLE 10.2 SAT MATH SCORES IN CALIFORNIA Suppose you want to estimate the mean SAT Math score for the more than 350,000 high school seniors in California. Only about 49% of California students take the SAT. These self-selected seniors are planning to attend college and so are not representative of all California seniors. You know better than to make inferences about the population based on any sample data. At considerable effortmd expense, you give the test to a simple random sample (SRS) of 500 California high school seniors. The mean for your sample is T= 461. What can vou say about the mean score in the ~o~ulation of all 350,000 seniors? The law of large numbers tells us that the sample mean X from a large SRS will be close to the unknown population mean p. Because 5s = 461, we guess that p is "somewhere around 461." To make "somewhere around 461" more precise, we ask: How would the sample mean vary if we took many samples of 500 seniors fiom this same population? Recall the essential facts about the sampling distribution of E

5 k Chapter 10 ~ntr~duction.- to Inference The central limit theorem tells us that the mean 55 of 500 scores has a distribution that is close to normal.. 'I, The mean of this normal sampling distribution is the same as the unknown mean p of the entire population. The standard deviation of P for an SRS of 500 students is U I ~ where, u is the standard deviation of individual SAT Math scores among all California high school seniors. Let us suppose that we know that the standard deviation of SAT Math scores in the population of all California seniors is a = 100. The standard deviation of X is then (It is usually not realistic to assume we know a. We will see in the next chapter how to proceed when u is not known. For now, we are more interested in statistical reasoning than in details of realistic methods.) If we choose many samples of size 500 and find the mean SAT Math score for each sample, we might get mean T = 461 from the first sample, X = 455 from the second, js = 463 from the third sample, and so on. If we collect all these sample means and display their distribution, we get the normal distribution with mean equal to the unknown p and standard deviation 4.5. Inference about the unknown p, starts from this sampling distribution. Figure 10.1 displays the distribution. The different values of T appear along the axis in the figure, and the normal curve shows how probable these values are. ". FIGURE 10.1 The sarnpllngdsstribution of the mean score X of an SRS of 500 California seniors on the SAT Math quantitative test.

6 10.1 Estimating with ~onfiden-! Statistical confidence Figure 10.2 is another picture of the same sampling distribution. It illustrates the following line of thought: The rule says that in 95% of all samples, the mean score 55 for the sample will be within two standard deviations of the population mean score p. SO the mean of 500 SAT Math scores will be within 9 points of p in 95% of all samples. Whenever F is within 9 points of the unknown p, p is within 9 points of the observed 55. This happens in 95% of all samples. So in 95% of all samples, the unknown p lies between 55-9 and Figure 10.3 displays this fact in picture form. FIGURE 10.2 In 95% of all samples, X lies within 29 of the unknown population mean p. Sop also lies within +9 of ii in those samples. FIGURE 10.3 To say that ji + 9 is a 95% confidence interval for the population mean p is to say that in repeated samples, 95% of these intervals capture p.

7 /' ===--=ZZOi-.Chapter 10 Introduction to Inference This conclusion just restates a fact about the sampling distribution of 15. The language of statistical inference uses this fact about what would happen in the long run to express our confidence in the results of any one sample. EXAMPLE % CONFIDENCE Our sample of 500 California seniors gave jt = 461. We say that we are 95% confident that the unknown mean SAT Math score for all California high school seniors lies between and X-9=461-9=452 T+9=461 +9=470 Be sure you understand the grounds for our confidence. There are only two possibilities: 1. The interval between 452 and 470 contains the true p. 2. Our SRS was one of the few samples for which 55 is not within 9 points of the true p. Only 5% of all samples give such inaccurate results. We cannot know whether our sample is one of the 95% for which the interval Xf 9 catches p, or if it is one of the unlucky 5%. The statement that we are 95% confident that the unknown p lies between 452 and 470 is shorthand for saying, 'We got these numbers by a method that gives correct results 95% of the time." The interval of numbers between the values Zf 9 is called a 95% confidence interval for p. Like most confidence intervals we will meet, this one has the form estimate * margin of error margin of error The estimate (Tin this case) is our guess for the value of the unknown parameter. The margin of error +9 shows how accurate we believe our guess is, based on the variability of the estimate. This is a 95% confidence interval because it catches the unknown p in 95% of all possible samples. CONFIDENCE INTERVAL A level C confidence interval for a parameter has two parts: An interval calculated from the data, usually of the form estimate + margin of error A confidence level C, which gives the probability that the interval will capture the true parameter value in repeated samples.

8 10.1 Estimating with ~ o n a ~ Users can choose the confidence level, most often 90% or higher because we most often want to be quite sure of our conclusions. We will use C to stand for the confidence level in decimal form. For example, a 95% confidence level corresponds to C = Figure 10.3 is one way to picture the idea of a 95% confidence interval. Figure 10.4 illustrates the idea in a different form. Study these figures carefully. If you understand what they say, you have mastered one of the big ideas of statistics. Figure 10.4 shows the result of drawing many SRSs from the same population and calculating a 95% confidence interval from each sample. The center of each interval is at Kand therefore varies from sample to sample. The sampling distribution of yappears at the top of the figure to show the long-term pattern of this variation. The 95% confidence intervals from 25 SRSs appear below. The center jl of each interval is marked by a dot. The arrows on either side of the dot span the confidence interval. All except one of these 25 intervals cover the true value of p. In a very large number of samples, 95% of the confidence intervals would contain p. FIGURE 10.4 Twenty-five samples from the same population gave these 95% confidence intervals. In the long run, 95% of all samples give an interval that contains the population mean p.

9 / Chapter 10 Introduction to Inference EXERCISES 10.1 POLLING WOMEN A New York Times poll on women's issues interviewed 1025 women randomly selected from the United States, excluding Alaska and Hawaii. The poll found that 47% of the women said they do not get enough time for themselves. (a) The poll announced a margin of error of +3 percentage points for 95% confidence in its conclusions. What is the 95% confidence interval for the percent of all adult women who think they do not get enough time for.themselves? (b) Explain to someone who knows no statistics why we can't just say that 47% of all adult women do not get enough time for themselves. (c) Then explain clearly what "95% confidence" means NAEP SCORES Young people have a better chance of full-time employment and good wages if they are good with numbers. How strong are the quantitative skills of young Americans of working age? One source of data is the National Assessment of Educational Progress (NAEP) Young Adult Literacy Assessment Survey, which is based on a nationwide probability sample of households. The NAEP survey includes a short test of quantitative skills, covering mainly basic arithmetic and the ability to apply it to realistic problems. Scores on the test range from 0 to 500. For example, a person who scores 233 can add the amounts of two checks appearing on a bank deposit slip; someone scoring 325 can determine the price of a meal from a menu; a person scoring 375 can transform a price in cents per ounce into dollars per pound.' Suppose that you give the NAEP test to an SRS of 840 people from a large population in which the scores have mean 280 and standard deviation a = 60. The mean f of the 840 scores will vary if you take repeated samples. (a) Describe the shape, center, and spread of the sampling distribution of l (b) Sketch the normal curve that describes how T varies in many samples from this population. Mark its mean and the values one, two, and three standard deviations on either side of the mean. (c) According to the rule, about 95% of all the values of ST fall within of the mean of this curve. What is the missing number? Call it m for "margin of error." Shade the region from the mean minus m to the mean plus m on the axis of your sketch, as in Figure (d) Whenever ST falls in the region you shaded, the true value of the population mean, p = 280, lies in the confidence interval between ST- m and E+ m. Draw the confidence interval below your sketch for one value off inside the shaded region and one value of T outside the shaded region. (Use Figure 10.4 as a model for the drawing.) (e) In what percent of all samples will the true mean p = 280 be covered by the confidence interval f i m? 10.3 EXPLAINING CONFIDENCE A student reads that a 95% confidence interval for the mean NAEP quantitative score for men of ages 21 to 25 is to Asked to explain the meaning of this interval, the student says, "95% of all young men have scores between and " Is the student right? Justify your answer AUTO EMISSIONS Oxides of nitrogen (called NOX for short) emitted by cars and trucks are important contributors to air pollution. The amount of NOX emitted by a

10 Estimating with Confidence particular model varies from vehicle to vehicle. For one light truck model, NOX emissions vary with mean p that is unknown and standard deviation o = 0.4 grams per mile. You test an SRS of 50 of these trucks. The sample mean NOX level f estimates the unknown p. You will get different values of f if you repeat your sampling. (a) Describe the shape, center, and spread of the sampling distribution of ST. (b) Sketch the normal curve for the sampling distribution of f. Mark its mean and the values one, two, and three standard deviations on either side of the mean. (c) According to the rule, about 95% of all values of f lie within a distance m of the mean of the sampling distribution. What is m? Shade the region on the axis of your sketch that is within m of the mean, as in Figure (d) Whenever f falls in the region you shaded, the unknown population mean p lies in the confidence interval Yf m. For what percent of all possible samples does this happen? (e) Following the style of Figure 10.4, draw the confidence intervals below your sketch for two values of ST, one that falls within the shaded region and one that falls outside it. Confidence interval for a population mean We can now give the recipe for a level C confidence interval for the mean p of a population when the data come from an SRS of size n. The construction of the interval depends on the fact that the sampling distribution of the sample mean 55 is at least approximately normal. This distribution is exactly normal if the population distribution is normal. When the population distribution is not normal, the central limit theorem tells us that the sampling distribution of will be approximately normal if n is sufficiently large. Be sure to check that these conditions are satisfied before you construct a confidence interval. I CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL FORp The construction of a confidence interval for a population mean p is appropriate when the data come from an SRS from the population of interest, and. the sampling distribution of 5~ is approximately normal. b Our construction of a 95% confidence interval for the mean SAT Math score began by noting that any normal distribution has probability about 0.95 within 2 standard deviations of its mean. To construct a level C confidence interval, we want to catch the central probability C under a normal curve. To do that, we must go out z" standard deviations on either side of the mean. Since any normal distribution can be standardized, we can get the value z" from the standard normal table. Here is an example of how to find 2'.

11 P Chapter 10 Introduction to Inference EXAMPLE 10.4 FINDING z* To find an 80% confidence interval, we must catch the central 80% of the normal sampling distribution of 7. In catching the central 80% we leave out 20%, or 10% in each tail. So Z* is the point with area 0.1 to its right (and 0.9 to its left) under the standard normal curve. Search the body of Table A to find the point with area 0.9 to its left. The closest entry is z" = There is area 0.8 under the standard normal curve between and Figure 10.5 shows how z" is related to areas under the curve. FIGURE 10.5 The central probability 0.8 under a standard normal curve lies between and That is. there is area 0.1 to the riaht of 1.28 under the curve. Figure 10.6 shows the general situation for any confidence level C. If we catch the central area C, the leftover tail area is 1 - C, or (1- C)/2 on each side. You can find z' for any C by searching Table A. Here are the results for the most common confidence levels: Confidence level Tail Area Notice that for 95% confidence we use z* = This is more exact than the approximate value zq = 2 given by the rule. The bottom row in Table C gives the values of zq for many confidence levels C. This row is labeled z*. (You can find Table C inside the rear cover. We will use the other rows of the table in the next chapter.) Values z" that mark off a specified area under the standard normal curve are often called critical values of the distribution.

12 10.1 Estimating with ~ o n % % i ~ n I I I FIGURE 10.6 In general, the central probability C under a standard normal curve lies between -z* and z*. Because z* has area (1-C)/2 to its right under the curve, we call it the upper (1-C)/2 critical value. CRITICAL VALUES The number z* with probability fl lying to its right under the standard normal curve is called the upper P critical value of the standard normal distribution.. b I Here's the thinking that leads to the level C confidence interval: Any normal curve has probability C between the point z" standard deviations below its mean and the point z" standard deviations above its mean. The standard deviation of the sampling distribution of r is UI &, and its mean is the population mean p. SO there is probability C that the observed sample mean T takes a value between p-zq- (T 0- & and p+z"- Whenever this happens, the population mean,u is contained between & * cr c Z-z - and Z+z*- & That is our confidence interval. The estimate of the unknown p is y, and the margin of error is z" (T / &. &

13 -10 Introduction to Inference CONFIDENCE INTERVAL FOR A POPULATION MEAN Choose an SRS of size n from a population having unknown mean p and known standard deviation cr. A level C confidence interval for p is Here 29s the value with area C between -2" and z' under the standard normal curve. This interval is exact when the population distribution is normal and is approximately correct for large n in other cases. EXAMPLE VIDEO SCREEN TENSION A manufacturer of high-resolution video terminals must control the tension on the mesh of fine wires that lies behind the surface of the viewing screen. Too much tension will tear the mesh and too little will allow wrinkles. The tension is measured by an electrical device with output readings in millivolts (mv). Some variation is inherent in the production process. Careful study has shown that when the process is operating properly, the standard deviation of the tension readings is cr = 43 mv. Here are the tension readings from an SRS of 20 screens from a single day's production Construct a 90% confidence interval for the mean tension p of all the screens produced on this day. St+ 1: Identify the population of interest and the parameter you want to draw conclu- sions about. The population of interest is all of the video terminals produced on the day in question. We want to estimate p, the mean tension for all of these screens. Step 2: Choose the appropriate inference procedure. Vm'fy the conditions for using the selected procedure. Since we know a, we should use the confidence interval for a pop ulation mean that was just introduced to estimate p. Now we must check that the two required conditions-(1) SRS from the population of interest and (2) sampling distribution of f approximately normal-are met. The data come from an SRS of 20 screens from the population of all screens produced that day. Is the sampling distribution off approximately normal? Past experience suggests that the tension readings of screens produced on a single day follow a normal distribution quite closely. If the population distribution is normal, the sampling distribution of f will be normal. Let's examine the sample data.

14 Estimating with Confideng- ' - - Y 1,y: 312 means to I I I FIGURE 10.7 A rternplot (a) and a normal probability plot (b) of the video screen tension r e g ings for Example A stemplot of the tension readings (Figure 10.7(a)) shows no outliers or strong skewness. The norman probability plot in Figure 10.7(b) tells us that the sample data are approximately normally distributed. These data give us no reason to doubt the normality of the population from which they came. Step 3: lfthe conditions are met, cany out fhe inference procedure. You can check that the mean tension reading for the 20 screens in our sample is T = mv. The confidence interval formula is jz f z *a 16. For a 90% confidence level, the critical value is Z* = So the 90% confidence interval for p is Stefi 4: Interpret your results in the context ofthe problem. We are90% confident that the true mean tension in the entire batch of video terminals produced that day is between and mv. Suppose that a single computer screen had a tension reading of mv, the same value as the sample mean in Example Repeating the calculation with n = 1 shows that the 90% confidence interval based on a single measurement is The mean of twenty measurements gives a smaller margin of error and therefore a shorter interval than a single measurement. ~ i ~ i10.8 r e illustrates the gain from using 20 observations. FIGURE 10.8 Confidence intervals for n = 20 ar~d ~n = 1 for Example Larger samples give shorter intervals.

15 0 Introduction to Inference We will use the four-step process of Example 10.5 throughout our study of statistical inference. You can think of this general structure as your Inference Toolbox. Specific inference procedures (tools) will be added to your toolbox for use in a variety of settings. Examples that use the Inference Toolbox will be INFERENCE TOOLBOX Confidence intervals ' To construct a oonfi$pncs interval: -!-.. 2 SF0 I: ~$p~~~~~~~fintexst Band the parameteeii'pu.+g&!,o &ia&~~4rd~_s~ns about. - sikf0: choose the ~ ~~~n3@&~h@e'. P ~ P ~ ~ ~ 'venfythe d ~ ccjn&@i&~&&i&&$e.&]:mi& e e pro~duh. stp~ 1; (. 3: If he rond~hf&&$?&$.h&$6utike infigenice procgdui&'!'!. --7 U.. ' '.GI * V &6ijiratk&,fmgagiil of error ;,,, c,:.;. 1 St@ 4: Interpret your rd~its in the kqg~eq&$5:$he_@#~blgm~.,. I The form of confidence intervals for the population mean p rests on the fact that the statistic 55 used to estimate p has a normal distribution. Because many sample statistics have normal distributions (at least approximately), it is useful to notice that the confidence interval has the form estimate f z"u,,imate The estimate based on the sample is the center of the confidence interval. The margin of error is zqaestimate. The desired confidence level determines z" from Table A. The standard deviation of the estimate, aestimate, depends on the particular estimate we use. When the estimate is Tfrom an SRS, the standard deviation of the estimate is u l&. EXERCISES 10.5 ANALYZING PHARMACEUTICALS A manufacturer of pharmaceutical products analyzes a specimen from each batch of a product to verify the concentration of the active ingredient. The chemical analysis is not perfectly precise. Repeated measurements on the same specimen give slightly different results. The results of repeated measurements follow a normal distribution quite closely. The analysis procedure has no bias, so the mean p of the population of all measurements is the true concentration in the specimen. The standard deviation of this distribution is known to be cr = grams per liter. The laboratory analyzes each specimen three times and reports the mean result.

16 10.1 Estimating with ~once- _ Three analyses of one specimen give concentrations Construct a 99% confidence interval for the true concentration p. Use the Inference Toolbox as a guide SURVEYING HOTEL MANAGERS A study of the career paths of hotel general managers sent questionnaires to an SRS of 160 hotels belonging to major U.S. hotel chains. There were 114 responses. The average time these 114 general managers had spent with their current company was years. Give a 99% confidence interval for the mean number of years general managers of major-chain hotels have spent with their current company. (Take it as known that the standard deviation of time with the company for all general managers is 3.2 years.) Use the Inference Toolbox as a guide ENGINE CRANKSHAFTS Here are measurements (in millimeters) of a critical dimension on a sample of auto engine crankshafts: The data come from a production process that is known to have standard deviation rr = mm. The process mean is supposed to be p = 224 mm but can drift away from this target during production. (a) We expect the distribution of the dimension to be close to normal. Make a plot of these data and describe the shape of the distribution. (b) Give a 95% confidence interval for the process mean at the time these crankshafts were produced. How confidence intervals behave The confidence interval X * z " a I & for the mean of a normal population illustrates several important properties that are shared by all confidence intervals in common use. The user chooses the confidence level, and the margin of error follows from this choice. We would like high confidence and also a small margin of error. High confidence says that our method almost always gives correct answers. A small margin of error says that we have pinned down the parameter quite precisely. The margin of error is 0 margin of error = z " - & This expression has z" and 0 in the numerator and & in the denominator. So the margin of error gets smaller when z" gets smaller. Smaller z" is the same as smaller confidence level C (look at Figure 10.6 again). There is a trade-off between the confidence level and the

17 _ Chapter 10 Introduction to Inference margin of error. To obtain a smaller margin of error from the same data, you must be willing to accept lower confidence. u gets smaller. The standard deviation u measures the variation in the population. You can think of the variation among individuals in the population as noise that obscures the average value p. It is easier to pin down p when a is small. n gets larger. Increasing the sample size n reduces the margin of error for any fixed confidence level. Because n appears under a square root sign, we must take four times as many observations in order to cut the margin of error in half. EXAMPLE 10.6 CHANGING THE CONFIDENCE LEVEL Suppose that the manufacturer in Example 10.5 wants 99% confidence rather than 90%. Table C gives the critical value for 99% confidence as z* = The 99% confidence interval for p based on an SRS of 20 video monitors with mean F = is Zf z*-= 0- & k =306.3k 24.8 =(281.5, fi Demanding 99% confidence instead of 90% confidence has increased the margin of error from 15.8 to Figure 10.9 compares these two measurements FIGURE % and 99% confidence intervals for Example Higher confidence requires a longer interval. EXERCISES 10.8 CORN YIELD Crop researchers plant 15 plots with a new variety of corn. The yields in bushels per acre are Assume that (+ = 10 bushels per acre. (a) Find the 90% confidence interval for the mean yield p for this variety of corn. Use your Inference Toolbox. (b) Find the 95% confidence interval. (c) Find the 99% confidence interval. (d) How do the margins of error in (a), (b), and (c) change as the confidence level increases?

18 10.1 Estimating with ton MORE CORN Suppose that the crop researchers in Exercise 10.8 obtained the same value of T from a sample of 60 plots rather than 15. (a) Compute the 95% confidence interval for the mean yield p. (b) Is the margin of error larger or smaller than the margin of error found for the sample of 15 plots in Exercise 10.8? Explain in plain language why the change occurs. (c) Will the 90% and 99% intervals for a sample of size 60 be wider or narrower than those for n = 15? (You need not actually calculate these intervals.) CONFIDENCE LEVEL AND INTERVAL LENGTH Examples 10.5 and 10.6 give confidence intervals for the screen tension p based on 20 measurements with F = and a = 43. The 99% confidence interval is to and the 90% confidence interval is to (a) Find the 80% confidence interval for p. (b) Find the 99.9% confidence interval for p. (c) Make a sketch like Figure 10.9 to compare all four intervals. How does increasing the confidence level affect the length of the confidence interval? SAMPLE SIZE AND MARGIN OF ERROR Find the margin of error for 90% confidence in Example 10.5 if the manufacturer measures the tension of 80 video monitors. Check that your result is half as large as the margin of error based on 20 measurements in Example Choosing the sample size A wise user of statistics never plans data collection without planning the inference at the same time. You can arrange to have both high confidence and a small margin of error by taking enough observations. The margin of error of the confidence interval for the mean of a normally distributed population is rn = z" u/&. To obtain a desired margin of error m, substitute the value of zd for your desired confidence level, set the expression for m less than or equal to the specified margin of error, and solve the inequality for n. The procedure is best illustrated with an example. EXAMPLE 10.7 DETERMINING SAMPLE SIZE Company management wants a report of the mean screen tension for the day's production accurate to within f 5 mv with 95% confidence. How large a sample of video monitors must be measured to comply with this request? For 95% confidence, Table C gives z" = We know that a = 43. Set the margin of error to be at most 5:

19 Introduction to Inference < 5 &- J;;, 0.96)(43) 5 & n so take n = 285 Because n is a whole number, the company must measure the tension of 285 video screens to meet management's demand. On learning the cost of this many measurements, management may reconsider this request! Here is the principle: SAMPLE SIZE FOR DESIRED MARGIN OF ERROR To determine the sample size n that will yield a confidence interval for a population mean with a specified margin of error m, set the expression for the margin of error to be less than or equal to m and solve for n: 0 z*-ern &- In practice, taking observations costs time and money. The required sample size may be impossibly expensive. Do notice once again that it is the size of the sample that determines the margin of error. The size of the fiofiulation (as long as the population is much larger than the sample) does not influence the sample size we need. EXERCISES A BALANCED SCALE? To assess the accuracy of a laboratory scale, a standard weight known to weigh 10 grams is weighed repeatedly. The scale readings are normally distributed with unknown mean (this mean is 10 grams if the scale has no bias). The standard deviation of the scale readings is known to be gram. (a) The weight is weighed five times. The mean result is grams. Give a 98% confidence interval for the mean of repeated measurements of the weight. (b) How many measurements must be averaged to get a margin of error of * with 98% confidence? SURVEYING HOTEL MANAGERS, II How large a sample of the hotel managers in Exercise 10.6 (page 549) would be needed to estimate the mean p within *l year with 99% confidence? ENGINE CRANKSHAFTS, II How large a sample of the crankshafts in Exercise 10.7 (page 549) would be needed to estimate the mean p within k0.020 mm with 95% confidence?

20 Estimating with ~onfi>ence Some cautions Any formula for inference is correct only in specific circumstances. If statistical procedures carried warning labels like those on drugs, most inference methods would have long labels indeed. Our handy formula i f z * u /& for estimating a normal mean comes with the following list of warnings for the user: The data must be an SRS from the population. We are completely safe if we actually carried out the random selection of an SRS. We are not in great danger if the data can plausibly be thought of as observations taken at random from a pop ulation. That is the case in Exercise 10.5 (page 548), where we have in mind the population resulting from a very large number of repeated analyses of the same specimen. The formula is not correct for probability sampling designs more complex than an SRS. Correct methods for other designs are available. We will not discuss confidence intervals based on multistage or stratified samples. If you plan such samples, be sure that you (or your statistical consultant) know how to carry out the inference you desire. There is no correct method for inference from data haphazardly collected with bias of unknown size. Fancy formulas cannot rescue badly produced data. Because X is strongly influenced by a few extreme observations, outliers can have a large effect on the confidence interval. You should search for outliers and try to correct them or justify their removal before computing the interval. If the outliers cannot be removed, ask your statistical consultant about procedures that are not sensitive to outliers. If the sample size is small and the population is not normal, the true confidence level will be different from the value C used in computing the interval. Examine your data carefully for skewness and other signs of nonnormality. The interval relies only on the distribution of T, which even for quite small sample sizes is much closer to normal than the individual observations. When n r 15, the confidence level is not greatly disturbed by nonnormal populations unless extreme outliers or quite strong skewness are present. We will discuss this issue in more detail in the next chapter. You must know the standard deviation a of the population. This unrealistic requirement renders the interval Z f z * a /& of little use in statistical practice. We will learn in the next chapter what to do when u is unknown. However, if the sample is large, the sample standard deviation s will be close to the unknown a. Then i k z * s 1 & is an approximate confidence interval for p. The most important caution concerning confidence intervals is a consequence of the first of these warnings. The margin of error in a confidence interval covers only random sampling errors. The margin of error is obtained from the sampling distribution and indicates how much error can be expected because of chance variation in randomized data production. Practical difficulties, such as undercoverage and nonresponse in a sample survey, can cause additional errors

21 / =-==SK Chapter 10 Introduction to Inference that may be larger than the random sampling error. Remember this unpleasant fact when reading the results of an opinion poll or other sample survey. The practical conduct of the survey influences the trustworthiness of its results in ways that are not included in the announced margin of error. Every inference procedure that we will meet has its own list of warnings. Because many of the warnings are similar to those above, we will not print the full warning label each time. It is easy to state (from the mathematics of probability) conditions under which a method of inference is exactly correct. These conditions are nmr fully met in practice. For example, no population is exactly normal. Deciding when a statistical procedure should be used in practice often requires judgment assisted by exploratory analysis of the data. Finally, you should understand what statistical confidence does not say. We are 95% confident that the mean SAT Math score for all California high school seniors lies between 452 and 470. That is, these numbers were calculated by a method that gives correct results in 95% of all possible samples. We cannot say that the probability is 95% that the true mean falls between 452 and 470. No randomness remains after we draw one particular sample and get from it one particular interval. The true mean either is or is not between 452 and 470. The probability calculations of standard statistical inference describe how often the method gives correct answers. EXERCISES ATALK SHOW OPINION POLL A radio talk show invites listeners to enter a dispute about a proposed pay increase for city council members. "What yearly pay do you think council members should get? Call us with your number." In all, 958 people call. The mean pay they suggest is T = $8740 per year, and the standard deviation of the responses is s = $ For a large sample such as this, s. is very close to the unknown population a. The station calculates the 95% confidence interval for the mean pay p that all citizens would propose for council members to be $8669 to $ (a) Is the station's calculation correct? (b) Does their conclusion describe the population of all the city's citizens? Explain your answer THE 2000 PRESIDENTIAL ELECTION A closely contested presidential election pitted George W. Bush against A1 Gore in A poll taken immediately before the 2000 election showed that 5 1% of the sample intended to vote for Gore. The polling organization announced that they were 95% confident that the sample result was within f 2 points of the true percent of all voters who favored Gore. (a) Explain in plain language to someone who knows no statistics what "95% confident" means in this announcement. (b) The poll showed Gore leading. Yet the polling organization said the election was too close to call. Explain why. (c) On hearing of the poll, a nervous politician asked, "What is the probability that over half the voters prefer Gore?" A statistician replied that this question can't be 1 '

22 10.1 Estimating with confidence 555 answered from the poll results, and that it doesn't even make sense to talk about such a probability. Explain why PRAYER IN THE SCHOOLS The New York TimesICBS News Poll asked the question, "Do you favor an amendment to the Constitution that would permit organized prayer in public schools?" Sixty-six percent of the sample answered 'Yes." The article describing the poll says that it "is based on telephone interviews conducted from Sept. 13 to Sept. 18 with 1,664 adults around the United States, excluding Alaska and Hawaii.... the telephone numbers were formed by random digits, thus permitting access to both listed and unlisted residential numbers." (a) The article gives the margin of error as 3 percentage points. Make a confidence statement about the percent of.all adults who favor a school prayer amendment. (b) The news article goes on to say: "The theoretical errors do not take into account a margin of additional error resulting from the various practical difficulties in taking any survey of public opinion." List some of the "practical difficulties" that may cause errors in addition to the *3% margin of error. Pay particular attention to the news article's description of the sampling method % CONFIDENCE A student reads that a 95% confidence interval for the mean SAT Math score of California high school seniors is 452 to 470. Asked to explain the. meaning of this interval, the student says, "95% of California high school seniors have SAT Math scores between 452 and 470." Is the student right? Justify your answer. SUMMARY %. '.. - I I A confidence interval uses sample data to estimate an unknown population parameter with an indication of how accurate the estimate is and of how confident we are that the result is correct. Any confidence interval has two parts: an interval computed from the data and a confidence level. The interval often has the form estimate f margin of error The confidence level states the probability that the method will give a correct answer. That is, if you use 95% confidence intervals often, in the long run 95% of your intervals will contain the true parameter value. You do not know whether a 95% confidence interval calculated from a particular set of data contains the true parameter value. A level C confidence interval for the mean p of a normal population with known standard deviation 0, based on an SRS of size n, is given by I The critical value z" is chosen so that the standard normal curve has area C between -zq and zq. Because of the central limit theorem, this interval is approximately correct for large samples when the population is not normal.

23 Introduction to Inference Other things being equal, the margin of error of a confidence interval gets smaller as the confidence level C decreases the population standard deviation a decreases the sample size n increases The sample size required to obtain a confidence interval with specified margin of error m for a normal mean is found by setting u z"-ern.if;- and solving for n, where z' is the critical value for the desired level of confidence. Always round n up when you use this formula. A specific confidence interval recipe is correct only under specific conditions. The most important conditions concern the method used to produce the data. Other factors such as the form of the population distribution may also be important. Use the Inference Toolbox (page 548) as a guide when you construct a confidence interval. SECTION 10.1 EXERCISES WHO SHOULD GET WELFARE? A news article on a Gallup Poll noted that "28 percent of the 1548 adults questioned felt that those who were able to work should be taken off welfare." The article also said, "The margin of error for a sample size of 1548 is plus or minus three percentage points." Opinion polls usually announce margins of error for 95% confidence. Using this fact, explain to someone who knows no statistics what "margin of error plus or minus three percentage points" means HOTEL COMPUTER SYSTEMS How satisfied are hotel managers with the computer systems their hotels use? A survey was sent to 560 managers in hotels of size 200 to 500 rooms in Chicago and Detr~it.~ In all, 135 managers returned the survey. Two questions concerned their degree of satisfaction with the ease of use of their computer systems and with the level of computer training they had received. The managers responded using a seven-point scale, with 1 meaning "not satisfied," 4 meaning "moderately satisfied," and 7 meaning "very satisfied." (a) What do you think is the population for this study? There are some major shortcomings in the data production. What are they? These shortcomings reduce the value of the formal inference you are about to do. (b) The measurements of satisfaction are certainly not normally distributed, because they take only whole-number values from 1 to 7. Nonetheless, the use of confidence intervals based on the normal distribution is justified for this study. Why?

24 10.1 Estimating with ~onfiden~-1=~ s: - (c) The mean response for satisfaction with ease of use was jt = Give a 95% confidence interval for the mean in the entire population. (Assume that the population standard deviation is u = 1.75.) Use your Inference Toolbox. (d) For satisfaction with training, the mean response was?i = Taking u = 1.75, give a 99% confidence interval for the population mean A NEWSPAPER POLL A New York Times poll on women's issues interviewed 1025 women and 472 men randomly selected from the United States, excluding Alaska and Hawaii. The poll announced a margin of error off 3 percentage points for 95% confidence in conclusions about women. The margin of error for results concerning men was 24 percentage points. Why is this larger than the margin of error for women? HEALING OF SKIN WOUNDS Biologists studying the healing of skin wounds measured the rate at which new cells closed a razor cut made in the skin of an anesthetized newt. Here are data from 18 newts, measured in micrometers (millionths of a meter) per hour:? (a) Make a stemplot of the healing rates (split the stems). It is difficult to assess normality from 18 observations, but look for outliers or extreme skewness. Now make a normal probability plot. What do you find? (b) Scientists usually assume that animal subjects are SRSs from their species or genetic type. Treat these newts as an SRS and suppose you know that the standard deviation of healing rates for this species of newt is 8 micrometers per hour. Give a 90% confidence interval for the mean healing rate for the species. (c) A friend who knows almost no statistics follows the formula X f r * u 1 & in a biology lab manual to get a 95% confidence interval for the mean. Is her interval wider or narrower than yours? Explain to her why it makes sense that higher confidence changes the length of the interval MORE NEWTS! How large a sample would enable you to estimate the mean healing rate of skin wounds in newts (see Exercise 10.22) within a margin of error of 1 micrometer per hour with 90% co&dence? DETERMINING SAMPLE SIZE Researchers planning a study of the reading ability of third-grade children want to obtain a 95% confidence interval for the population mean score on a reading test, with margin of error no greater than 3 points. They carry out a small pilot study to estimate the variability of test scores. The sample standard deviation is s = 12 points in the pilot study, so in preliminary calculations the researchers take the population standard deviation to be u = 12. (a) The study budget will allow as many as 100 students. Calculate the margin of error of the 95% confidence interval for the population mean based on n = 100. (b) There are many other demands on the research budget. If all of these demands were met, there would be funds to measure only 10 children. What is the margin of error of the confidence interval based on n = 10 measurements?

25 - 358 chapter 10 Introduction to Inference! (c) Find the smallest value of n that would satisfy the goal of a 95% confidence interval with margin of error 3 or less. Is this sample size within the limits of the budget? HOW THE POLL WAS CONDUCTED The New York Times includes a box entitled "How the poll was conducted" in news articles about its own opinion polls. Here are quotations from one such box (March 26,1995). The box also announced a margin of error off 3%, with 95% confidence. The latest New York TimeslCBS News poll is based on telephone intervims conducted March 9 through 12 with 1,156 adults around the United States, excluding Alaska and Hawaii. [The box then describes random digit dialing, the method used to select the sample.] In addition to sampling error, the practical difficulties of conducting any survey of public opinion may introduce other sources of error into the poll. Variations in question wording or in the order of questions, for instance, can lead to somewhat different results. (a) This account mentions several sources of possible errors in the poll's results. List these sources. (b) Which of the sources of error you listed in (a) are covered by the announced margin of error? CALCULATING CONFIDENCE INTERVALS (a) Use your TI-83/89 to find the confidence interval for the mean rate of healing for newts in Exercise Follow the Technology Toolbox below. (b) If you have summary statistics but not raw data, you would select "Stats" as your input method and then provide the sample mean 5i, the population standard deviation a, and the number n of observations. Use your calculator in this way to find the confidence interval for the mean number of years hotel managers have spent with their current company in Exercise 10.6 (page 549). Caution: Calculating the confidence interval is only one part of the inference process. Follow the steps in your Inference Toolbox. TECHNOLOGY TOOLBOX Calculator confidence intervals You can use your TI to construct confidence interval's, using dher data stored in a list or summaj statistics. In Exercise 10.5, for example, we would begin by entering the three specimen concentration; ' into Llnistl. TI-8 3 TI-89 Press m, choose TESTS, and then choose ress 12nd1m(m), choose 1:Zinterval P 7:Zinterval... Adjust your settings as shown. Then choose "Calculate." I hoose Input Method = Data. djust your settings is shown. ~resi-.

26 10.2 TESTS OF SIGNIFICANCE Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population parameter. The second common type of inference, called tests of significance, has a different goal: to assess the evidence provided by data about some claim concerning a population. Here is the reasoning of statistical tests in a nutshell. EXAMPLE 10.8 I'M A GREAT FREE-THROW SHOOTER I claim that I make 80% of my basketball free throws. To test my claim, you ask me to shoot 20 free throws. I make only 8 of the 20. "Aha!" you say. "Someone who makes 80% of his free throws would almost never make only 8 out of 20. So I don't believe your claim." Your reasoning is based on asking what would happen if my claim were true and we repeated the sample of 20 free throws many times-i would almost never make as few as 8. This outcome is so unlikely that it gives strong evidence that my claim is not true. You can say how strong the evidence against my claim is by giving the probability that I would make as few as 8 out of 20 free throws if I really make 80% in the long run. This probability is I would make as few as 8 of 20 only once in 10,000 tries in the long run if my claim to make 80% is true. The small probability convinces you that my claim is false.

***SECTION 10.1*** Confidence Intervals: The Basics

***SECTION 10.1*** Confidence Intervals: The Basics SECTION 10.1 Confidence Intervals: The Basics CHAPTER 10 ~ Estimating with Confidence How long can you expect a AA battery to last? What proportion of college undergraduates have engaged in binge drinking?

More information

Confidence Intervals. Chapter 10

Confidence Intervals. Chapter 10 Confidence Intervals Chapter 10 Confidence Intervals : provides methods of drawing conclusions about a population from sample data. In formal inference we use to express the strength of our conclusions

More information

Statistical Inference

Statistical Inference Statistical Inference Chapter 10: Intro to Inference Section 10.1 Estimating with Confidence "How good is your best guess?" "How confident are you in your method?" provides methods for about a from the.

More information

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference

10.1 Estimating with Confidence. Chapter 10 Introduction to Inference 10.1 Estimating with Confidence Chapter 10 Introduction to Inference Statistical Inference Statistical inference provides methods for drawing conclusions about a population from sample data. Two most common

More information

The following command was executed on their calculator: mean(randnorm(m,20,16))

The following command was executed on their calculator: mean(randnorm(m,20,16)) 8.1- Confidence Intervals: The Basics Introduction How long does a new model of laptop battery last? What proportion of college undergraduates have engaged in binge drinking? How much does the weight of

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimating with Confidence 8.1 Confidence Intervals: The Basics The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Confidence Intervals: The

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimating with Confidence Section 8.1 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Introduction Our goal in many statistical settings is to use a sample statistic

More information

Chapter 8 Estimating with Confidence

Chapter 8 Estimating with Confidence Chapter 8 Estimating with Confidence Introduction Our goal in many statistical settings is to use a sample statistic to estimate a population parameter. In Chapter 4, we learned if we randomly select the

More information

CHAPTER 5: PRODUCING DATA

CHAPTER 5: PRODUCING DATA CHAPTER 5: PRODUCING DATA 5.1: Designing Samples Exploratory data analysis seeks to what data say by using: These conclusions apply only to the we examine. To answer questions about some of individuals

More information

Thinking about Inference

Thinking about Inference CHAPTER 15 c Jupiterimages/Age fotostock Thinking about Inference IN THIS CHAPTER WE COVER... To this point, we have met just two procedures for statistical inference. Both concern inference about the

More information

Chapter 5: Producing Data

Chapter 5: Producing Data Chapter 5: Producing Data Key Vocabulary: observational study vs. experiment confounded variables population vs. sample sampling vs. census sample design voluntary response sampling convenience sampling

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

CHAPTER 8 Estimating with Confidence

CHAPTER 8 Estimating with Confidence CHAPTER 8 Estimating with Confidence 8.1b Confidence Intervals: The Basics The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Confidence Intervals: The

More information

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion Chapter 8 Estimating with Confidence Lesson 2: Estimating a Population Proportion What proportion of the beads are yellow? In your groups, you will find a 95% confidence interval for the true proportion

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Module 28 - Estimating a Population Mean (1 of 3)

Module 28 - Estimating a Population Mean (1 of 3) Module 28 - Estimating a Population Mean (1 of 3) In "Estimating a Population Mean," we focus on how to use a sample mean to estimate a population mean. This is the type of thinking we did in Modules 7

More information

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion Chapter 8 Estimating with Confidence Lesson 2: Estimating a Population Proportion Conditions for Estimating p These are the conditions you are expected to check before calculating a confidence interval

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Statistics for Psychology

Statistics for Psychology Statistics for Psychology SIXTH EDITION CHAPTER 3 Some Key Ingredients for Inferential Statistics Some Key Ingredients for Inferential Statistics Psychologists conduct research to test a theoretical principle

More information

PROBABILITY Page 1 of So far we have been concerned about describing characteristics of a distribution.

PROBABILITY Page 1 of So far we have been concerned about describing characteristics of a distribution. PROBABILITY Page 1 of 9 I. Probability 1. So far we have been concerned about describing characteristics of a distribution. That is, frequency distribution, percentile ranking, measures of central tendency,

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Sample surveys and experiments Most of what we ve done so far is data exploration ways to uncover, display, and describe patterns in data. Unfortunately,

More information

Handout 16: Opinion Polls, Sampling, and Margin of Error

Handout 16: Opinion Polls, Sampling, and Margin of Error Opinion polls involve conducting a survey to gauge public opinion on a particular issue (or issues). In this handout, we will discuss some ideas that should be considered both when conducting a poll and

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2010, 2007, 2004 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means

More information

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions Copyright 2010 Pearson Education, Inc. Standard Error Both of the sampling distributions we ve looked at are Normal. For proportions For means SD pˆ pq n

More information

Probability Models for Sampling

Probability Models for Sampling Probability Models for Sampling Chapter 18 May 24, 2013 Sampling Variability in One Act Probability Histogram for ˆp Act 1 A health study is based on a representative cross section of 6,672 Americans age

More information

Quiz 4.1C AP Statistics Name:

Quiz 4.1C AP Statistics Name: Quiz 4.1C AP Statistics Name: 1. The school s newspaper has asked you to contact 100 of the approximately 1100 students at the school to gather information about student opinions regarding food at your

More information

Math 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6

Math 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6 Math 1680 Class Notes Chapters: 1, 2, 3, 4, 5, 6 Chapter 1. Controlled Experiments Salk vaccine field trial: a randomized controlled double-blind design 1. Suppose they gave the vaccine to everybody, and

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimating with Confidence Key Vocabulary: point estimator point estimate confidence interval margin of error interval confidence level random normal independent four step process level C confidence

More information

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

9. Interpret a Confidence level: To say that we are 95% confident is shorthand for.. Mrs. Daniel AP Stats Chapter 8 Guided Reading 8.1 Confidence Intervals: The Basics 1. A point estimator is a statistic that 2. The value of the point estimator statistic is called a and it is our "best

More information

State of Connecticut Department of Education Division of Teaching and Learning Programs and Services Bureau of Special Education

State of Connecticut Department of Education Division of Teaching and Learning Programs and Services Bureau of Special Education State of Connecticut Department of Education Division of Teaching and Learning Programs and Services Bureau of Special Education Introduction Steps to Protect a Child s Right to Special Education: Procedural

More information

1. Find the appropriate value for constructing a confidence interval in each of the following settings:

1. Find the appropriate value for constructing a confidence interval in each of the following settings: AP Statistics Unit 06 HW #4 Review Name Period 1. Find the appropriate value for constructing a confidence interval in each of the following settings: a. Estimating a population proportion p at a 94% confidence

More information

Ch. 1 Collecting and Displaying Data

Ch. 1 Collecting and Displaying Data Ch. 1 Collecting and Displaying Data In the first two sections of this chapter you will learn about sampling techniques and the different levels of measurement for a variable. It is important that you

More information

Confidence Intervals and Sampling Design. Lecture Notes VI

Confidence Intervals and Sampling Design. Lecture Notes VI Confidence Intervals and Sampling Design Lecture Notes VI Statistics 112, Fall 2002 Announcements For homework question 3(b), assume that the true is expected to be about in calculating the sample size

More information

Lecture 12A: Chapter 9, Section 1 Inference for Categorical Variable: Confidence Intervals

Lecture 12A: Chapter 9, Section 1 Inference for Categorical Variable: Confidence Intervals Looking Back: Review Lecture 12A: Chapter 9, Section 1 Inference for Categorical Variable: Confidence Intervals! Probability vs. Confidence! Constructing Confidence Interval! Sample Size; Level of Confidence!

More information

Chapter 3. Producing Data

Chapter 3. Producing Data Chapter 3. Producing Data Introduction Mostly data are collected for a specific purpose of answering certain questions. For example, Is smoking related to lung cancer? Is use of hand-held cell phones associated

More information

Vocabulary. Bias. Blinding. Block. Cluster sample

Vocabulary. Bias. Blinding. Block. Cluster sample Bias Blinding Block Census Cluster sample Confounding Control group Convenience sample Designs Experiment Experimental units Factor Level Any systematic failure of a sampling method to represent its population

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Statistics Final Review Semeter I Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The Centers for Disease

More information

Chapter 20 Confidence Intervals with proportions!

Chapter 20 Confidence Intervals with proportions! Chapter 20 Confidence Intervals with proportions! Statistic or Type of Variable Parameter Point Estimate Quantitative Categorical (Binary) Any Confidence Interval Point Estimate ± Margin of Error Point

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the W's for the description of data. 1) A survey of bicycles parked outside college

More information

Name Class Date. Even when random sampling is used for a survey, the survey s results can have errors. Some of the sources of errors are:

Name Class Date. Even when random sampling is used for a survey, the survey s results can have errors. Some of the sources of errors are: Name Class Date 8-3 Surveys, Experiments, and Observational Studies Going Deeper Essential question: What kinds of statistical research are there, and which ones can establish cause-and-effect relationships

More information

Problems for Chapter 8: Producing Data: Sampling. STAT Fall 2015.

Problems for Chapter 8: Producing Data: Sampling. STAT Fall 2015. Population and Sample Researchers often want to answer questions about some large group of individuals (this group is called the population). Often the researchers cannot measure (or survey) all individuals

More information

Chapter 1 Data Collection

Chapter 1 Data Collection Chapter 1 Data Collection OUTLINE 1.1 Introduction to the Practice of Statistics 1.2 Observational Studies versus Designed Experiments 1.3 Simple Random Sampling 1.4 Other Effective Sampling Methods 1.5

More information

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study.

You can t fix by analysis what you bungled by design. Fancy analysis can t fix a poorly designed study. You can t fix by analysis what you bungled by design. Light, Singer and Willett Or, not as catchy but perhaps more accurate: Fancy analysis can t fix a poorly designed study. Producing Data The Role of

More information

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible Pledge: 10/4/2007 MATH 171 Name: Dr. Lunsford Test 1 100 Points Possible I. Short Answer and Multiple Choice. (36 points total) 1. Circle all of the items below that are measures of center of a distribution:

More information

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels; 1 One-Way ANOVAs We have already discussed the t-test. The t-test is used for comparing the means of two groups to determine if there is a statistically significant difference between them. The t-test

More information

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population

Data = collections of observations, measurements, gender, survey responses etc. Sample = collection of some members (a subset) of the population Chapter 1: Basic Ideas 1.1 Sampling Statistics = the Science of Data By collecting a limited amount of data, we want to say something about the whole group that we want to study, i.e. we want to say something

More information

SAMPLING AND SAMPLE SIZE

SAMPLING AND SAMPLE SIZE SAMPLING AND SAMPLE SIZE Andrew Zeitlin Georgetown University and IGC Rwanda With slides from Ben Olken and the World Bank s Development Impact Evaluation Initiative 2 Review We want to learn how a program

More information

t-test for r Copyright 2000 Tom Malloy. All rights reserved

t-test for r Copyright 2000 Tom Malloy. All rights reserved t-test for r Copyright 2000 Tom Malloy. All rights reserved This is the text of the in-class lecture which accompanied the Authorware visual graphics on this topic. You may print this text out and use

More information

Sampling Controlled experiments Summary. Study design. Patrick Breheny. January 22. Patrick Breheny Introduction to Biostatistics (BIOS 4120) 1/34

Sampling Controlled experiments Summary. Study design. Patrick Breheny. January 22. Patrick Breheny Introduction to Biostatistics (BIOS 4120) 1/34 Sampling Study design Patrick Breheny January 22 Patrick Breheny to Biostatistics (BIOS 4120) 1/34 Sampling Sampling in the ideal world The 1936 Presidential Election Pharmaceutical trials and children

More information

3.2 Least- Squares Regression

3.2 Least- Squares Regression 3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these

More information

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 SAMPLING AND CONFIDENCE INTERVALS Learning objectives for this session:

More information

STA 291 Lecture 4 Jan 26, 2010

STA 291 Lecture 4 Jan 26, 2010 STA 291 Lecture 4 Jan 26, 2010 Methods of Collecting Data Survey Experiment STA 291 - Lecture 4 1 Review: Methods of Collecting Data Observational Study vs. Experiment An observational study (survey) passively

More information

Margin of Error = Confidence interval:

Margin of Error = Confidence interval: NAME: DATE: Algebra 2: Lesson 16-7 Margin of Error Learning 1. How do we calculate and interpret margin of error? 2. What is a confidence interval 3. What is the relationship between sample size and margin

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

CHAPTER 3 METHOD AND PROCEDURE

CHAPTER 3 METHOD AND PROCEDURE CHAPTER 3 METHOD AND PROCEDURE Previous chapter namely Review of the Literature was concerned with the review of the research studies conducted in the field of teacher education, with special reference

More information

Math 124: Module 3 and Module 4

Math 124: Module 3 and Module 4 Experimental Math 124: Module 3 and Module 4 David Meredith Department of Mathematics San Francisco State University September 24, 2009 What we will do today Experimental 1 What we will do today Experimental

More information

If you could interview anyone in the world, who. Polling. Do you think that grades in your school are inflated? would it be?

If you could interview anyone in the world, who. Polling. Do you think that grades in your school are inflated? would it be? Do you think that grades in your school are inflated? Polling If you could interview anyone in the world, who would it be? Wh ic h is be s t Snapchat or Instagram? Which is your favorite sports team? Have

More information

Moore, IPS 6e Chapter 03

Moore, IPS 6e Chapter 03 Page 1 of 7 Moore, IPS 6e Chapter 03 Quizzes prepared by Dr. Patricia Humphrey, Georgia Southern University Researchers are studying the absorption of two drugs into the bloodstream. Each drug is to be

More information

AP Stats Review for Midterm

AP Stats Review for Midterm AP Stats Review for Midterm NAME: Format: 10% of final grade. There will be 20 multiple-choice questions and 3 free response questions. The multiple-choice questions will be worth 2 points each and the

More information

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc. Chapter 23 Inference About Means Copyright 2010 Pearson Education, Inc. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it d be nice to be able

More information

MATH& 146 Lesson 4. Section 1.3 Study Beginnings

MATH& 146 Lesson 4. Section 1.3 Study Beginnings MATH& 146 Lesson 4 Section 1.3 Study Beginnings 1 Populations and Samples The population is the complete collection of individuals or objects that you wish to learn about. To study larger populations,

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell q 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included in

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

Conduct an Experiment to Investigate a Situation

Conduct an Experiment to Investigate a Situation Level 3 AS91583 4 Credits Internal Conduct an Experiment to Investigate a Situation Written by J Wills MathsNZ jwills@mathsnz.com Achievement Achievement with Merit Achievement with Excellence Conduct

More information

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots Section 1.2 Displaying Quantitative Data with Graphs Dotplots One of the simplest graphs to construct and interpret is a dotplot. Each data value is shown as a dot above its location on a number line.

More information

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17 P. 266 #9, 11 9. Election. a) Answers will vary. A component is one voter voting. An outcome is a vote for our candidate. Using two random digits, 00-99, let 01-55 represent a vote for your candidate,

More information

Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009

Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009 MIT OpenCourseWare http://ocw.mit.edu Abdul Latif Jameel Poverty Action Lab Executive Training: Evaluating Social Programs Spring 2009 For information about citing these materials or our Terms of Use,

More information

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology

ISC- GRADE XI HUMANITIES ( ) PSYCHOLOGY. Chapter 2- Methods of Psychology ISC- GRADE XI HUMANITIES (2018-19) PSYCHOLOGY Chapter 2- Methods of Psychology OUTLINE OF THE CHAPTER (i) Scientific Methods in Psychology -observation, case study, surveys, psychological tests, experimentation

More information

Introduction; Study design

Introduction; Study design ; Study design Patrick Breheny January 12 Patrick Breheny STA 580: Biostatistics I 1/43 What is statistics? What is biostatistics, and why is it important? The statistical framework Statistics is the science

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola 1.1-1 Chapter 1 Introduction to Statistics 1-1 Review and Preview 1-2 Statistical Thinking 1-3

More information

Section I: Multiple Choice Select the best answer for each question.

Section I: Multiple Choice Select the best answer for each question. Chapter 1 AP Statistics Practice Test (TPS- 4 p78) Section I: Multiple Choice Select the best answer for each question. 1. You record the age, marital status, and earned income of a sample of 1463 women.

More information

Estimation. Preliminary: the Normal distribution

Estimation. Preliminary: the Normal distribution Estimation Preliminary: the Normal distribution Many statistical methods are only valid if we can assume that our data follow a distribution of a particular type, called the Normal distribution. Many naturally

More information

Chapter 7: Descriptive Statistics

Chapter 7: Descriptive Statistics Chapter Overview Chapter 7 provides an introduction to basic strategies for describing groups statistically. Statistical concepts around normal distributions are discussed. The statistical procedures of

More information

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation)

Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Chapter 3, Section 1 - Describing Relationships (Scatterplots and Correlation) Investigating relationships between variables is central to what we do in statistics. Why is it important to investigate and

More information

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA Data Analysis: Describing Data CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA In the analysis process, the researcher tries to evaluate the data collected both from written documents and from other sources such

More information

AP Statistics Chapter 5 Multiple Choice

AP Statistics Chapter 5 Multiple Choice AP Statistics Chapter 5 Multiple Choice 1. A nutritionist wants to study the effect of storage time (6, 12, and 18 months) on the amount of vitamin C present in freeze dried fruit when stored for these

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 2015 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

Chi Square Goodness of Fit

Chi Square Goodness of Fit index Page 1 of 24 Chi Square Goodness of Fit This is the text of the in-class lecture which accompanied the Authorware visual grap on this topic. You may print this text out and use it as a textbook.

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

Never P alone: The value of estimates and confidence intervals

Never P alone: The value of estimates and confidence intervals Never P alone: The value of estimates and confidence Tom Lang Tom Lang Communications and Training International, Kirkland, WA, USA Correspondence to: Tom Lang 10003 NE 115th Lane Kirkland, WA 98933 USA

More information

6 Relationships between

6 Relationships between CHAPTER 6 Relationships between Categorical Variables Chapter Outline 6.1 CONTINGENCY TABLES 6.2 BASIC RULES OF PROBABILITY WE NEED TO KNOW 6.3 CONDITIONAL PROBABILITY 6.4 EXAMINING INDEPENDENCE OF CATEGORICAL

More information

Statistics: Bar Graphs and Standard Error

Statistics: Bar Graphs and Standard Error www.mathbench.umd.edu Bar graphs and standard error May 2010 page 1 Statistics: Bar Graphs and Standard Error URL: http://mathbench.umd.edu/modules/prob-stat_bargraph/page01.htm Beyond the scatterplot

More information

Study Design. Study design. Patrick Breheny. January 23. Patrick Breheny Introduction to Biostatistics (171:161) 1/34

Study Design. Study design. Patrick Breheny. January 23. Patrick Breheny Introduction to Biostatistics (171:161) 1/34 Study design Patrick Breheny January 23 Patrick Breheny Introduction to Biostatistics (171:161) 1/34 in the ideal world In an ideal world, We have a list of everyone in the population of interest We randomly

More information

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research Chapter 11 Nonexperimental Quantitative Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) Nonexperimental research is needed because

More information

An Experimental Investigation of Self-Serving Biases in an Auditing Trust Game: The Effect of Group Affiliation: Discussion

An Experimental Investigation of Self-Serving Biases in an Auditing Trust Game: The Effect of Group Affiliation: Discussion 1 An Experimental Investigation of Self-Serving Biases in an Auditing Trust Game: The Effect of Group Affiliation: Discussion Shyam Sunder, Yale School of Management P rofessor King has written an interesting

More information

(a) 50% of the shows have a rating greater than: impossible to tell

(a) 50% of the shows have a rating greater than: impossible to tell KEY 1. Here is a histogram of the Distribution of grades on a quiz. How many students took the quiz? 15 What percentage of students scored below a 60 on the quiz? (Assume left-hand endpoints are included

More information

Summer AP Statistic. Chapter 4 : Sampling and Surveys: Read What s the difference between a population and a sample?

Summer AP Statistic. Chapter 4 : Sampling and Surveys: Read What s the difference between a population and a sample? Chapter 4 : Sampling and Surveys: Read 207-208 Summer AP Statistic What s the difference between a population and a sample? Alternate Example: Identify the population and sample in each of the following

More information

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics

Sampling for Success. Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics Sampling for Success Dr. Jim Mirabella President, Mirabella Research Services, Inc. Professor of Research & Statistics Session Objectives Upon completion of this workshop, participants will be able to:

More information

MATH-134. Experimental Design

MATH-134. Experimental Design Experimental Design Controlled Experiment: Researchers assign treatment and control groups and examine any resulting changes in the response variable. (cause-and-effect conclusion) Observational Study:

More information

STA Module 9 Confidence Intervals for One Population Mean

STA Module 9 Confidence Intervals for One Population Mean STA 2023 Module 9 Confidence Intervals for One Population Mean Learning Objectives Upon completing this module, you should be able to: 1. Obtain a point estimate for a population mean. 2. Find and interpret

More information

CAUTIONS ABOUT THE PRACTICE EXAM

CAUTIONS ABOUT THE PRACTICE EXAM Dr. Michael Passer Psych 209, U. of Washington CAUTIONS ABOUT THE PRACTICE EXAM DEAR STUDENT: This practice tests consists of questions from past exams. By taking this practice test, you should gain an

More information

Scientific Inquiry Review

Scientific Inquiry Review Scientific Inquiry Review Adapted from Regentsprep.org Be able to USE AND APPLY the steps of the scientific method to solve a problem and design an experiment: Scientific Method: 1. Make observations about

More information

This means that the explanatory variable accounts for or predicts changes in the response variable.

This means that the explanatory variable accounts for or predicts changes in the response variable. Lecture Notes & Examples 3.1 Section 3.1 Scatterplots and Correlation (pp. 143-163) Most statistical studies examine data on more than one variable. We will continue to use tools we have already learned

More information

Comparing Different Studies

Comparing Different Studies 32 LESSON Comparing Different Studies Types of Studies UNDERSTAND By studying a small group within a larger group, you can make inferences about the larger group. The larger group, called the population,

More information

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date:

AP Statistics Exam Review: Strand 2: Sampling and Experimentation Date: AP Statistics NAME: Exam Review: Strand 2: Sampling and Experimentation Date: Block: II. Sampling and Experimentation: Planning and conducting a study (10%-15%) Data must be collected according to a well-developed

More information

Chapter 1. Picturing Distributions with Graphs

Chapter 1. Picturing Distributions with Graphs Chapter 1 Picturing Distributions with Graphs Statistics Statistics is a science that involves the extraction of information from numerical data obtained during an experiment or from a sample. It involves

More information

Section 1: Exploring Data

Section 1: Exploring Data Section 1: Exploring Data The following maps the videos in this section to the Texas Essential Knowledge and Skills for Mathematics TAC 111.47(c). 1.01 Introduction to Statistics 1.02 Statistics and Parameters

More information

Planning for a time when you cannot make decisions for yourself

Planning for a time when you cannot make decisions for yourself Planning for a time when you cannot make decisions for yourself An information leaflet for members of the public Version: October 2013 Introduction The Mental Capacity Act 2005 allows you to plan ahead

More information

Chapter 13. Experiments and Observational Studies

Chapter 13. Experiments and Observational Studies Chapter 13 Experiments and Observational Studies 1 /36 Homework Read Chpt 13 Do p312 1, 7, 9, 11, 17, 20, 25, 27, 29, 33, 40, 41 2 /36 Observational Studies In an observational study, researchers do not

More information

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census.

Population. population. parameter. Census versus Sample. Statistic. sample. statistic. Parameter. Population. Example: Census. Population Population the complete collection of ALL individuals (scores, people, measurements, etc.) to be studied the population is usually too big to be studied directly, then statistics is used Parameter

More information