STAT 110: Chapter 1 Introduction to Thinking Statistically

Similar documents
Chapter 1: Alternative Forced Choice Methods

Handout 14: Understanding Randomness Investigating Claims of Discrimination

Teaching Statistics with Coins and Playing Cards Going Beyond Probabilities

STAT 100 Exam 2 Solutions (75 points) Spring 2016

(2) In each graph above, calculate the velocity in feet per second that is represented.

Statistical Thinking

Why do Psychologists Perform Research?

Practice First Midterm Exam

Carrying out an Empirical Project

Handout 16: Opinion Polls, Sampling, and Margin of Error

News English.com Ready-to-use ESL / EFL Lessons

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior

What Science Is and Is Not

MITOCW conditional_probability

Psychology 205, Revelle, Fall 2014 Research Methods in Psychology Mid-Term. Name:

Risk Aversion in Games of Chance

Math HL Chapter 12 Probability

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)

Spectrum inversion and intentionalism

The Regression-Discontinuity Design

Student Performance Q&A:

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

The Fate of Memory: Comment on McCloskey and Zaragoza

Functionalist theories of content

Peer review on manuscript "Multiple cues favor... female preference in..." by Peer 407

Probability Models for Sampling

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Poppy By: Avi Thinking Strategy Resources, Graphic Organizers, and Reading Responses

Paper Airplanes & Scientific Methods

Chapter 13. Experiments and Observational Studies

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Day Topic Homework IXL Grade

Writing Reaction Papers Using the QuALMRI Framework

Making Inferences from Experiments

REVIEW FOR THE PREVIOUS LECTURE

An Experimental Investigation of Self-Serving Biases in an Auditing Trust Game: The Effect of Group Affiliation: Discussion

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Beattie Learning Disabilities Continued Part 2 - Transcript

What Is Science? Lesson Overview. Lesson Overview. 1.1 What Is Science?

*A Case of Possible Discrimination (Spotlight Task)

How do you know if a newspaper article is giving a balanced view of an issue? Write down some of the things you should look for.

The Scientific Method

IT S A WONDER WE UNDERSTAND EACH OTHER AT ALL!

Math 140 Introductory Statistics

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

An Alternative Explanation for Premack & Premack

Introduction to Behavioral Economics Like the subject matter of behavioral economics, this course is divided into two parts:

Improving Individual and Team Decisions Using Iconic Abstractions of Subjective Knowledge

Unit 7 Comparisons and Relationships

Module 4 Introduction

I. Identifying the question Define Research Hypothesis and Questions

Statistics: Interpreting Data and Making Predictions. Interpreting Data 1/50

This project is designed to be a post-reading reflection of Dark Water Rising. Please read this carefully and follow all instructions!

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

Unit 5. Thinking Statistically

News English.com Ready-to-use ESL / EFL Lessons

Sheila Barron Statistics Outreach Center 2/8/2011

FSA Training Papers Grade 7 Exemplars. Rationales

Write a research proposal to rationalize the purpose of the research. (Consult PowerPoint slide show notes.)

The mentalistic basis of core social cognition: experiments in preverbal infants and a computational model

Metabolic Biochemistry GST and the Effects of Curcumin Practical: Criterion-Referenced Marking Criteria Rubric

Lesson 11 Correlations

Concepts and Connections: What Do You Expect? Concept Example Connected to or Foreshadowing

PSY 205 Module 3 Supplement. Comparing Correlation, Ex post facto, and Experimental Approaches to Research

Creative Commons Attribution-NonCommercial-Share Alike License

Full title: A likelihood-based approach to early stopping in single arm phase II cancer clinical trials

Chance. May 11, Chance Behavior The Idea of Probability Myths About Chance Behavior The Real Law of Averages Personal Probabilities

Offseason Training: Nutritional Troubleshooting and FAQ Section

Supplementary notes for lecture 8: Computational modeling of cognitive development

Tic Tac Toe: The Smallest Dragonboy

This reproduction of Woodworth & Thorndike s work does not follow the original pagination.

A Comparison of Three Measures of the Association Between a Feature and a Concept

Unit 2: Lesson 3 Development of Vaccines

21 INSTRUCTOR GUIDELINES

UNDERSTANDING MEMORY

Making charts in Excel

Causality and Treatment Effects

8LI 1EXLIQEXMGW SJ 1IEWYVMRK 7IPJ (IPYWMSR

Name: Period: Date: Unit Topic: Science and the Scientific Method Grade Level: 9

The Invisible Influence: How Our Decisions Are Rarely Ever Our Own By CommonLit Staff 2017

Paper Airplanes & Scientific Methods

Unit 1 Outline Science Practices. Part 1 - The Scientific Method. Screencasts found at: sciencepeek.com. 1. List the steps of the scientific method.

Meiosis and Introduction to Inheritance

How to Manage Seemingly Contradictory Facet Results on the MBTI Step II Assessment

Name: Date: Period: The M & M (not the rapper) Lab. Procedure

Commentary on The Erotetic Theory of Attention by Philipp Koralus. Sebastian Watzl

MRS Advanced Certificate in Market & Social Research Practice. Preparing for the Exam: Section 2 Q5- Answer Guide & Sample Answers

Is it possible to gain new knowledge by deduction?

AMBCO 1000+P AUDIOMETER

MOVEMENT PREPARATION LAB. Name: Score: Activity I: Predictability of the correct response choice & Influence of Pre cueing

Explorers 1. Teacher s notes for the Comprehension Test: Little Red Riding Hood. Answer key 1c 2c 3a 4b 5c 6b 7c 8a 9c 10b 11c 12a

Term Paper Step-by-Step

Section 4.3 Using Studies Wisely. Read pages 266 and 267 below then discuss the table on page 267. Page 1 of 10

Bayes Theorem Application: Estimating Outcomes in Terms of Probability

ROC Curve. Brawijaya Professional Statistical Analysis BPSA MALANG Jl. Kertoasri 66 Malang (0341)

Psychological. Influences on Personal Probability. Chapter 17. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

CHAPTER 15: DATA PRESENTATION

These methods have been explained in partial or complex ways in the original Mind Reading book and our rare Risk Assessment book.

Handout 11: Understanding Probabilities Associated with Medical Screening Tests STAT 100 Spring 2016

Transcription:

In the Evaluating a Claim of Hearing Loss activity, we encountered several basic examples that required us to think statistically in order to investigate a question of interest. Before we move on to slightly more complex examples, we will discuss some basic definitions that will be used throughout the semester. SOME BASIC DEFINITIONS Statistics: Categorical (or qualitative) data: Measurements that are classified into one of a group of categories. Numerical (or quantitative) data: Measurements that are recorded on a naturally occurring numerical scale. Most of what we ll be doing in this course centers on trying to understand a set of information. This set of information is from a... Population: The complete collection of ALL elements that are of interest for a given problem. The population is often so big that obtaining all information about its elements is either difficult or impossible. So, we work with a more manageable set of data that we obtain from a... Sample: A subcollection of elements drawn from a population. Observation: The collection of measurements from a particular unit in a sample. Variable: Any measurable characteristic of an observation. Example 1.1: Alleged Hearing Loss Consider once again the example presented in an article by Pankratz, Fausti, and Peed titled A Forced-Choice Technique to Evaluate Deafness in the Hysterical or Malingering Patient. Recall that the subject was correct on 36 out of 100 trials when he was asked to identify whether the tone played with either the red or the blue lightbulb. Source: Journal of Consulting and Clinical Psychology, 1975, Vol. 43, pg. 421-422. 1

Identify the following in the context of this example: Variable of interest: Data type: Population of interest: Sample: Recall that we carried out a simulation study to determine whether this patient who was suspected of malingering had obtained too few correct answers. The results of the simulation study indicate what outcomes we expect from a guessing subject: Question: Based on the results of this simulation study, do you believe the patient s outcome of 36 correct out of 100 was consistent with guessing, or do these results indicate that he may have been answering incorrectly on purpose in order to mislead the researchers into believing he was hearing impaired? 2

A Statistical Investigation for Example 1.1 from Start to Finish Research Question Is there statistical evidence that the subject is intentionally answering incorrectly in order to convince his doctors that he can t hear? Design Study/Collect Data The researchers presented the subject with 100 trials of the red/blue lightbulb experiment. They kept track of the number of trials in which he gave a correct answer. Explore/Summarize Data Of the 100 trials, he answered only 36 correct (36%). Draw Appropriate Inferences Beyond the Sample The subject s outcome (only 36 correct out of 100 trials) was much lower than what was expected if he were truly deaf, and it was shown to be very unlikely to occur by chance if he were truly deaf. Even though these 100 trials provide just a sample of the subject s overall behavior, the statistical evidence is strong enough to indicate that the actual long-run probability of the subject answering correctly is much lower than we would expect if he were truly deaf (i.e., there is evidence he is not deaf). Example 1.2: Helper vs. Hinderer? In a study reported in a November 2007 issue of Nature, researchers investigated whether infants take into account an individual s actions towards others in evaluating that individual as appealing or aversive, perhaps laying the foundation for social interaction (Hamlin, Wynn, and Bloom, 2007). In one component of the study, sixteen 10-month-old infants were shown a climber character (a piece of wood with google eyes glued onto it) that could not make it up a hill in two tries. Then they were shown two scenarios for the climber s next try, one where the climber was pushed to the top of the hill by another character ( helper ) and one where the climber was pushed back down the hill by another character ( hinderer ). The infant was alternately shown these two scenarios several times. Then the child was presented with both pieces of wood (the helper and the hinderer) and asked to pick one to play with. The color and shape and order (left/right) of the toys were varied and balanced out among the 16 infants. References: Hamlin, J. Kiley, Karen Wynn, and Paul Bloom. Social evaluation by preverbal infants. November 22, 2007. Nature, Volume 150. Introducing Concepts of Statistical Inference. Rossman, Chance, Cobb, and Holcomb. NSF/DUE/CCLI # 0633349. 3

Research Question: Do 10-month-old infants show a preference for the helper toy over the hinderer toy? Questions: 1. Why was it important for the researchers to balance out the color, shape, and order of the toys across the study? For example, how would the study results have been affected if the researchers always made the helper toy a blue circle and the hinderer a yellow triangle? 2. Identify the following in the context of this example: Variable of interest: Data type: Population of interest: Sample: 3. Recall that this study involves 16 infants. If the population of all 10-month-old infants has no real preference for one toy over the other, how many infants do you expect to choose the helper toy? Explain. 4. Suppose that 10 out of 16 infants choose the helper toy (62.5%). Since this value is higher than 50%, a researcher argues that these data show that the majority of all 10- month-old infants would choose the helper toy. What is wrong with their reasoning? Once again, the key question is how to determine whether the study s result is surprising under the assumption that there is no real preference for one toy over the other in the population of all 10-month-old infants. To answer this, we will simulate the process of 16 infants simply choosing a toy at random, over and over again. Each time we simulate the process, we ll keep track of how many infants out of the 16 chose the helper toy (note that you could also keep track of the number that chose the hinderer toy). Once we ve repeated this process a large number of times, we ll have a pretty good sense for what outcomes would be very surprising, somewhat surprising, or not so surprising if the population of all 10-month-old infants has no real preference. 4

Carry out the Tinkerplots simulation. Note that you should consider the following questions when designing your simulation study: What are the two possible outcomes on each of the trials? Change the values on your spinner accordingly. What is the probability that each outcome occurs under the assumption that the population of all 10-month-old infants has no real preference for either toy? Change your spinner accordingly. Be sure to change the Draw value to 1 since only one infant is choosing a toy at a time. How many infants were used in this study? Keep this value in mind when setting the Repeat value. Carry out the simulation study 1000 times overall, keeping track of the number of infants that choose the helper toy in each of the simulated experiments. Sketch in your results below: 5

Questions: 5. What does each dot on this plot represent? 6. Suppose that in the actual study 10 out of 16 infants chose the helper toy. Would this convince you that the majority of the population of all 10-month-old infants had a preference for the helper toy? Why or why not? 7. The actual study results are as follows: 14 out of 16 infants chose the helper toy. Mark this on the axis above the results of your simulations study. Based on this statistical investigation, what should the researchers conclude? Recall that their research question was stated as follows: Do 10-month-old infants tend to prefer the helper toy over the hinderer toy? A Statistical Investigation for Example 1.2 from Start to Finish Research Question Do 10-month-old infants show a preference for the helper toy over the hinderer toy? Design Study/Collect Data The study was conducted as described on page 3. Researchers kept track of which toy was chosen by the infant (helper or hinderer). Explore/Summarize Data Of the 16 infants, 14 chose the helper toy (87.5%). Draw Appropriate Inferences Beyond the Sample The study s outcome (14 of 16 infants choosing the helper) indicated that a strong majority of the sample showed a preference for the helper toy. Furthermore, the outcome was shown to be very unlikely to occur by chance if there were no real preference for one toy over the other in the population of all 10-month-olds. This provides strong statistical evidence that there is a preference for the helper toy in the population of all 10-month-olds, as well. Big idea: Even though the 16 infants studied are just a sample of the population of all 10- month-olds, we can draw a conclusion about the population of all 10-month-olds. 6

Example 1.3: Are Women Passed Over for Managerial Training? This example involves possible discrimination against female employees. Suppose a large supermarket chain occasionally selects employees to receive management training. A group of female employees has claimed that they are less likely than male employees of similar qualifications to be chosen for this training. The large employee pool that can be tapped for management training is 60% female and 40% male; however, since the management program began, 9 of the 20 employees chosen for management training were female (only 45%). Do the female employees have a valid statistical argument that they are being discriminated against? Research Question: Is there statistical evidence for gender discrimination against females? Questions: 1. Identify the following in the context of this example: Variable of interest: Data type: Population of interest: Sample: 2. If the selection process was unbiased, how many of the 20 employees selected for management do you expect to be women? Explain. Once again, the key question is how to determine whether the result is surprising under the assumption that the selection process is unbiased. To answer this, we will simulate the process of an unbiased selection process, over and over again. Each time we simulate the process, we ll keep track of how many of the 20 employees selected for management were female. Once we ve repeated this process a large number of times, we ll have a pretty good sense for what outcomes would be very surprising, somewhat surprising, or not so surprising if there was no discrimination in the selection process. 7

Carry out the Tinkerplots simulation. Note that you should consider the following questions when designing your simulation study: What are the two possible outcomes on each of the trials? Change the values on your spinner accordingly. What is the probability that each outcome occurs under the assumption that there is no gender discrimination in the selection process? Change your spinner accordingly. Be sure to change the Draw value to 1 since only one employee was chosen at a time. How many subjects were there in this study? Keep this value in mind when setting the Repeat value. Carry out the simulation study 1000 times overall, keeping track of the number of the employees chosen for management that were female on each of the simulated experiments. Sketch in your results below: 8

Questions: 3. What does each dot on this plot represent? 4. Recall that since the management program began, only 9 of the 20 employees chosen for management training were female. Does this outcome convince you that the selection process is biased against women? Why or why not? A Statistical Investigation for Example 1.3 from Start to Finish Research Question Is there statistical evidence for gender discrimination against females? Design Study/Collect Data Researchers kept track of the gender of employees chosen for management since the inception of the management program. began, 9 of the 20 employees chosen for management training were female (only 45%). Explore/Summarize Data Of the 20 employees chosen, 9 were female (45%). Draw Appropriate Inferences Beyond the Sample The study s outcome (9 of 20 employees chosen were female) was a bit lower than what was expected if the company were not discriminating. However, the outcome was shown to be somewhat likely to occur by chance even if the company was not discriminating. Even though the 20 employees chosen so far are just a sample of all employees the company might ever choose for management, we can still make a conclusion about the true long-run probability of a company choosing a female for management. In this case, the statistical evidence is not strong enough to indicate that the company is discriminating against females. 9

Example 1.4: Font Preferences Researchers carried out a marketing field study in order to study preferences of potential consumers in the U.S. They used silver cardboard boxes to contain chocolate truffles in a forced choice task. All of the box tops were decorated in the same way, and a white label was attached to each bearing the name Indulgence in either Signet font or Salem font. The text on each label was approximately equal-sized. For each of the 40 subjects in the study, one box labeled with the Signet font and another box labeled with the Salem font were placed on a tray, and the subject was simply asked to choose a truffle from one of the two boxes that were on the tray in front of them. Signet Font Style Salem Half of the people were presented a tray like this The other half were presented a tray like this The researchers aren t sure which font is more appropriate for the label and simply want to know whether the majority of all consumers will choose the truffles with one font more than the other. In the sample of 40 subjects, 30 chose to take a truffle from the box that had Signet font. Research Question: Do the majority of consumers have a preference for one font over the other? 10

Questions: 1. Identify the following in the context of this example: Variable of interest: Data type: Population of interest: Sample: 2. If there was no preference in the population, how many of the 40 consumers do you expect to choose Signet font? Explain. To gain an understanding of what outcomes we expect to see if there is no real preference in the population of all consumers, we will simulate this experiment under the condition that there is no preference for one font over the other. Carry out the Tinkerplots simulation. Note that you should consider the following questions when designing your simulation study: What are the two possible outcomes on each of the trials? Change the values on your spinner accordingly. What is the probability that each outcome occurs under the assumption that there is no preference in the population? Change your spinner accordingly. Be sure to change the Draw value to 1. How many subjects were there in this study? Keep this value in mind when setting the Repeat value. Carry out the simulation study 1000 times overall, keeping track of the number that choose Signet on each of the simulated experiments. Sketch in your results on the following graph: 11

Questions: 3. What does each dot on this plot represent? 4. In the actual study, 30 of the 40 selected the Signet font. Does this outcome convince you that there is a preference for one font over the other? Why or why not? 12

A Statistical Investigation for Example 1.4 from Start to Finish Research Question Do the majority of consumers have a preference for one font over the other? Design Study/Collect Data The study was conducted as described on page 10. Researchers kept track of which font was chosen by the consumer (Signet or Salem). Explore/Summarize Data Of the 40 consumers, 30 chose the Signet font (75%). Draw Appropriate Inferences Beyond the Sample The study s outcome (30 of 40 choosing the Signet font) was much higher than what was expected if there were no real font preference in the population. Furthermore, the outcome was shown to be very unlikely to occur by chance if there were no real preference for one font over the other in the population of all consumers. This provides strong statistical evidence that there is a preference for one font over the other in the population of consumers. Big idea: Even though the 40 consumers studied are just a sample of the population of all consumers, we can draw a conclusion about the population of all consumers. 13