Elementary statistics. Michael Ernst CSE 140 University of Washington

Similar documents
15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

LSP 121. LSP 121 Math and Tech Literacy II. Topics. Binning. Stats & Probability. Stats Wrapup Intro to Probability

Stat 111 Final. 1. What are the null and alternative hypotheses for Alex and Sarah having the same mean lap time?

Checking the counterarguments confirms that publication bias contaminated studies relating social class and unethical behavior

A Method to Estimate Mean Lying Rates and Their Full Distribution

When Intuition. Differs from Relative Frequency. Chapter 18. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Psychological. Influences on Personal Probability. Chapter 17. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

Probability and Sample space

Chapter 6: Counting, Probability and Inference

SAMPLING AND SAMPLE SIZE

Sleeping Beauty is told the following:

Unit 5. Thinking Statistically

Simplify the expression and write the answer without negative exponents.

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Theory Building and Hypothesis Testing. POLI 205 Doing Research in Politics. Theory. Building. Hypotheses. Testing. Fall 2015

ULTIMATUM GAME. An Empirical Evidence. Presented By: SHAHID RAZZAQUE

Learning and Peer Effects Disadvantageous lies

t-test for r Copyright 2000 Tom Malloy. All rights reserved

4. If you were to flip a coin 2 times, list all possible outcomes (order of outcomes does not matter):

Day Topic Homework IXL Grade

SOCIOLOGICAL RESEARCH PART I. If you've got the truth you can demonstrate it. Talking doesn't prove it. Robert A. Heinlein

lab exam lab exam Experimental Design Experimental Design when: Nov 27 - Dec 1 format: length = 1 hour each lab section divided in two

Chapter 5 & 6 Review. Producing Data Probability & Simulation

Applied Statistical Analysis EDUC 6050 Week 4

5.3: Associations in Categorical Variables

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Biostatistics 3. Developed by Pfizer. March 2018

ETHICAL DECISION-MAKING

Understanding Probability. From Randomness to Probability/ Probability Rules!

Sheila Barron Statistics Outreach Center 2/8/2011

First Problem Set: Answers, Discussion and Background

Biostatistics Lecture April 28, 2001 Nate Ritchey, Ph.D. Chair, Department of Mathematics and Statistics Youngstown State University

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) 1) A) B) C) D)

The Simple Experiment

Bio 1M: Evolutionary processes

Analysis of Variance (ANOVA)

Business Statistics Probability

Screening and Clinical Assessment for Dysphagia: How to Decide.

Selecting a research method

HYPOTHESIS TESTING 1/4/18. Hypothesis. Hypothesis. Potential hypotheses?

6 Relationships between

Incentives to cheat under loss aversion

Bayesian and Frequentist Approaches

CHANCE YEAR. A guide for teachers - Year 10 June The Improving Mathematics Education in Schools (TIMES) Project

BIOSTATS 540 Fall 2018 Exam 2 Page 1 of 12

Statistics and Probability

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

UNIT. Experiments and the Common Cold. Biology. Unit Description. Unit Requirements

2. Collecting agromyticin hormonal as a measure of aggression may not be a measure of aggression: (1 point)

Lecture 15. When Intuition Differs from Relative Frequency

The Human Side of Science: I ll Take That Bet! Balancing Risk and Benefit. Uncertainty, Risk and Probability: Fundamental Definitions and Concepts

Cheating in Mind Games: The Subtlety of Rules Matters

Lecture 3. QIAO Zhilin ( 乔志林 ) School of Economics & Finance Xi an Jiaotong University

Tuesday, August 02, 2016 Welcome to Investigative Science with Mr. Fireng

Level 2 l Upper intermediate

Lecture 20: Chi Square

Experimental Design Process. Things you can change or vary: Things you can measure or observe:

CHAPTER THIRTEEN. Data Analysis and Interpretation: Part II.Tests of Statistical Significance and the Analysis Story CHAPTER OUTLINE

Preliminary: Comments Invited Mar The Sleeping Beauty Problem:

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

SUPPLEMENTARY INFORMATION

An Ethical Approach to Health Journalism in the Netherlands

"PRINCIPLES OF PHYLOGENETICS: ECOLOGY AND EVOLUTION"

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

1. What is the difference between positive and negative correlations?

III. WHAT ANSWERS DO YOU EXPECT?

ACCTG 533, Section 1: Module 2: Causality and Measurement: Lecture 1: Performance Measurement and Causality

Final Exam Practice Test

Finding an Experimental Probability

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Psychology 205, Revelle, Fall 2014 Research Methods in Psychology Mid-Term. Name:

Still important ideas

were selected at random, the probability that it is white or black would be 2 3.

Psychology (Specification A)

1 The conceptual underpinnings of statistical power

AP STATISTICS 2008 SCORING GUIDELINES (Form B)

CS 520 Theory and Practice of Software Engineering Fall 2017

Lecture 2: Learning and Equilibrium Extensive-Form Games

CPSC 121 Some Sample Questions for the Final Exam

Supplementary notes for lecture 8: Computational modeling of cognitive development

Statistical sciences. Schools of thought. Resources for the course. Bayesian Methods - Introduction

Chapter 02 Developing and Evaluating Theories of Behavior

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Probabilities and Research. Statistics

appstats26.notebook April 17, 2015

Unit 11 Day 1 NAME Period. Factorials, Counting Principle

Confidence in Sampling: Why Every Lawyer Needs to Know the Number 384. By John G. McCabe, M.A. and Justin C. Mary

Chi Square Goodness of Fit

Subjects are motivated not only by their own payoffs but also by those of others and the relationship between the payoffs of the players of the game

CHAPTER 19: CONTROLS. One reprieve from the correlation-does-notimply-causation

So You Want to do a Survey?

Game Theoretic Statistics

Emanuela Carbonara. 31 January University of Bologna - Department of Economics

Online Supplemental Material. paying appropriate attention (Johnson, 2005), we established a minimum completion time

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)

Reasoning with Uncertainty. Reasoning with Uncertainty. Bayes Rule. Often, we want to reason from observable information to unobservable information

How Does Analysis of Competing Hypotheses (ACH) Improve Intelligence Analysis?

UNIT II: RESEARCH METHODS

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Transcription:

Elementary statistics Michael Ernst CSE 140 University of Washington

A dice-rolling game Two players each roll a die The higher roll wins Goal: roll as high as you can! Repeat the game 6 times

Hypotheses regarding the outcome Luck Fraud loaded die inaccurate reporting How likely is luck? How do we decide?

Questions that statistics can answer I am flipping a coin. Is it fair? How confident am I in my answer? I have two bags of beans, each containing some black and some white beans. I have a handful of beans. Which bag did the handful come from? I have a handful of beans, and a single bag. Did the handful come from that bag? Does this drug improve patient outcomes? Which website design yields greater revenue? Which baseball player should my team draft? What premium should an insurer charge? Which chemical process leads to the best-tasting beer?

What can happen when you roll a die? What is the likelihood of each?

A dice-rolling experiment Game: Roll one die, get paid accordingly: Roll 1 2 3 4 5 6 Payoff 1 CHF 2 CHF 3 CHF 4 CHF 5 CHF 0 CHF Player self-reports the die roll and takes the money no verification of the actual roll From Lies in disguise: An experimental study on cheating by Urs Fischbacher and Franziska Heusi

What can happen when you roll two dice? How likely are you to roll 11 or higher? This probability is known as the p value. 2 3 4 5 6 7 8 9 10 11 12

How to compute p values Via a statistical formula Requires you to make assumptions and know which formula to use Computationally (simulation) Run many experiments Count the fraction with a better result Requires a metric/measurement for better Requires you to be able to run the experiments We will use this approach exclusively

Interpreting p values p value of 5% or less = statistically significant This is a convention; there is nothing magical about 5% Two types of errors may occur in statistical tests: false positive(or false alarm or Type I error): no real effect, but report an effect (through good/bad luck or coincidence) If no real effect, a false positive occurs about 1 time in 20 If there is a real effect, a false positive occurs less often false negative(or missor Type II error): real effect, but report no effect (through good/bad luck or coincidence) The smaller the effect, the more likely a false negative is How many die rolls to detect a die that is only slightly loaded? The largerthe sample, the less the likelihood of a false positive or negative

A false positive http://xkcd.com/882/

http://xkcd.com/882/ http://xkcd.com/882/

Correlation causation Ice cream sales and murder rates are correlated http://xkcd.com/552/

Statistical significance practical importance

Summary of statistical methodology 1. Decide on a metric (bigger value = better) 2. Observe what you see in the real world 3. Hypothesize that what you saw is normal/typical This is the null hypothesis 4. Simulate the real world many times 5. How different is what you observed from the simulations? What percent of the simulation values are the real world values bigger than? 6. If the percentage is 95% or more, reject the null hypothesis

Analogy between hypothesis testing and mathematical proofs The underlying logic [of hypothesis testing] is similar to a proof by contradiction. To prove a mathematical statement, A, you assume temporarily that A is false. If that assumption leads to a contradiction, you conclude that A must actually be true. From the book Think Statistics by Allen Downey

A common error 1. Observe what you see in the real world 2. Decide on a metric (bigger value = better) This is backwards For any observation, there is something unique about it. Example: Roll dice, then be amazed because what are the odds you would get exactly that combination of rolls?

Don t trust your intuition People have very bad statistical intuition It s much better to follow the methodology and do the experiments