Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Similar documents
Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Study Methodology: Tricks and Traps

Chapter 1: Exploring Data

Reflection Questions for Math 58B

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

Biostatistics 3. Developed by Pfizer. March 2018

lab exam lab exam Experimental Design Experimental Design when: Nov 27 - Dec 1 format: length = 1 hour each lab section divided in two

Controlled experiments

What is Statistics? (*) Collection of data Experiments and Observational studies. (*) Summarizing data Descriptive statistics.

CHAPTER 4 Designing Studies

[Source: Freedman, Pisani, Purves (1998) Statistics, 3rd ed., Ch 2.]

STATISTICS & PROBABILITY

AMS 5 EXPERIMENTAL DESIGN

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

1. What is the difference between positive and negative correlations?

Lesson Plan. Class Policies. Designed Experiments. Observational Studies. Weighted Averages

Variable Data univariate data set bivariate data set multivariate data set categorical qualitative numerical quantitative

Unit 1 Exploring and Understanding Data

UNIT II: RESEARCH METHODS

t-test for r Copyright 2000 Tom Malloy. All rights reserved

Chapter 5: Producing Data

Review+Practice. May 30, 2012

Chapter 11: Experiments and Observational Studies p 318

Sampling Controlled experiments Summary. Study design. Patrick Breheny. January 22. Patrick Breheny Introduction to Biostatistics (BIOS 4120) 1/34

Sheila Barron Statistics Outreach Center 2/8/2011

Section 6.1 Sampling. Population each element (or person) from the set of observations that can be made (entire group)

Designing Psychology Experiments: Data Analysis and Presentation

Observational studies; descriptive statistics

Lesson 1: Distributions and Their Shapes

The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance?

MAT Mathematics in Today's World

Stat 111 Final. 1. What are the null and alternative hypotheses for Alex and Sarah having the same mean lap time?

EPIDEMIOLOGY-BIOSTATISTICS EXAM Midterm 2004 PRINT YOUR LEGAL NAME:

Observational Substance Use Epidemiology: A case study

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

*Karle Laska s Sections: There is NO class Thursday or Friday! Have a great Valentine s Day weekend!

MIDTERM EXAM. Total 150. Problem Points Grade. STAT 541 Introduction to Biostatistics. Name. Spring 2008 Ismor Fischer

Study Design. Study design. Patrick Breheny. January 23. Patrick Breheny Introduction to Biostatistics (171:161) 1/34

Psychology Research Process

Evaluation: Controlled Experiments. Title Text

Chapter 2. The Data Analysis Process and Collecting Data Sensibly. Copyright 2005 Brooks/Cole, a division of Thomson Learning, Inc.

CHAPTER 9: Producing Data: Experiments

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

Overview: Part I. December 3, Basics Sources of data Sample surveys Experiments

Chapter 4 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

In the broadest sense of the word, the definition of research includes any gathering of data, information, and facts for the advancement of knowledge.

66 Questions, 5 pages - 1 of 5 Bio301D Exam 3

CHAPTER 5: PRODUCING DATA

STAT 111 SEC 006 PRACTICE EXAM 1: SPRING 2007

IAPT: Regression. Regression analyses

Evaluation: Scientific Studies. Title Text

CHAPTER OBJECTIVES - STUDENTS SHOULD BE ABLE TO:

The Practice of Statistics 1 Week 2: Relationships and Data Collection

STAT 113: PAIRED SAMPLES (MEAN OF DIFFERENCES)

Welcome to this series focused on sources of bias in epidemiologic studies. In this first module, I will provide a general overview of bias.

Lies, damn lies, & clinical trials. Paul Young

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia

NORTH SOUTH UNIVERSITY TUTORIAL 1

In this second module in the clinical trials series, we will focus on design considerations for Phase III clinical trials. Phase III clinical trials

Week 10 Hour 1. Shapiro-Wilks Test (from last time) Cross-Validation. Week 10 Hour 2 Missing Data. Stat 302 Notes. Week 10, Hour 2, Page 1 / 32

Understandable Statistics

Research Methods. Observational Methods. Correlation - Single Score. Basic Methods. Elaine Blakemore. Title goes here 1. Observational.

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

The degree to which a measure is free from error. (See page 65) Accuracy

P. 266 #9, 11. p. 289 # 4, 6 11, 14, 17

Clever Hans the horse could do simple math and spell out the answers to simple questions. He wasn t always correct, but he was most of the time.

observational studies Descriptive studies

Experimental and survey design

Analysis of data in within subjects designs. Analysis of data in between-subjects designs

How Causal Heterogeneity Can Influence Statistical Significance in Clinical Trials

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

Midterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.

Never P alone: The value of estimates and confidence intervals

APPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors

Political Science 15, Winter 2014 Final Review

Review. Chapter 5. Common Language. Ch 3: samples. Ch 4: real world sample surveys. Experiments, Good and Bad

Name: Open binders and StatCrunch are allowed. For each problem marked with a, follow these directions:

Basic Biostatistics. Chapter 1. Content

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Risk Aversion in Games of Chance

Lecture 9 Internal Validity

Section 4.3 Using Studies Wisely. Honors Statistics. Aug 23-8:26 PM. Daily Agenda. 1. Check homework C4# Group Quiz on

DO NOT OPEN THIS BOOKLET UNTIL YOU ARE TOLD TO DO SO

A guide to conversations with young people about DRUGS & ALCOHOL

CHAPTER ONE CORRELATION

Sampling Reminders about content and communications:

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Patrick Breheny. January 28

A Spreadsheet for Deriving a Confidence Interval, Mechanistic Inference and Clinical Inference from a P Value

Introduction to Statistics Design of Experiments

International Statistical Literacy Competition of the ISLP Training package 3

the problem with 'mental illnesses' by Tio

Missy Wittenzellner Big Brother Big Sister Project

Test Bank Questions for Chapter 1

What is the Scientific Method?

15.301/310, Managerial Psychology Prof. Dan Ariely Lecture 2: The Validity of Intuitions

Science in Natural Resource Management ESRM 304

Transcription:

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences. 1. Hand in HW. 2. Causation and prediction. 3. Multiple testing and publication bias. 3. Relationship between CIs and tests. 4. Review list. NO LECTURE THU NOV 3 or TUE NOV 8. Also no office hour Nov 8. The midterm Thu Nov 10 will be on ch1-7. Bring a PENCIL and CALCULATOR and any books or notes you want. No computers. All numerical answers will be rounded to 3 significant digits. http://www.stat.ucla.edu/~frederic/13/f16. 1

1. Hand in HW1. 2. Causation and prediction. Note that for prediction, you sometimes do not care about confounding factors. * Forecasting wildfire activity using temperature. Warmer weather may directly cause wildfires via increased ease of ignition, or due to confounding with people chooseing to go camping in warmer weather. It does not really matter for the purpose of merely predicting how many wildfires will occur in the coming month. * The same goes for predicting lifespan, or liver disease rates, etc., using smoking as a predictor variable.

3. Multiple testing and publication bias. A p-value is the probability, assuming the null hypothesis of no relationship is true, that you will see a difference as extreme as, or more extreme than, you observed. So, 5% of the time you are looking at unrelated things, you will find a statistically significant relationship. This underscores the need for followup confirmation studies. If testing many explanatory variables simultaneously, it can become very likely to find something significant even if nothing is actually related to the response variable.

3. Multiple testing and publication bias. * For example, if the significance level is 5%, then for 100 tests where all null hypotheses are true, the expected number of incorrect rejections (Type I errors) is 5. If the tests are independent, the probability of at least one Type I error would be 99.4%. * To address this problem, scientists sometimes change the significance level so that, under the null hypothesis that none of the explanatory variables is related to the response variable, the probability of rejecting any of them is 5%. * One way is to use Bonferroni's correction: with m explanatory variables, use significance level 5%/m. P(at least 1 Type I error) will be m (5%/m) = 5%.

P(Type I error on explanatory 1) = 5%/m. P(Type I error on explanatory 2) = 5%/m. P(Type 1 error on at least one explanatory) P(error on 1) + P(error on 2) +... + P(error on m) = m x 5%/m.

Multiple testing and publication bias. Imagine a scenario where a drug is tested many times to see if it reduces the incidence of some response variable. If the drug is testes 100 times by 100 different researchers, the results will be stat. sig. about 5 times. If only the stat. sig. results are published, then the published record will be very misleading.

Multiple testing and publication bias. A drug called Reboxetine made by Pfizer was approved as a treatment for depression in Europe and the UK in 2001, based on positive trials. A meta-analysis in 2010 found that it was not only ineffective but also potentially harmful. The report found that 74% of the data on patients who took part in the trials of Reboxetine were not published because the findings were negative. Published data about reboxetine overestimated its benefits and underestimated its harm. A subsequent 2011 analysis indicated Reboxetine might be effective for severe depression though.

4. CIs and tests. Suppose we are comparing death rates in a treatment group and a control group. We observe a difference of 10.2%, do a test, and find a p-value of 8%. Does this mean the 95%-CI for the difference in death rates between the two groups would contain zero?

4. CIs and tests. Suppose we are comparing blood pressures in a treatment group and a control group. We observe a difference of 10.2 mm, do a 2-sided test, and find a p-value of 3%. Would the 95%-CI for the difference in blood pressures between the two groups contain zero?

4. CIs and tests. Suppose we are comparing blood pressures in a treatment group and a control group. We observe a difference of 10.2 mm, do a 2-sided test, and find a p-value of 3%. Would the 95%-CI for the difference in blood pressures between the two groups contain zero or not? No. It would not contain zero. For what confidence level would the CI just barely contain 0? 97%.

4. CIs and tests. The p-value is 3%. A 97%-CI would just contain zero. Ho ( 95% ) ( 95%-CI ) ----------------------------------------------------------------------------- 0 10.2mm Ho ( 97% ) ( 97%-CI ) ----------------------------------------------------------------------------- 0 10.2mm

4. Review list. 1. Meaning of SD. 19. Random sampling and random 2. Parameters and statistics. assignment. 3. Z statistic for proportions. 20. Two proportion CIs and testing. 4. Simulation and meaning of pvalues. 21. IQR and 5 number summaries. 5. SE for proportions. 22. CIs for 2 means and testing. 6. What influences pvalues. 23. Paired data. 7. CLT and validity conditions for tests. 24. Placebo effect, adherer bias, 8. 1-sided and 2-sided tests. and nonresponse bias. 9. Reject the null vs. accept the alternative. 25. Prediction and causation. 10. Sampling and bias. 26. Multiple testing and publication bias 11. Significance level. 12. Type I, type II errors, and power. 13. CIs for a proportion. 14. CIs for a mean. 15. Margin of error. 16. Practical significance. 17. Confounding. 18. Observational studies and experiments.

Some good hw problems from the book are 1.2.18, 1.2.19, 1.2.20, 1.3.17, 1.5.18, 2.1.38, 2.2.6, 2.2.24, 2.3.3, 2.3.25, 3.2.11, 3.2.12, 3.3.8, 3.3.19, 3.3.22, 3.5.23, 4.1.14, 4.1.18, 5.2.2, 5.2.10, 5.2.24, 5.3.11, 5.3.21, 5.3.24, 6.2.23, 6.3.1, 6.3.12, 6.3.22, 6.3.23, 7.2.20, 7.2.24, 7.3.7, 7.3.24.

NCIS was a top-rated tv show in 2014. It is currently 3 rd in 2016. A study finds that in a certain city, people who watch NCIS are much more likely to die than people who do not watch NCIS. Can we conclude that NCIS is a dangerous tv show to watch?

NCIS was a top-rated tv show in 2014. It is currently 3 rd in 2016. A study finds that in a certain city, people who watch NCIS are much more likely to die than people who do not watch NCIS. Can we conclude that NCIS is a dangerous tv show to watch? No. Age is a confounding factor. The median age of a viewer is 61 years old.

In the portacaval shunt example, why did the studies with historical controls find that the portacaval shunt seemed to be associated with lower death rates? a. Those getting the shunt smoked more. b. Those getting the shunt were healthier. c. Those getting the shunt were genetically predisposed to die younger. d. The explanatory variable is a confounding factor t-test with 95% central limit theorem.

In the portacaval shunt example, why did the studies with historical controls find that the portacaval shunt seemed to be associated with lower death rates? b. Those getting the shunt were healthier.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. a. Find a 95%-CI for how much less an average UCLA student's blood glucose level is than an average 2 nd grader.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. a. Find a 95%-CI for how much less an average UCLA student's blood glucose level is than an average 2 nd grader. 2.0 +/- 1.96 (1.5 2 /100 + 2.2 2 /80) = 2.0 +/- 0.564.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. b. Is the difference observed between the mean blood glucose at UCLA and in 2 nd grade statistically significant?

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. b. Is the difference observed between the mean blood glucose at UCLA and in 2 nd grade statistically significant? Yes. The 95%-CI does not come close to containing 0.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. c. Is this an observational study or an experiment?

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. c. Is this an observational study or an experiment? Observational study.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. d. Does going to UCLA cause your blood glucose level to drop?

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. d. Does going to UCLA cause your blood glucose level to drop? No. Age is a confounding factor. Young kids eat more candy.

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. e. The mean blood glucose level of all 43,301 UCLA students is a parameter random variable t-test

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. e. The mean blood glucose level of all 43,301 UCLA students is a parameter

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. f. If we took another sample of 100 UCLA students and 80 2 nd graders, and used the difference in sample means to estimate the difference in population means, how much would it typically be off by?

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. f. If we took another sample of 100 UCLA students and 80 2 nd graders, and used the difference in sample means to estimate the difference in population means, how much would it typically be off by? SE = (1.5 2 /100 + 2.2 2 /80) =.288 mmol/l

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. g. How much does one UCLA student's blood glucose level typically differ from the mean of UCLA students?

Suppose you sample 100 UCLA students and 80 2 nd graders and record their blood glucose levels. The mean at UCLA is 5.5 mmol/l, with an SD of 1.5, and the mean in 2 nd grade is 7.5 mmol/l, with an SD of 2.2. g. How much does one UCLA student's blood glucose level typically differ from the mean of UCLA students? 1.5 mmol/l.

Do bike helmets make you less likely to get into a collision? Study A looked at data from 40 States, the year before and the year after implementing mandatory helmet laws. They had millions of observations. In every State, a significantly higher percentage of cyclists wore helmets the year after the law was passed than the year before. Combining data from all 40 States, study A found no significant difference in collision rates before or after the law was passed. Study B surveyed 1000 people who bought a bicycle in the previous year, and found that a significantly lower percentage of those who wore helmets had been in collisions.

Study A looked at data from 40 States, and found no significant difference in collision rates before or after the law was passed. Study B surveyed 1000 people who bought a bicycle in the previous year, and found that a significantly lower percentage of those who wore helmets had been in collisions. Which study is more convincing, and why? a. Study A, because the sample size in study B is too small to be representative of the population. b. Study B, because it is unclear whether study A is an experiment or an observational study. c. Study A, because a confounding factor in study B is how conscientious the bicyclists are. d. Study A, because it has higher power and is statistically significant. e. Study A, because a confounding factor in study B is the weight of the bicycle.

Study A looked at data from 40 States, and found no significant difference in collision rates before or after the law was passed. Study B surveyed 1000 people who bought a bicycle in the previous year, and found that a significantly lower percentage of those who wore helmets had been in collisions. Which study is more convincing, and why? a. Study A, because the sample size in study B is too small to be representative of the population. b. Study B, because it is unclear whether study A is an experiment or an observational study. c. Study A, because a confounding factor in study B is how conscientious the bicyclists are. d. Study A, because it has higher power and is statistically significant. e. Study A, because a confounding factor in study B is the weight of the bicycle.

A histogram of the simulated mean difference between the bicycling to work with bike 1 minus bike 2, under Ho, is shown. What is the SE for the difference?

A histogram of the simulated mean difference between the bicycling to work with bike 1 minus bike 2, under Ho, is shown. What is the SE for the difference? 0.080.

A histogram of the simulated mean difference between the bicycling to work with bike 1 minus bike 2, under Ho, is shown. What would the margin of error be for a 95% CI? 1.96 (0.080) = 0.157.