Reflection Questions for Math 58B

Similar documents
Contingency Tables Summer 2017 Summer Institutes 187

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

Sections 10.7 and 10.9

Understandable Statistics

Unit 1 Exploring and Understanding Data

Assignment #6. Chapter 10: 14, 15 Chapter 11: 14, 18. Due tomorrow Nov. 6 th by 2pm in your TA s homework box

Risk Ratio and Odds Ratio

Chapter 8: Estimating with Confidence

Two-sample Categorical data: Measuring association

Types of data and how they can be analysed

Chapter 1: Exploring Data

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Applied Statistical Analysis EDUC 6050 Week 4

Measuring association in contingency tables

Sheila Barron Statistics Outreach Center 2/8/2011

Probability Models for Sampling

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Inferential Statistics

Business Statistics Probability

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

PTHP 7101 Research 1 Chapter Assignments

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Still important ideas

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

STATISTICS & PROBABILITY

Common Statistical Issues in Biomedical Research

Psychology Research Process

Still important ideas

SEM: the precision of the mean of the sample in predicting the population parameter and is a way of relating the sample mean to population mean

Chapter 25. Paired Samples and Blocks. Copyright 2010 Pearson Education, Inc.

Chapter 8: Estimating with Confidence

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

The t-test: Answers the question: is the difference between the two conditions in my experiment "real" or due to chance?

Fundamental Clinical Trial Design

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

04/12/2014. Research Methods in Psychology. Chapter 6: Independent Groups Designs. What is your ideas? Testing

Measuring association in contingency tables

Analysis of Variance (ANOVA)

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

Statistical Hocus Pocus? Assessing the Accuracy of a Diagnostic Screening Test When You Don t Even Know Who Has the Disease

appstats26.notebook April 17, 2015

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Chapter 1 Data Types and Data Collection. Brian Habing Department of Statistics University of South Carolina. Outline

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY

MTH 225: Introductory Statistics

Chapter 13 Summary Experiments and Observational Studies

Study Guide for the Final Exam

Psychology Research Process

Chapter 13. Experiments and Observational Studies. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Basic Biostatistics. Chapter 1. Content

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

Midterm project due next Wednesday at 2 PM

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

What is the Scientific Method?

BMI 541/699 Lecture 16

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

APPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

MAT Mathematics in Today's World

Exam 4 Review Exercises

*Karle Laska s Sections: There is NO class Thursday or Friday! Have a great Valentine s Day weekend!

Barriers to concussion reporting. Qualitative Study of Barriers to Concussive Symptom Reporting in High School Athletics

Something to think about. What happens, however, when we have a sample with less than 30 items?

Lec 02: Estimation & Hypothesis Testing in Animal Ecology

Observational study is a poor way to gauge the effect of an intervention. When looking for cause effect relationships you MUST have an experiment.

7 Statistical Issues that Researchers Shouldn t Worry (So Much) About

AP Stats Chap 27 Inferences for Regression

Understanding Statistics for Research Staff!

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Outline. Chapter 3: Random Sampling, Probability, and the Binomial Distribution. Some Data: The Value of Statistical Consulting

Big Idea 1 The Practice of Science. Big Idea 2 The Characteristics of Scientific Knowledge

Lecture 1 An introduction to statistics in Ichthyology and Fisheries Science

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

Other Designs. What Aids Our Causal Arguments? Time Designs Cohort. Time Designs Cross-Sectional

Lessons in biostatistics

AP Statistics. Semester One Review Part 1 Chapters 1-5

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Designing Psychology Experiments: Data Analysis and Presentation


Chapter 8 Estimating with Confidence

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Inferential Statistics: An Introduction. What We Will Cover in This Section. General Model. Population. Sample

Non-parametric methods for linkage analysis

USING STATCRUNCH TO CONSTRUCT CONFIDENCE INTERVALS and CALCULATE SAMPLE SIZE

CHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials

Student Performance Q&A:

Simple Linear Regression the model, estimation and testing

Substantive Significance of Multivariate Regression Coefficients

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Chapter 9: Comparing two means

Statistical reports Regression, 2010

What makes us special? Ages 3-5

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Math 1680 Class Notes. Chapters: 1, 2, 3, 4, 5, 6

Transcription:

Reflection Questions for Math 58B Johanna Hardin Spring 2017 Chapter 1, Section 1 binomial probabilities 1. What is a p-value? 2. What is the difference between a one- and two-sided hypothesis? 3. What is the difference between type I and type II errors? 4. Why is it never okay to accept H 0? 5. What is power? How is power calculated? What does power depend on? Chapter 1, Section 2 norm approx to binom 1. What does it mean for a variable to have a normal distribution? 2. What does the Central Limit Theorem say? 3. How is the normal distribution different from the binomial distribution? (one answer is that the normal describes a continuous variable and the binomial describes a discrete variable. what does that mean? what is another distinction?) 4. What are the technical conditions allowing the normal distribution to approximate the binomial distribution? 5. What does a z-score measure? 6. What is the difference between a z-critical value (z ) and a z test statistics (z 0 = ˆp π 0 )? π0 (1 π 0 ) n 7. What does it mean to say that to find normal probabilities compute the area under the normal curve? 8. When computing a confidence interval (i.e., when we don t have a preconceived idea for π), how is the standard deviation of ˆp estimated? 1

9. When using the normal distribution to create a confidence interval for π, how is the critical value for, say, a 94.7% interval calculated? 10. What is one reason to choose to use the normal distribution? 11. What is one reason to choose to use the binomial distribution? Chapter 1, Section 3 sampling 1. Why is it better to take random samples? 2. What is a simple random sample? 3. Why don t researchers always take random samples? 4. What does a large sample provide? 5. What is the difference between practical significance and statistical significance? (See Inv 1.17) Chapter 2, Section 1 quantitative variables 1. What changed about the studies (data structure) from Chapter 1? 2. What is the statistic we use now? 3. What is the difference between the distribution of the data and the distribution of the statistic? There is a theoretical difference as well as a computational difference. Chapter 2, Section 2 inference on 1 mean 1. What is the sampling distribution of the statistic? (Note, the answer here is only for big samples, that is the Central Limit Theorem works only where there is a limit... i.e., the sample size is big.) 2. What is the difference between the distribution of the data and the distribution of the statistic? There is a theoretical difference as well as a computational difference. 3. What is the difference between a normal distribution and a t distribution? 4. When do we use a z and when do we use a t? 5. When would you use a confidence interval and when would you use a hypothesis test? 6. What different information does a boxplot give versus a histogram? 2

Chapter 3, Section 1 2 pop proportions 1. How does the inference change now that there is binary (response) data taken from two populations? 2. How does the inference stay the same now that there is binary (response) data taken from two populations? 3. What does the Central Limit Theorem say? 4. When is it appropriate to use a hypothesis test to evaluate the data? And when is it appropriate to use a confidence interval to evaluate the data? 5. Which formula is used for SD(ˆp 1 ˆp 2 ) under which conditions / scenarios? 6. What technical conditions must hold for the Central Limit Theorem to apply? Chapter 3, Section 2 types of studies 1. What is the difference between an observational study and an experiment? 2. Why aren t all studies done as experiments? 3. What is a confounding variable? 4. Have you looked at page 139? Do you understand that page? [Random sampling vs. Random allocation] 5. how is the statistical meaning of the word cause different from the usage in the sentence The ball that hit me in the head caused me to get a headache. 6. What are the meanings of the words: randomized, double-blind (single-blind), control, placebo, significant, and comparative. Why are these ideas important to interpreting study results? Chapter 3, Section 3 prob vs prop 1. How is the random process different from the examples at the beginning of chapter 2? And, therefore, how is the inference different? 2. What (exact) probability model is best used for data that has been randomly allocated? [note: there is no exact model at the beginning of chapter 2, but the simulation using binomial probabilities allowed inference without implementing the CLT. ] 3. Fisher s exact test has an additional advantage that hasn t been discussed in class: it works for small sample sizes. Which method doesn t work for small sample sizes? 3

4. How is the normal approximation for two sample proportions different when the data are a random sample as compared to when they come from a randomized experiment? How is it the same? Chapter 3, Section 4 odds ratios and RR 1. What is the differences between cross-classification, cohort, and case-control studies? 2. When is it not appropriate to calculate differences or ratios of proportions? Why isn t it appropriate? 3. How are odds calculated? How is OR calculated? 4. What do we do when we we can t calculate statistics based on proportions? Why does this fix work? 5. Why do we look at the natural log of the RR and the natural log of the OR when finding confidence intervals for the respective parameters? 6. How do you calculate the SE for the ln( ˆ RR) and ln( ˆ OR)? 7. Once you have the CI for ln(rr) or for ln(or), what do you do? Why does that process work? Chapter 4, Section 1&2&3 2 means 1. What changed about the studies (data structure) from chapter 2? 2. What is the statistic of interest now? What is the parameter of interest? 3. What is the sampling distribution for the statistic of interest? 4. Why does the t-distribution become relevant? 5. How does the t-distribution become relevant? 6. What are degrees of freedom in general? What are the actual degrees of freedom for the test in section 3.2? 7. How is the null mechanism different across the three analysis methods in section 3.2: randomization test, two-sample t-test, random sampling test (n.b. this is also called the parametric bootstrap)? 8. How do you create a CI? How do you interpret the CI? 9. What if your data are NOT normal? What strategies can you try out? 4

Chapter 4, Section 4 paired samples 1. What changed about the studies (data structure) in section 3.3? 2. What is the statistic of interest now? What is the parameter of interest? 3. What is the sampling distribution for the statistic of interest? 4. What does pairing do for us? 5. What happens if we analyze a paired study as if we had an independent two sample study? (What happens to the p-value? What happens to the CI?) 6. What is the easiest way to think of / analyze paired data? Chapter 5, Section 1 2 categorical variables 1. How would you describe the data we see in r c tables? 2. Describe the applet mechanism that creates a sampling distribution under the assumption that the null hypothesis is true. 3. What is the test statistic (for both the randomization applet and the chi-square test!!)? Why do we need a complicated test statistic here and we didn t need one with 2 2 tables? 4. How do you compute the expected count? Why does that computation make sense? 5. What is one benefit that the two sample z-test of proportions has? That is, what is one thing we can do if we have a 2 2 table instead of an r c table? 6. Describe the directionality of the test statistic. That is, what values of X 2 make you reject H 0? 7. What are the technical assumptions for the chi-square test? Why do you need the technical assumptions? 8. What are the null and alternative hypotheses? Chapter 5, Sections 3 & 4 correlation & regression 1. How do we find the values of b 0 and b 1 for estimating the least squares line? 2. Why is it dangerous to extrapolate? 3. How do we interpret R 2? Why is that? (Look at SSE values on page 325, Inv 4.8 (aa) & (bb).) 5

4. What does it mean to say that b 1 has a sampling distribution? Why is it that we would never talk about the sampling distribution of β 1? 5. Why do we need the LINE technical conditions for the inference parts of the analysis but not for the estimation parts of the analysis? 6. What are the LINE technical conditions? 7. What are the three factors that influence the SE(b 1 )? (Note: when something influences SE(b 1 ), that means the inference is also effected. If you have a huge SE(b 1 ), it will be hard to tell if the slope is significant because the t value will be small.) 8. What does it mean to do a randomization test for the slope? That is, explain the process of doing a randomization test here. (See shuffle options in the Analyzing Two Quantitative Variables applet.) 9. Why would someone transform either of the variables? 10. What is the difference between a confidence interval and a prediction interval? Which is bigger? Why does that make sense? How do the centers of the intervals differ? (They don t. Why not?) 6