AP Statistics. Semester One Review Part 1 Chapters 1-5

Similar documents
Unit 1 Exploring and Understanding Data

Understandable Statistics

Chapter 1: Exploring Data

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

CHAPTER 3 Describing Relationships

M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60

M 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

STATISTICS & PROBABILITY

Still important ideas

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Chapter 1. Picturing Distributions with Graphs

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Business Statistics Probability

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

12.1 Inference for Linear Regression. Introduction

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Examining Relationships Least-squares regression. Sections 2.3

10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible

CP Statistics Sem 1 Final Exam Review

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

AP STATISTICS 2010 SCORING GUIDELINES (Form B)

Still important ideas

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Identify two variables. Classify them as explanatory or response and quantitative or explanatory.

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Outline. Practice. Confounding Variables. Discuss. Observational Studies vs Experiments. Observational Studies vs Experiments

Statistics and Probability

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

STT315 Chapter 2: Methods for Describing Sets of Data - Part 2

AP Stats Review for Midterm

bivariate analysis: The statistical analysis of the relationship between two variables.

Chapter 3: Examining Relationships

Normal Distribution. Many variables are nearly normal, but none are exactly normal Not perfect, but still useful for a variety of problems.

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

UF#Stats#Club#STA#2023#Exam#1#Review#Packet# #Fall#2013#

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Biostatistics. Donna Kritz-Silverstein, Ph.D. Professor Department of Family & Preventive Medicine University of California, San Diego

14.1: Inference about the Model

STT 200 Test 1 Green Give your answer in the scantron provided. Each question is worth 2 points.

Chapter 14. Inference for Regression Inference about the Model 14.1 Testing the Relationship Signi!cance Test Practice

AP Statistics Practice Test Ch. 3 and Previous

Chapter 3: Describing Relationships

Standard Scores. Richard S. Balkin, Ph.D., LPC-S, NCC

Chapter 4. More On Bivariate Data. More on Bivariate Data: 4.1: Transforming Relationships 4.2: Cautions about Correlation

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

MULTIPLE REGRESSION OF CPS DATA

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5).

Methodological skills

1.4 - Linear Regression and MS Excel

Basic Statistics 01. Describing Data. Special Program: Pre-training 1

INTERPRET SCATTERPLOTS

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1)

3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?

On the purpose of testing:

Readings: Textbook readings: OpenStax - Chapters 1 4 Online readings: Appendix D, E & F Online readings: Plous - Chapters 1, 5, 6, 13

In This Section An Introductory Example Obesity in America

Chapter 3 CORRELATION AND REGRESSION

3.2A Least-Squares Regression

Biostatistics for Med Students. Lecture 1

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

SCATTER PLOTS AND TREND LINES

9 research designs likely for PSYC 2100

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

V. Gathering and Exploring Data

Example The median earnings of the 28 male students is the average of the 14th and 15th, or 3+3

Multiple Bivariate Gaussian Plotting and Checking

Chapter 1: Explaining Behavior

Statistics: Making Sense of the Numbers

Choosing a Significance Test. Student Resource Sheet

Lesson 1: Distributions and Their Shapes

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Reminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

10. LINEAR REGRESSION AND CORRELATION

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Previously, when making inferences about the population mean,, we were assuming the following simple conditions:

Pitfalls in Linear Regression Analysis

AP Stats Chap 27 Inferences for Regression

Introduction to Statistical Data Analysis I

Section 1.2 Displaying Quantitative Data with Graphs. Dotplots

Conditional Distributions and the Bivariate Normal Distribution. James H. Steiger

Averages and Variation

Undertaking statistical analysis of

AP Psych - Stat 1 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

STATISTICS AND RESEARCH DESIGN

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Chapter 4: More about Relationships between Two-Variables Review Sheet

Section 3.2 Least-Squares Regression

AP Psych - Stat 2 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Transcription:

AP Statistics Semester One Review Part 1 Chapters 1-5

AP Statistics Topics Describing Data Producing Data Probability Statistical Inference

Describing Data Ch 1: Describing Data: Graphically and Numerically Ch 2: The Normal Distributions Ch 3: Describing BiVariate Relationships Ch 4: More BiVariate Relationships

Chapter 1: Describing Data Our Introductory Chapter taught us how to describe a set of data graphically and numerically. Our focus in this chapter was describing the Shape, Outliers, Center, and Spread of a dataset.

Describing Data When starting any data analysis, you should first PLOT your data and describe what you see... Dotplot Stemplot Box-n-Whisker Plot Histogram

Describe the SOCS After plotting the data, note the SOCS: Shape: Skewed, Mound, Uniform, Bimodal Outliers: Any extreme observations Center: Typical representative value Spread: Amount of variability

Numeric Descriptions While a plot provides a nice visual description of a dataset, we often want a more detailed numeric summary of the center and spread.

Measures of Center When describing the center of a set of data, we can use the mean or the median. Mean: Average value x = Median: Center value Q2! x n

Measures of Variability When describing the spread of a set of data, we can use: Range: Max-Min InterQuartile Range: IQR=Q3-Q1 Standard Deviation:! = (x " x) 2 # n " 1

Numeric Descriptions When describing the center and spread of a set of data, be sure to provide a numeric description of each: Mean and Standard Deviation 5-Number Summary: Min, Q1, Med, Q3, Max {Box-n-Whisker Plot}

Determining Outliers When an observation appears to be an outlier, we will want to provide numeric evidence that it is or isn t extreme We will consider observations outliers if: More than 3 standard deviations from the mean. Or More than 1.5 IQR s outside the box

Chapter 1 Summary

Chapter 2: Normal Distributions Many distributions in statistics can be described as approximately Normal. In this chapter, we learned how to identify and describe normal distributions and how to do Standard Normal Calculations.

Density Curves A Density Curve is a smooth, idealized mathematical model of a distribution. The area under every density curve is 1.

The Normal Distribution Many distributions of data and many statistical applications can be described by an approximately normal distribution. Symmetric, Bell-shaped Curve Centered at Mean µ Described as N(µ,! )

Empirical Rule One particularly useful fact about approximately Normal distributions is that 68% of observations fall within one standard deviation of µ 95% fall within 2 standard deviations of µ 99.7% fall within 3 standard deviations of µ

Standard Normal Calculations The empirical rule is useful when an observation falls exactly 1,2,or 3 standard deviations from µ. When it doesn t, we must standardize the value {zscore} and use a table to calculate percentiles, etc. z = x! µ "

Assessing Normality To assess the normality of a set of data, we can t rely on the naked eye alone - not all mound shaped distributions are normal. Instead, we should make a Normal Quantile Plot and look for linearity. Linearity Normality

Chapter 3 Describing BiVariate Relationships In this chapter, we learned how to describe bivariate relationships. We focused on quantitative data and learned how to perform least squares regression.

Bivariate Relationships Like describing univariate data, the first thing you should do with bivariate data is make a plot. Scatterplot Note Strength, Direction, Form

Correlation r We can describe the strength of a linear relationship with the Correlation Coefficient, r -1! r! 1 The closer r is to 1 or -1, the stronger the linear relationship between x and y.

Least Squares Regression When we observe a linear relationship between x and y, we often want to describe it with a line of best fit y=a+bx. We can find this line by performing least-squares regression. We can use the resulting equation to predict y-values for given x- values.

Assessing the Fit If we hope to make useful predictions of y we must assess whether or not the LSRL is indeed the best fit. If not, we may need to find a different model. Residual Plot

Making Predictions If you are satisfied that the LSRL provides an appropriate model for predictions, you can use it to predict a y-hat for x s within the observed range of x-values. ŷ = a + bx Predictions for observed x-values can be assessed by noting the residual. Residual = observed y - predicted y

Chapter 3 Summary

Chapter 4 More BiVariate Relationships In this chapter, we learned how to find models that fit some nonlinear relationships. We also explored how to describe categorical relationships.

NonLinear Relationships If data is not best described by a LSRL, we may be able to find a Power or Exponential model that can be used for more accurate predictions. Power Model: Exponential Model: ˆ y =10 a x b y ˆ =10 a 10 bx

Transforming Data If (x,y) is non-linear, we can transform it to try to achieve a linear relationship. If transformed data appears linear, we can find a LSRL and then transform back to the original terms of the data (x, log y) LSRL > Exponential Model (log x, log y) LSRL > Power Model

The Question of Causation Just because we observe a strong relationship or strong correlation between x and y, we can not assume it is a causal relationship.

Relations in Categorical Data When categorical data is presented in a two-way table, we can explore the marginal and conditional distributions to describe the relationship between the variables.

Chapter 5 Producing Data In this chapter, we learned methods for collecting data through sampling and experimental design.

Sampling Design Our goal in statistics is often to answer a question about a population using information from a sample. Observational Study vs. Experiment There are a number of ways to select a sample. We must be sure the sample is representative of the population in question.

Sampling If you are performing an observational study, your sample can be obtained in a number of ways: Convenience - Cluster Systematic Simple Random Sample Stratified Random Sample

Experimental Design In an experiment, we impose a treatment with the hopes of establishing a causal relationship. Experiments exhibit 3 Principles Randomization Control Replication

Experimental Designs Like Observational Studies, Experiments can take a number of different forms: Completely Controlled Randomized Comparative Experiment Blocked Matched Pairs

Chapters 6-9 Tomorrow