IAS 3.9 Bivariate Data

Size: px
Start display at page:

Download "IAS 3.9 Bivariate Data"

Transcription

1 Year 13 Mathematics IAS 3.9 Bivariate Data Robert Lakeland & Carl Nugent Contents Achievement Standard Bivariate Data Scatter Plots Relationships Correlation Coefficient Regression Analsis Outliers Residuals Residual Plots Causation Non-Linear Regression Bivariate Investigation Practice Internal Assessment Answers Innovative Publisher of Mathematics Tets

2 2 IAS 3.9 Bivariate Data NCEA 3 Internal Achievement Standard 3.9 Bivariate Data This achievement standard involves students investigating bivariate measurement data. Achievement Achievement with Merit Achievement with Ecellence Investigate bivariate measurement data. Investigate bivariate measurement data, with justification Investigate bivariate measurement data, with statistical insight. This achievement standard is derived from Level 8 of The New Zealand Curriculum and is related to the achievement objectives Carr out investigations of phenomena, using the statistical enquir ccle using eisting data sets finding, using, and assessing appropriate models (including linear regression for bivariate data), seeking eplanations, and making predictions using informed contetual knowledge and statistical inference communicating findings and evaluating all stages of the ccle in the Statistics strand of the Mathematics and Statistics Learning Area. Investigate bivariate measurement data involves showing evidence of using each component of the statistical enquir ccle. Investigate bivariate measurement data, with justification involves linking components of the statistical enquir ccle to the contet, and referring to evidence such as statistics, data values, trends, or features of visual displas in support of statements made. Investigate bivariate measurement data, with statistical insight involves integrating statistical and contetual knowledge throughout the investigation process, and ma include reflecting about the process; considering other relevant variables; evaluating the adequac of an models, or showing a deeper understanding of the models. Using the statistical enquir ccle to investigate bivariate measurement data involves: posing an appropriate relationship question using a given multivariate data set selecting and using appropriate displa(s) identifing features in the data finding an appropriate model describing the nature and strength of the relationship and relating this to the contet using the model to make a prediction communicating findings in a conclusion. Measurement data can either be discrete or continuous in nature. In regression analsis the -variable, or response variable, must be a continuous variable. The variable or eplanator variable can be either a discrete or continuous variable. The relationship ma be non-linear. Use and interpretation of R 2 is not epected at this level. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

3 IAS 3.9 Bivariate Data 3 Bivariate Data Introduction In statistics we are often interested in identifing relationships involving more than one variable. In this booklet we stud Bivariate Data which deals with the stud of two quantitative variables (pairs of variables) and identifing relationships between them. We aim to identif the relationship between the variables, graphicall with the use of a scatter plot and quantitativel b looking at correlation (the degree to which two or more quantities are linearl associated). In addition we use regression to enable us to make quantitative predictions (interpolation) of one variable from the other. Much of the emphasis in this Achievement Standard is on the visual interpretation of scatter plots as well as linking statistical knowledge to the contet of the question and using appropriate reasoning and reflection. To this end we are suggesting that students have access to inzight a simple data analsis sstem which encourages eploring what data is saing without the distractions of driving comple software (inzight website). INZight can be downloaded from auckland.ac.nz/~wild/inzight/dlw.html and can be installed on either a Mac or Windows computer. All datasets used in this booklet can be downloaded from our website at co.nz under the Downloads link and imported directl into inzight or Ecel. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

4 IAS 3.9 Bivariate Data 7 Relationships Inspecting a Scatter Plot When we are given a scatter plot and asked to comment on the relationship between the variables our first approach is a visual one. The first step is to make sure we understand the precise meaning of the variables being used as well as the units applied to the variables. Sometimes it ma be necessar to research the meaning of the variables so that ou have a better understanding prior to investigating a possible relationship. Net look at the degree of scatter of the points. Are the points clumped together or spread out? Are the points in a number of distinct clusters or is there just a single cluster of points with one or two points at the etremes of the scatter plot? Are there an unusual observations that go against the general trend of the scatter plot? Are these points still realistic, i.e. can the be eplained? What is the variation of scatter in the scatter plot? Are the points close to each other or is there significant gaps between them? Do the points follow a discernible pattern or trend, e.g. following a straight line? Correlation When we draw a scatter plot the shape of the plot is usuall defined as linear or non-linear (curved). A correlation eists between two variables when one of them is related or can be influenced b the other in some wa. The strength of a relationship can be identified visuall, on a scatter plot, b how tightl or spread out the plotted points are and the direction of the relationship can be identified b the general slope of the pattern of the data. Stud the scatter plots below and note the visual description of shape, strength and direction of the relationship between and. linear, strong positive relationship linear, moderate positive relationship linear, weak positive relationship non-linear, strong positive relationship linear, strong negative relationship linear, moderate negative relationship linear, weak negative relationship non-linear, moderate negative relationship An outlier is one or more points that do not follow the trend. no relationship strong relationship with outliers When determining whether a relationship eists between two variables we also want to be able to measure the strength and direction of a relationship quantitativel, and to do this for linear relationships we can calculate the (linear) correlation coefficient (r). This is eplained in detail on page 12. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

5 IAS 3.9 Bivariate Data 29 Eample A heater is turned on in a cold room and the temperature ( C) of the room is recorded ever five minutes. The results are given below. Find the least squares regression line using inzight. Time () C () Import the dataset Eample into inzight and then draw a scatter plot of time versus temperature. To find the regression line click on the Get Summar button. Both the correlation coefficient and the regression line are displaed in the window. Add a linear trend line (regression line) to the scatter plot b clicking on the button Add to Plot and then on the radio button Add trend curves. Click on the checkbo linear followed b Show Changes and Done. The regression line is written in terms of the variable names rather than and, i.e. Degrees.C () = * Time.mins () IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

6 IAS 3.9 Bivariate Data 37 Achievement Answer the following questions. 48. Download the dataset called Question 48.csv from under the Downloads link on our website ( Import the file into inzight. a) Using inzight produce a scatter plot of d) Would our chosen point in part b) be versus. better identified as an influential Visuall describe our scatter plot and point rather than an outlier or both? identif an features that stand out, e.g. Justif our answer. strength (degree of scatter), groupings (clusters), unusual observations, and the variation of scatter etc. Comment on the linear correlation coefficient for our scatter plot and find the least squares regression line. Predict the value of when = 5.0. e) Import the file Question 48e.csv which is the same dataset as above with some additional points included. Draw a scatter plot of the data. Is there an point that ou could consider a possible outlier? If so identif the point. f) What effect does our chosen point in e) have on the linear correlation coefficient and regression line? b) Import the file Question 48b.csv which is the same dataset as above with some additional points included. Draw a scatter plot of the data. Is there an point that ou could consider a possible outlier? If so identif the point. g) Would our chosen point in part e) be better identified as an influential point or an outlier or both or neither? Justif our answer. c) What effect does our chosen point in b) have on the linear correlation coefficient and regression line? IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

7 42 IAS 3.9 Bivariate Data Eample Open the dataset Question 16 in Ecel, which comprises the mean lifespan (in ears), metabolic rate (cm 3 /g/h), gestation period (das) and brain weight (g) of 27 mammals. B either using the Analsis Toolpak which can be installed using Add-Ins for Ecel 2010 or StatPlus mac LE which is a separate program that can be downloaded from en/products/statplusmacle/ for Mac, produce a plot of the residuals for gestation period versus lifespan of the 27 mammals and comment on what the plot tells us about using a linear model for the association between these two variables. Instructions for StatPlus mac LE are given below. See the following page for information on the instructions for using the Analsis ToolPak. cont... the independent variable of our data pair, that is, Gestation Period. When ou return to StatPlus [Workbook1]Sheet1!$B$1:$B$28 should appear. Now click the checkboes Plot residuals vs fitted and Plot line fit followed b OK. Generated are the residuals, a plot of the residuals and a scatter plot in an Ecel spreadsheet. The spreadsheet includes a large amount of information including the least squares regression line, (Lifespan = * Gestation period), the linear correlation coefficient (r = ) as well as a table of residuals (see below and the net page). If ou are using StatPlus mac LE on a Mac begin b booting the application StatPlus. When StatPlus boots it will also boot Ecel. Open a new worksheet in Ecel. Regression equation Using Ecel open the dataset Question 16.csv. Cop the two columns C and D, i.e. Lifespan and Gestation Period from the Question 16.csv spreadsheet into columns A and B of our blank Correlation coefficient worksheet. Now close the Question 16.csv spreadsheet. In the StatPlus application choose the menu option Statistics, then Regression, then Multiple Linear Regression. The following window appears. Click on the button to the right of the Dependent variable field. Remember the dependent variable is the response or variable. The Ecel window will be activated so ou can select the cells representing the dependent variable of our data pair, i.e. Lifespan. When ou return to StatPlus [Workbook1] Sheet1!$S$1:$A$28 should appear. Click on the button to the right of the Independent variable field. The independent variable is the eplanator or variable. The Ecel window will be activated so ou can select the cells representing The predicted values are those generated b substituting the appropriate gestation period into the least squares regression line. For observation 1 the gestation period is 14. Substituting this into Lifespan = * Gestation period gives us a predicted value, i.e. Lifespan () of The observed point for a gestation period of 14 das is a lifespan of 50 ears so the difference between 50 and , i.e is our residual. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

8 IAS 3.9 Bivariate Data 49 Non-Linear Regression cont... An alternative to fitting a logarithmic model could be to investigate using a negative parabola, especiall if ou are using inzight where ou have limited options (see below). inzight at present onl allows quadratic and cubic models for non-linear functions. If ou require eponential, logarithmic or power models the authors suggest ou use Ecel. You can start from the scatter plots produced b the Analsis Toolpak or the StatPlus mac LE. The equation for the quadratic trend is Strength = Da 19.35Da kpa which gives a strength of kPa for 2 das curing and a strength of kpa for 10 das curing. Comparing the three trends (linear, logarithmic and quadratic) the linear in the long term (> 25 das) will become less reliable as it will continue to increase over time. The same will be true of the quadratic because it will begin to decrease once the maimum point of the quadratic is reached. In the long term the logarithmic model is likel to be the best fit as it approaches an asmptote. Another option is to adopt a piecewise function which is a combination of the different functions over different subsets of. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

9 58 IAS 3.9 Bivariate Data Page 10 Q16. cont... g) There appears to be a moderate positive linear association between the two variables. Mammals with a longer gestation period have a longer lifespan. Two unusual observations are the Echnida which has a gestation period of onl 14 das but a long lifespan of 50 ears and man with a gestation period of 267 das and a ver long lifespan of 86 ears. Most points fall within the gestation range of 0 to <200 das and lifespan of 0 to 20 ears. h) Metabolic rate (eplanator) and gestation period (response). i) Investigating how metabolic rate predicts gestation period not the other wa round n) Gestation period is a r = reasonable predictor of 19. r = lifespan although man and the E. American Page 17 (Answers ma var) mole go against the trend. A linear model could be Constant difference. suitable for these two 21. Close to 1. The older a car the variables. less its value. Page 11 Q16. cont... j) The have a low gestation period. k) Not an obvious linear one although greater metabolic rate does result in a lower gestation period. More likel a negative non-linear relationship. Page 11 Q16. cont... l) There appears to be weak negative non-linear association between the two variables. Mammals with a greater metabolic rate have a shorter gestation period. Most points are clustered in the metabolic range 0 to < 1.0 and gestation period of 0 to < 300 das. Etreme points are that of the Asian elephant, little brown bat and E. American mole although the would still fit a non-linear model. m) Metabolic rate versus 22. Somewhat negative. Less well lifespan would be educated people smoke. better suited to a non-linear model. 23. Close to 1. People with big feet Again the E. American are generall taller. mole goes against 24. Somewhat negative. More the trend. A suitable cigarettes smoked the lower the non-linear model would level of fitness. probabl be a good predictor of lifespan. With brain weight versus lifespan a linear model is affected b the Asian elephant and E. American mole. Page 11 Q16 n) cont... Page Overall gestation looks like the better predictor from a linear perspective but metabolic rate from a non-linear perspective. Brain weight is not as good as the other two r = The more ou spend the less ou save. 26. Close to 1. Tall men often choose tall wives. 27. Somewhat positive. More land area greater price. IAS 3.9 Year 13 Mathematics and Statistics Published b N New Zealand Robert Lakeland & Carl Nugent

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

Exemplar for Internal Assessment Resource Mathematics Level 3. Resource title: Sport Science. Investigate bivariate measurement data

Exemplar for Internal Assessment Resource Mathematics Level 3. Resource title: Sport Science. Investigate bivariate measurement data Exemplar for internal assessment resource Mathematics 3.9A for Achievement Standard 91581 Exemplar for Internal Assessment Resource Mathematics Level 3 Resource title: Sport Science This exemplar supports

More information

10. LINEAR REGRESSION AND CORRELATION

10. LINEAR REGRESSION AND CORRELATION 1 10. LINEAR REGRESSION AND CORRELATION The contingency table describes an association between two nominal (categorical) variables (e.g., use of supplemental oxygen and mountaineer survival ). We have

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate

More information

Lab 5a Exploring Correlation

Lab 5a Exploring Correlation Lab 5a Exploring Correlation The correlation coefficient measures how tightly the points on a scatterplot cluster around a line. In this lab we will examine scatterplots and correlation coefficients for

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Using SPSS for Correlation

Using SPSS for Correlation Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded

More information

Section 3 Correlation and Regression - Teachers Notes

Section 3 Correlation and Regression - Teachers Notes The data are from the paper: Exploring Relationships in Body Dimensions Grete Heinz and Louis J. Peterson San José State University Roger W. Johnson and Carter J. Kerk South Dakota School of Mines and

More information

Pitfalls in Linear Regression Analysis

Pitfalls in Linear Regression Analysis Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression

More information

Pre-Algebra, Unit 8 Practice Test: Investigate Patterns of Association in Bivariate Data (Scatter Plots and Two-Way Tables)

Pre-Algebra, Unit 8 Practice Test: Investigate Patterns of Association in Bivariate Data (Scatter Plots and Two-Way Tables) Name: Date: Pre-Algebra, Unit 8 Practice Test: Investigate Patterns of Association in Bivariate Data (Scatter Plots and Two-Wa Tables). Define bivariate data.. Define relative frequenc.. (a) Define and

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

AP Statistics Practice Test Ch. 3 and Previous

AP Statistics Practice Test Ch. 3 and Previous AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

SCATTER PLOTS AND TREND LINES

SCATTER PLOTS AND TREND LINES 1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as

More information

Eating and Sleeping Habits of Different Countries

Eating and Sleeping Habits of Different Countries 9.2 Analyzing Scatter Plots Now that we know how to draw scatter plots, we need to know how to interpret them. A scatter plot graph can give us lots of important information about how data sets are related

More information

Level 3 AS Credits Internal Investigate Bivariate Measurement Data Written by Jake Wills MathsNZ

Level 3 AS Credits Internal Investigate Bivariate Measurement Data Written by Jake Wills MathsNZ Level 3 AS91581 4 Credits Internal Investigate Bivariate Measurement Data Written by Jake Wills MathsNZ jwills@mathsnz.com NCEA Level 3-3.9 Bivariate Data Achievement Achievement with Merit Achievement

More information

3.2 Least- Squares Regression

3.2 Least- Squares Regression 3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these

More information

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.

More information

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you? WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

Modeling and Environmental Science: In Conclusion

Modeling and Environmental Science: In Conclusion Modeling and Environmental Science: In Conclusion Environmental Science It sounds like a modern idea, but if you view it broadly, it s a very old idea: Our ancestors survival depended on their knowledge

More information

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*

LAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival* LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,

More information

Lesson 1: Distributions and Their Shapes

Lesson 1: Distributions and Their Shapes Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

Math 075 Activities and Worksheets Book 2:

Math 075 Activities and Worksheets Book 2: Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

CHAPTER TWO REGRESSION

CHAPTER TWO REGRESSION CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

Simple Linear Regression One Categorical Independent Variable with Several Categories

Simple Linear Regression One Categorical Independent Variable with Several Categories Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary

More information

1. To review research methods and the principles of experimental design that are typically used in an experiment.

1. To review research methods and the principles of experimental design that are typically used in an experiment. Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise Lab #7 (there was no Lab #6) Treatment for Depression: A Randomized Controlled Clinical Trial Objectives: 1. To review

More information

7. Bivariate Graphing

7. Bivariate Graphing 1 7. Bivariate Graphing Video Link: https://www.youtube.com/watch?v=shzvkwwyguk&index=7&list=pl2fqhgedk7yyl1w9tgio8w pyftdumgc_j Section 7.1: Converting a Quantitative Explanatory Variable to Categorical

More information

AP Statistics. Semester One Review Part 1 Chapters 1-5

AP Statistics. Semester One Review Part 1 Chapters 1-5 AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically

More information

3.2A Least-Squares Regression

3.2A Least-Squares Regression 3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to

More information

Business Statistics Probability

Business Statistics Probability Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world Visit us on the World Wide Web at: www.pearsoned.co.uk Pearson Education Limited 2014

More information

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize

More information

Examining Relationships Least-squares regression. Sections 2.3

Examining Relationships Least-squares regression. Sections 2.3 Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability

More information

The North Carolina Health Data Explorer

The North Carolina Health Data Explorer The North Carolina Health Data Explorer The Health Data Explorer provides access to health data for North Carolina counties in an interactive, user-friendly atlas of maps, tables, and charts. It allows

More information

Cancer: A Global View

Cancer: A Global View Cancer: A Global View Key Facts 1 Cancer is a leading cause of death worldwide: it accounted for 7.4 million deaths (around 13% of all deaths) in 2004. Lung, stomach, colorectal, liver, and breast cancer

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

1 Version SP.A Investigate patterns of association in bivariate data

1 Version SP.A Investigate patterns of association in bivariate data Claim 1: Concepts and Procedures Students can explain and apply mathematical concepts and carry out mathematical procedures with precision and fluency. Content Domain: Statistics and Probability Target

More information

Psychology of Perception PSYC Spring 2017 Laboratory 2: Perception of Loudness

Psychology of Perception PSYC Spring 2017 Laboratory 2: Perception of Loudness PSYC 4165-100 Laboratory 2: Perception of Loudness Lab Overview Loudness is a psychological dimension of sound experience that depends on several physical dimensions of the sound stimulus (intensity, frequency,

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

CHILD HEALTH AND DEVELOPMENT STUDY

CHILD HEALTH AND DEVELOPMENT STUDY CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in

More information

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics Psy201 Module 3 Study and Assignment Guide Using Excel to Calculate Descriptive and Inferential Statistics What is Excel? Excel is a spreadsheet program that allows one to enter numerical values or data

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

Two-Way Independent ANOVA

Two-Way Independent ANOVA Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression

More information

Problem set 2: understanding ordinary least squares regressions

Problem set 2: understanding ordinary least squares regressions Problem set 2: understanding ordinary least squares regressions September 12, 2013 1 Introduction This problem set is meant to accompany the undergraduate econometrics video series on youtube; covering

More information

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

Homework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Homework #3 Name Due Due on on February Tuesday, Due on February 17th, Sept Friday 28th 17th, Friday SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill

More information

STATISTICS & PROBABILITY

STATISTICS & PROBABILITY STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness

More information

MA 250 Probability and Statistics. Nazar Khan PUCIT Lecture 7

MA 250 Probability and Statistics. Nazar Khan PUCIT Lecture 7 MA 250 Probability and Statistics Nazar Khan PUCIT Lecture 7 Regression For bivariate data, we have studied that the correlation coefficient measures the spread of the data. Now we want to know how to

More information

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction

Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and

More information

Unit 1 Exploring and Understanding Data

Unit 1 Exploring and Understanding Data Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile

More information

Quadratic Functions I

Quadratic Functions I Quadratic Functions I by Frank C. Wilson Activity Collection Featuring real-world contexts: Autism Awareness Autism Awareness Business Growth - USAA Changing Population Kentucky Changing Population - New

More information

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge

More information

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.

TEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A. Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

The Jumping Dog Quadratic Activity

The Jumping Dog Quadratic Activity Standards: The Jumping Dog Quadratic Activity A2.4.1 Identify the family of function best suited for modeling a given real-world situation. A2.4.3 Using the adapted general symbolic form, draw reasonable

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

Study Guide #2: MULTIPLE REGRESSION in education

Study Guide #2: MULTIPLE REGRESSION in education Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that

More information

Diurnal Pattern of Reaction Time: Statistical analysis

Diurnal Pattern of Reaction Time: Statistical analysis Diurnal Pattern of Reaction Time: Statistical analysis Prepared by: Alison L. Gibbs, PhD, PStat Prepared for: Dr. Principal Investigator of Reaction Time Project January 11, 2015 Summary: This report gives

More information

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug? MMI 409 Spring 2009 Final Examination Gordon Bleil Table of Contents Research Scenario and General Assumptions Questions for Dataset (Questions are hyperlinked to detailed answers) 1. Is there a difference

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions

Regression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals

More information

Lab 3: Perception of Loudness

Lab 3: Perception of Loudness Lab 3: Perception of Loudness Lewis O. Harvey, Jr. and Samuel P. Paskewitz PSYC 4165: Psychology of Perception, Fall 2018 Department of Psychology and Neuroscience University of Colorado Boulder Boulder,

More information

Chapter Eight: Multivariate Analysis

Chapter Eight: Multivariate Analysis Chapter Eight: Multivariate Analysis Up until now, we have covered univariate ( one variable ) analysis and bivariate ( two variables ) analysis. We can also measure the simultaneous effects of two or

More information

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0% Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of

More information

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.

Chapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question. Name: Class: Date: Chapter 3 Review Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable

More information

SCAF Workshop Project Cost Control

SCAF Workshop Project Cost Control The following presentation was given at: SCAF Workshop Project Cost Control Tuesday 2nd February 2016 The Royal Institution of Naval Architects, London Released for distribution by the Author www.scaf.org.uk/library

More information

Statistics and Probability

Statistics and Probability Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape

More information

Unit 8 Bivariate Data/ Scatterplots

Unit 8 Bivariate Data/ Scatterplots Unit 8 Bivariate Data/ Scatterplots Oct 20 9:19 PM Scatterplots are used to determine if there is a relationship between two variables. /Correlation /Correlation /Correlation Line of best fit cuts the

More information

Comparing Two ROC Curves Independent Groups Design

Comparing Two ROC Curves Independent Groups Design Chapter 548 Comparing Two ROC Curves Independent Groups Design Introduction This procedure is used to compare two ROC curves generated from data from two independent groups. In addition to producing a

More information

Before we get started:

Before we get started: Before we get started: http://arievaluation.org/projects-3/ AEA 2018 R-Commander 1 Antonio Olmos Kai Schramm Priyalathta Govindasamy Antonio.Olmos@du.edu AntonioOlmos@aumhc.org AEA 2018 R-Commander 2 Plan

More information

INTERPRET SCATTERPLOTS

INTERPRET SCATTERPLOTS Chapter2 MODELING A BUSINESS 2.1: Interpret Scatterplots 2.2: Linear Regression 2.3: Supply and Demand 2.4: Fixed and Variable Expenses 2.5: Graphs of Expense and Revenue Functions 2.6: Breakeven Analysis

More information

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg.

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg. Photometry Frequently Asked Questions Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg. Protein standard curves are traditionally presented as

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

Lab 4: Perception of Loudness

Lab 4: Perception of Loudness Lab 4: Perception of Loudness Lewis O. Harvey, Jr. and Dillon J. McGovern PSYC 4165: Psychology of Perception, Spring 2019 Department of Psychology and Neuroscience University of Colorado Boulder Boulder,

More information

Lecture 12 Cautions in Analyzing Associations

Lecture 12 Cautions in Analyzing Associations Lecture 12 Cautions in Analyzing Associations MA 217 - Stephen Sawin Fairfield University August 8, 2017 Cautions in Linear Regression Three things to be careful when doing linear regression we have already

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis

More information

METHOD VALIDATION: WHY, HOW AND WHEN?

METHOD VALIDATION: WHY, HOW AND WHEN? METHOD VALIDATION: WHY, HOW AND WHEN? Linear-Data Plotter a,b s y/x,r Regression Statistics mean SD,CV Distribution Statistics Comparison Plot Histogram Plot SD Calculator Bias SD Diff t Paired t-test

More information

Chapter 4. More On Bivariate Data. More on Bivariate Data: 4.1: Transforming Relationships 4.2: Cautions about Correlation

Chapter 4. More On Bivariate Data. More on Bivariate Data: 4.1: Transforming Relationships 4.2: Cautions about Correlation Chapter 4 More On Bivariate Data Chapter 3 discussed methods for describing and summarizing bivariate data. However, the focus was on linear relationships. In this chapter, we are introduced to methods

More information

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions

More information

MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate

MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate September 5, 2018 1 Introduction to the Data We will be working with a synthetic data set meant to resemble the weight

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

Charts Worksheet using Excel Obesity Can a New Drug Help?

Charts Worksheet using Excel Obesity Can a New Drug Help? Worksheet using Excel 2000 Obesity Can a New Drug Help? Introduction Obesity is known to be a major health risk. The data here arise from a study which aimed to investigate whether or not a new drug, used

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment

More information

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing

More information

Multivariable Systems. Lawrence Hubert. July 31, 2011

Multivariable Systems. Lawrence Hubert. July 31, 2011 Multivariable July 31, 2011 Whenever results are presented within a multivariate context, it is important to remember that there is a system present among the variables, and this has a number of implications

More information

Bivariate &/vs. Multivariate

Bivariate &/vs. Multivariate Bivariate &/vs. Multivariate Differences between correlations, simple regression weights & multivariate regression weights Patterns of bivariate & multivariate effects Prox variables Multiple regression

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Statistics Final Review Semeter I Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The Centers for Disease

More information

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression

Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample

More information

6. Unusual and Influential Data

6. Unusual and Influential Data Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the

More information

General Biostatistics Concepts

General Biostatistics Concepts General Biostatistics Concepts Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. What is Biostatistics? 2. Types of Measurements

More information