Introduction to regression
|
|
- Rodger McBride
- 6 years ago
- Views:
Transcription
1 Introduction to regression Regression describes how one variable (response) depends on another variable (explanatory variable). Response variable: variable of interest, measures the outcome of a study Explanatory variable: explains (or even causes) changes in response variable Examples: Hearing difficulties: response - sound level (decibels), explanatory - age (years) Real estate market: response - listing prize ($), explanatory - house size (sq. ft.) Salaries: response - salary ($), explanatory - experience (years), education, sex Least squares regression, Jan 4, 4 - -
2 Introduction to regression Example: Food expenditures and income Data: Sample of households 6 food expenditure income Questions: How does food expenditure (Y ) depend on income ()? Suppose we know that = x, what can we tell about Y? Linear regression: If the response Y depends linearly on the explanatory variable, we can use a straight line (regression line) to predict Y from. Least squares regression, Jan 4, 4 - -
3 Least squares regression How to find the regression line 6 food expenditure income food expenditure Observed y Difference y y^ Predicted y^ income Since we intend to predict Y from, the errors of interest are mispredictions of Y for fixed. The least squares regression line of Y on is the line that minimizes the sum of squared errors. For observations (x, y ),..., (x n, y n ), the regression line is given by where Ŷ = a + b b = r s y s x and a = ȳ b x (r correlation coefficient, s x, s x standard deviations, x, ȳ means) Least squares regression, Jan 4, 4-3 -
4 Least squares regression Example: Food expenditure and income Y Y The summary statistics are: x = 45.5 s x = 3.96 ȳ = 7.97 s y = 4.66 r =.946 The regression coefficients are: b = r s y s x = =.84 a = ȳ b x = =.4 food expenditure income Least squares regression, Jan 4, 4-4 -
5 Interpreting the regression model The response in the model is denoted Ŷ to indicate that these are predicted Y values, not the true Y values. The hat denotes prediction. The slope of the line indicates how much change in. Ŷ changes for a unit The intercept is the value of Ŷ for =. It may or not have a physical interpretation, depending on whether or not can take values near. To make a prediction for an unobserved, just plug it in and calculate Ŷ. Note that the line need not pass through the observed data points. In fact, it often will not pass through any of them. Least squares regression, Jan 4, 4-5 -
6 Regression and correlation Correlation analysis: We are interested in the joint distribution of two (or more) quantitive variables. Example: Heights of,78 fathers and sons Son s height (inches) Father s height (inches) Points are scattered around the SD line: (y ȳ) = s y s x (x x) goes through center ( x, ȳ) has slope s y /s x The correlation r measures how much the points spread around the SD line. Least squares regression, Jan 4, 4-6 -
7 Regression analysis: Regression and correlation We are interested how the distribution of one response variable depends on one (or more) explanatory variables. Example: Heights of,78 fathers and sons Son s height (inches) 8 Father s height = 64 inches Density Density Son s height (inches) Father s height = 68 inches x x Father s height (inches) Density Son s height (inches) Father s height = 7 inches Son s height (inches) x Son s height (inches) In each vertical strip, the points are distributed around the regression line Father s height (inches) Least squares regression, Jan 4, 4-7 -
8 Properties of least squares regression The distinction between explanatory and response variables is essential. Looking at vertical deviations means that changing the axes would change the regression line x^ = a + b y Son s height (inches) y^ = a + bx Father s height (inches) A change of sd in corresponds to a change of r sds in Y. The least squares regression line always passes through the point ( x, ȳ). r (the square of the correlation) is the fraction of the variation in the values of y that is explained by the least squares regression on x. When reporting the results of a linear regression, you should report r. These properties depend on the least-squares fitting criterion and are one reason why that criterion is used. Least squares regression, Jan 4, 4-8 -
9 Regression effect The regression effect In virtually all test-retest situations, the bottom group on the first test will on average show some improvement on the second test - and the top group will on average fall back. This is the regression effect. The statistician and geneticist Sir Francis Galton (8-9) called this effect regression to mediocrity Son s height (inches) Father s height (inches) Regression fallacy Thinking that the regression effect must be due to something important, not just the spread around the line, is the regression fallacy. Least squares regression, Jan 4, 4-9 -
10 Regression in STATA. infile food income size using food.txt. graph twoway scatter food income lfit food income, legend(off) > ytitle(food). regress food income Source SS df MS Number of obs = F(, 8) = 5.97 Model Prob > F =. Residual R-squared = Adj R-squared =.888 Total Root MSE = food Coef. Std. Err. t P> t [95% Conf. Interval] income _cons Food expenditure Income This graph has been generated using the graphical user interface of STATA. The complete command is:. twoway (scatter food income, msymbol(circle) msize(medium) mcolor(black)) > (lfit food income, range( ) clcolor(black) clpat(solid) clwidth(medium)), > ytitle(food expenditure, size(large)) ylabel(, valuelabel angle(horizontal) > labsize(medlarge)) xtitle(income, size(large)) xscale(range( )) > xlabel((), labsize(medlarge)) legend(off) ysize() xsize(3) Least squares regression, Jan 4, 4 - -
11 Residual plots : difference of observed and predicted values e i = observed y predicted y = y i ŷ i = y i (a + b x i ) For a least squares regression, the residuals always have mean zero. Residual plot A residual plot is a scatterplot of the residuals against the explanatory variable. It is a diagnostic tool to assess the fit of the regression line. Patterns to look for: Curvature indicates that the relationship is not linear. Increasing or decreasing spread indicates that the prediction will be less accurate in the range of explanatory variables where the spread is larger. Points with large residuals are outliers in the vertical direction. Points that are extreme in the x direction are potential high influence points. Influential observations are individuals with extreme x values that exert a strong influence on the position of the regression line. Removing them would significantly change the regression line. Least squares regression, Jan 4, 4 - -
12 Regression Diagnostics Example: First data set Y Fitted values 5 5 residuals are regularly distributed Least squares regression, Jan 4, 4 - -
13 Regression Diagnostics Example: Second data set Y Fitted values 5 5 functional relationship other than linear Least squares regression, Jan 4, 4-3 -
14 Regression Diagnostics Example: Third data set 5 Y Fitted values outlier, regression line misfits majority of data Least squares regression, Jan 4, 4-4 -
15 Regression Diagnostics Example: Fourth data set 5 Y Fitted values 5 5 heteroscedasticity Least squares regression, Jan 4, 4-5 -
16 Regression Diagnostics Example: Fifth data set 5 Y Fitted values 5 5 one separate point in direction of x, highly influential Least squares regression, Jan 4, 4-6 -
Multiple Linear Regression Analysis
Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started
More informationChapter 3: Examining Relationships
Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression
More informationNotes for laboratory session 2
Notes for laboratory session 2 Preliminaries Consider the ordinary least-squares (OLS) regression of alcohol (alcohol) and plasma retinol (retplasm). We do this with STATA as follows:. reg retplasm alcohol
More informationChapter 3 CORRELATION AND REGRESSION
CORRELATION AND REGRESSION TOPIC SLIDE Linear Regression Defined 2 Regression Equation 3 The Slope or b 4 The Y-Intercept or a 5 What Value of the Y-Variable Should be Predicted When r = 0? 7 The Regression
More informationMODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING
Interpreting Interaction Effects; Interaction Effects and Centering Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Models with interaction effects
More informationLecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression
Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationAge (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)
Criminal Justice Doctoral Comprehensive Exam Statistics August 2016 There are two questions on this exam. Be sure to answer both questions in the 3 and half hours to complete this exam. Read the instructions
More informationThis tutorial presentation is prepared by. Mohammad Ehsanul Karim
STATA: The Red tutorial STATA: The Red tutorial This tutorial presentation is prepared by Mohammad Ehsanul Karim ehsan.karim@gmail.com STATA: The Red tutorial This tutorial presentation is prepared by
More information3.2 Least- Squares Regression
3.2 Least- Squares Regression Linear (straight- line) relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these
More informationReminders/Comments. Thanks for the quick feedback I ll try to put HW up on Saturday and I ll you
Reminders/Comments Thanks for the quick feedback I ll try to put HW up on Saturday and I ll email you Final project will be assigned in the last week of class You ll have that week to do it Participation
More informationHW 3.2: page 193 #35-51 odd, 55, odd, 69, 71-78
35. What s My Line? You use the same bar of soap to shower each morning. The bar weighs 80 grams when it is new. Its weight goes down by 6 grams per day on average. What is the equation of the regression
More informationExamining Relationships Least-squares regression. Sections 2.3
Examining Relationships Least-squares regression Sections 2.3 The regression line A regression line describes a one-way linear relationship between variables. An explanatory variable, x, explains variability
More informationAP Statistics Practice Test Ch. 3 and Previous
AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry
More informationFinal Exam - section 2. Thursday, December hours, 30 minutes
Econometrics, ECON312 San Francisco State University Michael Bar Fall 2011 Final Exam - section 2 Thursday, December 15 2 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.
More informationStat 13, Lab 11-12, Correlation and Regression Analysis
Stat 13, Lab 11-12, Correlation and Regression Analysis Part I: Before Class Objective: This lab will give you practice exploring the relationship between two variables by using correlation, linear regression
More information3.2A Least-Squares Regression
3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to
More informationModeling unobserved heterogeneity in Stata
Modeling unobserved heterogeneity in Stata Rafal Raciborski StataCorp LLC November 27, 2017 Rafal Raciborski (StataCorp) Modeling unobserved heterogeneity November 27, 2017 1 / 59 Plan of the talk Concepts
More informationChapter 3: Describing Relationships
Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate
More informationANOVA. Thomas Elliott. January 29, 2013
ANOVA Thomas Elliott January 29, 2013 ANOVA stands for analysis of variance and is one of the basic statistical tests we can use to find relationships between two or more variables. ANOVA compares the
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More informationLecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression
Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Equation of Regression Line; Residuals Effect of Explanatory/Response Roles Unusual Observations Sample
More information5 To Invest or not to Invest? That is the Question.
5 To Invest or not to Invest? That is the Question. Before starting this lab, you should be familiar with these terms: response y (or dependent) and explanatory x (or independent) variables; slope and
More information1.4 - Linear Regression and MS Excel
1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear
More informationChapter 3 Review. Name: Class: Date: Multiple Choice Identify the choice that best completes the statement or answers the question.
Name: Class: Date: Chapter 3 Review Multiple Choice Identify the choice that best completes the statement or answers the question. Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable
More informationMath 124: Module 2, Part II
, Part II David Meredith Department of Mathematics San Francisco State University September 15, 2009 What we will do today 1 Explanatory and Response Variables When you study the relationship between two
More informationSection 3.2 Least-Squares Regression
Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.
More informationLab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups.
Lab 4 (M13) Objective: This lab will give you more practice exploring the shape of data, and in particular in breaking the data into two groups. Activity 1 Examining Data From Class Background Download
More informationSociology 63993, Exam1 February 12, 2015 Richard Williams, University of Notre Dame,
Sociology 63993, Exam1 February 12, 2015 Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ I. True-False. (20 points) Indicate whether the following statements are true or false.
More informationCHILD HEALTH AND DEVELOPMENT STUDY
CHILD HEALTH AND DEVELOPMENT STUDY 9. Diagnostics In this section various diagnostic tools will be used to evaluate the adequacy of the regression model with the five independent variables developed in
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationMULTIPLE REGRESSION OF CPS DATA
MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear
More informationSTATISTICS INFORMED DECISIONS USING DATA
STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams
More informationCHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.
More informationSTAT 201 Chapter 3. Association and Regression
STAT 201 Chapter 3 Association and Regression 1 Association of Variables Two Categorical Variables Response Variable (dependent variable): the outcome variable whose variation is being studied Explanatory
More informationNORTH SOUTH UNIVERSITY TUTORIAL 2
NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate
More informationProblem Set 3 ECN Econometrics Professor Oscar Jorda. Name. ESSAY. Write your answer in the space provided.
Problem Set 3 ECN 140 - Econometrics Professor Oscar Jorda Name ESSAY. Write your answer in the space provided. 1) Sir Francis Galton, a cousin of James Darwin, examined the relationship between the height
More informationCHAPTER TWO REGRESSION
CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect
More informationm 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers
SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:
More informationUNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016
UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 STAB22H3 Statistics I, LEC 01 and LEC 02 Duration: 1 hour and 45 minutes Last Name: First Name:
More informationMath 075 Activities and Worksheets Book 2:
Math 075 Activities and Worksheets Book 2: Linear Regression Name: 1 Scatterplots Intro to Correlation Represent two numerical variables on a scatterplot and informally describe how the data points are
More informationSimple Linear Regression the model, estimation and testing
Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.
More informationMultiple Regression Analysis
Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables
More informationSTATISTICS 201. Survey: Provide this Info. How familiar are you with these? Survey, continued IMPORTANT NOTE. Regression and ANOVA 9/29/2013
STATISTICS 201 Survey: Provide this Info Outline for today: Go over syllabus Provide requested information on survey (handed out in class) Brief introduction and hands-on activity Name Major/Program Year
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationUse the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.
Fall, 2002 Grad Stats Final Exam There are four questions on this exam, A through D, and each question has multiple sub-questions. Unless otherwise indicated, each sub-question is worth 3 points. Question
More informationMA 250 Probability and Statistics. Nazar Khan PUCIT Lecture 7
MA 250 Probability and Statistics Nazar Khan PUCIT Lecture 7 Regression For bivariate data, we have studied that the correlation coefficient measures the spread of the data. Now we want to know how to
More informationIAPT: Regression. Regression analyses
Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project
More informationHomework #3. SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
Homework #3 Name Due Due on on February Tuesday, Due on February 17th, Sept Friday 28th 17th, Friday SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Fill
More informationStatistics for Psychology
Statistics for Psychology SIXTH EDITION CHAPTER 12 Prediction Prediction a major practical application of statistical methods: making predictions make informed (and precise) guesses about such things as
More informationName: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam May 28 th, 2015: 9am to 1pm Instructions: 1. There are seven questions and 12 pages. 2. Read each question carefully. Answer
More information2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%
Capstone Test (will consist of FOUR quizzes and the FINAL test grade will be an average of the four quizzes). Capstone #1: Review of Chapters 1-3 Capstone #2: Review of Chapter 4 Capstone #3: Review of
More informationBiology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction
Biology 345: Biometry Fall 2005 SONOMA STATE UNIVERSITY Lab Exercise 5 Residuals and multiple regression Introduction In this exercise, we will gain experience assessing scatterplots in regression and
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationResults & Statistics: Description and Correlation. I. Scales of Measurement A Review
Results & Statistics: Description and Correlation The description and presentation of results involves a number of topics. These include scales of measurement, descriptive statistics used to summarize
More informationAnswer all three questions. All questions carry equal marks.
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science and Statistics Postgraduate Diploma in Statistics Trinity Term 2 Introduction to Regression
More informationRegression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions
MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals
More informationINTERPRET SCATTERPLOTS
Chapter2 MODELING A BUSINESS 2.1: Interpret Scatterplots 2.2: Linear Regression 2.3: Supply and Demand 2.4: Fixed and Variable Expenses 2.5: Graphs of Expense and Revenue Functions 2.6: Breakeven Analysis
More informationClass 7 Everything is Related
Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational
More informationEffects of Nutrients on Shrimp Growth
Data Set 5: Effects of Nutrients on Shrimp Growth Statistical setting This Handout is an example of extreme collinearity of the independent variables, and of the methods used for diagnosing this problem.
More informationPitfalls in Linear Regression Analysis
Pitfalls in Linear Regression Analysis Due to the widespread availability of spreadsheet and statistical software for disposal, many of us do not really have a good understanding of how to use regression
More information1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA.
LDA lab Feb, 6 th, 2002 1 1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA. 2. Scientific question: estimate the average
More informationSTATISTICS & PROBABILITY
STATISTICS & PROBABILITY LAWRENCE HIGH SCHOOL STATISTICS & PROBABILITY CURRICULUM MAP 2015-2016 Quarter 1 Unit 1 Collecting Data and Drawing Conclusions Unit 2 Summarizing Data Quarter 2 Unit 3 Randomness
More informationMidterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.
Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and
More informationM 140 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75
M 140 est 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationRegression. Regression lines CHAPTER 5
CHAPTER 5 NASA/GSFC Can scientists predict in advance how many hurricanes the coming season will bring? Exercise 5.44 has some data. Regression IN THIS CHAPTER WE COVER... Linear (straight-line) relationships
More informationSimple Linear Regression
Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis
More information3. For a $5 lunch with a 55 cent ($0.55) tip, what is the value of the residual?
STATISTICS 216, SPRING 2006 Name: EXAM 1; February 21, 2006; 100 points. Instructions: Closed book. Closed notes. Calculator allowed. Double-sided exam. NO CELL PHONES. Multiple Choice (3pts each). Circle
More information2. Scientific question: Determine whether there is a difference between boys and girls with respect to the distance and its change over time.
LDA lab Feb, 11 th, 2002 1 1. Objective:analyzing dental data using ordinary least square (OLS) and Generalized Least Square(GLS) in STATA. 2. Scientific question: Determine whether there is a difference
More informationHZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION
HZAU MULTIVARIATE HOMEWORK #2 MULTIPLE AND STEPWISE LINEAR REGRESSION Using the malt quality dataset on the class s Web page: 1. Determine the simple linear correlation of extract with the remaining variables.
More informationMethod Comparison Report Semi-Annual 1/5/2018
Method Comparison Report Semi-Annual 1/5/2018 Prepared for Carl Commissioner Regularatory Commission 123 Commission Drive Anytown, XX, 12345 Prepared by Dr. Mark Mainstay Clinical Laboratory Kennett Community
More informationEXECUTIVE SUMMARY DATA AND PROBLEM
EXECUTIVE SUMMARY Every morning, almost half of Americans start the day with a bowl of cereal, but choosing the right healthy breakfast is not always easy. Consumer Reports is therefore calculated by an
More informationRegression Including the Interaction Between Quantitative Variables
Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance
More informationCorrelation and regression
PG Dip in High Intensity Psychological Interventions Correlation and regression Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle strength
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables
Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge
More informationSCATTER PLOTS AND TREND LINES
1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as
More informationGeneral Example: Gas Mileage (Stat 5044 Schabenberger & J.P.Morgen)
General Example: Gas Mileage (Stat 5044 Schabenberger & J.P.Morgen) From Motor Trend magazine data were obtained for n=32 cars on the following variables: Y= Gas Mileage (miles per gallon, MPG) X1= Engine
More information12.1 Inference for Linear Regression. Introduction
12.1 Inference for Linear Regression vocab examples Introduction Many people believe that students learn better if they sit closer to the front of the classroom. Does sitting closer cause higher achievement,
More informationM 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60
M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions
More informationRESPONSE SURFACE MODELING AND OPTIMIZATION TO ELUCIDATE THE DIFFERENTIAL EFFECTS OF DEMOGRAPHIC CHARACTERISTICS ON HIV PREVALENCE IN SOUTH AFRICA
RESPONSE SURFACE MODELING AND OPTIMIZATION TO ELUCIDATE THE DIFFERENTIAL EFFECTS OF DEMOGRAPHIC CHARACTERISTICS ON HIV PREVALENCE IN SOUTH AFRICA W. Sibanda 1* and P. Pretorius 2 1 DST/NWU Pre-clinical
More informationTable of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017
Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014
More information6. Unusual and Influential Data
Sociology 740 John ox Lecture Notes 6. Unusual and Influential Data Copyright 2014 by John ox Unusual and Influential Data 1 1. Introduction I Linear statistical models make strong assumptions about the
More informationPsych 5741/5751: Data Analysis University of Boulder Gary McClelland & Charles Judd. Exam #2, Spring 1992
Exam #2, Spring 1992 Question 1 A group of researchers from a neurobehavioral institute are interested in the relationships that have been found between the amount of cerebral blood flow (CB FLOW) to the
More informationTEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.
Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic
More informationM15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page Influence Analysis 1
M15_BERE8380_12_SE_C15.6.qxd 2/21/11 8:21 PM Page 1 15.6 Influence Analysis FIGURE 15.16 Minitab worksheet containing computed values for the Studentized deleted residuals, the hat matrix elements, and
More information14.1: Inference about the Model
14.1: Inference about the Model! When a scatterplot shows a linear relationship between an explanatory x and a response y, we can use the LSRL fitted to the data to predict a y for a given x. However,
More informationAn Introduction to Statistical Thinking Dan Schafer Table of Contents
An Introduction to Statistical Thinking Dan Schafer Table of Contents PART I: CONCLUSIONS AND THEIR UNCERTAINTY NUMERICAL AND ELEMENTS OF Chapter1 Statistics as a Branch of Human Reasoning Chapter 2 What
More informationHow Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1
How Faithful is the Old Faithful? The Practice of Statistics, 5 th Edition 1 Who Has Been Eating My Cookies????????? Someone has been steeling the cookie I bought for your class A teacher from the highschool
More informationAP Statistics. Semester One Review Part 1 Chapters 1-5
AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically
More informationHomework Linear Regression Problems should be worked out in your notebook
Homework Linear Regression Problems should be worked out in your notebook 1. Following are the mean heights of Kalama children: Age (months) 18 19 20 21 22 23 24 25 26 27 28 29 Height (cm) 76.1 77.0 78.1
More informationCaffeine & Calories in Soda. Statistics. Anthony W Dick
1 Caffeine & Calories in Soda Statistics Anthony W Dick 2 Caffeine & Calories in Soda Description of Experiment Does the caffeine content in soda have anything to do with the calories? This is the question
More information11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationRegression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES
CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of
More informationDr. Kelly Bradley Final Exam Summer {2 points} Name
{2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.
More informationF1: Introduction to Econometrics
F1: Introduction to Econometrics Feng Li Department of Statistics, Stockholm University General information Homepage of this course: http://gauss.stat.su.se/gu/ekonometri.shtml Lecturer F1 F7: Feng Li,
More informationLecture 12 Cautions in Analyzing Associations
Lecture 12 Cautions in Analyzing Associations MA 217 - Stephen Sawin Fairfield University August 8, 2017 Cautions in Linear Regression Three things to be careful when doing linear regression we have already
More informationStatistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2
Statistical Reasoning in Public Health 2009 Biostatistics 612, Homework #2 1. Suppose it is the year 1985 and you are doing research on the differences in wages earned by men and women in the U.S. workforce.
More information