Chapter 5. Describing numerical data
|
|
- Spencer Lawson
- 6 years ago
- Views:
Transcription
1 Chapter 5 Describing numerical data 1
2 Topics Numerical descriptive measures Location Variability Other measurements Graphical methods Histogram Boxplot, Stem and leaf plot Scatter plot for bivariate data 2
3 Exploring numerical data The number of observations in the data set The center of the data Mean Median The variability of the data Variance or standard deviation Range Interquartile range Coefficient of Variation 3
4 Other measures Minimum and Maximum Exploring numerical data First quartile (25th percentile) and third quartile (75th percentile) Percentiles Skewness Kurtosis 4
5 Example 5.1 The percentage returns achieved by 76 average risk funds
6 Some preliminary observations What can we say about the returns of these funds? Almost all the funds have a positive return. The spread of the returns ranges from -2.7% to 91.15%. There appears to be one very high return, 91.15%. 6
7 Descriptive statistics: SAS SAS:Proc Univariate Program data ex5 1ar; infile F:\ST2137\lecdata\ex5 1ar.txt firstobs=2; input return; proc univariate data=ex5 1ar; title Simple Descriptive Statistics ; var return; run; 7
8 Output from Proc Univariate 8
9 Output from Proc Univariate 9
10 Output from Proc Univariate 10
11 Output from Proc Univariate 11
12 Output from Proc Univariate 12
13 SAS: Proc Mean proc means data=ex5 1ar n mean std min max stderr maxdec=4; title Procedure Means ; var return; run; 13
14 Output from Proc Mean 14
15 Descriptive statistics: R > ex5.1=read.table ( F:/ST2137/lecdata/ex5 1ar.txt,header=T) > ex5.1ar=ex5.1[,1] > summary(ex5.1ar) Min 1st Qu Median Mean 3rd Qu. Max > mean(ex5.1ar) [1] > median(ex5.1ar) [1]22.98 > min(ex5.1ar) [1] -2.7 > max(ex5.1ar) [1]
16 Descriptive statistics: R > range(ex5.1ar) [1] > quantile(ex5.1ar) 0% 25% 50% 75% 100% > var(ex5.1ar) [1] > sd(ex5.1ar) [1] > IQR(ex5.1ar) #interquartile range [1]
17 Descriptive statistics: R > ex5.1ar[order(ex5.1ar)[1:5]] #the smallest 5 observations [1] > nar=length(ex5.1ar) > ex5.1ar[order(ex5.1ar)[(nar-5):nar]] #the biggest 5 observations [1] > cv=function(x)sd(x)/mean(x)*100 # Compute coefficient of variation (ex5.1ar) > cv(ex5.1ar) return
18 Calculating sample skewness: R Unbiased estimator of skewness is given by n(n 1) n 2 > skew=function(x){ n=length(x) m3=sum((x-mean(x)) 3)/n m2=sum((x-mean(x)) 2)/n m3/m2 (3/2) sqrt(n (n-1))/(n-2)} > skew(ex5.1ar) [1] ( m 3 ) m 3/2 2 18
19 Calculating sample kurtosis: R Unbiased estimator of kurtosis is given by (n 1) (n 2)(n 3) ((n + 1)m 4 m 2 2 3(n 1)) > kurt=function(x){ n=length(x) m4=sum((x-mean(x)) 4)/n m2=sum((x-mean(x)) 2)/n (n-1)/((n-2)*(n-3))*((n+1)*m4/m2 2-3*(n-1))} > kurt(ex5.1ar) [1]
20 Descriptive Statistics: SPSS Import ex5.1ar.txt into the SPSS and call it ex5.1ar.sav Analyze Descriptive Statistics Descriptive... Move the variable return from the left panel to the right panel Click Option Choose the statistics that you want to compute Click Continue OK 20
21 Descriptive Statistics: SPSS 21
22 Output of descriptive statistics: SPSS Descriptive Statistics N Range Minimum Maximum Return Statistic Statistic Statistic Statistic Mean Std Deviation Variance Statistic Std.Error Statistic Statistic Skewness Kurtosis Statistic Std.Error Statistic Std.Error
23 Graphical Methods: SAS Box Plot Stem and leaf plot Histogram QQ Plot 23
24 Plot in Proc Univariate: SAS Program /*Boxplot, stem and leaf plot and qq plot */ proc univariate data=ex5 1ar plot; var return; run; 24
25 Plot in Proc Univariate: SAS output 25
26 Plot in Proc Univariate: SAS output 26
27 Program /*Histogram and qqplot */ options reset=all ftext= Arial/bo cback=white colors=(black) gunit=pct htext=2 hpos=15; Histogram: SAS 27
28 Histogram: SAS Program proc univariate data=ex5 1ar; var return; histogram return/normal; /*Option normal creates a normal curve */ inset mean= Mean std= Standard Deviation /font= Arial pos=ne height=4; qqplot return; run; quit; 28
29 Output: Histogram in SAS 29
30 Output: Histogram in SAS 30
31 Histogram: R > hist(return,include.lowest=true,freq=true, main=paste( Histogram of return ), xlab= return, ylab= frequency, axes=true) > # Normal curve imposed on the histogram >xpt=seq(-10,100,0.1) >ypt=dnorm(seq(-10,100,0.1),mean(return),sd(return)) >ypt=ypt*length(return)*10 >lines(xpt,ypt) 31
32 32
33 QQplot: R > # qqplot of normal distribution > qqnorm(return) >qqline(return) 33
34 Stem and leaf plot: R > stem(return) 34
35 Boxplot: R >boxplot(return) 35
36 Plot: SPSS Open the SPSS data file ex5.1ar.sav Analyze Descriptive Statistics Explore.. Click plots Choose the plots you want Click Continue OK 36
37 Plot: SPSS 37
38 Output: Histogram in SPSS 38
39 Output: QQ plot in SPSS 39
40 Output: Boxplot in SPSS 40
41 Alternative way to get histogram in SPSS Open the SPSS data file ex5.1ar.sav Graphs Histogram... Move the variable return from the left panel to the right panel under Variable Choose Display normal curve, then click OK 41
42 Alternative way to get QQplot in SPSS Open the SPSS data file ex5.1ar.sav Analyze Descriptive Statistics QQ plot. Move the variable return from the left panel to the right panel under Variable Choose Normal in the Test distribution, then click OK 42
43 Alternative way to get Boxplot in SPSS Open the SPSS data file ex5.1ar.sav Graphs Legacy Dialogs Box plot... Choose Simple in the Summaries in separate variables, then click Define Move the variable return from the left panel to the right panel under Variable Click OK 43
44 Example 5.2 In addition to the percentage returns of the 76 average-risk funds, we have the following percentage returns achieved by 47 high-risk funds:
45 Preparing the dataset For a given fund, there are two variables: Return: Percentage return of the fund and Risk: 1 for average-risk fund and 2 for high-risk fund. Notice that the variable Risk is a classification (categorical) variable. 45
46 How to enter the data Two columns with the first column representing the return percentage while the second column identifies the type of funds There are 123 data in both columns. 46
47 Descriptive Statistics by groups: SAS proc format; value $risk 1 = Average Risk 2 = High Risk ; data ex5 2; infile F:\ST2137\lecdata\ex5 2.txt ; input return risk$; label return= Return Percentage ; format risk $risk.; 47
48 Descriptive Statistics by groups: SAS proc univariate data=ex5 2 plot; histogram return/midpoints=-30 to 90 by 15 normal; class risk; inset mean= Mean std= Standard Deviation /font= Arial pos=nw height=3; qqplot return; run; quit; 48
49 Output by groups: SAS 49
50 Output by groups: SAS 50
51 proc boxplot data=ex5 2; plot return*risk; run; Boxplot by groups: SAS m 51
52 Descriptive statistics by groups: R >ex5.2=read.table( G:/ST2137/lecdata/ex5 2.txt,header=T) >attach(ex5.2) >ex5.2ar=ex5.2[risk==1, Return ] >ex5.2hr=ex5.2[risk==2, Return ] >summary(ex5.2hr) Min 1st Qu Median Mean 3rd Qu. Max
53 >stem(ex5.2hr) Stem and leaf plot by groups: R 53
54 >boxplot(return risk) Boxplot by groups: R
55 Descriptive statistics by groups: SPSS Import ex5.2.txt into the SPSS and call it ex5.2.sav Analyze Descriptive Statistics Explore... Move Return from the left panel to the Dependent List panel Move Risk from left panel to the Factor List panel 55
56 Descriptive statistics by groups: SPSS Click Plots Choose the plots that you want Click Continue OK 56
57 Boxplot by groups:spss 57
58 Plot of bivariate data: SAS Refer to the data set htwt1 in Tutorial 1 data htwt1; infile F:\ST2137\tutorialdata\htwt1.txt ; input id gender$ height weight siblings; proc gplot data=htwt1; title Scatter Plot of WEIGHT by HEIGHT ; plot weight* height; symbol value=dot color=black; run; 58
59 Scatter plot of bivariate data: SAS 59
60 Plot of bivariate data: SAS proc gplot data=htwt1; title Using gender to generate the plotting symbols ; plot weight height=gender; symbol1 value=circle color=red; symbol2 value=square color=black; run; 60
61 Scatter plot of bivariate data for groups: SAS 61
62 Plot of bivariate data: SAS proc sort data=htwt1; by gender; proc gplot data=htwt1; title Separate Plots by GENDER ; by gender; plot weight*height; run; 62
63 Scatter plot of bivariate data for groups: SAS 63
64 Scatter plot of bivariate data for groups: SAS 64
65 Plot of bivariate data: R > htwt1=read.table( F:/ST2137/tutorialdata/htwt1.txt,header=TRUE) > names(htwt1)=c( id, gender, height, weight, siblings ) > attach(htwt1) > plot(height,weight,main= Plot of Weight by Height ) 65
66 Scatter plot of bivariate data for groups: R Plot of Weight by Height weight height 66
67 Plot of bivariate data: R >plot(height[gender== M ],weight[gender== M ], +main= Use Gender to geneerate the plotting symbol, +ylab= Weight,xlab= Height, +xlim=c(150,190), ylim=c(40,80)) >par(new=t) >plot(height[gender== F ],weight[gender== F ], +main=,ylab=,xlab=,xlim=c(150,190), ylim=c(40,80), +axes=f,pch=0,col=2) 67
68 Scatter plot of bivariate data for groups: R Use Gender to geneerate the plotting symbol Weight Height 68
69 Plot of bivariate data: R >par(mfrow=c(2,1)) >plot(height[gender== M ],weight[gender== M ], +main= Plot of Weight by Height for Male, +ylab= Weight,xlab= Height, +xlim=c(160,190), ylim=c(40,80)) >plot(height[gender== F ],weight[gender== F ], +main= Plot of Weight by Height for Female, +ylab= Weight,xlab= Height, +xlim=c(140,180), ylim=c(40,80)) 69
70 Output of separate plots by gender : R Plot of Weight by Height for Male Weight Height Plot of Weight by Height for Female Weight Height 70
71 Import the data set htwt1 Plot of bivariate data: SPSS Graph Legacy Dialogs Scatter/Dot... Choose Simple Scatter Define Move the variable height to the horizontal axis in the 2-D coordinate and weight to the vertical axis in the 2-D coordinate. 71
72 Scatter plot of bivariate data: SPSS 72
73 Plot of bivariate data for groups: SPSS Graph Legacy Dialogs Scatter/Dot... Move the variable height to the horizontal axis in the 2-D coordinate and weight to the vertical axis in the 2-D coordinate. Move the variable gender to the label cases by window. 73
74 Plot of bivariate data for groups: SPSS 74
75 Plot of bivariate data for groups: SPSS Click Options, choose Display chart with case labels 75
76 Output of plot by groups: SPSS 76
77 Separate plots by gender : SPSS Graph Legacy Dialogs Scatter/Dot... Move the variable height to the horizontal axis in the 2-D coordinate and weight to the vertical axis in the 2-D coordinate. Move the variable gender to the Panel Columns window. 77
78 Output of plot by gender :SPSS 78
7. Bivariate Graphing
1 7. Bivariate Graphing Video Link: https://www.youtube.com/watch?v=shzvkwwyguk&index=7&list=pl2fqhgedk7yyl1w9tgio8w pyftdumgc_j Section 7.1: Converting a Quantitative Explanatory Variable to Categorical
More informationHere are the various choices. All of them are found in the Analyze menu in SPSS, under the sub-menu for Descriptive Statistics :
Descriptive Statistics in SPSS When first looking at a dataset, it is wise to use descriptive statistics to get some idea of what your data look like. Here is a simple dataset, showing three different
More information2.4.1 STA-O Assessment 2
2.4.1 STA-O Assessment 2 Work all the problems and determine the correct answers. When you have completed the assessment, open the Assessment 2 activity and input your responses into the online grading
More informationPopulation. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:
Section 1.0 Making Sense of Data Statistics: Data Analysis: Individuals objects described by a set of data Variable any characteristic of an individual Categorical Variable places an individual into one
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Correlation SPSS procedure for Pearson r Interpretation of SPSS output Presenting results Partial Correlation Correlation
More informationCHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Reading Quiz 3.1 True/False 1.
More informationUnderstandable Statistics
Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement
More informationProbability and Statistics. Chapter 1
Probability and Statistics Chapter 1 Individuals and Variables Individuals and Variables Individuals are objects described by data. Individuals and Variables Individuals are objects described by data.
More informationANOVA in SPSS (Practical)
ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling
More informationUsing SPSS for Correlation
Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Key Vocabulary:! individual! variable! frequency table! relative frequency table! distribution! pie chart! bar graph! two-way table! marginal distributions! conditional distributions!
More informationLab 7 (100 pts.): One-Way ANOVA Objectives: Analyze data via the One-Way ANOVA
STAT 350 (Spring 2015) Lab 7: SAS Solution 1 Lab 7 (100 pts.): One-Way ANOVA Objectives: Analyze data via the One-Way ANOVA A. (50 pts.) Do isoflavones increase bone mineral density? (ex12-45bmd.txt) Kudzu
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationStat Wk 8: Continuous Data
Stat 342 - Wk 8: Continuous Data proc iml Loading and saving to datasets proc means proc univariate proc sgplot proc corr Stat 342 Notes. Week 3, Page 1 / 71 PROC IML - Reading other datasets. If you want
More informationIntroduction to Statistical Data Analysis I
Introduction to Statistical Data Analysis I JULY 2011 Afsaneh Yazdani Preface What is Statistics? Preface What is Statistics? Science of: designing studies or experiments, collecting data Summarizing/modeling/analyzing
More informationOne-Way Independent ANOVA
One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.
More informationExamining differences between two sets of scores
6 Examining differences between two sets of scores In this chapter you will learn about tests which tell us if there is a statistically significant difference between two sets of scores. In so doing you
More informationTable of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017
Essential Statistics for Nursing Research Kristen Carlin, MPH Seattle Nursing Research Workshop January 30, 2017 Table of Contents Plots Descriptive statistics Sample size/power Correlations Hypothesis
More informationMA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate
MA 151: Using Minitab to Visualize and Explore Data The Low Fat vs. Low Carb Debate September 5, 2018 1 Introduction to the Data We will be working with a synthetic data set meant to resemble the weight
More informationWDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?
WDHS Curriculum Map Probability and Statistics Time Interval/ Unit 1: Introduction to Statistics 1.1-1.3 2 weeks S-IC-1: Understand statistics as a process for making inferences about population parameters
More informationUndertaking statistical analysis of
Descriptive statistics: Simply telling a story Laura Delaney introduces the principles of descriptive statistical analysis and presents an overview of the various ways in which data can be presented by
More informationSTT315 Chapter 2: Methods for Describing Sets of Data - Part 2
Chapter 2.5 Interpreting Standard Deviation Chebyshev Theorem Empirical Rule Chebyshev Theorem says that for ANY shape of data distribution at least 3/4 of all data fall no farther from the mean than 2
More informationQ: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg.
Photometry Frequently Asked Questions Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg. Protein standard curves are traditionally presented as
More informationDepartment of Statistics TEXAS A&M UNIVERSITY STAT 211. Instructor: Keith Hatfield
Department of Statistics TEXAS A&M UNIVERSITY STAT 211 Instructor: Keith Hatfield 1 Topic 1: Data collection and summarization Populations and samples Frequency distributions Histograms Mean, median, variance
More informationAP Statistics. Semester One Review Part 1 Chapters 1-5
AP Statistics Semester One Review Part 1 Chapters 1-5 AP Statistics Topics Describing Data Producing Data Probability Statistical Inference Describing Data Ch 1: Describing Data: Graphically and Numerically
More informationStats 95. Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures
Stats 95 Statistical analysis without compelling presentation is annoying at best and catastrophic at worst. From raw numbers to meaningful pictures Stats 95 Why Stats? 200 countries over 200 years http://www.youtube.com/watch?v=jbksrlysojo
More informationUnit 7 Comparisons and Relationships
Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons
More informationIntro to SPSS. Using SPSS through WebFAS
Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800
More informationLesson 9 Presentation and Display of Quantitative Data
Lesson 9 Presentation and Display of Quantitative Data Learning Objectives All students will identify and present data using appropriate graphs, charts and tables. All students should be able to justify
More informationMedical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?
Medical Statistics 1 Basic Concepts Farhad Pishgar Defining the data Population and samples Except when a full census is taken, we collect data on a sample from a much larger group called the population.
More informationWhat you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu
What you should know before you collect data BAE 815 (Fall 2017) Dr. Zifei Liu Zifeiliu@ksu.edu Types and levels of study Descriptive statistics Inferential statistics How to choose a statistical test
More informationExplore. sexcntry Sex according to country. [DataSet1] D:\NORA\NORA Main File.sav
EXAMINE VARIABLES=nc228 BY sexcntry /PLOT BOXPLOT HISTOGRAM NPPLOT /COMPARE GROUPS /STATISTICS DESCRIPTIVES /CINTERVAL 95 /MISSING LISTWISE /NOTOTAL. Explore Notes Output Created Comments Input Missing
More informationOutline. Practice. Confounding Variables. Discuss. Observational Studies vs Experiments. Observational Studies vs Experiments
1 2 Outline Finish sampling slides from Tuesday. Study design what do you do with the subjects/units once you select them? (OI Sections 1.4-1.5) Observational studies vs. experiments Descriptive statistics
More informationHow to interpret scientific & statistical graphs
How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott 1 A brief introduction Graphics:
More informationBefore we get started:
Before we get started: http://arievaluation.org/projects-3/ AEA 2018 R-Commander 1 Antonio Olmos Kai Schramm Priyalathta Govindasamy Antonio.Olmos@du.edu AntonioOlmos@aumhc.org AEA 2018 R-Commander 2 Plan
More informationSTAT Factor Analysis in SAS
STAT 5600 Factor Analysis in SAS The data for this example come from the decathlon results in the 1988 Olympics. The decathlon is a two-day competition, with the 100 m race, long jump, shot put, high jump,
More informationSPSS Correlation/Regression
SPSS Correlation/Regression Experimental Psychology Lab Session Week 6 10/02/13 (or 10/03/13) Due at the Start of Lab: Lab 3 Rationale for Today s Lab Session This tutorial is designed to ensure that you
More informationList of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition
List of Figures List of Tables Preface to the Second Edition Preface to the First Edition xv xxv xxix xxxi 1 What Is R? 1 1.1 Introduction to R................................ 1 1.2 Downloading and Installing
More informationQuantitative Methods in Computing Education Research (A brief overview tips and techniques)
Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Dr Judy Sheard Senior Lecturer Co-Director, Computing Education Research Group Monash University judy.sheard@monash.edu
More informationSPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.
SPRING GROVE AREA SCHOOL DISTRICT PLANNED COURSE OVERVIEW Course Title: Basic Introductory Statistics Grade Level(s): 11-12 Units of Credit: 1 Classification: Elective Length of Course: 30 cycles Periods
More informationEmpirical Rule ( rule) applies ONLY to Normal Distribution (modeled by so called bell curve)
Chapter 2.5 Interpreting Standard Deviation Chebyshev Theorem Empirical Rule Chebyshev Theorem says that for ANY shape of data distribution at least 3/4 of all data fall no farther from the mean than 2
More informationV. Gathering and Exploring Data
V. Gathering and Exploring Data With the language of probability in our vocabulary, we re now ready to talk about sampling and analyzing data. Data Analysis We can divide statistical methods into roughly
More informationNORTH SOUTH UNIVERSITY TUTORIAL 1
NORTH SOUTH UNIVERSITY TUTORIAL 1 REVIEW FROM BIOSTATISTICS I AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 DATA TYPES/ MEASUREMENT SCALES Categorical:
More information4.3 Measures of Variation
4.3 Measures of Variation! How much variation is there in the data?! Look for the spread of the distribution.! What do we mean by spread? 1 Example Data set:! Weight of contents of regular cola (grams).
More informationSection 6: Analysing Relationships Between Variables
6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations
More informationLab #7: Confidence Intervals-Hypothesis Testing (2)-T Test
A. Objectives: Lab #7: Confidence Intervals-Hypothesis Testing (2)-T Test 1. Subsetting based on variable 2. Explore Normality 3. Explore Hypothesis testing using T-Tests Confidence intervals and initial
More informationResearch Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process
Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social
More informationChapter 1. Picturing Distributions with Graphs
Chapter 1 Picturing Distributions with Graphs Statistics Statistics is a science that involves the extraction of information from numerical data obtained during an experiment or from a sample. It involves
More informationMULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES
24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationStatistical Methods Exam I Review
Statistical Methods Exam I Review Professor: Dr. Kathleen Suchora SI Leader: Camila M. DISCLAIMER: I have created this review sheet to supplement your studies for your first exam. I am a student here at
More informationPRINTABLE VERSION. Quiz 1. True or False: The amount of rainfall in your state last month is an example of continuous data.
Question 1 PRINTABLE VERSION Quiz 1 True or False: The amount of rainfall in your state last month is an example of continuous data. a) True b) False Question 2 True or False: The standard deviation is
More informationDay 11: Measures of Association and ANOVA
Day 11: Measures of Association and ANOVA Daniel J. Mallinson School of Public Affairs Penn State Harrisburg mallinson@psu.edu PADM-HADM 503 Mallinson Day 11 November 2, 2017 1 / 45 Road map Measures of
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Please note the page numbers listed for the Lind book may vary by a page or two depending on which version of the textbook you have. Readings: Lind 1 11 (with emphasis on chapters 10, 11) Please note chapter
More informationTypes of Statistics. Censored data. Files for today (June 27) Lecture and Homework INTRODUCTION TO BIOSTATISTICS. Today s Outline
INTRODUCTION TO BIOSTATISTICS FOR GRADUATE AND MEDICAL STUDENTS Files for today (June 27) Lecture and Homework Descriptive Statistics and Graphically Visualizing Data Lecture #2 (1 file) PPT presentation
More informationSummarizing Data. (Ch 1.1, 1.3, , 2.4.3, 2.5)
1 Summarizing Data (Ch 1.1, 1.3, 1.10-1.13, 2.4.3, 2.5) Populations and Samples An investigation of some characteristic of a population of interest. Example: You want to study the average GPA of juniors
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationLOTS of NEW stuff right away 2. The book has calculator commands 3. About 90% of technology by week 5
1.1 1. LOTS of NEW stuff right away 2. The book has calculator commands 3. About 90% of technology by week 5 1 Three adventurers are in a hot air balloon. Soon, they find themselves lost in a canyon in
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationFrom Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1
From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...
More informationCHAPTER ONE CORRELATION
CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to
More information1. To review research methods and the principles of experimental design that are typically used in an experiment.
Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise Lab #7 (there was no Lab #6) Treatment for Depression: A Randomized Controlled Clinical Trial Objectives: 1. To review
More informationTest 1C AP Statistics Name:
Test 1C AP Statistics Name: Part 1: Multiple Choice. Circle the letter corresponding to the best answer. 1. At the beginning of the school year, a high-school teacher asks every student in her classes
More informationTwo-Way Independent ANOVA
Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There
More informationSection I: Multiple Choice Select the best answer for each question.
Chapter 1 AP Statistics Practice Test (TPS- 4 p78) Section I: Multiple Choice Select the best answer for each question. 1. You record the age, marital status, and earned income of a sample of 1463 women.
More informationChapter 1: Explaining Behavior
Chapter 1: Explaining Behavior GOAL OF SCIENCE is to generate explanations for various puzzling natural phenomenon. - Generate general laws of behavior (psychology) RESEARCH: principle method for acquiring
More informationSurvey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation
More informationSurvey research (Lecture 1)
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation
More informationPsychology of Perception Psychology 4165, Fall 2001 Laboratory 1 Weight Discrimination
Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)
More informationUNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016
UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test February 2016 STAB22H3 Statistics I, LEC 01 and LEC 02 Duration: 1 hour and 45 minutes Last Name: First Name:
More informationPsychology of Perception Psychology 4165, Spring 2003 Laboratory 1 Weight Discrimination
Psychology 4165, Laboratory 1 Weight Discrimination Weight Discrimination Performance Probability of "Heavier" Response 1.0 0.8 0.6 0.4 0.2 0.0 50.0 100.0 150.0 200.0 250.0 Weight of Test Stimulus (grams)
More informationMINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES
MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES THE PRESIDENTS OF THE UNITED STATES Project: Focus on the Presidents of the United States Objective: See how many Presidents of the United States
More informationM 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 60
M 140 Test 1 A Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-10 10 11 3 12 4 13 3 14 10 15 14 16 10 17 7 18 4 19 4 Total 60 Multiple choice questions (1 point each) For questions
More informationChapter 9. Factorial ANOVA with Two Between-Group Factors 10/22/ Factorial ANOVA with Two Between-Group Factors
Chapter 9 Factorial ANOVA with Two Between-Group Factors 10/22/2001 1 Factorial ANOVA with Two Between-Group Factors Recall that in one-way ANOVA we study the relation between one criterion variable and
More informationQPM Lab 9: Contingency Tables and Bivariate Displays in R
QPM Lab 9: Contingency Tables and Bivariate Displays in R Department of Political Science Washington University, St. Louis November 3-4, 2016 QPM Lab 9: Contingency Tables and Bivariate Displays in R 1
More informationLecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics
Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose
More informationHour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1
Agenda for Week 3, Hr 1 (Tuesday, Jan 19) Hour 1: - Installing R and inputting data. - Different tools for R: Notepad++ and RStudio. - Basic commands:?,??, mean(), sd(), t.test(), lm(), plot() - t.test()
More informationStatistics. Nur Hidayanto PSP English Education Dept. SStatistics/Nur Hidayanto PSP/PBI
Statistics Nur Hidayanto PSP English Education Dept. RESEARCH STATISTICS WHAT S THE RELATIONSHIP? RESEARCH RESEARCH positivistic Prepositivistic Postpositivistic Data Initial Observation (research Question)
More informationSCATTER PLOTS AND TREND LINES
1 SCATTER PLOTS AND TREND LINES LEARNING MAP INFORMATION STANDARDS 8.SP.1 Construct and interpret scatter s for measurement to investigate patterns of between two quantities. Describe patterns such as
More information10/4/2007 MATH 171 Name: Dr. Lunsford Test Points Possible
Pledge: 10/4/2007 MATH 171 Name: Dr. Lunsford Test 1 100 Points Possible I. Short Answer and Multiple Choice. (36 points total) 1. Circle all of the items below that are measures of center of a distribution:
More information1) What is the independent variable? What is our Dependent Variable?
1) What is the independent variable? What is our Dependent Variable? Independent Variable: Whether the font color and word name are the same or different. (Congruency) Dependent Variable: The amount of
More informationThe Effectiveness of Captopril
Lab 7 The Effectiveness of Captopril In the United States, pharmaceutical manufacturers go through a very rigorous process in order to get their drugs approved for sale. This process is designed to determine
More informationSTP226 Brief Class Notes Instructor: Ela Jackiewicz
CHAPTER 2 Organizing Data Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that can be assigned a numerical value or nonnumerical
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More informationPreliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)
Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete
More informationLesson 1: Distributions and Their Shapes
Lesson 1 Name Date Lesson 1: Distributions and Their Shapes 1. Sam said that a typical flight delay for the sixty BigAir flights was approximately one hour. Do you agree? Why or why not? 2. Sam said that
More informationDescriptive statistics
CHAPTER 3 Descriptive statistics 41 Descriptive statistics 3 CHAPTER OVERVIEW In Chapter 1 we outlined some important factors in research design. In this chapter we will be explaining the basic ways of
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More informationHow to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona
How to Conduct On-Farm Trials Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How can you determine whether a treatment (this might be an additive, a fertilizer, snake
More informationSummary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0
Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Overview 1. Survey research and design 1. Survey research 2. Survey design 2. Univariate
More informationINTERPRET SCATTERPLOTS
Chapter2 MODELING A BUSINESS 2.1: Interpret Scatterplots 2.2: Linear Regression 2.3: Supply and Demand 2.4: Fixed and Variable Expenses 2.5: Graphs of Expense and Revenue Functions 2.6: Breakeven Analysis
More informationLAB ASSIGNMENT 4 INFERENCES FOR NUMERICAL DATA. Comparison of Cancer Survival*
LAB ASSIGNMENT 4 1 INFERENCES FOR NUMERICAL DATA In this lab assignment, you will analyze the data from a study to compare survival times of patients of both genders with different primary cancers. First,
More informationChapter 3: Describing Relationships
Chapter 3: Describing Relationships Objectives: Students will: Construct and interpret a scatterplot for a set of bivariate data. Compute and interpret the correlation, r, between two variables. Demonstrate
More informationNEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003
NEUROBLASTOMA DATA -- TWO GROUPS -- QUANTITATIVE MEASURES 38 15:37 Saturday, January 25, 2003 Obs GROUP I DOPA LNDOPA 1 neurblst 1 48.000 1.68124 2 neurblst 1 133.000 2.12385 3 neurblst 1 34.000 1.53148
More informationImpact of Group Rejections on Self Esteem
Impact of Group Rejections on Self Esteem In 2005, researchers from the School of Physical Education at the University of Otago carried out a study to assess the impact of rejection on physical self esteem.
More informationStill important ideas
Readings: OpenStax - Chapters 1 11 + 13 & Appendix D & E (online) Plous - Chapters 2, 3, and 4 Chapter 2: Cognitive Dissonance, Chapter 3: Memory and Hindsight Bias, Chapter 4: Context Dependence Still
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationMULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:
1 MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: Predicting State Rates of Robbery per 100K We know that robbery rates vary significantly from state-to-state in the United States. In any given state, we
More informationKnowledge discovery tools 381
Knowledge discovery tools 381 hours, and prime time is prime time precisely because more people tend to watch television at that time.. Compare histograms from di erent periods of time. Changes in histogram
More information