Analysing and Comparing the Final Grade in Mathematics by Linear Regression Using Excel and SPSS
|
|
- Liliana Cannon
- 5 years ago
- Views:
Transcription
1 Analysing and Comparing the Final Grade in Mathematics by Linear Regression Using Excel and SPSS Elena Karamazova #, Teuta Jusufi Zenku *, Zoran Trifunov #3 Faculty of Computer Sciences Department of Mathematics and Statistics University "Goce Delcev" Stip Republic of Macedonia Faculty of Technical Sciences University "Mother Teresa" Skopje 000 Skopje, Republic of Macedonia 3 Faculty of Technical Sciences University "Mother Teresa" Skopje 000 Skopje, Republic of Macedonia Abstract: In this paper simple and multiple linear regression model is developed to analyze and compare the final grades of students from two Universities, University "Goce Delcev" - Stip, more specifically a group of students who studied in Kavadarci and students from "Mother Teresa" University in Skopje, for the subject of Mathematics. The models were based on the data of students' scores in three tests, st periodical exam, nd periodical exam and final examination. Statistical significance of the relationship between the variables has been provided. For obtaining our results some solvers like Excel and SPSS were used. Keywords: simple linear regression, multiple linear regression, st periodical exam, nd periodical exam, final examination. I. INTRODUCTION Simple linear regression is a statistical method that allows us to summarize and study relationships between two continuous variables: one variable, denoted x, is regarded as the predictor, explanatory, or independent variable. The other variable, denoted y, is regarded as the response, outcome, or dependent variable. Multiple linear regression can be used to analyze data from causal-comparative, correlational, or experimental research. In addition, it provides estimates both of the magnitude and statistical significance of relationships between variables. Multiple linear regression is one of the most widely used statistical techniques in educational research. Multiple linear regression is defined as a multivariate technique for determining the correlation between a response variable y and some combination of two or more predictor variables, X. In [], such a model is developed to predict anomalies of westward-moving intraseasonal precipitable water by utilizing the first through fourth powers of a time series of outgoing longwave radiation that is filtered for eastward propagation and for the temporal and spatial scales of the tropical intraseasonal oscillations. An independent and simpler compositing method is applied to show that the results of this multiple linear regression model provide better description of the actual relationships between eastward and westward moving intraseasonal modes than a regression model that includes only the linear predictor. In [] the application of regression models in macroeconomic analyses is emphasized. The particular situation approached is the influence of final consumption and gross investments on the evolution of Romania s Gross Domestic Product. II. SIMPLE AND MULTIPLE LINEAR REGRESSION MODEL A simple linear regression is carried out to estimate the relationship between a dependent variable, y, and a single explanatory variable, x, given a set of data that includes observations for both of these variables for a particular population. form is: y x () 0 where β s denote the population regression coefficients, and ε is a random error. A multiple linear regression analysis is carried out to predict the values of a dependent variable, y, given a set of k explanatory variables (x, x,.,x k ). form is: y x x x () 0 kk where β s denote the population regression coefficients, and ε is a random error. Excel and SPSS regression computer programs were used to determine the regression coefficients and analyse the data. ISSN: Page 334
2 III. PROBLEM AND OBJECT OF STUDY The purpose of this study was to contribute to knowledge relating to the use of simple and multiple linear regression in educational research by establishing an appropriate linear regression model to analyze the relationship between variables as a final grade in the student exam in Mathematics (considered as a dependent or variable Y) depending on the st periodical exam and nd periodical exam (considered as independent or predictive X variables). The data was collected by a sample of 40 students, 0 of University Goce Delcev - Stip (UGD) and 0 from students of Mother Teresa University - Skopje (MTU). In order to determine the regression coefficients and to analyze the data, math applicative software was used. IV. EXCEL ANALYSIS OF DATA FROM MTU The equation of the multiple regression model is: Y X X. for the multiple regression model is given in figure. The equation of the simple regression model for the X variable is: Y X. for the simple regression model for the X variable is given in figure. The equation of the simple regression model for X variable is: Y X. for the simple regression model for the X variable is given in figure3. V. SPSS ANALYSIS OF DATA FROM MTU In figure 4, 5 and 6 are given model summary for the simple regression model for the X variable, for simple regression model for the X variable and for multiple regression model respectively with SPSS for MTU. VI. INTERPRETING THE RESULTS FROM MTU From the analysis made using two application softwares we see the same results in the tables obtained. Some individual parameters are interpreted as follows: represents the extent of the relation between the dependent variables and two or more independent variables. The closer to one is, the greater is the connectivity. In our results we see that R> 0.93 in all cases, which means that the correlation between the Y variable and The multiple correlation coefficient R the X and X variables is relatively strong. The coefficient of determination R represents the variance of the variables interpreted by the model. The coefficient of determination serves as a measure of representation of the model. Value R 0. 87, therefore it can be said that found linear regression models are representative. Where, as we can see from tables the most representative is the multiple linear regression model. The statistical significance of the regression model is determined based on the value of the empirical ratio F, i.e. of the corresponding value p (so-called group test of the importance of the regression model). Where for our case of multiple model is , i.e. with the level of risk p <0.0, we claim that at least one of the regression variables has a statistically significant impact on the final grade in Mathematics, respectively, the multiple linear regression model is statistically significant. In the previous case, the statistical significance of the overall regression model was assessed. The data found in the regression output tables provide information on the statistical significance of the respective regression coefficients (so-called relevant regression model significance test). The statistical significance of the regression coefficients is determined on the basis of t-test i.e., of the corresponding values P. For our cases this value is less than 0.05 so we conclude that in the linear regression models both two variables have an impact of statistical significance on the "final grade in Mathematics" variable. ISSN: Page 335
3 SUMMARY OUTPUT Multiple R 0,9777 R 0,9559 0,9507 Error 3,7763 Observations 0,0000 Df SS MS F Significance F Regression 556,37 68,9 84,30 0, Residual 7 4,43 4,6 Total ,80 Error t Stat P-value Intercept,837,35,0696 0,998 -,0 6,7884 -,0 6,7884 X Variable 3,300 0,5650 5,770 0,0000,0380 4,40,0380 4,40 X Variable -0,4359 0,6469-0,6738 0,5095 -,8007 0,990 -,8007 0,990 Fig. Regression Statistic for the multiple regression model SUMMARY OUTPUT Multiple R 0,977 R 0,9547 0,95 Error 3,786 Observations 0,0000 Df SS MS F Significance F Regression 549, , ,6608 0, Residual 8 48,906 3,879 Total ,8000 Error t Stat P-value Intercept,538,7980 0,8554 0,4036 -,394 5,355 -,394 5,355 X Variable,869 0,469 9,4849 0,0000,554 3,75,554 3,75 Fig. Regression Statistic for the simple regression model for the X variable ISSN: Page 336
4 SUMMARY OUTPUT Multiple R 0,9334 R 0,87 0,8640 Error 6,740 Observations 0,0000 df SS MS F Significance F Regression 4790, ,733,696 0, Residual 8 708,567 39,366 Total ,8000 Error t Stat P-value Intercept -,3747 3,384-0,406 0,6894-8,4847 5,7353-8,4847 5,7353 X Variable 3,33 0,838,036 0,0000,5350 3,777,5350 3,777 Fig.3 Regression Statistic for the simple regression model for the X variable Summary R R.977 a,955,95 3,7858 a. Predictors: (Constant), x a Sum of s Df Mean F Sig. Regression 549, , , b Residual 48,90 8 3,88 Total 5498,800 9 b. Predictors: (Constant), x a ized Unstandardized (Constant),538,798,855,404 x,863,47,977 9,485,000 Fig. 4 Summary with SPSS for the simple regression model for the X variable for MTU ISSN: Page 337
5 Summary R R.933 a,87,864 6,7396 a. Predictors: (Constant), x a Sum of s df Mean F Sig. Regression 4790, ,73, b Residual 708, ,363 Total 5498,800 9 b. Predictors: (Constant), x a ized Unstandardized (Constant) -,375 3,384 -,406,689 x 3,3,84,933,03,000 Fig. 5 Summary with SPSS for the simple regression model for the X variable for MTU R R.978 a,956,95 3,77630 a. Predictors: (Constant), x, x a Sum of s df Mean F Sig. Regression 556,37 68,86 84, b Residual 4,48 7 4,60 Total 5498,800 9 b. Predictors: (Constant), x, x ized Unstandardized (Constant),84,35,070,300 x 3,30,565,0 5,77,000 x -,436,647 -,30 -,674,50 Fig. 6 Summary for the multiple regression model with SPSS for MTU VII. EXCEL ANALYSIS OF DATA FROM UGD The equation of the multiple regression model is Y X.43X. for the multiple regression model is given in figure 7. ISSN: Page 338
6 The equation of the simple regression model for the X variable is Y X. for the simple regression model for the X variable is given in figure 8. The equation of the simple regression model for the X variable is Y X. for the simple regression model for the X variable is given in figure 9. VIII. SPSS ANALYSIS OF DATA FROM UGD In figure 0, and are given model summary for the simple regression model for the X variable, for simple regression model for the X variable and for the multiple regression model respectively with SPSS for U. IX. INTERPRETING THE RESULTS FROM UGD From the analysis made using two application softwares we see the same results in the tables obtained. Individual parameters are interpreted as follows: The multiple correlation coefficient R represents the extent of the relation between the dependent variables and two or more independent variables. The closer to one is, the greater is the connectivity. In our results we see that R> 0.94 in all cases, which means that the correlation between the Y variable and the X and X variables is relatively strong. The coefficient of determination R represents the variance of the variables interpreted by the model. The coefficient of determination serves as a measure of representation of the model. Value R 0. 87, therefore it can be said that found linear regression models are representative. Where, as we can see from tables the most representative is the multiple linear regression model. The statistical significance of the regression model is determined based on the value of the empirical ratio F, i.e. of the corresponding value p (so-called group test of the importance of the regression model). Where for our case of multiple model is , i.e. with the level of risk p <0.0, we claim that at least one of the regression variables has a statistically significant impact on the final grade in Mathematics, respectively, the multiple linear regression model is statistically significant. In the previous case, the statistical significance of the overall regression model was assessed. The data found in the regression output tables gives us information on the statistical significance of the respective regression coefficients (so-called relevant regression model significance test). The statistical significance of the regression coefficients is determined on the basis of t-test i.e., of the corresponding values P. For our cases this value is less than 0.05 so we conclude that in the linear regression models both two variables have an impact of statistical significance on the "final grade in Mathematics" variable. X. CONCLUSIONS From the analysis above, our multiple regression model as well as simple linear regression models for predicting and analyzing the final grade in Mathematics, Y, from both Universities are useful and adequate. For further work, we can consider developing and studying similar models in the field of education, social sciences, economics, etc., adding different variables. ISSN: Page 339
7 SUMMARY OUTPUT Multiple R 0,976 R 0,9530 0,9475 Error 3,765 Observations 0,0000 Df SS MS F Significance F Regression 489, ,775 7,583 0, Residual 7 4,0069 4,769 Total 9 53,5500 Error t Stat P-value -,5343 7,4555 Intercept,9606,305,3897 0,86 -,5343 7,4555 X Variable,7538 0,3360 5,95 0,000,0449,467,0449,467 X Variable,49 0,409 3,0375 0,0074 0,3796,063 0,3796,063 Fig. 7 for the multiple regression model SUMMARY OUTPUT Multiple R 0,963 R 0,976 0,935 Error 4,5449 Observations 0,0000 Df SS MS F Significance F Regression 4760, , ,4759 0, Residual 8 37,803 0,656 Total 9 53,5500 Error t Stat P-value Intercept 6,5465,407 3,058 0,0068,049,0440,049,0440 X Variable,673 0,76 5,84 0,0000,3033 3,043,3033 3,043 Fig. 8 for the simple regression model for the X variable ISSN: Page 340
8 SUMMARY OUTPUT Multiple R 0,9369 R 0,8778 0,870 Error 5,9030 Observations 0,0000 Df SS MS F Significance F Regression 4505, ,33 9,94 0, Residual 8 67,77 34,8460 Total 9 53,5500 Error t Stat P-value Intercept 0,9468 3,849 0,88 0,7765-5,9545 7,848-5,9545 7,848 X Variable 3,670 0,785,3707 0,0000,588 3,75,588 3,75 Fig. 9 for the simple regression model for the X variable Summary R R.963 a,98,94 4,54490 a. Predictors: (Constant), x a Sum of s df Mean F Sig. Regression 4760, ,740 30, b Residual 37,80 8 0,656 Total 53,550 9 b. Predictors: (Constant), x a Unstandardized ized (Constant) 6,547,4 3,058,007 x,673,76,963 5,8,000 Fig. 0 Summary with SPSS for the simple regression model for the X variable ISSN: Page 34
9 Summary R R.937 a,878,87 5,90305 a. Predictors: (Constant), x a Sum of s Df Mean F Sig. Regression 4505,3 4505,3 9,9.000 b Residual 67,8 8 34,846 Total 53,550 9 b. Predictors: (Constant), x a ized Unstandardized (Constant),947 3,85,88,776 x 3,67,79,937,37,000 Fig. Summary with SPSS for the simple regression model for the X variable for UGD Summary R R.976 a,953,948 3,765 a. Predictors: (Constant), x, x a Sum of s df Mean F Sig. Regression 489, ,77 7, b Residual 4, ,77 Total 53,550 9 b. Predictors: (Constant), x, x a Unstandardized ized (Constant),96,30,390,83 x,754,336,63 5,9,000 x,43,409,368 3,038,007 Fig. Summary with SPSS for for the multiple regression model for UGD ISSN: Page 34
10 REFERENCES [] C. Anghelache, M.G. Anghel, L. Prodan, C. Sacală and M. Popovici, Multiple Linear Regression Used in Economic Analyses, Romanian Statistical Review Supplement, Vol. 6. Issue 0, pp. 0-7, (October), 04 [] C. Anghelache, A. Manole and M. G. Anghel, Analysis of final consumption and gross investment influence on GDP multiple linear regression model, Theoretical and Applied Economics, Volume XXII No. 3(604), pp. 37-4, 05 [3] A.C Cameron, and P.K Trivedi, Regression Analysis of Cout Data, Cambridge, 998. [4] F. Hoxha, Metoda tё analizёs numerike, Tiranё, 008. [5] D. Jaksimovic, Zbirka Zadataka iz poslovne statistike, Beograd, 004. [6] Sh. Leka, Teoria e probabiliteteve dhe statistika matematike, Tiranë, 004. [7] J. T. McClove, P. George Benson, Terry Sincich, Statistics for Business and Economics, USA, 998. [8] M. Papiq, Statistika e aplikuar në MS EXEL, Prishtinë, 007. [9] L. Puka, Probabliliteti dhe Statistika e zbatuar, Tiranё, 00. [0] J. M. Rojo, Regresion lineal multiple, Madrid, 007. [] P. E. Roundy and W. M. Frank, Applications of a Multiple Linear Regression to the Analysis of Relationships between Eastward and Westward Moving Intraseasonal Modes, Journal of the atmospheric sciences, pp , 004 [] M. Shakil, A multiple Linear regression to Predict the Student s Final Grade in a Mathematics Class, [3] A. F. Siegel, Pratical Statistics, USA, 996. [4] D. S. Wilks, Statistical Methods in the Atmospheric Sciences: An Introduction, International Geophysical Series, Vol. 59, Academic Press, pp. 467, 995 Appendix Fig. 3 Appendix with Data for MTU ISSN: Page 343
11 Fig. 4 Appendix with Data for UGD ISSN: Page 344
Simple Linear Regression
Simple Linear Regression Assoc. Prof Dr Sarimah Abdullah Unit of Biostatistics & Research Methodology School of Medical Sciences, Health Campus Universiti Sains Malaysia Regression Regression analysis
More informationStudy Guide #2: MULTIPLE REGRESSION in education
Study Guide #2: MULTIPLE REGRESSION in education What is Multiple Regression? When using Multiple Regression in education, researchers use the term independent variables to identify those variables that
More informationResearch paper. Split-plot ANOVA. Split-plot design. Split-plot design. SPSS output: between effects. SPSS output: within effects
Research paper Effects of alcohol and caffeine on driving ability Split-plot ANOVA conditions: No alcohol; no caffeine alcohol; no caffeine No alcohol; caffeine Alcohol; caffeine Driving in simulator Error
More informationCHAPTER TWO REGRESSION
CHAPTER TWO REGRESSION 2.0 Introduction The second chapter, Regression analysis is an extension of correlation. The aim of the discussion of exercises is to enhance students capability to assess the effect
More informationClass 7 Everything is Related
Class 7 Everything is Related Correlational Designs l 1 Topics Types of Correlational Designs Understanding Correlation Reporting Correlational Statistics Quantitative Designs l 2 Types of Correlational
More informationSUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK
SUMMER 011 RE-EXAM PSYF11STAT - STATISTIK Full Name: Årskortnummer: Date: This exam is made up of three parts: Part 1 includes 30 multiple choice questions; Part includes 10 matching questions; and Part
More informationCRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys
Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests
More informationData Analysis with SPSS
Data Analysis with SPSS A First Course in Applied Statistics Fourth Edition Stephen Sweet Ithaca College Karen Grace-Martin The Analysis Factor Allyn & Bacon Boston Columbus Indianapolis New York San Francisco
More informationSimple Linear Regression One Categorical Independent Variable with Several Categories
Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary
More informationREGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK
REGRESSION MODELLING IN PREDICTING MILK PRODUCTION DEPENDING ON DAIRY BOVINE LIVESTOCK Agatha POPESCU University of Agricultural Sciences and Veterinary Medicine Bucharest, 59 Marasti, District 1, 11464,
More informationMultiple Regression Analysis
Multiple Regression Analysis Basic Concept: Extend the simple regression model to include additional explanatory variables: Y = β 0 + β1x1 + β2x2 +... + βp-1xp + ε p = (number of independent variables
More informationF1: Introduction to Econometrics
F1: Introduction to Econometrics Feng Li Department of Statistics, Stockholm University General information Homepage of this course: http://gauss.stat.su.se/gu/ekonometri.shtml Lecturer F1 F7: Feng Li,
More informationProspect Theory and Gain-Framing: The Relationship Between Cost Perception and Satisfaction
Advances in Behavioral Finance & Economics: The Journal of the Academy of Behavioral Finance Volume 1, Issue 1, Winter 2011 87-99 Copyright 2011 Academy of Behavioral Finance, Inc. All rights reserved.
More informationMidterm STAT-UB.0003 Regression and Forecasting Models. I will not lie, cheat or steal to gain an academic advantage, or tolerate those who do.
Midterm STAT-UB.0003 Regression and Forecasting Models The exam is closed book and notes, with the following exception: you are allowed to bring one letter-sized page of notes into the exam (front and
More information1. You want to find out what factors predict achievement in English. Develop a model that
Questions and answers for Chapter 10 1. You want to find out what factors predict achievement in English. Develop a model that you think can explain this. As usual many alternative predictors are possible
More informationChapter 11 Multiple Regression
Chapter 11 Multiple Regression PSY 295 Oswald Outline The problem An example Compensatory and Noncompensatory Models More examples Multiple correlation Chapter 11 Multiple Regression 2 Cont. Outline--cont.
More informationA COMPARISON BETWEEN MULTIVARIATE AND BIVARIATE ANALYSIS USED IN MARKETING RESEARCH
Bulletin of the Transilvania University of Braşov Vol. 5 (54) No. 1-2012 Series V: Economic Sciences A COMPARISON BETWEEN MULTIVARIATE AND BIVARIATE ANALYSIS USED IN MARKETING RESEARCH Cristinel CONSTANTIN
More informationAnswer all three questions. All questions carry equal marks.
UNIVERSITY OF DUBLIN TRINITY COLLEGE Faculty of Engineering, Mathematics and Science School of Computer Science and Statistics Postgraduate Diploma in Statistics Trinity Term 2 Introduction to Regression
More informationCorrelation and Regression
Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott
More informationFertility Rate & Infant Mortality Rate
Sacred Heart University DigitalCommons@SHU Academic Festival Apr 20th, 1:00 PM - 3:00 PM Fertility Rate & Infant Mortality Rate Elena Burke Follow this and additional works at: http://digitalcommons.sacredheart.edu/acadfest
More informationSimple Linear Regression the model, estimation and testing
Simple Linear Regression the model, estimation and testing Lecture No. 05 Example 1 A production manager has compared the dexterity test scores of five assembly-line employees with their hourly productivity.
More informationRegression Including the Interaction Between Quantitative Variables
Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance
More informationNORTH SOUTH UNIVERSITY TUTORIAL 2
NORTH SOUTH UNIVERSITY TUTORIAL 2 AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 Correlation Analysis INTRODUCTION In correlation analysis, we estimate
More informationStatistical Methods and Reasoning for the Clinical Sciences
Statistical Methods and Reasoning for the Clinical Sciences Evidence-Based Practice Eiki B. Satake, PhD Contents Preface Introduction to Evidence-Based Statistics: Philosophical Foundation and Preliminaries
More informationRegression. Page 1. Variables Entered/Removed b Variables. Variables Removed. Enter. Method. Psycho_Dum
Regression Model Variables Entered/Removed b Variables Entered Variables Removed Method Meds_Dum,. Enter Psycho_Dum a. All requested variables entered. b. Dependent Variable: Beck's Depression Score Model
More information11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES
Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are
More informationBusiness Statistics Probability
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationSPSS output for 420 midterm study
Ψ Psy Midterm Part In lab (5 points total) Your professor decides that he wants to find out how much impact amount of study time has on the first midterm. He randomly assigns students to study for hours,
More informationTwo Factor Analysis of Variance
BIOL 310 Two Factor Analysis of Variance In the previous discussions of analysis of variance (ANOVA), only one factor was involved. For example, in Chapter 7 the variable of interest in the sample problem
More informationisc ove ring i Statistics sing SPSS
isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements
More informationGROSS FIXED CAPITAL FORMATION SUFFICES AS A DETERMINANT FOR GROSS DOMESTIC PRODUCT IN INDIA
GROSS FIXED CAPITAL FORMATION SUFFICES AS A DETERMINANT FOR GROSS DOMESTIC PRODUCT IN INDIA 1 PALLABI MUKHERJEE, 2 KANHAIYAAHUJA 1 Assistant Prof., IBMR, IPS Academy, Indore, India 2 Dean School of Social
More informationAppendix. I. Map of former Car Doctors site. Soil for the greenhouse study was collected in the area indicated by the arrow.
Appendix I. Map of former Car Doctors site. Soil for the greenhouse study was collected in the area indicated by the arrow. 73 II. Photo of former Car Doctors site. Site, facing Burlington Drive, before
More informationAddendum: Multiple Regression Analysis (DRAFT 8/2/07)
Addendum: Multiple Regression Analysis (DRAFT 8/2/07) When conducting a rapid ethnographic assessment, program staff may: Want to assess the relative degree to which a number of possible predictive variables
More informationSPSS output for 420 midterm study
Ψ Psy Midterm Part In lab (5 points total) Your professor decides that he wants to find out how much impact amount of study time has on the first midterm. He randomly assigns students to study for hours,
More informationTHE UNIVERSITY OF SUSSEX. BSc Second Year Examination DISCOVERING STATISTICS SAMPLE PAPER INSTRUCTIONS
C8552 THE UNIVERSITY OF SUSSEX BSc Second Year Examination DISCOVERING STATISTICS SAMPLE PAPER INSTRUCTIONS Do not, under any circumstances, remove the question paper, used or unused, from the examination
More informationWELCOME! Lecture 11 Thommy Perlinger
Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression
More informationDr. Kelly Bradley Final Exam Summer {2 points} Name
{2 points} Name You MUST work alone no tutors; no help from classmates. Email me or see me with questions. You will receive a score of 0 if this rule is violated. This exam is being scored out of 00 points.
More informationApplications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis
DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:
More informationA response variable is a variable that. An explanatory variable is a variable that.
Name:!!!! Date: Scatterplots The most common way to display the relation between two quantitative variable is a scatterplot. Statistical studies often try to show through scatterplots, that changing one
More information3 CONCEPTUAL FOUNDATIONS OF STATISTICS
3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical
More informationLecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression
Lecture 12: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression Equation of Regression Line; Residuals Effect of Explanatory/Response Roles Unusual Observations Sample
More informationSubstantive Significance of Multivariate Regression Coefficients
Substantive Significance of Multivariate Regression Coefficients Jane E. Miller Institute for Health, Health Care Policy and Aging Research Rutgers University Aims of this talk Provide concrete approaches
More informationDescribe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo
Business Statistics The following was provided by Dr. Suzanne Delaney, and is a comprehensive review of Business Statistics. The workshop instructor will provide relevant examples during the Skills Assessment
More informationCHAPTER 4: FINDINGS 4.1 Introduction This chapter includes five major sections. The first section reports descriptive statistics and discusses the
CHAPTER 4: FINDINGS 4.1 Introduction This chapter includes five major sections. The first section reports descriptive statistics and discusses the respondent s representativeness of the overall Earthwatch
More informationPerformance of Median and Least Squares Regression for Slightly Skewed Data
World Academy of Science, Engineering and Technology 9 Performance of Median and Least Squares Regression for Slightly Skewed Data Carolina Bancayrin - Baguio Abstract This paper presents the concept of
More informationNormal Q Q. Residuals vs Fitted. Standardized residuals. Theoretical Quantiles. Fitted values. Scale Location 26. Residuals vs Leverage
Residuals 400 0 400 800 Residuals vs Fitted 26 42 29 Standardized residuals 2 0 1 2 3 Normal Q Q 26 42 29 360 400 440 2 1 0 1 2 Fitted values Theoretical Quantiles Standardized residuals 0.0 0.5 1.0 1.5
More informationOverview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups
Survey Methods & Design in Psychology Lecture 10 ANOVA (2007) Lecturer: James Neill Overview of Lecture Testing mean differences ANOVA models Interactions Follow-up tests Effect sizes Parametric Tests
More informationCopyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and
Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and private study only. The thesis may not be reproduced elsewhere
More informationWhat Causes Stress in Malaysian Students and it Effect on Academic Performance: A case Revisited
Advanced Journal of Technical and Vocational Education 1 (1): 155-160, 2017 eissn: 2550-2174 RMP Publications, 2017 What Causes Stress in Malaysian Students and it Effect on Academic Performance: A case
More informationSTAT 135 Introduction to Statistics via Modeling: Midterm II Thursday November 16th, Name:
STAT 135 Introduction to Statistics via Modeling: Midterm II Thursday November 16th, 2017 Name: 1 1 Short Answer a) For each of these five regression scenarios, name an appropriate visualization (along
More informationStill important ideas
Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement
More information3.2A Least-Squares Regression
3.2A Least-Squares Regression Linear (straight-line) relationships between two quantitative variables are pretty common and easy to understand. Our instinct when looking at a scatterplot of data is to
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of
More informationRegional convergence in Romania: from theory to empirics
Available online at www.sciencedirect.com ScienceDirect Procedia Economics and Finance 32 ( 2015 ) 160 165 Emerging Markets Queries in Finance and Business Regional convergence in Romania: from theory
More informationExample of Interpreting and Applying a Multiple Regression Model
Example of Interpreting and Applying a Multiple Regression We'll use the same data set as for the bivariate correlation example -- the criterion is 1 st year graduate grade point average and the predictors
More informationSPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm
SPSS Portfolio Brittany Murray BUSA 2182 MWF 1:00pm-1:50pm Table Of Contents I) SPSS Computer Lab Assignment # 1 Frequency Distribution a) Cover Page b) Explanatory Paragraph c) Appendix II) SPSS Computer
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014
More informationChapter 3: Examining Relationships
Name Date Per Key Vocabulary: response variable explanatory variable independent variable dependent variable scatterplot positive association negative association linear correlation r-value regression
More informationDoing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto
Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea
More informationANOVA in SPSS (Practical)
ANOVA in SPSS (Practical) Analysis of Variance practical In this practical we will investigate how we model the influence of a categorical predictor on a continuous response. Centre for Multilevel Modelling
More informationChapter 14: More Powerful Statistical Methods
Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint
More informationTEACHING REGRESSION WITH SIMULATION. John H. Walker. Statistics Department California Polytechnic State University San Luis Obispo, CA 93407, U.S.A.
Proceedings of the 004 Winter Simulation Conference R G Ingalls, M D Rossetti, J S Smith, and B A Peters, eds TEACHING REGRESSION WITH SIMULATION John H Walker Statistics Department California Polytechnic
More informationInternational Journal on Future Revolution in Computer Science & Communication Engineering ISSN: Volume: 4 Issue:
Application of the Variance Function of the Difference Between two estimated responses in regulating Blood Sugar Level in a Diabetic patient using Herbal Formula Karanjah Anthony N. School of Science Maasai
More informationTesting the Predictability of Consumption Growth: Evidence from China
Auburn University Department of Economics Working Paper Series Testing the Predictability of Consumption Growth: Evidence from China Liping Gao and Hyeongwoo Kim Georgia Southern University and Auburn
More informationSTATISTICS INFORMED DECISIONS USING DATA
STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams
More informationScore Tests of Normality in Bivariate Probit Models
Score Tests of Normality in Bivariate Probit Models Anthony Murphy Nuffield College, Oxford OX1 1NF, UK Abstract: A relatively simple and convenient score test of normality in the bivariate probit model
More informationMultiple Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Multiple Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Multiple Regression 1 / 19 Multiple Regression 1 The Multiple
More informationMultiple Linear Regression Analysis
Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started
More informationIntroduction to Quantitative Methods (SR8511) Project Report
Introduction to Quantitative Methods (SR8511) Project Report Exploring the variables related to and possibly affecting the consumption of alcohol by adults Student Registration number: 554561 Word counts
More informationRegression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES
CHAPTER SIXTEEN Regression NOTE TO INSTRUCTORS This chapter includes a number of complex concepts that may seem intimidating to students. Encourage students to focus on the big picture through some of
More informationSTAT 201 Chapter 3. Association and Regression
STAT 201 Chapter 3 Association and Regression 1 Association of Variables Two Categorical Variables Response Variable (dependent variable): the outcome variable whose variation is being studied Explanatory
More informationMEASURES OF ASSOCIATION AND REGRESSION
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MEASURES OF ASSOCIATION AND REGRESSION I. AGENDA: A. Measures of association B. Two variable regression C. Reading: 1. Start Agresti
More informationLecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression
Lecture 6B: more Chapter 5, Section 3 Relationships between Two Quantitative Variables; Regression! Equation of Regression Line; Residuals! Effect of Explanatory/Response Roles! Unusual Observations! Sample
More informationReadings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F
Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F Plous Chapters 17 & 18 Chapter 17: Social Influences Chapter 18: Group Judgments and Decisions
More information11/24/2017. Do not imply a cause-and-effect relationship
Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are highly extraverted people less afraid of rejection
More informationValidation of Coping Styles and Emotions Undergraduate Inventory on Romanian Psychology Students
Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 83 ( 2013 ) 1116 1120 2 nd World Conference on Educational Technology Researches WCETR2012 Validation of Coping Styles
More informationUse the above variables and any you might need to construct to specify the MODEL A/C comparisons you would use to ask the following questions.
Fall, 2002 Grad Stats Final Exam There are four questions on this exam, A through D, and each question has multiple sub-questions. Unless otherwise indicated, each sub-question is worth 3 points. Question
More informationRussian Journal of Agricultural and Socio-Economic Sciences, 3(15)
ON THE COMPARISON OF BAYESIAN INFORMATION CRITERION AND DRAPER S INFORMATION CRITERION IN SELECTION OF AN ASYMMETRIC PRICE RELATIONSHIP: BOOTSTRAP SIMULATION RESULTS Henry de-graft Acquah, Senior Lecturer
More informationChapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables
Tables and Formulas for Sullivan, Fundamentals of Statistics, 4e 014 Pearson Education, Inc. Chapter Organizing and Summarizing Data Relative frequency = frequency sum of all frequencies Class midpoint:
More informationSPSS Learning Objectives:
Tasks Two lab reports Three homeworks Shared file of possible questions Three annotated bibliographies Interviews with stakeholders 1 SPSS Learning Objectives: Be able to get means, SDs, frequencies and
More informationINVESTIGATION OF ROUNDOFF NOISE IN IIR DIGITAL FILTERS USING MATLAB
Clemson University TigerPrints All Theses Theses 5-2009 INVESTIGATION OF ROUNDOFF NOISE IN IIR DIGITAL FILTERS USING MATLAB Sierra Williams Clemson University, sierraw@clemson.edu Follow this and additional
More information14.1: Inference about the Model
14.1: Inference about the Model! When a scatterplot shows a linear relationship between an explanatory x and a response y, we can use the LSRL fitted to the data to predict a y for a given x. However,
More informationLuci Karbonini. Pmf St. SWAP May 27,2011
Seasonal dependence of low frequency variability in an idealised GCM Luci Karbonini Pmf St SWAP May 27,2011 Motivation Planetary-Scale Baroclinic Instability and MJO, Straus and Lindzen (observations)
More informationd =.20 which means females earn 2/10 a standard deviation more than males
Sampling Using Cohen s (1992) Two Tables (1) Effect Sizes, d and r d the mean difference between two groups divided by the standard deviation for the data Mean Cohen s d 1 Mean2 Pooled SD r Pearson correlation
More informationUnit 1 Exploring and Understanding Data
Unit 1 Exploring and Understanding Data Area Principle Bar Chart Boxplot Conditional Distribution Dotplot Empirical Rule Five Number Summary Frequency Distribution Frequency Polygon Histogram Interquartile
More informationBasic concepts and principles of classical test theory
Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must
More informationBangor University Laboratory Exercise 1, June 2008
Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground
More informationAntecedents Concerning Student s Identity and Role Performance in the Development and Promotion of Science and Technology Talents Project (DPST)
Antecedents Concerning Student s Identity and Role Performance in the Development and Promotion of Science and Technology Talents Project (DPST) Pinyapan Roamchart 1, Assoc.Prof.Dr.Dusadee Yoelao, Dr.Somsak
More information2 Assumptions of simple linear regression
Simple Linear Regression: Reliability of predictions Richard Buxton. 2008. 1 Introduction We often use regression models to make predictions. In Figure?? (a), we ve fitted a model relating a household
More informationPrediction of sheep milk chemical composition using ph, electrical conductivity and refractive index
Prediction of sheep milk chemical composition using ph, electrical conductivity and refractive index A.I. Gelasakis *, R. Giannakou *, A. Kominakis, G. Antonakos, G. Arsenos * * Department of Animal Production,
More informationProblem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms).
Problem 1) Match the terms to their definitions. Every term is used exactly once. (In the real midterm, there are fewer terms). 1. Bayesian Information Criterion 2. Cross-Validation 3. Robust 4. Imputation
More informationRegression Equation. November 29, S10.3_3 Regression. Key Concept. Chapter 10 Correlation and Regression. Definitions
MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 10 Correlation and Regression 10 1 Review and Preview 10 2 Correlation 10 3 Regression 10 4 Variation and Prediction Intervals
More informationSTATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012
STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements
More informationCHAPTER III RESEARCH METHODOLOGY
CHAPTER III RESEARCH METHODOLOGY In this chapter, the researcher will elaborate the methodology of the measurements. This chapter emphasize about the research methodology, data source, population and sampling,
More informationInfrastructure Development and Economic Growth Nexus in Nigeria
Infrastructure Development and Economic Growth Nexus in Nigeria Owolabi-Merus, O School of Business, Economics and Law, University of New England, Armidale, NSW, 2351, Australia Email: oowolabimerus@gmail.com
More informationbivariate analysis: The statistical analysis of the relationship between two variables.
bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for
More informationSTATS8: Introduction to Biostatistics. Overview. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Overview Babak Shahbaba Department of Statistics, UCI The role of statistical analysis in science This course discusses some biostatistical methods, which involve
More informationMath 124: Module 2, Part II
, Part II David Meredith Department of Mathematics San Francisco State University September 15, 2009 What we will do today 1 Explanatory and Response Variables When you study the relationship between two
More informationDeterminants and Status of Vaccination in Bangladesh
Dhaka Univ. J. Sci. 60(): 47-5, 0 (January) Determinants and Status of Vaccination in Bangladesh Institute of Statistical Research and Training (ISRT), University of Dhaka, Dhaka-000, Bangladesh Received
More informationPreliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)
Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete
More information