Part 8 Logistic Regression

Size: px
Start display at page:

Download "Part 8 Logistic Regression"

Transcription

1 1 Quantitative Methods for Health Research A Practical Interactive Guide to Epidemiology and Statistics Practical Course in Quantitative Data Handling SPSS (Statistical Package for the Social Sciences) Part 8 Logistic Regression Quantitative Methods for Health Research: A Practical Interactive Guide to Epidemiology and Statistics, Second Edition. Nigel Bruce, Daniel Pope, and Debbi Stanistreet John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd.

2 Contents 1 OVERVIEW/REMINDER STUDY DATASET Reminder Study Dataset Coding Sheet for Low Back Pain Dataset IMPORTING THE STUDY DATASET PRACTICAL EXAMPLE OF MULTIPLE LOGISTIC REGRESSION Background to the Analysis Selection of Predictors for the Regression Model Is there a Univariate Association Between the Predictors and Low Back Pain? Relationship Between Hectic Work and Low Back Pain Interpreting the SPSS Output Relationship Between Monotonous Work and Low Back Pain Interpreting the SPSS Output Relationship Between Stressful Work and Low Back Pain Interpreting the SPSS Output Are There Univariate Associations Between the Potential Confounders and Low Back Pain? Relationship Between Psychological Distress and Low Back Pain Interpreting the SPSS Output Relationship Between Age and Low Back Pain Interpreting the SPSS Output Relationship Between Sex and Low Back Pain Interpreting the SPSS Output What Are the Independent Effects of the Predictor Variables (Psychological Working Environment) on Low Back Pain? Multivariate Logistic Regression in SPSS Interpreting the SPSS Output

3 3 1 Overview/Reminder Study Dataset This practical session is a continuation of multiple regression and describes the use of unconditional logistic regression in identifying associations between a dichotomous categorical outcome variable and predictor variables that are continuous and categorical. 1. Univariate and Multivariate Logistic Regression Using SPSS for Windows Throughout the practical session there are questions relating to the SPSS output obtained from the logistic regression analysis. Questions relating to the practical exercises are included in boxes like the one shown below. The answers to these questions are located at the end of this workbook. 1.1 Reminder Study Dataset The dataset for this exercise is a slightly different variation to the low back pain dataset (note the change in name). The dataset relates to information collected from 765 employees selected randomly from North West manual occupational settings. The aim of the study was to see what features of the occupational environment were associated with low back pain. The dataset includes information on demography (age, sex, height, weight, and social class), physical working environment (working postures, manual handling activities, and repetitive upper limb movements the duration of these activities was recorded for 60 minutes of one shift), psychosocial working environment (psychological demands of work), and psychological distress (a score based on responses to a psychological questionnaire a higher score indicates a higher level of psychological distress). The coding sheet for the low back pain dataset is shown below (note the type of variable, variable label, and value labels (if applicable)).

4 4 1.2 Coding Sheet for Low Back Pain Dataset Name Type Width Dec Label Values Id Numeric/continuous 4 0 Study number (unique identifier) compno Numeric/categorical 1 0 Type of company 1 = Post office, 2 = Supermarket, 3 = Store, 4 = Factory, 5 = Hospital Age Numeric/continuous 3 0 Age of subject Sex Numeric/nominal 1 0 Sex of subject 1 = Male, 2 = Female height Numeric/continuous 3 0 Height of subject (cm) weight Numeric/continuous 5 1 Weight of subject (kg) class Numeric/categorical 1 0 Social class 1 = I, 2 = II, 3 = IIIN, 4 = IIIM, 5 = IV, 6 = V backpain Numeric/nominal 1 0 Reported back pain Sit Numeric/continuous 4 1 Minutes seated stand Numeric/continuous 4 1 Minutes stood liftone Numeric/continuous 4 1 Minutes lifting with one hand liftboth Numeric/continuous 4 1 Minutes lifting with both hands onesho Numeric/continuous 4 1 Minutes carrying on one shoulder liftsho Numeric/continuous 4 1 Minutes lifting above shoulder level push Numeric/continuous 4 1 Minutes pushing weights Pull Numeric/continuous 4 1 Minutes pulling weights repwrist Numeric/continuous 4 1 Minutes with repetitive wrists reparm Numeric/continuous 4 1 Minutes with repetitive arms hectic Numeric/categorical 1 0 Finds work hectic/too fast 1 = Yes, 0 = No 0 = Never, 1 = Occasionally, 2 = Half the

5 5 monot Numeric/categorical 1 0 Finds work monotonous/boring stress Numeric/categorical 1 0 Finds work too stressful/anxious psycho Numeric/continuous 3 0 Psychological distress score time, 3 = Always 0 = Never, 1 = Occasionally, 2 = Half the time, 3 = Always 0 = Never, 1 = Occasionally, 2 = Half the time, 3 = Always

6 6 2 Importing the Study Dataset 1. Double click on SPSS Statistics 24 icon, When asked What would you like to do? : 2. Click on Open an existing file 3. Click on OK Note: The database for the following exercises is called backpain(logreg).sav: 4. Click on in the Look in box to generate a list of the drives 5. Click on backpain(log-reg).sav 6. Click on Open 3 Practical Example of Multiple Logistic Regression 3.1 Background to the Analysis To demonstrate the practical application of multiple logistic regression in SPSS, we will investigate the relationship between psychosocial working environment and low back pain. Although the back pain dataset has not come from a case control study, it will be treated as case control data in this practical with cases being those employees reporting back pain and controls being those employees not reporting back pain. In total there were 765 employees, of whom 198 (25.9%) were cases and 567 (74.1%) were controls. Therefore, there are approximately three controls for each case, but they are not matched. We are interested in identifying the relationship between psychosocial working environment (in terms of self-reported psychological demands associated with manual work) and low back pain. In particular we would

7 7 like to measure the independent effects of work speed (whether workers find their work hectic or too fast), work monotony (whether workers find their work unstimulating), and work stress (whether the work carried out by employees causes them anxiety or stress). The study hypothesis is that a poor psychosocial working environment, in terms of these psychological demands, may increase muscular tension through psychological stress, which in turn increases the risk of low back pain. 3.2 Selection of Predictors for the Regression Model The variables in the database that are of interest for this investigation include: Dependent (outcome) variable Low back pain categorical (dichotomous) Independent (explanatory) variables (including potential confounders) Psychosocial Working Environment Hectic work Monotonous work Stressful work All variables relating to psychosocial working environment are categorical variables with four point response scales (0 = never, 1 = occasionally, 2 = half the time, 3 = always). As we shall see, these categorical independent variables can be compared to the dependent variable (back pain) by creating dummy variables based on the response categories. Psychological Distress Psycho This is a continuous variable relating to a score on a questionnaire measuring psychological distress. The score ranges from 12 (no distress) to 48 (severely distressed). This variable will be included in the

8 8 logistic regression as a possible confounder to the relationship between psychosocial working environment and low back pain. Demographic Characteristics Age (years) continuous variable Sex a dichotomous (categorical) variable 3.3 Is there a Univariate Association Between the Predictors and Low Back Pain? Before constructing a multivariate model, we need to examine the association between the independent variables of interest (representing psychosocial working environment) and the dependent variable (low back pain) by carrying out univariate analysis using logistic regression Relationship Between Hectic Work and Low Back Pain You will find the logistic regression command in the Regression menu: 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic The main dialog box is similar to the standard regression option box. There is a space to place a dependent variable (outcome).

9 9 4. Enter back pain into the Dependent variable box There is also a box for specifying the covariates (predictor variables). 5. Enter Is work hectic? into the Covariates box The next step is to convert the four categories of hectic work (never, occasionally, half the time, and always) into dummy variables. 6. Click on in the dialogue box 7. Enter Is work hectic? into the Categorical Covariates box

10 10 We also need to specify which category of hectic work will be used as the reference group. The reference category for this variable is never (thus employees who find their work is too hectic or too fast for varying lengths of their shift will be compared to those employees who never find their work hectic or too fast). Because never is the first category of hectic work we will need to tell SPSS this: 8. Check the First box under Reference Category 9. Click on the box 10. Click on Continue Before carrying out the logistic regression there are various options that can be selected to further describe how well the selected independent variable(s) predict the dependent variable. For the purposes of this practical we will only select one option (addition of confidence intervals). 11. Click on Options We need to specify that we want confidence intervals around the odds ratios (the exponential of the Beta coefficients): 12. Check the CI for exp(b) box 13. Click on Continue The logistic regression for hectic work and back pain can now be carried out.

11 Click on OK to run the analysis Interpreting the SPSS Output The first three tables provide general information about the analysis that has just been carried out. We can see that only one person was excluded from the analysis. The reason for this will have been that there was missing data relating to the employees exposure to hectic work. We can also see that there was no need to change the coding of back pain into 0 and 1 for the logistic regression (a 0 and 1 coding was already used for back pain). If other values had been used (e.g., 1 and 2) SPSS would automatically convert the values into 0 and 1 for the analysis (0 representing the lowest category).

12 12 Finally we can see that, because there are four categories for Hectic, three dummy variables have been created based on the response categories of 1 (occasionally), 2 (half the time), and 3 (always). Thus, the reference category (never) has values of zero (.000 in the table), zero, zero for each dummy variable. The occasionally category has values of one (1.000 in the table), zero, zero, and so on. The next three tables relate to the model when only the constant is included (i.e., no predictor variables are included). Because these tables do not directly tell us about the association between the dependent variable and independent variable(s), we will not consider these further. For additional information about interpretation of these tables, refer to the text Discovering Statistics using SPSS for Windows by Andy Field (Sage Publications, 2001 Chapter 5). The remaining output shows the results from the new model (including hectic as an independent variable). The Omnibus Tests of Model Coefficients (OTMC) Table gives an estimation of the goodness-of-fit of the model in a similar way to the ANOVA Table in linear regression. The Chi-square value for the model is interpreted in a similar way to the F-ratio; a measure of how much the model has improved the prediction of the outcome compared to the

13 13 level of inaccuracy (random error) of the model. If the model is a good one, then we expect the improvement in prediction due to the model to be large and the difference between the model and the observed data to be small. In short, a good model should have a large Chi-square. The significance for the Chi-square is calculated using critical values of the Chi-distribution, for the corresponding degrees of freedom. The Chi-square value for this univariate model is statistically significant (p < 0.05). Therefore, there is less than a 5% chance that a Chi-square value this large would happen by chance alone. In short, the logistic regression model with hectic work predicts low back pain significantly well. The next (Model Summary) table gives additional information regarding the predictive properties of the model. The interpretation of this table will not be described in detail, except to point out that the R Square values can be interpreted in a similar way to those in Multiple Linear Regression. In short, taking the Nagelkerke R Square value, we can see that 1.6% of the variation in the probability of having low back pain is predicted by the model (including hectic work as a predictor). For additional information about interpretation of the Model Summary table refer to Discovering Statistics using SPSS for Windows by Andy Field (Sage Publications, 2001 Chapter 5). The third piece of SPSS output relates to the Classification Table. Again, this does not provide much additional information to logistic regression analysis and will not be considered further. See Discovering Statistics using SPSS for Windows by Andy Field (Sage Publications, 2001 Chapter 5).

14 14 The final table to be considered is the Variables in the Equation Table (interpreted in the same way as the Coefficients Table when carrying out linear regression in SPSS). It is important to note that the Beta values for the logistic regression are log odds and, as such, are difficult to interpret as they stand. To obtain the odds ratio describing the association between the dependent variable (low back pain) and the independent variables (psychosocial working environment), it is therefore necessary to take the exponential of the Beta. This is given as Exp(B) in SPSS. The SPSS table providing the output for the univariate logistic regression analysis of the hectic work variable is shown below: We can see from the model investigating the association between hectic work and low back pain that the odds ratio increases with the amount of time employees spend carrying out work they believe to be too hectic or fast. For employees who report their work to be occasionally too hectic or fast the odds ratio (Exp(B)) is This odds ratio of 1.97 means that employees are twice as likely to experience low back pain if they occasionally carry out work that is too hectic or fast relative to those who never carry out such work the reference category. The odds ratio for low back pain increases to 2.6 for employees carrying out hectic work half the time and to 3.09 for employees who always report their work as being too hectic or fast, relative to those who never report such work. However, we can see from the table that only the association between low back pain and the third dummy variable (work is hectic always compared to never ) has achieved statistical significance (p < 0.05). This is confirmed when looking at the 95% confidence intervals. For employees reporting carrying out hectic work occasionally and for half the time the confidence intervals span unity; hectic work ( occasionally and half the time ) can be associated with both a decrease and an

15 15 increase in the risk of back pain relative to never carrying out hectic work. We can therefore conclude that hectic work is univariately associated with low back pain leading to an increasing risk with the length of time spent carrying out hectic work. The association between low back pain and work that is always hectic relative to never hectic is statistically significant Relationship Between Monotonous Work and Low Back Pain Now repeat the univariate logistic regression for the variable relating to monotonous work: 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Click on Reset to clear the previous analysis 5. Enter back pain into the Dependent variable box 6. Enter Is work monotonous? into the Covariates box The next step is to convert the four categories of monotonous work (never, occasionally, half the time, and always) into dummy variables. 7. Click on in the dialogue box 8. Enter Is work monotonous? into the Categorical Covariates box 9. Check the First box under Reference Category 10. Click on the box 11. Click on Continue 12. Click on Options 13. Ensure the CI for exp(b) box has been checked 14. Click on Continue

16 Click on OK to run the analysis Interpreting the SPSS Output From the Case Processing Summary Table: How many people were excluded from the analysis (number of individuals without information about monotonous work)? From the Categorical Variables Coding Table: What was the number of individuals in each category of monotonous work? Never: Occasionally: Half the time: Always: Looking at the OTMC Table: What is the Chi-square value? What does this value tell us about the predictive properties of the model (including the predictor monotonous work)? How does this compare to the model incorporating hectic work? Looking at the Variables in the Equation Table: What are the odds ratios (and 95% confidence intervals) for the risk of low back pain according to the amount of time spent carrying out monotonous work? Never: OR = 1.0 Occasionally: OR = 95% CI = Half the time: OR = 95% CI = Always: OR = 95% CI = What does this tell us about the association between monotonous

17 17 work and low back pain? Conclusion: Is monotonous work univariately significantly associated with low back pain? Relationship Between Stressful Work and Low Back Pain Now repeat the univariate logistic regression for the variable relating to stressful work: 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Click on Reset to clear the previous analysis 5. Enter back pain into the Dependent variable box 6. Enter Is work stressful? into the Covariates box 7. Click on in the dialogue box 8. Enter Is work stressful? into the Categorical Covariates box 9. Check the First box under Reference Category 10. Click on the box 11. Click on Continue 12. Click on Options 13. Ensure the CI for exp(b) box has been checked 14. Click on Continue

18 Click on OK to run the analysis Interpreting the SPSS Output From the Categorical Variables Coding Table: What was the number of individuals in each category of stressful work? Never: Occasionally: Half the time: Always: Looking at the OTMC Table: What is the Chi-square value? What does this value tell us about the predictive properties of the model (including the predictor stressful work)? How does this compare to the other models (hectic and monotonous work)? Looking at the Variables in the Equation Table: What are the odds ratios (and 95% confidence intervals) for the risk of low back pain according to the amount of time spent carrying out stressful work? Never: OR = 1.0 Occasionally: OR = 95% CI = Half the time: OR = 95% CI = Always: OR = 95% CI = What does this tell us about the association between stressful work and low back pain? Conclusion: Is stressful work univariately significantly associated

19 19 with low back pain? 3.4 Are There Univariate Associations Between the Potential Confounders and Low Back Pain? We also need to identify whether the potential confounders of interest are associated with low back pain in our dataset, before adjusting for them in the multivariate logistic regression. The potential confounders include: Psychological distress: Employees who have high levels of psychological distress might be more likely to report a poor psychosocial working environment and such distress has been observed to be related to the experience of pain. Sex: Age: Females have been found to report a greater amount of pain than males and might be more likely to report working in a poor psychosocial environment. As people get older they experience a greater amount of musculoskeletal pain. In addition, older people might be more likely to report working in a poor psychosocial environment than younger people.

20 Relationship Between Psychological Distress and Low Back Pain The commands for univariate logistic regression will be the same except that Psychological Distress is represented by a continuous variable (there is no need to specify that dummy variables are to be created). 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Enter back pain into the Dependent variable box 5. Enter Psychological distress score into the Covariates box 6. Click on Options 7. Check the CI for exp(b) box 8. Click on Continue 9. Click on OK to run the analysis Interpreting the SPSS Output The tables are interpreted in the same was as shown in Section You will notice a third table representing dummy variables has not been created. From the Case Processing Summary Table we can see that four people were excluded from the analysis due to missing psychological distress data.

21 21 The OTMC Table shows us that the Chi-square value for this univariate model is statistically significant (p < ). Therefore, there is less than a 0.001% chance that a Chi-square value this large would happen by chance alone. The logistic regression model with psychological distress predicts low back pain significantly well. We can also see that psychological distress has a larger Chi-square value (27.092) than the three models containing the independent variables representing psychosocial working environment (more than three times greater than that obtained for hectic work). If we consider the final table (Variables in the Equation Table), we can see that the association between psychological distress and low back pain is significant (p < ). We can also see that the Exp(B) or odds ratio for this association is For each unit increase in psychological distress score the increased risk of having low back pain is 9.8%. We can also see that the 95% confidence interval for this odds ratio does not span unity, consistent with the hypothesis that psychological distress is significantly associated with low back pain. We can therefore conclude that psychological distress is univariately associated with low back pain.

22 Relationship Between Age and Low Back Pain Now repeat the univariate logistic regression for the variable relating to age: 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Click on Reset to clear the previous analysis 5. Enter back pain into the Dependent variable box 6. Enter Age into the Covariates box 7. Click on Options 8. Ensure the CI for exp(b) box has been checked 9. Click on Continue 10.Click on OK to run the analysis Interpreting the SPSS Output How many people were excluded from the analysis due to missing information about age? What is the Chi-square value for the goodness-of-fit of the model including age? What does this value tell us about the predictive properties of the model (including age as an independent variable)?

23 23 What are the odds ratio and 95% confidence interval for the association between age and low back? What does this tell us about the association between age and low back pain? Conclusion: Is age univariately significantly associated with low back pain? Relationship Between Sex and Low Back Pain Now repeat the univariate logistic regression for the variable relating to sex: 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Click on Reset to clear the previous analysis 5. Enter back pain into the Dependent variable box 6. Enter Sex into the Covariates box 7. Click on Options 8. Ensure the CI for exp(b) box has been checked 9. Click on Continue 10. Click on OK to run the analysis

24 24 Note: Because sex is a dichotomous categorical variable it is not necessary to specify dummy variables in SPSS Interpreting the SPSS Output How many people were excluded from the analysis due to missing information about sex? What is the Chi-square value for the goodness-of-fit of the model including sex? What does this value tell us about the predictive properties of the model (including sex as an independent variable)? What are the odds ratio and 95% confidence interval for the association between sex and low back (Note that SPSS will always take the smaller category as the referent. Hence males (1) form the referent group and the excess risk is assessed in females (2))? What does this tell us about the association between sex and low back pain? Conclusion: Is sex univariately significantly associated with low back pain?

25 What Are the Independent Effects of the Predictor Variables (Psychological Working Environment) on Low Back Pain? The full model will measure the independent effects of each of the predictor variables representing psychological working environment (hectic work, monotonous work, and stressful work) on the outcome (low back pain) after adjusting for the potential confounders (psychological distress and age). The model will be constructed using an unconditional multivariate logistic regression in SPSS Multivariate Logistic Regression in SPSS 1. Click on Analyze 2. Select Regression 3. Select Binary Logistic 4. Click on Reset to clear the previous analysis 5. Enter back pain into the Dependent variable box Now we must enter all the independent variables into the Covariates box. 6. Enter Is work hectic?, Is work monotonous?, Is work stressful?, Psychological distress; and age into the Covariates box It will be necessary to specify that hectic work, monotonous work, and stressful work are all categorical variables requiring dummy variables: 7. Click on the Categorical box 8. Enter Is work hectic?, Is work monotonous? and Is work stressful? into the Categorical Covariates box 9. For each variable: Check the First box under Reference Category and press Change

26 26 You should now have the following dialogue box: 10. Click on Continue to return to the main dialogue box We will also want to display the confidence limits for the Exp(B) coefficients (odds ratios): 11. Click on Options 12. Ensure the CI for exp(b) box has been checked 13. Click on Continue Before running the multivariate logistic regression it is important to discuss the method of carrying out the regression. The default method of conducting the regression is the Forced Entry Method. This method of regression places all the covariates (predictors) into the regression model in one block. An alternative method of regression (this is also true for multiple linear regression) is to use stepwise procedures (either forward or backward). These procedures are described in detail in Discovering Statistics using SPSS for Windows by Andy Field (Sage Publications, 2001 Chapter 5).

27 27 When forward stepwise regression is applied, the computer begins with a model that only includes the constant and then adds single predictors into the model based on a specified criterion. This criterion is the value of a score statistic: The variable with the most significant score statistic is added to the model. The computer proceeds until none of the remaining predictors have a significant score statistic (significance = p <0.05). At each step, the computer examines the variables entered into the model to see whether any should be removed (there are three removal criteria). Predictors in the model that have significance values above the default removal criterion of 0.1 will be removed. When backward stepwise regression is applied the same removal criteria are used, but instead of starting the model with only the constant, it begins the model with all the predictors included. The computer then tests whether any of these predictors can be removed from the model without having a substantial effect on how well the model fits the observed data. The first predictor to be removed will be the one that has the least impact on how the model fits the data. The main consideration of the choice of method of regression is whether you are testing a theory or merely carrying out exploratory work. Stepwise procedures are most appropriate when carrying out exploratory work (where no previous research exists on which to base hypotheses for testing and in situations where causality is not of interest and you merely wish to find a model to fit your data). Because the current example has a clear hypothesis (a poor psychosocial working environment is associated with back pain) that is supported by previous research, we will use the Forced Entry method of logistic regression. 14. Click on OK to run the analysis

28 Interpreting the SPSS Output The tables are interpreted in the same was as shown in Section From the Case Processing Summary Table: How many people were excluded from the analysis due to missing information about one or more of the predictor variables? From the OTMC Table: Does the model including the five explanatory variables significantly predict the variation in low back pain? Is the model any better at predicting the variation in low back pain than any single predictor (from the univariate analysis)?

29 29 From the Variables in the Equation Table: What has been the effect on the association between hectic work and low back pain of adjusting for the confounders and the other predictors? What does this tell us about the relationship between hectic work and low back pain? How is finding work stressful all the time associated with low back pain? How is finding work monotonous all the time associated with low back pain?

30 Are the two confounders still significantly associated with low back pain? 30

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

Simple Linear Regression One Categorical Independent Variable with Several Categories

Simple Linear Regression One Categorical Independent Variable with Several Categories Simple Linear Regression One Categorical Independent Variable with Several Categories Does ethnicity influence total GCSE score? We ve learned that variables with just two categories are called binary

More information

Basic Biostatistics. Chapter 1. Content

Basic Biostatistics. Chapter 1. Content Chapter 1 Basic Biostatistics Jamalludin Ab Rahman MD MPH Department of Community Medicine Kulliyyah of Medicine Content 2 Basic premises variables, level of measurements, probability distribution Descriptive

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multiple Regression (MR) Types of MR Assumptions of MR SPSS procedure of MR Example based on prison data Interpretation of

More information

POL 242Y Final Test (Take Home) Name

POL 242Y Final Test (Take Home) Name POL 242Y Final Test (Take Home) Name_ Due August 6, 2008 The take-home final test should be returned in the classroom (FE 36) by the end of the class on August 6. Students who fail to submit the final

More information

Statistical questions for statistical methods

Statistical questions for statistical methods Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the

More information

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES 24 MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES In the previous chapter, simple linear regression was used when you have one independent variable and one dependent variable. This chapter

More information

Correlation and Regression

Correlation and Regression Dublin Institute of Technology ARROW@DIT Books/Book Chapters School of Management 2012-10 Correlation and Regression Donal O'Brien Dublin Institute of Technology, donal.obrien@dit.ie Pamela Sharkey Scott

More information

One-Way Independent ANOVA

One-Way Independent ANOVA One-Way Independent ANOVA Analysis of Variance (ANOVA) is a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment.

More information

Step 3 Tutorial #3: Obtaining equations for scoring new cases in an advanced example with quadratic term

Step 3 Tutorial #3: Obtaining equations for scoring new cases in an advanced example with quadratic term Step 3 Tutorial #3: Obtaining equations for scoring new cases in an advanced example with quadratic term DemoData = diabetes.lgf, diabetes.dat, data5.dat We begin by opening a saved 3-class latent class

More information

Template 1 for summarising studies addressing prognostic questions

Template 1 for summarising studies addressing prognostic questions Template 1 for summarising studies addressing prognostic questions Instructions to fill the table: When no element can be added under one or more heading, include the mention: O Not applicable when an

More information

Chapter Eight: Multivariate Analysis

Chapter Eight: Multivariate Analysis Chapter Eight: Multivariate Analysis Up until now, we have covered univariate ( one variable ) analysis and bivariate ( two variables ) analysis. We can also measure the simultaneous effects of two or

More information

Chapter Eight: Multivariate Analysis

Chapter Eight: Multivariate Analysis Chapter Eight: Multivariate Analysis Up until now, we have covered univariate ( one variable ) analysis and bivariate ( two variables ) analysis. We can also measure the simultaneous effects of two or

More information

SPSS Correlation/Regression

SPSS Correlation/Regression SPSS Correlation/Regression Experimental Psychology Lab Session Week 6 10/02/13 (or 10/03/13) Due at the Start of Lab: Lab 3 Rationale for Today s Lab Session This tutorial is designed to ensure that you

More information

Intro to SPSS. Using SPSS through WebFAS

Intro to SPSS. Using SPSS through WebFAS Intro to SPSS Using SPSS through WebFAS http://www.yorku.ca/computing/students/labs/webfas/ Try it early (make sure it works from your computer) If you need help contact UIT Client Services Voice: 416-736-5800

More information

MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU

MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU MAKING THE NSQIP PARTICIPANT USE DATA FILE (PUF) WORK FOR YOU Hani Tamim, PhD Clinical Research Institute Department of Internal Medicine American University of Beirut Medical Center Beirut - Lebanon Participant

More information

SPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm

SPSS Portfolio. Brittany Murray BUSA MWF 1:00pm-1:50pm SPSS Portfolio Brittany Murray BUSA 2182 MWF 1:00pm-1:50pm Table Of Contents I) SPSS Computer Lab Assignment # 1 Frequency Distribution a) Cover Page b) Explanatory Paragraph c) Appendix II) SPSS Computer

More information

Chapter 9: Comparing two means

Chapter 9: Comparing two means Chapter 9: Comparing two means Smart Alex s Solutions Task 1 Is arachnophobia (fear of spiders) specific to real spiders or will pictures of spiders evoke similar levels of anxiety? Twelve arachnophobes

More information

MODEL SELECTION STRATEGIES. Tony Panzarella

MODEL SELECTION STRATEGIES. Tony Panzarella MODEL SELECTION STRATEGIES Tony Panzarella Lab Course March 20, 2014 2 Preamble Although focus will be on time-to-event data the same principles apply to other outcome data Lab Course March 20, 2014 3

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

WELCOME! Lecture 11 Thommy Perlinger

WELCOME! Lecture 11 Thommy Perlinger Quantitative Methods II WELCOME! Lecture 11 Thommy Perlinger Regression based on violated assumptions If any of the assumptions are violated, potential inaccuracies may be present in the estimated regression

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

Lesson: A Ten Minute Course in Epidemiology

Lesson: A Ten Minute Course in Epidemiology Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data

More information

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group

More information

DOE Wizard Definitive Screening Designs

DOE Wizard Definitive Screening Designs DOE Wizard Definitive Screening Designs Revised: 10/10/2017 Summary... 1 Designs Containing Only Continuous Factors... 2 Designs Containing Categorical Factors... 11 References... 16 Summary The DOE Wizard

More information

Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors

Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME, ISSUE 0, SEPTEMBER 01 ISSN 81 Prediction Model For Risk Of Breast Cancer Considering Interaction Between The Risk Factors Nabila Al Balushi

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis Revised July 2018 Multiple Linear Regression Analysis This set of notes shows how to use Stata in multiple regression analysis. It assumes that you have set Stata up on your computer (see the Getting Started

More information

bivariate analysis: The statistical analysis of the relationship between two variables.

bivariate analysis: The statistical analysis of the relationship between two variables. bivariate analysis: The statistical analysis of the relationship between two variables. cell frequency: The number of cases in a cell of a cross-tabulation (contingency table). chi-square (χ 2 ) test for

More information

Logistic Regression. The right choices over time greatly improve your odds of a long and healthy life.

Logistic Regression. The right choices over time greatly improve your odds of a long and healthy life. Learning Objectives C H A P T E R 1 3 Logistic Regression The right choices over time greatly improve your odds of a long and healthy life. Upon completing this chapter, you will be able to do the following:

More information

isc ove ring i Statistics sing SPSS

isc ove ring i Statistics sing SPSS isc ove ring i Statistics sing SPSS S E C O N D! E D I T I O N (and sex, drugs and rock V roll) A N D Y F I E L D Publications London o Thousand Oaks New Delhi CONTENTS Preface How To Use This Book Acknowledgements

More information

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang At the end of this session,

More information

Logistic Regression Predicting the Chances of Coronary Heart Disease. Multivariate Solutions

Logistic Regression Predicting the Chances of Coronary Heart Disease. Multivariate Solutions Logistic Regression Predicting the Chances of Coronary Heart Disease Multivariate Solutions What is Logistic Regression? Logistic regression in a nutshell: Logistic regression is used for prediction of

More information

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision

More information

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research

Chapter 11 Nonexperimental Quantitative Research Steps in Nonexperimental Research Chapter 11 Nonexperimental Quantitative Research (Reminder: Don t forget to utilize the concept maps and study questions as you study this and the other chapters.) Nonexperimental research is needed because

More information

Examining differences between two sets of scores

Examining differences between two sets of scores 6 Examining differences between two sets of scores In this chapter you will learn about tests which tell us if there is a statistically significant difference between two sets of scores. In so doing you

More information

Introduction to SPSS S0

Introduction to SPSS S0 Basic medical statistics for clinical and experimental research Introduction to SPSS S0 Katarzyna Jóźwiak k.jozwiak@nki.nl November 10, 2017 1/55 Introduction SPSS = Statistical Package for the Social

More information

Anticoagulation Manager - Getting Started

Anticoagulation Manager - Getting Started Vision 3 Anticoagulation Manager - Getting Started Copyright INPS Ltd 2014 The Bread Factory, 1A Broughton Street, Battersea, London, SW8 3QJ T: +44 (0) 207 501700 F:+44 (0) 207 5017100 W: www.inps.co.uk

More information

Variables and Data. Gbenga Ogunfowokan Lead, Nigerian Regional Faculty The Global Health Network 19 th May 2017

Variables and Data. Gbenga Ogunfowokan Lead, Nigerian Regional Faculty The Global Health Network 19 th May 2017 Variables and Data Gbenga Ogunfowokan Lead, Nigerian Regional The Global Health Network 19 th May 2017 Objectives At the end of this presentation you should be able to 1) Define a variable 2) Classify

More information

Two-Way Independent ANOVA

Two-Way Independent ANOVA Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There

More information

Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant

Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, 2010 Andrew Bryant Newcastle University Institute of Health and Society Background Background Young people s drinking behaviour

More information

NIH Public Access Author Manuscript Parkinsonism Relat Disord. Author manuscript; available in PMC 2009 August 1.

NIH Public Access Author Manuscript Parkinsonism Relat Disord. Author manuscript; available in PMC 2009 August 1. NIH Public Access Author Manuscript Published in final edited form as: Parkinsonism Relat Disord. 2009 August ; 15(7): 535 538. doi:10.1016/j.parkreldis.2008.10.006. Embarrassment in Essential Tremor:

More information

STATISTICAL MODELING OF THE INCIDENCE OF BREAST CANCER IN NWFP, PAKISTAN

STATISTICAL MODELING OF THE INCIDENCE OF BREAST CANCER IN NWFP, PAKISTAN STATISTICAL MODELING OF THE INCIDENCE OF BREAST CANCER IN NWFP, PAKISTAN Salah UDDIN PhD, University Professor, Chairman, Department of Statistics, University of Peshawar, Peshawar, NWFP, Pakistan E-mail:

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

Binary Diagnostic Tests Paired Samples

Binary Diagnostic Tests Paired Samples Chapter 536 Binary Diagnostic Tests Paired Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary measures

More information

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto

Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling. Olli-Pekka Kauppila Daria Kautto Doing Quantitative Research 26E02900, 6 ECTS Lecture 6: Structural Equations Modeling Olli-Pekka Kauppila Daria Kautto Session VI, September 20 2017 Learning objectives 1. Get familiar with the basic idea

More information

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale. Model Based Statistics in Biology. Part V. The Generalized Linear Model. Single Explanatory Variable on an Ordinal Scale ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10,

More information

Biostats Final Project Fall 2002 Dr. Chang Claire Pothier, Michael O'Connor, Carrie Longano, Jodi Zimmerman - CSU

Biostats Final Project Fall 2002 Dr. Chang Claire Pothier, Michael O'Connor, Carrie Longano, Jodi Zimmerman - CSU Biostats Final Project Fall 2002 Dr. Chang Claire Pothier, Michael O'Connor, Carrie Longano, Jodi Zimmerman - CSU Prevalence and Probability of Diabetes in Patients Referred for Stress Testing in Northeast

More information

Regression Including the Interaction Between Quantitative Variables

Regression Including the Interaction Between Quantitative Variables Regression Including the Interaction Between Quantitative Variables The purpose of the study was to examine the inter-relationships among social skills, the complexity of the social situation, and performance

More information

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality

Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Week 9 Hour 3 Stepwise method Modern Model Selection Methods Quantile-Quantile plot and tests for normality Stat 302 Notes. Week 9, Hour 3, Page 1 / 39 Stepwise Now that we've introduced interactions,

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression March 2013 Nancy Burns (nburns@isr.umich.edu) - University of Michigan From description to cause Group Sample Size Mean Health Status Standard Error Hospital 7,774 3.21.014

More information

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.

Survey research (Lecture 1) Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4. Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation

More information

Survey research (Lecture 1)

Survey research (Lecture 1) Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2015 Creative Commons Attribution 4.0 Overview 1. Survey research 2. Survey design 3. Descriptives & graphing 4. Correlation

More information

Chapter 13 Estimating the Modified Odds Ratio

Chapter 13 Estimating the Modified Odds Ratio Chapter 13 Estimating the Modified Odds Ratio Modified odds ratio vis-à-vis modified mean difference To a large extent, this chapter replicates the content of Chapter 10 (Estimating the modified mean difference),

More information

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations)

Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) Preliminary Report on Simple Statistical Tests (t-tests and bivariate correlations) After receiving my comments on the preliminary reports of your datasets, the next step for the groups is to complete

More information

The University of North Carolina at Chapel Hill School of Social Work

The University of North Carolina at Chapel Hill School of Social Work The University of North Carolina at Chapel Hill School of Social Work SOWO 918: Applied Regression Analysis and Generalized Linear Models Spring Semester, 2014 Instructor Shenyang Guo, Ph.D., Room 524j,

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

Using SPSS for Correlation

Using SPSS for Correlation Using SPSS for Correlation This tutorial will show you how to use SPSS version 12.0 to perform bivariate correlations. You will use SPSS to calculate Pearson's r. This tutorial assumes that you have: Downloaded

More information

General practice. Role of mechanical and psychosocial factors in the onset of forearm pain: prospective population based study.

General practice. Role of mechanical and psychosocial factors in the onset of forearm pain: prospective population based study. Role of mechanical and psychosocial factors in the onset of forearm pain: prospective population based study Gary J Macfarlane, Isabelle M Hunt, Alan J Silman Unit of Chronic Disease Epidemiology, School

More information

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0

Summary & Conclusion. Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Summary & Conclusion Lecture 10 Survey Research & Design in Psychology James Neill, 2016 Creative Commons Attribution 4.0 Overview 1. Survey research and design 1. Survey research 2. Survey design 2. Univariate

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Correlation SPSS procedure for Pearson r Interpretation of SPSS output Presenting results Partial Correlation Correlation

More information

Charts Worksheet using Excel Obesity Can a New Drug Help?

Charts Worksheet using Excel Obesity Can a New Drug Help? Worksheet using Excel 2000 Obesity Can a New Drug Help? Introduction Obesity is known to be a major health risk. The data here arise from a study which aimed to investigate whether or not a new drug, used

More information

(C) Jamalludin Ab Rahman

(C) Jamalludin Ab Rahman SPSS Note The GLM Multivariate procedure is based on the General Linear Model procedure, in which factors and covariates are assumed to have a linear relationship to the dependent variable. Factors. Categorical

More information

Chapter 14: More Powerful Statistical Methods

Chapter 14: More Powerful Statistical Methods Chapter 14: More Powerful Statistical Methods Most questions will be on correlation and regression analysis, but I would like you to know just basically what cluster analysis, factor analysis, and conjoint

More information

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE:

MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: 1 MULTIPLE OLS REGRESSION RESEARCH QUESTION ONE: Predicting State Rates of Robbery per 100K We know that robbery rates vary significantly from state-to-state in the United States. In any given state, we

More information

Influence of Hypertension and Diabetes Mellitus on. Family History of Heart Attack in Male Patients

Influence of Hypertension and Diabetes Mellitus on. Family History of Heart Attack in Male Patients Applied Mathematical Sciences, Vol. 6, 01, no. 66, 359-366 Influence of Hypertension and Diabetes Mellitus on Family History of Heart Attack in Male Patients Wan Muhamad Amir W Ahmad 1, Norizan Mohamed,

More information

IAPT: Regression. Regression analyses

IAPT: Regression. Regression analyses Regression analyses IAPT: Regression Regression is the rather strange name given to a set of methods for predicting one variable from another. The data shown in Table 1 and come from a student project

More information

Social Inequalities in Self-Reported Health in the Ukrainian Working-age Population: Finding from the ESS

Social Inequalities in Self-Reported Health in the Ukrainian Working-age Population: Finding from the ESS Social Inequalities in Self-Reported Health in the Ukrainian Working-age Population: Finding from the ESS Iryna Mazhak, PhD., a fellow at Aarhus Institute of Advanced Studies Contact: irynamazhak@aias.au.dk

More information

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers SOCY5061 RELATIVE RISKS, RELATIVE ODDS, LOGISTIC REGRESSION RELATIVE RISKS: Suppose we are interested in the association between lung cancer and smoking. Consider the following table for the whole population:

More information

Title: Socioeconomic conditions and number of pain sites in women

Title: Socioeconomic conditions and number of pain sites in women Author's response to reviews Title: Socioeconomic conditions and number of pain sites in women Authors: Finn E Skjeldestad (fisk@fhi.no) Toril Rannestad (Toril.Rannestad@hist.no) Version: 2 Date: 17 January

More information

Bangor University Laboratory Exercise 1, June 2008

Bangor University Laboratory Exercise 1, June 2008 Laboratory Exercise, June 2008 Classroom Exercise A forest land owner measures the outside bark diameters at.30 m above ground (called diameter at breast height or dbh) and total tree height from ground

More information

Line Murtnes Hagestande

Line Murtnes Hagestande Line Murtnes Hagestande The relation between leisure time physical exercise, physical and psychosocial work demands, and risk of fibromyalgia in working women; The Nord-Trøndelag Health Study BEV3901,

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 3: Overview of Descriptive Statistics October 3, 2005 Lecture Outline Purpose

More information

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2

12/30/2017. PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 PSY 5102: Advanced Statistics for Psychological and Behavioral Research 2 Selecting a statistical test Relationships among major statistical methods General Linear Model and multiple regression Special

More information

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Statistics as a Tool A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations. Descriptive Statistics Numerical facts or observations that are organized describe

More information

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

3 CONCEPTUAL FOUNDATIONS OF STATISTICS 3 CONCEPTUAL FOUNDATIONS OF STATISTICS In this chapter, we examine the conceptual foundations of statistics. The goal is to give you an appreciation and conceptual understanding of some basic statistical

More information

Manual Handling/Manual Tasks Checklist

Manual Handling/Manual Tasks Checklist Manual Handling/Manual Instruction: Complete for all manual tasks in consultation with workers undertaking the task. Date checklist completed Date for review Name of person completing checklist Position

More information

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1 Agenda for Week 3, Hr 1 (Tuesday, Jan 19) Hour 1: - Installing R and inputting data. - Different tools for R: Notepad++ and RStudio. - Basic commands:?,??, mean(), sd(), t.test(), lm(), plot() - t.test()

More information

WORK FITNESS ASSESSMENT

WORK FITNESS ASSESSMENT WORK FITNESS ASSESSMENT Name Date of Birth Job Applying For Date of Assessment Consent The work fitness assessment is a test of your capacity to undertake work postures and work activities. The assessment

More information

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results. 10 Learning Objectives Testing Means After reading this chapter, you should be able to: Related-Samples t Test With Confidence Intervals 1. Describe two types of research designs used when we select related

More information

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography

More information

Analysis of Covariance (ANCOVA)

Analysis of Covariance (ANCOVA) Analysis of Covariance (ANCOVA) Some background ANOVA can be extended to include one or more continuous variables that predict the outcome (or dependent variable). Continuous variables such as these, that

More information

02a: Test-Retest and Parallel Forms Reliability

02a: Test-Retest and Parallel Forms Reliability 1 02a: Test-Retest and Parallel Forms Reliability Quantitative Variables 1. Classic Test Theory (CTT) 2. Correlation for Test-retest (or Parallel Forms): Stability and Equivalence for Quantitative Measures

More information

Lecture 21. RNA-seq: Advanced analysis

Lecture 21. RNA-seq: Advanced analysis Lecture 21 RNA-seq: Advanced analysis Experimental design Introduction An experiment is a process or study that results in the collection of data. Statistical experiments are conducted in situations in

More information

Speed & Intensity risk factors in Wellnomics Risk Management. Wellnomics White Paper

Speed & Intensity risk factors in Wellnomics Risk Management. Wellnomics White Paper Speed & Intensity risk factors in Wellnomics Risk Management Wellnomics White Paper Dr Kevin Taylor, BE, PhD Wellnomics Limited www.wellnomics.com research@wellnomics.com 2010 Wellnomics Limited Ref 15/10/2010

More information

Ergonomics Software User s Manual v 4.1 BAKPAK. An Integrated Software Package for the Ergonomic Assessment of Lifting and Lowering Tasks

Ergonomics Software User s Manual v 4.1 BAKPAK. An Integrated Software Package for the Ergonomic Assessment of Lifting and Lowering Tasks Ergonomics Software User s Manual v 4.1 BAKPAK An Integrated Software Package for the Ergonomic Assessment of Lifting and Lowering Tasks 120 205 Hayes - 401 Grand Ave, Tecumseh, Marais Rd W, ON, Windsor,

More information

Linear Regression in SAS

Linear Regression in SAS 1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25

More information

MULTIPLE REGRESSION OF CPS DATA

MULTIPLE REGRESSION OF CPS DATA MULTIPLE REGRESSION OF CPS DATA A further inspection of the relationship between hourly wages and education level can show whether other factors, such as gender and work experience, influence wages. Linear

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups

Overview of Lecture. Survey Methods & Design in Psychology. Correlational statistics vs tests of differences between groups Survey Methods & Design in Psychology Lecture 10 ANOVA (2007) Lecturer: James Neill Overview of Lecture Testing mean differences ANOVA models Interactions Follow-up tests Effect sizes Parametric Tests

More information

9 research designs likely for PSYC 2100

9 research designs likely for PSYC 2100 9 research designs likely for PSYC 2100 1) 1 factor, 2 levels, 1 group (one group gets both treatment levels) related samples t-test (compare means of 2 levels only) 2) 1 factor, 2 levels, 2 groups (one

More information

CHAPTER III METHODOLOGY

CHAPTER III METHODOLOGY CHAPTER III METHODOLOGY This chapter describes the procedures followed in the selection of subjects, selection of variables, selection of tests, reliability of the data, competency of the tester, orientation

More information

3D SSPP Version 6. ANALYSIS & USE GUIDE For Reactive & Proactive Use

3D SSPP Version 6. ANALYSIS & USE GUIDE For Reactive & Proactive Use 3D SSPP Version 6 ANALYSIS & USE GUIDE For Reactive & Proactive Use REQUIREMENTS The user must complete the UAW-GM 3D SSPP training course offered through the UAW- GM Center for Human Resources for the

More information

PreTect Software Documentation

PreTect Software Documentation PreTect Software Documentation Table of Contents 1 Introduction 2 1.1 Overview 2 1.2 Author and Copyright 2 2 Disclaimer and Warning 3 3 Program Concepts 4 3.1 Program Purpose 4 3.2 Exercise / Stretch

More information

4Stat Wk 10: Regression

4Stat Wk 10: Regression 4Stat 342 - Wk 10: Regression Loading data with datalines Regression (Proc glm) - with interactions - with polynomial terms - with categorical variables (Proc glmselect) - with model selection (this is

More information

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan

Media, Discussion and Attitudes Technical Appendix. 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan Media, Discussion and Attitudes Technical Appendix 6 October 2015 BBC Media Action Andrea Scavo and Hana Rohan 1 Contents 1 BBC Media Action Programming and Conflict-Related Attitudes (Part 5a: Media and

More information