Choosing the Correct Statistical Test

Similar documents
Statistics as a Tool. A set of tools for collecting, organizing, presenting and analyzing numerical facts or observations.

Analysis and Interpretation of Data Part 1

STATISTICS AND RESEARCH DESIGN

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

INTRODUCTION TO MEDICAL RESEARCH: ESSENTIAL SKILLS

What you should know before you collect data. BAE 815 (Fall 2017) Dr. Zifei Liu

Prepared by: Assoc. Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Figure: Presentation slides:

Research Designs and Potential Interpretation of Data: Introduction to Statistics. Let s Take it Step by Step... Confused by Statistics?

Using a Likert-type Scale DR. MIKE MARRAPODI

HOW STATISTICS IMPACT PHARMACY PRACTICE?

Basic Steps in Planning Research. Dr. P.J. Brink and Dr. M.J. Wood

On the purpose of testing:

Descriptive Statistics Lecture

Statistics Guide. Prepared by: Amanda J. Rockinson- Szapkiw, Ed.D.

PTHP 7101 Research 1 Chapter Assignments

Business Statistics Probability

Choosing the correct statistical test in research

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

CHAPTER 3 DATA ANALYSIS: DESCRIBING DATA

Basic Biostatistics. Chapter 1. Content

Learning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

9/4/2013. Decision Errors. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

Analysis of Variance: repeated measures

AMSc Research Methods Research approach IV: Experimental [2]

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Research Manual STATISTICAL ANALYSIS SECTION. By: Curtis Lauterbach 3/7/13

Non-Randomized Trials

AP Psych - Stat 1 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Psychology Research Process

Elementary Statistics:

Still important ideas

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate

Still important ideas

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Research Manual COMPLETE MANUAL. By: Curtis Lauterbach 3/7/13

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

investigate. educate. inform.

Selecting the Right Data Analysis Technique

11/24/2017. Do not imply a cause-and-effect relationship

Lunchtime Seminar. Risper Awuor, Ph.D. Department of Graduate Educational and Leadership. January 30, 2013

Types of Statistics. Censored data. Files for today (June 27) Lecture and Homework INTRODUCTION TO BIOSTATISTICS. Today s Outline

Measures of Dispersion. Range. Variance. Standard deviation. Measures of Relationship. Range. Variance. Standard deviation.

Statistical questions for statistical methods

9 research designs likely for PSYC 2100

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

AP Psych - Stat 2 Name Period Date. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH

Study Guide for the Final Exam

A Brief (very brief) Overview of Biostatistics. Jody Kreiman, PhD Bureau of Glottal Affairs

Daniel Boduszek University of Huddersfield

Global Clinical Trials Innovation Summit Berlin October 2016

POST GRADUATE DIPLOMA IN BIOETHICS (PGDBE) Term-End Examination June, 2016 MHS-014 : RESEARCH METHODOLOGY

VARIABLES AND MEASUREMENT

SUMMER 2011 RE-EXAM PSYF11STAT - STATISTIK

Daniel Boduszek University of Huddersfield

Level of Measurements

Chapter 1: Explaining Behavior

Designing Psychology Experiments: Data Analysis and Presentation

BIOSTATISTICS. Dr. Hamza Aduraidi

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape.

Results & Statistics: Description and Correlation. I. Scales of Measurement A Review

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

Overview of Non-Parametric Statistics

Introduction to statistics Dr Alvin Vista, ACER Bangkok, 14-18, Sept. 2015

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

Lecture Outline. Biost 517 Applied Biostatistics I. Purpose of Descriptive Statistics. Purpose of Descriptive Statistics

Dr. SANDHEEP S. (MBBS MD DPH) Dr. BENNY PV (MBBS MD DPH) (DATA ANALYSIS USING SPSS ILLUSTRATED WITH STEP-BY-STEP SCREENSHOTS)

Readings Assumed knowledge

PRINCIPLES OF STATISTICS

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

RESEARCH METHODS. A Process of Inquiry. tm HarperCollinsPublishers ANTHONY M. GRAZIANO MICHAEL L RAULIN

CLINICAL BIOSTATISTICS

1. Introduction a. Meaning and Role of Statistics b. Descriptive and inferential Statistics c. Variable and Measurement Scales

Measuring the User Experience

Basic Statistics 01. Describing Data. Special Program: Pre-training 1

List of Figures. List of Tables. Preface to the Second Edition. Preface to the First Edition

To understand and systematically evaluate research, it is first imperative

REVIEW ARTICLE. A Review of Inferential Statistical Methods Commonly Used in Medicine

Theme 14 Ranking tests

Critical Appraisal of Scientific Literature. André Valdez, PhD Stanford Health Care Stanford University School of Medicine

Chapter 2--Norms and Basic Statistics for Testing

Ecological Statistics

Online Introduction to Statistics

Business Research Methods. Introduction to Data Analysis

Business Statistics (ECOE 1302) Spring Semester 2011 Chapter 3 - Numerical Descriptive Measures Solutions

The Logic of Data Analysis Using Statistical Techniques M. E. Swisher, 2016

Psychology Research Process

Chapter 1: Review of Basic Concepts

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

Research Example Aliza Ben-Zacharia DrNP, ANP

DATA GATHERING. Define : Is a process of collecting data from sample, so as for testing & analyzing before reporting research findings.

isc ove ring i Statistics sing SPSS

MBA 605 Business Analytics Don Conant, PhD. GETTING TO THE STANDARD NORMAL DISTRIBUTION

Transcription:

Choosing the Correct Statistical Test T racie O. Afifi, PhD Departments of Community Health Sciences & Psychiatry University of Manitoba Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY OF HEALTH SCIENCES

Choosing the Correct Statistical Test What do you need to know to pick the right statistical test? Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY OF HEALTH SCIENCES

To pick the correct statistical test you need to know What your research question is asking The level of measurement of the variables The distribution of the data

Common Statistical Tests T-test ANOVA Pearsons Correlation Linear Regression Logistic Regression Mann Whitney U Kruskal Wallis Test Chi-Square Test Spearmans Correlation

What is your research question asking?

Choosing a Statistical Test What is your research question asking? Is there a difference? Is there a relationship?

Is there a difference? Is there a difference in depression among adolescents who are sexually abused compared to adolescents who are not sexually abused?

Is there a difference? T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test

Is there a difference? T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test But how do you know which one to choose?

Is there a difference? T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test But how do you know which one to choose? What are the variables?

Is there a difference? T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test But how do you know which one to choose? What are the variables? How are the variables measured?

Is there a difference? T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test But how do you know which one to choose? What are the variables? How are the variables measured? What is the distribution of the data?

What are the Variables? Is there a difference in depression among adolescents who are sexually abused compared to adolescents who are not sexually abused?

What are the Variables? Is there a difference in depression among adolescents who are sexually abused compared to adolescents who are not sexually abused? One Variable is Sexual Abuse One Variable is Depression

How are the Variables Measured? Sexual Abuse Depression Categories (yes or no) Categories (none, minor, moderate, severe) Scores (e.g., 0-10)

How are the Variables Measured? Level of Measurement

Level of Measurement Nominal Named categories with no order Ordinal Categories with a logical order or rank order Interval Rank order AND distant between intervals of measurement have meaning (zero value is arbitrary). Ratio Same properties as interval data AND the distance and ratio between two measurements are defined and has an empirical (not arbitrary) zero value. You can say a score of 20 is twice as much as 10. Liamputtong 2013

Level of Measurement Type Nominal Ordinal (ranked) Interval Ratio Description Classes or categories without numerical order Male, female Jewish, Catholic, Muslim Ordered categories Mild pain, moderate pain, and severe pain High school, undergraduate, graduate The distance or interval between two measurements have meaning Temperature in Celsius (zero = 273.15 Kelvin) The distance and ratio between two measurements are defined and zero has a meaning of zero and therefore you can say twice as much Weight Age in years Temperature in Kelvin (absolute zero)

What is the Distribution of the Data?

Central Tendency and Dispersion Central tendency Where the bulk of the data lie. Mode, Median, Mean, etc Dispersion How wide or narrow the data are spread out. Number of categories, Range, Standard Deviation, etc Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7, pp. 119-142

Central Tendency Mode The value that appears most often (3, 4, 5, 6, 8, 8, 15) Mode = 8 Mean The arithmetic average of the observations (3, 4, 5, 6, 8, 8, 15) Mean = 7 Median Middle value (3, 4, 5, 6, 8, 8, 15) Median = 6

Level of Measurement Central Tendency Dispersion Nominal Mode (most frequent category) Number of categories Ordinal Median (data are ranked, middle value with half above and half below) Range and the Interquartile range (median of upper half and median of lower half IQR is difference between the two) Interval Mean (summed and divided by number) Standard Deviation (how much each data point deviates from the mean) Ratio Mean (summed and divided by number) Standard Deviation Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7, pp. 119-142

Level of Measurement Central Tendency Dispersion Nominal Mode (most frequent category) Number of categories Ordinal Median (data are ranked, middle value with half above and half below) NON-PARAMETERIC TESTS Range and the Interquartile range (median of upper half and median of lower half IQR is difference between the two) Interval Mean (summed and divided by number) Standard Deviation (how much each data PARAMETERIC TESTS point deviates from the mean) Ratio Mean (summed and divided by number) Standard Deviation Health Research Methods: A Canadian Perspective (2014) Edited by K. Bassil & D. Zabkiewicz; Chapter 7, pp. 119-142

What is the Distribution of the Data? Normal Distribution Or Non-Normal Distribution

Normal Distribution Average Hours of Sleep Frequency 0 20 40 60 5 11 12 25 60 29 12 Mean = 7.92 Std Error = 0.13 95% CI = 7.68 to 8.18 4 6 8 10 12 hours of sleep 8 3

Non-Normal Distribution Among respondents with babies 11 Frequency 0 5 10 5 10 4 Mean = 5.88 Std Error = 0.30 95% CI = 5.27 to 6.49 1 1 1 4 6 8 10 12 hours of sleep

Distribution of the Data Parametric test Interval or ratio level data with a NORMAL DISTRIBUTION Non-parametric test Nominal or ordinal level data or interval or ratio with a NON- NORMAL DISTRIBUTION

Common Statistical Tests

Is there a difference? Parametric Non-Parametric T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test

T-test To test if two means are statistically different? One variable is Continuous (interval or ratio level) One variable is Dichotomous (two categories) Distribution of continuous variable is NORMAL (bell curve)

T-test Is the mean depression score different for adolescents who are sexually abused compared to adolescents who are non-sexually abused? Sexual abuse = Yes or No (nominal or Dichotomous) Depression = 1 to 10 (interval with higher scores worse depression) Depression (mean) Total Sample 4 No Sexual abuse 2 Sexual abuse 8

What if the Distribution was NON-NORMAL? One variable is Continuous (interval or ratio level) with a NON- NORMAL DISTRIBUTION One variable is Dichotomous (two categories)

Mann-Whitney U test A non-parametric test for comparing ordinal, or non-normal continuous level data for two independent groups Non-normal distribution One Variable Ordinal or non-normal continuous level One Variable Two-level-categorical, dichotomous Bruce, 2008 Quantitative Methods for Health Research, pp. 491-495

Is there a difference? Parametric Non-Parametric T-test Difference in means in two groups Mann Whitney U Difference in medians in two groups

Is there a difference? What if you have three groups or more? No sexual abuse, minor sexual abuse, moderate sexual abuse, severe sexual abuse?

ANOVA Analysis of Variance Used to compare statistical difference between three or more group means ANOVA compares differences across all means at the same time Distribution of the sample means are normal (Parametric) Dependent Variable Continuous (one variable) Independent Variable Categorical (One variable with more than two levels or groups) Bruce, (2008); Tabachnick & Fidell (2007); Winston (1999); Liamputtong, 2013

ANOVA Are the mean depression score different for adolescents who experience mild sexual abuse, moderate sexual abuse, or severe sexual abuse? Distribution of depression scores is NORMAL Sexual abuse (Ordinal as none, minor, moderate, severe) Depression (interval ranging 0 to 10) Depression (mean) Total Sample 4 No Sexual Abuse 2 Minor Sexual Abuse 4 Moderate Sexual Abuse 7 Severe Sexual Abuse 9

ANOVA To test if three or means are statistically different? One variable is continuous (interval or ratio level) with a NORMAL DISTRIBUTION One variable is categorical (three or more categories)

What if the Distribution was NON-NORMAL? One variable is ordinal OR continuous (interval or ratio level) with a NON-NORMAL DISTRIBUTION One variable is Categorical (three or more categories)

Kruskal Wallis Test Median scores from three or more groups One variable = continuous (non-normal) or ordinal One variable = categorical with 3 levels or more An extension of the Mann Whitney U test and the nonparametric equivalent to ANOVA. Liamputtong, 2013

Chi-Square Test of Significance (X 2 ) Non-parametric test (Non-normal distribution) One Variable Categorical with 2 or more levels One Variable Categorical with 2 or more levels (1999) Bruce (2007); Tabachnick & Fidell (2007); Winston

Is there a difference? Parametric Non-Parametric T-test ANOVA Mann Whitney U Kruskal Wallis Test Chi-Square Test

Is there a relationship? Is there a positive correlation between sexual abuse and depression? Is sexual abuse severity associated with increased severity of depression? Is sexual abuse associated with increased odds of depression?

Is there a relationship? Is there a positive correlation between sexual abuse and depression? Correlation Is sexual abuse severity associated with increased severity of depression? Linear Regression Is sexual abuse associated with increased odds of depression? Logistic Regression

Is there a relationship? Parametric Non-Parametric Pearsons Correlation Linear Regression Logistic Regression Spearmans Correlation

Correlation Strength of a linear relationship Pearson Spearman Distribution of the variables are normal (parametric test) One Variable Continuous One Variable Continuous Distribution of the variables are non-normal (non-parametric test) OR one or more variables are ordinal One Variable Continuous/Categorical One Variable Continuous/Categorical Bruce, 2008 Quantitative Methods for Health Research, pp. 74-78

Linear Regression Describes how one variable (DV) depends on the other variable (IV) Regression estimates the relationship between two variables One Dependent Variable Continuous One or more Independent Variables Any level of measurement Bruce, 2008 Quantitative Methods for Health Research, pp. 232-255

Logistic Regression Predicts a dichotomous outcome from one or more Independent variables (Odds Ratio) Parametric test (some distribution assumptions apply) One Dependent Variable Dichotomous (two categories) One or More Independent Variables Any level

Is there a relationship? Parametric Test (Normal Distribution) Pearsons Correlation One variable = continuous One variable = continuous Non-Parametric Test (Non-Normal Distribution) Spearmans Correlation One variable = continuous or categorical One variable = continuous or categorical Linear Regression Dependent variable = continuous (1 variable) Independent variable = any level (1 or more) Logistic Regression Dependent variable = Dichotomous (1 variable) Independent variable = any level (1 or more)

Is there a difference? Parametric Test (Normal Distribution) T-test (difference in means) One variable = continuous One variable = Dichotomous Non-Parametric Test (Non-Normal Distribution) Mann Whitney U (difference in Medians) One variable = Continuous or ordinal One variable = dichotomous ANOVA One variable = continuous One variable = 3 or more categories Kruskal Wallis Test One variable = continuous (non-normal) or ordinal One variable = 3 categories or more Chi-Square Test One variable = 2 or more categories One variable = 2 or more categories

To pick the correct statistical test you need to know What your research question asking The level of measurement of the variables The distribution of the data