Chi-Square Goodness-of-Fit Test

Similar documents
CAPL 2 Questionnaire

Inferential Statistics

Comparing multiple proportions

appstats26.notebook April 17, 2015

Saskatchewan HIV Strategy: Social Network Approach

Demonstrating Client Improvement to Yourself and Others

FAQs about Provider Profiles on Breast Cancer Screenings (Mammography) Q: Who receives a profile on breast cancer screenings (mammograms)?

FGSZ Zrt. from 28 February 2019 till 29 February 2020 AUCTION CALENDAR: YEARLY YEARLY BUNDLED AT CROSS BORDER POINTS

Barriers to Transplantation

GSK Medicine: Study No.: Title: Rationale: Study Period Objectives: Indication: Study Investigators/Centers: Research Methods: Data Source:

McLean ebasis plus TM

Analysis of Meter Reading Validation Tolerances proposed by Project Nexus

IMPLEMENTING RECOVERY ORIENTED CLINICAL SERVICES IN OPIOID TREATMENT PROGRAMS PILOT UPDATE. A Clinical Quality Improvement Program

Education around PML risk and monitoring at NHNN Queen Square MS Centre

Global and National Trends in Vaccine Preventable Diseases. Dr Brenda Corcoran National Immunisation Office.

Applied Statistical Analysis EDUC 6050 Week 4

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /peds.

H1N1 FEARS EXAGGERATED SAY MANY CANADIANS LIBS TEN PERCENTAGE POINTS BEHIND TORIES

Data Visualization - Basics

Exam 4 Review Exercises

Flu Watch. MMWR Week 3: January 14 to January 20, and Deaths. Virologic Surveillance. Influenza-Like Illness Surveillance

Durham Region Influenza Bulletin: 2017/18 Influenza Season

STA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #

Supplementary Online Content

GREENWOOD PUBLIC SCHOOL DISTRICT PHYSICAL EDUCATION

Flu Watch. MMWR Week 4: January 21 to January 27, and Deaths. Virologic Surveillance. Influenza-Like Illness Surveillance

Breast Test Wales Screening Division Public Health Wales

Improving care of HIV-infected breastfeeding

The Confidence Interval. Finally, we can start making decisions!

Development and use of the Eating, Sleeping, Consoling (ESC) Care. for opioid-exposed newborns and their families in Northern New England

18 Week 92% Open Pathway Recovery Plan and Backlog Clearance

Date : September Permit/License or Registration Application. Permit/License/ Notification/ Registration Description. Remark

Lauren DiBiase, MS, CIC Associate Director Public Health Epidemiologist Hospital Epidemiology UNC Hospitals

BREATH AND BLOOD ALCOHOL STATISTICS

From Analytics to Action

Published by the Pharmaceutical Services Division to provide information for British Columbia s health care providers

The PROMs Programme in the NHS in England

Consultant-led Referral to Treatment (RTT) waiting times collection timetable: outcome of consultation

15.301/310, Managerial Psychology Prof. Dan Ariely Recitation 8: T test and ANOVA

Winter Holiday Suicide Myth Continues to be Reinforced in Press Annenberg Public Policy Center Study Finds

Emergency Department Boarding of Psychiatric Patients in Oregon

Overview of the Radiation Exposure Doses of the Workers at Fukushima Daiichi Nuclear Power Station

Shigella Infections in Maryland

Experimental Psychology Arlo Clark Foos

Has the UK had a double epidemic?

Crisis Connections Crisis Line Phone Worker Training (Online/Onsite) Winter 2019

Avian influenza in poultry, wild and captive birds (AI)

HRS 2010: Module 2 Health Literacy V000 BRANCHPOINT: IF THIS IS NOT A SELF-RESPONDENT (A009/A155 NOT 1), GO TO END OF MODULES

Quit Rates of New York State Smokers

City of Vancouver s Response to the Opioid Crisis Fire Chief Darrell Reid Vancouver Fire & Rescue Services (VF&RS)

Something to think about. What happens, however, when we have a sample with less than 30 items?

APPENDIX N. Summary Statistics: The "Big 5" Statistical Tools for School Counselors

Monitoring Protocol for Clozapine-induced Myocarditis. Copyright 2017, CAMH

Sleep Market Panel. Results for June 2015

Successful Falls Prevention in Aged Persons Mental Health. Reducing the risk and decreasing severity of outcome

An Updated Approach to Colon Cancer Screening and Prevention

Cancer Immunotherapy Pilot Program/ Patents 4 Patients. USPTO BCP Customer Partnership Meeting Alexandria, VA August 2, 2017

Placing the United States on the Path Toward the Elimination of Hepatitis C as a Public Health Threat

Kansas EMS Naloxone (Narcan) Administration

Seasonality of influenza activity in Hong Kong and its association with meteorological variations

Alberta Health. Seasonal Influenza in Alberta Season. Analytics and Performance Reporting Branch

Blood Alcohol Levels for Fatally Injured Drivers

Community Grants Strategy 2010 Evaluation Findings

Interpretability of Sudden Concept Drift in Medical Informatics Domain

Clostridium difficile (C. difficile) and Staphylococcus aureus bacteraemia (MRSA and MSSA) Bi-annual Report. Surveillance: Report:

Weekly Influenza News 2016/17 Season. Communicable Disease Surveillance Unit. Summary of Influenza Activity in Toronto for Week 43

TB Outbreak in a Homeless Shelter

Alcohol & Drugs. Contents:

Sheila Barron Statistics Outreach Center 2/8/2011

Curators of the University of Missouri - Combined January 1, 2016 through December 31, 2016

Quit for new life Capacity building staff to enhance successful quitting among pregnant Aboriginal women

BLOOD ALCOHOL LEVELS FOR FATALLY INJURED DRIVERS

BJA Performance Measures

Zero Suicide in Texas (ZEST)

CONTROL CHART METHODOLOGY

Timing of vaccination campaigns against pandemic influenza in a population dynamical model of Vancouver, Canada

Lessons in biostatistics

Dementia Content Report January Produced By The NHS Choices Reporting Team

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Tri-County Opioid Safety Coalition Data Brief December 2017 Clackamas, Multnomah, and Washington Counties

Dementia Content Report May Produced By The NHS Choices Reporting Team

ANOVA. Thomas Elliott. January 29, 2013

Reflection Questions for Math 58B

Forecasting Dengue Hemorrhagic Fever Cases Using Time Series in East Java Province, Indonesia

Implementing Rapid Response Teams (RRT) National Call September 13, 2007

California 2010 Pertussis Epidemic. Kathleen Winter, MPH Immunization Branch California Department of Public Health

Assessing Change with IIS. Steve Robison Oregon Immunization Program

Health Equity Workgroup. January 18, 2018

Statistics Assignment 11 - Solutions

FORM C Dr. Sanocki, PSY 3204 EXAM 1 NAME

Step 10 Visualisation Carlos Moura

SUICIDE IN SAN DIEGO COUNTY:

Magellan s Transport Route Lead Monitoring Program

Name Student Response sheet Period. Cell Division Mitosis Lab

Online Supplement to A Data-Driven Model of an Appointment-Generated Arrival Process at an Outpatient Clinic

Abs, Butt & Thigh. Zumba Gymnasium WEIGHT ROOM Monday Tuesday Wednesday Thursday Friday Saturday Sunday

We empower women toward

HIV Testing. Susan Tusher, LMSW Program Coordinator The Kansas AIDS Education and Training Center

Health impact assessment of particulate matter exposure in Pearl River Delta (PRD), China

Transcription:

STAT 250 Dr. Kari Lock Morgan Chi-Square Goodness-of-Fit Test SECTION 7.1 Testing the distribution of a single categorical variable : χ 2 goodness of fit (7.1) Multiple Categories So far, we ve learned how to do inference for categorical variables with only two categories Today, we ll learn how to do hypothesis tests for categorical variables with multiple categories Question of the Day Are children with ADHD younger than their peers? ADHD or Just Young? In British Columbia, Canada, the cutoff date for entering school in any year is December 31 st, so those born late in the year are almost a year younger than those born early in the year Is it possible that younger students are being overdiagnosed with ADHD? Are children diagnosed with ADHD more likely to be born late in the year, and so be younger than their peers? Data: Sample of kids aged 6-12 in British Columbia who have been diagnosed with ADHD Morrow, R., et. al., Influence of relative age on diagnosis and treatment of attentiondeficit/hyperactivity disorder in children, Canadian Medical Association Journal, April 17, 2012; 184(7): 755-762. Categories For simplicity, we ll break the year into four categories: January March April June July September October December Are birthdays for kids with ADHD evenly distributed among these four time periods? 1) State Hypotheses Hypothesis Testing 2) Calculate a statistic, based on your sample data 3) Create a distribution of this statistic, as it would be observed if the null hypothesis were true 4) Measure how extreme your test statistic from (2) is, as compared to the distribution generated in (3) 1

Hypotheses Define parameters as p 1 = Proportion of ADHD births in January March p 2 = Proportion of ADHD births in April June p 3 = Proportion of ADHD births in July Sep p 4 = Proportion of ADHD births in Oct Dec H 0 : p 1 = p 2 = p 3 = p 4 = 1/4 H a : At least one p i 1/4 Observed Counts The observed counts are the actual counts observed in the study Jan-Mar Apr-Jun July-Sep Oct-Dec Observed 6880 7982 9161 8945 Test Statistic Why can t we use the familiar formula sample statistic null value SE to get the test statistic? Expected Counts The expected counts are the expected counts if the null hypothesis were true For each cell, the expected count is the sample size (n) times the null proportion, p i expected = npi We need something a bit more complicated Expected Counts n = 32,968 boys with ADHD H 0 : p 1 = p 2 = p 3 = p 4 = ¼ Expected count for each category, if the null hypothesis were true: 32,968(1/4) = 8242 Jan-Mar Apr-Jun July-Sep Oct-Dec Observed 6880 7982 9161 8945 Expected 8242 8242 8242 8242 Chi-Square Statistic Jan-Mar Apr-Jun July-Sep Oct-Dec Observed 6880 7982 9161 8945 Expected 8242 8242 8242 8242 Need a way to measure how far the observed counts are from the expected counts Use the chi-square statistic : ( ) 2 2 observed - expected χ = expected 2

Chi-Square Statistic Observed Expected Contribution to χ 2 Jan-Mar 6880 8242 StatKey More Advanced Randomization Tests: χ 2 Goodness-of-Fit Apr-Jun 7982 8242 Jul-Sep 9161 8242 Oct-Dec 8945 8242 χ 2 = ( observed - expected) 2 expected Minitab Stat > Tables > Chi-Square Goodness-of-Fit Test What Next? We have a test statistic. What else do we need to perform the hypothesis test? Upper-Tail p-value To calculate the p-value for a chi-square test, we always look in the upper tail Why? Values of the χ 2 statistic are always positive The higher the χ 2 statistic is, the farther the observed counts are from the expected counts, and the stronger the evidence against the null Randomization Distribution Can we reject the null? a) Yes b) No 3

Chi-Square (χ 2 ) Distribution If each of the expected counts are at least 5, AND if the null hypothesis is true, then the χ 2 statistic follows a χ 2 distribution, with degrees of freedom equal to Chi-Square Distribution df = number of categories 1 ADHD Birth Month: df = 4 1 = 3 χ 2 Null Distribution p-value This is a TINY p-value Conclusion We have incredibly strong evidence that birthdays of boys with ADHD are not evenly distributed throughout the year. Chi-Square Test for Goodness of Fit 1. State null hypothesized proportions for each category, p i. Alternative is that at least one of the proportions is different than specified in the null. 2. Calculate the expected counts for each cell as np i. ( ) 2 3. Calculate the χ 2 statistic: 2 observed - expected χ = expected 4. Compute the p-value as the proportion above the χ 2 statistic for either a randomization distribution or a χ 2 distribution with df = (# of categories 1) if expected counts all > 5 5. Interpret the p-value in context. 4

Births Equally Likely? Maybe all birthdays (not just for boys with ADHD) are not evenly distributed throughout the year We have the overall proportion of births for all boys during each time period in British Columbia: January March: 0.244 April June: 0.258 July September: 0.257 October December: 0.241 Hypotheses Define parameters as p 1 = Proportion of ADHD births in January March p 2 = Proportion of ADHD births in April June p 3 = Proportion of ADHD births in July Sep p 4 = Proportion of ADHD births in Oct Dec H 0 : p 1 = 0.244, p 2 = 0.258, p 3 = 0.257, p 4 = 0.241 H a : At least one p i is not as specified in H 0 Birthday Proportion ADHD Expected Counts Jan-Mar 0.244 6880 Apr-Jun 0.258 7982 Jul-Sep 0.257 9161 Oct-Dec 0.241 8945 n = 32,968 The expected count for the Jan-Mar cell is a) 8044.2 b) 8505.7 c) 8472.8 d) 7945.3 Birthday Proportion ADHD Expected Contribution to χ 2 Counts Jan-Mar 0.244 6880 Apr-Jun 0.258 7982 Jul-Sep 0.257 9161 Oct-Dec 0.241 8945 7945.3 The contribution to χ 2 for the Oct-Dec cell is a) 32.2 b) 55.9 c) 125.8 d) 168.5 Birth Date Proportion ADHD Diagnoses Jan-Mar 0.244 6880 Apr-Jun 0.258 7982 Jul-Sep 0.257 9161 Oct-Dec 0.241 8945 Expected Counts Contribution to χ 2 ADHD or Just Young (Boys) p-value 0 We have VERY (!!!) strong evidence that boys with ADHD do not have the same distribution of birthdays as all boys in British Columbia. 5

A Deeper Understanding The p-value only tells you that the counts provide evidence against the null distribution If you want to know more, look at Which cells contribute most to the χ 2 statistic? Are the observed or expected counts higher? Birth Date Proportion ADHD Diagnoses Expected Counts Contribution to χ 2 Jan-Mar 0.244 6880 8044.2 168.5 Apr-Jun 0.258 7982 8505.7 32.2 Jul-Sep 0.257 9161 8472.8 55.9 Oct-Dec 0.241 8945 7945.3 125.8 Big contributions to χ 2 : beginning and end of the year Jan-Feb: Many fewer ADHD than would be expected Oct-Dec: Many more ADHD cases than would be expected ADHD and Birth Year Stop and think about the implications of this! In British Columbia, 6.9% of all boys 6-12 are diagnosed with ADHD 5.5% of boys 6-12 receive medication for ADHD How many of these diagnoses are simply due to the fact that these kids are younger than their peers??? ARE YOU CONCERNED? ADHD or Just Young? (Girls) Want more practice? Here is the data for girls. (χ 2 = 236.8) Birth Date Proportion ADHD Diagnoses Jan-Mar 0.243 1960 Apr-Jun 0.258 2358 Jul-Sep 0.257 2859 Oct-Dec 0.242 2904 Read Section 7.1 To Do Do HW 7.1 (due Friday, 11/20) 6