Packet'6:'Chi-Square'Test'for'Independence'!!

Similar documents
MATH 183 Test 2 Review Problems

Introduction. Lecture 1. What is Statistics?

NORTH SOUTH UNIVERSITY TUTORIAL 1

STATISTICS & PROBABILITY

AP STATISTICS 2009 SCORING GUIDELINES

Distributions and Samples. Clicker Question. Review

Understandable Statistics

Algebra 2 P Experimental Design 11 5 Margin of Error

5.3: Associations in Categorical Variables

Statistics are commonly used in most fields of study and are regularly seen in newspapers, on television, and in professional work.

Chi-square test. Wenli Lu Dept. Health statistics School of public health Tianjin medical university. Chi-square

STA 291 Lecture 4 Jan 26, 2010

Chapter 1: Exploring Data

Table 1: One Year Net Survival Rates for All Cancers Excluding Non-Melanoma Skin Cancer:

Majority approve of legal marijuana

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5).

THE DIVERSITY OF SAMPLES FROM THE SAME POPULATION

Section I: Multiple Choice Select the best answer for each question. a) 8 b) 9 c) 10 d) 99 e) None of these

Unit 7 Comparisons and Relationships

In 1987, Vermont introduced a 21-year-old drinking law which. prohibited alcohol use by those born on or after July 1, 1969, but allowed

+/ 4.0%. Strong oppose legalization. Value % 13.7% 6.8% 41.7% 36.3% 12.9% 5.9% 44.9%

Lecture 10: Chapter 5, Section 2 Relationships (Two Categorical Variables)

Omnibus Poll April 11-12, 2013

1 Correlates of Motor Vehicle Injuries: Analyses of the National Population Health Survey

Why nursing students should understand statistics. Objectives of lecture. Why Statistics? Not to put students off statistics!

I. Identifying the question Define Research Hypothesis and Questions

Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H

Lesson: A Ten Minute Course in Epidemiology

Math for Liberal Arts MAT 110: Chapter 5 Notes

Samples, Sample Size And Sample Error. Research Methodology. How Big Is Big? Estimating Sample Size. Variables. Variables 2/25/2018

Unit 1 Exploring and Understanding Data

Comparing multiple proportions

Stat Quiz 6 11/30/2012

Analysis of Categorical Data from the Ashe Center Student Wellness Survey

Test Bank for Privitera, Statistics for the Behavioral Sciences

More than Half Approve of the Sale of Marijuana Edibles

full file at

The random variable must be a numeric measure resulting from the outcome of a random experiment.

Driving While High: Facts and Public Attitudes

HOMEWORK 4 Due: next class 2/8

Exam 4 Review Exercises

Medical Statistics 1. Basic Concepts Farhad Pishgar. Defining the data. Alive after 6 months?

Assessment of the Safe Streets Treatment Options Program (SSTOP)

8.2 Relative Frequency

Chi-Square Goodness-of-Fit Test

Survey of U.S. Drivers about Marijuana, Alcohol, and Driving

Marijuana Users Think $10 per Gram is Reasonable

Constructing a Bivariate Table:

[POLS 4150] R, Randomization and Sampling

Demonstrating Client Improvement to Yourself and Others

Marijuana and driving in the United States: prevalence, risks, and laws

Drug-Impaired Driving in the United States

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Designing Psychology Experiments: Data Analysis and Presentation

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

Age of Drinking Onset, Driving After Drinking, and Involvement in Alcohol Related Motor Vehicle Crashes

Reaction Times: A POGIL Introduction to the Nervous System

Chapter 01 What Is Statistics?

SPRING GROVE AREA SCHOOL DISTRICT. Course Description. Instructional Strategies, Learning Practices, Activities, and Experiences.

Statistical Methods Exam I Review

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 1.1-1

Part 1. For each of the following questions fill-in the blanks. Each question is worth 2 points.

Popper If data follows a trend that is not linear, we cannot make a prediction about it.

V. Gathering and Exploring Data

Statistics and Probability

Statistics Assignment 11 - Solutions

Practice First Midterm Exam

REVIEW from Chapter 1 : Key Elements of a Statistical Problem

Recreational marijuana and collision claim frequencies

Interpreting Main Effects & Interactions with 2 x 2 Designs

The British Columbia Coroners Service is committed to conducting a thorough, independent examination of the factors contributing to death in order to

Section 3.2: Understanding and Interpreting Confidence Intervals Objective: estimating population parameters with sample statistics

Standard Deviation and Standard Error Tutorial. This is significantly important. Get your AP Equations and Formulas sheet

Biostatistics 513 Spring Homework 1 Key solution. See the Appendix below for Stata code and output related to this assignment.

Chapter 1. Picturing Distributions with Graphs

Team: Seat #: Name: Statistics Team Quiz 1 Explain each answer in one or more complete sentences.

Making Inferences from Experiments

Survey of Pennsylvanians on the Issue of the Swine Flu KEY FINDINGS REPORT

7. Bivariate Graphing

How to describe bivariate data

Popper If data follows a trend that is not linear, we cannot make a prediction about it. a. True b. False

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

What type of graph should I use?

Survey of U.S. Adult Cigarette Smokers

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Chapter 8 Estimating with Confidence. Lesson 2: Estimating a Population Proportion

Confidence Intervals and Sampling Design. Lecture Notes VI

1 Version SP.A Investigate patterns of association in bivariate data

Topic 5 Day 3. Today's Agenda:

Statistics Success Stories and Cautionary Tales

MTH 225: Introductory Statistics

10. Introduction to Multivariate Relationships

Effect of Saxagliptin on Renal Outcomes in the SAVOR TIMI- 53 study- Appendixes:

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

Raid Preparedness for Medical Marijuana Dispensaries. Written by Cannabis Training University (CTU) All rights reserved

Bios 6648: Design & conduct of clinical research

In This Section An Introductory Example Obesity in America

A) I only B) II only C) III only D) II and III only E) I, II, and III

Page 1 PFR Note client s current age.

Transcription:

Packet'6:'Chi-Square'Test'for'Independence' Aftercompletingthismaterial,youshouldbeableto: calculatetheexpectedcountforanycellofacontingencytable. Textbookpages:18 24;624 631 findmarginaldistributionsforacontingencytableandusethosetoconstructasidedbydsidebargraph. computechidsquarecontributionsforanycellofacontingencytable. conductthechisdsquaretest(withtheaidofstatcrunchoutput)todetermineiftwocategoricalvariablesare related. The legalization of medicinal marijuana has been a hotly contested subject. A survey conducted in April 2015 was undertaken to investigate whether a relationship exists between feelings on the legalization of medicinal marijuana (for/against)andpoliticalparty(republican/democrat/independent). During this study, a total of 500 individuals were surveyed. For each American adult, what variables were recorded?arethesevariablescategoricalorquantitative? Becausetwovariableswererecordedforeachindividual,weneedawaytoorganizethisdata.Acontingency'table categorizes counts on two (or more) categorical variables in other words, this table summarizes the number of individualsinallpossiblecombinationsofcategories.wecansummarizetheresponsestothesurveyinthetablebelow: Legalization'of'Marijuana' Supports' Does'not'support' Political'Party' Democrat' Republican' Independent' Instead of looking at the counts, let s split the table into marginal( distributions that is, what percentage of each politicalpartysurveyedgaveeachofthetworesponses? '

page2 Inordertodetermineifthedifferencesaresignificant,weneedtoconductahypothesistest.One'can'never'simply' examine' sample' data' and' draw' some' conclusion' about' the' population' ' we' need' to' conduct' a' hypothesis' test' in' order'to'determine'if'the'results'are'significant. Whatisthegoalofthechi-square'test'for'independence? Ifwewantedtoconductthistestonthelegalizationofmarijuanadata,whathypotheses'wouldbetested? Inordertoconductahypothesistest,weneedsomequantitytocomparetheobservedcountsfromthesurveyto.This isreferredtoastheexpected'count(inotherwords,whatshouldwehaveobservedifthenullhypothesisweretrue). Fillinthetablebelowwiththeexpectedcounts. Political'Party' Observed'' Counts' Legalization'of'Marijuana' Supports' Does'not'support' Democrat' 116 84 Republican' 74 126 Independent' 59 41 Expected' Counts' Legalization'of'Marijuana' Supports' Does'not'support' Democrat' Political'Party' Republican' Independent' Whatdoyounoticewhentheobserved(toptable)andexpectedcounts(bottomtable)arecompared? ' STA205Notes Buckley Fall2016

page3 Thetest'statisticforthechiDsquaretestforindependencecomparestheobservedandexpectedcounts.Itsformulais thefollowing: Let slookathowthisteststatisticiscalculatedbygoingbacktothemarijuanaexample: What'is'the'Chi-Square'distribution?' HowisthechiDsquaredistributionusedtofindaprobability? In statistical inference, there are several common distributions used for inference. In addition to the normal distribution (which we have already used), the chidsquare distribution is also a common distribution used for inference. Thisformulawillbegivenon theformulasheet. Formula'Alert' Chi-Square Distn, df=3 0 2 4 6 8 10 12 14 Ingeneral,wewon tcalculatethechidsquareteststatistic(thecalculationcanbetedious)orthepdvalueassociatedwith thetest.instead,wewillrelyonstatcrunchoutputforourcalculations.let slookatthestatcrunchoutputforthe legalizationofmarijuanaexample: STA205Notes Buckley Fall2016

page4 Completetheappropriatehypothesistestusingasignificancelevelof0.05todetermineifpoliticalpartyandsupportof legalizationofmarijuanaarerelated. Example:All new drugs must go through a drug study before being approved by the FDA. A drug study typically includesclinicaltrialswherebyparticipantsarerandomizedtoreceivedifferentdosagesaswellasaplacebo.tocontrol asmanyfactorsaspossible,itisbesttoassignparticipantsrandomlyacrossthetreatments.arecentstudyforanew drugconsistedoftwodosages(10mg,20mg)andaplacebo.thosewhodesignedthestudywouldliketoknowifthe dosageassignedwasrelatedtotheparticipants gender.theresponsesaresummarizedinthestatcrunchoutputbelow: Findthemarginaldistributionforeachgenderbyfillinginthetablesbelow. Computetheexpectednumberoffemalesreceivingtheplacebo.Whatdoes thisquantitymean? Dosage' 10mg' 20mg' Placebo' Female' ' ' ' Dosage' 10mg' 20mg' Placebo' Male' ' ' ' Basedonthesedistributions,doyoubelievegender issomehowrelatedtodosage?explain. STA205Notes Buckley Fall2016

page5 Computethechi-square'contributionformaleparticipantswhoweregiven10mgofthedrug. UsingtheStatCrunchoutputbelow,conducttheappropriatetesttodetermineifthereisarelationshipbetweengender andthedosagereceived.useasignificancelevelof0.01. WhatassumptionsmustbesatisfiedforthechiDsquaretestofindependencetobevalid? STA205Notes Buckley Fall2016

page6 Example:' A sample of 1000 traffic crashes occurring in either Kentucky or OhiowasselectedfromtheNationalHighwaySafetyTrafficAdministration database.foreachcrash,itwasnotedwhetherornotalcoholwasinvolved in the accident. A reporter has questioned whether there is a relationship betweenalcoholinvolvementandthestateinwhichtheaccidentoccurred. TheinformationgatheredissummarizedintheStatCrunchoutputprovided. Computethenumberofaccidentsonewouldexpecttoinvolvealcoholin KYifthereisnorelationship. Fill in the tables below with the marginal distributions for each state. Then create a sidedbydside bar graph comparingthepercentagesforthetwostates. ' Alcohol yes Alcohol no KY ' ' ' Alcohol yes Alcohol no OH ' ' ' Conducttheappropriatetesttoaddresstheconjecturemadebythereporter.Useasignificancelevelof0.05. STA205Notes Buckley Fall2016