Outlier Exercises (Section 2.13) 16(b), 20(b), 23(b), 33

Similar documents
The Test-Curriculum Matching Analysis: Science

Appendix F: Test-Curriculum Matching Analysis

Appendix F. The Test-Curriculum Matching Analysis Science TIMSS 2011 INTERNATIONAL RESULTS IN SCIENCE APPENDIX F 479

Main developments in past 24 hours

Drug Prices Report Opioids Retail and wholesale prices * and purity levels,by drug, region and country or territory (prices expressed in US$ )

Louisville '19 Attachment #69

Undetectable = Untransmittable. Mariah Wilberg Communications Specialist

ESPEN Congress Geneva 2014 FOOD: THE FACTOR RESHAPING THE SIZE OF THE PLANET

WELLNESS COACHING. Wellness & Personal Fitness Solution Providers

Terms and Conditions. VISA Global Customer Assistance Services

GLOBAL FMD SITUATION DURING 2001/2002

Global EHS Resource Center

IOSH No Time to Lose campaign: working together to tackle asbestos-related cancer #NTTLasbestos. Jonathan Hughes IOSH Vice President

Council Officers and Domain Chairs

WCPT COUNTRY PROFILE December 2017 SWEDEN

Sperm donation Oocyte donation. Hong Kong þ Guideline þ þ Hungary þ þ þ þ Israel þ þ þ þ Italy þ þ þ. Germany þ þ þ þ Greece þ þ þ þ

EU Market Situation for Poultry. Committee for the Common Organisation of the Agricultural Markets 20 April 2017

Tobacco: World Markets and Trade

Sponsor Novartis Generic Drug Name. Pimecrolimus. Therapeutic Area of Trial. Atopic dermatitis

Guidance for Travelers on Temporary Work Assignment Abroad

MSCI GLOBAL MARKET ACCESSIBILITY REVIEW JUNE Competitive landscape

WCPT COUNTRY PROFILE December 2017 HUNGARY

WCPT COUNTRY PROFILE December 2017 SERBIA

World Connections Committee (WCC) Report

Country-wise and Item-wise Exports of Animal By Products Value Rs. Lakh Quantity in '000 Unit: Kgs Source: MoC Export Import Data Bank

WELLNESS COACHING. Wellness & Personal Fitness Solution Providers NZ & Australia

מדינת ישראל. Tourist Visa Table

THE CARE WE PROMISE FACTS AND FIGURES 2017

Supplementary appendix

3.5 Consumption Annual Prevalence Opiates

ORBE Summary of Benefits

CALLING ABROAD PRICES FOR EE SMALL BUSINESS PLANS

Inequalities in health: challenges and opportunities in Europe Dr Zsuzsanna Jakab WHO Regional Director for Europe

Country Length Discount Travel Period Anguilla All 20% off 08/24/11 12/15/11 Antigua All 20% off 08/24/11 12/15/11 Argentina All 20% off 08/24/11

מדינת ישראל. Tourist Visa Table. Tourist visa exemption is applied to national and official passports only, and not to other travel documents.

Men & Health Work. Difference can make a difference Steve Boorman & Ian Banks RSPH Academy 2013

The IB Diploma Programme Statistical Bulletin. November 2015 Examination Session. Education for a better world

Field of Post-M.D. Training

of the Bone and Joint Decade The Global Challenge of Fragility Fractures and the Role of Fragility fracture Network

Available Hepatitis Information: how can this be used to convince decision makers and potential funders

New scientific study: no safe level of alcohol

LEBANON. WCPT COUNTRY PROFILE December 2018

- Network for Excellence in Health Innovation

The Single Convention on Narcotic Drugs- Implementation in Six Countries: Albania, Bangladesh, India, Kyrgyzstan, Sri Lanka, Ukraine

Summary of Results for Laypersons

DENMARK. WCPT COUNTRY PROFILE December 2018

World Health organization/ International Society of Hypertension (WH0/ISH) risk prediction charts

Brigitte Khoury, Ph.D. Director, Arab Regional Center for Research, Training and Policy Making in Mental Health Dept. of Psychiatry, American

Current Trends in ODS Production and Supply

Current State of Global HIV Care Continua. Reuben Granich 1, Somya Gupta 1, Irene Hall 2, John Aberle-Grasse 2, Shannon Hader 2, Jonathan Mermin 2

GERMANY. WCPT COUNTRY PROFILE December 2018

ADMINISTRATIVE AND FINANCIAL MATTERS. Note by the Executive Secretary * CONTENTS. Explanatory notes Tables. 1. Core budget

NobelDesign 1.3 Installation guide

Enriched RWE study in the Nordics a case study

REPORT ON AVIAN INFLUENZA (AI) SURVEILLANCE MONITORING FOR THE SURVEILLANCE PERIOD: July 2017 to December 2017 (2H 2017)

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Introduction to Mulitple Regression

WCPT COUNTRY PROFILE December 2017 JAPAN

Monitoring EU Agri-Food Trade: Development until February 2018

NobelProcera Implant Bar Overdenture EXPERIENCE THE NEW WORLD OF CAD/CAM DENTISTRY

World Federation of Hemophilia Report on the GLOBAL SURVEY 2004 JULY 2005

Department of Defense Armed Forces Health Surveillance Branch Global MERS-CoV Surveillance Summary (16 NOV 2016)

Q1 What age are you?

Recommended composition of influenza virus vaccines for use in the 2009 southern hemisphere influenza season

APPENDIX II - TABLE 2.3 ANTI-TOBACCO MASS MEDIA CAMPAIGNS

Emily J. Hadgkiss, 1 George A. Jelinek, 1,2 Tracey J. Weiland, 1,3 Naresh G. Pereira, 4 Claudia H. Marck, 1 and Dania M.

Chapter 23. Inference About Means. Copyright 2010 Pearson Education, Inc.

Real Life, Real PD Survey

Alcohol-related harm in Europe and the WHO policy response

CND UNGASS FOLLOW UP

High risk groups and disease markers. Professor Bruce Duncan Federal University of Rio Grande do Sul

Developed for the Global Initiative for Asthma

itntx.org / /

Percutaneous Mitral Valve Therapies

FAOAIDEnews Animal Influenza Disease Emergency

International experience. Local knowledge.

Drug resistance TB in People Living with HIV: research questions and priorities.

Recommended composition of influenza virus vaccines for use in the 2007 influenza season

FRAMEWORK CONVENTION ALLIANCE BUILDING SUPPORT FOR TOBACCO CONTROL. Smoke-free. International Status Report

Population. Sample. AP Statistics Notes for Chapter 1 Section 1.0 Making Sense of Data. Statistics: Data Analysis:

WHO REGIONAL OFFICE FOR EUROPE RECOMMENDATIONS ON INFLUENZA

Taking a Threat Management Approach to Pandemic Preparedness. Baxter Healthcare Corporation Sharon Kemerer Corporate Director, OH

Sangeeta Agrawal, M.S., Gallup James K. Harter, Ph.D., Gallup

D7.1 Report summarising results of survey of EU countries to identify volumes and trends in relation to the import and export of stem cells

D7.1 Report summarising results of survey of EU countries to identify volumes and trends in relation to the import and export of stem cells

RECOMMENDATIONS ON INFLUENZA VACCINATION DURING THE WINTER SEASON

STA Module 9 Confidence Intervals for One Population Mean

A Global perspective on Heart Failure: What needs to change? Martin R Cowie London, United Kingdom

Differences make a Difference

AP STATISTICS 2010 SCORING GUIDELINES (Form B)

ABC Global Alliance. F. Cardoso, MD

EURO POLIO PAGE Data as of 04 October 2005 (Week 38)

Transcription:

Outlier Exercises Pages n/a Suggested Reading (not much reading here the phrase outlier was thrown at you a few times already, but we delved into it in class more thoroughly) Pages Problems 90 103 (Section 2.13) 16(b), 20(b), 23(b), 33 Scores of countries involved in the 8 th grade 2007 TIMMS (Trends in Measurement of Math and Science) testing: Algeria 387 Colombia 380 Hong Kong 572 Korea 597 Palestine 367 Singapore 593 Armenia 499 Cyprus 465 Hungary 517 Kuwait 354 Qatar 307 Slovenia 501 Australia 496 Czech 504 Indonesia 397 Lebanon 449 Romania 461 Sweden 491 Bahrain 398 Egypt 391 Iran 403 Lithuania 506 Russia 512 Thailand 441 Bosnia 456 El Salvador 340 Israel 463 Malaysia 474 SAR 395 Tunisia 420 Botswana 364 England 513 Italy 480 Malta 488 Saudi Arabia 329 Turkey 432 Bulgaria 464 Georgia 410 Japan 570 Norway 469 Scotland 487 Ukraine 462 China 610 Ghana 309 Jordan 427 Oman 372 Serbia 486 US 508 E1. Draw (use your TI; then just sketch it) a histogram of the data, and comment on its shape. Identify any outliers in the data if E2. you use the outside of two standard deviations rule. E3. you use the IQR rule. E4. Does it worry you that you got two different answers? Why or why not? E5. Draw a boxplot of the data. Does it need to be modified? Why or why not?

Consider the following graphic: E6. Do any states use an unusual amount of gas per capita (that is, on average per resident)? How did you decide? Remember our discussion about margins of error from a few lessons back? Good! They can also be used to identify outliers, as per the outside of two standard deviations rule. We ll study this at great depth in MTH 244, but, for those of you not making that journey, I felt it important enough to tackle the idea of margins of error a couple of times in MTH 243. Many of you have heard about clinical trials such as the one referenced in the following headline:

In the study, it is stated For each additional two hours of TV viewing per day, the risk of type 2 diabetes, cardiovascular disease, and premature mortality increased by 20, 15, and 13 percent respectively. From my own digging I found the risk of these three diseases in the general population: 7.8%, 4.1%, and 0.2%, respectively (from the CDC). Let s look at the Type 2 diabetes number, and see just why it s so staggeringly high. For those who watch 2 hours of TV per day, it s claimed that their type 2 diabetes rate is around 28% (20% more than the general population). Now, since the study was based on a random sample, their 28% must carry a margin of error (MOE), correct? I couldn t track the study down, but let s assign it a generous 5% margin of error (much larger than it most likely actually is). We ll assign the same MOE to the CDC s statistics, even though I know, for sure, they re much smaller, and then we ll create confidence intervals by adding said MOE to said statistics:. Looking at these confidence intervals of values, you can see, quite clearly, that even the lowest value of the 2 hours TV per day interval is higher than the largest value of the general population s rate. Remember, too, that these intervals represent a spread of two standard deviations thus, we can safely say that, since the two hours per day rate interval is well outside the range of the general population, these rates are outliers when viewed in comparison, and thus, deserve another look as to why they are the way they are. (however, one must be careful to say things like, If I watch 2 more hours of TV, then my rate goes up blah, blah percent. These are averages and, more importantly, correlations. Chances are, those who watch more TV per day are also, for example, more sedentary than those who don t, and that, as far as I can tell, isn t cross referenced in the study.) E7. Verify that the rates for cardiovascular disease and premature death (quoted in the article) are outliers when compared to the general populations rates. Use a 5% margin of error like we did above. E8. Comment on the following statement, made by the senior author of the study: The message is simple. Cutting back on TV watching can significantly reduce risk of type 2 diabetes, heart disease, and premature mortality.

Answers. E1. Slightly skewed, but overall approximately bell shaped, unimodal, and symmetric (yours might look slightly different; I couldn t find my TI, so I used another program to generate it). 9 freq 0 291.850 data 625.150 E2. For this data, two standard deviations runs from about 304 to 600. Using this range, China s score of 610 is a high outlier. There are no low outliers. E3. The IQR for this data is 103, so 1.5IQR is 154.5, so the high fence is 654, and the low fence is 242. No data points are outliers when viewed in this way. E4. Well, since there s no universal definition of what an outlier is no, I m not worried. E5. No need to modify, as the IQR method doesn t yield outliers. 307 462 610 396 500 E6. Can t wait to see what you come up with! E7. Well, let s see them! E8. Check out the message in parentheses above E7.

Outlier Quizzes Quiz 1. Refer back to E7 and E8 above. 1. (8 points). Answer E7 using a correctly constructed confidence intervals (2 points each) drawn correctly on a number line (2 points each) like was done for Type 2 Diabetes for each of cardiovascular disease and premature death. Be sure to label each interval so I can tell them apart (one efficient way of doing this is to take a snip of mine and edit in MS Paint here s a video). Use a 5% MOE for each. 2. (2 points) Anything wrong with what the author stated?

Quiz 2. Consider the following dataset: 30 171 184 201 212 250 265 270 272 289 305 306 322 322 336 346 351 370 390 404 409 411 436 437 439 441 444 448 451 453 470 480 482 487 494 495 499 503 514 521 522 527 548 550 559 560 570 572 574 578 585 592 592 607 616 618 621 629 637 638 640 656 668 707 709 719 737 739 752 758 766 792 792 794 802 818 830 832 843 858 860 869 918 925 953 991 1000 1005 1068 1441 1. (2 points) Begin by creating a histogram of these data (use the Excel Calculator, for sure!). Take a screenshot and include it! 2. (4 points 1 point for telling me which method, 3 points for identifying outliers) We learned different methods for finding outliers in class. Using one of these methods, find any outliers in this data set, if there are any (if there aren t, state so). Make sure I can follow what you do show me your jazz hands ranges! 3. (4 points) Justify the choice of the method you used in number 2.