To review probability concepts, you should read Chapter 3 of your text. This handout will focus on Section 3.6 and also some elements of Section 13.3.

Size: px
Start display at page:

Download "To review probability concepts, you should read Chapter 3 of your text. This handout will focus on Section 3.6 and also some elements of Section 13.3."

Transcription

1 To review probability concepts, you should read Chapter 3 of your text. This handout will focus on Section 3.6 and also some elements of Section EXAMPLE: The table below contains the results of treatment for patients with different types of Hodgkin s disease. The data was collected by taking a random sample of 538 patients diagnosed with some form of Hodgkin s disease; therefore, both Type of Hodgkin s Disease and Response to Treatment are random variables. Type of Hodgkin s Disease Response to Treatment None Partial Positive ROW TOTALS Lymphocyte depletion Lymphocyte predominant Mixed cellularity Nodular schlerosis COLUMN TOTALS n 538 Questions: For a patient selected at random from these 538 Hodgkin s patients, find the probability that the patient 1. had a positive response. 2. had at least some response to the treatment. 3. had lymphocyte predominant Hodgkin s disease and had a positive response to treatment. 4. Had lymphocyte predominant or nodular schlerosis Hodgkin s disease. 1

2 Conditional Probability The probability of event A occurring given that event B has already occurred is usually denoted by P(A B). Recall the following formula: P(A B) P(A B) P(B) Once again, consider the Hodgkin s data. Type of Hodgkin s Disease Response to Treatment None Partial Positive ROW TOTALS Lymphocyte depletion Lymphocyte predominant Mixed cellularity Nodular schlerosis Questions: Find the following conditional probabilities: 1. P(Positive Response LD) 2. P(Partial Response LD) 3. P(No Response LD) 2

3 These conditional probabilities are easily seen using JMP. First, enter the data from the contingency table into JMP as shown in the data table below. Note: The frequencies/counts in the Freq column are interpreted by JMP as frequencies associated with the histological type/response categories. To obtain the contingency table and mosaic plot in JMP select Analyze > Fit Y by X and place Histological type in the X box and Response in the Y box. 3

4 EXAMPLE: A study was conducted in 1991 by the University of Wisconsin and the Wisconsin Department of Transportation in which linked police reports and hospital discharge records were used to assess, among other things, the risk for head injury for motorcyclists in motor-vehicle crashes. The data shown below can be used to examine the relationship between helmet use and whether brain injury was sustained in the accident. Brain Injury No Brain Injury Row Totals Helmet Worn No Helmet Column Totals Questions: 1. What is the probability that a motorcycle accident victim in Wisconsin suffered brain injury? 2. What is the probability that a motorcyclist involved in an accident was wearing a helmet? Can this be used to estimate the probability that a randomly sampled motorcyclist in WI wears a helmet? 3. What is the probability that a motorcyclist suffered brain injury given that they were wearing a helmet? 4. What is the probability that a motorcyclist not wearing a helmet suffered brain injury? 5. How many times more likely is a motorcyclist not wearing a helmet to sustain a brain injury? 4

5 RELATIVE RISK AND ODDS RATIOS Relative Risk: This is a measure of how much a particular risk factor influences the risk of a specified outcome. For the motorcycle helmet data, we calculate the relative risk as follows: P(Brain Injury NoHelmet Worn) Relative Risk P(Brain Injury Helmet Worn) Comments: 1. We interpret this number by saying that the risk of brain injury is times as large for those that do not wear helmets as for those that do wear helmets. 2. A relative risk value of 1.0 is the reference value for making comparisons. That is, a relative risk of 1.0 implies that there is no difference in the two probabilities (or risks). 3. The relative risk ratio is easily displayed in the mosaic plot: 4. If we had alternatively calculated the relative risk ratio as P(Brain Injury Helmet Worn) Relative Risk P(Brain Injury NoHelmet Worn) then the interpretation changes. Now, we say the risk of brain injury for those that do wear helmets is times as large as the risk of brain injury for those that do not wear helmets. 5

6 Odds Ratios The relative risk ratio is frequently used when investigating the relationship between two categorical variables. Although this quantity is relatively easy to calculate and interpret, statisticians often use another quantity known as an odds ratio in this situation. Before computing an odds ratio, we need to compute the odds: Odds: With counts given for two distinct response categories (Helmet Use or Non-use), the odds of a Yes versus a No is computed as the number of Yes events versus the number of No events for each group. You can also think of this as the probability that something is true divided by the probability that something is not true. Once again, consider our data. Brain Injury No Brain Injury Row Totals Helmet Worn No Helmet Column Totals Find the odds of sustaining a brain injury for both groups: Odds of Brain Injury for Wearing Helmet Number with Brain Injury in Helmet group Number with No Brain Injury in Helmet group Odds of Brain Injury for No Helmet Number with Brain Injury in No Helmet group Number with No Brain Injury in No Helmet group The odds ratio is simply the ratio of the odds for the two groups: Odds Ratio Odds of Brain Injury for No Helmet Odds of Brain Injury for Helmet The interpretation is that the odds of brain injury in the No Helmet group are times as high as the odds of brain injury in the Helmet group. 6

7 We could also have calculated the odds ratio as follows: Odds Ratio Odds of Brain Injury for Helmet Odds of Brain Injury for No Helmet The interpretation is that the odds of brain injury in the Helmet group are as large as those in the No Helmet group. Comments: 1. An odds ratio of 1.0 implies that there is no observable difference between the two odds. 2. The odds can also be visualized in the mosaic plot: 7

8 EXAMPLE: A case-control study was conducted to determine whether there was an increased risk of cervical cancer amongst women who had their first child before age 25. A sample of 49 women with cervical cancer was taken of which 42 had their first child before the age of 25. From a sample of 317 similar women without cervical cancer it was found that 203 of them had their first child before age 25. Do these data suggest that having a child at or before age 25 increases risk of cervical cancer? Cancer (Case) No cancer (Control) Row Totals Age Age > Column Totals Questions: 1. Why can t we meaningfully calculate P(disease risk factor status)? 2. Even though it is not appropriate to do so, calculate P(disease risk factor status) for both risk factor groups and compute the relative risk ratio. 3. Find the odds for disease for both risk factor groups, and find the odds ratio for having the disease associated with the risk factor being present. 8

9 Now, suppose that the data from the study were slightly different from what was observed in the above table. Note that we have simply doubled the number of controls, but kept the percent of controls falling in each age group constant. Cancer (Case) No cancer (Control) Row Totals Age Age > Column Totals Questions: 4. Even though it is not appropriate to do so, calculate P(disease risk factor status) for both risk factor groups and compute the relative risk ratio using the revised data. 5. With the revised data, find the odds for disease for both risk factor groups, and find the odds ratio for having the disease associated with the risk factor being present. 6. What do you notice? Why do you suppose the odds ratio is much more commonly used that the relative risk? 9

10 Comments on Odds Ratios and Relative Risk 1. When the disease is rare in the population being studied then there is little difference between the RR and OR, with the difference getting smaller the rarer the disease is. Thus, for many diseases RR OR, which makes it easier to discuss and interpret odds ratios because we can state the results in terms of how many times more likely something is rather than using a multiplicative statement in terms of the odds. 2. The most commonly cited advantage of the relative risk over the odds ratio is that the former has the more natural interpretation. The relative risk comes closer to what most people think of when they compare the relative likelihood of events. 10

INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2013

INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2013 Crash Outcome Data Evaluation System INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2013 Wayne Bigelow Center for Health Systems Research and Analysis

More information

INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2011

INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2011 Crash Outcome Data Evaluation System INJURIES, DEATHS AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2011 Wayne Bigelow Center for Health Systems Research and Analysis

More information

INCIDENCE, HEALTH OUTCOMES AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2006

INCIDENCE, HEALTH OUTCOMES AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2006 Crash Outcome Data Evaluation System INCIDENCE, HEALTH OUTCOMES AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2006 Wayne Bigelow Center for Health Systems Research

More information

MORBIDITY, MORTALITY AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2010

MORBIDITY, MORTALITY AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2010 Crash Outcome Data Evaluation System MORBIDITY, MORTALITY AND COSTS RELATED TO MOTOR VEHICLE CRASHES IN WHICH ALCOHOL WAS A FACTOR, WISCONSIN, 2010 Wayne Bigelow Center for Health Systems Research and

More information

The Impact of Helmet Use and Alcohol on Traumatic Brain Injury Related Hospitalizations for Motorcycle Crashes in Wisconsin,

The Impact of Helmet Use and Alcohol on Traumatic Brain Injury Related Hospitalizations for Motorcycle Crashes in Wisconsin, The Impact of Helmet Use and Alcohol on Traumatic Brain Injury Related Hospitalizations for Motorcycle Crashes in Wisconsin, 1991-1998 Wayne Bigelow, M.S. Center for Heatlh Systems Research and Analysis

More information

5.3: Associations in Categorical Variables

5.3: Associations in Categorical Variables 5.3: Associations in Categorical Variables Now we will consider how to use probability to determine if two categorical variables are associated. Conditional Probabilities Consider the next example, where

More information

MULTIFACTOR DESIGNS Page Factorial experiments are more desirable because the researcher can investigate

MULTIFACTOR DESIGNS Page Factorial experiments are more desirable because the researcher can investigate MULTIFACTOR DESIGNS Page 1 I. Factorial Designs 1. Factorial experiments are more desirable because the researcher can investigate simultaneously two or more variables and can also determine whether there

More information

Unit 7 Comparisons and Relationships

Unit 7 Comparisons and Relationships Unit 7 Comparisons and Relationships Objectives: To understand the distinction between making a comparison and describing a relationship To select appropriate graphical displays for making comparisons

More information

INVESTIGATION SLEEPLESS DRIVERS

INVESTIGATION SLEEPLESS DRIVERS 50644_05_ch5_p407-482.qxd 5/10/05 12:08 PM Page 420 420 5.1 COMPARING TWO SAMPLES ON A CATEGORICAL RESPONSE retain the penny and the proportion of female students at this university who would vote to retain

More information

STATISTICS INFORMED DECISIONS USING DATA

STATISTICS INFORMED DECISIONS USING DATA STATISTICS INFORMED DECISIONS USING DATA Fifth Edition Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation Learning Objectives 1. Draw and interpret scatter diagrams

More information

Two-sample Categorical data: Measuring association

Two-sample Categorical data: Measuring association Two-sample Categorical data: Measuring association Patrick Breheny October 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 40 Introduction Study designs leading to contingency

More information

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics Psy201 Module 3 Study and Assignment Guide Using Excel to Calculate Descriptive and Inferential Statistics What is Excel? Excel is a spreadsheet program that allows one to enter numerical values or data

More information

Data that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes:

Data that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes: This sheets starts from slide #83 to the end ofslide #4. If u read this sheet you don`t have to return back to the slides at all, they are included here. Categorical Data (Qualitative data): Data that

More information

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups

Making comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups Making comparisons Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups Data can be interpreted using the following fundamental

More information

Unit 4 Probabilities in Epidemiology

Unit 4 Probabilities in Epidemiology BIOSTATS 540 Fall 2018 4. Probabilities in Epidemiology Page 1 of 23 Unit 4 Probabilities in Epidemiology All knowledge degenerates into probability - David Hume With just a little bit of thinking, we

More information

Binary Diagnostic Tests Paired Samples

Binary Diagnostic Tests Paired Samples Chapter 536 Binary Diagnostic Tests Paired Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary measures

More information

PubHlth Introductory Biostatistics Fall 2012 Examination II Choice A Unit 2 (Introduction to Probability) Due Monday October 8, 2012

PubHlth Introductory Biostatistics Fall 2012 Examination II Choice A Unit 2 (Introduction to Probability) Due Monday October 8, 2012 PubHlth 540 Fall 2012 Exam II Choice A Page 1 of 9 PubHlth 540 - Introductory Biostatistics Fall 2012 Examination II Choice A Unit 2 (Introduction to Probability) Due Monday October 8, 2012 This is Choice

More information

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo.

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo. Goal of a Formal Sensitivity Analysis To replace a general qualitative statement that applies in all observational studies the association we observe between treatment and outcome does not imply causation

More information

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA

Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA PharmaSUG 2014 - Paper SP08 Methodology for Non-Randomized Clinical Trials: Propensity Score Analysis Dan Conroy, Ph.D., inventiv Health, Burlington, MA ABSTRACT Randomized clinical trials serve as the

More information

STAT 201 Chapter 3. Association and Regression

STAT 201 Chapter 3. Association and Regression STAT 201 Chapter 3 Association and Regression 1 Association of Variables Two Categorical Variables Response Variable (dependent variable): the outcome variable whose variation is being studied Explanatory

More information

6 Relationships between

6 Relationships between CHAPTER 6 Relationships between Categorical Variables Chapter Outline 6.1 CONTINGENCY TABLES 6.2 BASIC RULES OF PROBABILITY WE NEED TO KNOW 6.3 CONDITIONAL PROBABILITY 6.4 EXAMINING INDEPENDENCE OF CATEGORICAL

More information

The Severity of Pedestrian Injuries in Alcohol-Related Collisions

The Severity of Pedestrian Injuries in Alcohol-Related Collisions The Severity of Pedestrian Injuries in -Related Collisions AUTHORS: Stanley Sciortino, PhD Elyse Chiapello San Francisco Department of Public Health Community Health Education Section The California Statewide

More information

observational studies Descriptive studies

observational studies Descriptive studies form one stage within this broader sequence, which begins with laboratory studies using animal models, thence to human testing: Phase I: The new drug or treatment is tested in a small group of people for

More information

Measuring association in contingency tables

Measuring association in contingency tables Measuring association in contingency tables Patrick Breheny April 3 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 28 Hypothesis tests and confidence intervals Fisher

More information

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010

OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 OCW Epidemiology and Biostatistics, 2010 David Tybor, MS, MPH and Kenneth Chui, PhD Tufts University School of Medicine October 27, 2010 SAMPLING AND CONFIDENCE INTERVALS Learning objectives for this session:

More information

Chapter 2 Planning Experiments

Chapter 2 Planning Experiments Some Standard Designs Page 1 of 9 Chapter 2 Planning Experiments Careful planning of an experiment is crucial for good analysis with required precision. 2.2 A Checklist for Planning Experiments A. Define

More information

Binary Diagnostic Tests Two Independent Samples

Binary Diagnostic Tests Two Independent Samples Chapter 537 Binary Diagnostic Tests Two Independent Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary

More information

Rhode Island. Data Sources:

Rhode Island. Data Sources: Data Sources: Multiple Cause of Death (MCOD) Files, 2009-2013, National Center for Health Statistics. The MCOD file is a census of all deaths in the U.S. and some territories. Five years data were combined

More information

1. To review research methods and the principles of experimental design that are typically used in an experiment.

1. To review research methods and the principles of experimental design that are typically used in an experiment. Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab Exercise Lab #7 (there was no Lab #6) Treatment for Depression: A Randomized Controlled Clinical Trial Objectives: 1. To review

More information

Lesson 11: Conditional Relative Frequencies and Association

Lesson 11: Conditional Relative Frequencies and Association Name date per Lesson 11: Conditional Relative Frequencies and Association Classwork After further discussion, the students involved in designing the superhero comic strip decided that before any decision

More information

Introduction. Lecture 1. What is Statistics?

Introduction. Lecture 1. What is Statistics? Lecture 1 Introduction What is Statistics? Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain information and understanding from data. A statistic

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title:Differences upon admission and in hospital course of children hospitalized pneumonia: a retrospective cohort study Authors: Raquel Simbalista (r.simbalista@terra.com.br)

More information

BMI 541/699 Lecture 16

BMI 541/699 Lecture 16 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -

More information

2. METHODS OF DATA COLLECTION

2. METHODS OF DATA COLLECTION 2. METHODS OF DATA COLLECTION Proper data collection is important. Even sophisticated statistical analyses can t compensate for data with bias, ambiguity or errors. Some examples from Wainer, Palmer and

More information

Neuro-Oncology Practice

Neuro-Oncology Practice Neuro-Oncology Practice Neuro-Oncology Practice 2(4), 162 166, 2015 doi:10.1093/nop/npv030 Advance Access date 7 September 2015 Diagnostic tests: how to estimate the positive predictive value Annette M.

More information

Measuring association in contingency tables

Measuring association in contingency tables Measuring association in contingency tables Patrick Breheny April 8 Patrick Breheny Introduction to Biostatistics (171:161) 1/25 Hypothesis tests and confidence intervals Fisher s exact test and the χ

More information

Take a look at the three adult bears shown in these photographs:

Take a look at the three adult bears shown in these photographs: Take a look at the three adult bears shown in these photographs: Which of these adult bears do you think is most likely to be the parent of the bear cubs shown in the photograph on the right? How did you

More information

9.0 L '- ---'- ---'- --' X

9.0 L '- ---'- ---'- --' X 352 C hap te r Ten 11.0 10.5 Y 10.0 9.5 9.0 L...- ----'- ---'- ---'- --' 0.0 0.5 1.0 X 1.5 2.0 FIGURE 10.23 Interpreting r = 0 for curvilinear data. Establishing causation requires solid scientific understanding.

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting

More information

Road safety. Tool 1 COMMUNITY TOOLS

Road safety. Tool 1 COMMUNITY TOOLS COMMUNITY TOOLS Tool 1 Road safety What do you see in these pictures? 1. A person on a bicycle wearing a helmet with the chinstrap fastened. 2. A woman in a car wearing her seatbelt while driving. 3. An

More information

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS. Math 117 Confidence Intervals and Hypothesis Testing Interpreting Results SOLUTIONS The results are given. Interpret the results and write the conclusion within context. Clearly indicate what leads to

More information

Statistics and Probability

Statistics and Probability Statistics and a single count or measurement variable. S.ID.1: Represent data with plots on the real number line (dot plots, histograms, and box plots). S.ID.2: Use statistics appropriate to the shape

More information

Unit 4 Probabilities in Epidemiology

Unit 4 Probabilities in Epidemiology BIOSTATS 540 Fall 2017 4. Probabilities in Epidemiology Page 1 of 19 Unit 4 Probabilities in Epidemiology All knowledge degenerates into probability - David Hume A weather report statement such as the

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests

Objectives. Quantifying the quality of hypothesis tests. Type I and II errors. Power of a test. Cautions about significance tests Objectives Quantifying the quality of hypothesis tests Type I and II errors Power of a test Cautions about significance tests Designing Experiments based on power Evaluating a testing procedure The testing

More information

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape.

C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape. MODULE 02: DESCRIBING DT SECTION C: KEY POINTS C-1: Variables which are measured on a continuous scale are described in terms of three key characteristics central tendency, variability, and shape. C-2:

More information

Chapter 15: Continuation of probability rules

Chapter 15: Continuation of probability rules Chapter 15: Continuation of probability rules Example: HIV-infected women attending either an infectious disease clinic in Bangkok were screened for high-risk HPV and received a Pap test; those with abnormal

More information

UNIT 5 - Association Causation, Effect Modification and Validity

UNIT 5 - Association Causation, Effect Modification and Validity 5 UNIT 5 - Association Causation, Effect Modification and Validity Introduction In Unit 1 we introduced the concept of causality in epidemiology and presented different ways in which causes can be understood

More information

Computer Science 101 Project 2: Predator Prey Model

Computer Science 101 Project 2: Predator Prey Model Computer Science 101 Project 2: Predator Prey Model Real-life situations usually are complicated and difficult to model exactly because of the large number of variables present in real systems. Computer

More information

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE

THE STATSWHISPERER. Introduction to this Issue. Doing Your Data Analysis INSIDE THIS ISSUE Spring 20 11, Volume 1, Issue 1 THE STATSWHISPERER The StatsWhisperer Newsletter is published by staff at StatsWhisperer. Visit us at: www.statswhisperer.com Introduction to this Issue The current issue

More information

Market Research on Caffeinated Products

Market Research on Caffeinated Products Market Research on Caffeinated Products A start up company in Boulder has an idea for a new energy product: caffeinated chocolate. Many details about the exact product concept are yet to be decided, however,

More information

8.2 Relative Frequency

8.2 Relative Frequency 8.2 Relative Frequency Essential Question: How can you recognize possible associations and trends between two categories of categorical data? Explore Relative Frequencies To show what portion of a data

More information

Drinking and Driving Laws

Drinking and Driving Laws Drinking and Driving Laws What Motorcyclists Should Know Ride Sober. 1-888-THE-TABC www.tabc.state.tx.us www.2young2drink.com 2 This booklet was designed to inform you about some of the penalties you could

More information

Analyzing Discrete Data

Analyzing Discrete Data Analyzing Discrete Data Kwang Woo Ahn, PhD Sponsored by the Clinical and Translational Science Institute (CTSI) and the Department of Population Health / Division of Biostatistics Speaker disclosure In

More information

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables

Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Chapter 3: Investigating associations between two variables Further Mathematics 2018 CORE: Data analysis Chapter 3 Investigating associations between two variables Extract from Study Design Key knowledge

More information

Proportions, risk ratios and odds ratios

Proportions, risk ratios and odds ratios Applied Biostatistics Proportions, risk ratios and odds ratios Martin Bland Professor of Health Statistics University of York http://www-users.york.ac.uk/~mb55/ Risk difference Difference between proportions:

More information

Bayes theorem describes the probability of an event, based on prior knowledge of conditions that might be related to the event.

Bayes theorem describes the probability of an event, based on prior knowledge of conditions that might be related to the event. Bayes theorem Bayes' Theorem is a theorem of probability theory originally stated by the Reverend Thomas Bayes. It can be seen as a way of understanding how the probability that a theory is true is affected

More information

Something to think about. What happens, however, when we have a sample with less than 30 items?

Something to think about. What happens, however, when we have a sample with less than 30 items? One-Sample t-test Remember In the last chapter, we learned to use a statistic from a large sample of data to test a hypothesis about a population parameter. In our case, using a z-test, we tested a hypothesis

More information

First Problem Set: Answers, Discussion and Background

First Problem Set: Answers, Discussion and Background First Problem Set: Answers, Discussion and Background Part I. Intuition Concerning Probability Do these problems individually Answer the following questions based upon your intuitive understanding about

More information

Sections 10.7 and 10.9

Sections 10.7 and 10.9 Sections 10.7 and 10.9 Timothy Hanson Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 24 10.7 confidence interval for p 1

More information

Chapter 18: Categorical data

Chapter 18: Categorical data Chapter 18: Categorical data Labcoat Leni s Real Research The impact of sexualized images on women s self-evaluations Problem Daniels, E., A. (2012). Journal of Applied Developmental Psychology, 33, 79

More information

Lesson 9: Two Factor ANOVAS

Lesson 9: Two Factor ANOVAS Published on Agron 513 (https://courses.agron.iastate.edu/agron513) Home > Lesson 9 Lesson 9: Two Factor ANOVAS Developed by: Ron Mowers, Marin Harbur, and Ken Moore Completion Time: 1 week Introduction

More information

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How to Conduct On-Farm Trials Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona How can you determine whether a treatment (this might be an additive, a fertilizer, snake

More information

Chapter 19: Categorical outcomes: chi-square and loglinear analysis

Chapter 19: Categorical outcomes: chi-square and loglinear analysis Chapter 19: Categorical outcomes: chi-square and loglinear analysis Labcoat Leni s Real Research The impact of sexualized images on women s self-evaluations Problem Daniels, E., A. (2012). Journal of Applied

More information

CHAPTER ONE CORRELATION

CHAPTER ONE CORRELATION CHAPTER ONE CORRELATION 1.0 Introduction The first chapter focuses on the nature of statistical data of correlation. The aim of the series of exercises is to ensure the students are able to use SPSS to

More information

MIDTERM EXAM. Total 150. Problem Points Grade. STAT 541 Introduction to Biostatistics. Name. Spring 2008 Ismor Fischer

MIDTERM EXAM. Total 150. Problem Points Grade. STAT 541 Introduction to Biostatistics. Name. Spring 2008 Ismor Fischer STAT 541 Introduction to Biostatistics Spring 2008 Ismor Fischer Name MIDTERM EXAM Instructions: This exam is worth 150 points ( 1/3 of your course grade). Answer Problem 1, and any two of the remaining

More information

Population-based Patterns of HIV-related Hodgkin Lymphoma in the Greater San Francisco Bay Area:

Population-based Patterns of HIV-related Hodgkin Lymphoma in the Greater San Francisco Bay Area: Population-based Patterns of HIV-related Hodgkin Lymphoma in the Greater San Francisco Bay Area: 1988-98 Sally Glaser, Ph.D. Christina Clarke, Ph.D. Northern California Cancer Center Margaret Gulley, M.D.

More information

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics. Dae-Jin Lee dlee@bcamath.org Basque Center for Applied Mathematics http://idaejin.github.io/bcam-courses/ D.-J. Lee (BCAM) Intro to GLM s with R GitHub: idaejin 1/40 Modeling count data Introduction Response

More information

CHAPTER 8 EXPERIMENTAL DESIGN

CHAPTER 8 EXPERIMENTAL DESIGN CHAPTER 8 1 EXPERIMENTAL DESIGN LEARNING OBJECTIVES 2 Define confounding variable, and describe how confounding variables are related to internal validity Describe the posttest-only design and the pretestposttest

More information

Lessons in biostatistics

Lessons in biostatistics Lessons in biostatistics The test of independence Mary L. McHugh Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA Corresponding author:

More information

Chapter 12: Introduction to Analysis of Variance

Chapter 12: Introduction to Analysis of Variance Chapter 12: Introduction to Analysis of Variance of Variance Chapter 12 presents the general logic and basic formulas for the hypothesis testing procedure known as analysis of variance (ANOVA). The purpose

More information

DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Research Methods Posc 302 ANALYSIS OF SURVEY DATA

DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Research Methods Posc 302 ANALYSIS OF SURVEY DATA DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Research Methods Posc 302 ANALYSIS OF SURVEY DATA I. TODAY S SESSION: A. Second steps in data analysis and interpretation 1. Examples and explanation

More information

Section 3.2 Least-Squares Regression

Section 3.2 Least-Squares Regression Section 3.2 Least-Squares Regression Linear relationships between two quantitative variables are pretty common and easy to understand. Correlation measures the direction and strength of these relationships.

More information

Section 6: Analysing Relationships Between Variables

Section 6: Analysing Relationships Between Variables 6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations

More information

STP226 Brief Class Notes Instructor: Ela Jackiewicz

STP226 Brief Class Notes Instructor: Ela Jackiewicz CHAPTER 2 Organizing Data Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that can be assigned a numerical value or nonnumerical

More information

Biostatistics 2 - Correlation and Risk

Biostatistics 2 - Correlation and Risk BROUGHT TO YOU BY Biostatistics 2 - Correlation and Risk Developed by Pfizer January 2018 This learning module is intended for UK healthcare professionals only. PP-GEP-GBR-0957 Date of preparation Jan

More information

SHARED DECISION MAKING WORKSHOP SMALL GROUP ACTIVITY LUNG CANCER SCREENING ROLE PLAY

SHARED DECISION MAKING WORKSHOP SMALL GROUP ACTIVITY LUNG CANCER SCREENING ROLE PLAY SHARED DECISION MAKING WORKSHOP LUNG CANCER SCREENING ROLE PLAY Instructions Your group will role play a Shared Decision Making (SDM) conversation around lung cancer screening using the provided scenario.

More information

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University

Comparing Proportions between Two Independent Populations. John McGready Johns Hopkins University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Geometry 13-4 Study Guide: Two-Way Tables (p )! Page 1 of 7

Geometry 13-4 Study Guide: Two-Way Tables (p )! Page 1 of 7 Page 1 of 7 Attendance Problems. A bag contains 4 red and 2 yellow marbles. A marble is selected, kept out of the bag, and another marble is selected. Find each conditional probability of selecting the

More information

Age of Drinking Onset, Driving After Drinking, and Involvement in Alcohol Related Motor Vehicle Crashes

Age of Drinking Onset, Driving After Drinking, and Involvement in Alcohol Related Motor Vehicle Crashes Title: Age of Drinking Onset, Driving After Drinking, and Involvement in Alcohol Related Motor Vehicle Crashes Author(s): Affiliation: Hingson, R., Heeren, T., Levenson, S., Jamanka, A., Voas, R. Boston

More information

Statistics Mathematics 243

Statistics Mathematics 243 Statistics Mathematics 243 Michael Stob February 2, 2005 These notes are supplementary material for Mathematics 243 and are not intended to stand alone. They should be used in conjunction with the textbook

More information

Getting the Design Right Daniel Luna, Mackenzie Miller, Saloni Parikh, Ben Tebbs

Getting the Design Right Daniel Luna, Mackenzie Miller, Saloni Parikh, Ben Tebbs Meet the Team Getting the Design Right Daniel Luna, Mackenzie Miller, Saloni Parikh, Ben Tebbs Mackenzie Miller: Project Manager Daniel Luna: Research Coordinator Saloni Parikh: User Interface Designer

More information

Errors and Corrections. to Behavioral Risk Management. by Hersh Shefrin

Errors and Corrections. to Behavioral Risk Management. by Hersh Shefrin Errors and Corrections to Behavioral Risk Management by Hersh Shefrin This file contains corrections and edits to Behavioral Risk Management which were identified after the book went to press. Corrections

More information

Migraine Dataset. Exercise 1

Migraine Dataset. Exercise 1 Migraine Dataset In December 2016 the company BM launched the app MigraineTracker. This app was developed to collect data from people suffering from migraine. Users are able to create a record in this

More information

Errors and Corrections. to Behavioral Risk Management. by Hersh Shefrin

Errors and Corrections. to Behavioral Risk Management. by Hersh Shefrin Errors and Corrections to Behavioral Risk Management by Hersh Shefrin This file contains corrections and edits to Behavioral Risk Management which were identified after the book went to press. Corrections

More information

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences.

Stat 13, Intro. to Statistical Methods for the Life and Health Sciences. Stat 13, Intro. to Statistical Methods for the Life and Health Sciences. 0. SEs for percentages when testing and for CIs. 1. More about SEs and confidence intervals. 2. Clinton versus Obama and the Bradley

More information

Selection at one locus with many alleles, fertility selection, and sexual selection

Selection at one locus with many alleles, fertility selection, and sexual selection Selection at one locus with many alleles, fertility selection, and sexual selection Introduction It s easy to extend the Hardy-Weinberg principle to multiple alleles at a single locus. In fact, we already

More information

Delaware. Data Sources:

Delaware. Data Sources: Data Sources: Multiple Cause of Death (MCOD) Files, 2009-2013, National Center for Health Statistics. The MCOD file is a census of all deaths in the U.S. and some territories. Five years data were combined

More information

Basic Statistics and Data Analysis in Work psychology: Statistical Examples

Basic Statistics and Data Analysis in Work psychology: Statistical Examples Basic Statistics and Data Analysis in Work psychology: Statistical Examples WORK PSYCHOLOGY INTRODUCTION In this chapter we examine a topic which is given too little coverage in most texts of this kind,

More information

ADENIYI MOFOLUWAKE MPH APPLIED EPIDEMIOLOGY WEEK 5 CASE STUDY ASSIGNMENT APRIL

ADENIYI MOFOLUWAKE MPH APPLIED EPIDEMIOLOGY WEEK 5 CASE STUDY ASSIGNMENT APRIL ADENIYI MOFOLUWAKE MPH 510 - APPLIED EPIDEMIOLOGY WEEK 5 CASE STUDY ASSIGNMENT APRIL 4 2013 Question 1: What makes the first study a case-control study? The first case study is a case-control study because

More information

Stratified Tables. Example: Effect of seat belt use on accident fatality

Stratified Tables. Example: Effect of seat belt use on accident fatality Stratified Tables Often, a third measure influences the relationship between the two primary measures (i.e. disease and exposure). How do we remove or control for the effect of the third measure? Issues

More information

Address : S. 27th Street, Suite 201 Franklin, Wisconsin, WebSite : https://www.patienttrak.net/review-site-monitoring

Address : S. 27th Street, Suite 201 Franklin, Wisconsin, WebSite : https://www.patienttrak.net/review-site-monitoring Address : - 7441 S. 27th Street, Suite 201 Franklin, Wisconsin,53132 Email ID : info@patienttra.net Phone No : 888-766-2862 WebSite : https://www.patienttrak.net/review-site-monitoring There are so many

More information

Sanjay P. Zodpey Clinical Epidemiology Unit, Department of Preventive and Social Medicine, Government Medical College, Nagpur, Maharashtra, India.

Sanjay P. Zodpey Clinical Epidemiology Unit, Department of Preventive and Social Medicine, Government Medical College, Nagpur, Maharashtra, India. Research Methodology Sample size and power analysis in medical research Sanjay P. Zodpey Clinical Epidemiology Unit, Department of Preventive and Social Medicine, Government Medical College, Nagpur, Maharashtra,

More information

Statistics for Psychology

Statistics for Psychology Statistics for Psychology SIXTH EDITION CHAPTER 3 Some Key Ingredients for Inferential Statistics Some Key Ingredients for Inferential Statistics Psychologists conduct research to test a theoretical principle

More information

5 14.notebook May 14, 2015

5 14.notebook May 14, 2015 Objective: I can represent categorical data using a two way frequency table Entry: A marketing company is trying to determine how much diversity there is in the age of people who drink different soft drinks.

More information

Things you need to know about the Normal Distribution. How to use your statistical calculator to calculate The mean The SD of a set of data points.

Things you need to know about the Normal Distribution. How to use your statistical calculator to calculate The mean The SD of a set of data points. Things you need to know about the Normal Distribution How to use your statistical calculator to calculate The mean The SD of a set of data points. The formula for the Variance (SD 2 ) The formula for the

More information

WHATCOM COUNTY MEDICAL EXAMINER 1500 NORTH STATE STREET BELLINGHAM, WA ANNUAL REPORT

WHATCOM COUNTY MEDICAL EXAMINER 1500 NORTH STATE STREET BELLINGHAM, WA ANNUAL REPORT WHATCOM COUNTY MEDICAL EXAMINER NORTH STATE STREET BELLINGHAM, WA 9 ANNUAL REPORT During the Whatcom County Medical Examiner s office assumed jurisdiction in death investigations that involved forensic

More information

(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d

(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d Biostatistics and Research Design in Dentistry Reading Assignment Measuring the accuracy of diagnostic procedures and Using sensitivity and specificity to revise probabilities, in Chapter 12 of Dawson

More information

MATH Homework #3 Solutions. a) P(E F) b) P(E'UF') c) P(E' F') d) P(E F) a) P(E F) = P(E) * P(F E) P(E F) =.55(.20) =.11

MATH Homework #3 Solutions. a) P(E F) b) P(E'UF') c) P(E' F') d) P(E F) a) P(E F) = P(E) * P(F E) P(E F) =.55(.20) =.11 p. 04 3.8 Suppose P(E).55, P(F).40 and P(F E).20. Find a) P(E F) b) P(E'UF') c) P(E' F') d) P(E F) a) P(E F) P(E) * P(F E) P(E F).55(.20). b) P(E' F') P((E F)') P(E' F') P(E F) P(E' F')..89 c) P(E' F')

More information

Lesson 14: Association Between Categorical Variables

Lesson 14: Association Between Categorical Variables Lesson 14: Association Between Categorical Variables Classwork Example 1 Suppose a random group of people are surveyed about their use of smartphones. The results of the survey are summarized in the tables

More information