Lecture 1: Measuring Disease Occurrence: Prevalence, incidence, incidence density

Similar documents
Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

Incorporating External Data into Existing Longitudinal Studies

Reflection Questions for Math 58B

Name: (First) (Middle) ( Last)

NUTRITION & MORTALITY INDICATORS IN THE CADRE HARMONISÉ. Olutayo Adeyemi

Person-years; number of study participants (number of cases) HR (95% CI) P for trend

DATA UPDATE: CANCER INCIDENCE IN DAKOTA AND WASHINGTON COUNTIES

DATA UPDATE: CANCER INCIDENCE IN DAKOTA AND WASHINGTON COUNTIES

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

PubHlth Introductory Biostatistics Practice Test I (Without Unit 3 Questions)

Kitchen Assistant III Curriculum

Comparison And Application Of Methods To Address Confounding By Indication In Non- Randomized Clinical Studies

Measures of association: comparing disease frequencies. Outline. Repetition measures of disease occurrence. Gustaf Edgren, PhD Karolinska Institutet

Immunization Safety Office: Overview and Considerations on Safety of Alternative Immunization Schedules

One-Way ANOVAs t-test two statistically significant Type I error alpha null hypothesis dependant variable Independent variable three levels;

Cohort studies. Hans Wolff. Service d épidémiologie Clinique, Département de médecine communautaire

Hypothesis Testing. Richard S. Balkin, Ph.D., LPC-S, NCC

BMI 541/699 Lecture 16

Estimating HIV incidence in the United States from HIV/AIDS surveillance data and biomarker HIV test results

Epidemiology: Overview of Key Concepts and Study Design. Polly Marchbanks

Development of criteria for the selection of markers for use in nutrition research: Follow-up of the ILSI Europe Marker Validation Initiative

ASSOCIATION & CAUSATION IN EPIDEMIOLOGICAL STUDIES. Dr. Sireen Alkhaldi Community Medicine, 2016/ 2017 The University of Jordan

Slide 1. Slide 2. Slide 3. Behavioral Research Chapter 10. Simple designs. Factorial design. Complex Experimental Designs

Biostatistics for Med Students. Lecture 1

EPIDEMIOLOGY STUDY DESIGN AND DATA ANALYSIS

Confidence Intervals On Subsets May Be Misleading

LA SIERRA UNIVERSITY Department of Health and Exercise Science Winter 2016

Lecture 12A: Chapter 9, Section 1 Inference for Categorical Variable: Confidence Intervals

Probability Models for Sampling

Executive summary. 9 Executive summary

Maintenance of weight loss and behaviour. dietary intervention: 1 year follow up

LA SIERRA UNIVERSITY Department of Health and Exercise Science Fall 2011

Chapter 8 Experimental Design

Modifying effects of dietary polyunsaturated fatty acid (PUFA) on levels of cholesterol and their implications for heart health

Please refer to the specific section of the ecqm to identify the QDM data elements and associated value sets for use in reporting this ecqm.

Poisson regression. Dae-Jin Lee Basque Center for Applied Mathematics.

Dual-energy X-ray absorptiometry (DXA), body composition assessment 62

Appendix 1 (as supplied by the authors): Supplementary data

DEPARTMENT OF HEALTH AND HUMAN SERVICES DIVISION OF PUBLIC HEALTH MANDY COHEN, MD, MPH SECRETARY

BAYESIAN JOINT LONGITUDINAL-DISCRETE TIME SURVIVAL MODELS: EVALUATING BIOPSY PROTOCOLS IN ACTIVE-SURVEILLANCE STUDIES

Behavioral Data Mining. Lecture 4 Measurement

Epidemiology of weak associations The case of nutrition and cancer. Paolo Boffetta Icahn School of Medicine at Mount Sinai New York NY

A point estimate is a single value that has been calculated from sample data to estimate the unknown population parameter. s Sample Standard Deviation

Comparing Means among Two (or More) Independent Populations. John McGready Johns Hopkins University

A novel approach to estimation of the time to biomarker threshold: Applications to HIV

Confidence interval and hypothesis testing examples

Trial Designs. Professor Peter Cameron

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Chapter 19. Confidence Intervals for Proportions. Copyright 2010 Pearson Education, Inc.

Math 243 Sections , 6.1 Confidence Intervals for ˆp

Cancer incidence: life table risk versus

Creative Commons Attribution-NonCommercial-Share Alike License

A point estimate is a single value that has been calculated from sample data to estimate the unknown population parameter. s Sample Standard Deviation

Cancer Data Review ( ) Selected Zip Codes of Warminster, Warrington, and Horsham, PA

Health Information Technology: Comprehensive Telehealth Interventions to Improve Diet Among Patients with Chronic Diseases

GATE: Graphic Appraisal Tool for Epidemiology picture, 2 formulas & 3 acronyms

Biostatistics 2 - Correlation and Risk

Chapter 11: Advanced Remedial Measures. Weighted Least Squares (WLS)

Supplementary Table 1. Association of rs with risk of obesity among participants in NHS and HPFS

Cohort studies. Alfredo Morabia

Is dairy good for you?

SUPPLEMENTARY MATERIAL

Risk Ratio and Odds Ratio

Nutrition and MND. Dr Chris McDermott

Estimating U.S. national vaccination coverage using IIS Sentinel Site data

Design and Analysis of a Cancer Prevention Trial: Plans and Results. Matthew Somerville 09 November 2009

Advanced ANOVA Procedures

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

What is Multilevel Modelling Vs Fixed Effects. Will Cook Social Statistics

Executive Summary. This report provides the findings of a ten-month study requested by the Maryland

Estimating the impact of community-level interventions: The SEARCH Trial and HIV Prevention in Sub-Saharan Africa

ADVANCED ANOVA APPLICATIONS

The following are questions that students had difficulty with on the first three exams.

Title: A Prospective Study of Dietary Selenium Intake and Risk of Type 2 Diabetes

DETERMINANTS OF HEALTH HHD Unit 3 AOS 1 Pg 61-76

The 2015 Dutch food-based dietary guidelines:

Lecture 2 Carbohydrates

STA Learning Objectives. What is Population Proportion? Module 7 Confidence Intervals for Proportions

STA Module 7 Confidence Intervals for Proportions

Chapter 19. Confidence Intervals for Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

A Critique of Two Methods for Assessing the Nutrient Adequacy of Diets

Individual Lab Report Ci-Trol Nov,2016. Abnormal Fibrinogen (mg/dl) Abnormal Fbg Control - Lot# LFC Your Lab

Some Comments on the Relation Between Reliability and Statistical Power

Mary Emmett, PhD, FACHE Center for Health Services and Outcomes Research

Types of Statistics. Censored data. Files for today (June 27) Lecture and Homework INTRODUCTION TO BIOSTATISTICS. Today s Outline

SECTION 4. Nutrition and Disease in the Japan Collaborative Cohort Study for Evaluation of Cancer (JACC) Hiroyasu Iso, Yoshimi Kubota.

Chapter 8 Estimating with Confidence

DIETARY REFERENCE INTAKES (DRIS) FOR MONGOLIANS

Epidemiology of Lassa Fever

Age 18 years and older BMI 18.5 and < 25 kg/m 2

Section Editor Steven T DeKosky, MD, FAAN Kenneth E Schmader, MD

9. Interpret a Confidence level: "To say that we are 95% confident is shorthand for..

How many Cases Are Missed When Screening Human Populations for Disease?

DIETARY ASSESSMENT OF PREGNANT SICKLERS. Kalpana Jadhav* & Saroj Zanjhal**

Epidemiologic study designs

MIDTERM EXAM. Total 150. Problem Points Grade. STAT 541 Introduction to Biostatistics. Name. Spring 2008 Ismor Fischer

L2, Important properties of epidemics and endemic situations

Announcement. Homework #2 due next Friday at 5pm. Midterm is in 2 weeks. It will cover everything through the end of next week (week 5).

Age 18 years and older BMI 18.5 and < 25 kg/m 2

Epidemiologic Methods I & II Epidem 201AB Winter & Spring 2002

Transcription:

Lecture 1: Measuring Disease Occurrence: Prevalence, incidence, incidence density Dankmar Böhning Southampton Statistical Sciences Reserch Institute University of Southampton, UK 2-4 March 2015

Outline Purpose The purpose of this material is to provide an overview on the most important measures of disease occurrence: prevalence incidence (cumulative incidence or risk) incidence density Examples The concepts will be illustrated with examples and practicals.

Outline Measuring Disease Occurrence: Prevalence Measuring Disease Occurrence: Incidence Measuring Disease Occurrence: Incidence Density

Measuring Disease Occurrence: Prevalence Measuring Disease Occurrence: Prevalence Prevalence: is the proportion (denoted as p) of a specific population having a particular disease. p is a number between 0 and 1. If multiplied by 100 it is percentage. Examples In a population of 1000 there are two cases of malaria: p = 2/1000 = 0.002 or 0.2%. In a population of 10,000 there are 4 cases of skin cancer: p = 4/10, 000 = 0.0004 or 0.04%.

Measuring Disease Occurrence: Prevalence Measuring Disease Occurrence: Prevalence epidemiological terminology In epidemiology, disease occurrence is frequently small relative to the population size. Therefore, the proportion figures are multiplied by an appropriate number such as 10,000. In the above second example, we have a prevalence of 4 per 10,000 persons. Exercise In a county with 2300 inhabitant there have occurred 2 cases of leukemia. Prevalence?

Measuring Disease Occurrence: Prevalence Quantitative Aspects: What is Variance and Confidence Interval for the Prevalence! sample: sample (population survey) of size n provides for disease status for each unit of the sample: X i = 1, disease present X i = 0, disease not present consequently, ˆp = X 1 + X 2 +... + X n n n i=1 = X i n plausible estimator of prevalence.

Measuring Disease Occurrence: Prevalence Computing Variance of Prevalence of ˆp: consequently, p(1 p) Var (ˆp) = n p(1 p) SD(ˆp) = n

Measuring Disease Occurrence: Prevalence ˆp is approx. normal

Measuring Disease Occurrence: Prevalence using the normal distribution for ˆp: with 95% probability 2 ˆp p SD(ˆp) +2 ˆp 2SD(ˆp) p ˆp + 2SD(ˆp) 95%CI : ˆp ± 2SD(ˆp) = ˆp ± 2 ˆp(1 ˆp)/ n

Measuring Disease Occurrence: Prevalence Examples In a population of 1000 there are two cases of malaria: p = 2/1000 = 0.002 or 0.2%. Var(ˆp) = 0.002(1 0.002)/1000 = (0.00141280) 2, SD(ˆp) = 0.00141280 95%CI : ˆp ± 2 ˆp(1 ˆp)/ n = 0.002 ± 2 0.0014 = (0 0.0048)

Measuring Disease Occurrence: Prevalence Exercise In a county with 2300 inhabitants there have occurred 2 cases of leukemia. Prevalence with CI?

Measuring Disease Occurrence: Incidence Measuring Disease Occurrence: Incidence Incidence: is the proportion (denoted as I ) of a specific, disease-free population developing a particular disease in a specific study period. I is a number between 0 and 1. If multiplied by 100 it is percentage. Examples In a malaria-free population of 1000 there are four new cases of malaria within one year : I = 4/1000 = 0.004 or 0.4%. In a skin-cancer free population of 10,000 there are 11 new cases of skin cancer: I = 11/10, 000 = 0.0011 or 0.11%.

Measuring Disease Occurrence: Incidence Measuring Disease Occurrence: Incidence Exercise In a rural county with 2000 children within pre-school age there have occurred 15 new cases of leukemia within 10 years. Incidence?

Measuring Disease Occurrence: Incidence Quantitative Aspects: How to determine Variance and Confidence Interval for the Incidence? sample (population cohort - longitudinal) of size n, which is initially disease-free, provides the disease status for each unit of the sample at the end of study period: consequently, X i = 1, new case X i = 0, disease not present Î = X 1 + X 2 +... + X n n plausible estimator of incidence. = n i=1 X i n

Measuring Disease Occurrence: Incidence Computing Variance of Incidence ) Var (Î = I (1 I ) n I (1 I ) SD(Î ) = n

Measuring Disease Occurrence: Incidence Î is approx. normal

Measuring Disease Occurrence: Incidence 95% confidence interval for the incidence density with 95% probability 2 Î I SD(Î ) +2 Î 2SD(Î ) I Î + 2SD(Î ) 95%CI : Î ± 2SD(Î ) = Î ± 2 Î (1 Î )/ n

Measuring Disease Occurrence: Incidence Examples In a malaria-free population of 1000 there are four new cases of malaria within one year : I = 4/1000 = 0.004 or.4%. Var(Î ) = 0.004(1 0.004)/1000 = (0.001996)2, SD(Î ) = 0.001996 95%CI : Î ± 2 Î (1 Î )/ n = 0.004 ± 2 0.001996 = (0.000008 0.0080)

Measuring Disease Occurrence: Incidence Exercise In a rural county with 2000 children within pre-school age there have occurred 15 new cases of leukemia within 10 years. Incidence with 95% CI?

Measuring Disease Occurrence: Incidence Density Measuring Disease Occurrence: Incidence Density Incidence Density: is the rate (denoted as ID) of a specific, disease-free population developing a particular disease w. r. t. a specific study period of length T. ID is a positive number, but not necessarily between 0 and 1. estimating incidence density suppose a disease-free population of size n is under risk for a time period T. Then a plausible estimator of ID is given as ÎD = n i=1 X i n T count of events = person time where X i = 1 if for person i disease occurs and 0 otherwise.

Measuring Disease Occurrence: Incidence Density Examples A cohort study is conducted to evaluate the relationship between dietary fat intake and the development in prostate cancer in men. In the study, 100 men with high fat diet are compared with 100 men who are on low fat diet. Both groups start at age 65 and are followed for 10 years. During the follow-up period, 10 men in the high fat intake group are diagnosed with prostate cancer and 5 men in the low fat intake group develop prostate cancer. The incidence density is ÎD = 10/(1, 000) = 0.01 in the high fat intake group and ÎD = 5/(1, 000) = 0.005 in the low fat intake group.

Measuring Disease Occurrence: Incidence Density most useful generalization occurs if persons are different times under risk and hence contributing differently to the person time denominator estimating incidence density with different risk-times suppose a disease-free population of size n is under risk for a time periods T 1, T 2,..., T n, respectively. Then a plausible estimator of ID is given as ÎD = n i=1 X i n i=1 T i = count of events person time where X i = 1 if for person i disease occurs and 0 otherwise, and T i represents the person-time of person i in the study period.

Measuring Disease Occurrence: Incidence Density Examples Consider a population of n = 5 factory workers with X 2 = 1 and all other X i = 0(here the disease incidence might be a lung disease). We have also T 1 = 12, T 2 = 2,T 3 = 6,T 4 = 12,T 5 = 5, so that ÎD = 1 12 + 2 + 6 + 12 + 5 = 1/37.

Measuring Disease Occurrence: Incidence Density interpretation of incidence density: In the above example of diet-cancer study: ÎD = 0.01 means what? There is no longer the interpretation of 1 case per 100 men, but 1 case per 100 men-years! The interpretation is now number of events per person time!

Measuring Disease Occurrence: Incidence Density Quantitative Aspects for the Incidence Density sample (population cohort - longitudinal) of size n available: event indicators: X 1,..., X n estimate of incidence density person times: T 1,..., T n ÎD = X 1 + X 2 +... + X n T 1 + T 2 +... + T n = X T a variance estimate can be found as Var(ÎD) = ÎD T = X T 2

Measuring Disease Occurrence: Incidence Density Quantitative Aspects for the Incidence Density variance estimate can be found as Var(ÎD) = ÎD T = X T 2 so that a 95% confidence interval is given as ÎD ÎD ± 2 T

Measuring Disease Occurrence: Incidence Density Example Consider the population of n = 5 factory workers with X 2 = 1 and all other X i = 0 (here the disease incidence might be a lung disease). We have X = 1 and T = 37, so that ÎD = 1/37 = 0.027. The variance is ÎD T = 0.0007 and standard deviation 0.027. This leads to a 95% CI ÎD ÎD ± 2 = 0.027 ± 2 0.027 = (0, 0.081). T

Measuring Disease Occurrence: Incidence Density Exercise We return to the cohort study mentioned before. It had been conducted to evaluate the relationship between dietary fat intake and the development in prostate cancer in men. In the study, 100 men with high fat diet are compared with 100 men who are on low fat diet. Both groups start at age 65 and are followed for 10 years. During the follow-up period, 10 men in the high fat intake group are diagnosed with prostate cancer and 5 men in the low fat intake group develop prostate cancer. Compute 95% CI for incidence densities: high fat intake group: ÎD = 10/(1, 000) = 0.01 low fat intake group: ÎD = 5/(1, 000) = 0.005