BMI 541/699 Lecture 16
|
|
- Allen Hopkins
- 6 years ago
- Views:
Transcription
1 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables - hypothesis test of a proportion - estimate and confidence interval for a proportion - χ 2 (Chi-square) goodness of fit test. - contingency tables (testing two proportions). - Mcnemar s test for paired binary data - odds ratios and relative risk - sensitivity and specificity - positive predictive value and negative predictive value - ROC curves 1 / 38
2 Relative Risk & Odd ratios Example An experiment is conducted to compare the proportion of low birth weight babies in mothers who smoke to those who don t. n 1 = 105 babies from smoking mothers are weighed at birth n 2 = 87 babies from non-smoking mothers are weighed at birth X 1 = number of low birth weight babies from mothers who smoke X 1 Binomial(n 1, p 1 ) X 2 = number of low birth weight babies from mothers who don t smoke We observe x 1 = 21 and x 2 = 9 X 2 Binomial(n 2, p 2 ) Mother Outcome smokes doesn t smoke low birth weight x 1 = 21 x 2 = 9 normal birth weight n 1 x 1 = 84 n 2 x 2 = 78 Total n 1 = 105 n 2 = 87 2 / 38
3 Mother Outcome smokes doesn t smoke low birth weight 21 9 normal birth weight Total Note that here the column sums are fixed by the experimental design but the row totals are not. ˆp 1 = 21/105 = ˆp 2 = 9/87 = We could estimate the difference between p 1 and p 2 but sometimes we are interested in estimating the ratio of proportions rather than the difference. 3 / 38
4 The Relative Risk (RR) is a ratio of two probabilities, both of the same event, but under different conditions. The relative risk of low birth weight for mothers who smoke relative to those who don t is RR = Pr(low birth weight mother smoked) Pr(low birth weight mother didn t smoke) RR = ˆp 1 /ˆp 2 =.2000/.1034 = So the risk of low birth weight babies is times as high for smoking mothers as for non-smoking. 4 / 38
5 The Odds Ratio The odds ratio is another way to compare probabilities. If E 1 = is the event that a low birth weight baby is born to a mother who smokes and Pr(E 1 ) = p 1. then the odds of event E 1 = Pr(E 1) Pr(!E 1 ) = Pr(E 1) 1 Pr(E 1 ) = p 1 1 p 1 If E 2 = the event that a low birth weight baby is born to a mother who doesn t smoke and Pr(E 2 ) = p 2 then the odds of event E 2 = Pr(E 2) Pr(!E 2 ) = p 2 1 p 2 5 / 38
6 The odds ratio of two events is the ratio of the odds for the events and is often denoted θ. The odds ratio for events with probabilities p 1 and p 2 is θ = p 1/(1 p 1 ) p 2 /(1 p 2 ) = p 1(1 p 2 ) p 2 (1 p 1 ) For our example ˆp 1 = 21/105 = ˆp 2 = 9/87 = and the estimate of the odds ratio for low birth weight babies born to smokers vs non smokers is: θ = ˆp 1(1 ˆp 2 ) ( ) = ˆp 2 (1 ˆp 1 ) ( ) = / 38
7 Comparing Relative Risk and Odds Ratios Relative risk and odds ratios are not identical, but are similar to one another. The exact relationship is: odds ratio = p 1 p 2 1 p 2 1 p 1 = relative risk 1 p 2 1 p 1 The odds ratio will be very close to the relative risk when p 1 and p 2 are both small. 7 / 38
8 The relative risk p 1 /p 2 is easier to interpret than the odds ratio. So why do we use the odds ratio? 1. The range of the RR is dependent on p 2. 0 RR 1/p 2 The range of the OR is always 0 to. 0 OR 2. There are situations (case-control studies) where an odds ratio can be calculated from data where relative risk cannot. 3. Logistic regression allows us to model the probability of an event as a function of multiple variables. The results can be summarized as odds ratios. 8 / 38
9 OR for case-control studies Turn the birth weight experiment around Let For the original experiment we sampled smoking mothers and non-smoking mothers Imagine a second experiment where we sample underweight babies and a matching control group of normal weight babies. (a case control study) n 3 = 30 low birth weight babies n 4 = 162 matched normal birth weight babies X 3 = # of mother s who smoke in the low birth weight baby group X 3 Binomial(n 3, p 3 ) X 4 = # of mother s who smoke in the normal birth weight baby group X 4 Binomial(n 4, p 4 ) 9 / 38
10 We observe x 3 = 21 and x 4 = 84 Mother Group smokes doesn t smoke total low birth weight x 3 = 21 9 n 3 = 30 normal birth weight x 4 = n 4 = 162 ˆp 3 = 21/30 = 0.70 ˆp 4 = 84/162 = The data from a case control study do not give us any information about the values of p 1 = the probability of a low birth weight baby for mothers who smoke. OR p 2 = the probability of a low birth weight baby for mothers who don t smoke. We also can t estimate p 1 /p 2 (the relative risk). 10 / 38
11 However we can estimate the odds ratio from a case control study since the odds ratio for the rows is the same as the odds ratio for the columns. Odds ratio for the rows is (odds ratio of smoking mothers for the two groups of babies) θ = ˆp 3(1 ˆp 4 ) 0.70( ) = (1 ˆp 3 )ˆp 4 (1 0.70)0.518 = Our estimate for the column odds ratio (odds ratio of low birth weight babies for the two groups of mothers) θ = ˆp 1(1 ˆp 2 ) ( ) = ˆp 2 (1 ˆp 1 ) ( ) = We get the same thing even though we can t estimate p 1 or p / 38
12 The distribution of the Relative Risk An estimate for RR is RR = ˆp 1 /ˆp 2 where ˆp i = x i /n i The distribution of log( RR) is approximately normal with mean = log(rr) variance = 1 p 1 n 1 p p 2 n 2 p 2 standard deviation = 1 p 1 n 1 p p 2 n 2 p 2 12 / 38
13 A confidence interval for the relative risk A 95% CI for log(rr): log( RR) ± ˆp 1 n 1 ˆp ˆp 2 n 2 ˆp 2 transform back to get corresponding CI for RR. 13 / 38
14 Odds ratio: Recall the odds ratio is the ratio of the odds in the two groups. OR = p 1/(1 p 1 ) p 2 /(1 p 2 ) Note that ˆp 1 (1 ˆp 1 ) = x 1/n 1 1 x 1 /n 1 = x 1 n 1 x 1 The estimate for the OR is ÔR = ˆp 1/(1 ˆp 1 ) ˆp 2 /(1 ˆp 2 ) = x 1/(n 1 x 1 ) x 2 /(n 2 x 2 ) = x 1(n 2 x 2 ) x 2 (n 1 x 1 ) 14 / 38
15 Distribution of the Odds Ratio log(ôr) is approximately normally distributed. The expected value of log(ôr) is mean(log(ôr)) = log(or) The standard deviation of log(ôr) Var(log(ÔR)) = ( )) ( ( )) (log Var ˆp1 ˆp2 + Var log 1 ˆp 1 1 ˆp 2 = 1 n 1 p 1 (1 p 1 ) + 1 n 2 p 2 (1 p 2 ) = n 1 p 1 n 1 (1 p 1 ) n 2 p 2 n 2 (1 p 2 ) 15 / 38
16 If we estimate p 1 and p 2 using ˆp 1 = x 1 n 1 and ˆp 2 = x 2 we get: Var(log(ÔR)) = 1 x 1 + a 95% CI for log(or) is log(ôr) ± x n 1 x x n 1 x x 2 + n 2, 1 n 2 x 2 1 n 2 x 2 apply the inverse log to the endpoints of the CI to obtain the CI for OR. 16 / 38
17 Example: An experiment is conducted to compare the effectiveness of two anti-hypertensive medications. n 1 = 105 subjects are given drug 1 n 2 = 87 subjects are given drug 2. X 1 = number who improve on drug 1 Binomial(n 1, p 1 ) X 2 = number who improve on drug 2 Binomial(n 2, p 2 ) We observe x 1 = 71 and x 2 = 45 treatment group outcome drug 1 drug 2 improvement no improvement Total ˆp 1 = 71/( ) =.676 ˆp 2 = 45/( ) = / 38
18 treatment group outcome drug 1 drug 2 improvement a = 71 c = 45 no improvement b = 34 d = 42 ÔR = 71(42) 45(34) = 1.95 The 99% CI for log(or) is 1 log(1.95) ± = (.103, 1.438) 45 And the 99% CI for the odds ratio OR is ( e.103, e 1.438) = (0.90, 4.21) 18 / 38
19 For the example the estimate for relative risk is RR = ˆp 1 /ˆp 2 =.676/.517 = 1.31 and a 99% CI for log(rr) is.324 log(1.31) ± (.676) = (.051,.587) 87(.517) 99% CI for RR: (e.051, e.587 ) = (0.95, 1.80) 19 / 38
20 Summary of relative risk & odds ratio relative risk = (p 1 /p 2 ) odds ratio = p 1/(1 p 1 ) p 2 /(1 p 2 ) RR is easier to interpret but OR is used more often because: 1. The range of the RR is dependent on p 2. The range of the OR is always 0 to. 2. There are situations (case-control studies) where the odds ratio can be calculated from data but the relative risk cannot. 3. Logistic regression produces odds ratios Our estimate for the odds ratio is ÔR = ˆp 1(1 ˆp 2 ) (1 ˆp 1 )ˆp 2 where ˆp 1 = x 1 /n 1 ; ˆp 2 = x 2 /n 2 ; The 95% confidence interval for the log odds ratio is log(ôr) ± x n 1 x x n 2 x 2 transformed back to the OR scale using the inverse log. 20 / 38
21 Diagnostic Tests and 2 by 2 tables Example: Angiograms (the gold standard test for diagnosing stroke) have a slight risk of mortality (< 1%). Some investigators have attempted to use a PET scanner to detect stroke non-invasively. A simple random sample of n = 64 was taken from the group of patients who were thought to have had a stroke. Stroke (by angiogram) PET Frequency negative negative 21 negative positive 8 positive negative 3 positive positive / 38
22 The data in a 2 by 2 table: Stroke (by angiogram) positive negative Total PET test: positive negative Total There are a number of measures of the quality of the new diagnostic test. The sensitivity or true positive rate of a diagnostic test is the probability of a positive test result when the patient has the disease. That is, how sensitive is the test in detecting the disease. How likely is the test to be positive if you have the disease. Sensitivity = Pr(positive diagnostic test disease) 22 / 38
23 Recall: For any events E and F, Pr(E F ) = Pr(E&F )/Pr(F ) So: Sensitivity = Pr(positive diagnostic test disease) Pr(positive PET & had stroke) = Pr(had stroke) Data again: Stroke (by angiogram) positive negative Total PET test: positive negative Total Our best estimate for Pr(positive PET & had stroke) = 32/64. Our best estimate for Pr(had stroke) = 35/64. Sensitivity = 32/64 35/64 = 32/35 = 91% 23 / 38
24 Stroke (by angiogram) positive negative Total PET test: positive negative Total Another measure of the quality of a diagnostic test is the specificity or true negative rate. Specificity is the probability of a negative test result when the patient doesn t have the disease. That is, how specific is the test in detecting the disease. How likely is the test to be negative if you don t have the disease. Specificity = Pr(negative diagnostic test no disease) 24 / 38
25 Stroke (by angiogram) positive negative Total PET test: positive negative Total Specificity = Pr(negative PET didn t have stroke) = Pr(negative PET & no stroke)/pr(no stroke) Our best estimate for Pr(negative PET & no stroke) = 21/64. Our best estimate for Pr(no stroke) = 29/64. So Specificity = 21/64 = 21/29 = 72% 29/64 25 / 38
26 If we name the cells in the table: have don t have disease disease total test is positive TP = true positives FP = false positives total positives test is negative FN = false negatives TN = true negatives total negatives total total with total without disease disease Our estimates are: Sensitivity = true positive rate = number of true positives / total with disease = TP/(TP+FN) Specificity = true negative rate = number of true negatives / total without disease = TN/(TN+FP) 26 / 38
27 HIV test example: HIV present by Blood test positive negative Home test positive Home test negative Sensitivity = 72/(72+2) = 94%; Specificity = 157/(157+50) = 76% This could be a good screening test. There is only a 6% chance of a false negative when HIV is present (1-sensitivity). 27 / 38
28 Sensitivity/Specificity trade off For many diagnostic tests there is a cutoff value. The test is positive if the measurement is above the cutoff negative if the measurement is below the cutoff If the cutoff level is set too high there will be few false positives but many false negatives. Such a test would miss many individuals with the disease. (low sensitivity) If the cutoff level is set too low there will be few false negatives but many false positives. Such a test would indicate that many healthy individuals have the disease (low specificity). A good diagnostic test will discriminate well between those with and without the disease and a cutoff can be found that provides good sensitivity and good specificity. 28 / 38
29 Use in the clinical setting From a perspective of putting the diagnostic test to use in a clinical setting the predictive value measures can be more useful: The positive predictive value (PPV) of a diagnostic test is the probability of the patient having the disease given the diagnostic test is positive PPV = Pr(disease positive diagnostic test) The negative predictive value (NPV) of a diagnostic test is the probability of the patient not having the disease given the diagnostic test is negative NPV = Pr( no disease negative diagnostic test ) 29 / 38
30 Example: Pet stroke test We can calculate PPV and NPV from the same 2 by 2 table: Stroke (by angiogram) positive negative Total PET test: positive negative Total Positive predictive value = Pr(had stroke positive PET) = Pr(had stroke & positive PET)/Pr( positive PET) PPV = 32/64 = 32/40 = 80% 40/64 Negative predictive value = Pr(didn t have stroke negative PET) = Pr(no stroke & negative PET)/Pr(negative PET) NPV = 21/64 24/64 = 21/24 = 87% 30 / 38
31 Summary of calculations have don t have disease disease total test is positive TP = true positives FP = false positives total positives test is negative FN = false negatives TN = true negatives total negatives total total with total without disease disease Our estimates are: Sensitivity = true positive rate = number of true positives / total with disease = TP/(TP+FN) Specificity = true negative rate = number of true negatives / total without disease = TN/(TN+FP) Positive Predictive Value (PPV) = number of true positives / total positives = TP/(TP+FP) Negative Predictive Value (NPV) = number of true negatives / total negatives = TN/(TN+FN) 31 / 38
32 One sample vs. two PPV & NPV change when the probability of disease changes. Sensitivity & Specificity don t. If I increase the total number of negative angiograms without changing the proportion of positive and negative PET tests, PPV will go down and NPV will go up. Stroke (by Angiogram) Positive Negative Total PET test: positive negative Total PPV is now 32/112 = 29% (was 80%) NPP is now 210/213 = 99% (was 87%) 32 / 38
33 Sensitivity and specificity do not change because they condition on disease status. Stroke (by Angiogram) Positive Negative Total PET test: positive negative Total Sensitivity = 32/35 = 91% (same as before) Specificity = 210/290 = 72% (same as before) Sensitivity and Specificity can be estimated from two independent samples, one of disease positive individuals and one of disease negative individuals. PPV and NPV can only be estimated when the table is from a simple random sample of the population of interest (patients who may have had a stroke). 33 / 38
34 Comments on Diagnostic Testing A diagnostic test with low PPV may still be useful as an initial screening tool to be followed up with a test with higher PPV. PPV and NPV can only be estimated when the table is from a simple random sample of the population of interest. Sensitivity and Specificity can be estimated from two independent samples, one of individuals with the disease and one of individuals without the disease. 34 / 38
35 Sensitivity and Specificity trade off: For a diagnostic test based on a continuous variable, - Decreasing the cutoff point will increase sensitivity and decrease specificity. - Increasing the cutoff point will decrease sensitivity and increase specificity. A high false positive rate (low specificity) can be alarming and costly. Example: false positive mammography findings leads to unnecessary biopsy. Some diagnostic testing strategies aim to maximize sensitivity (minimize false negatives) at a cost of false positives in order to rule out disease with the highest possible probability. Example: screening of donated blood for Hepatitis. 35 / 38
36 Sensitivity, Specificity, PPV & NPV in R Commander. You will need the Rcmdr plugin EZR. To load EZR Install the package RcmdrPlugin.EZR into R if you have not already done so. You only need to do this once. See the instructions on the home page. Load EZR into R (must do this each time you need EZR): - Choose the Menu Tools Load Rcmdr plug-ins(s)... - Choose EZR and say yes when you are asked if you want to restart R commander. 36 / 38
37 Example: Stroke (by angiogram) positive negative Total PET test: positive negative Total R commander (EZR) menu: Statistical Analysis Accuracy of diagnostic test Accuracy of qualitative test. Enter the table counts and click OK. 37 / 38
38 The output: > epi.tests(.table, conf.level = 0.95) Disease positive Disease negative Total Test positive Test negative Total Point estimates and 95 % CIs: Estimation Lower CI Upper CI Apparent prevalence True prevalence Sensitivity Specificity Positive predictive value Negative predictive value Diagnstic accuracy Likelihood ratio of a positive test Likelihood ratio of a negative test / 38
Screening (Diagnostic Tests) Shaker Salarilak
Screening (Diagnostic Tests) Shaker Salarilak Outline Screening basics Evaluation of screening programs Where we are? Definition of screening? Whether it is always beneficial? Types of bias in screening?
More informationVarious performance measures in Binary classification An Overview of ROC study
Various performance measures in Binary classification An Overview of ROC study Suresh Babu. Nellore Department of Statistics, S.V. University, Tirupati, India E-mail: sureshbabu.nellore@gmail.com Abstract
More informationDiagnostic screening. Department of Statistics, University of South Carolina. Stat 506: Introduction to Experimental Design
Diagnostic screening Department of Statistics, University of South Carolina Stat 506: Introduction to Experimental Design 1 / 27 Ties together several things we ve discussed already... The consideration
More informationROC (Receiver Operating Characteristic) Curve Analysis
ROC (Receiver Operating Characteristic) Curve Analysis Julie Xu 17 th November 2017 Agenda Introduction Definition Accuracy Application Conclusion Reference 2017 All Rights Reserved Confidential for INC
More informationComparing Proportions between Two Independent Populations. John McGready Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationStatistics, Probability and Diagnostic Medicine
Statistics, Probability and Diagnostic Medicine Jennifer Le-Rademacher, PhD Sponsored by the Clinical and Translational Science Institute (CTSI) and the Department of Population Health / Division of Biostatistics
More informationMeasuring association in contingency tables
Measuring association in contingency tables Patrick Breheny April 3 Patrick Breheny University of Iowa Introduction to Biostatistics (BIOS 4120) 1 / 28 Hypothesis tests and confidence intervals Fisher
More informationMETHODS FOR DETECTING CERVICAL CANCER
Chapter III METHODS FOR DETECTING CERVICAL CANCER 3.1 INTRODUCTION The successful detection of cervical cancer in a variety of tissues has been reported by many researchers and baseline figures for the
More informationMeasuring association in contingency tables
Measuring association in contingency tables Patrick Breheny April 8 Patrick Breheny Introduction to Biostatistics (171:161) 1/25 Hypothesis tests and confidence intervals Fisher s exact test and the χ
More informationData that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes:
This sheets starts from slide #83 to the end ofslide #4. If u read this sheet you don`t have to return back to the slides at all, they are included here. Categorical Data (Qualitative data): Data that
More informationReview. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN
Outline 1. Review sensitivity and specificity 2. Define an ROC curve 3. Define AUC 4. Non-parametric tests for whether or not the test is informative 5. Introduce the binormal ROC model 6. Discuss non-parametric
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Logistic Regression SPSS procedure of LR Interpretation of SPSS output Presenting results from LR Logistic regression is
More informationIntroduction to screening tests. Tim Hanson Department of Statistics University of South Carolina April, 2011
Introduction to screening tests Tim Hanson Department of Statistics University of South Carolina April, 2011 1 Overview: 1. Estimating test accuracy: dichotomous tests. 2. Estimating test accuracy: continuous
More information7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course
Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course David W. Dowdy, MD, PhD Department of Epidemiology Johns Hopkins Bloomberg School of Public Health
More informationReflection Questions for Math 58B
Reflection Questions for Math 58B Johanna Hardin Spring 2017 Chapter 1, Section 1 binomial probabilities 1. What is a p-value? 2. What is the difference between a one- and two-sided hypothesis? 3. What
More informationTwo-sample Categorical data: Measuring association
Two-sample Categorical data: Measuring association Patrick Breheny October 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 40 Introduction Study designs leading to contingency
More informationINTRODUCTION TO MACHINE LEARNING. Decision tree learning
INTRODUCTION TO MACHINE LEARNING Decision tree learning Task of classification Automatically assign class to observations with features Observation: vector of features, with a class Automatically assign
More informationKnowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes
Knowledge Discovery and Data Mining Lecture 15 - ROC, AUC & Lift Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey ID5059-17-AUC
More informationComputer Models for Medical Diagnosis and Prognostication
Computer Models for Medical Diagnosis and Prognostication Lucila Ohno-Machado, MD, PhD Division of Biomedical Informatics Clinical pattern recognition and predictive models Evaluation of binary classifiers
More informationI got it from Agnes- Tom Lehrer
Epidemiology I got it from Agnes- Tom Lehrer I love my friends and they love me We're just as close as we can be And just because we really care Whatever we get, we share! I got it from Agnes She got it
More informationMeta-analysis of diagnostic research. Karen R Steingart, MD, MPH Chennai, 15 December Overview
Meta-analysis of diagnostic research Karen R Steingart, MD, MPH karenst@uw.edu Chennai, 15 December 2010 Overview Describe key steps in a systematic review/ meta-analysis of diagnostic test accuracy studies
More informationTypes of data and how they can be analysed
1. Types of data British Standards Institution Study Day Types of data and how they can be analysed Martin Bland Prof. of Health Statistics University of York http://martinbland.co.uk In this lecture we
More informationLecture 15 Chapters 12&13 Relationships between Two Categorical Variables
Lecture 15 Chapters 12&13 Relationships between wo Categorical ariables abulating and Summarizing able of Expected Counts Statistical Significance for wo-way ables Constructing & Assessing a wo-way able
More informationFinding Good Diagnosis Studies
Finding Good Diagnosis Studies n MESH Term Sensitivity and Specificity n Sensitivity prob that someone with disease will test positive (Pr [+ D]) true positive n Specificity prob that someone without the
More informationDaniel Boduszek University of Huddersfield
Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to Multinominal Logistic Regression SPSS procedure of MLR Example based on prison data Interpretation of SPSS output Presenting
More informationSections 10.7 and 10.9
Sections 10.7 and 10.9 Timothy Hanson Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 24 10.7 confidence interval for p 1
More informationBiostatistics 513 Spring Homework 1 Key solution. See the Appendix below for Stata code and output related to this assignment.
Biostatistics 513 Spring 2010 Homework 1 Key solution * Question will be graded for points, the remaining question is credit/no credit. See the Appendix below for Stata code and output related to this
More informationCritical reading of diagnostic imaging studies. Lecture Goals. Constantine Gatsonis, PhD. Brown University
Critical reading of diagnostic imaging studies Constantine Gatsonis Center for Statistical Sciences Brown University Annual Meeting Lecture Goals 1. Review diagnostic imaging evaluation goals and endpoints.
More informationWeek 2 Video 3. Diagnostic Metrics
Week 2 Video 3 Diagnostic Metrics Different Methods, Different Measures Today we ll continue our focus on classifiers Later this week we ll discuss regressors And other methods will get worked in later
More informationProportions, risk ratios and odds ratios
Applied Biostatistics Proportions, risk ratios and odds ratios Martin Bland Professor of Health Statistics University of York http://www-users.york.ac.uk/~mb55/ Risk difference Difference between proportions:
More informationControlling Bias & Confounding
Controlling Bias & Confounding Chihaya Koriyama August 5 th, 2015 QUESTIONS FOR BIAS Key concepts Bias Should be minimized at the designing stage. Random errors We can do nothing at Is the nature the of
More informationSensitivity, Specificity, and Relatives
Sensitivity, Specificity, and Relatives Brani Vidakovic ISyE 6421/ BMED 6700 Vidakovic, B. Se Sp and Relatives January 17, 2017 1 / 26 Overview Today: Vidakovic, B. Se Sp and Relatives January 17, 2017
More informationSimple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia https://goo.
Goal of a Formal Sensitivity Analysis To replace a general qualitative statement that applies in all observational studies the association we observe between treatment and outcome does not imply causation
More informationROC Curves (Old Version)
Chapter 545 ROC Curves (Old Version) Introduction This procedure generates both binormal and empirical (nonparametric) ROC curves. It computes comparative measures such as the whole, and partial, area
More informationthe standard deviation (SD) is a measure of how much dispersion exists from the mean SD = square root (variance)
Normal distribution The normal distribution is also known as the Gaussian distribution or 'bell-shaped' distribution. It describes the spread of many biological and clinical measurements Properties of
More informationROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way:
ROC Curves We developed a method to make diagnoses of anxiety using criteria provided by Phillip. Would it also be possible to make such diagnoses based on a much more simple scheme, a simple cutoff point
More informationMaking comparisons. Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups
Making comparisons Previous sessions looked at how to describe a single group of subjects However, we are often interested in comparing two groups Data can be interpreted using the following fundamental
More informationContingency Tables Summer 2017 Summer Institutes 187
Contingency Tables 87 Overview ) Types of Variables ) Comparing () Categorical Variables Contingency (two-way) tables Tests 3) x Tables Sampling designs Testing for association Estimation of effects Paired
More informationMidterm Exam ANSWERS Categorical Data Analysis, CHL5407H
Midterm Exam ANSWERS Categorical Data Analysis, CHL5407H 1. Data from a survey of women s attitudes towards mammography are provided in Table 1. Women were classified by their experience with mammography
More information4. Model evaluation & selection
Foundations of Machine Learning CentraleSupélec Fall 2017 4. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr
More informationIntroduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015
Introduction to diagnostic accuracy meta-analysis Yemisi Takwoingi October 2015 Learning objectives To appreciate the concept underlying DTA meta-analytic approaches To know the Moses-Littenberg SROC method
More informationGeneral Biostatistics Concepts
General Biostatistics Concepts Dongmei Li Department of Public Health Sciences Office of Public Health Studies University of Hawai i at Mānoa Outline 1. What is Biostatistics? 2. Types of Measurements
More informationSTATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012
STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements
More informationWelcome to this third module in a three-part series focused on epidemiologic measures of association and impact.
Welcome to this third module in a three-part series focused on epidemiologic measures of association and impact. 1 This three-part series focuses on the estimation of the association between exposures
More informationSupplemental Information
ARTICLE Supplemental Information This section contains additional detail about alternative scores that were considered for the M-CHAT-R/F (see Supplemental Appendix), as well as additional information
More informationStatistical questions for statistical methods
Statistical questions for statistical methods Unpaired (two-sample) t-test DECIDE: Does the numerical outcome have a relationship with the categorical explanatory variable? Is the mean of the outcome the
More informationSection 6: Analysing Relationships Between Variables
6. 1 Analysing Relationships Between Variables Section 6: Analysing Relationships Between Variables Choosing a Technique The Crosstabs Procedure The Chi Square Test The Means Procedure The Correlations
More informationsickness, disease, [toxicity] Hard to quantify
BE.104 Spring Epidemiology: Test Development and Relative Risk J. L. Sherley Agent X? Cause Health First, Some definitions Morbidity = Mortality = sickness, disease, [toxicity] Hard to quantify death Easy
More informationOverview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate
Overview Critical Literature Evaluation and Biostatistics Ahl Ashley N. Lewis, PharmD, BCPS Clinical Specialist, Drug Information UNC Hospitals Background Review of basic statistics Statistical tests Clinical
More informationBiostatistics and Epidemiology Step 1 Sample Questions Set 2. Diagnostic and Screening Tests
Biostatistics and Epidemiology Step 1 Sample Questions Set 2 Diagnostic and Screening Tests 1. A rare disorder of amino acid metabolism causes severe mental retardation if left untreated. If the disease
More informationEvidence-Based Medicine: Diagnostic study
Evidence-Based Medicine: Diagnostic study What is Evidence-Based Medicine (EBM)? Expertise in integrating 1. Best research evidence 2. Clinical Circumstance 3. Patient values in clinical decisions Haynes,
More informationProbability Revision. MED INF 406 Assignment 5. Golkonda, Jyothi 11/4/2012
Probability Revision MED INF 406 Assignment 5 Golkonda, Jyothi 11/4/2012 Problem Statement Assume that the incidence for Lyme disease in the state of Connecticut is 78 cases per 100,000. A diagnostic test
More informationSurvey of Smoking, Drinking and Drug Use (SDD) among young people in England, Andrew Bryant
Survey of Smoking, Drinking and Drug Use (SDD) among young people in England, 2010 Andrew Bryant Newcastle University Institute of Health and Society Background Background Young people s drinking behaviour
More information(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d
Biostatistics and Research Design in Dentistry Reading Assignment Measuring the accuracy of diagnostic procedures and Using sensitivity and specificity to revise probabilities, in Chapter 12 of Dawson
More informationSISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers
SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington
More informationPredictive Models for Healthcare Analytics
Predictive Models for Healthcare Analytics A Case on Retrospective Clinical Study Mengling Mornin Feng mfeng@mit.edu mornin@gmail.com 1 Learning Objectives After the lecture, students should be able to:
More informationANOVA. Thomas Elliott. January 29, 2013
ANOVA Thomas Elliott January 29, 2013 ANOVA stands for analysis of variance and is one of the basic statistical tests we can use to find relationships between two or more variables. ANOVA compares the
More informationDiagnostic Test. H. Risanto Siswosudarmo Department of Obstetrics and Gynecology Faculty of Medicine, UGM Jogjakarta. RS Sardjito
ب س م الل ه الر ح م ن الر ح يم RS Sardjito Diagnostic Test Gold standard New (test Disease No Disease Column Total Posi*ve a b a+b Nega*ve c d c+d Row Total a+c b+d N H. Risanto Siswosudarmo Department
More informationComparing Two ROC Curves Independent Groups Design
Chapter 548 Comparing Two ROC Curves Independent Groups Design Introduction This procedure is used to compare two ROC curves generated from data from two independent groups. In addition to producing a
More informationUnit 4 Probabilities in Epidemiology
BIOSTATS 540 Fall 2018 4. Probabilities in Epidemiology Page 1 of 23 Unit 4 Probabilities in Epidemiology All knowledge degenerates into probability - David Hume With just a little bit of thinking, we
More informationHayden Smith, PhD, MPH /\ v._
Hayden Smith, PhD, MPH.. + /\ v._ Information and clinical examples provided in presentation are strictly for educational purposes, and should not be substituted for clinical guidelines or up-to-date medical
More informationComparing Proportions between Two Independent Populations. John McGready Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationCHL 5225 H Advanced Statistical Methods for Clinical Trials. CHL 5225 H The Language of Clinical Trials
CHL 5225 H Advanced Statistical Methods for Clinical Trials Two sources for course material 1. Electronic blackboard required readings 2. www.andywillan.com/chl5225h code of conduct course outline schedule
More informationNORTH SOUTH UNIVERSITY TUTORIAL 1
NORTH SOUTH UNIVERSITY TUTORIAL 1 REVIEW FROM BIOSTATISTICS I AHMED HOSSAIN,PhD Data Management and Analysis AHMED HOSSAIN,PhD - Data Management and Analysis 1 DATA TYPES/ MEASUREMENT SCALES Categorical:
More informationAppendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)
Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations) This is an experiment in the economics of strategic decision making. Various agencies have provided funds for this research.
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 10: Introduction to inference (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 17 What is inference? 2 / 17 Where did our data come from? Recall our sample is: Y, the vector
More informationExample - Birdkeeping and Lung Cancer - Interpretation. Lecture 20 - Sensitivity, Specificity, and Decisions. What do the numbers not mean...
Odds Ratios Example - Birdkeeping and Lung Cancer - Interpretation Lecture 20 - Sensitivity, Specificity, and Decisions Sta102 / BME102 Colin Rundel April 16, 2014 Estimate Std. Error z value Pr(> z )
More informationIntroduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.
Diagnostic Tests 1 Introduction Suppose we have a quantitative measurement X i on experimental or observed units i = 1,..., n, and a characteristic Y i = 0 or Y i = 1 (e.g. case/control status). The measurement
More informationBinary Diagnostic Tests Paired Samples
Chapter 536 Binary Diagnostic Tests Paired Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary measures
More informationSEM: the precision of the mean of the sample in predicting the population parameter and is a way of relating the sample mean to population mean
1999b(9)/1997a(14)/1995b(17): What is meant by 95% confidence interval? Explain the practical applications of CIs and indicate why they may be preferred to P values General: 95% CI defines the range of
More informationIntroduction. Lecture 1. What is Statistics?
Lecture 1 Introduction What is Statistics? Statistics is the science of collecting, organizing and interpreting data. The goal of statistics is to gain information and understanding from data. A statistic
More information3. Model evaluation & selection
Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr
More informationBIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA
BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA PART 1: Introduction to Factorial ANOVA ingle factor or One - Way Analysis of Variance can be used to test the null hypothesis that k or more treatment or group
More informationSample Size Considerations. Todd Alonzo, PhD
Sample Size Considerations Todd Alonzo, PhD 1 Thanks to Nancy Obuchowski for the original version of this presentation. 2 Why do Sample Size Calculations? 1. To minimize the risk of making the wrong conclusion
More informationPsy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics
Psy201 Module 3 Study and Assignment Guide Using Excel to Calculate Descriptive and Inferential Statistics What is Excel? Excel is a spreadsheet program that allows one to enter numerical values or data
More informationLesson: A Ten Minute Course in Epidemiology
Lesson: A Ten Minute Course in Epidemiology This lesson investigates whether childhood circumcision reduces the risk of acquiring genital herpes in men. 1. To open the data we click on File>Example Data
More informationLecture 5. Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test
Lecture 5 Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test Contingency tables - example Factor 2 Present + Absent - Total Factor 1 Present + a b a+b Absent
More informationLearning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency
Conflicts of Interest I have no conflict of interest to disclose Biostatistics Kevin M. Sowinski, Pharm.D., FCCP Last-Chance Ambulatory Care Webinar Thursday, September 5, 2013 Learning Objectives For
More information9/4/2013. Decision Errors. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency
Conflicts of Interest I have no conflict of interest to disclose Biostatistics Kevin M. Sowinski, Pharm.D., FCCP Pharmacotherapy Webinar Review Course Tuesday, September 3, 2013 Descriptive statistics:
More informationBioengineering and World Health. Lecture Twelve
Bioengineering and World Health Lecture Twelve Four Questions What are the major health problems worldwide? Who pays to solve problems in health care? How can technology solve health care problems? How
More informationScreening for Disease
Screening for Disease An Ounce of Prevention is Worth a Pound of Cure. Actually, an ounce of prevention is better than a pound of cure, but if prevention hasn t been effective, perhaps early identification
More informationModeling Binary outcome
Statistics April 4, 2013 Debdeep Pati Modeling Binary outcome Test of hypothesis 1. Is the effect observed statistically significant or attributable to chance? 2. Three types of hypothesis: a) tests of
More informationWeek 2 Video 2. Diagnostic Metrics, Part 1
Week 2 Video 2 Diagnostic Metrics, Part 1 Different Methods, Different Measures Today we ll focus on metrics for classifiers Later this week we ll discuss metrics for regressors And metrics for other methods
More informationMeta Analysis. David R Urbach MD MSc Outcomes Research Course December 4, 2014
Meta Analysis David R Urbach MD MSc Outcomes Research Course December 4, 2014 Overview Definitions Identifying studies Appraising studies Quantitative synthesis Presentation of results Examining heterogeneity
More informationMethodologicOverview of Screening Studies
MethodologicOverview of Screening Studies Diana L. Miglioretti, PhD University of California Davis Thanks to William Black, MD for many of these slides! 1/11/17 RSNA CTMW 2017 1 Learning Objectives Understand
More informationModule Overview. What is a Marker? Part 1 Overview
SISCR Module 7 Part I: Introduction Basic Concepts for Binary Classification Tools and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington
More informationSTA 3024 Spring 2013 EXAM 3 Test Form Code A UF ID #
STA 3024 Spring 2013 Name EXAM 3 Test Form Code A UF ID # Instructions: This exam contains 34 Multiple Choice questions. Each question is worth 3 points, for a total of 102 points (there are TWO bonus
More informationImproved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning
Improved Hepatic Fibrosis Grading Using Point Shear Wave Elastography and Machine Learning Presenter: Hersh Sagreiya 1, M.D. Authors: Alireza Akhbardeh 1, Ph.D., Isabelle Durot 1, M.D., Carlo Filice 2,
More informationAnalysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach
University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School November 2015 Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach Wei Chen
More informationPostanalytical phase
Postanalytical phase Test request POSTANALYTICAL Result interpretation PHASE Result Sampling Black box: the lab And the RESULT is created The technician approves the result; it is transferred to the lab
More informationCreative Commons Attribution-NonCommercial-Share Alike License
Author: Brenda Gunderson, Ph.D., 05 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:
More informationFigure 1: Design and outcomes of an independent blind study with gold/reference standard comparison. Adapted from DCEB (1981b)
Page 1 of 1 Diagnostic test investigated indicates the patient has the Diagnostic test investigated indicates the patient does not have the Gold/reference standard indicates the patient has the True positive
More informationPackage CompareTests
Type Package Package CompareTests February 6, 2017 Title Correct for Verification Bias in Diagnostic Accuracy & Agreement Version 1.2 Date 2016-2-6 Author Hormuzd A. Katki and David W. Edelstein Maintainer
More informationZheng Yao Sr. Statistical Programmer
ROC CURVE ANALYSIS USING SAS Zheng Yao Sr. Statistical Programmer Outline Background Examples: Accuracy assessment Compare ROC curves Cut-off point selection Summary 2 Outline Background Examples: Accuracy
More informationMAESTRO TRIAL FINAL RESULTS. Gisela L.G. Menezes, MD, PhD
MAESTRO TRIAL FINAL RESULTS Gisela L.G. Menezes, MD, PhD MAESTRO Angiogenesis Metastasis How does OA work? Malignant Benign How does OA work? D O How does OA work? Optoacoustic Imaging Fibroadenoma IDC
More informationTwo-Way Independent ANOVA
Two-Way Independent ANOVA Analysis of Variance (ANOVA) a common and robust statistical test that you can use to compare the mean scores collected from different conditions or groups in an experiment. There
More informationBias and confounding. Mads Kamper-Jørgensen, associate professor, Section of Social Medicine
Bias and confounding Mads Kamper-Jørgensen, associate professor, maka@sund.ku.dk PhD-course in Epidemiology l 7 February 2017 l Slide number 1 The world according to an epidemiologist Exposure Outcome
More informationSection D. Another Non-Randomized Study Design: The Case-Control Design
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationBiosta's'cs Board Review. Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016
Biosta's'cs Board Review Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016 Review key biosta's'cs concepts Understand 2 X 2 tables Objec'ves By the end of this
More information