Diagnostic Test. H. Risanto Siswosudarmo Department of Obstetrics and Gynecology Faculty of Medicine, UGM Jogjakarta. RS Sardjito

Similar documents
Statistics, Probability and Diagnostic Medicine

Screening for Disease

7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course

ROC (Receiver Operating Characteristic) Curve Analysis

Screening (Diagnostic Tests) Shaker Salarilak

Questionnaire design. Questionnaire Design: Content. Questionnaire Design. Questionnaire Design: Wording. Questionnaire Design: Wording OUTLINE

Finding Good Diagnosis Studies

Predictive Models for Healthcare Analytics

Evidence-Based Medicine: Diagnostic study

BMI 541/699 Lecture 16

The cross sectional study design. Population and pre-test. Probability (participants). Index test. Target condition. Reference Standard

METHODS FOR DETECTING CERVICAL CANCER

Introduction to Epidemiology Screening for diseases

Biosta's'cs Board Review. Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016

OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING. Learning Objectives for this session:

Week 2 Video 3. Diagnostic Metrics

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN

Knowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes

Sensitivity, Specificity, and Relatives

Critical reading of diagnostic imaging studies. Lecture Goals. Constantine Gatsonis, PhD. Brown University

Diagnostic tests, Laboratory tests

Clinical Decision Analysis

For more information about how to cite these materials visit

Introduction to screening tests. Tim Hanson Department of Statistics University of South Carolina April, 2011

Welcome to this four part series focused on epidemiologic and biostatistical methods related to disease screening. In this first segment, we will

Worksheet for Structured Review of Physical Exam or Diagnostic Test Study

Lecture 14 Screening tests and result interpretation

Christina Martin Kazi Russell MED INF 406 INFERENCING Session 8 Group Project November 15, 2014

Assessment of performance and decision curve analysis

Supplemental Information

ROC Curves (Old Version)

VU Biostatistics and Experimental Design PLA.216

Introduction to ROC analysis

About OMICS International

Hayden Smith, PhD, MPH /\ v._

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

ROC Curve. Brawijaya Professional Statistical Analysis BPSA MALANG Jl. Kertoasri 66 Malang (0341)

EBM, Study Design and Numbers. David Frankfurter, MD Professor OB/GYN The George Washington University

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way:

(true) Disease Condition Test + Total + a. a + b True Positive False Positive c. c + d False Negative True Negative Total a + c b + d a + b + c + d

Behavioral Data Mining. Lecture 4 Measurement

The recommended method for diagnosing sleep

MAESTRO TRIAL FINAL RESULTS. Gisela L.G. Menezes, MD, PhD

RELIABILITY OF OPERATORS DURING THE VISUAL INSPECTION OF PRODUCED PARENTERAL DRUGS

Nicolaus Copernicus University in Torun Medical College in Bydgoszcz Family Doctor Department CANCER PREVENTION IN GENERAL PRACTICE

Lecture 5. Contingency /incidence tables Sensibility, specificity Relative Risk Odds Ratio CHI SQUARE test

An Improved Patient-Specific Mortality Risk Prediction in ICU in a Random Forest Classification Framework

Diagnostic screening. Department of Statistics, University of South Carolina. Stat 506: Introduction to Experimental Design

Chapter 10. Screening for Disease

Jenny Toonstra, PhD, ATC Mid-Atlantic Athletic Trainers Association (MAATA) Student Educational Program May 20, 2017

Various performance measures in Binary classification An Overview of ROC study

Understanding diagnostic tests. P.J. Devereaux, MD, PhD McMaster University

An Introduction to ROC curves. Mark Whitehorn. Mark Whitehorn

MethodologicOverview of Screening Studies

Meta-analysis of diagnostic research. Karen R Steingart, MD, MPH Chennai, 15 December Overview

Evaluating Diagnostic Procedures. Einas Al-Eisa, MSc, PhD

Evaluation of a rapid lateral flow immunoassay (STic Expert HIT) for the diagnosis of heparin-induced thrombocytopenia (HIT)

Types of Biomedical Research

Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy

THE USE OF PLEURAL FLUID CHOLESTEROL IN IDENTIFYING THE TYPE OF PLEURAL EFFUSION

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

12/26/2013. Types of Biomedical Research. Clinical Research. 7Steps to do research INTRODUCTION & MEASUREMENT IN CLINICAL RESEARCH S T A T I S T I C

Evaluation of diagnostic tests

PERFORMANCE MEASURES

Evidence-based Imaging: Critically Appraising Studies of Diagnostic Tests

Evalua&ng Methods. Tandy Warnow

Prostate Cancer. Biomedical Engineering for Global Health. Lecture Fourteen. Early Detection. Prostate Cancer: Statistics

Cognitive Screening in Risk Assessment. Geoffrey Tremont, Ph.D. Rhode Island Hospital & Alpert Medical School of Brown University.

4.5% 4.0% 3.5% 3.0% 2.5% 2.0% 1.5% 1.0% 0.5% 0.0% < >80 Current Age (Yrs)

Module Overview. What is a Marker? Part 1 Overview

Data that can be classified as belonging to a distinct number of categories >>result in categorical responses. And this includes:

Let s look a minute at the evidence supporting current cancer screening recommendations.

COMP90049 Knowledge Technologies

Glossary of Practical Epidemiology Concepts

Comparing Two ROC Curves Independent Groups Design

Appraising Diagnostic Test Studies

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

sickness, disease, [toxicity] Hard to quantify

Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions

Protein Structure & Function. University, Indianapolis, USA 3 Department of Molecular Medicine, University of South Florida, Tampa, USA

Diagnostic accuracy of range of motion measurements in early symptomatic hip and/or knee osteoarthritis

Dr. Ruby Senie is Professor of Clinical Epidemiology and Sociomedical Sciences in the Mailman School of Public Health.

2011 ASCP Annual Meeting

Bayes theorem, the ROC diagram and reference values: Definition and use in clinical diagnosis

STUDIES OF THE ACCURACY OF DIAGNOSTIC TESTS: (Relevant JAMA Users Guide Numbers IIIA & B: references (5,6))

Meta-analysis of Diagnostic Test Accuracy Studies

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

4 Diagnostic Tests and Measures of Agreement

SYSTEMATIC REVIEWS OF TEST ACCURACY STUDIES

Receiver operating characteristic

Bioengineering and World Health. Lecture Twelve

Ultrasound: Improving Breast Cancer Detection

Pat Quigley,PhD,MPH,ARNP,CRRN,FAAN,FAANP. Associate Director, VISN 8 Patient Safety Center Associate Chief for Nursing Service/Research

1 Diagnostic Test Evaluation

ABSTRAK Korelasi anatara Index Massa Tubuh(IMT) dan tekanan darah pada penduduk di Desa Lebih Kecamatan Gianyar

Biomedical Engineering for Global Health

«àπ π â Õ μ «å «π Áß μâ π π ßæ π ª

isk ncer 15 Weeks ical C tion c 1,000 Years! Lifetime Screening Cost

BME 301. Lecture Twelve

Transcription:

ب س م الل ه الر ح م ن الر ح يم RS Sardjito Diagnostic Test Gold standard New (test Disease No Disease Column Total Posi*ve a b a+b Nega*ve c d c+d Row Total a+c b+d N H. Risanto Siswosudarmo Department of Obstetrics and Gynecology Faculty of Medicine, UGM Jogjakarta

Diagnostic Test 1. Objective 1. Recognizing DT and screening. 2. Identifying diseases appropriate for screening. 3. Defining, calculating, and interpreting the sensitivity, specificity, positive predictive value and negative predictive value, false positive and false negative, SnNOUT and SpIN, LR + and LR- of a screening test. 4. Explaining the influence of prevalence of disease to predictive value. 5. Understanding side effects of a screening test. 6. Understanding an ROC curve. 2. Method 1. Lecture 2. Discussion 3. Computer computation

When screening is beneficial The disease is serious (e.g., cervical cancer). When treatment before symptoms occur is more effective than treatment that is delayed until symptoms appear. When the prevalence of disease in the DPCP is relatively high. The time interval between possible detection by screening and later detection after symptoms is the "detectable preclinical phase" or DPCP.

Characteristics of a Good Screening Test Inexpensive Easy to administer (easy to do, not complicated) Minimal discomfort (convenient) Reliable (consistent, reproducible) Valid (distinguishes diseased & non-diseased people) Characteristic of gold standard test The best test available Widely accepted (by all) Often expensive, or invasive

Basic Table: Two by two table Tes baru (tes diagnos5k) Sakit Gold standard Tidak sakit Total Posi*f a b a+b Nega*f c d c+d Total a+c b+d N Sensitivity: a/(a+c) PPV: a/(a+b) TP : a ; FP: b Specificity: d/(b+d) NPV: d/(c+d) Accuracy: (a+d)/n TN : d ; FN: c

Test validity Test validity is the ability of a screening test to accurately identify diseased and non-disease individuals. An ideal screening test is exquisitely sensitive (high probability of detecting disease) and extremely specific (high probability that those without the disease will screen negative). The validity of a screening test is based on its accuracy in identifying diseased and non-diseased persons. This can only be determined if the accuracy of the screening test is compared to some "gold standard" test. Gold standard test is a test that establishes the true disease status. The gold standard should be very accurate, but may be expensive and invasive

1. Sensitivity: The accurate the screening test in identifying disease in people who truly have the disease. The ability of the test to identify correctly those who have the disease. Proportion of people with disease who have a pos. test 2. Specificity The accuracy of the screening test in correctly classifying truly non-diseased people. The ability of the test to identify correctly those who DO NOT HAVE the disease Proportion of people without disease who have a neg. test

1. Positive predictive value (PPV): The proportion of people with positive test who actually HAVE the disease. If a person test is positive, what is the probability that she/he really has the disesae? PPV= a/(a+b) 2. Negative predictive value (NPV) The proportion of people with negative test who actually DO NOT HAVE the disease If a person test is negative, what is the probability that she/he really DOESN T HAVE the disease? NPV = d/(c+d)

1. SnNOUT When a test with a high SENsitivity is NEGative, it effectively rules OUT the disease 2. SpPIN When a test with high SPEcificity is POSitive, it effectively rules IN the disease.

Likelihood ratio 1. Likelihood ratio positive (LR+) The probability of a positive test in a person with the disease divided by the probability of a positive test in a person without the disease. LR(+) = Sensitivity / (1 Specificity) 2. Likelihood ratio negative (LR-) The probability of a negative test in a person with the disease, divided by the probability of a negative test in a person without the disease. LR (-) = (1 Sensitivity) / Specificity

Impact of likelihood ratio LR(+) LR( ) Impact on likelihood 10 0.1 Excellent 6 0.2 Very good 2 0.5 Fair 1 1 Useless

Two important down sides to screening: 1. False positive (Exp: Breast ca on a young women). The probability is extremely low The sensitivity of mammography is low because younger women have denser breast tissue. Will be subjected to extreme anxiety and worry. May also undergo invasive diagnostic tests such as needle biopsy and surgical biopsy unnecessarily. Time consuming and costy

Two important down sides to screening: 2. False negative. They will be reassured that they don't have disease, when they really do. Delayed diagnosis Delayed treatment Increased morbidity and mortality Increased cost

Effect of prevalence on PPV Skrining pada donor darah perempuan yang prevalensi HIV (+) adalah 0,01% (hanya 10 HIV positif dari 100.000 donor perempuan). Tes skrining memiliki sensitivitas 100% dan spesifisitas 95%. Berapa PPV nya? Perhatikan tabel berikut: Screening test HIV (+) Gold Standard HIV (- ) Total Column Tes (+) 10 510 520 Tes (- ) 0 99.480 99.480 Total Row 10 99.990 100.000 PPV = 10/520 = 1,9%

Effect of prevalence on PPV, scenario II Skrining dilakukan pada laki-laki yang datang di klinik penyakit menular seksual yang prevalensi HIV (+) adalah 4 %. Tes skrining memiliki sensitivitas dan spesifisitas yang sama yakni masingmasing 100% dan 95%. Berapa PPV nya? Screening Gold Standard Total Column test HIV (+) HIV (- ) Tes (+) 4.000 480 4.480 Tes (- ) 0 95.520 95.520 Total Row 4.000 96.000 100.000 PPV = 4000/4480 = 89%

Effect of prevalence on PPV, scenario II Skrining dilakukan pada pengguna obat suntik intravena yang prevalensi HIV (+) nya adalah 20%. Berapa PPV nya? Screening Gold Standard Total Column test HIV (+) HIV (- ) Tes (+) 20.000 400 20.400 Tes (- ) 0 79.600 95.520 Total Row 20.000 80.000 100.000 PPV = 20.000/20.400 = 98.%

Receiver Operating Characteristic (ROC) Curve ROC curve is a plot of sensitivity (Y axis) versus 1- specificity ( X axis) The position of the ROC on the graph reflects the accuracy of the diagnostic test. It covers all possible thresholds (cut-off points). The ROC of random guessing lies on the diagonal line. It means that the test is worthless The ROC of a perfect diagnostic technique is a point at the upper left corner of the graph, where the TP proportion is 1.0 and the FP proportion is 0.

The Area Under the Curve (AUC), also referred to as index of accuracy (A), and it is an accepted traditional performance metric for an ROC curve. The higher the area under the curve the better prediction power the model has. AUC = 0.8 indicates good predictive power of the model.90-1 = excellent.80-.90 = good.70-.80 = fair.60-.70 = poor.50-.60 = fail