METHODS FOR DETECTING CERVICAL CANCER

Size: px
Start display at page:

Download "METHODS FOR DETECTING CERVICAL CANCER"

Transcription

1 Chapter III METHODS FOR DETECTING CERVICAL CANCER 3.1 INTRODUCTION The successful detection of cervical cancer in a variety of tissues has been reported by many researchers and baseline figures for the sensitivity and specificity of their methods can be derived from their publications. In order to establish whether our laboratory could produce results similar to those reported in the literature. Several practical considerations are common to all hands-on cytological studies. This chapter discusses these practical issues and explains the reasoning behind the implementation decisions we made. The following chapter then describes the pilot studies and discusses the implications of their results in the context of previous findings in this field. 3.2 REPORTING DIAGNOSTIC TEST RESULTS The results of diagnostic tests are often reported as a single figure, usually the percentage of correct. This approach has the advantage of being simple to comprehend, and it makes comparisons between results easy to perform; but this single figure is influenced by a multitude of underlying factors, of which the most obvious is the cutoff threshold used for classification. A diagnostic test returns a continuously distributed measurement, or score, for each case. In order to actually classify the case, a threshold value of the score must be established, above (or below) which a case is considered 19

2 positive. The error proportion of the test depends upon the classification threshold chosen, and can be adjusted by moving this threshold. Figure 3.1 Dependence of summary measures of classifier performance on classification threshold selected To fully assess the performance of a diagnostic test it is necessary to understand its performance over a range of threshold values. Undoubtedly the most widely-used technique for this purpose is Receiver Operating Characteristic (ROC ) curve analysis (Henderson, 1993; Dwyer, 1996). ROC curves were developed for use in signal detection in radar returns in the 1950 s Swets (1986), in an excellent overview article, mentions that they were invented by Theodore G. Birdsall, of the Electrical Engineering Department of the University of Michigan, who taught the technique to him. The use of ROC curves has since been generalized to many problem domains, and is particularly widespread in medical decision making; Swets (1988) mentions at least 100 studies in the field of medical imaging which use ROC curve analysis. Example ROC curves are shown in Figure. 20

3 3.3 SCREENING TESTS Figure 3.2 Examples of ROC curves There are three possible purposes for a clinical diagnostic test discovery, confirmation and exclusion. A discovery test aims to detect the presence of a disease in a population; screening tests such as Pap smear screening are discovery tests. Confirmation tests are used to confirm the presence of disease in an individual with other symptoms, and exclusion tests allow the presence of disease in an individual to be ruled out. The three types of test have different purposes and a different decision threshold may be appropriate for each purpose. A rule-in threshold is the threshold used to confirm the presence of disease, while a rule-out threshold is used to exclude it. The two thresholds need not be the same. The performance of a diagnostic test is assessed in comparison with a gold standard indicator for the disease in question. Henderson (1993) defines a gold standard as A test constituting definitive diagnostic evidence [which] uniquely defines the disease 21

4 in the presence of specific symptomatology. For cervical neoplasia, the gold standard is biopsy of the lesion; however, in many studies biopsy is not available for all patients. Individuals diagnosed as normal are not usually biopsied, and therefore may include missed positives, while some women diagnosed as having neoplasia refuse treatment for personal reasons, and therefore cannot be biopsy-confirmed. A surrogate test, such as examination of a Pap smear by several cytologists, must be used instead in these cases. The possibility of errors in the gold standard must be kept in mind when assessing test performance. With respect to the gold standard diagnosis, a given test may provide one of four outcomes (Table 3.1). A positive test result in an individual with the disease is known as a true positive (TP); a positive result in a disease-free individual is a false positive (FP); a negative result in a disease-free individual is a true negative (TN) and a negative result in a patient with the disease is a false negative (FN). A table such as Table 3.1 is often referred to as the confusion matrix for a classifier. Table 3.1 The table of classification outcomes True Diagnosis Positive Classification Negative Total Positive True Positive (TP) False Negative (FN) Positive Population (PP) Negative False Positive (FP) True Negative (TN) Negative Population (NP) Total Class d Positive (CP) Class d Negative (CN) Total Population 22

5 From these outcomes, several related measures of test performance can be calculated (Bradley, 1997): Accuracy = (1 Error) = (TP + TN)/(PP + NP) = Pr(C), the probability of a correct classification. Sensitivity = TP/(TP + FN) = TP/PP = Pr(disease detected disease present), the ability of the test to detect disease in a population of diseased individuals. Specificity = TN/(TN + FP) = TN / NP = Pr(negative result no disease present), the ability of the test to correctly rule out the disease in a disease-free population. 3.4 PREDICTIVE VALUE OF A TEST Measures of sensitivity and specificity do not indicate how a test will perform in clinical practice. To do this, it is necessary to calculate the predictive values of that test. The positive predictive value (PPV) of the test is the proportion of all positive tests which correctly indicate the presence of disease; that is PPV = TP / (TP + FP) = Pr (disease present positive test) 3.5 RECEIVER OPERATING CHARACTERISTIC CURVE An ROC curve is constructed by first classifying the data set of interest. The classification result can be a single real number or an ordinal number, permitting a sensible ranking of cases (Dwyer, 1996; van Erkel & Pattynama, 1998). The true positive and false positive rates will depend upon the classification threshold chosen; to plot a ROC curve this threshold is varied over all possible output values and the true positive proportion is plotted against the false positive proportion for each threshold. The resulting curve will follow the diagonal from 0,0 to 1,1 if the classifier has no power (area 23

6 under the curve is 0.5), and will hug the top left corner of the plot for a perfect classifier (area under the curve is 1.0). The prevalence of the disease, the proportion of individuals with the disease in a given population at a specified point in time, will affect these values. The predictive value of a given test decreases with decreasing prevalence of the disease, so that even a relatively good test will perform poorly for a disease of very low prevalence. 3.6 ROC CURVE ANALYSIS ROC curves are usually plotted assuming a binormal distribution of the data that is, it is assumed that data from both the diseased and the non-diseased groups are distributed normally and the mean and variance of each distribution is estimated separately (Swets, 1986; Metz, Herman & Roe 1998). These distributions are then used to find the area under the curve. A number of nonparametric methods have also been developed for finding the area under the curve, to handle data which is highly non- Gaussian, although Hajian-Tilaki, Hanley, Joseph & Collet (1997) demonstrate that both parametric and non-parametric methods yield very similar estimates of the area under the curve for the same data, and these authors conclude that for a wide range of distributions ROC curves and their associated methodology are relatively robust. For statistical details of these approaches see (Swets, 1986). The binormal assumption is widely used because much of the original work on ROC curves was performed by humans ranking images (radar or radiological). A human can only handle a limited number of categories; five or seven are the numbers frequently cited in the literature (Hanley & McNeil, 1982; Swets, 1986). The area under a ROC curve empirically plotted from five categories can be estimated using the trapezoidal rule, but this approach is 24

7 known to underestimate the area. The binormal assumption allowed a smooth curve to be drawn, and the area underneath it to be calculated accurately (Hanley & McNeil, 1982). Using more recent computer techniques and a continuous decision variable, the binormal assumption is unnecessary and an empirical approach can be taken. 3.7 MEASURES FROM ROC CURVES The area under the ROC curve is commonly used as a measure of the overall performance of the classifier. Bamber (1975) demonstrated that the area under the ROC curve is equivalent to the value of the non-parametric Mann-Whitney U statistic, which in turn is equivalent to the Wilcoxon statistic. The Wilcoxon statistic is also a nonparametric statistic, usually used to test the hypothesis that the distribution of a variable, x, from one population, p, is equal to that from a second population, n. If the null hypothesis, H0: xp = xn is rejected, the probability, p can be calculated such that either xp > xn,, xp < xn or xp xn. The Wilcoxon test makes no assumptions about the distributions of the underlying variables. The area under the ROC curve effectively measures P(xp > xn), and so represents the probability that a random chosen positive example from a data set of interest is correctly ranked with respect to a randomly chosen negative example (Hanley & McNeil, 1982). In order to compare classifiers, it is necessary to estimate the standard error of the area under the curve, SE(AUC). The method for doing this varies with the method used to estimate the AUC, but one easily applied method, which is applicable to an empirically derived curve, is to use the standard error of the Wilcoxon statistic, SE (W): 25

8 .3.1 where θ is the area under the curve, C p and C n are the number of positive and negative examples respectively, and Q 1 is the probability that two randomly chosen abnormal images will both be ranked with greater suspicion than a randomly chosen normal image and Q2 is the probability that one randomly chosen abnormal image will be ranked with greater suspicion than two randomly chosen normal images (Hanley & McNeil, 1982; Henderson, 1993). SE(W) decreases as the number of samples on which it is estimated, N, increases, the decrease being proportional to N. SE(W) is also inversely proportional to the area under the curve, with SE(W) approaching 0 as the area under the curve approaches 1. To overcome these problems, Dwyer (1996) has suggested analysis of the area of that portion of the ROC curve which corresponds to a desirable range of false positive values. This allows the comparison of only those parts of the curve which are in the area of clinical interest. Another use of the ROC curve is to select a single threshold for optimum discrimination. Intuitively, this point is the point closest to the upper left corner of the graph; however, cost/benefit considerations may alter this optimum. Van Erkel & 26

9 Pattynama (1998) recommend combining ROC analysis with formal cost-effectiveness analysis in order to determine the optimal threshold. Returning to the use of a single point would appear to invalidate the use of ROC curves, since the same point could potentially be selected on the basis of the sensitivity/specificity values on which the curve is based. This is not, in fact, the case. A single pair of values may represent a point where sensitivity may be increased with little loss of specificity and vice versa, or it may not. Inspection of the ROC curve allows clarification of such issues (Bradley, 1997). 3.8 SLOPE OF THE ROC CURVE The area under the ROC curve provides an overall measure of the behaviour of the classifier, but does not measure performance at specific points. The diagnostic utility of a particular point on the ROC curve can be calculated using the odds-likelihood ratio form of Bayes Rule (Henderson, 1993). In its simplest form, Bayes Rule is usually written as: where Pr(D R) is the probability of disease (D) given the test result (R), Pr(R D) is the probability of the result given that the disease is present, Pr(D) is the probability of disease (D) in the population, and Pr(R) is the probability of the test result in the population as a whole. This equation can also be expressed in terms of the probability of non-disease (D ), and the two expressions combined as a ratio:

10 Or (posterior odds of disease) = (positive likelihood ratio) x (prior odds of disease) The positive likelihood ratio (LR+) is the ratio between the probability of a positive test result given the presence of a disease and the probability of the same test result given the absence of a disease (Choi, 1998). The negative likelihood ratio (LR-) can be defined analogously. These likelihood ratios can be calculated from the slope of the ROC curve. The tangent to the curve at the point representing the threshold of interest corresponds to the likelihood ratio for a single test value corresponding to that point on the ROC curve for a continuous test (Henderson, 1993; Dwyer, 1996; Choi, 1998). The likelihood ratio of a positive test result for a dichotomous test is given by LR+ = TPR / FPR 3.6 where TPR is the True Positive Proportion, and FPR is the False Positive Proportion, both of which lie between 0.0% and 100.0%. The likelihood ratio is equivalent to the tangent of the angle formed by that point, 0,0 and 1,0. The likelihood ratio of a negative test result is given by LR- = FNR / TNR 3.7. which is equivalent to the slope of a line from that point to 1,1 (Choi, 1998). Before performing any tests at all, the prior probability of a patient having the disease is equal to the prevalence of the disease in the population of interest (Moons et al., 1997). The posterior odds required for treatment depend upon clinical issues such as the cost/benefit tradeoff discussed above, and will vary greatly from test to test and disease to disease. Further discussion of these issues can be found in (Moons et al., 1997). 28

11 3.9 DISCUSSION OF ROC CURVE ANALYSIS ROC curve analysis has not been used in the investigation of cervical cancer earlier. Early studies tended to be subjective and qualitative, and did not yield the type of data suitable for ROC curve generation. The first workers to use ROC analysis in this context were Burger et al. (1981), who present ROC curves for their cervical cell classifiers, but do not extract any summary measures, such as the area under the curve. Haroske et al. (1990) also take this approach in demonstrating the performance of their hierarchical classifier, as do Garner et al. (1994b), who merely mention that the closer the curves are to the axes, the better the system performance (Garner et al., 1994b, p. 8). The same is true for Palcic & MacAulay (1994). Payne et al. (1997) do, however, present a ROC curve, together with a discussion of the implications of such a curve for the underlying classifier, and a discussion of the sensitivity, specificity and positive predictive value of the classifier at selected operating points. Given the amount of work which has gone into the development of ROC curve analysis for the characterization, optimization and comparison of classifiers, it would appear that these techniques have to date been underutilized by cervical cancer researchers. 29

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp

1 Introduction. st0020. The Stata Journal (2002) 2, Number 3, pp The Stata Journal (22) 2, Number 3, pp. 28 289 Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve

More information

Various performance measures in Binary classification An Overview of ROC study

Various performance measures in Binary classification An Overview of ROC study Various performance measures in Binary classification An Overview of ROC study Suresh Babu. Nellore Department of Statistics, S.V. University, Tirupati, India E-mail: sureshbabu.nellore@gmail.com Abstract

More information

Comparing Two ROC Curves Independent Groups Design

Comparing Two ROC Curves Independent Groups Design Chapter 548 Comparing Two ROC Curves Independent Groups Design Introduction This procedure is used to compare two ROC curves generated from data from two independent groups. In addition to producing a

More information

Statistics, Probability and Diagnostic Medicine

Statistics, Probability and Diagnostic Medicine Statistics, Probability and Diagnostic Medicine Jennifer Le-Rademacher, PhD Sponsored by the Clinical and Translational Science Institute (CTSI) and the Department of Population Health / Division of Biostatistics

More information

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN Outline 1. Review sensitivity and specificity 2. Define an ROC curve 3. Define AUC 4. Non-parametric tests for whether or not the test is informative 5. Introduce the binormal ROC model 6. Discuss non-parametric

More information

Receiver operating characteristic

Receiver operating characteristic Receiver operating characteristic From Wikipedia, the free encyclopedia In signal detection theory, a receiver operating characteristic (ROC), or simply ROC curve, is a graphical plot of the sensitivity,

More information

Meta-analyses evaluating diagnostic test accuracy

Meta-analyses evaluating diagnostic test accuracy THE STATISTICIAN S PAGE Summary Receiver Operating Characteristic Curve Analysis Techniques in the Evaluation of Diagnostic Tests Catherine M. Jones, MBBS, BSc(Stat), and Thanos Athanasiou, MD, PhD, FETCS

More information

A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range

A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range A Learning Method of Directly Optimizing Classifier Performance at Local Operating Range Lae-Jeong Park and Jung-Ho Moon Department of Electrical Engineering, Kangnung National University Kangnung, Gangwon-Do,

More information

ROC Curves (Old Version)

ROC Curves (Old Version) Chapter 545 ROC Curves (Old Version) Introduction This procedure generates both binormal and empirical (nonparametric) ROC curves. It computes comparative measures such as the whole, and partial, area

More information

EVALUATION AND COMPUTATION OF DIAGNOSTIC TESTS: A SIMPLE ALTERNATIVE

EVALUATION AND COMPUTATION OF DIAGNOSTIC TESTS: A SIMPLE ALTERNATIVE EVALUATION AND COMPUTATION OF DIAGNOSTIC TESTS: A SIMPLE ALTERNATIVE NAHID SULTANA SUMI, M. ATAHARUL ISLAM, AND MD. AKHTAR HOSSAIN Abstract. Methods of evaluating and comparing the performance of diagnostic

More information

BMI 541/699 Lecture 16

BMI 541/699 Lecture 16 BMI 541/699 Lecture 16 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Proportions & contingency tables -

More information

Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions

Estimation of Area under the ROC Curve Using Exponential and Weibull Distributions XI Biennial Conference of the International Biometric Society (Indian Region) on Computational Statistics and Bio-Sciences, March 8-9, 22 43 Estimation of Area under the ROC Curve Using Exponential and

More information

Week 2 Video 3. Diagnostic Metrics

Week 2 Video 3. Diagnostic Metrics Week 2 Video 3 Diagnostic Metrics Different Methods, Different Measures Today we ll continue our focus on classifiers Later this week we ll discuss regressors And other methods will get worked in later

More information

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015

Introduction to diagnostic accuracy meta-analysis. Yemisi Takwoingi October 2015 Introduction to diagnostic accuracy meta-analysis Yemisi Takwoingi October 2015 Learning objectives To appreciate the concept underlying DTA meta-analytic approaches To know the Moses-Littenberg SROC method

More information

4. Model evaluation & selection

4. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2017 4. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta GA, USA.

Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta GA, USA. A More Intuitive Interpretation of the Area Under the ROC Curve A. Cecile J.W. Janssens, PhD Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta GA, USA. Corresponding

More information

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T.

Introduction. We can make a prediction about Y i based on X i by setting a threshold value T, and predicting Y i = 1 when X i > T. Diagnostic Tests 1 Introduction Suppose we have a quantitative measurement X i on experimental or observed units i = 1,..., n, and a characteristic Y i = 0 or Y i = 1 (e.g. case/control status). The measurement

More information

Introduction to ROC analysis

Introduction to ROC analysis Introduction to ROC analysis Andriy I. Bandos Department of Biostatistics University of Pittsburgh Acknowledgements Many thanks to Sam Wieand, Nancy Obuchowski, Brenda Kurland, and Todd Alonzo for previous

More information

Sensitivity, Specificity, and Relatives

Sensitivity, Specificity, and Relatives Sensitivity, Specificity, and Relatives Brani Vidakovic ISyE 6421/ BMED 6700 Vidakovic, B. Se Sp and Relatives January 17, 2017 1 / 26 Overview Today: Vidakovic, B. Se Sp and Relatives January 17, 2017

More information

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

INTRODUCTION TO MACHINE LEARNING. Decision tree learning INTRODUCTION TO MACHINE LEARNING Decision tree learning Task of classification Automatically assign class to observations with features Observation: vector of features, with a class Automatically assign

More information

Bayes theorem, the ROC diagram and reference values: Definition and use in clinical diagnosis

Bayes theorem, the ROC diagram and reference values: Definition and use in clinical diagnosis Special Lessons issue: in biostatistics Responsible writing in science Bayes theorem, the ROC diagram and reference values: efinition and use in clinical diagnosis Anders Kallner* epartment of clinical

More information

3. Model evaluation & selection

3. Model evaluation & selection Foundations of Machine Learning CentraleSupélec Fall 2016 3. Model evaluation & selection Chloé-Agathe Azencot Centre for Computational Biology, Mines ParisTech chloe-agathe.azencott@mines-paristech.fr

More information

Knowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes

Knowledge Discovery and Data Mining. Testing. Performance Measures. Notes. Lecture 15 - ROC, AUC & Lift. Tom Kelsey. Notes Knowledge Discovery and Data Mining Lecture 15 - ROC, AUC & Lift Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk twk@st-andrews.ac.uk Tom Kelsey ID5059-17-AUC

More information

Derivative-Free Optimization for Hyper-Parameter Tuning in Machine Learning Problems

Derivative-Free Optimization for Hyper-Parameter Tuning in Machine Learning Problems Derivative-Free Optimization for Hyper-Parameter Tuning in Machine Learning Problems Hiva Ghanbari Jointed work with Prof. Katya Scheinberg Industrial and Systems Engineering Department Lehigh University

More information

PII: S (96) THE USE OF THE AREA UNDER THE ROC CURVE IN THE EVALUATION OF MACHINE LEARNING ALGORITHMS

PII: S (96) THE USE OF THE AREA UNDER THE ROC CURVE IN THE EVALUATION OF MACHINE LEARNING ALGORITHMS Pergamon Pattern Recognition, Vol. 30, No. 7, pp. 1145-1159, 1997 1997 Pattern Recognition Society. Published by Elsevier Science Ltd Printed in Great Britain. All rights reserved 0031-3203/97 $17.00+.00

More information

An Introduction to ROC curves. Mark Whitehorn. Mark Whitehorn

An Introduction to ROC curves. Mark Whitehorn. Mark Whitehorn An Introduction to ROC curves Mark Whitehorn Mark Whitehorn It s all about me Prof. Mark Whitehorn Emeritus Professor of Analytics Computing University of Dundee Consultant Writer (author) m.a.f.whitehorn@dundee.ac.uk

More information

Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy

Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Chapter 10 Analysing and Presenting Results Petra Macaskill, Constantine Gatsonis, Jonathan Deeks, Roger Harbord, Yemisi Takwoingi.

More information

Meta-analysis of Diagnostic Test Accuracy Studies

Meta-analysis of Diagnostic Test Accuracy Studies GUIDELINE Meta-analysis of Diagnostic Test Accuracy Studies November 2014 Copyright EUnetHTA 2013. All Rights Reserved. No part of this document may be reproduced without an explicit acknowledgement of

More information

Critical reading of diagnostic imaging studies. Lecture Goals. Constantine Gatsonis, PhD. Brown University

Critical reading of diagnostic imaging studies. Lecture Goals. Constantine Gatsonis, PhD. Brown University Critical reading of diagnostic imaging studies Constantine Gatsonis Center for Statistical Sciences Brown University Annual Meeting Lecture Goals 1. Review diagnostic imaging evaluation goals and endpoints.

More information

A scored AUC Metric for Classifier Evaluation and Selection

A scored AUC Metric for Classifier Evaluation and Selection A scored AUC Metric for Classifier Evaluation and Selection Shaomin Wu SHAOMIN.WU@READING.AC.UK School of Construction Management and Engineering, The University of Reading, Reading RG6 6AW, UK Peter Flach

More information

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate

Overview. Goals of Interpretation. Methodology. Reasons to Read and Evaluate Overview Critical Literature Evaluation and Biostatistics Ahl Ashley N. Lewis, PharmD, BCPS Clinical Specialist, Drug Information UNC Hospitals Background Review of basic statistics Statistical tests Clinical

More information

SYSTEMATIC REVIEWS OF TEST ACCURACY STUDIES

SYSTEMATIC REVIEWS OF TEST ACCURACY STUDIES Biomarker & Test Evaluation Program SYSTEMATIC REVIEWS OF TEST ACCURACY STUDIES Patrick MM Bossuyt Structure 1. Clinical Scenarios 2. Test Accuracy Studies 3. Systematic Reviews 4. Meta-Analysis 5.

More information

7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course

7/17/2013. Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course Evaluation of Diagnostic Tests July 22, 2013 Introduction to Clinical Research: A Two week Intensive Course David W. Dowdy, MD, PhD Department of Epidemiology Johns Hopkins Bloomberg School of Public Health

More information

Clinical Decision Analysis

Clinical Decision Analysis Clinical Decision Analysis Terminology Sensitivity (Hit True Positive) Specificity (Correct rejection True Negative) Positive predictive value Negative predictive value The fraction of those with the disease

More information

1 Diagnostic Test Evaluation

1 Diagnostic Test Evaluation 1 Diagnostic Test Evaluation The Receiver Operating Characteristic (ROC) curve of a diagnostic test is a plot of test sensitivity (the probability of a true positive) against 1.0 minus test specificity

More information

Evaluation of diagnostic tests

Evaluation of diagnostic tests Evaluation of diagnostic tests Biostatistics and informatics Miklós Kellermayer Overlapping distributions Assumption: A classifier value (e.g., diagnostic parameter, a measurable quantity, e.g., serum

More information

Examining differences between two sets of scores

Examining differences between two sets of scores 6 Examining differences between two sets of scores In this chapter you will learn about tests which tell us if there is a statistically significant difference between two sets of scores. In so doing you

More information

Modifying ROC Curves to Incorporate Predicted Probabilities

Modifying ROC Curves to Incorporate Predicted Probabilities Modifying ROC Curves to Incorporate Predicted Probabilities C. Ferri, P. Flach 2, J. Hernández-Orallo, A. Senad Departament de Sistemes Informàtics i Computació Universitat Politècnica de València Spain

More information

Module Overview. What is a Marker? Part 1 Overview

Module Overview. What is a Marker? Part 1 Overview SISCR Module 7 Part I: Introduction Basic Concepts for Binary Classification Tools and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Quantitative Methods in Computing Education Research (A brief overview tips and techniques)

Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Quantitative Methods in Computing Education Research (A brief overview tips and techniques) Dr Judy Sheard Senior Lecturer Co-Director, Computing Education Research Group Monash University judy.sheard@monash.edu

More information

Fundamentals of Clinical Research for Radiologists. ROC Analysis. - Research Obuchowski ROC Analysis. Nancy A. Obuchowski 1

Fundamentals of Clinical Research for Radiologists. ROC Analysis. - Research Obuchowski ROC Analysis. Nancy A. Obuchowski 1 - Research Nancy A. 1 Received October 28, 2004; accepted after revision November 3, 2004. Series editors: Nancy, C. Craig Blackmore, Steven Karlik, and Caroline Reinhold. This is the 14th in the series

More information

Meta-analysis of diagnostic research. Karen R Steingart, MD, MPH Chennai, 15 December Overview

Meta-analysis of diagnostic research. Karen R Steingart, MD, MPH Chennai, 15 December Overview Meta-analysis of diagnostic research Karen R Steingart, MD, MPH karenst@uw.edu Chennai, 15 December 2010 Overview Describe key steps in a systematic review/ meta-analysis of diagnostic test accuracy studies

More information

Probability Revision. MED INF 406 Assignment 5. Golkonda, Jyothi 11/4/2012

Probability Revision. MED INF 406 Assignment 5. Golkonda, Jyothi 11/4/2012 Probability Revision MED INF 406 Assignment 5 Golkonda, Jyothi 11/4/2012 Problem Statement Assume that the incidence for Lyme disease in the state of Connecticut is 78 cases per 100,000. A diagnostic test

More information

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy

An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Number XX An Empirical Assessment of Bivariate Methods for Meta-analysis of Test Accuracy Prepared for: Agency for Healthcare Research and Quality U.S. Department of Health and Human Services 54 Gaither

More information

VU Biostatistics and Experimental Design PLA.216

VU Biostatistics and Experimental Design PLA.216 VU Biostatistics and Experimental Design PLA.216 Julia Feichtinger Postdoctoral Researcher Institute of Computational Biotechnology Graz University of Technology Outline for Today About this course Background

More information

Worksheet for Structured Review of Physical Exam or Diagnostic Test Study

Worksheet for Structured Review of Physical Exam or Diagnostic Test Study Worksheet for Structured Review of Physical Exam or Diagnostic Study Title of Manuscript: Authors of Manuscript: Journal and Citation: Identify and State the Hypothesis Primary Hypothesis: Secondary Hypothesis:

More information

Introduction to screening tests. Tim Hanson Department of Statistics University of South Carolina April, 2011

Introduction to screening tests. Tim Hanson Department of Statistics University of South Carolina April, 2011 Introduction to screening tests Tim Hanson Department of Statistics University of South Carolina April, 2011 1 Overview: 1. Estimating test accuracy: dichotomous tests. 2. Estimating test accuracy: continuous

More information

Introduction to Meta-analysis of Accuracy Data

Introduction to Meta-analysis of Accuracy Data Introduction to Meta-analysis of Accuracy Data Hans Reitsma MD, PhD Dept. of Clinical Epidemiology, Biostatistics & Bioinformatics Academic Medical Center - Amsterdam Continental European Support Unit

More information

Machine learning II. Juhan Ernits ITI8600

Machine learning II. Juhan Ernits ITI8600 Machine learning II Juhan Ernits ITI8600 Hand written digit recognition 64 Example 2: Face recogition Classification, regression or unsupervised? How many classes? Example 2: Face recognition Classification,

More information

Biomarker adaptive designs in clinical trials

Biomarker adaptive designs in clinical trials Review Article Biomarker adaptive designs in clinical trials James J. Chen 1, Tzu-Pin Lu 1,2, Dung-Tsa Chen 3, Sue-Jane Wang 4 1 Division of Bioinformatics and Biostatistics, National Center for Toxicological

More information

Diagnostic Test. H. Risanto Siswosudarmo Department of Obstetrics and Gynecology Faculty of Medicine, UGM Jogjakarta. RS Sardjito

Diagnostic Test. H. Risanto Siswosudarmo Department of Obstetrics and Gynecology Faculty of Medicine, UGM Jogjakarta. RS Sardjito ب س م الل ه الر ح م ن الر ح يم RS Sardjito Diagnostic Test Gold standard New (test Disease No Disease Column Total Posi*ve a b a+b Nega*ve c d c+d Row Total a+c b+d N H. Risanto Siswosudarmo Department

More information

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

ROC Curve. Brawijaya Professional Statistical Analysis BPSA MALANG Jl. Kertoasri 66 Malang (0341)

ROC Curve. Brawijaya Professional Statistical Analysis BPSA MALANG Jl. Kertoasri 66 Malang (0341) ROC Curve Brawijaya Professional Statistical Analysis BPSA MALANG Jl. Kertoasri 66 Malang (0341) 580342 ROC Curve The ROC Curve procedure provides a useful way to evaluate the performance of classification

More information

Diagnostic tests, Laboratory tests

Diagnostic tests, Laboratory tests Diagnostic tests, Laboratory tests I. Introduction II. III. IV. Informational values of a test Consequences of the prevalence rate Sequential use of 2 tests V. Selection of a threshold: the ROC curve VI.

More information

Chapter 11. Experimental Design: One-Way Independent Samples Design

Chapter 11. Experimental Design: One-Way Independent Samples Design 11-1 Chapter 11. Experimental Design: One-Way Independent Samples Design Advantages and Limitations Comparing Two Groups Comparing t Test to ANOVA Independent Samples t Test Independent Samples ANOVA Comparing

More information

STATISTICS AND RESEARCH DESIGN

STATISTICS AND RESEARCH DESIGN Statistics 1 STATISTICS AND RESEARCH DESIGN These are subjects that are frequently confused. Both subjects often evoke student anxiety and avoidance. To further complicate matters, both areas appear have

More information

Bayesian meta-analysis of Papanicolaou smear accuracy

Bayesian meta-analysis of Papanicolaou smear accuracy Gynecologic Oncology 107 (2007) S133 S137 www.elsevier.com/locate/ygyno Bayesian meta-analysis of Papanicolaou smear accuracy Xiuyu Cong a, Dennis D. Cox b, Scott B. Cantor c, a Biometrics and Data Management,

More information

sickness, disease, [toxicity] Hard to quantify

sickness, disease, [toxicity] Hard to quantify BE.104 Spring Epidemiology: Test Development and Relative Risk J. L. Sherley Agent X? Cause Health First, Some definitions Morbidity = Mortality = sickness, disease, [toxicity] Hard to quantify death Easy

More information

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang

Systematic Reviews and meta-analyses of Diagnostic Test Accuracy. Mariska Leeflang Systematic Reviews and meta-analyses of Diagnostic Test Accuracy Mariska Leeflang m.m.leeflang@amc.uva.nl This presentation 1. Introduction: accuracy? 2. QUADAS-2 exercise 3. Meta-analysis of diagnostic

More information

Diagnostic Reasoning: Approach to Clinical Diagnosis Based on Bayes Theorem

Diagnostic Reasoning: Approach to Clinical Diagnosis Based on Bayes Theorem CHAPTER 75 Diagnostic Reasoning: Approach to Clinical Diagnosis Based on Bayes Theorem A. Mohan, K. Srihasam, S.K. Sharma Introduction Doctors caring for patients in their everyday clinical practice are

More information

Predicting Breast Cancer Survivability Rates

Predicting Breast Cancer Survivability Rates Predicting Breast Cancer Survivability Rates For data collected from Saudi Arabia Registries Ghofran Othoum 1 and Wadee Al-Halabi 2 1 Computer Science, Effat University, Jeddah, Saudi Arabia 2 Computer

More information

An Empirical and Formal Analysis of Decision Trees for Ranking

An Empirical and Formal Analysis of Decision Trees for Ranking An Empirical and Formal Analysis of Decision Trees for Ranking Eyke Hüllermeier Department of Mathematics and Computer Science Marburg University 35032 Marburg, Germany eyke@mathematik.uni-marburg.de Stijn

More information

Detection Theory: Sensitivity and Response Bias

Detection Theory: Sensitivity and Response Bias Detection Theory: Sensitivity and Response Bias Lewis O. Harvey, Jr. Department of Psychology University of Colorado Boulder, Colorado The Brain (Observable) Stimulus System (Observable) Response System

More information

Screening (Diagnostic Tests) Shaker Salarilak

Screening (Diagnostic Tests) Shaker Salarilak Screening (Diagnostic Tests) Shaker Salarilak Outline Screening basics Evaluation of screening programs Where we are? Definition of screening? Whether it is always beneficial? Types of bias in screening?

More information

Zheng Yao Sr. Statistical Programmer

Zheng Yao Sr. Statistical Programmer ROC CURVE ANALYSIS USING SAS Zheng Yao Sr. Statistical Programmer Outline Background Examples: Accuracy assessment Compare ROC curves Cut-off point selection Summary 2 Outline Background Examples: Accuracy

More information

Diagnostic imaging evaluating image quality using visual grading characteristic (VGC) analysis

Diagnostic imaging evaluating image quality using visual grading characteristic (VGC) analysis Vet Res Commun (2010) 34:473 479 DOI 10.1007/s11259-010-9413-2 SHORT COMMUNICATION Diagnostic imaging evaluating image quality using visual grading characteristic (VGC) analysis Eberhard Ludewig & Andreas

More information

Hayden Smith, PhD, MPH /\ v._

Hayden Smith, PhD, MPH /\ v._ Hayden Smith, PhD, MPH.. + /\ v._ Information and clinical examples provided in presentation are strictly for educational purposes, and should not be substituted for clinical guidelines or up-to-date medical

More information

Clinical Utility of Likelihood Ratios

Clinical Utility of Likelihood Ratios CONCEPTS Clinical Utility of Likelihood Ratios From the Departments of Emergency Medicine, Medicine, Epidemiology, and Social Medicine, Albert Einstein College of Medicine, Bronx, NY. Received for publication

More information

Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy

Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy Ben A. Dwamena, MD The University of Michigan & VA Medical Centers, Ann Arbor SNASUG - July 24, 2008 Diagnostic

More information

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1:

Research Methods 1 Handouts, Graham Hole,COGS - version 1.0, September 2000: Page 1: Research Methods 1 Handouts, Graham Hole,COGS - version 10, September 000: Page 1: T-TESTS: When to use a t-test: The simplest experimental design is to have two conditions: an "experimental" condition

More information

OBSERVER PERFORMANCE METHODS FOR DIAGNOSTIC IMAGING Foundations, Modeling and Applications with R-Examples. Dev P.

OBSERVER PERFORMANCE METHODS FOR DIAGNOSTIC IMAGING Foundations, Modeling and Applications with R-Examples. Dev P. OBSERVER PERFORMANCE METHODS FOR DIAGNOSTIC IMAGING Foundations, Modeling and Applications with R-Examples Dev P. Chakraborty, PhD Chapter 01: Preliminaries This chapter provides background material, starting

More information

Theme 14 Ranking tests

Theme 14 Ranking tests Slide 14.1 Theme 14 Ranking tests Slide 14.2 Overview The tests which we have considered so far assumed that the data on our variables is normally distributed What do we do if this assumption has been

More information

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way:

ROC Curves. I wrote, from SAS, the relevant data to a plain text file which I imported to SPSS. The ROC analysis was conducted this way: ROC Curves We developed a method to make diagnoses of anxiety using criteria provided by Phillip. Would it also be possible to make such diagnoses based on a much more simple scheme, a simple cutoff point

More information

Behavioral Data Mining. Lecture 4 Measurement

Behavioral Data Mining. Lecture 4 Measurement Behavioral Data Mining Lecture 4 Measurement Outline Hypothesis testing Parametric statistical tests Non-parametric tests Precision-Recall plots ROC plots Hardware update Icluster machines are ready for

More information

Overview of Non-Parametric Statistics

Overview of Non-Parametric Statistics Overview of Non-Parametric Statistics LISA Short Course Series Mark Seiss, Dept. of Statistics April 7, 2009 Presentation Outline 1. Homework 2. Review of Parametric Statistics 3. Overview Non-Parametric

More information

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol.

Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Ho (null hypothesis) Ha (alternative hypothesis) Problem #1 Neurological signs and symptoms of ciguatera poisoning as the start of treatment and 2.5 hours after treatment with mannitol. Hypothesis: Ho:

More information

! Mainly going to ignore issues of correlation among tests

! Mainly going to ignore issues of correlation among tests 2x2 and Stratum Specific Likelihood Ratio Approaches to Interpreting Diagnostic Tests. How Different Are They? Henry Glick and Seema Sonnad University of Pennsylvania Society for Medical Decision Making

More information

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES Correlational Research Correlational Designs Correlational research is used to describe the relationship between two or more naturally occurring variables. Is age related to political conservativism? Are

More information

Analysis of Diabetic Dataset and Developing Prediction Model by using Hive and R

Analysis of Diabetic Dataset and Developing Prediction Model by using Hive and R Indian Journal of Science and Technology, Vol 9(47), DOI: 10.17485/ijst/2016/v9i47/106496, December 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Diabetic Dataset and Developing Prediction

More information

Studies reporting ROC curves of diagnostic and prediction data can be incorporated into meta-analyses using corresponding odds ratios

Studies reporting ROC curves of diagnostic and prediction data can be incorporated into meta-analyses using corresponding odds ratios Journal of Clinical Epidemiology 60 (2007) 530e534 BRIEF REPORT Studies reporting ROC curves of diagnostic and prediction data can be incorporated into meta-analyses using corresponding odds ratios S.D.

More information

Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy

Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy Multivariate Mixed-Effects Meta-Analysis of Paired-Comparison Studies of Diagnostic Test Accuracy Ben A. Dwamena, MD The University of Michigan & VA Medical Centers, Ann Arbor SNASUG - July 24, 2008 B.A.

More information

When Overlapping Unexpectedly Alters the Class Imbalance Effects

When Overlapping Unexpectedly Alters the Class Imbalance Effects When Overlapping Unexpectedly Alters the Class Imbalance Effects V. García 1,2, R.A. Mollineda 2,J.S.Sánchez 2,R.Alejo 1,2, and J.M. Sotoca 2 1 Lab. Reconocimiento de Patrones, Instituto Tecnológico de

More information

SEED HAEMATOLOGY. Medical statistics your support when interpreting results SYSMEX EDUCATIONAL ENHANCEMENT AND DEVELOPMENT APRIL 2015

SEED HAEMATOLOGY. Medical statistics your support when interpreting results SYSMEX EDUCATIONAL ENHANCEMENT AND DEVELOPMENT APRIL 2015 SYSMEX EDUCATIONAL ENHANCEMENT AND DEVELOPMENT APRIL 2015 SEED HAEMATOLOGY Medical statistics your support when interpreting results The importance of statistical investigations Modern medicine is often

More information

Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers

Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers Ewout Steyerberg Professor of Medical Decision Making Dept of Public Health, Erasmus MC Birmingham July, 2013

More information

Receiver operating characteristics curves and related decision measures: A tutorial

Receiver operating characteristics curves and related decision measures: A tutorial Chemometrics and Intelligent Laboratory Systems 8 (26) 24 38 www.elsevier.com/locate/chemolab Receiver operating characteristics curves and related decision measures: A tutorial Christopher D. Brown a,

More information

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes

Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes Performance Evaluation of Machine Learning Algorithms in the Classification of Parkinson Disease Using Voice Attributes J. Sujatha Research Scholar, Vels University, Assistant Professor, Post Graduate

More information

OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING. Learning Objectives for this session:

OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING. Learning Objectives for this session: OCW Epidemiology and Biostatistics, 2010 Michael D. Kneeland, MD November 18, 2010 SCREENING Learning Objectives for this session: 1) Know the objectives of a screening program 2) Define and calculate

More information

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections New: Bias-variance decomposition, biasvariance tradeoff, overfitting, regularization, and feature selection Yi

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

The recommended method for diagnosing sleep

The recommended method for diagnosing sleep reviews Measuring Agreement Between Diagnostic Devices* W. Ward Flemons, MD; and Michael R. Littner, MD, FCCP There is growing interest in using portable monitoring for investigating patients with suspected

More information

COMPARATIVE STUDY ON FEATURE EXTRACTION METHOD FOR BREAST CANCER CLASSIFICATION

COMPARATIVE STUDY ON FEATURE EXTRACTION METHOD FOR BREAST CANCER CLASSIFICATION COMPARATIVE STUDY ON FEATURE EXTRACTION METHOD FOR BREAST CANCER CLASSIFICATION 1 R.NITHYA, 2 B.SANTHI 1 Asstt Prof., School of Computing, SASTRA University, Thanjavur, Tamilnadu, India-613402 2 Prof.,

More information

SEM: the precision of the mean of the sample in predicting the population parameter and is a way of relating the sample mean to population mean

SEM: the precision of the mean of the sample in predicting the population parameter and is a way of relating the sample mean to population mean 1999b(9)/1997a(14)/1995b(17): What is meant by 95% confidence interval? Explain the practical applications of CIs and indicate why they may be preferred to P values General: 95% CI defines the range of

More information

Investigating the robustness of the nonparametric Levene test with more than two groups

Investigating the robustness of the nonparametric Levene test with more than two groups Psicológica (2014), 35, 361-383. Investigating the robustness of the nonparametric Levene test with more than two groups David W. Nordstokke * and S. Mitchell Colp University of Calgary, Canada Testing

More information

Comparing disease screening tests when true disease status is ascertained only for screen positives

Comparing disease screening tests when true disease status is ascertained only for screen positives Biostatistics (2001), 2, 3,pp. 249 260 Printed in Great Britain Comparing disease screening tests when true disease status is ascertained only for screen positives MARGARET SULLIVAN PEPE, TODD A. ALONZO

More information

Diagnostic methods 2: receiver operating characteristic (ROC) curves

Diagnostic methods 2: receiver operating characteristic (ROC) curves abc of epidemiology http://www.kidney-international.org & 29 International Society of Nephrology Diagnostic methods 2: receiver operating characteristic (ROC) curves Giovanni Tripepi 1, Kitty J. Jager

More information

Assessment of performance and decision curve analysis

Assessment of performance and decision curve analysis Assessment of performance and decision curve analysis Ewout Steyerberg, Andrew Vickers Dept of Public Health, Erasmus MC, Rotterdam, the Netherlands Dept of Epidemiology and Biostatistics, Memorial Sloan-Kettering

More information

International Journal of Engineering Trends and Applications (IJETA) Volume 4 Issue 2, Mar-Apr 2017

International Journal of Engineering Trends and Applications (IJETA) Volume 4 Issue 2, Mar-Apr 2017 RESEARCH ARTICLE OPEN ACCESS Knowledge Based Brain Tumor Segmentation using Local Maxima and Local Minima T. Kalaiselvi [1], P. Sriramakrishnan [2] Department of Computer Science and Applications The Gandhigram

More information

Choosing the Correct Statistical Test

Choosing the Correct Statistical Test Choosing the Correct Statistical Test T racie O. Afifi, PhD Departments of Community Health Sciences & Psychiatry University of Manitoba Department of Community Health Sciences COLLEGE OF MEDICINE, FACULTY

More information

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

Methods Research Report. An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy Methods Research Report An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy Methods Research Report An Empirical Assessment of Bivariate Methods for Meta-Analysis of Test Accuracy

More information