Binary Diagnostic Tests Paired Samples

Similar documents
Binary Diagnostic Tests Two Independent Samples

Comparing Two ROC Curves Independent Groups Design

Contingency Tables Summer 2017 Summer Institutes 187

Psy201 Module 3 Study and Assignment Guide. Using Excel to Calculate Descriptive and Inferential Statistics

MULTIPLE LINEAR REGRESSION 24.1 INTRODUCTION AND OBJECTIVES OBJECTIVES

Lesson: A Ten Minute Course in Epidemiology

Multiple Linear Regression Analysis

Testing Means. Related-Samples t Test With Confidence Intervals. 6. Compute a related-samples t test and interpret the results.

Using SPSS for Correlation

Appendix: Instructions for Treatment Index B (Human Opponents, With Recommendations)

THIS PROBLEM HAS BEEN SOLVED BY USING THE CALCULATOR. A 90% CONFIDENCE INTERVAL IS ALSO SHOWN. ALL QUESTIONS ARE LISTED BELOW THE RESULTS.

BlueBayCT - Warfarin User Guide

4 Diagnostic Tests and Measures of Agreement

ROC Curves (Old Version)

ANOVA in SPSS (Practical)

BIOL 458 BIOMETRY Lab 7 Multi-Factor ANOVA

Sample Size Considerations. Todd Alonzo, PhD

Examining differences between two sets of scores

Daniel Boduszek University of Huddersfield

IBRIDGE 1.0 USER MANUAL

Fundamental Clinical Trial Design

Unit 1 Exploring and Understanding Data

Simple Sensitivity Analyses for Matched Samples Thomas E. Love, Ph.D. ASA Course Atlanta Georgia

Constructing a Bivariate Table:

Reflection Questions for Math 58B

Section 6: Analysing Relationships Between Variables

1. To review research methods and the principles of experimental design that are typically used in an experiment.

Charts Worksheet using Excel Obesity Can a New Drug Help?

Chapter 2 Organizing and Summarizing Data. Chapter 3 Numerically Summarizing Data. Chapter 4 Describing the Relation between Two Variables

Anticoagulation Manager - Getting Started

Warfarin Help Documentation

Part 8 Logistic Regression

Intro to SPSS. Using SPSS through WebFAS

PBSI-EHR Off the Charts!

One-Way Independent ANOVA

Chapter 13 Estimating the Modified Odds Ratio

Risk Ratio and Odds Ratio

To begin using the Nutrients feature, visibility of the Modules must be turned on by a MICROS Account Manager.

Excel Solver. Table of Contents. Introduction to Excel Solver slides 3-4. Example 1: Diet Problem, Set-Up slides 5-11

POPULATION TRACKER MIDS USER GUIDE

INRODUCTION TO TREEAGE PRO

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

Managing Immunizations

The Clinical Information Data Entry Screen is the main screen in the DQCMS application.

To open a CMA file > Download and Save file Start CMA Open file from within CMA

Meta-analysis using RevMan. Yemisi Takwoingi October 2015

Hour 2: lm (regression), plot (scatterplots), cooks.distance and resid (diagnostics) Stat 302, Winter 2016 SFU, Week 3, Hour 1, Page 1

Day 11: Measures of Association and ANOVA

Dementia Direct Enhanced Service

PSYCH-GA.2211/NEURL-GA.2201 Fall 2016 Mathematical Tools for Cognitive and Neural Science. Homework 5

EPIDEMIOLOGY. Training module

Two-Way Independent ANOVA

Missy Wittenzellner Big Brother Big Sister Project

STP 231 Example FINAL

The Hospital Anxiety and Depression Scale Guidance and Information

Lessons in biostatistics

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

Exam 4 Review Exercises

Using the New Nutrition Facts Label Formats in TechWizard Version 5

GST: Step by step Build Diary page

Sheila Barron Statistics Outreach Center 2/8/2011

Tech Talk: Using the Lafayette ESS Report Generator

User Guide for Classification of Diabetes: A search tool for identifying miscoded, misclassified or misdiagnosed patients

25. Two-way ANOVA. 25. Two-way ANOVA 371

SAMPLE SIZE AND POWER

Q: How do I get the protein concentration in mg/ml from the standard curve if the X-axis is in units of µg.

To open a CMA file > Download and Save file Start CMA Open file from within CMA

Two-sample Categorical data: Measuring association

Basic Biostatistics. Chapter 1. Content

Appendix B. Nodulus Observer XT Instructional Guide. 1. Setting up your project p. 2. a. Observation p. 2. b. Subjects, behaviors and coding p.

BMI 541/699 Lecture 16

OneTouch Reveal Web Application. User Manual for Healthcare Professionals Instructions for Use

To review probability concepts, you should read Chapter 3 of your text. This handout will focus on Section 3.6 and also some elements of Section 13.3.

Benchmark Dose Modeling Cancer Models. Allen Davis, MSPH Jeff Gift, Ph.D. Jay Zhao, Ph.D. National Center for Environmental Assessment, U.S.

Determining the size of a vaccine trial

Name: emergency please discuss this with the exam proctor. 6. Vanderbilt s academic honor code applies.

Sanjay P. Zodpey Clinical Epidemiology Unit, Department of Preventive and Social Medicine, Government Medical College, Nagpur, Maharashtra, India.

GLOOKO FOR ios MIDS USER GUIDE

Midterm Exam MMI 409 Spring 2009 Gordon Bleil

Chapter 13: Introduction to Analysis of Variance

Biostatistics & SAS programming

Impact of Group Rejections on Self Esteem

- Decide on an estimator for the parameter. - Calculate distribution of estimator; usually involves unknown parameter

Problem set 2: understanding ordinary least squares regressions

Survival Skills for Researchers. Study Design

MS/MS Library Creation of Q-TOF LC/MS Data for MassHunter PCDL Manager

Estimating national adult prevalence of HIV-1 in Generalized Epidemics

POLS 5377 Scope & Method of Political Science. Correlation within SPSS. Key Questions: How to compute and interpret the following measures in SPSS

Decimal degree. 1:1,500,000 Area of collection

Market Research on Caffeinated Products

Measuring association in contingency tables


Step 3 Tutorial #3: Obtaining equations for scoring new cases in an advanced example with quadratic term

How to Conduct On-Farm Trials. Dr. Jim Walworth Dept. of Soil, Water & Environmental Sci. University of Arizona

NAÏVE BAYES CLASSIFIER AND FUZZY LOGIC SYSTEM FOR COMPUTER AIDED DETECTION AND CLASSIFICATION OF MAMMAMOGRAPHIC ABNORMALITIES

The Loss of Heterozygosity (LOH) Algorithm in Genotyping Console 2.0

Practical Bayesian Design and Analysis for Drug and Device Clinical Trials

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

WFS. User Guide. thinkwhere Glendevon House Castle Business Park Stirling FK9 4TZ Tel +44 (0) Fax +44 (0)

Transcription:

Chapter 536 Binary Diagnostic Tests Paired Samples Introduction An important task in diagnostic medicine is to measure the accuracy of two diagnostic tests. This can be done by comparing summary measures of diagnostic accuracy such as sensitivity or specificity using a statistical test. Often, you want to show that a new test is similar to another test, in which case you use an equivalence test. Or, you may wish to show that a new diagnostic test is not inferior to the existing test, so you use a noninferiority test. All of these hypothesis tests are available in this procedure for the important case when the diagnostic tests provide a binary (yes or no) result. Experimental Design Suppose you are interested in comparing the sensitivities of two diagnostic tests for a particular disease (or condition). Each test provides a binary (yes or no) response. Further suppose you draw a random sample of subjects from the population with the disease and administered both diagnostic tests to each subject in random order. Assume that Test 1 is a new (experimental or treatment) test that will replace Test 2, the existing (standard or reference) test, if it is found to be better. The results of such a study can be displayed in a 2-by-2 table in which the Test 1 result is shown as the rows and the Test 2 result is shown as the columns. Test 2 Result Test 1 Result Positive Negative Total Positive X11 X10 m1 Negative X01 X00 m0 Total n1 n0 N Data such as this can be analyzed using standard techniques for comparing two correlated proportions which are presented in the chapter on Two Correlated Proportions. Such a table was originally analyzed using McNemar s Test. However, procedures with better statistical properties have recently been proposed. See for example Nam and Blackwelder (2002). Sensitivity Sensitivity is the proportion of those that have the condition for which the diagnostic test is positive. Since this design assumes that the subjects come from the population of individuals with the disease, the sensitivity can be calculated. 536-1

Specificity Specificity is the proportion of those that do not have the condition for which the diagnostic test is negative. To study specificity, a separate study would have to be conducted in which subjects were drawn from the population of individuals without the disease. The data from a such a study could be analyzed with this procedure by changing the meaning of positive and negative. Instead of positive meaning that the person had the disease, positive would mean that the diagnostic test result matched the true condition of the subject. Likewise, negative would mean that the diagnostic test result did not match the true condition. In the procedure printouts, you would substitute specificity for sensitivity. Comparing Sensitivity and Specificity Suppose you arrange the results of two diagnostic tests into two 2-by-2 tables as follows: Test 2 Result Test 1 Result Positive Negative Total Positive X11 X10 m1 Negative X01 X00 m0 Total n1 n0 N Hence, the study design include N = N1 + N0 patients. The hypotheses of interest when comparing the sensitivities (Se) of two diagnostic tests are either the difference hypotheses HO: Se1 Se2 = 0 versus H A: Se1 Se2 0 or the ratio hypothesis H : O Se 1/ Se 2 = 1 versus H : A Se 1/ Se 2 1 Similar sets of hypotheses may be defined for the difference or ratio of the specificities (Sp) as H : O Sp 1 Sp 2 = 0 versus H : A Sp 1 Sp 2 0 and H : O Sp 1/ Sp 2 = 1 versus H : A Sp 1/ Sp 2 1 Note that the difference hypotheses usually require a smaller sample size for comparable statistical power, but the ratio hypotheses may be more convenient. The sensitivities are estimated as m Se1 ˆ 1 = and N n Se2 ˆ 1 = N The sensitivities of the two diagnostic tests may be compared using either their differences or their ratios. Hence, the comparison of the sensitivity reduces to the problem of comparing two correlated binomial proportions. The formulas used for hypothesis testing and confidence intervals are the same as presented in the chapter on testing two correlated proportions. We refer you to that chapter for further details. 536-2

Data Structure This procedure does not use data from a dataset. Instead, you enter the values directly into the 2-by-2 table on the panel. Procedure Options This section describes the options available in this procedure. Data Tab Enter the data values directly on this panel. Data Values X11 This is the number of patients that responded positively to both diagnostic tests. The value entered must be a nonnegative number. X10 This is the number of patients that tested positive using Test 1, but negative using Test 2. The value entered must be a non-negative number. X01 This is the number of patients that tested negative using Test 1, but positive using Test 2. The value entered must be a non-negative number. X00 This is the number of patients that responded negatively to both diagnostic tests. The value entered must be a nonnegative number. Confidence Interval Method Difference C.I. Method This option specifies the method used to calculate the confidence intervals of the sensitivity differences. These methods are documented in detail in the Two Correlated Proportions chapter. We recommend the score method proposed by Nam (1990). Ratio C.I. Method This option specifies the method used to calculate the confidence intervals of the sensitivity ratios. These methods are documented in detail in the Two Correlated Proportions chapter. The recommended method is score method proposed by Nam and Blackwelder (2002). Report Options Alpha Confidence Intervals The confidence coefficient to use for calculating the confidence limits in proportions. 100 x (1 - alpha)% confidence limits will be calculated. This must be a value between 0 and 0.5. The most common choice is 0.05. 536-3

Alpha Hypothesis Tests This is the significance level of the hypothesis tests, including the equivalence and noninferiority tests. Typical values are between 0.01 and 0.10. The most common choice is 0.05. Proportion Decimals The number of digits to the right of the decimal place to display when showing proportions on the reports. Probability Decimals The number of digits to the right of the decimal place to display when showing probabilities on the reports. Equivalence or Non-Inferiority Settings Max Equivalence Difference This is the largest value of the difference between the two sensitivities that will still result in the conclusion of equivalence. When running equivalence tests, this value is crucial since it defines the interval of equivalence. Usually, this value is between 0.01 and 0.20. Note that this value must be a positive number. Max Equivalence Ratio This is the largest value of the ratio of the two sensitivities that will still result in the conclusion of diagnostic equivalence. When running equivalence tests, this value is crucial since it defines the interval of equivalence. Usually, this value is between 1.05 and 2.0. Note that this value must be greater than one. Example 1 Binary Diagnostic Test of Paired Samples This section presents an example of how to enter data and run an analysis. In this example, a sample of 50 individuals known to have a certain disease was selected. For this study, Test 1 refers to a new, cheaper, lessinvasive diagnostic test and Test 2 refers to the standard diagnostic test that is currently being used. The results are summarized into the following table: Test 2 Result Test 1 Result Positive Negative Total Positive 31 5 36 Negative 4 10 14 Total 35 15 50 You may follow along here by making the appropriate entries or load the completed template Example 1 by clicking on Open Example Template from the File menu of the window. 1 Open the window. Using the Analysis menu or the Procedure Navigator, find and select the Binary Diagnostic Tests - Paired Samples procedure. On the menus, select File, then New Template. This will fill the procedure with the default template. 536-4

2 Enter the data. Select the Data tab. In the X11 box, enter 31. In the X10 box, enter 5. In the X01 box, enter 4. In the X00 box, enter 10. 3 Set the other options. Set the Difference C.I. Method to Score (Nam RMLE). Set the Ratio C.I. Method to Score (Nam Blackwelder) Set the Max Equivalence Difference to 0.2. Set the Max Equivalence Ratio to 1.25. 4 Run the procedure. From the Run menu, select Run Procedure. Alternatively, just click the green Run button. Data and Proportions Counts Table Proportions Test 1 (New) Test 2 (Standard) Result Test 2 (Standard) Result Result Positive Negative Total Positive Negative Total Positive 31 5 36 0.6200 0.1000 0.7200 Negative 4 10 14 0.0800 0.2000 0.2800 Total 35 15 50 0.7000 0.3000 1.0000 Row Proportions Column Proportions Test 1 (New) Test 2 (Standard) Result Test 2 (Standard) Result Result Positive Negative Total Positive Negative Total Positive 0.8611 0.1389 1.0000 0.8857 0.3333 0.7200 Negative 0.2857 0.7143 1.0000 0.1143 0.6667 0.2800 Total 0.7000 0.3000 1.0000 1.0000 1.0000 1.0000 These reports display the counts that were entered along with various proportions that make interpreting the table easier. Note that Test 1 s sensitivity of 0.7200 and Test 2 s sensitivity of 0.7000 are displayed in the margins of the Table Proportions table. Sensitivity Confidence Intervals Lower 95.0% Upper 95.0% Statistic Test Value Conf. Limit Conf. Limit Sensitivity (Se1) 1 0.7200 0.5833 0.8253 Sensitivity (Se2) 2 0.7000 0.5625 0.8090 Difference (Se1-Se2) 0.0200-0.1094 0.1511 Ratio (Se1/Se2) 1.0286 0.8524 1.2491 Sensitivity: proportion of those that actually have the condition for which the diagnostic test is positive. Difference confidence limits based on Nam's RMLE method. Ratio confidence limits based on Blackwelder and Nam's method. This report displays the sensitivity for each test as well as corresponding confidence interval. It also shows the value and confidence interval for the difference and ratio of the sensitivity. Note that for a perfect diagnostic test, the sensitivity would be one. Hence, the larger the values the better. Note that the type of confidence interval for the difference and ratio is specified on the Data panel. 536-5

Confidence Intervals for the Odds Ratio Lower 95.0% Upper 95.0% Statistic Value Conf. Limit Conf. Limit Exact Conditional Binomial 1.2500 0.2690 6.2995 Maximum Likelihood 1.2500 0.3357 4.6549 Odds Ratio = Odds(True Condition = +) / Odds(True Condition = -) where Odds(Condition) = P(Positive Test Condition) / P(Negative Test Condition) This report displays estimates of the odds ratio as well as its confidence interval. Hypothesis Tests about Sensitivity Difference Null Test Conclusion Test Test Hypothesis Statistic Prob at the 5.0% Name Sides (H0) Value Level Level Nam 2 Se1-Se2=0 0.1111 0.7389 Cannot Reject H0 Nam Lower 1 Se1-Se2<=0 0.3333 0.3694 Cannot Reject H0 Nam Upper 1 Se1-Se2>=0 0.3333 0.6306 Cannot Reject H0 This report displays the results of hypothesis tests comparing the sensitivities of the two diagnostic tests using Nam s test. Note that for this test, identical test results are obtained from either the test of differences or test of ratios. Tests of Equivalence Reject H0 Lower Upper and Conclude 90.0% 90.0% Lower Upper Equivalence Prob Conf. Conf. Equiv. Equiv. at the 5.0% Statistic Level Limit Limit Bound Bound Significance Level Difference (Se1-Se2) 0.0051-0.0859 0.1274-0.2000 0.2000 Yes Ratio (Se1/Se2) 0.0247 0.8833 1.2039 0.8000 1.2500 Yes Equivalence is concluded when the confidence limits fall completely inside the equivalence bounds. Difference confidence limits based on Nam's RMLE method. Ratio confidence limits based on Blackwelder and Nam's method. This report displays the results of the equivalence tests of sensitivity, one based on the difference and the other based on the ratio. Equivalence is concluded if the confidence limits are inside the equivalence bounds. Prob Level The probability level is the smallest value of alpha that would result in rejection of the null hypothesis. It is interpreted as any other significance level. That is, reject the null hypothesis when this value is less than the desired significance level. Note that for many types of confidence limits, a closed form solution for this value does not exist and it must be searched for. Confidence Limits These are the lower and upper confidence limits calculated using the method you specified. Note that for equivalence tests, these intervals use twice the alpha. Hence, for a 5% equivalence test, the confidence coefficient is 0.90, not 0.95. 536-6

Lower and Upper Bounds These are the equivalence bounds. Values of the difference (ratio) inside these bounds are defined as being equivalent. Note that this value does not come from the data. Rather, you have to set it. These bounds are crucial to the equivalence test and they should be chosen carefully. Reject H0 and Conclude Equivalence at the 5% Significance Level This column gives the result of the equivalence test at the stated level of significance. Note that when you reject H0, you can conclude equivalence. However, when you do not reject H0, you cannot conclude nonequivalence. Instead, you conclude that there was not enough evidence in the study to reject the null hypothesis. Tests Showing the Sensitivity Non-inferiority of Test 1 Compared to Test 2 Reject H0 Lower Upper and Conclude 90.0% 90.0% Lower Upper Non-inferiority Prob Conf. Conf. Equiv. Equiv. at the 5.0% Statistic Level Limit Limit Bound Bound Significance Level Diff. (Se1-Se2) 0.0011-0.0859 0.1274-0.2000 0.2000 Yes Ratio (Se1/Se2) 0.0067 0.8833 1.2039 0.8000 1.2500 Yes H0: The Sensitivity of Test 1 is inferior to Test 2. Ha: The Sensitivity of Test 1 is non-inferior to Test 2. The non-inferiority of Test 1 compared to Test 2 is concluded when the lower c.l. > lower bound. Difference confidence limits based on Nam's RMLE method. Ratio confidence limits based on Blackwelder and Nam's method. This report displays the results of two non-inferiority tests of sensitivity, one based on the difference and the other based on the ratio. Report definitions are identical with those above for equivalence. 536-7