Week 17 and 21 Comparing two assays and Measurement of Uncertainty Explain tools used to compare the performance of two assays, including

Similar documents
Business Statistics Probability

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

It is a requirement of ISO that laboratories shall determine the uncertainty of measurement of results, where relevant and possible. (5.6.

Guide to Fulfillment of Measurement Uncertainty

Still important ideas

METHOD VALIDATION: WHY, HOW AND WHEN?

Correlation and regression

Assessing Agreement Between Methods Of Clinical Measurement

Readings: Textbook readings: OpenStax - Chapters 1 13 (emphasis on Chapter 12) Online readings: Appendix D, E & F

Unit 1 Exploring and Understanding Data

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Simple Linear Regression the model, estimation and testing

IAPT: Regression. Regression analyses

Farouk AL Quadan lecture 3 Implentation of

bivariate analysis: The statistical analysis of the relationship between two variables.

Introduction On Assessing Agreement With Continuous Measurement

10. LINEAR REGRESSION AND CORRELATION

Method Comparison Report Semi-Annual 1/5/2018

Chapter 3 CORRELATION AND REGRESSION

WDHS Curriculum Map Probability and Statistics. What is Statistics and how does it relate to you?

Chapter 1: Exploring Data

Online Supplementary Appendix

10 Intraclass Correlations under the Mixed Factorial Design

Statistical Considerations: Study Designs and Challenges in the Development and Validation of Cancer Biomarkers

Political Science 15, Winter 2014 Final Review

Statistical techniques to evaluate the agreement degree of medicine measurements

An update on the analysis of agreement for orthodontic indices

CHAPTER 3 RESEARCH METHODOLOGY

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

Measuring the User Experience

Understandable Statistics

UNIVERSITY OF THE FREE STATE DEPARTMENT OF COMPUTER SCIENCE AND INFORMATICS CSIS6813 MODULE TEST 2

EPIDEMIOLOGY. Training module

GUIDE TO THE EVALUATION OF COMMUTABILITY OF CONTROL MATERIALS

Regression CHAPTER SIXTEEN NOTE TO INSTRUCTORS OUTLINE OF RESOURCES

Addressing error in laboratory biomarker studies

Evidence-Based Medicine Journal Club. A Primer in Statistics, Study Design, and Epidemiology. August, 2013

3 CONCEPTUAL FOUNDATIONS OF STATISTICS

QUANTIFYING THE EFFECT OF SETTING QUALITY CONTROL STANDARD DEVIATIONS GREATER THAN ACTUAL STANDARD DEVIATIONS ON WESTGARD RULES

A validation study of after reconstitution stability of diabetes: level 1 and diabetes level 2 controls

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Forum for Collaborative HIV Research External Validation of CD4 and Viral Load Assays Paris, France June 29, 2007

Psychology, 2010, 1: doi: /psych Published Online August 2010 (

STATISTICS AND RESEARCH DESIGN

Examining Relationships Least-squares regression. Sections 2.3

Performance Characteristics of a New Single-Sample Freezing Point Depression Osmometer

On the purpose of testing:

11/24/2017. Do not imply a cause-and-effect relationship

Still important ideas

Annual Report ERNDIM-EQAS Quantitative Amino Acids 2004

1.4 - Linear Regression and MS Excel

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

Performance Guide. icheck Iodine The test kit to measure iodine in salt

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Clincial Biostatistics. Regression

Understanding Quality Control for Infectious Disease Testing. Wayne Dimech ASLM Quality Control Workshop Abuja, Nigeria 9 th December 2018

STATISTICS 8 CHAPTERS 1 TO 6, SAMPLE MULTIPLE CHOICE QUESTIONS

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

Section 6: Analysing Relationships Between Variables

Study Guide for the Final Exam

1 PROTOCOL. Comparison Study Summary

(a) y = 1.0x + 0.0; r = ; N = 60 (b) y = 1.0x + 0.0; r = ; N = Lot 1, Li-heparin whole blood, HbA1c (%)

Annual Report ERNDIM-EQAS Quantitative Amino Acids 2002

INTERNAL QUALITY CONTROL ( Q I C) QC)

Four rules to remember: 1) Addition, subtraction, multiplication and division of SD and/or CV.

CHAPTER ONE CORRELATION

appstats26.notebook April 17, 2015

ExperimentalPhysiology

Differential Item Functioning

REVIEW ARTICLE. A Review of Inferential Statistical Methods Commonly Used in Medicine

Regression Including the Interaction Between Quantitative Variables

Chapter 1: Introduction to Statistics

UNIT V: Analysis of Non-numerical and Numerical Data SWK 330 Kimberly Baker-Abrams. In qualitative research: Grounded Theory

SCATTER PLOTS AND TREND LINES

The analytical phase

Readings: Textbook readings: OpenStax - Chapters 1 11 Online readings: Appendix D, E & F Plous Chapters 10, 11, 12 and 14

Statistical Tools in Biology

Research Designs and Potential Interpretation of Data: Introduction to Statistics. Let s Take it Step by Step... Confused by Statistics?

Results. NeuRA Hypnosis June 2016

Statistical analysis DIANA SAPLACAN 2017 * SLIDES ADAPTED BASED ON LECTURE NOTES BY ALMA LEORA CULEN

Before we get started:

Choosing a Significance Test. Student Resource Sheet

1 PROTOCOL. Comparison Study Summary. Springs Memorial Hospital. Springs Memorial Hospital 800 W. Meeting St. Lancaster, SC (803)

Statistical Significance, Effect Size, and Practical Significance Eva Lawrence Guilford College October, 2017

Pitfalls in Linear Regression Analysis

12/31/2016. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Overview of Non-Parametric Statistics

Using WRT and control charts to evaluate analytical precision

SEM: the precision of the mean of the sample in predicting the population parameter and is a way of relating the sample mean to population mean

What information on measurement uncertainty should be communicated to clinicians, and how? Mario Plebani

NORTH SOUTH UNIVERSITY TUTORIAL 2

PTHP 7101 Research 1 Chapter Assignments

MODULE S1 DESCRIPTIVE STATISTICS

Evaluation of the analytical performances of six measurands for thyroid functions of Mindray CL-2000i system


CHAPTER - 6 STATISTICAL ANALYSIS. This chapter discusses inferential statistics, which use sample data to

Statistical questions for statistical methods

BIOSTATISTICS. Dr. Hamza Aduraidi

Transcription:

Week 17 and 21 Comparing two assays and Measurement of Uncertainty 2.4.1.4. Explain tools used to compare the performance of two assays, including 2.4.1.4.1. Linear regression 2.4.1.4.2. Bland-Altman plots 2.4.1.4.3. Kappa statistic 2.4.1.4.4. Chi-squared analyses Measurement of Uncertainty (MOU) 2.4.3.1. Explain the principles behind themou 2.4.3.2. Determination the MOU in appropriate assays 2.4.3.3. Explain the clinical relevance of MOU in specific assays with regard to patient monitoring Questions: 1) You are planning to replace your dsdna assay (currently RIA) with an ALBIA based method (you have been given a second hand instrument from another lab). You run 10 samples in parallel. How would you compare these assays? What do you need to consider about your data before choosing an appropriate statistical method? There are a number of ways to compare the assays. Visually using a scatter plot or a difference plot (Bland Altman is a type of difference plot) You can compare them statistically using linear regression, kappa statisitcs, percentage agreement or chi-squared. You need to determine the nature of your data to determine which method you can apply to your results eg are the variables continuous, ordinal or nominal? Linear regression Plot variable X against variable y. The line of best fit is drawn which minimizes the sum of the squared distance between the points and the line

The equation of the line is given as y = ax +b (a = slope, b = intercept) Coefficient of determination (r 2 ; r = correlation coefficient) Determines the strength of the relationship between the two variables Example:

In this example the sum of the squared deviations of the predicted Y (using line of best fit is 10.8 which is 10% of 108 therefore 90% of the variation of y is accounted for by x. Therefore r 2 = is 0.9 Chi squared test The Chi squared test for independence sets up a null hypothesis, being that there is no relationship between two assays. Under this assumption, there should be no clear relationship between categorisation of samples as being positive or negative. This is used to generate an expected value for each cell of a 2x2 contingency table. This value is subtracted from the observed value and the square of the difference used to generate a test statistic which is then compared to the Chi squared distribution. If the observed values deviate sufficiently from the expected values, then the null hypothesis is rejected and the alternative hypothesis is accepted i.e. that the classification of samples is not independent. This suggests an agreement between two different assays. Kappa statistic See question 4 2) You plot your data using a scatter plot. You are very pleased with the results!! What is the difference between correlation and agreement? How could a bias be relevant in this context? Correlation tells you whether there is a relationship between the two results (ie they increase/decrease proportionally) however they may not agree (ie give the same result) there may be a bias. Perfect correlation occurs if all points are along a straight line but perfect agreement if all points along the line of unity Perfect agreement suggests that they give the same result and they are interchangeable. If the bias is constant between the two assays, it may be reasonable to adjust for it, to provide agreement

RIA ALBIA 3) What is a Bland Altman plot? Explain how you would construct this for your dsdna data. What are the advantages of plotting your data this way (compared to the method above) Bland Altman plots chart provide useful information on assay bias. If x represents the values obtained on one test, and y represents the values obtained by another test, Bland Altman plots can be represented by (x+y/2, x-y). From these values the following can be calculated. a) The mean of the differences between each assay (i.e. bias). b) The 95% confidence limits of the differences (i.e. where 95% of the differences will lie) By the Bland Altman method, the two tests are then interchangeable if the following conditions are satisfied. a) There is no obvious relation between the difference and the mean of the two assays (i.e. the y value stays relatively constant for any x value). b) The magnitude of the variability in the difference between the two assays is not clinically important NB. The presence of systematic bias is not problematic, as this value may be subtracted from the results provided by the new method if required.

4) Comment on the following plots a) RANDOM BIAS b) Proportional bias

C) Constant Which of these would you be willing to tolerate if your were changing your assay? Constant bias 4) Calculate (or explain how you would calculate) the kappa statistic for the following data: The kappa statistic quantifies percentage agreement but corrects for the agreement that would be expected due to chance. It is used for nominal/categorical variables

The kappa statistic is typically used to quantitate agreement between raters on a categorical variable. Based on chance alone, concordance would between two raters or tests on a categorical variable would be 50%. The kappa statistic attempts to provide a less biased quantitation of agreement. The formula for the kappa statistic is given by: k = po-pe/1-pe where po is the observed agreement (as a proportion) and pe is the expected agreement. This can be defined as the probability that raters will agree on ratings purely by chance. In this example, po is given by the (number of ratings that agree)/(total ratings) = (20+15)/50 = 0.7. pe is given by the probability that the assays will both provide a Pos rating by chance + the probability that the assays will provide a Neg rating by chance = 0.5 x 0.6 + 0.5 x 0.4 = 0.5. Therefore, the kappa statistic is 0.7-0.5/1-0.5 = 0.4. The kappa statistic varies between 0 & 1, where 1 is considered to indicate perfect agreement., 5) (Aspects taken shamelessly from the RCPA case studies) Jessica a 33 year old female presents to her immunologist for follow up of her SLE. She has been on plaquenil for the last 12 months. Recently she has been feeling fatigued and reporting increased joint pain but this occurs in the context of a recent viral illness. Her last dsdna performed 1 month prior was 5 IU/mL (RR <7). A repeat test taken 3 days ago shows a value of 9 IU/mL. Both samples were performed in the same laboratory using the same methodology. a) Are these values different from each other? What is meant by measurement of uncertainty? Ideally, how should this be calculated? No this variation may be accunted for the level of uncertainty in the assay. MoU is a quantitative estimate of a range where the true value may lie (usually 95% CI) based on the numerous sources of uncertainty in performance of the assay. Calculated MoU is a four step process 1) defining what will be measured

2) identifying all sources of uncertainty (pre analytical, analytical and post analytical) 3) quantifying the uncertainties of each of these sources 4) calculating the combined uncertainty b) What do we do in reality? We perform a best estimate of MoU using the CV of QC material (usually at least 30 runs). If these are done over a sufficient period of time, with different operators, under different conditions, this will usually encompass the different sources of uncertainty. The CV is multiplied by a coverage factor (2 or 2.77 depending on what you are trying to determine) to give MoU c) Which of the following assays would we be expected to calculate an MU? i) ANA pattern ii) total IgE levels iii) paraprotein quantitation iv) oligoclonal bands/isoelectric focussing d) what are sources of uncertainty that may affect this result? Pre analytical Differences in patient preparation, specimen collection, transport/storage, preparation of primary sample Analytical Uncertainty of the calibrator value and dispensed volumes, reagent and calibrator batch variations, equipment aging and maintenance, changing operators, environmental fluctuations Post analytical Reporting with appropriate number of significant figures etc e) Your MU calculations show a CV of 20% at low level 6-12 IU/ml and a CV of 35% at >150 Are her initial values different? No 5-9 =4/9 = 44% (cf 2.77x20= >55%) If Jessica was receiving rituximab for her disease and her clinician wanted to measure her disease activity, are results of 200 IU/mL and 400 IU/mL different? How much would they have to differ at this level for there to be 95% confidence that they are different? 200-400/400 = 50% (cf 2.77 x 35 = 96.95)

Measurement of Uncertainty Definition: ISO15189 (3.17): The uncertainty of measurement is a parameter associated with the result of a measurement that characterises the dispersion of the values that could be reasonably attributed to the measurement. All measurements have some inaccuracy due to imprecision and bias, and therefore measurement results can only be estimates of the values of the quantities being measured. In accordance with ISO 15189 the MoU must be calculated where relevant and possible: MoU will be calculated for all quantitative tests, including qualitative results derived from a numerical value. This standard does not apply to qualitative tests not derived from a numerical value. If it is decided that MoU is not relevant or impossible to estimate, then the reasoning behind this decision will be documented. The MoU will be provided to clinical users upon request, in accordance with ISO 15189. There are a couple of key documents (released by NPAAC and RCPA) which are useful to have a look over. Potential sources of error may occur at any step of the assay eg preanalytical, analytical and post analytical. Ideally MoU estimates identify and sum all sources of error to arrive at the cumulative MoU. In practice this is impossible! For the vast majority of procedures, random effects are the major contributors of MoU so a reasonable surrogate for combined MoU is imprecision (CV) using QC data. Estimates of MoU will include levels of the measurand at or near clinical decision values. There may be different MoUs at different ranges. It is also important (where possible) to incorporate bias data. This is usually obtained with comparison to a CRM. Unfortunately, this is not available for most immunoassays so a reasonable approximation are results from EQAP using an appropriate group mean as an assigned conventional reference value. Calculation of Measurement of Uncertainty: 1. Choose an appropriate quality control material with a measurable value at or near the clinical decision value. 2. Determine the coefficient of variation (CV)% for the measurand using data points from separate runs for each internal control:

CV (%) = Standard deviation (SD) / mean Measurement of Uncertainty = CV (%) x 2.77 (2 is also used as a coverage factor this is used to give 95% CI of the true value of a test; 2.77 is used to determine 95% CI that 2 different values are statistically different) There is a 95% chance that the true result lies within a range covered by result number ± the uncertainty of measurement For frequently performed tests (eg nephelometry, at least 30 data points from consecutive runs should be used; for less frequently performed tests, the number of data points may be limited by the stability of the controls used. In these circumstances, 20 data points is acceptable. These results should be collected across all routinely encountered events that are reasonably expected to have a detectable influence on the results produced (eg, calibrator and reagent batch changes, different operators, equipment maintenance, environmental fluctuations). Measurement of Uncertainty Goals: Acceptable levels of MoU will depend upon the clinical use of the test results, and may vary between tests. Basically, it is important to determine whether the test is fit for purpose for example if it is clinically important to be able to differentiate a value of 7.1 from 6.9, a MoU of 55% at this range is not acceptable and your assay is not fit for purpose. For certain analytes acceptable limits are available on the westgard website. A widely used and internationally recommended concept is to define the upper acceptable limit for imprecision as a proportion of the intra-individual biological variation of the analyte. This is not available for most immunoassays. Therefore, this can be calculated using EQAP data * Minimal: <0.75CVw 90th percentile EQAP ** Desirable: <0.5CVw or 50th percentile EQAP ***Optimal: <0.25 CVw or 25 percentile EQAP CVw = within subject biological variation

If MoU goals are not met major contributors to the combined source of uncertainty should be identified (eg sampling, sample preparation, calibrators, reference material, equipment, environmental conditions, operators etc)