Clinical experience and pre-test probability scores in the diagnosis of pulmonary embolism

Similar documents
From the Departments of Medicine, University of Ottawa, Ottawa, Canada, McMaster University, Hamilton, Canada, Dalhousie University, Halifax, Canada

Citation for published version (APA): Douma, R. A. (2010). Pulmonary embolism: advances in diagnosis and prognosis

ORIGINAL INVESTIGATION. Assessing Clinical Probability of Pulmonary Embolism in the Emergency Ward. tools (leg vein ultrasonography 1-4 and plasma

Citation for published version (APA): Mac Gillavry, M. R. (2001). Some understanding of diagnostic tests for pulmonary embolism

Comparison of a Clinical Probability Estimate and Two Clinical Models in Patients with Suspected Pulmonary Embolism

ORIGINAL INVESTIGATION

ORIGINAL INVESTIGATION

Chapter 3. Simplification of the revised Geneva score for assessing clinical probability of pulmonary embolism

Usefulness of D-dimer, blood gas, and respiratory rate measurements for excluding pulmonary embolism

Citation for published version (APA): Mac Gillavry, M. R. (2001). Some understanding of diagnostic tests for pulmonary embolism

The diagnosis and prognosis of venous thromboembolism : variations on a theme Gibson, N.S.

Clinical Guide - Suspected PE (Reviewed 2006)

Differences in clinical presentation of pulmonary embolism in women and men

Predictive Accuracy of Revised Geneva Score in the Diagnosis of Pulmonary Embolism

Cover Page. The handle holds various files of this Leiden University dissertation.

The Modified Wells Score Accurately Excludes Pulmonary Embolus in Hospitalized Patients Receiving Heparin Prophylaxis

Citation for published version (APA): Douma, R. A. (2010). Pulmonary embolism: advances in diagnosis and prognosis

SAFETY OF A PULMONARY EMBOLISM AMBULATORY TREATMENT PROGRAM

Hayden Smith, PhD, MPH /\ v._

The value of clinical features in the diagnosis of acute pulmonary embolism: systematic review and meta-analysis

C-reactive protein and D-dimer with clinical probability score in the exclusion of pulmonary embolism

Simplified approach to investigation of suspected VTE

M ortality from pulmonary embolic disease has

Citation for published version (APA): Douma, R. A. (2010). Pulmonary embolism: advances in diagnosis and prognosis

ACR Appropriateness Criteria Suspected Lower Extremity Deep Vein Thrombosis EVIDENCE TABLE

Pulmonary Embolism. Pulmonary Embolism. Pulmonary Embolism. PE - Clinical

Diagnostic Algorithms in VTE

Department of Internal medicine, Sahukat Khanum Memorial Cancer Hospital and Research Center, Lahore, Pakistan 2

Computed tomography pulmonary angiogram as a result of medical emergency team calls: a 5-year retrospective audit

BACKGROUND METHODS RESULTS CONCLUSIONS

Deep Vein Thrombosis: Can a Second Sonographic Examination Be Avoided?

Usefulness of Clinical Pre-test Scores for a Correct Diagnostic Pathway in Patients with Suspected Pulmonary Embolism in Emergency Room

Chapter 1. Introduction

suspected deep-vein thrombosis is a common condition, with a lifetime cumulative incidence of 2 to 5 percent. Untreated deep-vein thrombosis can resul

Pulmonary embolism (PE) remains a diagnostic challenge,

Pulmonary embolism: assessment and imaging

Scanning the Literature

ORIGINAL INVESTIGATION. The Impact of the Introduction of a Rapid D-Dimer Assay on the Diagnostic Evaluation of Suspected Pulmonary Embolism

Proper Diagnosis of Venous Thromboembolism (VTE)

Pulmonary embolism (PE) can be considered in patients with a wide

Radiation Exposure in Pregnancy. John R. Mayo UNIVERSITY OF BRITISH COLUMBIA

Epidermiology Early pulmonary embolism

Research Article The Use of Computed Tomography of Pulmonary Angiogram in a District Hospital

Pulmonary embolism? A rapid disposition can be a matter of life or death.

The Pulmonary Embolism Severity Index in Predicting the Prognosis of Patients With Pulmonary Embolism

Audit of CT Pulmonary Angiogram in suspected pulmonary embolism patients

Prospective Validation of Wells Criteria in the Evaluation of Patients With Suspected Pulmonary Embolism

Audit of CT Pulmonary Angiogram in suspected pulmonary embolism patients

FOR CMS (MEDICARE) MEMBERS ONLY NATIONAL COVERAGE DETERMINATION (NCD) FOR COMPUTED TOMOGRAPHY:

ORIGINAL INVESTIGATION

Does This Patient Have Pulmonary Embolism?

Clinically Suspected Acute Recurrent Pulmonary Embolism: A Diagnostic Challenge

Diagnosis of Deep Venous Thrombosis and Pulmonary Embolism

The new ARDS definitions: what does it mean?

Supplementary Online Content

The diagnosis and prognosis of venous thromboembolism : variations on a theme Gibson, N.S.

Triage D-Dimer Test. Pulmonary embolism? A rapid disposition can be a matter of life or death.

VTE General Background

Ultrasonography and Diagnosis of Venous Thromboembolism

New Criteria for Ventilation-Perfusion Lung Scan Interpretation: A Basis for Optimal Interaction with Helical CT Angiography 1

University of Groningen

Citation for published version (APA): Mac Gillavry, M. R. (2001). Some understanding of diagnostic tests for pulmonary embolism

Underuse of risk assessment and overuse of CTPA in patients with suspected pulmonary thromboembolism

Too much medicine and venous thromboembolism: How can we make things Well again?

present in some 20% of the patients with angiographically documented PE. On the contrary, McFarlane and Imperiale [4] found a normal P(A a)o 2

In recent years, extensive research has been devoted to

Pulmonary embolism. Pulmonary embolism (PE) is an uncommon but. Assessment and management. Pathophysiology. Aetiology. Shortness of breath THEME

Cover Page. The handle holds various files of this Leiden University dissertation

Author(s): Rockefeller A. Oteng, M.D., University of Michigan

The Role of D-Dimer in the Diagnosis of Venous Thromboembolism

Scintigraphic Lung Scans and Clinical Assessment in Critically Ill Patients With Suspected Acute Pulmonary Embolism*

Pulmonary embolism rule-out criteria (PERC) in pulmonary embolism revisited: A systematic review and meta-analysis

MATERIALS AND METHODS

CURRENT & FUTURE THERAPEUTIC MANAGEMENT OF VENOUS THROMBOEMBOLISM. Gordon Lowe Professor of Vascular Medicine University of Glasgow

In practice, we routinely rely on our clinical skills

The Evaluation of Suspected Pulmonary Embolism

Provider Led Entity. CDI Quality Institute PLE Chest / Pulmonary Embolus AUC 07/31/2018

Deep vein thrombosis and its prevention in critically ill adults Attia J, Ray J G, Cook D J, Douketis J, Ginsberg J S, Geerts W H

Diagnosis and Treatment of Deep Venous Thrombosis and Pulmonary Embolism

ORIGINAL INVESTIGATION. The Bedside Investigation of Pulmonary Embolism Diagnosis Study

Most primary care patients with suspected

Cover Page. The handle holds various files of this Leiden University dissertation.

Clinical probability score and D-dimer estimation lack utility in the diagnosis of childhood pulmonary embolism

Alveolar-Arterial Oxygen Gradient in the Assessment of Acute Pulmonary Embolism*

Appendix to Gibson et al. Application of a decision rule and a D-dimer assay in the

Diagnosis of Venous Thromboembolism

Pulmonary Embolism Rule-out Criteria - Is it good enough?

Assessment of the safety using age adjusted D-dimer to rule out venous thromboembolism in a Swedish Emergency Department

Adjustments in the diagnostic work-up, treatment and prognosis of pulmonary embolism van Es, Josien

Evidence Based Medicine: Articles of Diagnosis

Cover Page. The handle holds various files of this Leiden University dissertation

What You Should Know

Clinical prediction rules for pulmonary embolism: a systematic review and meta-analysis

A low probability interpretation of a ventilation/

The clinical diagnosis of pulmonary embolism is

Qualitative point-of-care D-dimer testing compared with quantitative D-dimer testing in excluding pulmonary embolism in primary care

PTHP 7101 Research 1 Chapter Assignments

Transcription:

Q J Med 2003; 96:211 215 doi:10.1093/qjmed/hcg027 Clinical experience and pre-test probability scores in the diagnosis of pulmonary embolism S. ILES, A.M. HODGES, J.R. DARLEY, C. FRAMPTON 1,M.EPTON,L.E.L.BECKERT and G.I. TOWN From the Canterbury Respiratory Research Group and 1 Department of Medicine, Christchurch School of Medicine and Health Sciences, University of Otago, New Zealand Received 16 October 2002 and in revised form 9 December 2002 Summary Background: The Geneva and Wells pre-test probability scores are intended to replace empirical assessment of patients with suspected pulmonary embolism (PE). The effect of clinical experience on the inter-rater variability of these scores, and on empirical judgement, is unknown. Aim: To determine whether medical staff appointment grade affects the inter-rater variability of these pre-test probability scores, or empirical assessment, in patients with suspected PE. Design: Questionnaire survey. Methods: Doctors were grouped by grade (mean number of years since graduation"sem): house officers 0.7"0.2, registrars 6.3"0.6, consultants 25"4 and applied pre-test probability scores to actual case scenarios. Introduction The use of history and clinical signs in the diagnosis of pulmonary embolism (PE) is notoriously inaccurate, and clinical judgement alone can rarely confirm or refute the diagnosis of PE with certainty. 1 However, initial clinical assessment determines the need for imaging to objectively confirm or exclude PE. 2 5 The pre-test probability (PTP) of disease represents a formal assessment of the likelihood of disease before a confirmatory test, or investigation, is performed. The Prospective Investigation Of Pulmonary Embolism Diagnosis (PIOPED) investigators Results: The Geneva score was the most consistent method of determining pre-test probability and was unaffected by clinical experience (Geneva k = 0.73, Wells k = 0.38, empirical k = 0.23, p-0.001 ). With empirical judgement, inter-rater variability was inversely proportional to clinical experience (house officers k = 0.37, registrars k = 0.24, consultants k = 0.16, p-0.05). Discussion: The Geneva score was the least variable method and can be applied by junior or senior doctors. Using empirical judgement, junior doctors were more likely to agree on the pre-test probability of PE than were their more senior colleagues. This may imply that as physicians gain experience, they recognize that the diagnosis of PE can be difficult to assess and are reluctant to exclude it on clinical grounds. demonstrated that for any given Ventilation Perfusion (VQ) scan result, the higher the PTP of PE, the more likely it was that PE would be confirmed as being present. 6 Bayes theorem states that the probability of disease following the interpretation of a diagnostic investigation (the post-test probability) is determined by two factors. One, the probability of disease prior to carrying out the investigation (the pre-test probability) based on the information already available. Two, the characteristics of the test. For example, if in a patient with suspected PE, Address correspondence to Dr S. Iles, Canterbury Respiratory Research Group, Hagley Hostel, Christchurch Hospital, Private Bag 4710, New Zealand. e-mail: stephen.iles@chmeds.ac.nz ß Association of Physicians 2003

212 S. Iles et al. the PTP of PE is low, estimated using either of the three methods studied, and the D-dimer is not elevated, the post-test probability of PE is very low and PE is thus highly unlikely. Furthermore, subsequent investigation(s) are unlikely to change the management of this patient. The Geneva 7 and Wells 8 scores are clinical pretest probability scores derived from large trials that sought to determine the clinical signs and symptoms that reliably predict the diagnosis of PE. The variables used in the calculation of these two scores are presented in Tables 1 and 2, respectively. These scores may be used to define the PTP of PE as low, moderate, or high, 7,8 with the prevalence of PE increasing across the three groups. While similar in Table 1 Criterion Criteria for the calculation of the Geneva score Age 60 79 years 080 years q2 Previous DVT or PE Recent surgery (-4 weeks ago) Heart rate)100 bpm Points value q2 q3 PaCO 2-35 mmhg q2 35 39 mmhg PaO 2-49 mmhg q4 49 59 mmhg q3 60 71 mmhg q2 72 82 mmhg Chest X-ray Band atelectasis Elevation of hemidiaphragm Total score: -5 indicates a low probability of PE; 5 8 indicates a moderate probability of PE; )8 indicates a high probability of PE. Table 2 Criterion Criteria for the calculation of the Wells score Points value Clinical signs of DVT q3 Alternative diagnosis less probable than PE q3 Heart rate)100 bpm.5 Immobilization or surgery-4 weeks ago.5 Previous DVT or PE.5 Haemoptysis Cancer Total score -2 indicates a low probability of PE; 2 6 indicates a moderate probability of PE; )6 indicates a high probability of PE. concept, some criteria used to calculate the Wells 8 score (e.g. Alternative diagnosis less probable than PE ) are arguably more subjective than those used to calculate the Geneva 7 score. Although the determination of the PTP alone would never be enough to exclude or confirm PE, in combination with bedside tests such as D-dimer, they may help triage those who do not need further testing or imaging, conferring considerable resource savings. In order for such a strategy to be used in clinical practice, both the assessment of the PTP of PE and the measurement of D-dimer need to fulfil two criteria. Firstly, they must be accurate, and secondly, they must be reliable. One measure of reliability is inter-rater variability. The need to define the inter-rater variability of clinical scoring methods, such as the Geneva and the Wells scores, before they are applied by large numbers of clinicians, has been previously identified. 8 Also, it is often assumed that less experienced doctors are not as capable of determining the pretest probability of PE 9 We sought to determine whether the clinical experience of the assessing doctor affects the inter-rater variability of these clinical scores and empirical assessment. Methods Study setting and design This study was performed at Christchurch Hospital, a 600-bed tertiary institution serving a population of 375 000. The first thirty case scenarios were selected from a database of all consecutive patients investigated for PE by VQ scan or Spiral Computer Tomography (Spiral CT) between August 2000 and August 2001 that were able to have PE diagnosed or excluded by objective criteria. Cases with insufficient data to calculate the PTP scores were excluded. PE was judged to have been objectively diagnosed with a high probability VQ scan according to PIOPED criteria 6 (11 cases), and/or a vascular filling defect on Spiral CT (3 cases). PE was excluded by a normal VQ scan (14 cases). PE also was considered excluded by a normal CT scan accompanied by a normal bilateral leg ultrasound (1 case), or normal D-dimer (1 case), and absence of a diagnosed thromboembolic event over a minimum of 3 months (both cases). The prevalence of PE in this cohort was 47%. Of the 30, 16 were female and the mean"sd age was 58"15.0 years. Thirty case scenarios were constructed from the information contained in the medical record for that admission. Each case scenario consisted of a brief history, examination findings, results of basic initial

Diagnosing pulmonary embolism 213 investigations (including full blood count, biochemistry, and arterial blood gases), and included a copy of the formal chest X-ray report. If performed at admission, the electrocardiogram (28/30 cases) and chest X-ray films (23/30 cases) were available for inspection. Study participants We invited 20 consultant physicians, 31 medical registrars, and 33 medical house officers who participated in the acute medical rota to take part. We received positive replies from 12, 14, and 13 of them, respectively. The first ten from each medical staff grade who completed the questionnaire within the required timeframe were included. Format of questionnaire Participants were blinded to the results of the tests used to diagnose PE. After reviewing each case scenario, participants completed the Wells, and Geneva scores (Tables 1 and 2), and their empirical assessment of pre-test probability. The participants were unaware of what raw values for the respective scoring systems defined a patient s PTP of PE as low, medium or high, so as not to influence their empirical assessment unduly. They indicated their empirical assessment of the likelihood of PE, similar to that used in the PIOPED study, by placing a cross on a Visual Analogue scale (VAS), where 0 = PE is definitely not the cause of this patient s symptoms and 100 = PE is definitely the cause of this patient s symptoms. All participants reviewed all 30 cases, and listed their professional qualifications with the number of years since they graduated. Statistical analysis Raw scores from questionnaires were converted to low, medium, and high pre-test probability, according to the methods of the original authors (Tables 1 and 2). The distance from the point of origin (0) of the visual analogue scale was measured. Scores -20% were defined as low pre-test probability. Scores from 20 80% were defined as moderate pretest probability. Scores )80% were defined as high pre-test probability. 10 Inter-rater variability was assessed by calculating an extended form of the kappa (k) statistic to allow for more than two observers 11. Kappa statistic scores vary from q 1.0 (perfect agreement) to 0.0 (no agreement) and test the null hypothesis of no agreement between raters. The standard errors calculated for each kappa were pooled to enable statistical comparisons between kappas. 11 The negative predictive value (NPV) of a low pre-test probability score was calculated by aggregating the probability assessment for each clinician through the 30 cases and calculating the prevalence of PE in those cases that were labelled as low probability for each scoring method. The NPV of the PTP method was used (rather than positive predictive value) as it represents the original authors most useful finding, i.e. by indicating when a patient can safely have PE excluded by noninvasive means. The NPV between categories of doctor using each method of determining the PTP of PE was compared by one-way ANOVA, having confirmed the normality of these measures for parametric testing. The NPV between each of the methods used to determine the PTP of PE was compared by repeated measures ANOVA. Results Study doctors Ten house officers, ten registrars, and ten consultants participated. The number of years since graduation was used as a surrogate for clinical experience (Table 3). Inter-observer variability The Geneva PTP score was the most consistent (i.e. the least variable) of the three methods used to determine the PTP of PE (k for all experience groups: Geneva = 0.73, Wells = 0.38, empirical = 0.23). This finding was consistent across all three categories of doctors studied (p-0.001, Figure 1). The more junior the clinician, the more likely they were to agree with their peers as to the PTP of Table 3 Clinical experience and characteristics of study participants Characteristic House officers (n = 10) Registrars (n = 10) Consultants (n = 10) Years since graduation (mean"sem) 0.7"0.2 6.3"0.6 25"4 Male/female 3/7 6/4 8/2 Number with postgraduate qualifications 0 2 10

214 S. Iles et al. Figure 1. The inter-rater variability (kappa) of PTP scores with clinical experience. PE when using their empirical judgement (house officers k = 0.37, registrars k = 0.24, consultants k = 0.16, p-0.05). Negative predictive value The seniority of the assessing clinician did not effect the NPV of a low PTP of PE, regardless of the method used. Empirical assessment of the PTP of PE had a significantly better NPV than either the Wells score (p = 0.004) or the Geneva score (p = 0.001) (Table 4). When deciding that a patient had a low PTP of PE using their empirical judgement, as opposed to one of the two clinical scoring methods studied, clinicians were more conservative (Figure 2). Discussion The Geneva score was the most consistent of the three methods studied at determining the pre-test probability of PE (Geneva k = 0.73, Wells k = 0.38, Figure 2. The frequency (% of all scores) with which the case scenarios were calculated to have a low PTP of PE according to the method of calculation. empirical k = 0.23, p-0.001, Figure 1). The interrater variabilities of both the Geneva and the Wells scores were not affected by the experience of the assessing clinician. With increasing experience, clinicians were less likely to agree with their peers when assessing the pre-test probability of PE empirically. This suggests that as their level of experience increases, clinicians are more aware that the diagnosis of PE is difficult to confirm or refute clinically. Empirical assessment was the most accurate of the three methods studied, as determined by the NPV of a low PTP of PE. However, it was also the most conservative and least consistent method of assessing the PTP of PE. The advantage of using one of these clinical scoring methods for calculating the PTP of PE, compared to empirical assessment, is that they standardize the expression of clinical findings in an explicit manner. Theoretically therefore they should lead to fewer misunderstandings between colleagues when they report their findings to each other. Wells et al. have shown that a low PTP in combination with a negative SimpliRED D-dimer test has a NPV when excluding PE of 98%. 8 The advantage of using either the Wells or Geneva scores to assess the PTP of PE, as compared to empirical assessment, is that more patients would Table 4 The negative predictive value* of low probability scores according to the method used to calculate the PTP and clinical experience Method House officers Registrars Consultants Wells 109/143 (76%) 113/139 (81%) 107/135 (79%) Geneva 96/117 (82%) 97/125 (78%) 91/119 (76%) Empirical (VAS) 58/68 (85%) 62/75 (83%) 50/55 (91%) *Calculated as the number of low probability scores when PE was excluded, divided by the total number of low probability scores.

Diagnosing pulmonary embolism 215 be classified as having a low PTP of PE. As outlined, when combined with measurement of D-dimer, this would allow PE to be confidently excluded. Hence fewer patients would be subjected to the morbidity and mortality associated with investigations such as VQ scanning, Spiral CT, and pulmonary angiography. The current study has several limitations. Firstly, the Geneva and Wells scores were designed to assess the PTP of PE before the lung scan was performed. In this study the case scenarios were presented retrospectively as written case scenarios, as opposed to the patient being in front of the clinician. However, the scenarios were consecutive actual cases, and care was taken to reproduce all the clinical information that would have been available at the time. Cases were selected only when enough information from the medical records could be obtained, and when PE could be ruled in or out by objective criteria. We do not believe a prospective study with so many observers from different groups of clinical experience would have been possible. Because all participants were presented with the same clinical information, our measures of inter-rater variability are greater than would be observed if the study were repeated prospectively. In addition although it could not be avoided, and participants were not given the Wells and Geneva scoring systems, the questions in the scores were asked before they marked their empirical assessment and this may have influenced their judgement of the likelihood of PE. Finally, the NPV of a low PTP score is likely to be underestimated due to the high prevalence of PE (47%) in this cohort. Nonetheless, results were similar in another study that prospectively tested the Wells PTP score. 10 The prevalence of PE in the database from which these cases were drawn is 24%. This is similar to most published series, and we believe that the high prevalence of PE in this sub-group was not attributable to any systematic selection bias; also the kappa values would not be influenced by the prevalence of PE. We agreed with the original authors of the scores that the most clinically relevant use of the PTP scoring methods were in order to identify patients that could potentially have PE excluded by non-invasive means, such as in combination with a negative D-dimer. The positive predictive value of clinical scores in PE have never been sufficient to diagnose PE, and this is illustrated by only 28% and 19% of PE positive patients scoring high probability in the original studies of the Wells and Geneva scoring systems, respectively. 7,8 In order for PTP scores to be widely adopted they need to be both accurate and reliable. It follows that they should be simple to use and objective. The Alternative diagnosis less probable than PE used in calculating the Wells score is subjective, and this is reflected in the Wells score s high inter-rater variability. In conclusion, PTP scores in combination with non-invasive diagnostic methods may be useful in helping to exclude PE. Of the three methods studied, the Geneva score gave the most consistent results across participants. The inter-rater variability and accuracy of the Geneva PTP score was unaffected by clinical experience. From these results, the Geneva score would thus be the preferred method for both junior and senior doctors, in combination with non-invasive diagnostic methods, to exclude PE. References 1. Hildner FJ, Ormond RS. Accuracy of the clinical diagnosis of pulmonary embolism. JAMA 1967; 202:115 18. 2. Perrier A, Desmarais S, Miron MJ, et al. Noninvasive diagnosis of venous thromboembolism. Lancet 1999; 353:775 81. 3. Wells PS, Ginsberg JS, Anderson DR, et al. Use of a clinical model for safe management of patients with suspected pulmonary embolism. Ann Intern Med 1998; 129:997 1005. 4. Minati M, Prediletto R, Formichi B, et al. Accuracy of clinical assessment in the diagnosis of pulmonary embolism. Am J Resp Crit Care Med 1999; 159:864 71. 5. Perrier A, Miron M-J, Desmarais S, et al. Combining clinical evaluation and lung scan to rule out suspected pulmonary embolism. Arch Intern Med 2000; 160:512 16. 6. The PIOPED Investigators. Value of the ventilation-perfusion scan in acute pulmonary embolism. JAMA 1990; 263:2753 9. 7. Wicki J, Perneger TV, Junod AF, Bounameaux H, Perrier A. Assessing clinical probability of pulmonary embolism in the emergency ward: a simple score. Arch Intern Med 2001; 161:92 7. 8. Wells PS, Anderson DR, Rodger M, Ginsberg JS, Kearon C, Gent M, Turpie AGG, Bormanis J, Weitz J, Chamberlain M, Bowie D, Barnes D, Hirsh J. Derivation of a simple model to categorize patients probability of pulmonary embolism: Increasing the model s utility with the SimpliRED D-dimer. Thromb Haemost 2000; 83:416 20. 9. Kelly J, Hunt BJ. Role of D-dimers in diagnosis of venous thromboembolism. Lancet 2002; 359:456 7. 10. Sanson BJ, Lijmer JG, MacGillavry MR, Turkstra F, Prins MH, Buller HR. Comparison of a clinical probability estimate and two clinical models in patients with suspected pulmonary embolism. Thromb Haemost 2000; 83:199 203. 11. Fleiss J. Statistical methods for rates and proportion, 2nd edn. New York, Wiley, 1981:225 32.