Determinants of quality: Factors that lower or increase the quality of evidence

Similar documents
ACR OA Guideline Development Process Knee and Hip

Introduzione al metodo GRADE

GRADE. Grading of Recommendations Assessment, Development and Evaluation. British Association of Dermatologists April 2014

Copyright GRADE ING THE QUALITY OF EVIDENCE AND STRENGTH OF RECOMMENDATIONS NANCY SANTESSO, RD, PHD

GRADE. Grading of Recommendations Assessment, Development and Evaluation. British Association of Dermatologists April 2018

SOF: Summary of Findings tables. Negrar, 7 aprile 2018

Why is ILCOR moving to GRADE?

Problem solving therapy

What is the Cochrane Collaboration? What is a systematic review?

JBI GRADE Research School Workshop. Presented by JBI Adelaide GRADE Centre Staff

GRADE Evidence Profiles on Long- and Rapid-Acting Insulin Analogues for the treatment of Diabetes Mellitus [DRAFT] October 2007

Critical Appraisal of a Meta-Analysis: Rosiglitazone and CV Death. Debra Moy Faculty of Pharmacy University of Toronto

How to Conduct a Meta-Analysis

Does the evidence directly answer my question?

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Controlled Trials. Spyros Kitsiou, PhD

NeuRA Sleep disturbance April 2016

GRADE, Summary of Findings and ConQual Workshop

Results. NeuRA Hypnosis June 2016

Traumatic brain injury

Results. NeuRA Treatments for dual diagnosis August 2016

Results. NeuRA Mindfulness and acceptance therapies August 2018

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Deep vein thrombosis and its prevention in critically ill adults Attia J, Ray J G, Cook D J, Douketis J, Ginsberg J S, Geerts W H

The comparison or control group may be allocated a placebo intervention, an alternative real intervention or no intervention at all.

Safinamide (Addendum to Commission A15-18) 1

PICC or peripheral intravenous catheter?

Critical Appraisal Practicum. Fabio Di Bello Medical Implementation Manager

Meta Analysis. David R Urbach MD MSc Outcomes Research Course December 4, 2014

Developing a research question The research question

8.0 Parenteral Nutrition vs. Standard care May 2015

Distraction techniques

Results. NeuRA Worldwide incidence April 2016

Results. NeuRA Herbal medicines August 2016

Robert M. Jacobson, M.D. Department of Pediatric and Adolescent Medicine Mayo Clinic, Rochester, Minnesota

Meta-Analysis. Zifei Liu. Biological and Agricultural Engineering

The role of Randomized Controlled Trials

Animal-assisted therapy

Q11: WHAT IS THE CURRENT STANDARD FIRST LINE TREATMENT FOR METASTATIC INOPERABLE COLORECTAL CANCER?

Results. NeuRA Treatments for internalised stigma December 2017

Assessing quality of evidence The GRADE approach

Systematic Review & Course outline. Lecture (20%) Class discussion & tutorial (30%)

Method. NeuRA Paliperidone August 2016

(A) (B) 2019 American Diabetes Association. Published online at

Washington, DC, November 9, 2009 Institute of Medicine

Systematic reviews and meta-analyses of observational studies (MOOSE): Checklist.

NeuRA Obsessive-compulsive disorders October 2017

EBM: Therapy. Thunyarat Anothaisintawee, M.D., Ph.D. Department of Family Medicine, Ramathibodi Hospital, Mahidol University

Influence of blinding on treatment effect size estimate in randomized controlled trials of oral health interventions

Results. NeuRA Forensic settings April 2016

Michiel H.F. Poorthuis*, Robin W.M. Vernooij*, R. Jeroen A. van Moorselaar and Theo M. de Reijke

Checklist for appraisal of study relevance (child sex offenses)

Online Annexes (2-4)

Evidence-Based Medicine and Publication Bias Desmond Thompson Merck & Co.

ARCHE Risk of Bias (ROB) Guidelines

Dose: Metoclopramide -0.1 mg/kg (max 10 mg) IV, over 15 minutes

Results. NeuRA Family relationships May 2017

Clinical research in AKI Timing of initiation of dialysis in AKI

Checklist for Randomized Controlled Trials. The Joanna Briggs Institute Critical Appraisal tools for use in JBI Systematic Reviews

Revman Plots: Glucose child up to 2 yrs Distress Acute

5.3 Strategies to Optimize Delivery and Minimize Risks of EN: Small Bowel Feeding vs. Gastric February 2014

DVT PROPHYLAXIS IN HOSPITALIZED MEDICAL PATIENTS SAURABH MAJI SR (PULMONARY,MEDICINE)

C2 Training: August 2010

MINI SYMPOSIUM - EUMASS - UEMASS European Union of Medicine in Assurance and Social Security

Workshop The GRADE Approach to Grading Quality of Evidence and Strength of Recommendations

NeuRA Decision making April 2016

GATE CAT Intervention RCT/Cohort Studies

Assessing risk of bias

Learning objectives. Examining the reliability of published research findings

Surveillance report Published: 6 April 2016 nice.org.uk. NICE All rights reserved.

Introduction to systematic reviews/metaanalysis

5.1 Strategies to Optimize Delivery and Minimize Risks of EN: Feeding Protocols March 2013

Transcranial Direct-Current Stimulation

Evidence Based Medicine

Method. NeuRA Biofeedback May 2016

Addressing multiple treatments II: multiple-treatments meta-analysis basic methods

Post-traumatic stress disorder

Evaluating the results of a Systematic Review/Meta- Analysis

Evaluate the efficacy of LMWH compared to UFH in patients with ESRD receiving outpatient, chronic, intermittent hemodialysis.

Evidence profile. Physical Activity. Background on the scoping question. Population/Intervention/Comparison/Outcome (PICO)

CEU screening programme: Overview of common errors & good practice in Cochrane intervention reviews

Results. NeuRA Essential fatty acids August 2016

Module 5. The Epidemiological Basis of Randomised Controlled Trials. Landon Myer School of Public Health & Family Medicine, University of Cape Town

Appendix Document A1: Search strategy for Medline (1960 November 2015)

Systematic reviews: From evidence to recommendation. Marcel Dijkers, PhD, FACRM Icahn School of Medicine at Mount Sinai

How to do a meta-analysis. Orestis Efthimiou Dpt. Of Hygiene and Epidemiology, School of Medicine University of Ioannina, Greece

Attention deficit hyperactivity disorder: diagnosis and management (update)

Alcohol interventions in secondary and further education

Pulmonary rehabilitation following exacerbations of chronic obstructive pulmonary disease(review)

Online Annexes (5-8)

Results. NeuRA Motor dysfunction April 2016

INTERNAL VALIDITY, BIAS AND CONFOUNDING

Less is more: Guidelines

Types of Data. Systematic Reviews: Data Synthesis Professor Jodie Dodd 4/12/2014. Acknowledgements: Emily Bain Australasian Cochrane Centre

11.1 Supplemental Antioxidant Nutrients: Combined Vitamins and Trace Elements April 2013

GRADE Tables and Summary of Findings for the recommendations of Rehabilitation in health systems

Outcomes and GRADE Summary of Findings Tables: old and new

TITLE: Acetylsalicylic Acid for Venous Thromboembolism Prophylaxis: A Review of Clinical Evidence, Benefits and Harms

Evidence profile. Brief Structured Psychological treatment.doc. Background on the scoping question

Objectives. Information proliferation. Guidelines: Evidence or Expert opinion or???? 21/01/2017. Evidence-based clinical decisions

Transcription:

Determinants of quality: Factors that lower or increase the quality of evidence GRADE Workshop CBO, NHG and Dutch Cochrane Centre CBO, April 17th, 2013 Outline The GRADE approach: step by step Factors that lower the quality of evidence Factors that increase the quality of evidence Examples + discussion For each factor: what information do you need and where do you find it 2 1

Step 1a Specify research question Patients Intervention Comparison Outcomes 3 Step 1b Specify outcomes for research question Importance of end points Mortality Myocardial infarction Fractures Pain Flatulence 9 8 7 6 5 4 3 2 1 Critical for decision making Important but not critical for decision making Not important for decision making of lower importance for patients 4 2

Step 2 Summarize all relevant evidence By outcome, across studies Use available systematic review(s) Or Asses quality of the review (AMSTAR) Needed: information on individual studies (e.g. risk of bias assessment) Develop systematic review Cochrane Handbook for Systematic Reviews of Interventions 5 Summary of Findings Table (Cochrane Review) Patients or population: Anyone taking a long flight (lasting more than 6 hours) Settings: International air travel Intervention: Compression stockings 1 Comparison: Without stockings Outcomes Illustrative comparative risks* (95% CI) Assumed risk Corresponding risk Relative effect (95% CI) Number of participants (studies) Quality of the evidence (GRADE) Comments Without stockings With stockings (95% CI) Symptomatic deep vein thrombosis (DVT) See comment See comment Not estimable 2821 (9 studies) See comment 0 participants developed symptomatic DVT in these studies. Symptomatic deep vein thrombosis surrogate, symptomless deep vein thrombosis Low risk population 2 15 per 1000 High risk population 2 1.5 per 1000 (0 to 3) RR 0.10 (0.04 to 0.26) 2637 (9 studies) Moderate 3 All of the events in the illustrative comparative risks column were asymptomatic and thus of questionable importance 25 per 1000 2.5 per 1000 (1 to 8) Superficial vein thrombosis 13 per 1000 6 per 1000 (2 to 15) RR 0.45 (0.18 to 1.13) 1804 (8 studies) Moderate 4 Oedema Post-flight values measured on a scale from 0, no oedema, to 10, maximum oedema. The mean oedema score ranged across control groups from 6 to 9. The mean oedema score in the intervention groups was on average 4.7 lower (95% CI -4.9 to -4.5). 1246 (6 studies) Low 5 6 Pulmonary embolus See comment See comment Not estimable 2821 See comment 0 participants developed 3

Step 3 Rate the evidence Across studies, by outcome RCTs: high quality evidence Observational studies: low quality evidence 5 factors can lower the quality 3 factors can increase the quality Use footnotes to explain your decisions on down/upgrading 7 Quality of evidence: 4 levels High Very confident that the true effect lies close to the estimate of the effect. Moderate Moderately confident in the effect estimate: true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different. Low Limited confidence in the effect estimate: true effect may be substantially different from the estimate of effect. Very low Very little confidence in the effect estimate: true effect is likely to be substantially different from the estimate of effect. 8 4

Levels of quality of evidence Underlying methodology RCTs Double-upgraded observational studies Downgraded RCTs Upgraded observational studies Double-downgraded RCTs Observational studies Triple-downgraded RCTs Downgraded observational studies Case series/reports Quality rating High Moderate Low Very low 9 Factors that can lower the quality of evidence 1. Study limitations 2. Inconsistency of results 3. Indirectness of evidence 4. Imprecision 5. Publication bias If factor is present: downgrade level of evidence by 1 ( serious ) or 2 levels ( very serious ) 10 5

Downgrading: Study limitations (RCTs) Inadequate randomization Lack of allocation concealment Lack of blinding (patient / clinician / outcome assessor) Loss to follow-up / selective follow-up Intention to treat principle violated Failure to report outcomes 11 Risk of bias summary 12 6

Risk of bias Across studies Interpretation Considerations GRADE assessment Low Most information is from studies at low risk of bias. Plausible bias unlikely to seriously alter the results. No apparent limitations. No serious limitations; do not downgrade Unclear Most information is from studies at low or unclear risk of bias. Plausible bias that raises some doubt about the results. Potential limitations are unlikely to lower confidence in the estimate of effect. No serious limitations; do not downgrade Potential limitations are likely to lower confidence in the estimate of effect. Serious limitations; downgrade one level High The proportion of information from studies at high risk of bias is sufficient to affect the interpretation of results. Plausible bias that seriously weakens confidence in the results. Crucial limitation for one criterion, or some limitations for multiple criteria, sufficient to lower confidence in the estimate of effect. Serious limitations; downgrade one level Crucial limitation for one or more criteria sufficient to substantially lower confidence in the estimate of effect. Very serious limitations; downgrade two levels 13 Another example (fictitious) 1 2 3 Concealment of allocation Adequate Adequate Adequate Blinding of outcome assessment Adequate Unclear (NR) Adequate Description of dropouts (%) Adequate (4) Adequate (50) Adequate (12) Analysis by ITT No Yes Yes Selective reporting of events No Yes No 14 7

Downgrading: inconsistency Explanations for inconsistency: Clinical heterogeneity Populations different effect in sicker populations Interventions larger effect with higher doses Outcomes diminishing effect with time Methodological heterogeneity (differences study design) 15 Inconsistency Unexplained heterogeneity Results of meta-analysis (forest plot) Eye-ball test: overlap of confidence intervals? I 2 statistic (>60% = substantial) 16 8

17 17 18 18 9

Study or Subgroup 1.1.1 Gamma nail Adams 2001 Hoffman 1996 Kukla 1997 O'Brien 1995 Ovesen 2006 Utrilla 2005 Subtotal (95% CI) Short femoral nail Sliding hip screw Mean Difference Mean Difference Mean SD Total Mean SD Total Weight IV, Random, 95% CI IV, Random, 95% CI 55.4 56.7 47.1 59 65 46 20 17 20.8 23.9 29 11 203 31 60 53 73 104 524 61.3 54.3 53.4 47 51 44 22.2 16.4 8.3 13.3 22 15 197 36 60 49 73 106 521 Heterogeneity: Tau² = 47.36; Chi² = 34.80, df = 5 (P < 0.00001); I² = 86% Test for overall effect: Z = 0.80 (P = 0.42) 11.9% 9.9% 11.2% 10.2% 9.7% 12.1% 65.0% -5.90 [-10.04, -1.76] 2.40 [-5.63, 10.43] -6.30 [-11.97, -0.63] 12.00 [4.57, 19.43] 14.00 [5.65, 22.35] 2.00 [-1.55, 5.55] 2.48 [-3.60, 8.56] 1.1.2 Intramedullary hip screw (IMHS) Baumgaertner 1998 Hardy 1998 Harrington 2002 Subtotal (95% CI) 72 71 108 33 28.9 26.8 67 50 50 167 80 57 88 35 24.8 27.5 68 50 52 170 Heterogeneity: Tau² = 175.30; Chi² = 13.45, df = 2 (P = 0.001); I² = 85% Test for overall effect: Z = 1.06 (P = 0.29) 8.0% 8.5% 8.5% 25.1% -8.00 [-19.47, 3.47] 14.00 [3.44, 24.56] 20.00 [9.46, 30.54] 8.81 [-7.43, 25.05] 1.1.3 Proximal femoral nail (PFN) Saudan 2002 Subtotal (95% CI) 64 33 Heterogeneity: Not applicable Test for overall effect: Z = 0.24 (P = 0.81) 100 100 65 26 106 106 9.9% 9.9% -1.00 [-9.14, 7.14] -1.00 [-9.14, 7.14] 1.1.4 Targon PF nail Subtotal (95% CI) Heterogeneity: Not applicable Test for overall effect: Not applicable 0 0 Not estimable Total (95% CI) 791 797 Heterogeneity: Tau² = 58.02; Chi² = 56.04, df = 9 (P < 0.00001); I² = 84% Test for overall effect: Z = 1.41 (P = 0.16) 100.0% 3.83 [-1.51, 9.17] -10-5 0 5 10 Favours nail Favours 19 19 SHS Fluoroquinolone prophylaxis in neutropenia: infection-related mortality 20 10

Downgrading: Indirectness of evidence Evidence comes from different research question Indirect comparison: drug A drug B A placebo and B placebo Population: oseltamivir prophylaxis for avian flu seasonal influenza Comparator: new drug flexible doses of haloperidol fixed doses of haloperidol Outcome (next slide) 21 Indirectness: surrogate outcomes Condition Diabetes mellitus Dementia Osteoporosis Chronic respiratory disease Cardiovascular disease/risk Patient important outcomes Symptoms, hospital admission, complications Patient function, behaviour, caregiver burden Fractures Quality of life, exacerbations, mortality Vascular events, mortality Surrogate outcome Blood glucose, HbA1c Bone density Pulmonary function, exercise capacity Serum lipids 22 11

Downgrading: Imprecision Basic idea: few patients and few events => wide confidence intervals => imprecise results SRs versus Guidelines Optimal information size (OIS): the number of patients required for adequate statistical power 23 M ethods/concepts 24 12

Imprecision For dichotomous outcomes consider downgrading if: 1. Sample size lower than optimal information size and/or number of events <300 2. OIS is met, but 95% confidence interval includes both no effect and appreciable benefit or harm* * Relative Risk Reduction (RRR) >0.25% (RR<0.75 of >1.25) 25 Table 1: Optimal information size implications from Figure 5 Total Number of Events Relative Risk Reduction Implications for meeting OIS threshold 100 or less < 30% Will almost never meet threshold whatever control event rate 200 30% Will meet threshold for control event rates for ~ 25% or greater 200 25% Will meet threshold for control event rates for ~ 50% or greater 200 20% Will meet threshold only for control event rates for ~ 80% or greater 300 > 30% Will meet threshold 300 25% Will meet threshold for control event rates ~ 25% or greater 300 20% Will meet threshold for control event rates ~ 60% or greater 400 or more > 25% Will meet threshold for any control event rate 400 or more 20% Will meet threshold for control event rates of ~ 40% or 26 greater 13

Imprecision For continuous outcomes consider downgrading if: 1. Sample size lower than optimal information size and/or population size <400 2. OIS is met, but 95% confidence interval includes no effect and one of the confidence limits crosses the minimal important difference (benefit or harm)* If SMD or effect size (ES) is calculated in the metaanalysis, consider downgrading if the upper or lower confidence limit crosses an ES of 0.5 in either direction. * If the MID is not known, you may calculate an MID by using the rule of thumb that MID is typically 0.5 standard deviations (Norman et al., 2003). 27 Example MID A meta-analysis found a pooled mean difference of -2.58 (-5.14 to -0.02) on a scale of 0 to 100 The pooled SD of the control groups was 17.2. The MID could be calculated as 0.5 X 17.2 = 8.6 M The confidence intervals around the pooled mean difference do not include 8.6 (despite the fact that there is an effect). Resulting point estimate is precise and the evidence is not downgraded. 28 14

M ethods/concepts 29 Figure 1, Rating down for imprecision in guidelines: Thresholds are key Mortality estimate and confidence interval Threshold if side effects, toxicity and cost minimal, NNT = 200. Entire confidence interval to left of threshold, do not rate down for imprecision Threshold if side effects, toxicity and Cost appreciable, NNT = 100. Confidence interval crosses threshold, rate down for imprecision 2.0 0.5 0 0.5 Favors Intervention Risk difference (%) Favors Control 30 15

Downgrading: Publication bias Evidence is incomplete Systematic under- or overestimate of the effect due to selective publication of studies Investigators fail to report studies (typically those that show no effect) Suspicion: evidence is limited to small number of trials, all funded by industry 31 How to identify? Funnel plot Publication bias 32 16

Publication bias How to identify? Funnel plot Trial registers Abstract books Experts 33 Upgrading: magnitude of effect Large effect (in the absence of plausible confounders in studies with no major threats to validity) Large: RR >2 or RR <0.5 (upgrade 1 level) Very large: RR >5 or RR <0.2 (upgrade 2 levels) Example Bicycle helmets to prevent head injuries in cyclists: OR 0.31, 95% CI 0.26 to 0.37 (Thompson 2000) 34 17

Upgrading: dose-response gradient Example Rofecoxib and cardiovascular events (McGettigan 2006): RR 1.33 (95% CI 1.00 to 1.79) for doses 25 mg/day RR 2.19 (95% CI 1.64 to 2.91) for doses >25 mg/day 35 Upgrading: residual bias All plausible biases may be working to underestimate an apparent intervention effect E.g. Sicker patients overrepresented in intervention group, but the intervention seems more effective than control group Observational studies have failed to demonstrate an association, but all plausible biases would have increased an intervention effect E.g. Exploration of apparent harmful effects Only valid studies can be upgraded! 36 18

Do s and don ts Do First, consider each factor separately Consider all factors together to make a balanced judgment (quality of evidence is a continuum) Do not Double downgrade for the same problem (e.g. inconsistency and imprecision can be related) 37 Footnotes: examples 1 Unclear concealment in one of five trials did not lead to downgrading the quality of evidence 2 The studies used different LMWHs but indirectness is not likely given the similarity in results across studies 3 The 95% CI includes both negligible effect and appreciable benefit or appreciable harm 4 Out of 5 included, only 2 reported DVT. We assumed that this was based in selective reporting of outcomes. The authors of the study did not provide further information. 38 19

Overall quality of evidence Consider the outcomes that are critical for decision making Overall quality of evidence = lowest rating of the critical outcomes Some exceptions 39 Using existing SRs: what information do you need? Review question = your research question Clear description of the in/exclusion criteria Characteristics of the included studies Detailed risk of bias assessment + reporting Estimate of effect: pooled result: results of the meta-analysis, forest plot with I 2, subgroup analyses no meta-analysis: effect estimates of the individual studies Assessment/discussion on publication bias 40 20

How to make a GRADE profile? 41 21