QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDICTORS ON THE PERFORMANCE OF A PROGNOSTIC MODEL

Size: px
Start display at page:

Download "QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDICTORS ON THE PERFORMANCE OF A PROGNOSTIC MODEL"

Transcription

1 QUANTIFYING THE IMPACT OF DIFFERENT APPROACHES FOR HANDLING CONTINUOUS PREDICTORS ON THE PERFORMANCE OF A PROGNOSTIC MODEL Gary Collins, Emmanuel Ogundimu, Jonathan Cook, Yannick Le Manach, Doug Altman Centre for Statistics in Medicine University of Oxford 20-July-2016 gary.collins@csm.ox.ac.uk

2 Outline 2 Existing guidance What s done in practice? Brief overview of the study sample & simulation set-up Findings & Discussion

3 Basis of this presentation 3

4 Not a new idea 4

5 It s all in the title ( ) 5 1. Problems in dichotomizing continuous variables (Altman 1994) 2. Dangers of using "optimal" cutpoints in the evaluation of prognostic factors. (Altman et al 1994) 3. How bad is categorization? (Weinberg; 1995) 4. Seven reasons why you should NOT categorize continuous data (Dinero; 1996) 5. Breaking Up is Hard to Do: The Heartbreak of Dichotomizing Continuous Data (Streiner; 2002) 6. Negative consequences of dichotomizing continuous predictor variables (Irwin & McClelland; 2003) 7. Why carve up your continuous data? (Owen 2005) 8. Chopped liver? OK. Chopped data? Not OK. Chopped liver? OK. Chopped data? Not OK (Butts & Ng 2005) 9. Categorizing continuous variables resulted in different predictors in a prognostic model for nonspecific neck pain (Schellingerhout et al 2006)

6 It s all in the title ( ) 10.Dichotomizing continuous predictors in multiple regression: a bad idea (Royston et el 2006) 11. The cost of dichotomising continuous variables (Altman & Royston; 2006) 12.Leave 'em alone - why continuous variables should be analyzed as such (van Walraven & Hart; 2008) 13.Dichotomization of continuous data--a pitfall in prognostic factor studies (Metze; 2008) 14. Analysis by categorizing or dichotomizing continuous variables is inadvisable: an example from the natural history of unruptured aneurysms (Naggara et al 2011) 15.Against quantiles: categorization of continuous variables in epidemiologic research, and its discontents (Bennette & Vickers; 2012) 16.Dichotomizing continuous variables in statistical analysis: a practice to avoid (Dawson & Weiss; 2012) 17. The danger of dichotomizing continuous variables: A visualization (Kuss 2013) 18. The anathema of arbitrary categorization of continuous predictors (Vintzileos et al; 2014) 19. Ophthalmic statistics note: the perils of dichotomising continuous variables (Cumberland et al 2014) 6

7 Biologically implausible Prognostic factor (PF) Convoluted Reasoning and PF not present (low risk) PF present (high risk) Anti-intellectual Pomposity C.R.A.P (Norman & Streiner; A B Biostatistics: the Bare Cut-point C Essentials, 2008) Slide adapted from Michael Babyak ( Modeling with Observational Data )

8 Still, what happens in practice? 8 Breast cancer models (Altman 2009) Categorised some/all - 34/53 (64%) Diabetes models (Collins et al 2011) Categorised some/all 21/43 (49%) General medical journals (Bouwmeester et al 2012) Categorised 30/64 (47%) Dichotomised 21/64 (21%) Cancer models (Mallett et al 2010) All categorised/dichotomised 24/47 (51%)

9 Aim of the study 9 Investigate the impact of different approaches for handling continuous predictors on the apparent performance (same data) validation performance (different data; geographical validation) Investigate the influence of sample size on the approach for handling continuous predictors

10 Sample characteristics (THIN) 10 80,800 CVD events 4688 CVD events 7721 hip fractures 565 hip fractures

11 Models 11 Cox models to predict 10-year risk of CVD (men & women) 10-year risk of hip fracture (women only) CVD model contained 7 predictors Age, sex, family history, cholesterol, SBP, BMI, hypertension Hip fracture model contained 5 predictors Age, BMI, Townsend score, asthma, antidepressants

12 Resampling strategy 12 MODEL DEVELOPMENT To ensure the number of events in each sample was fixed at 25, 50, 100, and 2000 events Sample were drawn from those with and without the event (separately) 200 samples randomly drawn (with replacement) MODEL VALIDATION All available data were used CVD: n=110,934 (4688 CVD events) Hip fracture: n=61,563 (565 hip fractures)

13 Approaches considered 13 Dichotomised at the Median predictor value optimal cut-point based on the logrank test Categorised into 3 groups (using tertile predictor values) 4 groups (using quartile predictor values) 5 groups (using quintile predictor values) 5-year age categories 10-year age categories Linear relationship Nonlinear relationship fractional polynomials (FP2; 4 degrees of freedom per predictor) restricted cubic splines (3 knots)

14 Performance measures calculated Calibration Calibration plot Harrell s val.surv function; hazard regression with linear splines 14 Discrimination Harrell s c-index Clinical utility Decision curve analysis (Vickers & Elkin 2006) Net benefit; weighted difference between true positives and false positives D-statistic; Brier Score; R-squared also examined Not reported here - but in the supplementary material of Collins et al Stat Med 2016.

15 Net benefit (recap) 15 p t is the probability threshold to denote high risk Used to weight the FP and FN results TP and FP calculated using Kaplan-Meier estimates of the percentage surviving at 10 years among those with predicted risks greater than p t Bottom line: model with highest NB wins

16 Age & CVD 16

17 Total serum cholesterol & CVD 17

18 Age, cholesterol, BMI, SBP & CVD 18

19 Age, BMI & Hip fracture 19

20 RESULTS: CVD 25 events 20

21 RESULTS: CVD 50 events 21

22 RESULTS: CVD 100 events 22

23 RESULTS: CVD 2000 events 23

24 RESULTS: Hip fracture 25 events 24

25 RESULTS: Hip fracture 50 events 25

26 RESULTS: Hip fracture 100 events 26

27 RESULTS: Hip fracture 2000 events 27

28 RESULTS: Discrimination CVD 28 At small sample sizes (25 events) Large difference in between apparent performance and validation performance for optimal dichotomisation 0.84 (apparent); 0.72 (validation) Smaller differences observed for FP/RCS/Linear 0.84 (apparent); 0.78 (validation) Observed difference between dichotomisation (at the median) and linear/fp/rcs Apparent performance: difference of 0.05 Validation performance: difference of 0.05 Observed over all 4 sample sizes examined Negligible differences between linear/fp/rcs

29 RESULTS: Discrimination Hip Fracture 29 At small sample sizes (25 events) Large difference in between apparent performance and validation performance for optimal dichotomisation 0.86 (apparent); 0.76 (validation) FP/RCS/Linear 0.90 (apparent); 0.87 (validation) Observed difference between dichotomisation (at the median) and linear/fp/rcs Apparent performance: difference of 0.1 Validation performance: difference of 0.1 Observed over all 4 sample sizes examined Negligible differences between linear/fp/rcs

30 RESULTS: Discrimination Hip Fracture 30

31 RESULTS: Decision Curve Analysis (CVD only) [higher NB better model] 31 FP/RCS dichotomisation

32 RESULTS: Net cases found per

33 Conclusions 33 Systematic reviews show dichotomising / categorising continuous predictors routinely done when developing a prediction model Dichotomising, either at the median or optimal predictor value leads to models with substantially poorer performance Poor discrimination; poor calibration; poor clinical utility Large discrepancies between apparent performance and validation performance observed for optimal split dichotomising The impact of dichotomising continuous predictors are handled is more pronounced at smaller sample sizes

Supplementary appendix

Supplementary appendix Supplementary appendix This appendix formed part of the original submission and has been peer reviewed. We post it as supplied by the authors. Supplement to: Callegaro D, Miceli R, Bonvalot S, et al. Development

More information

Four simple ways to increase power without increasing the sample size

Four simple ways to increase power without increasing the sample size Four simple ways to increase power without increasing the sample size Stanley E Lazic Quantitative Biology, Discovery Sciences, AstraZeneca, IMED Biotech Unit, Cambridge, CB4 0WG, UK stan.lazic@cantab.net

More information

Overview of Multivariable Prediction Modelling. Methodological Conduct & Reporting: Introducing TRIPOD guidelines

Overview of Multivariable Prediction Modelling. Methodological Conduct & Reporting: Introducing TRIPOD guidelines Overview of Multivariable Prediction Modelling Methodological Conduct & Reporting: Introducing TRIPOD guidelines Gary Collins Centre for Statistics in Medicine www.csm-oxford.org.uk University of Oxford

More information

AN INDEPENDENT VALIDATION OF QRISK ON THE THIN DATABASE

AN INDEPENDENT VALIDATION OF QRISK ON THE THIN DATABASE AN INDEPENDENT VALIDATION OF QRISK ON THE THIN DATABASE Dr Gary S. Collins Professor Douglas G. Altman Centre for Statistics in Medicine University of Oxford TABLE OF CONTENTS LIST OF TABLES... 3 LIST

More information

Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers

Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers Net Reclassification Risk: a graph to clarify the potential prognostic utility of new markers Ewout Steyerberg Professor of Medical Decision Making Dept of Public Health, Erasmus MC Birmingham July, 2013

More information

Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer

Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer Jayashree Kalpathy-Cramer PhD 1, William Hersh, MD 1, Jong Song Kim, PhD

More information

Validation of QFracture. Analysis prepared for NICE 2011

Validation of QFracture. Analysis prepared for NICE 2011 Validation of QFracture compared with FRAX Analysis prepared for NICE 2011 Authors: Julia Hippisley-Cox & Carol Coupland Email: Julia.hippisley-cox@nottingham.ac.uk Julia Hipisley-Cox, University of Nottingham,

More information

Summary of main challenges and future directions

Summary of main challenges and future directions Summary of main challenges and future directions Martin Schumacher Institute of Medical Biometry and Medical Informatics, University Medical Center, Freiburg Workshop October 2008 - F1 Outline Some historical

More information

the standard deviation (SD) is a measure of how much dispersion exists from the mean SD = square root (variance)

the standard deviation (SD) is a measure of how much dispersion exists from the mean SD = square root (variance) Normal distribution The normal distribution is also known as the Gaussian distribution or 'bell-shaped' distribution. It describes the spread of many biological and clinical measurements Properties of

More information

Lifetime vs. 10-year risk to allocate treatments for the primary prevention of cardiovascular disease

Lifetime vs. 10-year risk to allocate treatments for the primary prevention of cardiovascular disease Lifetime vs. 10-year risk to allocate treatments for the primary prevention of cardiovascular disease Eleni Rapsomaniki Farr Institute for Health Informatics Research, Epidemiology & Public Health University

More information

That's a nice elliptical plot, for which the the "ordinary" Pearson r correlation is.609.

That's a nice elliptical plot, for which the the ordinary Pearson r correlation is.609. Dichotomization: How bad is it? Thomas R. Knapp 2013 Introduction I love percentages (Knapp, 2010). I love them so much I'm tempted to turn every statistical problem into an analysis of percentages. But

More information

It s hard to predict!

It s hard to predict! Statistical Methods for Prediction Steven Goodman, MD, PhD With thanks to: Ciprian M. Crainiceanu Associate Professor Department of Biostatistics JHSPH 1 It s hard to predict! People with no future: Marilyn

More information

Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study

Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study Development and validation of QDiabetes-2018 risk prediction algorithm to estimate future risk of type 2 diabetes: cohort study Julia Hippisley-Cox, 1,2 Carol Coupland 1 1 Division of Primary Care, University

More information

Genetic risk prediction for CHD: will we ever get there or are we already there?

Genetic risk prediction for CHD: will we ever get there or are we already there? Genetic risk prediction for CHD: will we ever get there or are we already there? Themistocles (Tim) Assimes, MD PhD Assistant Professor of Medicine Stanford University School of Medicine WHI Investigators

More information

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN

Review. Imagine the following table being obtained as a random. Decision Test Diseased Not Diseased Positive TP FP Negative FN TN Outline 1. Review sensitivity and specificity 2. Define an ROC curve 3. Define AUC 4. Non-parametric tests for whether or not the test is informative 5. Introduce the binormal ROC model 6. Discuss non-parametric

More information

Template 1 for summarising studies addressing prognostic questions

Template 1 for summarising studies addressing prognostic questions Template 1 for summarising studies addressing prognostic questions Instructions to fill the table: When no element can be added under one or more heading, include the mention: O Not applicable when an

More information

Reporting and Methods in Clinical Prediction Research: A Systematic Review

Reporting and Methods in Clinical Prediction Research: A Systematic Review Reporting and Methods in Clinical Prediction Research: A Systematic Review Walter Bouwmeester 1., Nicolaas P. A. Zuithoff 1., Susan Mallett 2, Mirjam I. Geerlings 1, Yvonne Vergouwe 1,3, Ewout W. Steyerberg

More information

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time.

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time. Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time. Anne Cori* 1, Michael Pickles* 1, Ard van Sighem 2, Luuk Gras 2,

More information

Discontinuation and restarting in patients on statin treatment: prospective open cohort study using a primary care database

Discontinuation and restarting in patients on statin treatment: prospective open cohort study using a primary care database open access Discontinuation and restarting in patients on statin treatment: prospective open cohort study using a primary care database Yana Vinogradova, 1 Carol Coupland, 1 Peter Brindle, 2,3 Julia Hippisley-Cox

More information

Part [1.0] Introduction to Development and Evaluation of Dynamic Predictions

Part [1.0] Introduction to Development and Evaluation of Dynamic Predictions Part [1.0] Introduction to Development and Evaluation of Dynamic Predictions A Bansal & PJ Heagerty Department of Biostatistics University of Washington 1 Biomarkers The Instructor(s) Patrick Heagerty

More information

Identifying patients with undetected pancreatic cancer in primary care:

Identifying patients with undetected pancreatic cancer in primary care: Research Gary S Collins and Douglas G Altman Identifying patients with undetected pancreatic cancer in primary care: an independent and external validation of QCancer (Pancreas) Abstract Background Despite

More information

Supplemental Material

Supplemental Material Supplemental Material Supplemental Results The baseline patient characteristics for all subgroups analyzed are shown in Table S1. Tables S2-S6 demonstrate the association between ECG metrics and cardiovascular

More information

Development and validation of a multivariable prediction model for all-cause cancer incidence based on health behaviours in the population setting

Development and validation of a multivariable prediction model for all-cause cancer incidence based on health behaviours in the population setting Development and validation of a multivariable prediction model for all-cause cancer incidence based on health behaviours in the population setting Courtney Maskerine A thesis submitted to the Faculty of

More information

Outline of Part III. SISCR 2016, Module 7, Part III. SISCR Module 7 Part III: Comparing Two Risk Models

Outline of Part III. SISCR 2016, Module 7, Part III. SISCR Module 7 Part III: Comparing Two Risk Models SISCR Module 7 Part III: Comparing Two Risk Models Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington Outline of Part III 1. How to compare two risk models 2.

More information

Analyzing diastolic and systolic blood pressure individually or jointly?

Analyzing diastolic and systolic blood pressure individually or jointly? Analyzing diastolic and systolic blood pressure individually or jointly? Chenglin Ye a, Gary Foster a, Lisa Dolovich b, Lehana Thabane a,c a. Department of Clinical Epidemiology and Biostatistics, McMaster

More information

Evidence Based Medicine

Evidence Based Medicine Course Goals Goals 1. Understand basic concepts of evidence based medicine (EBM) and how EBM facilitates optimal patient care. 2. Develop a basic understanding of how clinical research studies are designed

More information

OUTCOMES OF DICHOTOMIZING A CONTINUOUS VARIABLE IN THE PSYCHIATRIC EARLY READMISSION PREDICTION MODEL. Ng CG

OUTCOMES OF DICHOTOMIZING A CONTINUOUS VARIABLE IN THE PSYCHIATRIC EARLY READMISSION PREDICTION MODEL. Ng CG ORIGINAL PAPER OUTCOMES OF DICHOTOMIZING A CONTINUOUS VARIABLE IN THE PSYCHIATRIC EARLY READMISSION PREDICTION MODEL Ng CG Department of Psychological Medicine, Faculty of Medicine, University Malaya,

More information

Supplementary Material. Serum Chloride is an Independent Predictor of Mortality in Hypertensive Patients

Supplementary Material. Serum Chloride is an Independent Predictor of Mortality in Hypertensive Patients Supplementary Material Serum Chloride is an Independent Predictor of Mortality in Hypertensive Patients Linsay McCallum,* Panniyammakal Jeemon,* Claire E Hastie, Rajan K Patel, Catherine Williamson, Adyani

More information

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference

Lecture Outline Biost 517 Applied Biostatistics I. Statistical Goals of Studies Role of Statistical Inference Lecture Outline Biost 517 Applied Biostatistics I Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Statistical Inference Role of Statistical Inference Hierarchy of Experimental

More information

Observed survival (%) time women men

Observed survival (%) time women men Manuscript ID BMJ.2016.035185 entitled "Development and validation of risk prediction equations to estimate survival in patients with colorectal cancer: cohort study." Thank you for sending us your paper.

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

RESEARCH. Predicting risk of osteoporotic fracture in men and women in England and Wales: prospective derivation and validation of QFractureScores

RESEARCH. Predicting risk of osteoporotic fracture in men and women in England and Wales: prospective derivation and validation of QFractureScores Predicting risk of osteoporotic fracture in men and women in England and Wales: prospective derivation and validation of QFractureScores Julia Hippisley-Cox, professor of clinical epidemiology and general

More information

Development, validation and application of risk prediction models

Development, validation and application of risk prediction models Development, validation and application of risk prediction models G. Colditz, E. Liu, M. Olsen, & others (Ying Liu, TA) 3/28/2012 Risk Prediction Models 1 Goals Through examples, class discussion, and

More information

Central pressures and prediction of cardiovascular events in erectile dysfunction patients

Central pressures and prediction of cardiovascular events in erectile dysfunction patients Central pressures and prediction of cardiovascular events in erectile dysfunction patients N. Ioakeimidis, K. Rokkas, A. Angelis, Z. Kratiras, M. Abdelrasoul, C. Georgakopoulos, D. Terentes-Printzios,

More information

The index of prediction accuracy: an intuitive measure useful for evaluating risk prediction models

The index of prediction accuracy: an intuitive measure useful for evaluating risk prediction models Kattan and Gerds Diagnostic and Prognostic Research (2018) 2:7 https://doi.org/10.1186/s41512-018-0029-2 Diagnostic and Prognostic Research METHODOLOGY Open Access The index of prediction accuracy: an

More information

QStatin Update Information

QStatin Update Information QStatin -2014 Update Information Revision History Revision date Document Version Summary of Changes 19.08.2014 V1.0 First issue. Contents 1 Purpose of document... 3 2 Rationale for regular updates of QStatin...

More information

Statistical Analysis of Biomarker Data

Statistical Analysis of Biomarker Data Statistical Analysis of Biomarker Data Gary M. Clark, Ph.D. Vice President Biostatistics & Data Management Array BioPharma Inc. Boulder, CO NCIC Clinical Trials Group New Investigator Clinical Trials Course

More information

DISEASE-SPECIFIC REFERENCE EQUATIONS FOR LUNG FUNCTION IN PATIENTS WITH CYSTIC FIBROSIS

DISEASE-SPECIFIC REFERENCE EQUATIONS FOR LUNG FUNCTION IN PATIENTS WITH CYSTIC FIBROSIS DISEASE-SPECIFIC REFERENCE EQUATIONS FOR LUNG FUNCTION IN PATIENTS WITH CYSTIC FIBROSIS Michal Kulich, PhD, Margaret Rosenfeld, MD, MPH, Jonathan Campbell, MS, Richard Kronmal, PhD, Ron L. Gibson, MD,

More information

RESEARCH. Predicting risk of type 2 diabetes in England and Wales: prospective derivation and validation of QDScore

RESEARCH. Predicting risk of type 2 diabetes in England and Wales: prospective derivation and validation of QDScore Predicting risk of type 2 diabetes in England and Wales: prospective derivation and validation of QDScore Julia Hippisley-Cox, professor of clinical epidemiology and general practice, 1 Carol Coupland,

More information

Journal: Nature Methods

Journal: Nature Methods Journal: Nature Methods Article Title: Network-based stratification of tumor mutations Corresponding Author: Trey Ideker Supplementary Item Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure

More information

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers Tutorial in Biostatistics Received 21 November 2012, Accepted 17 July 2013 Published online 23 August 2013 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5941 Graphical assessment of

More information

Assessment of performance and decision curve analysis

Assessment of performance and decision curve analysis Assessment of performance and decision curve analysis Ewout Steyerberg, Andrew Vickers Dept of Public Health, Erasmus MC, Rotterdam, the Netherlands Dept of Epidemiology and Biostatistics, Memorial Sloan-Kettering

More information

SISCR Module 4 Part III: Comparing Two Risk Models. Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

SISCR Module 4 Part III: Comparing Two Risk Models. Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington SISCR Module 4 Part III: Comparing Two Risk Models Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington Outline of Part III 1. How to compare two risk models 2.

More information

Module Overview. What is a Marker? Part 1 Overview

Module Overview. What is a Marker? Part 1 Overview SISCR Module 7 Part I: Introduction Basic Concepts for Binary Classification Tools and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Issues in popular designs for observational studies

Issues in popular designs for observational studies Issues in popular designs for observational studies Doug Altman University of Oxford on behalf of STRATOS TG5: Study Design CSM celebration, 29 March 2017 TG5 Members Chairs: Mitchell Gail, Suzanne Cadarette

More information

Computer Models for Medical Diagnosis and Prognostication

Computer Models for Medical Diagnosis and Prognostication Computer Models for Medical Diagnosis and Prognostication Lucila Ohno-Machado, MD, PhD Division of Biomedical Informatics Clinical pattern recognition and predictive models Evaluation of binary classifiers

More information

Abstract: Heart failure research suggests that multiple biomarkers could be combined

Abstract: Heart failure research suggests that multiple biomarkers could be combined Title: Development and evaluation of multi-marker risk scores for clinical prognosis Authors: Benjamin French, Paramita Saha-Chaudhuri, Bonnie Ky, Thomas P Cappola, Patrick J Heagerty Benjamin French Department

More information

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme Michael Sweeting Cardiovascular Epidemiology Unit, University of Cambridge Friday 15th

More information

Summarising and validating test accuracy results across multiple studies for use in clinical practice

Summarising and validating test accuracy results across multiple studies for use in clinical practice Summarising and validating test accuracy results across multiple studies for use in clinical practice Richard D. Riley Professor of Biostatistics Research Institute for Primary Care & Health Sciences Thank

More information

Peter C. Austin Institute for Clinical Evaluative Sciences and University of Toronto

Peter C. Austin Institute for Clinical Evaluative Sciences and University of Toronto Multivariate Behavioral Research, 46:119 151, 2011 Copyright Taylor & Francis Group, LLC ISSN: 0027-3171 print/1532-7906 online DOI: 10.1080/00273171.2011.540480 A Tutorial and Case Study in Propensity

More information

VU Biostatistics and Experimental Design PLA.216

VU Biostatistics and Experimental Design PLA.216 VU Biostatistics and Experimental Design PLA.216 Julia Feichtinger Postdoctoral Researcher Institute of Computational Biotechnology Graz University of Technology Outline for Today About this course Background

More information

Statistical modelling for thoracic surgery using a nomogram based on logistic regression

Statistical modelling for thoracic surgery using a nomogram based on logistic regression Statistics Corner Statistical modelling for thoracic surgery using a nomogram based on logistic regression Run-Zhong Liu 1, Ze-Rui Zhao 2, Calvin S. H. Ng 2 1 Department of Medical Statistics and Epidemiology,

More information

Biostatistics II

Biostatistics II Biostatistics II 514-5509 Course Description: Modern multivariable statistical analysis based on the concept of generalized linear models. Includes linear, logistic, and Poisson regression, survival analysis,

More information

Statistics, Probability and Diagnostic Medicine

Statistics, Probability and Diagnostic Medicine Statistics, Probability and Diagnostic Medicine Jennifer Le-Rademacher, PhD Sponsored by the Clinical and Translational Science Institute (CTSI) and the Department of Population Health / Division of Biostatistics

More information

Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference

Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference Discrimination and Reclassification in Statistics and Study Design AACC/ASN 30 th Beckman Conference Michael J. Pencina, PhD Duke Clinical Research Institute Duke University Department of Biostatistics

More information

UK Liver Transplant Audit

UK Liver Transplant Audit November 2012 UK Liver Transplant Audit In patients who received a Liver Transplant between 1 st March 1994 and 31 st March 2012 ANNUAL REPORT Advisory Group for National Specialised Services Prepared

More information

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH

STATISTICS IN CLINICAL AND TRANSLATIONAL RESEARCH 09/07/11 1 Overview and Descriptive Statistics a. Application of statistics in biomedical research b. Type of data c. Graphic representation of data d. Summary statistics: central tendency and dispersion

More information

Question 1(25= )

Question 1(25= ) MSG500 Final 20-0-2 Examiner: Rebecka Jörnsten, 060-49949 Remember: To pass this course you also have to hand in a final project to the examiner. Open book, open notes but no calculators or computers allowed.

More information

Comparison of prognostic nomograms based on different nodal staging systems in patients with resected gastric cancer

Comparison of prognostic nomograms based on different nodal staging systems in patients with resected gastric cancer 950 Ivyspring International Publisher Research Paper Journal of Cancer 2017; 8(6): 950-958. doi: 10.7150/jca.17370 Comparison of prognostic nomograms based on different nodal staging systems in patients

More information

research methods & reporting

research methods & reporting Prognosis and prognostic research: Developing a prognostic model research methods & reporting Patrick Royston, 1 Karel G M Moons, 2 Douglas G Altman, 3 Yvonne Vergouwe 2 In the second article in their

More information

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic

Statistics: A Brief Overview Part I. Katherine Shaver, M.S. Biostatistician Carilion Clinic Statistics: A Brief Overview Part I Katherine Shaver, M.S. Biostatistician Carilion Clinic Statistics: A Brief Overview Course Objectives Upon completion of the course, you will be able to: Distinguish

More information

BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS

BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS Nan Shao, Ph.D. Director, Biostatistics Premier Research Group, Limited and Mark Jaros, Ph.D. Senior

More information

The Potential of Genes and Other Markers to Inform about Risk

The Potential of Genes and Other Markers to Inform about Risk Research Article The Potential of Genes and Other Markers to Inform about Risk Cancer Epidemiology, Biomarkers & Prevention Margaret S. Pepe 1,2, Jessie W. Gu 1,2, and Daryl E. Morris 1,2 Abstract Background:

More information

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies

Individual Participant Data (IPD) Meta-analysis of prediction modelling studies Individual Participant Data (IPD) Meta-analysis of prediction modelling studies Thomas Debray, PhD Julius Center for Health Sciences and Primary Care Utrecht, The Netherlands March 7, 2016 Prediction

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Schnabel RB, Aspelund T, Li G, et al. Validation of an atrial fibrillation risk algorithm in whites and African Americans. Arch Intern Med. 2010;170(21):1909-1917. eappendix.

More information

Heart Rate in Patients with Coronary Artery Disease - the Lower the Better? An Analysis from the Treating to New Targets (TNT) trial

Heart Rate in Patients with Coronary Artery Disease - the Lower the Better? An Analysis from the Treating to New Targets (TNT) trial Heart Rate in Patients with Coronary Artery Disease - the Lower the Better? An Analysis from the Treating to New Targets (TNT) trial Sripal Bangalore, MD, MHA, Chuan-Chuan Wun, PhD, David A DeMicco, PharmD,

More information

Systematic reviews of prognostic studies 3 meta-analytical approaches in systematic reviews of prognostic studies

Systematic reviews of prognostic studies 3 meta-analytical approaches in systematic reviews of prognostic studies Systematic reviews of prognostic studies 3 meta-analytical approaches in systematic reviews of prognostic studies Thomas PA Debray, Karel GM Moons for the Cochrane Prognosis Review Methods Group Conflict

More information

Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery

Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery Author's response to reviews Title:Prediction of poor outcomes six months following total knee arthroplasty in patients awaiting surgery Authors: Eugen Lungu (eugen.lungu@umontreal.ca) François Desmeules

More information

Health-related quality of life assessed before and during chemotherapy predicts for survival in multiple myeloma

Health-related quality of life assessed before and during chemotherapy predicts for survival in multiple myeloma British Journal of Haematology, 1997, 97, 29 37 Health-related quality of life assessed before and during chemotherapy predicts for survival in multiple myeloma FINN WISLØFF 1 AND MARTIN HJORTH 2 for the

More information

Supplemental data to article. Association of ESA hypo-responsiveness and haemoglobin variability with. mortality in hemodialysis patients

Supplemental data to article. Association of ESA hypo-responsiveness and haemoglobin variability with. mortality in hemodialysis patients Supplemental data to article Association of ESA hypo-responsiveness and haemoglobin variability with mortality in hemodialysis patients A. Kainz, B. Mayer, R. Kramar, R. Oberbauer Table of Contents Hb

More information

Linear Regression in SAS

Linear Regression in SAS 1 Suppose we wish to examine factors that predict patient s hemoglobin levels. Simulated data for six patients is used throughout this tutorial. data hgb_data; input id age race $ bmi hgb; cards; 21 25

More information

The effect of systemic exposure to efavirenz, sex and age on the risk of virological nonsuppression in HIV-infected African children.

The effect of systemic exposure to efavirenz, sex and age on the risk of virological nonsuppression in HIV-infected African children. The effect of systemic exposure to efavirenz, sex and age on the risk of virological nonsuppression in HIV-infected African children. Andrzej Bienczak Paolo Denti, Adrian Cook, Veronica Mulenga, Cissy

More information

Patient characteristics of training and validation set. Patient selection and inclusion overview can be found in Supp Data 9. Training set (103)

Patient characteristics of training and validation set. Patient selection and inclusion overview can be found in Supp Data 9. Training set (103) Roepman P, et al. An immune response enriched 72-gene prognostic profile for early stage Non-Small- Supplementary Data 1. Patient characteristics of training and validation set. Patient selection and inclusion

More information

Sample size considerations for the external validation of a multivariable prognostic model: a resampling study

Sample size considerations for the external validation of a multivariable prognostic model: a resampling study Research Article Received: 10 March 2015, Accepted: 12 October 2015 Published online 9 November 2015 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.6787 Sample size considerations for

More information

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis

Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis EFSA/EBTC Colloquium, 25 October 2017 Recent developments for combining evidence within evidence streams: bias-adjusted meta-analysis Julian Higgins University of Bristol 1 Introduction to concepts Standard

More information

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers

SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers SISCR Module 7 Part I: Introduction Basic Concepts for Binary Biomarkers (Classifiers) and Continuous Biomarkers Kathleen Kerr, Ph.D. Associate Professor Department of Biostatistics University of Washington

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Serra AL, Poster D, Kistler AD, et al. Sirolimus and kidney

More information

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets

Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Application of Artificial Neural Network-Based Survival Analysis on Two Breast Cancer Datasets Chih-Lin Chi a, W. Nick Street b, William H. Wolberg c a Health Informatics Program, University of Iowa b

More information

Estimating interaction on an additive scale between continuous determinants in a logistic regression model

Estimating interaction on an additive scale between continuous determinants in a logistic regression model Int. J. Epidemiol. Advance Access published August 27, 2007 Published by Oxford University Press on behalf of the International Epidemiological Association ß The Author 2007; all rights reserved. International

More information

Stratified Cost-Effectiveness Analysis (with implications for sub-group analysis) (and applications to value-based pricing)

Stratified Cost-Effectiveness Analysis (with implications for sub-group analysis) (and applications to value-based pricing) Stratified Cost-Effectiveness Analysis (with implications for sub-group analysis) (and applications to value-based pricing) Andrew H Briggs William R Lindsay Chair of Health Economics Stratified CEA: Overview

More information

Selection Criteria for Lung-Cancer Screening

Selection Criteria for Lung-Cancer Screening T h e n e w e ngl a nd j o u r na l o f m e dic i n e original article Selection Criteria for Lung-Cancer Screening Martin C. Tammemägi, Ph.D., Hormuzd A. Katki, Ph.D., William G. Hocking, M.D., Timothy

More information

Diagnostic methods 2: receiver operating characteristic (ROC) curves

Diagnostic methods 2: receiver operating characteristic (ROC) curves abc of epidemiology http://www.kidney-international.org & 29 International Society of Nephrology Diagnostic methods 2: receiver operating characteristic (ROC) curves Giovanni Tripepi 1, Kitty J. Jager

More information

Systematic reviews of prediction modeling studies: planning, critical appraisal and data collection

Systematic reviews of prediction modeling studies: planning, critical appraisal and data collection Systematic reviews of prediction modeling studies: planning, critical appraisal and data collection Karel GM Moons, Lotty Hooft, Hans Reitsma, Thomas Debray Dutch Cochrane Center Julius Center for Health

More information

RISK PREDICTION MODEL: PENALIZED REGRESSIONS

RISK PREDICTION MODEL: PENALIZED REGRESSIONS RISK PREDICTION MODEL: PENALIZED REGRESSIONS Inspired from: How to develop a more accurate risk prediction model when there are few events Menelaos Pavlou, Gareth Ambler, Shaun R Seaman, Oliver Guttmann,

More information

M. J. Rutherford 1,*, T. M-L. Andersson 2, H. Møller 3, P.C. Lambert 1,2.

M. J. Rutherford 1,*, T. M-L. Andersson 2, H. Møller 3, P.C. Lambert 1,2. Understanding the impact of socioeconomic differences in breast cancer survival in England and Wales: avoidable deaths and potential gain in expectation of life. M. J. Rutherford 1,*, T. M-L. Andersson

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH PROPENSITY SCORE Confounding Definition: A situation in which the effect or association between an exposure (a predictor or risk factor) and

More information

APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR)

APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR) APPENDIX D REFERENCE AND PREDICTIVE VALUES FOR PEAK EXPIRATORY FLOW RATE (PEFR) Lung function is related to physical characteristics such as age and height. In order to assess the Peak Expiratory Flow

More information

BREAST CANCER EPIDEMIOLOGY MODEL:

BREAST CANCER EPIDEMIOLOGY MODEL: BREAST CANCER EPIDEMIOLOGY MODEL: Calibrating Simulations via Optimization Michael C. Ferris, Geng Deng, Dennis G. Fryback, Vipat Kuruchittham University of Wisconsin 1 University of Wisconsin Breast Cancer

More information

2017 American Medical Association. All rights reserved.

2017 American Medical Association. All rights reserved. Supplementary Online Content Borocas DA, Alvarez J, Resnick MJ, et al. Association between radiation therapy, surgery, or observation for localized prostate cancer and patient-reported outcomes after 3

More information

Inferential Statistics

Inferential Statistics Inferential Statistics and t - tests ScWk 242 Session 9 Slides Inferential Statistics Ø Inferential statistics are used to test hypotheses about the relationship between the independent and the dependent

More information

Supplementary Online Content

Supplementary Online Content Supplementary Online Content Ebbing M, Bønaa KH, Nygård O, et al. Cancer incidence and mortality after treatment with folic acid and vitamin B 1. JAMA. 9;3(19):119-1. etable 1. Circulating Levels of B

More information

AP Statistics Practice Test Ch. 3 and Previous

AP Statistics Practice Test Ch. 3 and Previous AP Statistics Practice Test Ch. 3 and Previous Name Date Use the following to answer questions 1 and 2: A researcher measures the height (in feet) and volume of usable lumber (in cubic feet) of 32 cherry

More information

Individualized Treatment Effects Using a Non-parametric Bayesian Approach

Individualized Treatment Effects Using a Non-parametric Bayesian Approach Individualized Treatment Effects Using a Non-parametric Bayesian Approach Ravi Varadhan Nicholas C. Henderson Division of Biostatistics & Bioinformatics Department of Oncology Johns Hopkins University

More information

Supplementary Online Content

Supplementary Online Content 1 Supplementary Online Content 2 3 4 5 Hay AD, Little P, Harnden A, et al. Effect of oral prednisolone on symptom duration in nonasthmatic adults with acute lower respiratory tract infection: a randomized

More information

RESEARCH. An independent external validation and evaluation of QRISK cardiovascular risk prediction: a prospective open cohort study

RESEARCH. An independent external validation and evaluation of QRISK cardiovascular risk prediction: a prospective open cohort study An independent external validation and evaluation of cardiovascular risk prediction: a prospective open cohort study Gary S Collins, medical statistician, 1 Douglas G Altman, professor of statistics in

More information

ORIGINAL RESEARCH & CONTRIBUTIONS

ORIGINAL RESEARCH & CONTRIBUTIONS Association of Age to Mortality and Repeat Revascularization in End-Stage Renal Disease Patients: Implications for Clinicians and Future Health Policies Ashok Krishnaswami, MD, MAS; Thomas Alloggiamento,

More information

RESEARCH. An independent and external validation of QRISK2 cardiovascular disease risk score: a prospective open cohort study

RESEARCH. An independent and external validation of QRISK2 cardiovascular disease risk score: a prospective open cohort study Centre for Statistics in Medicine, Wolfson College Annexe, University of Oxford, Oxford OX2 6UD Correspondence to: G Collins gary.collins@csm.ox.ac.uk Cite this as: BMJ 2010;340:c2442 doi:10.1136/bmj.c2442

More information

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:

Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23: Roadmap for Developing and Validating Therapeutically Relevant Genomic Classifiers. Richard Simon, J Clin Oncol 23:7332-7341 Presented by Deming Mi 7/25/2006 Major reasons for few prognostic factors to

More information

From single studies to an EBM based assessment some central issues

From single studies to an EBM based assessment some central issues From single studies to an EBM based assessment some central issues Doug Altman Centre for Statistics in Medicine, Oxford, UK Prognosis Prognosis commonly relates to the probability or risk of an individual

More information