Sarwar Islam Mozumder 1, Mark J Rutherford 1 & Paul C Lambert 1, Stata London Users Group Meeting

Similar documents
Up-to-date survival estimates from prognostic models using temporal recalibration

Overview. All-cause mortality for males with colon cancer and Finnish population. Relative survival

Multi-state survival analysis in Stata

Estimating and Modelling the Proportion Cured of Disease in Population Based Cancer Studies

Multivariate dose-response meta-analysis: an update on glst

4. STATA output of the analysis

Quantifying cancer patient survival; extensions and applications of cure models and life expectancy estimation

Estimating and modelling relative survival

An overview and some recent advances in statistical methods for population-based cancer survival analysis

Notes for laboratory session 2

Regression Output: Table 5 (Random Effects OLS) Random-effects GLS regression Number of obs = 1806 Group variable (i): subject Number of groups = 70

1. Objective: analyzing CD4 counts data using GEE marginal model and random effects model. Demonstrate the analysis using SAS and STATA.

Immortal Time Bias and Issues in Critical Care. Dave Glidden May 24, 2007

Prediction and Inference under Competing Risks in High Dimension - An EHR Demonstration Project for Prostate Cancer

About me. An overview and some recent advances in statistical methods for population-based cancer survival analysis. Research interests

ECON Introductory Econometrics Seminar 7

Age (continuous) Gender (0=Male, 1=Female) SES (1=Low, 2=Medium, 3=High) Prior Victimization (0= Not Victimized, 1=Victimized)

Modelling prognostic capabilities of tumor size: application to colorectal cancer

Statistical Methods for the Evaluation of Treatment Effectiveness in the Presence of Competing Risks

Assessment of lead-time bias in estimates of relative survival for breast cancer

Least likely observations in regression models for categorical outcomes

M. J. Rutherford 1,*, T. M-L. Andersson 2, H. Møller 3, P.C. Lambert 1,2.

Meta-analysis using individual participant data: one-stage and two-stage approaches, and why they may differ

Multiple Linear Regression Analysis

Biostatistics II

Editor Executive Editor Associate Editors Copyright Statement:

Issues for analyzing competing-risks data with missing or misclassification in causes. Dr. Ronny Westerman

Simulation shows undesirable results for competing risks analysis with time-dependent covariates for clinical outcomes

Joint Modelling of Event Counts and Survival Times: Example Using Data from the MESS Trial

merlin: Mixed effects regression for linear, non-linear and user-defined models

Dynamic prediction using joint models for recurrent and terminal events: Evolution after a breast cancer

Modeling unobserved heterogeneity in Stata

Application of EM Algorithm to Mixture Cure Model for Grouped Relative Survival Data

Sociology Exam 3 Answer Key [Draft] May 9, 201 3

Assessing methods for dealing with treatment switching in clinical trials: A follow-up simulation study

competing risks; event-specific hazard; incidence rate; model misspecification; proportional hazards; survival analysis

This tutorial presentation is prepared by. Mohammad Ehsanul Karim

Workplace Project Portfolio C (WPP C) Laura Rodwell

An application of the new irt command in Stata

Multivariable Cox regression. Day 3: multivariable Cox regression. Presentation of results. The statistical methods section

G , G , G MHRN

Self-assessment test of prerequisite knowledge for Biostatistics III in R

Supplement for: CD4 cell dynamics in untreated HIV-1 infection: overall rates, and effects of age, viral load, gender and calendar time.

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Statistical Models for Censored Point Processes with Cure Rates

ethnicity recording in primary care

Controlling for time-dependent confounding using marginal structural models

Supplementary Appendix

Recent Advances in Methods for Quantiles. Matteo Bottai, Sc.D.

Supplementary Information for Avoidable deaths and random variation in patients survival

Survival analysis beyond the Cox-model

m 11 m.1 > m 12 m.2 risk for smokers risk for nonsmokers

Heritability. The Extended Liability-Threshold Model. Polygenic model for continuous trait. Polygenic model for continuous trait.

Survival Prediction Models for Estimating the Benefit of Post-Operative Radiation Therapy for Gallbladder Cancer and Lung Cancer

2. Scientific question: Determine whether there is a difference between boys and girls with respect to the distance and its change over time.

Received: 14 April 2016, Accepted: 28 October 2016 Published online 1 December 2016 in Wiley Online Library

MEASURING CANCER CURE IN IRELAND

Projection of power and events in clinical trials with a time-to-event outcome

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

ASSESSMENT OF LEAD-TIME BIAS IN ESTIMATES OF RELATIVE SURVIVAL FOR BREAST CANCER

LOGO. Statistical Modeling of Breast and Lung Cancers. Cancer Research Team. Department of Mathematics and Statistics University of South Florida

Implementing double-robust estimators of causal effects

ESM1 for Glucose, blood pressure and cholesterol levels and their relationships to clinical outcomes in type 2 diabetes: a retrospective cohort study

A novel approach to estimation of the time to biomarker threshold: Applications to HIV

TWISTED SURVIVAL: IDENTIFYING SURROGATE ENDPOINTS FOR MORTALITY USING QTWIST AND CONDITIONAL DISEASE FREE SURVIVAL. Beth A.

Generalized Mixed Linear Models Practical 2

For any unreported outcomes, umeta sets the outcome and its variance at 0 and 1E12, respectively.

Cross-validated AUC in Stata: CVAUROC

An overview and some recent advances in statistical methods for population-based cancer survival analysis

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

Age-standardised Net Survival & Cohort and Period Net Survival Estimates

Evaluation of AJCC, UICC, and Brigham and Women's Hospital Tumor Staging for Cutaneous Squamous Cell Carcinoma

Cross-validation. Miguel Angel Luque Fernandez Faculty of Epidemiology and Population Health Department of Non-communicable Diseases.

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas References Also see

Biological Cure. Margaret R. Stedman, Ph.D. MPH Angela B. Mariotto, Ph.D.

40th anniversary for Section of Biostatistics and Two day meeting in Danish Society of Theoretical Statistics

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme

Statistical Science Issues in HIV Vaccine Trials: Part I

Landmarking, immortal time bias and. Dynamic prediction

SCHOOL OF MATHEMATICS AND STATISTICS

Analysis of Rheumatoid Arthritis Data using Logistic Regression and Penalized Approach

Demystifying causal inference in randomised trials. Lecture 3: Introduction to mediation and mediation analysis using instrumental variables

A Study on Type 2 Diabetes Mellitus Patients Using Regression Model and Survival Analysis Techniques

The improved cure fraction for esophageal cancer in Linzhou city

Final Exam - section 2. Thursday, December hours, 30 minutes

Downloaded from jhs.mazums.ac.ir at 13: on Friday July 27th 2018 [ DOI: /acadpub.jhs ]

Parvovirus B19 in five European countries: disentangling the sociological and microbiological mechanisms underlying infectious disease transmission

cloglog link function to transform the (population) hazard probability into a continuous

Application of the Multiplicative-Additive Model in the Bone Marrow Transplantation Survival Data Including Competing Risks

Linear Regression in SAS

The Stata Journal. Editor Nicholas J. Cox Geography Department Durham University South Road Durham City DH1 3LE UK

MODEL SELECTION STRATEGIES. Tony Panzarella

Estimation and prediction in multistate models

Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models, 2nd Ed.

validscale: A Stata module to validate subjective measurement scales using Classical Test Theory

Investigation of relative survival from colorectal cancer between NHS organisations

The SAS SUBTYPE Macro

Testing the treatment effect on competing causes of death in oncology clinical trials

Matched Cohort designs.

Estimating relative survival using the strel command

Transcription:

2016 Stata London Users Group Meeting stpm2cr: A Stata module for direct likelihood inference on the cause-specific cumulative incidence function within the flexible parametric modelling framework Sarwar Islam Mozumder 1, Mark J Rutherford 1 & Paul C Lambert 1, 2 1 Department of Health Sciences, University of Leicester, Leicester, UK 2 Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden

Multi-state Model Death from Cancer, k = 1 Alive at Diagnosis Death from Cause k = 2 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 2/22

Multi-state Model Death from Cancer, k = 1 Alive at Diagnosis Death from Cause k = 2 Death from Cause k = K Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 2/22

The Cumulative Incidence Function (CIF) Cause-specific CIF, F k (t) F k (t) = P(T t, D = k) Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 3/22

The Cumulative Incidence Function (CIF) Cause-specific CIF, F k (t) The probability that a patient will die from cause D = k by time t whilst also being at risk from dying of other causes We obtain this by either, 1 Estimating using all cause-specific hazard functions, or 2 Transforming using a direct relationship with the subdistribution hazard functions Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 3/22

The Cumulative Incidence Function (CIF) Cause-specific CIF, F k (t) The probability that a patient will die from cause D = k by time t whilst also being at risk from dying of other causes We obtain this by either, 1 Estimating using all cause-specific hazard functions, or 2 Transforming using a direct relationship with the subdistribution hazard functions Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 3/22

Approach (1) Cause-specific Hazards, h cs k (t) hk cs P(t < T t + t, D = k T > t) (t) = lim t 0 t Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 4/22

Approach (1) Cause-specific Hazards, h cs k (t) Instantaneous conditional rate of mortality from cause D = k given that the patient is still alive at time t Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 4/22

Approach (1) Cause-specific Hazards, h cs k (t) Instantaneous conditional rate of mortality from cause D = k given that the patient is still alive at time t Estimating Cause-specific CIF using CSH F k (t) = t 0 S(u)h cs k (u)du Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 4/22

Approach (1) Cause-specific Hazards, h cs k (t) Instantaneous conditional rate of mortality from cause D = k given that the patient is still alive at time t Estimating Cause-specific CIF using CSH t s K F k (t) = exp hj cs (u)du hk cs (s)ds 0 0 j=1 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 4/22

Approach (2) Subdistribution Hazards, h sd k (t) hk sd(t) = lim P(t < T t + t, D = k T > t (T t cause k) t 0 t Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 5/22

Approach (2) Subdistribution Hazards, h sd k (t) The instantaneous rate of failure at time t from cause D = k amongst those who have not died, or have died from any of the other causes, where D k Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 5/22

Approach (2) Subdistribution Hazards, h sd k (t) The instantaneous rate of failure at time t from cause D = k amongst those who have not died, or have died from any of the other causes, where D k Direct Transformation of the Cause-specific CIF [ t ] F k (t) = 1 exp hk sd (u)du 0 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 5/22

Regression Modelling SDH Regression Model [ ] hk sd (t x) = hsd 0,k (t) exp x k β sd k Subdistribution hazard ratio = exp [ β sd ] k Association on the effect of a covariate on risk Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 6/22

Why Flexible Parametric Survival Models? [Royston and Lambert, 2011] Models baseline (log-cumulative) SDH function using restricted cubic splines Log-Cumulative SDH Flexible Parametric Model ln(h sd k (t x ik)) = s k (ln(t) γ k, m 0k ) + x ik β k Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 7/22

Why Flexible Parametric Survival Models? [Royston and Lambert, 2011] Models baseline (log-cumulative) SDH function using restricted cubic splines Log-Cumulative SDH Flexible Parametric Model ln(h sd k (t x ik)) = s k (ln(t) γ k, m 0k ) + x ik β k Easy to include time-dependent effects Relaxing Assumption of Proportionality ln(h sd k (t)) = s k(ln(t); γ k, m 0k ) + x k β k + E s k (ln(t); α lk, m lk )x lk l=1 Can predict time-dependent HRs, absolute differences and more... Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 7/22

The Likelihood Function [Jeong and Fine, 2006] Direct Parametrisation (competing risks) n K [ ] L = (fj s (t i x j )) δ ij [S(t x)] 1 K i=1 j=1 j=1 δ ij Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 8/22

The Likelihood Function [Jeong and Fine, 2006] Direct Parametrisation (competing risks) n K [ ] L = (fj s (t i x j )) δ ij [S(t x)] 1 K i=1 j=1 j=1 δ ij Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 8/22

The Likelihood Function [Jeong and Fine, 2006] Direct Parametrisation (competing risks) n K [ ] L = (fj s (t i x j )) δ ij [S(t x)] 1 K i=1 CSH Approach n K [ L = i=1 j=1 j=1 (S(t x)h cs j j=1 δ ij ] (t i x j )) δ ij [S(t x)] 1 K j=1 δ ij Estimates covariate effects on the cause-specific CIF rather than the CSH rate Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 8/22

The Likelihood Function [Jeong and Fine, 2006] Direct Parametrisation (competing risks) n K [ ] K L = (fj s (t i x j )) δ ij 1 F j (t x j ) i=1 j=1 j=1 1 K j=1 δ ij Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 8/22

Existing Approaches with Implementation in Stata stcrreg: Fine & Gray model stcrprep: Restructures the data and calculates appropriate weights [Lambert et al., 2016 (submitted] Both commands above are computationally intensive due to the requirement of numerical integration and fitting models to an expanded dataset. Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 9/22

Quick Note on stcrreg. stset survmm, failure(cause == 1) scale(12) id(id) exit(time 180) (output omitted ). stcrreg i.stage, compete(cause == 2, 3) failure _d: cause == 1 analysis time _t: survmm/12 exit on or before: time 180 id: id Iteration 0: log pseudolikelihood = -27756.886 Iteration 1: log pseudolikelihood = -27756.795 Iteration 2: log pseudolikelihood = -27756.795 Competing-risks regression No. of obs = 14,162 No. of subjects = 14,162 Failure event : cause == 1 No. failed = 3,042 Competing events: cause == 2 3 No. competing = 4,803 No. censored = 6,317 Wald chi2(1) = 880.27 Log pseudolikelihood = -27756.795 Prob > chi2 = 0.0000 (Std. Err. adjusted for 14,162 clusters in id) Robust _t SHR Std. Err. z P> z [95% Conf. Interval] stage Regional 3.502575.1479802 29.67 0.000 3.224223 3.804958 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 10/22

Quick Note on stcrreg. stset survmm, failure(cause == 1, 2, 3) scale(12) id(id) exit(time 180) (output omitted ). gen cause2 = cond(_d==0,0,cause). stset survmm, failure(cause2 == 1) scale(12) id(id) exit(time 180) (output omitted ). stcrreg i.stage, compete(cause2 == 2, 3) failure _d: cause2 == 1 analysis time _t: survmm/12 exit on or before: time 180 id: id Iteration 0: log pseudolikelihood = -27756.886 Iteration 1: log pseudolikelihood = -27756.795 Iteration 2: log pseudolikelihood = -27756.795 Competing-risks regression No. of obs = 14,162 No. of subjects = 14,162 Failure event : cause2 == 1 No. failed = 3,042 Competing events: cause2 == 2 3 No. competing = 4,795 No. censored = 6,325 Wald chi2(1) = 880.27 Log pseudolikelihood = -27756.795 Prob > chi2 = 0.0000 (Std. Err. adjusted for 14,162 clusters in id) Robust _t SHR Std. Err. z P> z [95% Conf. Interval] stage Regional 3.502575.1479802 29.67 0.000 3.224223 3.804958 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 10/22

The stpm2cr Command Fit log-cumulative SDH flexible parametric models simultaneously for all cause-specific CIFs Uses individual patient data - significant computational time gains Initial values obtained using stcompet, i.e. the Aalen-Johansen approach, and reg Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 11/22

Main Syntax stpm2cr [ equation1 ][ equation2 ]... [ equationn ] [ if ] [ in ], events(varname) [ censvalue(#) cause(numlist) level(#) alleq noorthog eform oldest mlmethod(string) lininit ] maximise options Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 12/22

Equation Syntax The syntax of each equation is: causename: [ varlist ], scale(scalename) [ df(#) knots(numlist) tvc(varlist) dftvc(df list) knotstvc(numlist) bknots(knotslist) bknotstvc(numlist) noconstant cure ] Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 13/22

U.S. SEER Colorectal Data Survival of males diagnosed with colorectal cancer from 1998 to 2013 Localised and regional stage at diagnosis and ages 75 to 84 years old (14,215) Time to death from: 1 Colorectal cancer 2 Heart disease 3 Other causes Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 14/22

Fitting a Model. stset survmm, failure(cause == 1, 2, 3) scale(12) id(id) exit(time 180) (output omitted ). stpm2cr [colrec_cancer: stage2, scale(hazard) df(5)] /// > [other_causes: stage2, scale(hazard) df(5)] /// > [heart_disease: stage2, scale(hazard) df(5)] /// >, events(cause) cause(1 2 3) cens(0) eform nolog (output omitted ) Obtaining Initial Values Starting to Fit Model Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 15/22

Fitting a Model Log likelihood = -26795.633 Number of obs = 14,162 exp(b) Std. Err. z P> z [95% Conf. Interval] colrec_cancer stage2 3.429293.1448444 29.18 0.000 3.156836 3.725265 cr_rcs_c1_1 2.413135.0384796 55.24 0.000 2.338882 2.489744 cr_rcs_c1_2 1.14997.0120123 13.38 0.000 1.126665 1.173756 cr_rcs_c1_3 1.029327.0059401 5.01 0.000 1.01775 1.041035 cr_rcs_c1_4 1.066262.004378 15.63 0.000 1.057716 1.074877 cr_rcs_c1_5 1.014686.0030352 4.87 0.000 1.008754 1.020652 _cons.0672821.0025488-71.24 0.000.0624675.0724678 other_causes stage2.7203278.0248887-9.49 0.000.673162.7707984 cr_rcs_c2_1 2.977916.0637639 50.96 0.000 2.855527 3.10555 cr_rcs_c2_2.9215077.0122849-6.13 0.000.8977415.9459031 cr_rcs_c2_3.9148877.006712-12.13 0.000.9018266.9281379 cr_rcs_c2_4 1.012339.0052904 2.35 0.019 1.002023 1.022762 cr_rcs_c2_5.996456.0034676-1.02 0.308.9896827 1.003276 _cons.1208134.0033707-75.75 0.000.1143843.1276038 heart_disease stage2.686007.0343982-7.52 0.000.6217948.7568504 cr_rcs_c3_1 2.795411.0817361 35.16 0.000 2.639715 2.960291 cr_rcs_c3_2.9261574.016438-4.32 0.000.8944935.9589422 cr_rcs_c3_3.9187738.0092581-8.41 0.000.9008063.9370997 cr_rcs_c3_4.9981656.0071221-0.26 0.797.9843037 1.012223 cr_rcs_c3_5 1.00047.0047326 0.10 0.921.9912372 1.009789 _cons.0578301.0023139-71.23 0.000.0534682.0625479 Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 15/22

stpm2cr Post-estimation predict newvarname [ if ] [ in ] [, at(varname # [ varname # ] ) cause(numlist) chrdenominator(varname # [ varname #... ] ) chrnumerator(varname # [ varname #... ] ) ci cif cifdiff1(varname # [ varname #... ] ) cifdiff2(varname # [ varname #... ] ) cifratio csh cumodds cumsubhazard cured shrdenominator(varname # [ varname #... ] ) shrnumerator(varname # [ varname #... ] ) subdensity subhazard survivor timevar(varname) uncured xb zeros deviance dxb level(#) ] Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 16/22

Comparison with Aalen-Johansen Estimates 0.6 0.5 Regional Stage Patients Aged 75 to 84 yrs old Cancer (Aalen-Johansen) Other Causes (Aalen-Johansen) Heart Disease (Aalen-Johansen) Cumulative Incidence 0.4 0.3 0.2 0.1 0.0 0 3 6 9 12 15 Years since Diagnosis Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 17/22

Comparison with Aalen-Johansen Estimates. predict cif_reg, cif at(stage1 0 stage2 1) ci Calculating predictions for the following causes: 1 2 3 0.6 0.5 Regional Stage Patients Aged 75 to 84 yrs old Cancer (Aalen-Johansen) Cancer (PSDH) Other Causes (Aalen-Johansen) Other Causes (PSDH) Heart Disease (Aalen-Johansen) Heart Disease (PSDH) Cumulative Incidence 0.4 0.3 0.2 0.1 0.0 0 3 6 9 12 15 Years since Diagnosis Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 17/22

Relaxing the Proportionality Assumption. stpm2cr [colrec_cancer: stage2, scale(hazard) df(5) tvc(stage2) dftvc(3)] /// > [other_causes: stage2, scale(hazard) df(5) tvc(stage2) dftvc(3)] /// > [heart_disease: stage2, scale(hazard) df(5) tvc(stage2) dftvc(3)] /// >, events(cause) cause(1 2 3) cens(0) eform nolog (output omitted ) Obtaining Initial Values Starting to Fit Model (output omitted ) Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 18/22

Comparison with Aalen-Johansen Estimates. predict cif_reg_tvc, cif at(stage1 0 stage2 1) ci Calculating predictions for the following causes: 1 2 3 0.6 0.5 Regional Stage Patients Aged 75 to 84 yrs old Cancer (Aalen-Johansen) Cancer (Non-PSDH) Other Causes (Aalen-Johansen) Other Causes (Non-PSDH) Heart Disease (Aalen-Johansen) Heart Disease (NonPSDH) Cumulative Incidence 0.4 0.3 0.2 0.1 0.0 0 3 6 9 12 15 Years since Diagnosis Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 19/22

Relationship with the CSH (Beyersmann & Schumacher, 2007) [ K j=1 h k (t) = λ k (t) 1 F j(t)] F k (t) + 1 F (t) Can also calculate the CSH from the model To calculate from Fine & Gray model, need to fit models for all causes separately (this could take a long, long time) Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 20/22

Relationship with the CSH (Beyersmann & Schumacher, 2007) [ K j=1 h k (t) = λ k (t) 1 F j(t)] F k (t) + 1 F (t) Can also calculate the CSH from the model To calculate from Fine & Gray model, need to fit models for all causes separately (this could take a long, long time) Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 20/22

stpm2cif [Hinchliffe and Lambert, 2013] vs. stpm2cr 0.3 Regional Stage Patients Aged 75 to 84 yrs old Cancer (stpm2cif) Other Causes (stpm2cif) Heart Disease (stpm2cif) 0.2 Hazard 0.1 0.0 0 3 6 9 12 15 Years since Diagnosis Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 21/22

stpm2cif [Hinchliffe and Lambert, 2013] vs. stpm2cr. predict csh_reg_tvc, csh at(stage1 0 stage2 1) Calculating predictions for the following causes: 1 2 3 0.3 Regional Stage Patients Aged 75 to 84 yrs old Cancer (stpm2cif) Cancer (stpm2cr) Other Causes (stpm2cif) Other Causes (stpm2cr) Heart Disease (stpm2cif) Heart Disease (stpm2cr) 0.2 Hazard 0.1 0.0 0 3 6 9 12 15 Years since Diagnosis Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 21/22

Which way should we model? If interest is just on the effect of one cause - no need to model all cause-specific CIFs Aetiological = CSH regression models Directly model covariate effects on the hazard rate for those at risk Prognostic (decision-making) = SDH regression models Understand why a covariate affects the cause-specific CIF in a certain way Make inferences on both scales for a better understanding [Latouche et al., 2013, Beyersmann et al., 2007] Advantage of FPMs: Computationally efficient, useful out-of-sample predictions... Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 22/22

Selected References I S. I. Mozumder, M.J. Rutherford, and P.C. Lambert. Direct likelihood inference on the cause-specific cumulative incidence function: a flexible parametric regression modelling approach. Statistics in Medicine, 2016 (submitted). P. Royston and P. C. Lambert. Flexible parametric survival analysis in Stata: Beyond the Cox model. Stata Press, 2011. Jong-Hyeon Jeong and Jason Fine. Direct parametric inference for the cumulative incidence function. Journal of the Royal Statistical Society: Series C (Applied Statistics), 55(2):187 200, 2006. P.C. Lambert, Wilkes S. R., and M.J. Crowther. Flexible parametric modelling of the cause-specific cumulative incidence function. Statistics in Medicine, 2016 (submitted). Sally R Hinchliffe and Paul C Lambert. Flexible parametric modelling of cause-specific hazards to estimate cumulative incidence functions. BMC medical research methodology, 13(1):1, 2013. Aurelien Latouche, Arthur Allignol, Jan Beyersmann, Myriam Labopin, and Jason P Fine. A competing risks analysis should report results on all cause-specific hazards and cumulative incidence functions. Journal of clinical epidemiology, 66(6):648 653, 2013. Jan Beyersmann, Markus Dettenkofer, Hartmut Bertz, and Martin Schumacher. A competing risks analysis of bloodstream infection after stem-cell transplantation using subdistribution hazards and cause-specific hazards. Statistics in medicine, 26(30):5360 5369, 2007. Hein Putter, M Fiocco, and RB Geskus. Tutorial in biostatistics: competing risks and multi-state models. Statistics in medicine, 26(11):2389 2430, 2007. Sarwar I Mozumder Direct Likelihood FPM Approach for the Cause-specific CIF 9 Sept 2016 22/22