Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors

Similar documents
Modeling Longitudinal Data of Blood Donors

Monitoring of iron in whole blood donors at Sanquin Blood Bank

A novel approach to estimation of the time to biomarker threshold: Applications to HIV

Latent Class Analysis Identification of Syndromes in Alzheimer's Disease: A Bayesian Approach

Using dynamic prediction to inform the optimal intervention time for an abdominal aortic aneurysm screening programme

Bayesian Joint Modelling of Longitudinal and Survival Data of HIV/AIDS Patients: A Case Study at Bale Robe General Hospital, Ethiopia

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

Individual Differences in Attention During Category Learning

Meta-analysis of few small studies in small populations and rare diseases

Bayesian hierarchical modelling

Accommodating informative dropout and death: a joint modelling approach for longitudinal and semicompeting risks data

Mediation Analysis With Principal Stratification

Sensitivity of heterogeneity priors in meta-analysis

Bayesian Latent Subgroup Design for Basket Trials

Dynamic borrowing of historical data: Performance and comparison of existing methods based on a case study

Bayesians methods in system identification: equivalences, differences, and misunderstandings

Introduction. Patrick Breheny. January 10. The meaning of probability The Bayesian approach Preview of MCMC methods

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Biostatistical modelling in genomics for clinical cancer studies

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Bayesian Methods p.3/29. data and output management- by David Spiegalhalter. WinBUGS - result from MRC funded project with David

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations

Bayesian Inference. Thomas Nichols. With thanks Lee Harrison

Bayesian random-effects meta-analysis made simple

Bayesian Inference Bayes Laplace

Fitting growth curve models in the Bayesian framework

Learning from data when all models are wrong

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

Ordinal Data Modeling

Multiple indicator hidden Markov model with an application to medical utilization data

Estimating and comparing cancer progression risks under varying surveillance protocols: moving beyond the Tower of Babel

Systematic review with multiple treatment comparison metaanalysis. on interventions for hepatic encephalopathy

Advanced Bayesian Models for the Social Sciences

Treatment effect estimates adjusted for small-study effects via a limit meta-analysis

Modelling prognostic capabilities of tumor size: application to colorectal cancer

Regression Discontinuity Designs: An Approach to Causal Inference Using Observational Data

Introduction to Bayesian Analysis 1

Application of Multinomial-Dirichlet Conjugate in MCMC Estimation : A Breast Cancer Study

CROSS-VALIDATION IN GROUP-BASED LATENT TRAJECTORY MODELING WHEN ASSUMING A CENSORED NORMAL MODEL. Megan M. Marron

Bayesian Methods for Medical Test Accuracy. Broemeling & Associates Inc., 1023 Fox Ridge Road, Medical Lake, WA 99022, USA;

Day Hospital versus Ordinary Hospitalization: factors in treatment discrimination

Technical Appendix: Methods and Results of Growth Mixture Modelling

For general queries, contact

Growth Modeling with Non-Ignorable Dropout: Alternative Analyses of the STAR*D Antidepressant Trial

Advanced IPD meta-analysis methods for observational studies

Bayesian graphical models for combining multiple data sources, with applications in environmental epidemiology

Tutorial #7A: Latent Class Growth Model (# seizures)

AALBORG UNIVERSITY. Prediction of the Insulin Sensitivity Index using Bayesian Network. Susanne G. Bøttcher and Claus Dethlefsen

Fitting growth curve models in the Bayesian framework

HIDDEN MARKOV MODELS FOR ALCOHOLISM TREATMENT TRIAL DATA

Bayesian Joint Modelling of Benefit and Risk in Drug Development

Hidden Markov Models for Alcoholism Treatment Trial Data

Historical controls in clinical trials: the meta-analytic predictive approach applied to over-dispersed count data

Bayesian Statistics Estimation of a Single Mean and Variance MCMC Diagnostics and Missing Data

Real-Time Individual Predictions of Prostate Cancer Recurrence Using Joint Models

Bayesian versus maximum likelihood estimation of treatment effects in bivariate probit instrumental variable models

Case Studies in Bayesian Augmented Control Design. Nathan Enas Ji Lin Eli Lilly and Company

A Case Study: Two-sample categorical data

Modelling Spatially Correlated Survival Data for Individuals with Multiple Cancers

Comparative Study of K-means, Gaussian Mixture Model, Fuzzy C-means algorithms for Brain Tumor Segmentation

Statistics for Social and Behavioral Sciences

BAYESIAN HYPOTHESIS TESTING WITH SPSS AMOS

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Kernel Density Estimation for Random-effects Meta-analysis

Combining Risks from Several Tumors using Markov chain Monte Carlo (MCMC)

Item Analysis: Classical and Beyond

TRIPODS Workshop: Models & Machine Learning for Causal I. & Decision Making

General growth mixture modeling for randomized preventive interventions

Individualized Treatment Effects Using a Non-parametric Bayesian Approach

CopulaDTA: An R package for copula-based bivariate beta-binomial models for diagnostic test accuracy studies in Bayesian framework

Bayesian Nonparametric Methods for Precision Medicine

Impact of an equality constraint on the class-specific residual variances in regression mixtures: A Monte Carlo simulation study

George B. Ploubidis. The role of sensitivity analysis in the estimation of causal pathways from observational data. Improving health worldwide

Risk-prediction modelling in cancer with multiple genomic data sets: a Bayesian variable selection approach

Statistical methods for the meta-analysis of full ROC curves

A Multilevel Testlet Model for Dual Local Dependence

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

WinBUGS : part 1. Bruno Boulanger Jonathan Jaeger Astrid Jullion Philippe Lambert. Gabriele, living with rheumatoid arthritis

Bayesian Hierarchical Models for Fitting Dose-Response Relationships

A Bayesian Analysis of the Effect of Polyaromatic Hydrocarbons on Hormone Levels During the Human Menstrual Cycle

Bivariate lifetime geometric distribution in presence of cure fractions

Multiple equilibria in social interaction models: estimating a model of adolescent smoking behavior.

Response to Comment on Cognitive Science in the field: Does exercising core mathematical concepts improve school readiness?

Review: Logistic regression, Gaussian naïve Bayes, linear regression, and their connections

A Brief Introduction to Bayesian Statistics

MODELING NONCOMPENSATORY CHOICES WITH A COMPENSATORY MODEL FOR A PRODUCT DESIGN SEARCH

No Large Differences Among Centers in a Multi-Center Neurosurgical Clinical Trial

Bayesian Variable Selection Tutorial

Clinical trials with incomplete daily diary data

Bayesian Models for Combining Data Across Subjects and Studies in Predictive fmri Data Analysis

Quantifying and Mitigating the Effect of Preferential Sampling on Phylodynamic Inference

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

THE ANALYSIS OF METHADONE CLINIC DATA USING MARGINAL AND CONDITIONAL LOGISTIC MODELS WITH MIXTURE OR RANDOM EFFECTS

Bayesian Joinpoint Regression Model for Childhood Brain Cancer Mortality

Technical Appendix. Response to: Quality review of a proposed EEPRU review of the EQ-5D-5L value set for England

SLAUGHTER PIG MARKETING MANAGEMENT: UTILIZATION OF HIGHLY BIASED HERD SPECIFIC DATA. Henrik Kure

A Latent Mixture Approach to Modeling Zero- Inflated Bivariate Ordinal Data

Transcription:

Bayesian growth mixture models to distinguish hemoglobin value trajectories in blood donors Kazem Nasserinejad 1 Joost van Rosmalen 1 Mireille Baart 2 Katja van den Hurk 2 Dimitris Rizopoulos 1 Emmanuel Lesaffre 1,3 Wim de Kort 2,4 1 Department of Biostatistics, Erasmus University Medical Center, Rotterdam, the Netherlands 2 Sanquin Research, Department of Donor Studies, Amsterdam, the Netherlands 3 L-Biostat, KU Leuven, Leuven, Belgium 4 Department of Public Health, Academic Medical Center, Amsterdam, the Netherlands Bayes2014, June, 2014 Bayes2014, June, 2014 1 / 36

Introduction Blood donation leads to a temporary reduction in the Hb level. This needs a period after donation for the Hb value to recover to its pre-donation level. Individual donors may differ in their recovery and minimum interval (56 days) may not be safe for each individual. We aim to classify the longitudinal Hb values measured in blood donors. Bayes2014, June, 2014 2 / 36

Blood donation recovery process Blood donation recovery process Hemoglobin level 8.2 8.3 8.4 8.5 8.6 Recovery Time Time Hb threshold for eligibility is 8.4(7.8) mmol/l for male(female) Bayes2014, June, 2014 3 / 36

Data A random sample of 2000 male and 2000 female donors collected by Sanquin Blood Supply (the Netherlands) (2005-2012). New-entrant donors, at least one donation and one visit afterward. Covariates Season that donation takes place Age at 1st visit/donation Time since previous donation Number of donations in past two years Gender Bayes2014, June, 2014 4 / 36

Hemoglobin profiles for male and female donors Hemoglobin level (mmol/l) 7.0 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0 11.5 Donor I Donor II Male Hemoglobin level (mmol/l) 7.0 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0 11.5 Donor I Donor II Female 0 400 900 1500 2100 2700 3300 3900 Time since first visit (day) 0 400 900 1500 2100 2700 3300 3900 Time since first visit (day) Bayes2014, June, 2014 5 / 36

Descriptive statistics Male Female Donors deferred at least once due to low Hb 18.4% 32.3% Donations in cold season 49.9% 49.5% Age at screening visit (years) 34.8 (24.1, 45.8) 29.4 (21.6, 42.2) Number of donations 9 (4, 16) 5 (2, 9) Hb value at screening visit (mmol/l) 9.4 (9.0, 9.9) 8.4 (8.0, 8.8) Inter-donation interval (days) 90 (74, 126) 138 (120, 188) Bayes2014, June, 2014 6 / 36

A statistical model Capture the unobserved heterogeneity by (continuous latent classes) random effects. Mixed-effects model Hb it = θ 1 + b i0 + β 1 Age i1 + β 2 Season it + β 3 TSPD it +β 4 TSPD 2 it + β 5 BFD it + (θ 2 + b i1 )NODY 2 it + ɛ it Non-informative priors! Bayes2014, June, 2014 7 / 36

Results of LMM (Male) Estimate CI σ 2 θ 1 9.589 9.556 9.624 3.1 10 4 θ 2-0.0545-0.0596-0.0496 6.3 10 6 Bayes2014, June, 2014 8 / 36

A statistical model Capture the unobserved heterogeneity by latent class mixed model (growth mixture model). The density function of y in a Gaussian growth mixture model can be expressed as: f (y θ) = K w k f k (y θ k ), k=1 f k (y θ k ) (k=1,...,k) are density functions describe each class w k is mixing distribution, 0 w k 1 and K k=1 w k = 1. Bayes2014, June, 2014 9 / 36

A GMM with 2 latent classes A growth mixture model with two classes where one class restricted to be stable regarding to number of donations. If person i belongs to class 1: (Stable class) Hb it = θ 11 + b i10 + β 1 Age i1 + β 2 Season it + β 3 TSPD it If person i belongs to class 2: +β 4 TSPD 2 it + β 5 BFD it + ɛ it Hb it = θ 21 + b i20 + β 1 Age i1 + β 2 Season it + β 3 TSPD it +β 4 TSPD 2 it + β 5 BFD it + (θ 22 + b i21 )NODY 2 it + ɛ it Data dependent priors! Elliott et al. 2005 Biostatistics. Tends to force the posterior trajectories of the classes to be more equal! Identifiable! Bayes2014, June, 2014 10 / 36

Two scenarios for the mixing distribution Scenario 1: Latent Class Model no covariates, Dirichlet prior on mixing distribution. g[j] dcat(w[j, ]) # Latent class indicator w[i, 1 : k] ddirch(a 1, a 2,..., a k ) Fruhwirth-Schnatter 2006: a k > 1 to ensure no empty class! Rousseau et al., 2011: a k < d/2 Ensure no over fitting! Bayes2014, June, 2014 11 / 36

Two scenarios for the mixing distribution Scenario 2: Latent Class Model depends on some other covariates g[j] dcat(w[j, ]) # Latent class indicator w[j, class] < phi[j, class]/sum(phi[j, ]) log(phi[j, class]) <- gamma[class, 1] + gamma[class, 2] Age0[j] + gamma[class, 3] Hb0[j] Informative or non-informative priors? gamma[class, c] dnorm(0, 4/9) After log transformation, this provides relatively flat prior! Garrett and Zeger, 2000 Biometrics. Bayes2014, June, 2014 12 / 36

Priors distribution after log transformation! tau=40 tau=0.4 tau=0.04 Density 0 2 4 6 8 10 Density 0.0 0.5 1.0 1.5 Density 0 2 4 6 8 10 0.40 0.45 0.50 0.55 0.60 0.0 0.2 0.4 0.6 0.8 1.0 wi wi 0.0 0.2 0.4 0.6 0.8 1.0 wi Bayes2014, June, 2014 13 / 36

Results of GMM with 2 latent classes Estimate CI σ 2 w i θ 11 9.037 8.998 9.072 3.5 10 4 42.5% θ 21 9.831 9.789 9.872 4.3 10 4 57.5% θ 22-0.06849-0.0749-0.0622 1.1 10 5 Bayes2014, June, 2014 14 / 36

Software Jags via Rjags package in R! http://mcmc-jags.sourceforge.net/ Stan via Rstan package in R! http://mc-stan.org/index.html Stan, less iterations to be converged, e.g 800-1500 samples! Jags, more iterations to be converged, e.g 20,000-100,000 samples! Both computationally expensive and regarding to time almost the same! Bayes2014, June, 2014 15 / 36

Jags/Bugs syntax for growth mixture model model { for(j in 1:n) { for(i in offset[j]:(offset[j+1]-1)){ Hb[i]~dnorm(mu[i],tau) mu[i]<-beta[1]*age1[j]+beta[2]*season[i]+ +beta[5]*d[i]+ equals(g[j],1)*(theta11+b01[j])+ # class I equals(g[j],2)*(theta21+b02[j,1]+(theta22+b02[j,2])*nody2[i]) # class II } g[j] ~ dcat(w[j,]) # Latent class indicator for(class in 1:2) { w[j,class]<-phi[j,class]/sum(phi[j,]) log(phi[j,class])<-gamma[class,1]+ gamma[class,2]*age0[j]+gamma[class,3]*hb0[j] } } Bayes2014, June, 2014 16 / 36

Stan syntax for growth mixture model transformed parameters { vector[n] w1; //mixing proportions vector[n] w2; //mixing proportions vector[n] ppi1; // for(j in 1:n){ ppi1[j]<-exp(gamma[1,1]+ gamma[2,1]*age0[j]+ gamma[3,1]*hb0[j]); } for(j in 1:n){ w1[j]<-ppi1[j]/(ppi1[j]+1); w2[j]<-1/(ppi1[j]+1); } } model { for(j in 1:n) { for(i in offset[j]:(offset[j+1]-1)){ psi[1]<- log(w1[j])+normal_log(hb[i],theta11+b1[j]+beta[1]*age1[i]+, sigma);// Class I psi[2]<- log(w2[j])+normal_log(hb[i],theta21+b2[j,1]+beta[1]*age1[i]+ +(theta22+b2[j,2])*dony2[i], sigma);// Class II increment_log_prob(log_sum_exp(psi)); } } } Bayes2014, June, 2014 17 / 36

A GMM with 3 latent classes A growth mixture model with three classes where one class restricted to be stable regarding to number of donation. If person i belongs to class 1: (Stable class) Hb it = θ 11 + b i10 + β 1 Age i1 +... + β 5 BFD it + ɛ it If person i belongs to class 2: Hb it = θ 21 + b i20 + β 1 Age i1 +... + β 5 BFD it + (θ 22 + b i21 )NODY 2 it + ɛ it If person i belongs to class 3: Hb it = θ 31 + b i30 + β 1 Age i1 +... + β 5 BFD it + (θ 32 + b i31 )NODY 2 it + ɛ it Data dependent priors! Unidentifiable! Bayes2014, June, 2014 18 / 36

Label switching problem f (y θ) = K! permutations K w k f k (y θ k ) = w 1 f 1 (y θ 1 ) +... + w k f k (y θ k ) k=1 Simple solution: Imposing artificial identifiability constraints on the class parameters, to make them identifiable. Relabeling algorithm (on McMc output): Stephens 2000. More algorithms (on McMc output)!!! When the data set is large, label switching very seldom occurs! Pritchard et al., 2000; Guillot et al., 2005 Bayes2014, June, 2014 19 / 36

Label switching problem example θ 8.0 9.0 10.0 11.0 Original McMc output θ 8.0 9.0 10.0 11.0 STEPHENS 0 500 1000 1500 2000 2500 3000 0 500 1000 1500 2000 2500 3000 Iteration Iteration 0 50 100 150 9.4 9.6 9.8 10.0 10.2 10.4 9.4 9.6 9.8 10.0 10.2 10.4 θ21 θ21 0 50 100 150 0 10 20 30 40 50 0 10 20 30 40 9.4 9.6 9.8 10.0 10.2 10.4 θ31 9.4 9.6 9.8 10.0 10.2 10.4 θ31 Bayes2014, June, 2014 20 / 36

Results of GMM with 3 latent classes Estimate CI σ 2 w i θ 11 8.96 8.91 9.02 7.6 10 4 22.5% θ 21 9.59 9.54 9.63 6.5 10 4 50.3% θ 22-0.053-0.063-0.044 2.4 10 5 θ 31 10.21 10.14 10.28 1.3 10 3 27.2% θ 32-0.085-0.102-0.069 6.7 10 5 Bayes2014, June, 2014 21 / 36

Model comparison (How many classes?) WinBUGS currently does not compute DIC if the likelihood depends on any discrete parameters Jags currently needs more than one chain to compute DIC. (between chains label switching problems) Computing DIC manually outside the Bugs for large data set is almost impossible! Solution: Computing BIC manually outside the Bugs based on marginal posterior parameter! GMM with 4 latent classes selected. Bayes2014, June, 2014 22 / 36

Assess model fit Using PPC by computing a Bayesian P-value, χ 2 discrepancy measure to test both the distributional and latent class number assumption of the model. Posterior classification table. Bayes2014, June, 2014 23 / 36

Assess model fit Ex: Class in each iteration: 1 1 1 2 1 3 1...4 1 1 Posterior classification table. Male Female Latent Class Class I Class II Class III Class IV Class I Class II Class III Class IV Class I 0.795 0.204 0.001 0.000 0.771 0.227 0.002 0.000 Class II 0.135 0.717 0.144 0.004 0.103 0.744 0.142 0.001 Class III 0.001 0.150 0.736 0.113 0.001 0.163 0.680 0.156 Class IV 0.000 0.001 0.191 0.808 0.000 0.008 0.121 0.780 Bayes2014, June, 2014 24 / 36

Intercepts of different classes for male donors θ 8.5 9.0 9.5 10.0 10.5 Linear mixed effects model θ 8.5 9.0 9.5 10.0 10.5 Growth mixture model (2) 0 1000 2000 3000 4000 5000 0 1000 2000 3000 4000 5000 iteration iteration θ 8.5 9.0 9.5 10.0 10.5 Growth mixture model (3) θ 8.5 9.0 9.5 10.0 10.5 Growth mixture model (4) 0 1000 2000 3000 4000 5000 iteration 0 1000 2000 3000 4000 5000 iteration Bayes2014, June, 2014 25 / 36

Parameter estimation (GMM IV) for male donors Male donors Female donors Parameter Estimation 95% CI Estimation 95% CI Intercept I 8.88 8.81 8.89 7.93 7.85 8.00 Intercept II 9.34 9.27 9.44 8.30 8.24 8.36 Intercept III 9.84 9.78 9.94 8.77 8.68 8.84 Intercept IV 10.38 10.23 10.59 9.13 9.02 9.25 NODY2 II 0.05 0.06 0.04 0.02 0.04 0.01 NODY2 III 0.06 0.08 0.05 0.06 0.10 0.03 NODY2 IV 0.09 0.12 0.07 0.14 0.17 0.10 Age 0 (year) 1.7 10 4 2.8 10 3 3.1 10 3 9.6 10 3 6.4 10 3 1.3 10 2 Season 7.7 10 2 8.9 10 2 6.5 10 2 5.2 10 2 6.2 10 2 4.2 10 2 TSPD(month) 2.5 10 2 1.9 10 2 3.0 10 2 1.9 10 2 1.4 10 2 2.3 10 2 TSPD 2 (month) 1.0 10 4 1.3 10 4 7.3 10 5 4.5 10 5 6.2 10 5 2.8 10 5 BFD 8.7 10 4 3.2 10 2 3.1 10 2 8.3 10 2 5.4 10 2 11.1 10 2 Bayes2014, June, 2014 26 / 36

Hemoglobin profiles for different classes for male donors Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 12.0 Class I (13.5%) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 12.0 Class II (37.9%) 0 300 700 1100 1600 2100 2600 3100 3600 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) Time since first visit (day) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 12.0 Class III (37.7%) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 12.0 Class IV (10.9%) 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) Bayes2014, June, 2014 27 / 36

Hemoglobin profiles for different classes for female donors Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 Class I (15.4%) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 Class II (43.5%) 0 300 700 1100 1600 2100 2600 3100 3600 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) Time since first visit (day) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 Class III (28.4%) Hemoglobin level (mmol/l) 7.0 8.0 9.0 10.0 11.0 Class IV (12.7%) 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) 0 300 700 1100 1600 2100 2600 3100 3600 Time since first visit (day) Bayes2014, June, 2014 28 / 36

Latent classes description Male donors Class I Class II Class III Class IV Size of the class 13.5% 37.9% 37.7% 10.9% Donors deferred at least once due to low Hb 39.3 % 21.6 % 10.6% 8.2 % Number of donations 7(2, 13) 9(5, 16) 10(5, 17) 11(5, 18) Hb value at screening visit (mmol/l) 8.7(8.5, 8.9) 9.2(9.0, 9.4) 9.7(9.4, 10.0) 10.4(10.1, 10.7) Age at screening visit (year) 39(29, 51) 36(26, 48) 32(24, 43) 33(24, 44) Female donors Class I Class II Class III Class IV Size of the class 15.4% 43.5% 28.4% 12.7% Donors deferred at least once due to low Hb 66.6% 35.4% 16.6% 15.8% Number of donations 2(1, 5) 4(2, 9) 6(3, 11) 6(3, 9) Hb value at screening visit (mmol/l) 7.8(7.6, 8.0) 8.2(8.0, 8.5) 8.7(8.5, 9.0) 9.3(9.1, 9.6) Age at screening visit (year) 43.5(33, 52) 29(22, 41) 27(21, 40) 23(20, 31) Bayes2014, June, 2014 29 / 36

Donors deferred proportion Kaplan-Meier curves of the latent classes Cumulative proportion 0.0 0.2 0.4 0.6 0.8 1.0 Class IV Class III Class II Class I Cumulative proportion 0.0 0.2 0.4 0.6 0.8 1.0 Class IV Class III Class II Class I 0 5 10 15 20 25 30 5 10 15 20 Donation number Donation number Bayes2014, June, 2014 30 / 36

Predicting latent class membership The probability that individual i belongs to the kth latent class can be calculated: P(c i = k Hb i, Age 0i, Hb 0i, θ k ) = w ik f k (Hb i θ k ) K k=1 w ikf k (Hb i θ k ) This is a dynamic prediction for the latent class of a donor. In each visit it can be updated! Bayes2014, June, 2014 31 / 36

EX:Predicting latent class membership For a male donor with Hb value of 8.9 mmol/l and age of 29 years at the screening visit. 8.5 9.0 9.5 Screening visit Class III Class I Class IV Visit 1 Class III Class I Class IV Class II Class II Visit 2 Visit 3 8.5 9.0 9.5 Class I Class II Class III Class IV Class I Class IV Class II Class III Visit 4 Visit 5 8.5 9.0 9.5 Class I Class IV Class II Class III Class I Class IV Class II Class III Visit 6 Visit 7 8.5 9.0 9.5 Class I Class IV Class II Class III Class I Class IV Class II Class III Visit 8 Visit 9 8.5 9.0 9.5 Class I Class IV Class II Class III Class I Class IV Class II Class III 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 Bayes2014, June, 2014 32 / 36

Conclusion To capture the unobserved heterogeneity of Hb profiles, we implemented a Bayesian GMM. This model assumes that each donor belongs to one of several latent classes. Within each class, the Hb trajectory follows a linear mixed model. In addition we let the latent class membership depend on the age and hemoglobin value at first visit. Bayes2014, June, 2014 33 / 36

Conclusion Our fitted GMM suggests 4 different classes of Hb trajectories. This model gives some insight in the donation process and is a start to better predict for which donors care needs to be exercised not to produce a too low Hb level. Bayes2014, June, 2014 34 / 36

Future research Find a better way to determine the number of optimum latent classes: Reversible Jump McMC (Green 1995) Variational Bayes algorithm (Bruneau 2010) Robust Bayesian growth mixture model. Cost-effectiveness analysis. Bayes2014, June, 2014 35 / 36

Thank you! K.nasserinejad@erasmusmc.nl Bayes2014, June, 2014 36 / 36