A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY

Similar documents
S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

Missing Data and Imputation

Sequential nonparametric regression multiple imputations. Irina Bondarenko and Trivellore Raghunathan

Catherine A. Welch 1*, Séverine Sabia 1,2, Eric Brunner 1, Mika Kivimäki 1 and Martin J. Shipley 1

Methods for Computing Missing Item Response in Psychometric Scale Construction

SCHIATTINO ET AL. Biol Res 38, 2005, Disciplinary Program of Immunology, ICBM, Faculty of Medicine, University of Chile. 3

Kelvin Chan Feb 10, 2015

Applied Medical. Statistics Using SAS. Geoff Der. Brian S. Everitt. CRC Press. Taylor Si Francis Croup. Taylor & Francis Croup, an informa business

An Introduction to Multiple Imputation for Missing Items in Complex Surveys

Multiple imputation for multivariate missing-data. problems: a data analyst s perspective. Joseph L. Schafer and Maren K. Olsen

Selected Topics in Biostatistics Seminar Series. Missing Data. Sponsored by: Center For Clinical Investigation and Cleveland CTSC

Statistical data preparation: management of missing values and outliers

Section on Survey Research Methods JSM 2009

Advanced Handling of Missing Data

Logistic Regression with Missing Data: A Comparison of Handling Methods, and Effects of Percent Missing Values

Help! Statistics! Missing data. An introduction

PubH 7405: REGRESSION ANALYSIS. Propensity Score

Appendix 1. Sensitivity analysis for ACQ: missing value analysis by multiple imputation

Analysis of TB prevalence surveys

BIOSTATISTICAL METHODS

Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study

Should a Normal Imputation Model Be Modified to Impute Skewed Variables?

Module 14: Missing Data Concepts

Multiple imputation for handling missing outcome data when estimating the relative risk

Strategies for handling missing data in randomised trials

Inclusive Strategy with Confirmatory Factor Analysis, Multiple Imputation, and. All Incomplete Variables. Jin Eun Yoo, Brian French, Susan Maller

Bayesian approaches to handling missing data: Practical Exercises

Evaluators Perspectives on Research on Evaluation

Mediation Analysis With Principal Stratification

Bayesian Logistic Regression Modelling via Markov Chain Monte Carlo Algorithm

Master thesis Department of Statistics

Impact and adjustment of selection bias. in the assessment of measurement equivalence

Complier Average Causal Effect (CACE)

Using Test Databases to Evaluate Record Linkage Models and Train Linkage Practitioners

Unit 1 Exploring and Understanding Data

Multiple Imputation For Missing Data: What Is It And How Can I Use It?

Evaluating health management programmes over time: application of propensity score-based weighting to longitudinal datajep_

An application of a pattern-mixture model with multiple imputation for the analysis of longitudinal trials with protocol deviations

Missing data. Patrick Breheny. April 23. Introduction Missing response data Missing covariate data

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Score Tests of Normality in Bivariate Probit Models

Bias reduction with an adjustment for participants intent to dropout of a randomized controlled clinical trial

2017 American Medical Association. All rights reserved.

MISSING DATA AND PARAMETERS ESTIMATES IN MULTIDIMENSIONAL ITEM RESPONSE MODELS. Federico Andreis, Pier Alda Ferrari *

Analysis Strategies for Clinical Trials with Treatment Non-Adherence Bohdana Ratitch, PhD

Statistical Audit. Summary. Conceptual and. framework. MICHAELA SAISANA and ANDREA SALTELLI European Commission Joint Research Centre (Ispra, Italy)

UN Handbook Ch. 7 'Managing sources of non-sampling error': recommendations on response rates

A Review of Hot Deck Imputation for Survey Non-response

Detection of Unknown Confounders. by Bayesian Confirmatory Factor Analysis

Exploring the Impact of Missing Data in Multiple Regression

Running Head: BAYESIAN MEDIATION WITH MISSING DATA 1. A Bayesian Approach for Estimating Mediation Effects with Missing Data. Craig K.

Business Statistics Probability

The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models

SESUG Paper SD

Combining Risks from Several Tumors Using Markov Chain Monte Carlo

arxiv: v1 [stat.ap] 11 Jan 2013

Comparing Multiple Imputation to Single Imputation in the Presence of Large Design Effects: A Case Study and Some Theory

Supplementary Online Content

Estimating HIV incidence in the United States from HIV/AIDS surveillance data and biomarker HIV test results

Addendum: Multiple Regression Analysis (DRAFT 8/2/07)

Hakan Demirtas, Anup Amatya, Oksana Pugach, John Cursio, Fei Shi, David Morton and Beyza Doganay 1. INTRODUCTION

Clinical trials with incomplete daily diary data

Missing data in medical research is

What s New in SUDAAN 11

Analysis of Vaccine Effects on Post-Infection Endpoints Biostat 578A Lecture 3

Advanced Bayesian Models for the Social Sciences

Subject index. bootstrap...94 National Maternal and Infant Health Study (NMIHS) example

Missing data in clinical trials: making the best of what we haven t got.

Biostatistics II

How should the propensity score be estimated when some confounders are partially observed?

Abstract. Introduction A SIMULATION STUDY OF ESTIMATORS FOR RATES OF CHANGES IN LONGITUDINAL STUDIES WITH ATTRITION

IAPT: Regression. Regression analyses

Clincial Biostatistics. Regression

EPI 200C Final, June 4 th, 2009 This exam includes 24 questions.

Instrumental Variables Estimation: An Introduction

Discussion. Ralf T. Münnich Variance Estimation in the Presence of Nonresponse

You must answer question 1.

Imputation classes as a framework for inferences from non-random samples. 1

Modern Strategies to Handle Missing Data: A Showcase of Research on Foster Children

Estimands, Missing Data and Sensitivity Analysis: some overview remarks. Roderick Little

Advanced Bayesian Models for the Social Sciences. TA: Elizabeth Menninga (University of North Carolina, Chapel Hill)

Comparison of imputation and modelling methods in the analysis of a physical activity trial with missing outcomes

Imputation approaches for potential outcomes in causal inference

Estimating drug effects in the presence of placebo response: Causal inference using growth mixture modeling

Regression Methods in Biostatistics: Linear, Logistic, Survival, and Repeated Measures Models, 2nd Ed.

In this module I provide a few illustrations of options within lavaan for handling various situations.

Proof. Revised. Chapter 12 General and Specific Factors in Selection Modeling Introduction. Bengt Muthén

Propensity Score Methods for Estimating Causality in the Absence of Random Assignment: Applications for Child Care Policy Research

Missing Data and Institutional Research

Methods for Addressing Selection Bias in Observational Studies

Depressive illness has been shown to be associated with

Accuracy of Range Restriction Correction with Multiple Imputation in Small and Moderate Samples: A Simulation Study

WATCHMAN PROTECT AF Study Rev. 6

Data Analysis Using Regression and Multilevel/Hierarchical Models

Comparison And Application Of Methods To Address Confounding By Indication In Non- Randomized Clinical Studies

Bayesian Statistics Estimation of a Single Mean and Variance MCMC Diagnostics and Missing Data

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Missing Data: Our View of the State of the Art

Modeling Nonresponse Bias Likelihood and Response Propensity

The prevention and handling of the missing data

Transcription:

A COMPARISON OF IMPUTATION METHODS FOR MISSING DATA IN A MULTI-CENTER RANDOMIZED CLINICAL TRIAL: THE IMPACT STUDY Lingqi Tang 1, Thomas R. Belin 2, and Juwon Song 2 1 Center for Health Services Research, UCLA Neuropsychiatric Institute Los Angeles, CA 90024 2 Department of Biostatistics, UCLA School of Public Health, Los Angeles, CA 90095 EMAIL: lqtang@ucla.edu KEY WORDS: Multiple Imputation, Hot-deck Imputation, Mental Health ABSTRACT: IMPACT is a multi-center randomized controlled trial of a disease management program for late life depression. Like many longitudinal clinical trials, this study faces problems of item-nonresponse, unit-level non-response and drop out. In this paper, we compare two approaches to handle incomplete data. The first approach is based on hot-deck multiple imputation of missing response, using a modified predicted mean matching method for item-nonresponse (Bell, 1999) and the approximate Bayesian bootstrap for unit-non-response (Lavori, Dawson and Shera 1995). In the second method, we apply multiple imputation based on the multivariate normal model using SAS PROC MI software. The two methods as well as complete-case analysis are compared in a simulation study. Overall both hot-deck multiple imputations performed well with good coverage rates for Monte Carlo means and testing intervention effects. For dichotomous variables, multiple imputation under the multivariate normal model has lower coverage in three variables, which were derived from a highly skewed variable. On the other hand, complete-case analysis showed that 47% varaibles had low coverage in Monte Carlo means, but it had good coverage in the intervention effects because the intervention and the control group have the same direction in bias and biases were cancelled. 1. THE IMPACT STUDY AND MISSING DATA Project IMPACT (Improving Mood: Promoting Access to Collaborative Treatment for Late Life Depression) is a multi-center study to test the effectiveness of a new disease management model for late life depression. 1,801 patients aged 60 and older with major depression or dysthymic disorder were enrolled in the study from 8 organizations. In each clinic, participants are randomly assigned to either the new treatment model or the usual care. Usual care patients could use any primary care or specialty mental health care services available to them as usual. Intervention participants had access to a depression care manager who was supervised by a psychiatrist and a primary care expert for up to 12 months in addition to the usual care component. After 12 months, all study participants continued with their regular primary care provider in care as usual. Telephone surveys were conducted for a two-year period to assess the effects of the new depression treatment model on various health and economic outcomes, as compared to the usual care delivery models in place at each site. This paper uses IMPACT data from baseline, 3 and 6 month follow-ups. Most variables have missing rates of less than 2% at item-level missing data. The unit response rates to the 3 and 6-month follow-up assessments were 90.2% and 87.2% respectively. Missing rates in the control group are higher than in the intervention group (10.1% vs. 8.3% at Month-3 and 12.6% vs. 10.7% at Month-6), while these differences are not significant. The missing data pattern is not monotone in terms that some of patients (2.6%) missed Month-3 assessments and participated at the Month-6 (Table 1). 1521 participants completed both follow-ups with 783 in the intervention group and 738 in the care as usual group. Table 1. Unit Response Pattern Over Waves Month Response status Over all (N=1801) Interventi on (N=906) Control (N=895) 3 Responded 1624 825 799 Missing 150 63 87 Dropout 16 12 4 Death 11 6 5 6 Responded 1570 801 769 Missing 118 41 77 Dropout 90 55 35 Death 23 * 9* 14* *: cumulative number including month 3 To investigate whether unit-nonresponse behavior differs between the intervention and control groups, we tested bivariate association for each independent variable in a large set of predictors with the outcome of response (coded 1 if response and 0 if nonresponse). The independent variables included in the bivariate analyses were demographic, clinical characteristics and prior treatments at baseline. For Month-6 analyses, we also included clinical characteristics and depression treatment variables at 3 month. A number of covariates are associated with nonresponse at 3 and 6 months in both the intervention and the control group although the strength and statistical significance of the associations varies somewhat. 3430

At 3 months, higher rates of nonresponse are associated with greater cognitive impairment at baseline in both groups. At 6 months, higher rates of nonresponse are associated with education level, functional impairment at baseline and quality of life at 3 months in both groups. There are, however, a number of covariates that are significantly associated with nonresponse in one group but not the other. For example, the nonresponse rates significantly differ across the organizations at both follow-ups in the intervention group but not in the control group. The baseline treatment preferences for counseling rather than antidepressant medication are associated with lower nonresponse rates in the intervention but not the control group. This could be explained by the fact that the intervention offers free counseling to all participants. Nonresponders in the intervention but not in the control group are more likely to state a treatment preference for "neither counseling nor psychotherapy". This might be explained by the fact that intervention patients are actively encouraged to try one of these two treatments while this is less likely to be the case in the usual care controls. Similarly, nonresponders to both follow-ups in the intervention group have significantly lower rates of prior antidepressant use than responders. This is not the case in the usual care group, and it might reflect the fact that in the intervention group where patients were actively encouraged to try treatments such as antidepressants those who do not wish to take such treatments are more likely to drop out than in the usual care group where there is less such pressure. These findings indicate that the unit nonresponse behavior in the IMPACT study differs between the intervention group and usual care group. And this behavior changes over time. We chose mixed-effects models to conduct intent-to-treat analyses. In the mixed-effects models, we specified the covariance structure within subjects using an unstructured model to account for the within-subject correlation over time. Mixed-effects models handles cases with incomplete follow-up, and it is known that it provides unbiased estimates when the missing data are missing at random (Littell, 1996). However, this assumption is correct only under the included variables in the model. If there are variables not included in the mixed-effects model but related to the missing data mechanism, the model without those variables may lead to biased estimates. In addition, such programs are designed to handle missing outcomes, and it is typical as with many other regression programs to drop subjects from analyses when any explanatory variable is missing. 2. IMPUTATION PROCEDURES In this paper, we considered two multiple imputation methods to handle missing data. The first is referred as the hot-deck multiple imputation procedure since it uses a modified predicted mean matching method for item-nonresponse (Bell, 1999, Little 1988) and the approximate Bayesian bootstrap for unit-non-response (Lavori, Dawson and Shera 1995). In the second method, we apply multiple imputation using a multivariate normal model using a missing data procedure (PROC MI) in SAS software. 2.1. HOT-DECK MULTIPLE IMPUTATION For the hot-deck multiple imputation procedure, we dealt with the item-level missing data and unit-level missing data sequentially. We first impute baseline data 5 times. We then conduct unit-level imputation for 3 month followup data using information from the imputed baseline. The item-level imputation at 3 month follow-up is then performed after the unit-level imputation. For 6 month follow-up data, we impute the unit-level imputation using information from 5 imputed baseline and 3 month data followed by the item-level imputation. These sequential procedures will be continued towards the end of the study, resulting five imputed data sets in each follow-up. The hot-deck imputation for item-level missing data modifies the predictive mean matching method (Bell, 1999). This method involves two steps: (1) forming imputation classes; and (2) drawing imputations at random from observed data within each class. In order to impute variable Y, for instance, we select 20 variables, X1-X20, as predictors. In step 1, each missing value of Y is predicted using a multiple regression of Y on all of the independent variables that were observed for that case. Suppose that Respondent A was missing Y and X15 to X20. Respondent A s prediction would be based on a multiple linear regression for people with complete data for Y and X1 to X14 (without regard to which of X15 to X20 were observed). Predicted values are then computed for anyone with complete data on X1 to X14 (everyone in the regression, Respondent A, and all others with the same missing data pattern as Respondent A). The predicted values from this regression are then sorted into equal-sized cells based on the predicted values from all cases. A value of Y is imputed for Respondent A in Step 2 by choosing an observed value of Y at random (with replacement) from the same cell that Respondent A fell in. To reflect the uncertainty of donor cells we created bootstrap weights before each hot-deck imputation. Then we used the 3431

bootstrap weights in the weighted multiple regression as well as in the selection of donors. In each follow-up month, the imputed data sets differs by only the bootstrap weight and the seed used to obtain the random number employed in the hot deck imputation. The hot-deck imputation for unit-level missing data is based on the approximate Bayesian bootstrap method, stratifying by propensity scores of response (Lavori etc., 1995). This method consists of two steps: (1) forming imputation cells by propensity-scores based stratification; (2) imputation based on the approximate Bayesian bootstrap approach within each cell. Let T be the number of follow-up waves and z t be the indicator for response to the t-th wave (coded 1 if response and 0 if nonresponse), t=1, 2,, T. In step 1, we model the propensity of response to the t-th wave through baseline and prior waves: e 1 (X 0 )=Pr(z 1 =1 X 0 ), : : e t (X 0,Y 1,,Y t-1 )=Pr(z 2 =1 X 0,Y 1, Y t-1 ), where X 0 is the baseline covariate vector and Y t is the outcome vector at the t-th wave. Given the stratification based on the quartiles of the propensity scores among the nonrespondants, at step (2), we use the algorithm of the approximate Bayesian bootstrap (Lavori, Dawson and Shera 1995) to select donors: (a) Sample n obs (the number of respondents to the wave) at random with replacement from the observed responses in an imputation cell. This forms a potential set of observed responses. (b) Sample n miss (the number of non-respondents to the wave) with replacement from the potential observed sample from (a). Here the sampled n miss are donors for the imputation. (c) Missing data are replaced by the observed data from the donors determined in (b). 2.2. MULTIPLE IMPUTATION BASED ON A MULTIVARIATE NORMAL MODEL Multiple imputation based on multivariate normal model assumes that complete data follow a multivariate normal distribution (Schafer 1997). Here we chose the SAS MI procedure to generate five multiply imputed data sets. IMPACT data contains both continuous and categorical variables. Even if categorical variables do not follow the normal distribution assumption, multiple imputation may be robust when the amounts of missing information are not large (Schafer, 1997). The percentage of missing values in each variable was quite low in the IMPACT study, and we used dummy codes for categorical variables. Several variables measured as counts were not normally distributed with highly skewed distribution. The log transformation was applied to those variables, and imputed values were transformed back to the original scale. Imputations under the normal distribution resulted in imputed values in a continuous scale and we rounded off those numbers to the possible range of each variable. IMPACT study measures outcome variables repeatedly over time. However, the measured time points were only 3 times (baseline, 3 months, and 6 months), we considered a cross-sectional imputation model instead of a longitudinal one. In the imputation model, we considered longitudinally measured variable as separate variables. That is, the mean SCL-20 depression score was measured at baseline, three months, and six months and treated as three different variables in the model. In some sense, this model may be more general than a longitudinal model, because it considers all possible correlations among variables. 3. APPLICATION This section describes the implementation of the two imputation procedures and presents numeric results from the data analyses. 3.1. HOT-DECK MULTIPLE IMPUTATION To conduct the unit-level imputation, we first fit the multiple logistic regressions to estimate the propensity scores of response stratified by the intervention arms. We started with a large set of independent variables to be considered for a logistic regression on the outcome of response (coded 1 if response and 0 if nonresponse). In order to control for multiple comparisons, we only included in the modeling procedures with a bivariate association with the response that was partially statistically significant (P-value<0.1) The final model included the predictors that were significant (P-value<.05) in at least one of the 5 multiply imputation data sets for either intervention group or the care as usual group. We also forced in independent variables of gender and two design variables: recruitment method (screening or referral) and site (7 dummy variables for 8 sites). To form the imputation cell, we used the intervention status and gender as the primary stratification variable because men and women may have different characteristics. Within each intervention and gender group, the 4 imputation cells were formed based on the quartiles of the estimated propensity scores e t (X 0,Y 1,,Y t-1 ) among the nonresponse group (see Table 2). Averaged across 5 imputation data sets, the propensity scores differ by 2 percent between the intervention group 3432

and care as usual group (mean(sd) =.90 (.04) vs..92(.05) at Month-3, and means(sd) =.87 (.08) vs..89 (.08) at Month-6). Table 2 shows the distribution of the possible matches in each imputation cells, stratified by intervention status, gender and the quartiles of propensity s cores. Table 2. Imputation Cells Stratified by Gender- Propensity Scores Possible matches Month 3 Month 6 Grou p Sex Quartile Group Missing wave Obs. Missing wave Obs. UC M 1 9 61 12 23 2 9 66 11 24 3 9 55 13 56 4 10 85 13 147 F 1 13 68 15 59 2 13 106 16 89 3 12 152 15 158 4 15 204 16 211 IV M 1 7 26 9 17 2 7 54 9 44 3 7 46 9 78 4 8 166 10 142 F 1 11 38 14 33 2 11 105 15 90 3 12 114 15 96 4 12 275 15 300 All 165 1621 207 1567 3.2. MULTIPLE IMPUTATION BASED ON A MULTIVARIATE NORMAL MODEL The imputation model based on multivariate normal model contained 91 variables. It includes an indicator variable for intervention status and 35 variables of baseline characteristics, 20 outcome variables at baseline, 17 outcome variables at Month-3 follow-up, and 18 outcome variables at Month-6 follow-up. All variables had missing rates less than 2%. Outcome variables at each time point contained 8-9 primary outcome variables of our main interest as well as 7-11 secondary outcome variables for later analysis. Five outcome variables of main interest were derived variables from three variables. The model contained those three original variables, and five outcome variables were derived from them after imputation. Two outcome variables (dysthymia and current employment status) were measured only at baseline, and major depression (SCID) was not measured at 3 month follow-up, resulting different number of outcome variables at each time point. Some outcome variables were composite measures from many item variables. Since much bigger model with each item variables had a problem in convergence of the EM algorithm, we did not consider item variable imputation here. The convergence of data augmentation was checked with time-series plots and autocorrelation plots. The first thousand iterations were discarded as a burn-in period. We considered a single chain multiple imputation and 5 imputed data sets were generated at every one thousand iterations. Imputation of categorical variables could generate unreasonable imputed values. For example, missing ethnicity might result in two ethnicities. In that case, imputed values were considered as a probability and randomly redrawn ethnicity variables until only one ethnicity were chosen. In Month-3 and Month-6, 11 and 12 participants were deceased, respectively, and their values should not be imputed. Since the MI procedure did not allow us to choose who should not be imputed, those people s values were deleted after imputation. That is, final imputed data were missing for those participants on follow-up measures after their death. 3.3. RESULTS We conducted outcome analyses using the data sets imputed by two imputation procedures. Dependent variables in IMPACT included 9 binary and 3 interval valued variables: self reported use of antidepressants or psychotherapy in the last 3 months, potentially effective use of antidepressants or psychotherapy, SCL-20 depression scores, rates of remission of depression (defined by an SCL-20 depression score <0.5), rates of treatment response (defined as a 50 % or greater drop in SCL-20 depression score from baseline), the proportion of subjects who met diagnostic criteria for major depression on the SCID, health related functional impairment, and quality of life at 3 and 6-months. For each dependent variable, we conducted an intent-to-treat analysis of repeated measures. We fitted mixed-effects regression models (or mixedeffects logistic regression for dichotomous variables) using baseline, 3 and 6 months followup data with regression adjustment for recruitment method (screening or referral) and participating study organizations. In the mixed-effects models, to account for the within-subject correlation over time, we specified the covariance structure within subjects using an unstructured model. Table 3 presents the imputation effects on unadjusted analyses for 3 key outcome variables (DEPTX=any depression treatment, MAJDEP= Major depression, SCL= SCL-20 depression score). Point estimates from two imputation methods were compared to the complete-data analysis. SE0 stands for the stander error from complete-data analysis. λ is the estimated fraction of missing information. Both methods produced similar point estimates. On the average, the hotdeck multiple imputation produced higher standard errors, larger λ and slightly less significant 3433

intervention effects. Among 12 dependent variables, the difference of t-statistics, on the average, is less than 0.04. Table 3 Imputation Effects for Key Outcome Varaibles from Unadjusted Analyses MI Hot-deck MI multivariate normal Difference on point Difference on point Y Data estimations SE/SE0 100λ estimations SE/SE0 100λ DEPTX03* All -0.0328 0.99 5.67 0.0067 0.95 1.67 UC -0.0260 0.98 6.43 0.0157 0.96 2.83 I -0.0379 1.02 2.27-0.0005 0.96 0.92 DEPTX06** All -0.0056 0.99 10.14 0.0040 0.95 2.84 UC -0.0032 1.03 17.95 0.0132 0.95 3.11 I -0.0052 1.05 18.73-0.0023 0.97 3.85 MAJDEP06** All 0.0065 0.98 7.41 0.0111 1.07 22.25 UC 0.0049 0.99 11.54 0.0111 0.97 6.93 I 0.0065 1.02 12.40 0.0096 1.18 37.51 SCL03* All 0.0043 1.00 8.71 0.0104 0.99 7.81 UC 0.0060 0.99 8.38 0.0057 0.98 7.10 I -0.0002 1.02 12.18 0.0121 1.01 10.53 SCL06** All 0.0143 1.04 18.47 0.0183 0.95 1.77 UC 0.0179 1.02 17.43 0.0118 0.98 9.17 I 0.0078 1.07 22.11 0.0217 0.97 3.77 * Represents 3-month follow-up, ** Represents 6-month follow-up. 4. SIMULATION To compare bias and coverage rates in two imputation methods in the IMPACT study, we performed a Monte Carlo simulation study. The group of 1521 participants who completed all two follow-ups was considered to be the population for this simulation study (intervention group=783, care as usual group =738). For each intervention arm and follow-up wave, simple random samples of n=250 were drawn without replacement. After a sample was drawn, the following missing data mechanism was imposed for each person. The missing mechanism mimicked MAR mechanism with estimated coefficients in the multiple logistic regressions of estimating propensity scores of response from all participates, adjusting intercept such that the percentage of the missingness rates matched the IMPACT observed data. The procedure was conducted separately by the intervention and usual care group as well as the follow-up months. The missingmess rates matched the true data with 8% and 11% for intervention group at months 3 and 6, and 10% and 11% for care as usual group. After imposing a pattern of unit-missingness, the two imputation methods were applied and point estimates (the group means) and intervention effects were calculated for each dependent variable. The estimated variances were adjusted by the finite population correction factor because the sampling fraction is large in the simulation. The procedure was carried out 1000 times. Tables 4 and 5 display the simulation results from point estimation and intervention effects for 3 key outcome varaibles. The tables report the Monte Carlo mean estimated bias, the actual coverage rate (cvg.) indicating the percentage of 95% intervals out of 1000 that covered the true estimand, the length of the 95% confidence interval (lngh). Overall hot-deck multiple imputations performed well with good coverage rates. Multiple imputations under the multivariate normal model showed good coverage in all continuous variables. For dichotomous variables, it has lower coverage in three variables, which were derived from highly skewed variable, the number of all mental health therapy visit. Imputed items tended to be more positive than the observed ones; the percentage of zero was 40.9% among imputed cases and 66.7% among observed cases. Since the distribution of this variable is very skewed, the mean is much higher than the median, and it might end up imputing higher values than observed values. Similar patterns were shown in other simulated data with large bias in this variable and variables derived from this variable. On the other hand, 47% varaibles in complete-case analysis showed lower coverage in Monte Carlo means, but it showed good coverage in the intervention effects because the intervention and the control group have the same direction in bias and biases were cancelled. 3434

Table 4. Point estimates for key outcome varaibles Complete-case analysis Hot-deck Multivariate Normal Y Data bias Cvg. lngh bias Cvg. lngh 100λ bias Cvg. lngh 100λ DEPTX03 All 0.0009 0.96 0.07-0.0011 0.96 0.08 10.21 0.0377 0.45 0.07 5.47 UC 0.0029 0.96 0.11 0.0021 0.96 0.11 12.17 0.0432 0.65 0.11 6.32 IV 0.0037 0.95 0.09 0.0032 0.96 0.09 9.98 0.0395 0.56 0.09 5.10 DEPTX06 All 0.0060 0.94 0.07 0.0001 0.96 0.08 15.51 0.0119 0.91 0.07 7.99 UC 0.0101 0.93 0.11 0.0039 0.96 0.12 16.72 0.0211 0.91 0.11 8.83 IV 0.0067 0.93 0.09 0.0034 0.95 0.09 16.16 0.0098 0.91 0.09 7.95 MAJDEP06 All -0.0119 0.88 0.07-0.0048 0.95 0.07 17.47 0.0009 0.94 0.07 13.85 UC -0.0133 0.91 0.10-0.0074 0.93 0.11 17.98-0.0032 0.95 0.11 13.35 IV -0.0130 0.89 0.09-0.0060 0.94 0.10 17.68 0.0011 0.96 0.09 14.04 SCL03 All 0.0009 0.96 0.10 0.0023 0.96 0.11 10.90 0.0036 0.95 0.10 5.65 UC -0.0025 0.94 0.14-0.0020 0.95 0.15 12.90-0.0015 0.95 0.14 6.68 IV -0.0011 0.96 0.14-0.0016 0.95 0.15 9.99 0.0005 0.95 0.14 5.14 SCL06 All -0.0298 0.78 0.11-0.0140 0.91 0.11 16.23-0.0077 0.93 0.11 8.31 UC -0.0326 0.83 0.15-0.0195 0.91 0.17 17.65-0.0133 0.92 0.15 8.38 IV -0.0322 0.83 0.14-0.0163 0.92 0.15 15.94-0.0098 0.94 0.14 8.31 Table 5. Intervention effects for key outcome varaibles Complete-case analysis Hotdeck Multivariate Normal Y bias cvg. lngth bias cvg. lngth 100λ bias cvg. lngth 100λ DEPTX03 0.0004 95.60 0.14 0.0007 96.30 0.15 9.98-0.0042 98.10 0.14 5.10 DEPTX06-0.0031 94.60 0.14-0.0003 95.90 0.15 16.16-0.0110 95.30 0.14 7.95 MAJDEP06 0.0012 94.50 0.13 0.0022 94.50 0.15 17.68 0.0051 95.30 0.14 14.04 SCL03 0.0007 94.90 0.20-0.0003 96.40 0.21 9.99 0.0013 95.20 0.20 5.14 SCL06 0.0001 94.70 0.21 0.0029 95.30 0.22 15.94 0.0032 95.00 0.21 8.31 REFERENCE Little R. J. (1988) Missing data adjustments in large surveys,.j Business and Economic Statistics, 6, 287-301. Bell R (1999). Presentation at Depression PORT Methods Workshop (I). RAND, Santa Monica, CA. Rubin DB. (1987), Multiple imputation for nonresponse in surveys. New York: J Wiley & Sons. Lavori P., Dawson R. and Shera D. A multiple imputation strategy for clinical trials with truncation of patient data. Statistics in Medicine 1995; 14: 1913-1925 Littell, R. C., Milliken, G. A., Stroup, W. W., and Wolfinger, R. D., (1996) SAS System for Mixed Models, SAS Institute, Inc. Schafer, JL, (1997) Analysis of Incomplete Multivariate Data, Chapman & Hall. 3435