Validity of Proxies and Correction for Proxy Use When Evaluating Social Determinants of Health in Stroke Patients

Similar documents
Fatalism, Optimism, Spirituality, Depressive Symptoms, and Stroke Outcome A Population-Based Analysis

Does Socioeconomic Status or Acculturation Modify the Association Between Ethnicity and Hypertension Treatment Before Stroke?

There is little information available about stroke risk after

Access to Care, Acculturation, and Risk Factors for Stroke in Mexican Americans. The Brain Attack Surveillance in Corpus Christi (BASIC) Project

A review of statistical methods in the analysis of data arising from observer reliability studies (Part 11) *

Unit 1 Exploring and Understanding Data

Fast Food and Neighborhood Stroke Risk

THE APPLICATION OF ORDINAL LOGISTIC HEIRARCHICAL LINEAR MODELING IN ITEM RESPONSE THEORY FOR THE PURPOSES OF DIFFERENTIAL ITEM FUNCTIONING DETECTION

FAMILY RESILIENCY, UNCERTAINTY, OPTIMISM, AND THE QUALITY OF LIFE OF INDIVIDUALS WITH HIV/AIDS

More than 35% of US adults are obese. 1 Obesity accounts

Sikha Naik Mark Vosvick, Ph.D, Chwee-Lye Chng, Ph.D, and John Ridings, A.A. Center for Psychosocial Health

Personal Well-being Among Medical Students: Findings from a Pilot Survey

Chapter 1: Exploring Data

Evaluation of Patient and Proxy Responses on the Activity Measure for Postacute Care

PSYCHOMETRIC PROPERTIES OF CLINICAL PERFORMANCE RATINGS

RECOVERY OF LINGUISTIC DEFICITS IN STROKE PATIENTS; A THREE- YEAR-FOLLOW UP STUDY.

NIH Public Access Author Manuscript Stroke. Author manuscript; available in PMC 2015 January 16.

Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria

BIOSTATISTICAL METHODS AND RESEARCH DESIGNS. Xihong Lin Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA

APPENDIX 11: CASE IDENTIFICATION STUDY CHARACTERISTICS AND RISK OF BIAS TABLES

Identifying Adult Mental Disorders with Existing Data Sources

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data

Small-area estimation of mental illness prevalence for schools

Methods for Computing Missing Item Response in Psychometric Scale Construction

Optimistic Versus Pessimistic Life-Orientation Beliefs among University Teachers

Development of a self-reported Chronic Respiratory Questionnaire (CRQ-SR)

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research

BEST PRACTICES FOR IMPLEMENTATION AND ANALYSIS OF PAIN SCALE PATIENT REPORTED OUTCOMES IN CLINICAL TRIALS

How accurately does the Brief Job Stress Questionnaire identify workers with or without potential psychological distress?

Sex Differences in Depression in Patients with Multiple Sclerosis

DESIGNING MULTI-ETHNIC STROKE STUDIES: THE BRAIN ATTACK SURVEILLANCE IN CORPUS CHRISTI (BASIC) PROJECT

CONTINUOUS AND CATEGORICAL TREND ESTIMATORS: SIMULATION RESULTS AND AN APPLICATION TO RESIDENTIAL RADON

Understandable Statistics

Agreement between Proxy and Patient Reports of HRQoL using the EQ-5D:

Investigating the robustness of the nonparametric Levene test with more than two groups

Maximizing the Accuracy of Multiple Regression Models using UniODA: Regression Away From the Mean

PRINCIPLES OF STATISTICS

Sources of Comparability Between Probability Sample Estimates and Nonprobability Web Sample Estimates

CRITICALLY APPRAISED PAPER (CAP)

SELF-PROXY RESPONSE STATUS AND QUALITY OF CIGARETTE-RELATED INFORMATION

SUMMARY AND DISCUSSION

COMPUTING READER AGREEMENT FOR THE GRE

Pain Assessment in Elderly Patients with Severe Dementia

A 3-Factor Model for the FACIT-Sp

MMI 409 Spring 2009 Final Examination Gordon Bleil. 1. Is there a difference in depression as a function of group and drug?

11/18/2013. Correlational Research. Correlational Designs. Why Use a Correlational Design? CORRELATIONAL RESEARCH STUDIES

NEARLY 12 MILLION COSmetic

STATISTICS & PROBABILITY

Self-rated Mental Health Status (G1) Behavioral Risk Factors Surveillance System (BRFSS).

An Empirical Study on Causal Relationships between Perceived Enjoyment and Perceived Ease of Use

Chapter 2 Norms and Basic Statistics for Testing MULTIPLE CHOICE

Toward a Comprehensive Measure of Hope

Measuring Perceived Social Support in Mexican American Youth: Psychometric Properties of the Multidimensional Scale of Perceived Social Support

Biases in clinical research. Seungho Ryu, MD, PhD Kanguk Samsung Hospital, Sungkyunkwan University

A semantic verbal fluency test for English- and Spanish-speaking older Mexican-Americans

2008 Ohio State University. Campus Climate Study. Prepared by. Student Life Research and Assessment

Examining Relationships Least-squares regression. Sections 2.3

The Impact of Cellphone Sample Representation on Variance Estimates in a Dual-Frame Telephone Survey

S Imputation of Categorical Missing Data: A comparison of Multivariate Normal and. Multinomial Methods. Holmes Finch.

S P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H A N D WITHOUT PSYCHOPATHOLOGY

Emotional distress is common after stroke. The prevalence

Aggregation of psychopathology in a clinical sample of children and their parents

Technical Brief for the THOMAS-KILMANN CONFLICT MODE INSTRUMENT

ANOVA in SPSS (Practical)

Still important ideas

2.75: 84% 2.5: 80% 2.25: 78% 2: 74% 1.75: 70% 1.5: 66% 1.25: 64% 1.0: 60% 0.5: 50% 0.25: 25% 0: 0%

Research Article Estimating Measurement Error of the Patient Activation Measure for Respondents with Partially Missing Data

CHAPTER VI RESEARCH METHODOLOGY

Analysis of Confidence Rating Pilot Data: Executive Summary for the UKCAT Board

A Coding System to Measure Elements of Shared Decision Making During Psychiatric Visits

Alternate Site Testing for Hemoglobin A1c in Children with Diabetes. Sarah L. Flores, MS, RN, BC Manager, Specialty Clinics

AnExaminationoftheQualityand UtilityofInterviewerEstimatesof HouseholdCharacteristicsinthe NationalSurveyofFamilyGrowth. BradyWest

Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects

Learning Objectives 9/9/2013. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

9/4/2013. Decision Errors. Hypothesis Testing. Conflicts of Interest. Descriptive statistics: Numerical methods Measures of Central Tendency

Critical Review: Group Therapy for Post-Stroke Aphasia Rehabilitation

Dispositional Optimism and Therapeutic Expectations in Early-Phase Oncology Trials

This article is the second in a series in which I

Reliability of Reported Age at Menopause

Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers

Repeatability of a questionnaire to assess respiratory

ExperimentalPhysiology

Gender-Based Differential Item Performance in English Usage Items

LEVEL ONE MODULE EXAM PART TWO [Reliability Coefficients CAPs & CATs Patient Reported Outcomes Assessments Disablement Model]

Calculating CAHPS Hospice Survey Top-Box Scores

Georgina Salas. Topics EDCI Intro to Research Dr. A.J. Herrera

Interaction Effects: Centering, Variance Inflation Factor, and Interpretation Issues

Section on Survey Research Methods JSM 2009

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Reliability and Validity of the Pediatric Quality of Life Inventory Generic Core Scales, Multidimensional Fatigue Scale, and Cancer Module

Unequal Numbers of Judges per Subject

ASSESSING THE EFFECTS OF MISSING DATA. John D. Hutcheson, Jr. and James E. Prather, Georgia State University

The EuroQol and Medical Outcome Survey 36-item shortform

Table of Contents. Plots. Essential Statistics for Nursing Research 1/12/2017

Small-area estimation of prevalence of serious emotional disturbance (SED) in schools. Alan Zaslavsky Harvard Medical School

RESPONSE SURFACE MODELING AND OPTIMIZATION TO ELUCIDATE THE DIFFERENTIAL EFFECTS OF DEMOGRAPHIC CHARACTERISTICS ON HIV PREVALENCE IN SOUTH AFRICA

Self esteem, optimism and exams anxiety among high school students

Calculating CAHPS Hospice Survey Top-Box Scores

Transcription:

Validity of Proxies and Correction for Proxy Use When Evaluating Social Determinants of Health in Stroke Patients Lesli E. Skolarus, MD; Brisa N. Sánchez, PhD; Lewis B. Morgenstern, MD; Nelda M. Garcia, BS; Melinda A. Smith, DrPH; Devin L. Brown, MD, MS; Lynda D. Lisabeth, PhD Background and Purpose The purpose of this study was to evaluate stroke patient proxy agreement with respect to social determinants of health, including depression, optimism, and spirituality, and to explore approaches to minimize proxy-introduced bias. Methods Stroke patient proxy pairs from the Brain Attack Surveillance in Corpus Christi Project were interviewed (n 34). Evaluation of agreement between patient proxy pairs included calculation of intraclass correlation coefficients, linear regression models (ProxyResponse 0 1 PatientResponse, where 0 0 and 1 1 denotes no bias) and statistics. Bias introduced by proxies was quantified with simulation studies. In the simulated data, we applied 4 approaches to estimate regression coefficients of stroke outcome social determinants of health associations when only proxy data were available for some patients: (1) substituting proxy responses in place of patient responses; (2) including an indicator variable for proxy use; (3) using regression calibration with external validation; and (4) internal validation. Results Agreement was fair for depression (intraclass correlation coefficient, 0.41) and optimism (intraclass correlation coefficient, 0.48) and moderate for spirituality (, 0.48 to 0.53). Responses of proxies were a biased measure of the patients responses for depression, with 0 4.88 (CI, 2.24 to 7.52) and 1 0.39 (CI, 0.09 to 0.69), and for optimism, with 0 3.82 (CI, 1.04 to 8.69) and 1 0.81 (CI, 0.41 to 1.22). calibration with internal validation was the most accurate method to correct for proxy-induced bias. Conclusion Fair/moderate patient proxy agreement was observed for social determinants of health. Stroke researchers who plan to study social determinants of health may consider performing validation studies so corrections for proxy use can be made. (Stroke. 2010;41:510-515.) Key Words: stroke proxy social determinants of health More than 25% of stroke survivors have cognitive or language deficits that prohibit their direct participation in outcome studies. 1 The study of these patients is critical to avoid the selection bias that is introduced if only patients with mild disability are evaluated. Using family members or friends to represent the more severely affected patient is a common strategy to minimize selection bias. 2 However, although the use of proxies may reduce selection bias, disagreement between patient and proxy responses introduces measurement error and may alter study results. Previous work has shown that proxies generally report greater disability than stroke patients in studies surveying global assessment of poststroke function. 3 5 Little is known about agreement between stroke patients and proxies with regard to social determinants of health (SDH). SDH are described by the World Health Organization as the cause of the causes of disease, directing attention to the social factors shaping people s health. 6 SDH, such as spirituality and depression, may be important contributors to outcomes after stroke. 7 14 Decreased disability after stroke was observed in patients who frequently attended religious services. 7 Depression has been inversely linked to stroke outcomes, with several studies reporting that depressed patients have increased disability and mortality after stroke. 8 10,12 However, most studies of the impact of religion and depression on stroke outcomes have included only minimally affected stroke patients, 7 10,12 14 limiting the generalizability of the study results to the broader stroke population. The objectives of this validation study were 2-fold: (1) to quantify the agreement between stroke patient and proxy responses to questions regarding spirituality, depression, and optimism; and (2) to evaluate different methodologic approaches to incorporating proxy data into stroke outcome studies. Methods This was a prespecified subproject of the Brain Attack Surveillance in Corpus Christi (BASIC) project. BASIC is a population-based stroke surveillance study in Nueces County, Texas. Detailed methods of the BASIC project have been described previously. 15,16 Received October 30, 2009; final revision received October 30, 3009; accepted November 19, 2009. From the Stroke Program (L.E.S., L.B.M., N.M.G., M.A.S., D.L.B., L.D.L.), University of Michigan Medical School, Ann Arbor, Mich; and the Departments of Biostatistics (B.N.S.) and Epidemiology (L.B.M., L.D.L.), University of Michigan School of Public Health, Ann Arbor, Mich. Correspondence to Lynda Lisabeth, PhD, 1415 Washington Heights, Room 4641, Ann Arbor, MI 48109. E-mail llisabet@umich.edu 2010 American Heart Association, Inc. Stroke is available at http://stroke.ahajournals.org DOI: 10.1161/STROKEAHA.109.571703 510

Skolarus et al Proxies and Social Determinants of Health 511 A convenience sample of consecutively interviewed ischemic stroke/transient ischemic attack patients were identified from September to November 2007 for this validation study. Eligibility was based on the ability of patients to correctly answer a brief set of questions to evaluate their cognitive and language capabilities. Eligible subjects were then asked to identify a proxy, defined as the person who knew the patient best, to participate in the validation study. Interviews took place at various intervals after the patients acute event based on patient availability and ease. Patients were queried in person (95%) or over the telephone (5%). Patients and proxies were asked identical questions, with proxies instructed to respond to the questions as they believed the patients would respond. For all patient proxy pairs used in final data analysis, proxies were blinded to the patients responses. Proxy demographics and the type and duration of the proxy patient relationship were collected. Written informed consent was obtained from all subjects, and the study was approved by the institutional review board at University of Michigan and the Nueces County hospital systems. SDH Scales The validation study focused on 3 measures of SDH: depression, optimism, and spirituality. Depression was measured with the Patient Health Questionnaire 9 (PHQ-9), which has been validated in stroke patients. 17 For the purposes of this study, the PHQ-9 was scored as a continuous measure based on the total of the 9 individual questions (range, 0 to 27). Optimism was queried via a modified revised Life Orientation Test (LOT-R). 18 Respondents indicated level of agreement with 6 statements on a Likert scale. Items from the optimism scale were presented both positively and negatively. Therefore, for the purposes of the analysis, some items were reverse scored. A composite optimism score was computed as the sum of the 6 individual responses, with a lower score corresponding to increased optimism (range, 6 to 24). Finally, spirituality was assessed using 2 questions ascertaining the importance of religious or spiritual beliefs in the patient s daily life using Strawbridge s religiosity scale. 19 For each question, respondents indicated level of importance on a Likert scale. Statistical Analysis Descriptive statistics were calculated using medians and interquartile ranges (IQRs) for continuous variables. Categorical variables were analyzed using frequencies and percentages. Evaluation of Agreement Between Stroke Patient and Proxy Responses for SDH Agreement between stroke patient and proxy responses for the ordinal spirituality questions was assessed by calculating percentage agreement and weighted statistics. 20 is a measure of overall agreement, ranging from 0 to 1, with higher values representing more agreement with values 0.5 representing fair to moderate agreement. 21,22 Agreement between stroke patient and proxy responses for the continuous optimism and depression scores was assessed using intraclass correlation coefficients and linear regression models of the form: ProxyResponse 0 1 PatientResponse. The intraclass correlation coefficient is a measure of overall agreement, ranging from 0 to 1, with higher values representing more agreement. The coefficients from the regression models describe the direction and extent of bias introduced by use of proxies (ie, when 0 0 and 1 1, the response of the proxy is unbiased). Evaluation of Bias Introduced by Different Approaches to Incorporating Proxy Data We conducted simulation studies to assess the extent of bias in future regression-based analysis of SDH stroke outcome studies, which include proxy responses for SDH measures. The simulation study consisted of simulating data sets of 500 stroke patients according to assumed true models for the SDH stroke outcome and patient proxy associations, and subsequently estimating regression coefficients for the SDH stroke outcome associations from the simulated data using the approaches described below. Table 1. Values for Factors in the Simulation Study Simulation Factor Values 0 1, 2 1 0.10, 0.25, 1, 2 (range of weak to strong effects) SD of the error term, 2, 4, and 6 0 Point estimate in validation study, and lower and upper confidence limits 1 Point estimate in validation study, and lower and upper confidence limits Standard deviation of the proxy s response error, Percentage of patients with proxies Total simulation scenarios per SDH scale Observed SD in the validation study, as well as half and twice the observed SD 5%, 10%, 25%, and 50% of the sample 2592 For each data set, we first simulated patient responses to optimism and depression scales by resampling with replacement from the distributions observed in the validation study. We then simulated stroke outcomes for each patient assuming that the association between the outcome and optimism or depression followed a linear relationship (eg, Outcome 0 1 Optimism ). We then randomly selected a subgroup of patients as having only proxy responses. To generate proxy responses, we assumed a linear relationship: ProxyResponse 0 1 PatientResponse with 0 and 1 equal to those estimated with our validation study data, for both SDH scales. The simulation study followed a factorial design with 7 factors, with values shown in Table 1. We estimated the regression coefficients 0 and 1 for the SDH stroke outcome model, applying 4 approaches to the simulated data. First, we substituted the proxy responses in place of the patients responses and fit the regression model. The second approach was similar to the first, except we added an indicator variable for proxy use to the regression model (ie, adjustment for proxy use). Third, we used regression calibration, 23 which consisted of substituting a corrected proxy response in place of the patient s response (for patients with proxies). The corrected response was calculated as: CorrectedProxy ˆ0 ˆ1ProxyResponse, where ˆ0 and ˆ1 were obtained from fitting a regression model PatientResponse 0 1 ProxyResponse to the validation data. This approach is termed regression calibration with external validation because ˆ0 and ˆ1 were obtained from our validation study and did not use any data from the simulated patient population. The fourth approach was similar to the third except that for each simulated data set, we simulated an additional 34 patients with proxies to serve as a validation sample and estimated ˆ0 and ˆ1 from that sample. This fourth approach is labeled regression calibration with internal validation. The 34 simulated internal validation patients were not used to estimate the outcome model so that the total number of patients remained at 500 across all simulated samples and so that the percentage of patients with proxies remained at 5%, 10%, 25%, and 50%. For each combination of assumed SDH stroke outcome and patient proxy associations, 1000 data sets were simulated. For each data set, the percentage bias in the estimated 1, ˆ1 was calculated as percentage bias 100 ( ˆ1 1 )/ 1. The percentage bias was averaged across the 1000 simulated data sets for the given scenario, and the corresponding 2.5th and 97.5th percentiles across the 1000 estimates of percent bias were calculated and were compared with 0. As a way of summarizing the results of the 2592 simulation scenarios for each SDH measure, the relative importance of each simulation factor was assessed by conducting an ANOVA. For the ANOVA, the average percentage of bias estimated in each simulation scenario served as the outcome, and the simulation factors (values of various parameters) were explanatory variables.

512 Stroke March 2010 Figure 1. Percentage bias in the estimated association between optimism (top) and depression (bottom) measures and stroke outcome when the true association is 1 1 and 4 (ie, R 2 0.36 for optimism and R 2 0.68 depression) and varying the percentage of patients with proxy. Reg indicates regression; int, interior; ext, exterior; Inds, indicator variable for proxy use. Results A total of 44 ischemic stroke/transient ischemic attack patients were eligible for this validation study. Patients were excluded if they refused to be interviewed separate from their proxy (n 6), were unable to name a proxy (n 3), or a proxy could not be located (n 1). The validation study was completed with the remaining 34 patient proxy pairs. Women comprised 59% of patients but 74% of proxies. Patients were 47% Mexican American whereas proxies were 44% Mexican American. Median age of patients was 63 years (IQR, 55 to 77) compared with 52 years (IQR, 40 to 63) for proxies. Proxies were most commonly the patients spouses (38%) or children (47%) and had long-term relationships with patients (median, 37 years; IQR, 25 to 55). Median score on the PHQ-9 depression scale was 6 (IQR, 2 to 9), whereas median proxy score was 5 (IQR, 3 to 13). Fair agreement, intraclass correlation coefficient of 0.41, was found between patient and proxy responses. However, the proxies response was a biased measure of the patients response, with 0 4.88 (CI, 2.24 to 7.52) and 1 0.39 (CI, 0.09 to 0.69) and residual SD of 5.22; ie, for patients with lower levels of depression, proxies overestimated the depression score by 5 points, but this gap narrowed as the patients depression score increased. Median patient score on the optimism LOT-R scale was 11 (IQR, 9 to 14), whereas median proxy optimism score was 12 (IQR, 11 to 15). Fair agreement was also found between patient and proxy responses for optimism, with an intraclass correlation coefficient of 0.48. Again, the proxies response leaned toward being a biased measure of the patients response, with 0 3.82 (CI, 1.04 to 8.69) and 1 0.81 (CI, 0.41 to 1.22) and residual SD of 2.57. Note that the CIs include 0 and 1, such that the statistical evidence of a biased proxy response is lower compared with the PHQ-9 scale. Furthermore, the residual SD is smaller, showing that although the proxies are somewhat biased in reporting optimism, their responses are generally more reliable than for depression. Agreement for the spirituality questions was moderate, with values of 0.55 and 0.46 and percentage agreement ranging from 74% to 79%. Notably, the majority of patients felt that spirituality was at least fairly important to what they do every day (88%). Figure 1 shows the average percentage of bias and the percentiles in the estimated associations between a potential stroke outcome and the SDH measures obtained with various approaches to incorporating proxy data. Substituting the crude proxy responses resulted in biased regression coefficients, regardless of the use of a proxy indicator. Further, as the percentage of patients with proxy responses was increased, the bias increased. In contrast, the bias in the regression coefficients obtained with regression calibration with internal or external validation was nearly 0, although the precision of the estimates decreased as the percentage of patients with proxies increased. The estimated values of 0, 1 used to correct proxy responses in the regression calibration with internal validation varied across data sets; those used for regression calibration with external validation were ˆ0 3.20 and ˆ1 0.44 for depression, and ˆ0 6.17 and ˆ1 0.41 for optimism. Figure 2 shows the impact of the variability in the proxy s error, SD( ), on the percentage bias in the estimated SDH stroke outcome association. When the variability of this error increases, the magnitude of the bias increases. Figure 3 shows the impact of the true association, 1, when all other simulation factors remain fixed. As 1 increases while the residual error SD,, remains fixed, the R 2 of the outcome model increases. The change in the R 2 ultimately impacts the precision of the estimated percentage bias. That is, when the R 2 is high (Figure 3, bottom), the bias in the estimated regression coefficient was negative for nearly every data set analyzed. However, when the R 2 is low (Figure 3, top), the

Skolarus et al Proxies and Social Determinants of Health 513 Figure 2. Percentage bias in the estimated association between depression (PHQ-9) and stroke outcome, when the true association is 1 1 and 4 (ie, R 2 0.68) and varying the percentage of patients with proxy. Top, Percentage bias when the SD of the proxy s error is SD( ) 5.2 (observed in the validation study); middle, when the SD is half of what was observed; and bottom, when it is twice the observed. Reg indicates regression; int, interior; ext, exterior; Inds, indictor variable for proxy use. degree of bias for a given data set is less certain and may be either positive or negative (although on average, it will be negative). The results were similar for optimism (data not shown). Table 2 shows the relative importance of the simulation factors on the percentage of bias estimated in the simulation study. As can be observed, the percentage of patients with proxies and the SD of the proxy s error are the most relevant factors impacting the bias in the estimated coefficients (largest mean squared error; P values 0.001 for all methods). The bias coefficients 0 and 1 explain relatively less variability in the percentage bias. Discussion This validation study found fair stroke patient proxy agreement for the measurement of depression and optimism. Patient proxy agreement was higher for spirituality but was nevertheless only modest. These results are consistent with previous studies that have shown greater stroke patient proxy agreement in objective domains such as physical abilities 3 5 but less agreement with more subjective domains such as energy, 3 emotion, 5 and mood. 4 In this study, proxies commonly scored the patients as more depressed and less optimistic than the patients scored themselves. Other studies have also shown that proxies overestimate subjective end points such as poststroke quality of life and depression. 3,4,24 Given the potential importance of SDH to stroke outcomes, understanding methods to minimize bias in the use of proxy data is critical. We performed a simulation study to explore 4 different methods to correct for the proxy-induced bias. Our results suggest that directly substituting proxy data for the patient or adjusting for proxy use in models results in bias, and the degree of this bias increases with increasing frequency of proxy use and with decreasing reliability in the proxies responses. On the other hand, regression calibration with internal validation, where proxy data are corrected based on known (or estimated) information about the patient proxy association, results in nearly unbiased estimates. Applying estimates of the patient proxy association estimated with external validation studies reduced the bias compared with substituting proxy responses but did not always eliminate the bias (Figure 2). Extrapolating results from external validation studies warrants caution. 23 As an alternate to regression calibration methods, additional approaches that may improve the use of proxy information include developing more objective scales to access the patients SDH designed specifically for proxies. Studies have also proposed alternative statistical methods to correct for proxy-induced bias such as propensity score adjustment 25 and psychometric profile analysis. 26 There are several limitations to this study. Because of the limited number of patient proxy pairs, we were unable to compare agreement and bias in subgroups defined by gender, race/ethnicity, or relationship of the proxy to the patient. Further, the study population was limited to patients able to self-report their responses, and therefore the population

514 Stroke March 2010 Figure 3. Percentage bias in the estimated association between depression (PHQ-9) and stroke outcome when the true association is 1 0.1 (top), 1 0.25 (middle), or 1 2 (bottom), but remains constant at 4 (ie, outcome model R 2 0.02, 0.11, and 0.89, respectively), the SD of the proxy s error is SD( ) 5.2 (observed in the validation study), and varying the percentage of patients with proxy. Reg indicates regression; int, interior; ext, exterior; Inds, indicator variable for proxy use. studied is different from the population that actually requires proxies. Finally, also because of a small sample size, we were unable to assess whether there is lack of linearity in the association between proxy and patient responses. Linearity assumptions were used in the regression calibration methods, although nonlinear models can also be used to correct the proxy responses. Proxies should continue to be included in stroke outcome studies to avoid selection bias. However, caution is recommended when collecting SDH data from proxies. In particular, appropriate methods should be used to incorporate proxy responses when there are larger percentages of patients with proxies and when the reliability of proxies is low. Researchers who plan to study SDH in stroke patients or other critically ill populations may consider performing validation work to identify and quantify the measurement error introduced by use of proxies in their population so corrections can be made. Additional research is needed to Table 2. ANOVA Results Using Percentage Bias in the Estimated Outcome SDH Association as the Dependent Variable and Simulation Factors as Explanatory Variables PHQ-9 Depression (Internal) (External) LOT-R Optimism (Internal) (External) Factor df Proxy Proxy Indicator Proxy Proxy Indicator 0 1 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 1 3 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 2 0.000 0.000 0.000 0.000 0.000 0.000 0.001 0.000 0 2 0.292* 0.000 0.000 0.015 1.450* 0.000 0.000 0.312* 1 2 0.400* 0.593* 0.020* 0.700* 1.457* 0.107* 0.006* 0.521* SD ( ) 2 27.39* 31.22* 0.010* 5.583* 8.229* 18.91* 0.008* 3.30* % Proxy 3 12.79* 10.80* 0.081* 0.926* 10.23* 7.49* 0.022* 2.06* Residual 0.007 0.008 0.000 0.004 0.017 0.006 0.000 0.010 Degrees of freedom (df) and mean squared error associated with each simulation factor for each method of incorporating proxies and for each scale. Larger mean squared error implies that the factor explains a larger amount of variance in the percentage bias. *P 0.001; P 0.05; residual df 2576.

Skolarus et al Proxies and Social Determinants of Health 515 understand the bias introduced when other measures of SDH are under study. Sources of Funding This study was funded by the National Institutes of Health (National Institute on Neurological Disorders and Stroke; R01 NS38916). L.E.S. is funded by the American Academy of Neurology Clinical Research Training Fellowship. D.L.B. is funded by National Institute for Neurologic Disorders and Stroke (K23 NS051202). L.D.L. is funded by National Institute for Neurologic Disorders and Stroke (K23 NS050161). None. Disclosures References 1. Pedersen PM, Jorgensen HS, Nakayama H, Raaschou HO, Olsen TS. Aphasia in acute stroke: incidence, determinants, and recovery. Ann Neurol. 1995;38:659 666. 2. Townend E, Brady M, McLaughlan K. A systematic evaluation of the adaptation of depression diagnostic methods for stroke survivors who have aphasia. Stroke. 2007;38:3076 3083. 3. Hilari K, Owen S, Farrelly SJ. Proxy and self-report agreement on the stroke and aphasia quality of life scale-39. J Neurol Neurosurg Psychiatry. 2007;78:1072 1075. 4. Williams LS, Bakas T, Brizendine E, Plue L, Tu W, Hendrie H, Kroenke K. How valid are family proxy assessments of stroke patients healthrelated quality of life? Stroke. 2006;37:2081 2085. 5. Duncan PW, Lai SM, Tyler D, Perera S, Reker DM, Studenski S. Evaluation of proxy responses to the stroke impact scale. Stroke. 2002; 33:2593 2599. 6. Graham H. Social determinants and their unequal distribution: clarifying policy understandings. Milbank Q. 2004;82:101 124. 7. Berges IM, Kuo YF, Markides KS, Ottenbacher K. Attendance at religious services and physical functioning after stroke among older Mexican Americans. Exp Aging Res. 2007;33:1 11. 8. Black SA, Markides KS. Depressive symptoms and mortality in older Mexican Americans. Ann Epidemiol. 1999;9:45 52. 9. Herrmann N, Black SE, Lawrence J, Szekely C, Szalai JP. The Sunnybrook Stroke Study: a prospective study of depressive symptoms and functional outcome. Stroke. 1998;29:618 624. 10. House A, Knapp P, Bamford J, Vail A. Mortality at 12 and 24 months after stroke may be associated with depressive symptoms at 1 month. Stroke. 2001;32:696 701. 11. Williams LS, Ghose SS, Swindle RW. Depression and other mental health diagnoses increase mortality risk after ischemic stroke. Am J Psychiatry. 2004;161:1090 1095. 12. Everson SA, Roberts RE, Goldberg DE, Kaplan GA. Depressive symptoms and increased risk of stroke mortality over a 29-year period. Arch Intern Med. 1998;158:1133 1138. 13. Giaquinto S, Spiridigliozzi C, Caracciolo B. Can faith protect from emotional distress after stroke? Stroke. 2007;38:993 997. 14. Johnstone B, Franklin KL, Yoon DP, Burris J, Shigaki C. Relationships among religiousness, spirituality, and health for individuals with stroke. J Clin Psychol Med Settings. 2008;15:308 313. 15. Smith MA, Risser JM, Moye LA, Garcia N, Akiwumi O, Uchino K, Morgenstern LB. Designing multi-ethnic stroke studies: the Brain Attack Surveillance in Corpus Christi (BASIC) project. Ethn Dis. 2004;14:520 526. 16. Morgenstern LB, Smith MA, Lisabeth LD, Risser JM, Uchino K, Garcia N, Longwell PJ, McFarling DA, Akuwumi O, Al-Wabil A, Al-Senani F, Brown DL, Moye LA. Excess stroke in Mexican Americans compared with non-hispanic whites: the Brain Attack Surveillance in Corpus Christi project. Am J Epidemiol. 2004;160:376 383. 17. Williams LS, Brizendine EJ, Plue L, Bakas T, Tu W, Hendrie H, Kroenke K. Performance of the PHQ-9 as a screening tool for depression after stroke. Stroke. 2005;36:635 638. 18. Scheier MF, Carver CS, Bridges MW. Distinguishing optimism from neuroticism (and trait anxiety, self-mastery, and self-esteem): a reevaluation of the life orientation test. J Pers Soc Psychol. 1994;67:1063 1078. 19. Strawbridge WJ, Shema SJ, Cohen RD, Roberts RE, Kaplan GA. Religiosity buffers effects of some stressors on depression but exacerbates others. J Gerontol B Psychol Sci Soc Sci. 1998;53:S118 S126. 20. Jakobsson U, Westergren A. Statistical methods for assessing agreement for ordinal data. Scand J Caring Sci. 2005;19:427 431. 21. Altman D. Practical statistics for medical research. London: Chapman and Hall; 1991. 22. Fleiss J. Statistical Methods for Rates and Proportions. New York, NY: John Wiley & Sons; 1981. 23. Carroll RJ, Ruppert D, Stefanski LA. Measurement Error in Non-Linear Models. New York, NY: Wiley; 1995. 24. Berg A, Lonnqvist J, Palomaki H, Kaste M. Assessment of depression after stroke: a comparison of different screening instruments. Stroke. 2009;40:523 529. 25. Ellis BH, Bannister WM, Cox JK, Fowler BM, Shannon ED, Drachman D, Adams RW, Giordano LA. Utilization of the propensity score method: an exploratory comparison of proxy-completed to self-completed responses in the Medicare Health Outcomes Survey. Health Qual Life Outcomes. 2003;1:47. 26. Weinfurt KP, Trucco SM, Willke RJ, Schulman KA. Measuring agreement between patient and proxy responses to multidimensional health-related quality-of-life measures in clinical trials. An application of psychometric profile analysis. J Clin Epidemiol. 2002;55:608 618.