Mapping the EORTC QLQ C-30 onto the EQ-5D Instrument: The Potential to Estimate QALYs without Generic Preference Data

Similar documents
Mapping EORTC QLQ-C30 onto EQ-5D for the assessment of cancer patients

To what extent do people prefer health states with higher values? A note on evidence from the EQ-5D valuation set

Japan Journal of Medicine

Mapping QLQ-C30, HAQ, and MSIS-29 on EQ-5D

Session 6: Choosing and using HRQoL measures vs Multi-Attribute Utility Instruments QLU-C10D and EQ-5D as examples

Multidimensional fatigue and its correlates in hospitalized advanced cancer patients

Health Technology Assessment (HTA)

Impact of Chronic Liver Disease and Cirrhosis on Health Utilities Using SF-6D and the Health Utility Index

Women s Health Development Unit, School of Medical Science, Health Campus, Universiti Sains Malaysia b

NICE DSU TECHNICAL SUPPORT DOCUMENT 8: AN INTRODUCTION TO THE MEASUREMENT AND VALUATION OF HEALTH FOR NICE SUBMISSIONS

EORTC QLQ-C30 descriptive analysis with the qlqc30 command

Mapping the Positive and Negative Syndrome Scale scores to EQ-5D- 5L and SF-6D utility scores in patients with schizophrenia

Change in health-related quality of life over 1 month in cancer patients with high initial levels of symptoms and problems

Introducing the QLU-C10D A preference-based utility measure derived from the QLQ-C30

A comparison of direct and indirect methods for the estimation of health utilities from clinical outcomes

Time to tweak the TTO: results from a comparison of alternative specifications of the TTO

This is a repository copy of Deriving a preference-based measure for cancer using the EORTC QLQ-C30.

Mapping health outcome measures from a stroke registry to EQ-5D weights

QUALITY OF LIFE IN CANCER CLINICAL TRIALS A Practical Guide For. Research Staff

LIHS Mini Master Class

Condition-Specific Preference-Based Measures: Benefit or Burden?

Using HAQ-DI to estimate HUI-3 and EQ-5D utility values for patients with rheumatoid arthritis in Spain

The response to the Committee request for additional analyses in section 1.4, MSD follows:

Choice of EQ 5D 3L for Economic Evaluation of Community Paramedicine Programs Project

EuroQol Working Paper Series

Learning Effects in Time Trade-Off Based Valuation of EQ-5D Health States

Assessment of Health State in Patients With Tinnitus: A Comparison of the EQ-5D and HUI Mark III

How is the most severe health state being valued by the general population?

Economic evaluation of end stage renal disease treatment Ardine de Wit G, Ramsteijn P G, de Charro F T

Matt D. Stevenson, PhD, 1 Alison Scope, PhD, 1 Paul A. Sutcliffe, DPhil 2. Introduction. Methods

Quality of Life. The assessment, analysis and reporting of patient-reported outcomes. Third Edition

Tails from the Peak District: Adjusted Limited Dependent Variable Mixture Models of EQ-5D Questionnaire Health State Utility Values

Sanjeewa Kularatna 1,2*, Jennifer A. Whitty 1,2,3, Newell W. Johnson 2, Ruwan Jayasinghe 4 and Paul A. Scuffham 1,2

VALUE IN HEALTH 19 (2016) Available online at journal homepage:

Comparing the UK EQ-5D-3L and English EQ-5D-5L value sets

Time Trade-Off and Ranking Exercises Are Sensitive to Different Dimensions of EQ-5D Health States

Table of Contents. Preface to the third edition xiii. Preface to the second edition xv. Preface to the fi rst edition xvii. List of abbreviations xix

A panel data comparison of two commonly-used health-related quality of life instruments

Clustering Rasch Results: A Novel Method for Developing Rheumatoid Arthritis States for Use in Valuation Studiesvhe_

UCLA UCLA Previously Published Works

This is a repository copy of The estimation of a preference-based measure of health from the SF-36.

The EuroQol and Medical Outcome Survey 36-item shortform

Basic Economic Analysis. David Epstein, Centre for Health Economics, York

White Rose Research Online URL for this paper:

Patient Reported Outcomes: are they appropriate for clinical practice? Jose M Valderas Clinical Lecturer

HEDS Discussion Paper 05/05

Guideline scope Persistent pain: assessment and management

Valuation of the SF-6D Health States Is Feasible, Acceptable, Reliable, and Valid in a Chinese Population

Setting The setting was an outpatients department. The economic study was carried out in the UK.

Making Value Based Pricing A Reality: Issue Panel

Validity of the EuroQoL (EQ-5D) Instrument in a Greek General Population

University of Bristol - Explore Bristol Research. Publisher's PDF, also known as Version of record

Work in progress: please do not quote without authors permission

XX Encuentro de Economía Pública Sevilla, 31 de enero y 1 de febrero de 2013

An Introduction to Costeffectiveness

Interim Scoring for the EQ-5D-5L: Mapping the EQ-5D-5L to EQ-5D-3L Value Sets

Since 1980, obesity has more than doubled worldwide, and in 2008 over 1.5 billion adults aged 20 years were overweight.

Background. 2 5/30/2017 Company Confidential 2015 Eli Lilly and Company

This is a repository copy of A comparison of the EQ-5D and the SF-6D across seven patient groups.

Reliability and Validity of the Taiwan Chinese Version of the EORTC QLQ-PR25 in Assessing Quality of Life of Prostate Cancer Patients

An empirical evaluation of the SF-12, SF-6D, EQ-5D and Michigan Hand Outcome Questionnaire in patients with rheumatoid arthritis of the hand

Analysis of EQ-5D scores from two phase 3 clinical trials of romiplostim in the treatment of immune thrombocytopenia (ITP)

CHAPTER 3 RESEARCH METHODOLOGY

To what extent can we explain time trade-off values from other information about respondents?

Original Article on Palliative Radiotherapy

Reliability and validity of the Cancer Therapy Satisfaction Questionnaire in lung cancer

INTERMEZZO ONCOQUEST: A TOUCH-SCREEN COMPUTER ASSISTED SYSTEM TO MONITOR HEALTH RELATED QUALITY OF LIFE VIA PATIENT REPORTED OUTCOME MEASURES

HEDS Discussion Paper 08/02

A Brief Information Sheet On Opioid Effects Improves Quality Of Life In Cancer Patients On Opioids

Health Services Research and Health Economics. Paul McCrone Institute of Psychiatry, King s College London

Analysis of relationship between emotional intelligence and quality of life in oncology patients

Carol M. Mangione, MD NEI VFQ-25 Scoring Algorithm August 2000

The European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30): Validation study of the Thai version

Formulário de acesso a dados do Registo Nacional de Doentes Reumáticos (Reuma.pt) da SPR,

Setting The setting was primary care. The economic study was carried out in Brazil, France, Germany and Italy.

Estimating EQ-5D values from the Neck Disability Index and numeric rating scales for neck and arm pain

Using EQ-5D to Derive General Population-based Utilities for the Quality of Life Assessment of Growth Hormone Deficiency in Adults (QoL-AGHDA)

Patient Reported Outcomes

Using Discrete Choice Experiments with duration to model EQ-5D-5L health state preferences: Testing experimental design strategies

Prognostic value of changes in quality of life scores in prostate cancer

Valuing health using visual analogue scales and rank data: does the visual analogue scale contain cardinal information?

Interchangeability of the EQ-5D and the SF-6D in Long-Lasting Low Back Pain

Are preferences over health states informed?

Mapping scores from the Strengths and Difficulties Questionnaire (SDQ) to preferencebased

P E R S P E C T I V E S

Fanxing Hong 1*, Jaclyn L F Bosco 2, Nigel Bush 3 and Donna L Berry 2

Economics evaluation of three two-drug chemotherapy regimens in advanced non-small-cell lung cancer Neymark N, Lianes P, Smit E F, van Meerbeeck J P

This is a repository copy of Estimating an EQ-5D population value set: the case of Japan.

Comparing Generic and Condition-Specific Preference-Based Measures in Epilepsy: EQ-5D-3L and NEWQOL-6D

A non-linear beta-binomial regression model for mapping EORTC QLQ- C30 to the EQ-5D-3L in lung cancer patients: a comparison with existing approaches

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors for Performance Ratings

Citation for published version (APA): Ebbes, P. (2004). Latent instrumental variables: a new approach to solve for endogeneity s.n.

Quality of Life assessment: A perspective from The EORTC

Technology appraisal guidance Published: 27 January 2016 nice.org.uk/guidance/ta380

The usefulness of EuroQol and McGill Quality of Life questionnaires in palliative care in-patients

Is EQ-5D-5L Better Than EQ-5D-3L? A Head-to-Head Comparison of Descriptive Systems and Value Sets from Seven Countries

Comparison of Value Set Based on DCE and/or TTO Data: Scoring for EQ-5D-5L Health States in Japan

Setting The setting was secondary care. The study was carried out in the UK, with emphasis on Scottish data.

Performance of Median and Least Squares Regression for Slightly Skewed Data

White Rose Research Online URL for this paper:

Transcription:

Volume 12 Number 1 2009 VALUE IN HEALTH Mapping the EORTC QLQ C-30 onto the EQ-5D Instrument: The Potential to Estimate QALYs without Generic Preference Data Lynda McKenzie, MSc, Marjon van der Pol, PhD Health Economics Research Unit, University of Aberdeen, Scotland, UK ABSTRACT Objectives: The aim of this article is to map the European Organization for Research and Treatment of Cancer (EORTC) QLQ C-30 onto the EQ-5D measure to enable the estimation of health state values based on the EORTC QLQ C-30 data. The EORTC QLQ C-30 is of interest because it is the most commonly used instrument to measure the quality of life of cancer patients. Methods: Regression analysis is used to establish the relationship between the two instruments. The performance of the model is assessed in terms of how well the responses to the EORTC QLQ C-30 predict the EQ-5D responses for a separate data set. Results: The results showed that the model explaining EQ-5D values predicted well. All of the actual values were within the 95% confidence intervals of the predicted values. More importantly, predicted difference in quality-adjusted life-years (QALYs) between the arms of the trial was almost identical to the actual difference. Conclusion: There is potential to estimate EQ-5D values using responses to the disease-specific EORTC QLQ C-30 measure of quality of life. Such potential implies that in studies that do not include disease-specific measures, it might still be possible to estimate QALYs. Keywords: quality adjusted life-years, quality of life, regression modeling, utility assessment. Introduction The most popular form of economic evaluation, and the one recommended by the National Institute of Health and Clinical Excellence (NICE), is the cost-utility analysis [1]. In this form of analysis, the outcomes are measured in terms of quality-adjusted life-years (QALYs), which combines life-years and quality of life into one index measure. Quality of life is measured on a scale ranging from 0 (death) to 1 (full health). Specific health states are located on this scale based on individuals preferences elicited using methods such as the visual analog scale (VAS), time tradeoff, and the standard gamble method. The majority view is that the preferences of the adult population (the taxpayers) are most appropriate given that they provide the funding for the National Health Service (NHS) [2]. This means that population values need to be obtained for all of the relevant health states in the economic evaluation. Eliciting population s preferences for every single economic evaluation would clearly be prohibitively expensive and too time consuming. Fortunately, population values are readily available for several generic quality of life instruments including the EQ-5D [3] and the SF-6D [4]. To estimate QALYs only the responses to say the EQ-5D instrument need to be elicited which can then be transformed into quality of life values by using the population tariff. Because these instruments use broad health domains, they can be applied to a wide range of conditions. When the interest is in measuring the quality of life of patients, disease-specific instruments are often preferred to generic instruments [5]. Disease-specific instruments focus on particular health problems and tend to be more sensitive to clinically important differences [5]. Using the generic instruments preferred by Address correspondence to: Marjon van der Pol, Health Economics Research Unit, University of Aberdeen, Foresterhill, Aberdeen AB25 2ZD, Scotland, UK. E-mail: m.vanderpol@abdn.ac.uk 10.1111/j.1524-4733.2008.00405.x economists, in addition to the disease-specific instruments, adds to the burden imposed on patients for completing questionnaires. Population values, however, are generally not available for diseasespecific instruments. When using a disease-specific instrument only, QALYs can not be estimated and a cost-utility analysis cannot be performed. One solution is to map disease-specific measures onto generic measures. The relationship between the two instruments can be established using regression analysis on existing data. Mapping would provide a model that would enable the estimation of QALYs in trials which did not include any generic instruments. Mapping between instruments has been attempted by previous studies with mixed results. Brazier [6] used regression analysis to attempt to map the disease-specific instrument, Impact of Weight on Quality of Life-Lite to the SF-6D. Although the models tended to perform well in terms of explanatory power, they were weak in terms of predictive ability. Tsuchiya et al. [7] explored possible ways of converting disease-specific measure (Asthma Quality of Life Questionnaire) to preference-based measure (EQ-5D) using a range of ordinary least squares (OLS) regression models. Although fairly stable functional relationships were achieved, their results showed limited ability to predict accurate EQ-5D values at an individual level. Brennan and Spencer [8] mapped the Oral Health Impact Profile onto the EQ-5D. The predicted values for the hold-out sample were substantially higher than the actual EQ-5D values. The aim of this article is to map the European Organization for Research and Treatment of Cancer (EORTC) QLQ C-30 onto the EQ-5D measure to enable the estimation of health state values based on the EORTC QLQ C-30 data. The EORTC QLQ C-30 is of interest because it is the most commonly used instrument to measure the quality of life of cancer patients [9]. It has been translated into more than 65 languages and is used widely internationally [10]. Given its extensive use, successful mapping of this measure onto a generic measure could provide a substantial 2008, International Society for Pharmacoeconomics and Outcomes Research (ISPOR) 1098-3015/09/167 167 171 167

168 McKenzie and van der Pol benefit. It is also hypothesized that mapping is likely to be more successful because the instrument includes more general domains in addition to cancer-specific symptoms. Some of these more general domains are similar to the dimensions of the EQ-5D. Regression analysis is used to establish the relationship between the two instruments, and the performance of the model is assessed in terms of how well the responses to the EORTC QLQ C-30 predict the EQ-5D responses for a separate data set. Methods Instruments The EQ-5D is the generic instrument recommended for use in economic evaluations by NICE [1]. It classifies patients into one of 243 health states (five dimensions, each with three levels) [11]. The five dimensions are: mobility; self-care; usual activities; pain/ discomfort; and anxiety/depression. The levels of each dimension are roughly: none, moderate, and severe. The EQ-5D is of demonstrated validity and reliability (for an overview of empirical evidence, see Brooks [12] and Brazier et al. [13]). Population values are available for the UK [3], as well as several other countries. The EORTC QLQ C-30 is a popular instrument for measuring general cancer quality of life. It covers several health domains, as well as cancer-specific symptoms of disease, the side effects of treatment, psychological distress, physical functioning, social interaction, global health, and quality of life. Most of the questions have a hierarchical response (Not at all; A little; Quite a bit; and Very much), with two questions relying on the use of a VAS. The raw questionnaire responses are transformed to produce scores on a set of five function scales (physical, role, emotional, cognitive, and social functioning) and nine symptom scales along with a scale representing global quality of life. The EORTC QLQ C-30 scale has undergone extensive psychometric testing based on classical test theory, which has yielded favorable results [9]. Data The EORTC QLQ C-30 instrument is mapped onto the EQ-5D instrument using data from a randomized controlled trial of the cost-effectiveness of palliative therapies for patients with inoperable esophageal cancer [14]. Data are available for 199 patients (29.8% female, 70.2% male), with an average age of 74.8 years. Data were collected at a range of between 1- and 3-weekly intervals from baseline to 90 weeks. The advantage of using this data set is that there are a sufficient number of patients for each of the levels of EQ-5D dimensions. The number of individuals reporting severe problems is often very small, which reduces the scope for consistent modeling of the data. Table 1 shows the distributions of the responses to each of the five EQ-5D dimensions, as well as the mean and range of the EORTC QLQ C-30 scales. Analysis Two approaches are taken in terms of modeling the EQ-5D data: 1) the EQ-5D values are modeled as a function of the EORTC QLQ C-30 data; and 2) the five EQ-5D dimensions are modeled as a function of the EORTC QLQ C-30 data. These two approaches require different regression techniques. The EQ-5D values are continuous and OLS regression is used. The second approach models the dimensions of the EQ-5D. The dimensions in the EQ-5D have three levels. Ordered probit regression is used to allow for the ordinal nature of the dependent variable. Table 1 Descriptive statistics N % EQ-5D Mobility No problems 319 36.4 Moderate problems 507 57.8 Severe problems 51 5.8 Self-care No problems 622 70.9 Moderate problems 197 22.5 Severe problems 58 6.6 Usual activities No problems 251 28.6 Moderate problems 386 44.0 Severe problems 240 27.4 Pain No problems 270 30.8 Moderate problems 543 61.9 Severe problems 64 7.3 Anxiety/depression No problems 376 42.9 Moderate problems 443 50.5 Severe problems 58 6.6 Mean Range EQ-5D Value 0.539-0.594 1.0 EORTC QLQ C-30 Global health status 45.34 0 100 Physical functioning 55.17 0 100 Role functioning 46.50 0 100 Emotional functioning 72.82 0 100 Cognitive functioning 77.08 0 100 Social functioning 59.14 0 100 EORTC, European Organization for Research and Treatment of Cancer. The EQ-5D values and the EQ-5D dimensions are modeled as a function of the EORTC QLQ C-30 global health status scale, the five function scales, and the nine symptom scales. It is hypothesized that global health status and the function scales are highly correlated with the EQ-5D values. The symptoms scales, with the exception of pain, are less likely to be correlated given that they are more cancer-specific. The five function scales are estimated assuming equal weighting of the different items which may not reflect patients preferences. The analysis is therefore repeated using the raw responses to each of the EORTC QLQ C-30 item measures. For the main analyses, data from all time points are used as this increases the sample size and therefore the statistical precision of the estimates. The clustering option is used in STATA (State Corp, College Station, TX) which allows for the fact that there are several observations per individual which may not be independent of each other. It could be hypothesized that the relationship between the two instruments becomes more robust over time as individuals become more familiar with the instruments. To explore this hypothesis, the regression models are reestimated to include interaction terms between the independent variables and time point at which data are collected. Performance of the Models Because the purpose of the model is to predict EQ-5D values based on EORTC QLQ C-30 data, the predictive ability of the models is of foremost importance and needs to be tested carefully. Two approaches are taken: 1) assessing the goodness of fit of the regression models; and 2) prediction of EQ-5D values for a separate data set. To assess goodness of fit, the standard measure of adjusted R 2 is used for the OLS regression. A pseudo R 2 is required for the ordered probit regressions. The McKelvey Zavoina pseudo R 2 is used which is an estimate of the proportion

Mapping the EORTC QLQ C-30 onto the EQ-5D 169 of variance explained by the models. This measure most closely approximates the R 2 in OLS regression [15]. It is also informative to examine the number of correct predictions for each of the dimensions. The Goodman and Kruskal l is used which represents the number of correct predictions adjusted for the largest row marginal. The second approach involves estimating EQ-5D values for a separate data set. Data from a trial on radiotherapy after breast cancer surgery for low-risk elderly women (the Postoperative Radiotherapy in Minimum Risk Elderly [PRIME] study) were available for 254 patients with an average age of 72.5 years [16]. Data were collected at four time points; baseline, 3.5 months, 9 months, and 15 months. EQ-5D values are predicted for this data set using both types of models. When using the ordered probit models, the predicted EQ-5D dimensions are transformed into EQ-5D values using the UK population tariff [3]. EQ-5D values are estimated for the overall sample as well as by time points. Predictive performance for this data set is assessed in terms of absolute differences between actual and predicted mean EQ-5D values, the relative prediction error, and whether actual EQ-5D values lie within the 95% confidence interval (CI). As well as statistical significance, it is also important to explore whether the difference in EQ-5D values constitutes a quantitative significant difference. Trivial differences may be statistically significant if sample sizes are relatively large. There are no clear guidelines as to what constitutes a quantitative significant difference and this can only be determined pragmatically. The minimal clinically important difference (MCID) for the EQ-5D has been suggested as 0.03 because this is the smallest difference in EQ-5D value for moving from one level to another on any of the five dimensions [17]. The predicted EQ-5D values are also used to estimate QALYs and these are compared with the actual QALYs. The interest, from a trial perspective, is in terms of differences between health states rather than absolute health state values. When comparing two alternative interventions, the difference in quality of life between the two arms is assessed. It is therefore investigated whether predicted differences in both EQ-5D values and QALYs between the two arms of the trial are similar to the actual differences in EQ-5D values and QALYs. Moreover, it is assessed whether the same conclusion regarding effectiveness would have been reached using the predicted values rather than the actual values. Results Correlations between the individual responses for each of the five EQ-5D questions and for each of the 30 EORTC QLQ C-30 questions were first explored. Statistically significant correlations were found between scores for a number of dimensions which measure similar aspects of quality of life. Table 2 shows the OLS regression results for the EQ-5D values. All statistical significance levels are at the 95% level. The coefficient on global health status is positive and statistically significant (P = 0.003). Three of the function scales are statistically significant, namely role functioning (P 0.000), emotional functioning (P 0.000), and cognitive functioning (P = 0.027). As hypothesized, pain is a statistically significant symptom scale (P 0.000). Fatigue is also statistically significant (P 0.000). Table 3 shows the ordered probit regression results for each of the five EQ-5D dimensions. In the case of mobility, role and cognitive functioning are statically significant (both P 0.000), as well as fatigue (P = 0.003) and dyspnea (P = 0.004). The same variables, apart from dyspnea, are statistically significant with respect to self-care [role functioning (P 0.000), cognitive functioning (P = 0.004), fatigue (P = 0.013)] with the addition of Table 2 Regression results for EQ-5D values financial problems (P = 0.036). In the case of usual activities, global health status (P 0.000), role functioning (P 0.000), and fatigue (P = 0.004) are significant. In the case of pain, emotional functioning, fatigue, and pain are significant [(P = 0.002), (P = 0.03) and (P 0.000), respectively]. Finally, physical and emotional functioning are statistically significant with respect to anxiety and depression [(P = 0.006) and (P 0.000), respectively]. Models were also estimated using raw responses to the 31 EORTC QLQ C-30 questions. Similar results were found (full results can be obtained from the authors). In terms of the interaction terms between time and EORTC QLQ C-30 scales, only the interaction term with pain was statistically significant. This implies that, with the exception of one scale, the relationship between the scales and the EQ-5D values does not change as patients become more familiar with the instruments. Predictive Performance The goodness of fit statistics for the different regression models are reported in Tables 2 and 3. The adjusted R 2 for the OLS regression is relatively high, namely 0.611. The McKelvey and Zavoina s R 2 ranges from 0.370 to 0.668 for the ordered probit regression. Most models have relatively high R 2 indicating that a relatively large part of the variation is explained by the models. The models for mobility and self-care seem to perform less well in terms of goodness of fit. The Goodman and Kruskal l for the ordered probit regression results, reported in Table 3, show that the models reduce prediction error by about 35% on average. The reduction in prediction error is highest for the anxiety and depression dimension (52.8%) and lowest for the self-care dimension (9%). Table 4 shows the predicted EQ-5D values for the PRIME data set. The EQ-5D values predicted on the basis of the OLS regression are reported first. In terms of the overall sample, the predicted EQ-5D value is higher than the actual EQ-5D value. The mean difference is relatively small, namely 0.014. The actual EQ-5D value of 0.760 is just inside the 95% CI of the predicted EQ-5D value which ranges from 0.760 to 0.788. The difference of 0.014 is below an MCID of 0.03. Actual and predicted EQ-5D values are also reported for each time point. The difference between actual and predicted values seems to increase over time. Nevertheless, the actual EQ-5D values are all within the 95% CI of the predicted EQ-5D values. Moreover, all of the differences b t-value Global health status* 0.0016 3.06 Physical functioning 0.0004 1.01 Role functioning* 0.0022 5.26 Emotional functioning* 0.0028 6.16 Cognitive functioning* 0.0009 2.22 Social functioning 0.0002 0.67 Fatigue* -0.0021-3.76 Nausea/vomiting -0.0005-1.59 Pain* -0.0024-6.11 Dyspnea 0.0004 1.46 Insomnia 0.00004 0.14 Appetite loss 0.0003 1.15 Constipation 0.0001 0.64 Diarrhea -0.0003-0.62 Financial problems -0.0006-1.41 Constant 0.2376 3.40 Adjusted R 2 0.611 N observations 877 *denotes a statistically significant attribute.

170 McKenzie and van der Pol Table 3 Regression results for EQ-5D dimensions Mobility Self-care Usual activities Pain Anxiety/depression b t-value b t-value b t-value b t-value b t-value Global health status -0.0048-1.46 0.0027 0.90-0.0110-3.81-0.0001-0.04-0.0009-0.27 Physical functioning -0.0026-1.13-0.0049-1.97 0.0016 0.74 0.0026 1.11-0.0063-2.77 Role functioning -0.0132-5.20-0.0133-5.20-0.0231-8.81 0.0040 1.63 0.0019 0.85 Emotional functioning -0.00005-0.02-0.0019-0.67-0.0040-1.21-0.0090-3.10-0.0415-11.35 Cognitive functioning -0.0104-3.87-0.0066-2.85-0.0019-0.72 0.0006 0.21-0.0027-1.15 Social functioning 0.0014 0.74-0.0010-0.49-0.0022-1.07 0.0031 1.53-0.0027-1.28 Fatigue 0.0112 2.96 0.0078 2.47 0.0085 2.86 0.0073 2.17 0.0051 1.59 Nausea/vomiting 0.0015 0.71 0.0004 0.21-0.0004-0.17 0.0035 1.45 0.0010 0.43 Pain -0.0017-0.72 0.0010 0.41-0.0012-0.50 0.0420 12.29-0.0019-0.92 Dyspnea 0.0052 2.85-0.0014-0.74-0.0023-1.40-0.0020-1.04-0.000001 0.01 Insomnia 0.0002 0.12-0.0007-0.35-0.000001 0.01 0.0026 1.50-0.0024-1.52 Appetite loss -0.0019-1.16-0.0024-1.26-0.0001-0.09 0.0008 0.44 0.0003 0.16 Constipation -0.0017-1.18 0.0002 0.14-0.0003-0.16 0.0019 1.20 0.0005 0.32 Diarrhea 0.0026 0.82 0.0037 1.68 0.0041 1.66-0.0017-0.58-0.0012-0.37 Financial problems 0.0005 0.22 0.0045 2.09 0.0030 1.30 0.0007 0.30-0.0010-0.44 N 877 877 877 877 877 McKelvey and Zavoina s R 2 0.473 0.370 0.621 0.668 0.565 Goodman and Kruskal l 0.289 0.090 0.442 0.425 0.528 are below an MCID of 0.03. The mean QALYs are also reported in Table 4. Given that the predicted EQ-5D values are lower than the actual EQ-5D values, it follows that the mean QALYs estimated from predicted EQ-5D values are also higher than the mean QALYs estimated from actual EQ-5D values. The difference is equal to 0.018 QALYs. The actual mean QALYs is well within the 95% CI of the predicted QALYs. Table 4 also shows the EQ-5D values based on predicted EQ-5D dimensions from the ordered probit regression results. The models seem to predict less well as the difference between predicted and actual EQ-5D values is much larger, namely 0.050. Moreover, the actual value of 0.760 is outwith the 95% CI of the predicted EQ-5D value which ranges from 0.797 to 0.822. The difference of 0.050 is also greater than the MCID of 0.03. The difference between actual and predicted values again seems to increase over time. The models also predict less well in terms of QALYs as the difference between predicted and actual QALYs is equal to 0.072. Results by Arm of the Trial As highlighted before, the real question is whether the differences in predicted values between the arms of the trial are similar to the differences in actual values. That is, would the same conclusions have been reached regarding effectiveness and cost-effectiveness? In the PRIME study, no statistically significant differences in either the EQ-5D values or QALYs were found between the two arms of the trial. Table 5 shows the EQ-5D values and QALYs by arm of the trial based on the predictions from the OLS regression model. None of the differences in the predicted EQ-5D values between the two arms of trial are statistically significant. Moreover, the difference in predicted QALYs between the two arms of the trial is almost identical to the actual difference in QALYs, namely 0.017 versus 0.019. Conclusions The aim of this article was to map the EORTC QLQ C-30 onto the EQ-5D measure to enable the estimation of health state values based on the EORTC QLQ C-30 data. Both the EQ-5D values as well as the EQ-5D dimensions were modeled as a function of the EORTC QLQ C-30 global health status, functioning, and symptom scales. The predictive ability of the model was tested on a separate data set. The results showed that the model explaining EQ-5D values predicted well. All of the actual Table 4 Predicted versus actual EQ-5D values and quality-adjusted life-years (QALYs) Actual Predicted (95% CI) Difference Relative prediction error Predictions based on predicted values from OLS regression Mean EQ-5D 0.760 0.774 (0.760 0788) 0.014 1.84% Mean EQ-5D by time point Baseline 0.765 0.772 (0.746 0.798) 0.007 0.92% 3.5 months 0.778 0.780 (0.752 0.807) 0.002 0.26% 9 months 0.756 0.780 (0.751 0.808) 0.024 3.17% 15 months 0.741 0.764 (0.735 0.794) 0.023 3.10% Mean QALYs 0.945 0.963 (0.933 0.9937) 0.018 1.90% Predictions based on predicted dimensions from ordered probit regressions Mean EQ-5D 0.760 0.810 (0.797 0.822) 0.050 6.58% Mean EQ-5D by time point Baseline 0.765 0.801 (0.776 0.826) 0.035 4.71% 3.5 months 0.778 0.814 (0.789 0.839) 0.036 4.63% 9 months 0.756 0.824 (0.800 0.848) 0.068 8.99% 15 months 0.741 0.800 (0.773 0.828) 0.059 7.96% Mean QALYs 0.945 1.017 (0.980 1.054) 0.072 7.62% CI, confidence interval; OLS, ordinary least squares.

Mapping the EORTC QLQ C-30 onto the EQ-5D 171 Table 5 Actual and predicted EQ-5D values and quality-adjusted lifeyears (QALYs) by arm of the trial Radiotherapy No radiotherapy Difference between arms t-value EQ-5D value Actual 0.765 0.755-0.010 0.77 Predicted 0.779 0.769-0.009 0.70 Baseline Actual 0.764 0.766 0.002 0.07 Predicted 0.770 0.774 0.004 0.15 3.5 months Actual 0.783 0.773-0.011 0.47 Predicted 0.784 0.776-0.008 0.27 9 months Actual 0.774 0.740-0.033 1.16 Predicted 0.792 0.768-0.025 0.85 15 months Actual 0.743 0.740-0.003 0.09 Predicted 0.770 0.759-0.011 0.37 QALYs Actual 0.954 0.935-0.019 0.69 Predicted 0.972 0.955-0.017 0.54 values were within the 95% CI of the predicted values. More importantly, the predicted difference in QALYs between the arms of the trial was almost identical to the actual difference in QALYs. Using predicted rather than actual values would therefore not have changed the conclusions regarding the costeffectiveness of the intervention. This indicates that there is potential to estimate EQ-5D values using responses to the disease-specific EORTC QLQ C-30 measure of quality of life. Such potential implies that in studies that do not include diseasespecific measures, it might still be possible to estimate QALYs. The prediction of the EQ-5D values based on predicted levels for each of the dimensions was less successful. This is most likely because of the fact that the analysis did not take any correlation between the dimensions into account because the dimensions were modeled separately. The use of multivariate analysis might improve the predictive ability. Unfortunately, multivariate ordered probit analysis is currently not available within standard software packages such as STATA and Limdep (Econometric Software, Plainview, NY). The mapping was very successful in this study in that the model predicted the EQ-5D levels and QALYs well for a separate data set. The question arises whether these results are generalizable. The data used were collected from patients with esophageal cancer. An advantage of this data set was that there were sufficient numbers of patients in each of the levels of the five EQ-5D dimensions. This patient group, however, is unlikely to be representative of the average cancer patient group. As well as the type and stage of cancer factors such as age and sex may affect the predictive performance of the model. Although the results showed that the model did predict well for a group of patients with different type of cancer, namely breast cancer, the average age of the patients was similar in the two data sets. Further research exploring predictive performance for different patient groups is clearly required before the application of the model should become a recommended approach for converting the EORTC QLQ-C30 data into EQ-5D values. The data used in this study came from two trials, both funded by the NHS HTA Programme. The first trial is A pragmatic randomized controlled trial of the cost-effectiveness of palliative therapies for patients with inoperable esophageal cancer NHS HTA 96/06/07, and the second trial is The PRIME breast cancer trial (postoperative radiotherapy in minimumrisk elderly) NHS HTA 96/03/01. We would like to thank Paul McNamee for making the first set of data available to us. The Chief Scientist Office of the Scottish Executive Health Department (SEHD) funds HERU. The views expressed in this article are those of the author only and not those of the funding body. Source of financial support: None. References 1 National Institute for Clinical Excellence. Guide to the methods of technology appraisal. Available from: http://www.nice.org.uk/ nicemedia/pdf/tap_methods.pdf [Accessed November 2, 2007]. 2 Dolan P. Output measures and valuation in health. In: Drumond M, McGuire A, eds. Economics Evaluation in Health Care. New York: Oxford University Press, 2001. 3 Dolan P. Modelling valuation for EuroQol health states. Med Care 1997;35:351 63. 4 Brazier J, Roberts J, Deverill M. The estimation of a preferencebased measure of health from the SF-36. J Health Econ 2002;21: 271 92. 5 Patrick DL, Deyo RA. Generic and disease-specific measures in assessing health status and quality of life. Med Care 1989;27: S217 32. 6 Brazier JE, Kolotkin RL, Crosby RD, Williams GR. Estimating a preference-based single index for the Impact of Weight on Quality of Life (IWQOL-Lite) Instrument from the SF-6D. Value Health 2004;7:490 8. 7 Tsuchiya A, Brazier JE, McColl E, Parkin D. Deriving preferencebased single indices from non-preference based condition-specific instruments: converting AQLQ into EQ5D indices. Sheffield Health Economics Group Discussion Paper Series 02/1, University of Sheffield, 2002. 8 Brennan DS, Spencer AJ. Mapping oral health related quality of life to generic health state values. BMC Health Serv Res 2006;6: 96 106. 9 Aaronson NK, Ahmedzai S, Bergman B, et al. The European Organization for Research and Treatment of Cancer QLQ C-30: a quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst 1993;85:365 76. 10 Bottomley A, Aaronson NK. International perspective on healthrelated quality-of-life research in cancer clinical trials: the European Organisation for Research and Treatment of Cancer experience. J Clin Oncol 2007;25:5082 6. 11 EuroQol Group. EQ-5D User Guide. Rotterdam: EuroQol Group, 2000. 12 Brooks R, EuroQol Group. EuroQol: the current state of play. Health Policy 1996;3:53 72. 13 Brazier J, Deverill M, Green C, et al. A review of the use of health status measures in economic evaluation. Health Technol Assess 1999;3:9. 14 Shenfine J, McNamee P, Steen N, et al. A pragmatic randomised controlled trial of the cost-effectiveness of palliative therapies for patients with oesophageal cancer. Health Technol Assess 2005;9:5. 15 Veall MR, Zimmermann KF. Pseudo-R2 measures for some common limited. Dependent variable models. J Econ Surv 1996; 10:241 59. 16 Prescott RJ, Kunkler IH, King CC, et al. PRIME: a randomised controlled trial of post-operative radiotherapy following breast conserving surgery in a minimum risk older population. Health Technol Assess 2007;11:31. 17 Marra CA, Woolcott JC, Kopec JA, et al. A comparison of generic, indirect utility measures (the HU12, HU13, SF-6D, and the EQ-5D) and disease-specific instruments (the RAQoL and the HAQ) in rheumatoid arthritis. Soc Sci Med 2005;60:1571 782.