COMPUTERIZED ADAPTIVE TESTING (CAT) has been

Size: px
Start display at page:

Download "COMPUTERIZED ADAPTIVE TESTING (CAT) has been"

Transcription

1 ORIGINAL ARTICLE Computerized Adaptive Testing for Follow-Up After Discharge From Inpatient Rehabilitation: I. Activity Outcomes Stephen M. Haley, PhD, PT, Hilary Siebens, MD, Wendy J. Coster, PhD, OTR, Wei Tao, BS, Randie M. Black-Schaffer, MD, MA, Barbara Gandek, MS, Samuel J. Sinclair, MEd, Pengsheng Ni, MD, MPH 1033 From the Health and Disability Research Institute, Boston University, Boston, MA (Haley, Tao, Ni); Department of Rehabilitation Medicine, University of Virginia, Charlottesville, VA (Siebens); Department of Occupational Therapy and Rehabilitation Counseling, Sargent College of Health and Rehabilitation Sciences, Boston University, Boston, MA (Coster); Spaulding Rehabilitation Hospital and the Department of Physical Medicine and Rehabilitation, Harvard Medical School, Boston MA (Black-Schaffer); and Health Assessment Lab, Waltham, MA (Gandek, Sinclair). Supported by the National Institute of Child Health and Human Development (grant no. R01 HD043568) and the Agency for Healthcare Research and Quality, and an independent scientist award (grant no. K02 HD ). A commercial party having a direct financial interest in the results of the research supporting this article has conferred or will confer a financial benefit upon the author or 1 or more of the authors. Haley has stock interest in CRE Care LLC, which distributes the Activity Measure for Post-Acute Care products. Reprint requests to Stephen M. Haley, PhD, PT, Health and Disability Research Institute, Boston University, 53 Bay State Rd, Boston, MA 02215, smhaley@bu.edu /06/ $32.00/0 doi: /j.apmr ABSTRACT. Haley SM, Siebens H, Coster WJ, Tao W, Black-Schaffer RM, Gandek B, Sinclair SJ, Ni P. Computerized adaptive testing for follow-up after discharge from inpatient rehabilitation: I. Activity outcomes. Arch Phys Med Rehabil 2006;87: Objective: To examine score agreement, precision, validity, efficiency, and responsiveness of a computerized adaptive testing (CAT) version of the Activity Measure for Post-Acute Care (AM-PAC-CAT) in a prospective, 3-month follow-up sample of inpatient rehabilitation patients recently discharged home. Design: Longitudinal, prospective 1-group cohort study of patients followed approximately 2 weeks after hospital discharge and then 3 months after the initial home visit. Setting: Follow-up visits conducted in patients home setting. Participants: Ninety-four adults who were recently discharged from inpatient rehabilitation, with diagnoses of neurologic, orthopedic, and medically complex conditions. Interventions: Not applicable. Main Outcome Measures: Summary scores from AM- PAC-CAT, including 3 activity domains of movement and physical, personal care and instrumental, and applied cognition were compared with scores from a traditional fixed-length version of the AM-PAC with 66 items (AM-PAC-66). Results: AM-PAC-CAT scores were in good agreement (intraclass correlation coefficient model 3,1 range,.77.86) with scores from the AM-PAC-66. On average, the CAT programs required 43% of the time and 33% of the items compared with the AM-PAC-66. Both formats discriminated across functional severity groups. The standardized response mean (SRM) was greater for the movement and physical fixed form than the CAT; the effect size and SRM of the 2 other AM-PAC domains showed similar sensitivity between CAT and fixed formats. Using patients own report as an anchor-based measure of change, the CAT and fixed length formats were comparable in responsiveness to patient-reported change over a 3-month interval. Conclusions: Accurate estimates for functional activity group-level changes can be obtained from CAT administrations, with a considerable reduction in administration time. Key Words: Outcome assessment (health care); Psychometrics; Rehabilitation by the American Congress of Rehabilitation Medicine and the American Academy of Physical Medicine and Rehabilitation COMPUTERIZED ADAPTIVE TESTING (CAT) has been proposed as an alternative to fixed-format instruments that traditionally have been applied to monitoring functional progress in rehabilitation programs. 1-3 In contrast to an assessment in which all items must be scored for every person, CAT 4 selects only questions that are appropriate to a person s functional level based on previous responses and skips items that are obviously too easy or too hard. 5 In previous work, investigators have examined the potential of CAT for group-level adult rehabilitation assessments by conducting empirical simulations. These simulations use item responses from previously collected data to replicate a CAT session by using the most informative items (items that have good discrimination and represent a level of functioning that is near the person s functional ability) for each individual. The simulations clearly indicate that CAT software has potential to minimize response burden and produce accurate group-level scores in samples from a general rehabilitation population, 2,6,7 and in persons with stroke, 8 medically complex conditions, 9 lower-extremity functional deficits, 10 and persons with chronic headaches. 11 These simulation studies, however, may tend to overestimate CAT results in real patient care settings because the same item responses used to estimate the item parameters also are being used to estimate person scores. As a next step, prospective studies of CAT that replicate prospective outcome assessment conditions in patient care environments are needed to further evaluate the accuracy, validity, and responsiveness of the activity measures generated using CAT software. Reports of prospective CAT applications have been much less common than simulation studies in the field of rehabilitation to date, and those prospective studies have had relatively small samples. For example, Ware et al 2 conducted a cross-sectional prospective pilot study of 20 adult rehabilitation patients using a CAT with a selected set of physical functioning content, and reported a high level of agreement with an alternative short-form and improved discriminant validity over the comparison short-form. In a small sample of children with and without disabilities using a CAT program measuring physical functioning, the CAT program was found to approximate closely the discriminant validity and scoring estimates of the full instrument. 12 In the only prospective longitudinal study of CAT conducted in a rehabilitation environment of which we are aware, Haley et al 13 found that

2 1034 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley both the full-length functional mobility instrument and the CAT version were able to detect statistically significant functional changes during a 16-week fitness intervention for children with disabilities, with a large reduction in test burden as compared with the full-length instrument. One of the largest assessment challenges in rehabilitation is conducting follow-up of patients once they have returned back to the community. The predominant functional assessment system used in rehabilitation, the FIM instrument, may not be the most effective measure for long-term follow-up of patients who have been discharged from inpatient settings. 14 Recent work by Coster et al 15 using patient-reported outcomes suggests that a short-form version of the Activity Measure for Post-Acute Care (AM-PAC) 16 may be more sensitive than the FIM in assessing functional gains and losses once a patient returns to the community. The AM-PAC 17 is a patient-reported outcome system that includes 3 functional activity domains: movement & physical, 6 personal care & instrumental, 18 and applied cognitive. 19 Our research group has built a CAT version of the AM-PAC and undertaken testing of its psychometric properties compared with a fixed-format version in a prospective follow-up study of patients who were recently discharged from inpatient rehabilitation. CAT applications require: (1) a large set of items (item banks) that are empirically calibrated with fixed item parameters (difficulty, discrimination) for each functional area of interest, (2) items that scale consistently along a dimension of low to high functional proficiency and that target the range of functional ability in the intended sample, and (3) rules that guide starting, stopping, and scoring procedures. Item response theory (IRT) methods are used to create hierarchically ordered item pools, and then software algorithms select items to match the person s functional level. We have built and tested CAT software based on earlier versions of the AM-PAC item bank. 2,7 In the current study, we have revised some items from previous work and have collected new item calibration data for each of the 3 AM-PAC domains. Our objective in the current project was to examine essential psychometric properties of CAT scores used to measure functional recovery of patients who have recently been discharged from inpatient rehabilitation. We examined the agreement between CAT-generated scores (AM-PAC-CAT) and those derived from a 66-item fixed-length form of the AM-PAC (AM- PAC-66), and we report the amount of time needed for each assessment format. Second, we examined the ability of CAT scores to discriminate among patient groups classified using a known severity index. Finally, we examined the sensitivity of the CAT to detect changes and examined its responsiveness in relation to patient-centered estimates of their own functional changes during the follow-up period. We expected that the advantage of a CAT-based assessment of activity outcomes for follow-up after inpatient discharge would be reduced respondent burden with only marginal losses in score accuracy, discriminant validity, sensitivity, and responsiveness as compared to the longer fixed-length form assessment. METHODS Sample This study is a longitudinal, prospective 1-group cohort study of patients followed within approximately 2 weeks of hospital discharge and then 3 months later. The final sample of 94 (mean age standard deviation [SD], y; range, 20 90y) patients were recently discharged from the inpatient rehabilitation program at a major rehabilitation hospital (designated as a long-term acute care facility). The average inpatient rehabilitation hospital length of stay (LOS) for the study sample was days. The 94 study participants completed both the initial and follow-up assessments once they returned home. To achieve the final study sample, we originally recruited 149 patients, of whom 111 completed the initial home visit approximately 2 to 6 weeks after hospital discharge and 94 completed the final follow-up visit approximately 3 months after the initial home visit. For the final sample, the average time between hospital discharge and the initial visit was days (range, 5 42d) and the average time between visits was days (range, d). Eligibility criteria included: 18 years of age or older, receiving inpatient rehabilitation services at the time of recruitment, ability to speak English, and having a planned discharge back home. Participants also needed to pass a cognitive screen 20 to assure that each person could report reliably on their own functional status. In addition, patients were excluded if the facility recruiter judged that they were unable to give informed consent based on information in the medical record and/or discussions with treating clinicians. Specifically, the presence of any of the following criteria indicated ineligibility: (1) any orientation deficit, (2) difficulty remembering the day s events, or (3) receptive or expressive communication deficits that precluded the patient from communicating responses reliably (verbally or nonverbally). The final study sample was stratified to include approximately equal numbers of subjects in 3 major patient groups: (1) 38.7% with neurologic disorders (eg, stroke, multiple sclerosis, Parkinson s disease, brain injury, spinal cord injury, neuropathy); (2) 33.3% with musculoskeletal disorders (eg, fractures, joint replacements, orthopedic surgery, joint or muscular pain); and (3) 28.0% with medically complex disorders (eg, debility resulting from illness, cardiopulmonary conditions, postsurgical recovery). To assure good representation of levels of functional severity, recruitment was also stratified to yield a distribution of subjects representing 2 distinct severity levels slight to mild (41.5%) and moderate to severe (58.5%), based on scores from an adapted Modified Rankin Scale (MRS), 21 in which we converted the original 4 categories into 2. This was done so we could make meaningful comparisons in severity groups with the available sample size. The sample was heterogeneous and reflects the racial and ethnic distribution of the recruitment site, although we did make a strong effort to over-recruit minorities. See table 1 for a full description of the demographic characteristics of the final study sample. Eighty-three percent of the patients were receiving some form of home or outpatient skilled rehabilitation services at the time of the initial visit, and 55% continued to receive services at the follow-up visit. The institutional review boards of Boston University and the recruitment facility approved the study and all persons signed informed consent forms prior to participation. AM-PAC Item Banks We developed the item banks used to build the CAT in this study from a calibration study that included 535 patients from inpatient and transitional care rehabilitation units (48%), outpatient (26%), and home care settings (25%). To reduce response burden, and to avoid administering irrelevant items to patients in either a hospital or community setting, data on core items were obtained on all patients across settings, and data on additional items were collected depending on whether the patient was in an inpatient or community rehabilitation setting. These procedures have been outlined in previous item calibration studies on the AM-PAC. 6 SF-8 Health Survey 22 data indicated that the health of the item calibration sample based on the physical component summary (mean SD, ) was

3 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley 1035 Table 1: Demographic Characteristics of Final Study Sample (N 94) Characteristics Values Mean age SD (y) Sex (%) Male 45.2 Female 54.8 Marital status (%) Married 67.6 Race (%) White 82.6 Black/African American 14.1 Asian 3.3 Education (%) High school or less 44.1 Bachelor/certificate 40.9 Graduate degree 15.1 Impairment group (%) Neurology 38.7 Orthopedics 33.3 Medically complex 28.0 MRS (%) Severe/moderate 58.5 Mild 41.5 Living with (%) Alone 25.5 Spouse/partner 28.7 Family 42.6 Nonfamily 3.2 Living location (%) House 54.3 Apartment/condominium 38.3 Senior housing 4.3 Assisted living 3.2 Problems with (%) Eyesight 22.3 Hearing 13.8 Speech 13.8 Thinking/understanding/remembering 14.9 Use of legs 80.9 Use of arms 36.2 Grasping and use of fingers 38.3 Walking 79.8 Mean FIM discharge SD Motor score Cognitive score Total score below the U.S. population norms (mean, 50 10), although the mental component summary (mean, ) was consistent with U.S. population norms (mean, 50 10). The full AM-PAC item pool consisted of 233 activity items, of which 117 movement & physical, 52 personal care & instrumental, and 47 applied cognitive items were retained for the final analyses. The majority of items removed were items that involved use of wheelchairs due to the small number of persons using wheelchairs in the calibration sample; other items were deleted due to poor fit or redundancy of content. Item parameter estimations and model fit tests were conducted using Parscale software. 23,a A generalized partial credit model was used for all 3 domains, because the data did not support the requirement of common item slopes. However, to obtain convergence for 2 domains (movement & physical, applied cognitive), we used a variation of the generalized partial credit model, in which a 1-parameter model first was estimated on a core set of items completed by all patients (15 for movement & physical, 18 for applied cognition). We then estimated a second 1-parameter model (for each domain) for the remaining items, anchoring on the core items. We evaluated fit to the model based on the comparison of expected and observed values across the distribution of the latent variable, following a method described by Mislevy and Bock 24 and adapted with slight modification for use in Parscale. Bonferroni-adjusted P values were used for significance testing. Assumptions of unidimensionality and local independence were evaluated prior to finalizing the item banks using factor analysis of categorical data, 25,26 because violations of these model assumptions can affect the estimation of item information discrimination parameters. 27 We estimated IRT-based scores for the item banks using weighted maximum likelihood estimation. 28 Weighted maximum likelihood is less biased than maximum likelihood estimation with the same asymptotic variance and normal distribution, and is more accurate than other procedures with a CAT fixed item stop rule. 29 The final IRT-based AM-PAC scores were standardized to a mean of 50 and SD of 10 based on the current rehabilitation sample. This was done so that the addition of future new items could be easily integrated onto the same scale metric. AM-PAC-CAT We based the AM-PAC algorithms on the DYNHA software b developed at QualityMetric Inc. 11,30 The AM-PAC-CAT was designed to be completed by patients and was administered from a stand-alone laptop computer using a Windows operating system. For each of the 3 separate activity domains, we selected an initial item with a high information function in the middle of the scoring range and content that seemed appropriate for most respondents. We chose to use an item in the middle of the range for the initial item because test information function usually peaks in this range. The response to the first item is fed into the DYNHA engine, and the application calculates a probable score, as well as a person-specific measure of score precision. Rules for stopping each AM-PAC-CAT domain were based on score precision or maximum number of items. If the score is not estimated with sufficient precision, additional questions are selected and administered until the 95% confidence interval (CI) around a (mean, 50 10) score is above the set limit (standard error, 5) or the defined maximum number of items has been administered (10 per domain). AM-PAC Short-Forms We selected 66 items (AM-PAC-66) from the AM-PAC items banks for inclusion on the AM-PAC short forms to maximize content coverage and information value of items across the range of content within each activity domain. We felt that these 66 items represented all aspects of content and the full range of functional ability expected in the sample. Thus, approximately 20 items per activity domain were selected to approximate the actual IRT latent trait scores estimated by the full set of items in each activity bank. We selected items at or near the extreme score ranges to minimize ceiling and floor effects. Twenty-six items were selected for the movement & physical (22% of the item bank), 20 items for personal care & instrumental (38% of the item bank), and 20 items for applied cognitive (43% of the item bank) domains. Testing Procedures We collected AM-PAC-66 and AM-PAC-CAT data from patient interviews at 2 time points: approximately 2 weeks after

4 1036 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley discharge from the inpatient rehabilitation hospital and 3 months after the initial home visit. An on-site recruiter at the inpatient hospital explained the study to potential participants, answered any questions, and obtained signed consent forms prior to hospital discharge. The data collector abstracted information from the medical record including basic demographic, medical and diagnostic information, and all FIM inpatient rehabilitation admission and discharge data, as well as selected information from the Uniform Data System for Medical Rehabilitation 31 data fields. All data were entered into files on laptop computers without personal identifiers. Patient interviews were conducted by trained interviewers at the subjects current living location. Research staff contacted each subject 1 to 2 weeks before each interview was scheduled to set up a convenient time for the interview. A window of 6 weeks from the due date to be interviewed was applied. Subjects not interviewed within 6 weeks of hospital discharge were dropped from the study. The administration sequence of the AM-PAC-66 and AM-PAC-CAT was alternated systematically to avoid an order effect. Each person enrolled was consecutively assigned a test order pattern, either CAT first, short-form second on initial visit, then short-form first, CAT second on follow-up visit or the reverse pattern, starting with the shortform first on the initial visit. Due to drop-outs, the order pattern of CAT first on the initial visit occurred 52.1% of the time, while the reverse order occurred 47.9%. We collected the AM-PAC-66 and all other data besides the CAT by interview. During CAT administration, participants viewed the computer screen along with the data collectors. If the person was computer literate and chose to interact with the computer directly, the respondent was asked to enter item responses into the computer by use of mouse or touch pad. Most often the data collector served as the person who used the mouse to record the response for the patient. At the follow-up visit, and after both CAT and AM-PAC-66 administrations, we asked each person to rate his/her functional status (worse, about the same, better) in each activity domain, since the start of the study, using a standard Likert scale For each domain, the patient first rated him-/herself as worse, about the same, or better compared with 3 months earlier, and then scored the amount of change using a 15-point scale ranging from 7 (a very great deal worse) through 0 (no change) to 7 (a very great deal better). Participants completed all 3 global rating questions at the end of the interview. Each interview lasted about 45 minutes to an hour. We collected the actual time (to the closest minute) required for administration of the AM-PAC-66; the AM-PAC-CAT had an internal clock to track the amount of time and the number of items needed to meet preset levels of precision. Additional data were collected at each visit if time permitted, including participation outcomes using short forms and CAT, the results of which are reported in a companion article. Analyses Intraclass correlations coefficient model 3,1 (ICC 3,1 ) 35 between CAT initial visit and follow-up scores and the bestestimate IRT-based latent trait scores from the AM-PAC-66 were calculated to assess the extent to which CAT scores accurately reproduced an estimate of the item bank score based on the 66-item short-form. ICCs were calculated as a ratio of the variance of scores between subjects to the total variance of scores between and among subjects. For group estimates, reliability is considered high if the ICC is greater than.80, substantial if it is between.61 and.80, moderate between.41 and.60, and poor to fair if it is.40 or less. 35 The ability of the AM-PAC-CAT, compared with the AM-PAC-66 versions, to discriminate between groups of patients on the basis of severity of disability (adapted MRS score 36 ) was evaluated by a series of independent sample t tests. For this analysis, we found similar patterns in the initial and follow-up data, thus to decrease the number of tests and limit dependencies, we report just the follow-up data across visits for the discriminant validity comparisons. We used a series of paired t tests to examine differences in sensitivity between the AM-PAC-66 and CAT versions. We defined sensitivity as the amount of positive change detected by an instrument. As no one index of sensitivity appears to be used consistently, we calculated 2 sensitivity indices. The simple effect size (ES) for correlated samples is the average change between initial and follow-up measurements, divided by the SD of the initial measurement. 37 The standardized response mean (SRM) is the ratio of mean change to the SD of the change score. 38 Within the same sample, the SRM is identical to the Cohen statistic (d) for correlated samples, which is calculated by taking the t statistic and dividing it by the square root of the sample. 39 The ES and SRM often yield the same ranking of measures, but the absolute values can be different. The magnitude of ES is dependent on the variability of scores within the initial measurement session. In situations in which the correlation between the initial and follow-up scores is large, the SRM is considerably larger than the ES. 40 To compare the equality of the ES and SRM between activity domain pairs of the AMPAC-66 and AM-PAC-CAT, we generated 95% CIs using a total of 5000 bootstrap random samples with replacement. To examine the efficiency and difference in response burden of the CAT, we used a series of 1-sample t tests to examine the difference between the number of items required in the AM-PAC-CAT versus the AM-PAC-66 (fixed), and a series of paired t tests to examine differences in the amount of time needed for the AM-PAC-CAT (internal computer clock) and AM-PAC-66 (timing by test administrators). Finally, we examined responsiveness using 2 methods. We reserved the term responsiveness to mean changes based on an external anchor, in this case, the global ratings of change between initial and follow-up visits provided by the patients. We collapsed data from an original 15-point Likert scale of change that has been used in previous responsiveness studies 34 (appendix 1). We grouped the absolute values of the ratings (either worse or better) using the following categories: 0 and 1 as no change, 2 to 5 as small to medium change (some), and 6,7 as large change. We limited our analyses to 3 rather than the customary 4 categories 32,34 because of negligible numbers in the smallest change category. We compared the mean AM-PAC-CAT and AM-PAC-66 scores for each change category. Then, we contrasted the responsiveness of the 2 test formats by producing receiver operator characteristic (ROC) curves 41,42 for detecting at least a small to medium change (either worse or better) based on patient-centered ratings. The construction of paired-roc curves for the AM- PAC-CAT and AM-PAC-66 involved plotting sensitivity against (1 specificity) along multiple cutpoints based on the absolute change values of patient scores from either the AM- PAC-CAT or the AM-PAC-66 domains. A series of change cutpoints using absolute values ( 89 per domain) were used to develop the ROC curves. The true positive rate (sensitivity) is the proportion of those patients exceeding each absolute change cutpoint relative to those who reported making at least a small to medium change based on their own global rating. The false positive rate (ie, 1 specificity) is the proportion of patients not exceeding each absolute change cutpoint relative to those who made at least a small to medium change based on their own ratings. Chi-square tests were conducted to examine

5 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley 1037 Table 2: Score Agreement Between the AM-PAC-CAT and the AM-PAC-66 Table 4: Respondent Burden of the AM-PAC Versions in a Longitudinal Sample (N 94) Activity Domain Interview ICC 3,1 (N 94) Movement & physical Initial.829 Follow-up.864 Personal care & Initial.794 instrumental Follow-up.771 Applied cognition Initial.807 Follow-up.844 ICC 3,1 (average within domain).850 ICC 3,1 (average across domains) Version Mean SD (min) Range CAT as % of Fixed AM-PAC-66 Time (min) AM-PAC-CAT Time (min) * No. of items *Paired t test (t 16.59, P.001). One-sample t test (t , P.001). if the areas under the ROC curves were different than expected by chance alone (curve that follows the diagonal) and if the area under the curves between paired AM-PAC-CAT and AM- PAC-66 ROC curves were statistically different. When comparing 2 ROC curves constructed on the same individuals, statistical analyses between the curves should take into account the correlated nature of the data. 43 RESULTS Sample Follow-Up There were no significant differences in average age, proportions of sex or race, or average inpatient LOS between the study group (n 94) and persons who enrolled, but dropped out of the study prior to the first visit (n 38). Persons who dropped out had a higher average discharge FIM total (mean difference, 4.1; t 2.67, P.008) and discharge FIM motor scores (mean difference, 4.3; t 2.93, P.004) than the study group. A greater percentage of persons who dropped out had less severe levels of physical disability as measured by an adapted severity classification of the MRS 21 ( 2 test 10.95, P.004). There were no differences in demographic or initial activity scores from the CAT or the fixed-length form between those subjects who completed both visits in comparison to those who were lost to follow-up (n 17). Reasons for persons who were lost to follow-up included: death (n 3), unable to contact or no longer living at home (n 6), refused (n 6), and missing data (n 2). Score Comparability and Validity Intraclass correlations between score estimates of all of the CATs and the AM-PAC-66 indicate a high to substantial degree of correspondence (ICC range,.77.86) for group level data. For the entire sample, the average ICC correlation across all 3 domains and test occasions was.82 (table 2). We saw no substantial differences in score agreement between the initial (ICC mean,.810) and follow-up (ICC mean,.826) test occasions. With extreme outliers removed (difference 10 points, or 1 SD, between the AM-PAC-66 and the AM-PAC-CAT; n 5 to 7 per domain for the movement & physical and applied cognition domains; n 16 for the personal care & instrumental domain), greater than 75% of all individual scores between the CAT and fixed format instruments were within a margin of 5 points (0.5 SD). Fifty-five (58.5%) patients were coded as having moderate to severe disability on the initial visit, and 42 (44.7%) patients were coded as moderate to severe on the follow-up visit; almost 60% of the sample did not change disability status between visits. The AM-PAC-66 and the AM-PAC-CAT were successful in discriminating between known severity groups in each of the 3 activity domains; that is, both were able to detect statistically significant differences in scores between the 2 severity groups. As expected, because the MRS emphasizes primarily physical functioning, the movement & physical and personal care & instrumental domains in general (both CAT and fixedform) were more discriminating between physical disability severity groups than was applied cognitive (table 3). CAT Item Selection and Time Burden The average number of items SD required for the AM-PAC-CAT (averaged across the initial and follow-up visits) was for movement & physical, for personal care & instrumental, and for applied cognition domains. The minimum number of items per domain was 2 and the maximum (established by the item-stop rule) was 10. The average number of items per person for the 3 domains was For the movement & physical domain, 50 of the 118 items were selected 1 or more times across both initial and follow-up visits, 22 of the 52 items were selected from the personal care & instrumental domain, and 14 of 47 items were selected from the applied cognition domain. Within each of the activity domains, the 10 most frequently selected items accounted for 62% of the item administrations for movement & physical, 81% for personal care & instrumental, and 97% for applied cognition. Table 4 summarizes the relative burden for the 94 respon- Table 3: CAT Versus Fixed-Form Test Format Discrimination by Severity of Physical Disability (Follow-Up Data) Activity Domain Version Mean Severe/ Moderate SD (n 42) Mean Mild/ Slight SD (n 52) Difference t df P Movement & physical AM-PAC AM-PAC-CAT Personal care & instrumental AM-PAC AM-PAC-CAT Applied cognition AM-PAC AM-PAC-CAT

6 1038 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley Activity Domain Table 5: Sensitivity of the AM-PAC-CAT Versus the AM-PAC-66 Over 3-Month Interval Test Format Mean SD Initial Visit Follow-Up Change t Value (df 93) P ES SRM Movement & physical AM-PAC * AM-PAC-CAT * Personal care & instrumental AM-PAC AM-PAC-CAT Applied cognition AM-PAC AM-PAC-CAT *95% CIs of the AM-PAC-66 SRM and AM-PAC-CAT SRM do not overlap, indicating significant difference; all other comparisons of ES and SRM are nonsignificant. dents with complete initial and follow-up visits. Overall, the AM-PAC-CAT yielded large decreases in respondent burden as compared with the AM-PAC-66, requiring 33% of the number of items and 44% of the administration time of the full-length survey. The differences between the number of items and amount of time required to complete the 2 different formats were significant, favoring the more efficient AM-PAC- CAT. Sensitivity Using bootstrap methods to generate 95% CIs for each ES and SRM, we found only 1 statistically significant paired difference. The movement & physical AM-PAC-66 SRM was statistically higher than the corresponding AM-PAC-CAT SRM (table 5). None of the ES comparisons were statistically significant. The personal care & instrumental domain yielded nearly similar ES and SRMs between the 2 testing formats. Neither CAT or fixed-length formats of the applied cognition domain were able to detect significant levels of change between initial and follow-up visits due largely to a high ceiling effect. Responsiveness Figure 1 depicts a series of bar charts that highlight the amount of absolute change in the AM-PAC-66 and AM-PAC- CAT formats that corresponds to no change, some change (small to medium), or large change categories as defined by patients global ratings of change within each respective functional domain. We used absolute values in these calculations because a number of persons reported worsening functional activity status on the follow-up visit compared to the initial home visit after hospital discharge. In general, the AM-PAC- CAT and AM-PAC-66 were equally able to detect levels of absolute change across the 3 anchor-based change categories, because no statistical differences were found between the AM- PAC-CAT and the AM-PAC-66 in any of the paired comparisons within each change category. We defined some change as a minimal level of change that appeared to have meaning to patients. We also examined separately differences in change across the 3 categories of only the persons who changed in a positive direction (better function at follow-up than at initial), and found no statistical differences between the AM-PAC- CAT and the AM-PAC-66 scores. Of the 38 individuals who reported some change in movement & physical, the average change values for the AM-PAC- CAT (mean SD, ) and the AM-PAC-66 (mean, ) were nearly identical. Forty persons indicated that they made some change in the personal care & instrumental domain, and their AM-PAC-CAT (mean, ) and AM- PAC-66 (mean, ) scores reflected analogous levels of responsiveness to those who reported some change. Similarly, even though the magnitude of the changes was smaller, Fig 1. Comparison of changes (absolute values) detected by CAT and fixed-length formats of the AM-PAC based on categories of patient-reported ratings of change.

7 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley 1039 we found comparable levels of change from the 20 individuals who reported some change on the applied cognition domain when comparing the AM-PAC-CAT (mean, ) with the AM-PAC-66 (mean, ). Using the some change category as the cutpoint for examining paired-roc curves to detect minimal levels of patient-reported responsiveness, we found in general that the AM-PAC-CAT and the AM-PAC-66 performed equally well in 2 of the 3 activity domains. For the movement & physical domain, both the AM-PAC-66 (ROC, ; P.017) and AM-PAC-CAT (ROC, ; P.006) ROC curves were statistically different from chance levels and there was no statistically significant difference between the paired-roc curves (ROC difference, ; 2 1 test.103, P.748). Similarly, for the personal & instrumental care domain, both the AM-PAC-66 (ROC, ; P.025) and AM-PAC- CAT (ROC, ; P.028) ROC curves were statistically different from chance levels and there was no statistically significant difference between the paired-roc curves (ROC difference, ; 2 1 test.003, P.959). Because of the relatively small amounts of change detected in the applied cognition domains, the ROC curves for both the fixed and CAT forms were not statistically different from chance levels. DISCUSSION Previous studies of simulations using real patient data have suggested that CAT programs can offer substantial reductions in response burden with relatively small compromises in accuracy or sensitivity to change. In this study, we examined this question prospectively in a cohort of patients who were recently discharged from inpatient rehabilitation and followed over a 3-month interval during which, in keeping with standard rehabilitation practice, most received additional physical, occupational, and/or speech therapy services at home or in an outpatient setting. At the time of the initial visits 83% were receiving services; this proportion declined to 55% by the time of the second visit. The results suggest that CAT programs achieve good score correspondence with longer, fixed-length instruments. The average score agreement across the 3 activity domains was approximately ICC equal to.81 for the full sample. This level of agreement is considered acceptable for group level studies. Because we are early in the stage of using CAT in patient assessments, it is not yet clear if some of the higher correlations seen in recent empirical simulations (.90) will be realized in real-world patient care assessments using CAT programs. The level of score agreement found in this study may not be acceptable when change scores of individual patients are of interest. This can be easily rectified by assigning a higher level of precision to the CAT stop-rule in future studies. Also, in future studies, we need to examine the testretest reliability of the CAT programs over short intervals to estimate the level of measurement error. Previous studies have suggested that test-retest reliability of aggregate scores of functional items is quite acceptable, 44 yet this needs to be confirmed with the CAT platform. A number of alternative explanations can be presented to understand score disagreements between the CAT and fixed format instruments. We noted that most of the largest scoring disagreements between the 2 test formats occur at the extremes of the range. This is where the CAT may have some important advantages over the fixed form, because items can be better tailored to an individual using the CAT when a person has very low or high functional ability. The fixed form, even though more items are employed, may be providing a less accurate estimate of function than the CAT. Both fixed forms and CATs are typically more precise for persons who score in the middle of the range. CATs, however, have more flexibility to meet the content level of persons at the extremes, and thus may have some advantages over fixed forms for score precision at the extremes. In the case of the personal care & instrumental domain, disagreements in scores for 14 persons can be attributed to the greater content breadth of the CAT, and thus a higher ceiling score was obtained on the CAT than on the fixed form. This was due to the fact the personal care & instrumental fixed forms did not have as wide of content range as the CAT version. We also identified a number of individuals who made unexpected and inconsistent (to the IRT model) responses on a number of items on the CAT. On average across the three domains, 30% of the exact items that were answered in both the AM-PAC-66 and AM-PAC-CAT were answered inconsistently. These individuals, who tended to have relatively low functional scores, provided more challenges to obtaining a precise estimate of activity function with a CAT based on few items. Certainly, differences in the 2 testing formats (interview vs computer) and the particular sequence of item administration may all lead to score differences between the CAT and the alternative fixed forms. We found that CAT-based scores have good discriminant validity vis-à-vis severity of disability across all 3 activity domains. We found only 1 comparison (SRMs for movement and physical) in which the fixed-length form had greater sensitivity than the CAT. This can be attributed mainly to the larger standard deviations seen with CAT scores, as there is more variability in scores when the estimates are generated by substantially fewer items than the fixed forms. This finding of reduced sensitivity in the movement & physical domain is not entirely unexpected, given that the average number of CAT items was 6 versus the 26 movement & physical items within the AM-PAC-66. Although we established a maximum stoprule of 10 items per activity domain, on average, only 6 items were needed to meet the precision requirements ( 5 points or 0.5 SD) established for each activity CAT. In retrospect, this setting may have not been adequate to detect the full amount of change in this functional domain. In simulation studies, we have seen that sensitivity can be improved quite markedly as the number of items administered increases. 12 In future work, we intend to explore different item-stop rules and precision levels to strike the appropriate balance between sensitivity and test burden. Using the full study sample, we found that neither the AM-PAC-CAT or the AM-PAC-66 format of the applied cognitive domain detected statistically significant group change. This finding likely reflects a very high ceiling effect ( 38%) for both the AM-PAC-66 and AM-PAC-CAT scores at the initial home visit. The applied cognitive scale was developed to measure cognitive functional activities that involve limited movement requirements, including reading, communication, problem solving, organization and management of routines, as well as telephone use, money management, and management of medications. 19 In our initial calibration work on this scale, we defined the scale only in terms of patients with neurologic disorders such as stroke and brain injury. However, in subsequent work with a short-form version of the applied cognitive scale, we found that many individuals with complex medical conditions also were limited in a number of cognitive functional skills. 15 The amount of change on the AM-PAC-CAT in patients with neurologic disorders (n 36) was nearly twice the change (mean, 2.16; ES.29) seen in the entire longitudinal sample (change mean, 1.23; ES.17). However, in this study, the persons with complex medical disorders exhibited on average very small amounts of change on the CAT version of this scale (change mean,.59; ES.08). Based on these results, we

8 1040 COMPUTERIZED ADAPTIVE TESTING: ACTIVITY OUTCOMES, Haley may emphasize use of the applied cognition scale with those patients with neurologic impairments or individuals with either suspected or clear signs of cognitive deficits. We found that approximately 25% of the patients who were followed for approximately 3 months in this study actually obtained lower AM-PAC-CAT or AM-PAC-66 scores on 1 or more domains (indicating less functional ability) on the follow-up visit than the initial visit. This is not surprising because 23 of 94 (24.5%) patients within the final study sample reported significant new illnesses or injuries between study visits, and 17 of these patients reported that they had been rehospitalized during the study interval. Patients who had declining activity scores were primarily from the neurologic (42%) and complex medical subgroups (55%). Due to the relatively large number of patients who lost function during the study, we analyzed the responsiveness of the AM-PAC-66 and AM-PAC-CAT using absolute change values. Absolute values assume that the meaning of change in either direction is likely to be similar, and in the case of a general rehabilitation patient, this assumption is probably warranted. However, for conditions that almost always result in some functional deterioration over time, conditions in which small improvements are unexpected, or for cohorts with much larger sample sizes in which the direction of functional change is critical, perhaps separate analyses should be conducted to evaluate patients with either functional improvement or deterioration. 45 By using absolute change in this study, we were able to include all subjects and to compare the ability of the CAT to detect both deterioration as well as improvement. In contrast, the overall change scores, on which the sensitivity analyses were based, combined patients who deteriorated, were stable, and who improved into the mean change scores. The comparison of the responsiveness of the AM-PAC-66 and AM-PAC-CAT in this study was based on a global rating of change made by the patient at the follow-up home visit. Although patients are considered to be the ideal respondent regarding their own functioning, there are, nevertheless, a number of concerns regarding patient-based anchors of change. These include recall bias and accommodation to the illness or condition. 46 However, methodologic issues aside, patient-based anchors are a very important consideration in evaluating the relative responsiveness of an instrument. Importantly, we found that the AM-PAC-CAT was just as responsive to patient-reported change as the much longer AM-PAC-66 in both the movement & physical and personal care & instrumental domains. We note impressive efficiency gains in using the AM- PAC-CAT, as it required only one third of the items in the longer fixed-length form, with relatively minor accuracy, sensitivity, or responsiveness losses. These results suggest that the CAT format is a very promising technology for long-term follow-up. We may have actually underestimated the efficiency gains from the CAT in this study. The amount of time needed to administer the CAT was calculated by an internal computer clock that could not be stopped. During the interview in the home environment, a number of interruptions (eg, phone calls, persons coming to the door, greetings by spouses and family members) occurred during the CAT administration. We were unable to account for these interruptions in calculating the time for CAT administration; thus, the amount of time recorded is likely an overestimate, although this may represent realistic time required to administer the CAT in a home setting where interruptions are less controllable than in a clinical setting. When conducting the interviews of the fixed-length forms, data collectors were instructed to record administration time, but to record the length of interruptions greater than 1 minute and subtract this time from the overall administration time. Future studies in both clinical and home environments should continue to examine the potential efficiency gains by use of the CAT platform for functional assessments. A potential limitation of using primarily an interview format for collecting CAT data is that the responses may not generalize to situations in which a patient is responding to questions directly on the computer. We chose to err on the side of data completeness and quality in this early CAT work on activity functioning; comparisons of item response to the use of an interviewer versus full patient-report could be the focus of future studies. Future CAT development will also need to balance the utility of generating scores for groups of patients with the perceived need by some clinicians for these assessments to provide usable information for individual patient treatment planning and monitoring. For example, CAT item selection programs may be developed in the future to choose items based on content considerations as well as maximizing information value, as was done in this study. A challenge for CAT applications in rehabilitation is to provide enough information at the individual level, and still minimize response burden so that CAT remains feasible in rehabilitation practice. There was considerable attrition from the time of initial recruitment into the study prior to hospital discharge to the 3-month follow-up visit, and persons with higher level functional skills were disproportionately represented in the dropouts. It appears that individuals who have fully recovered or have other community responsibilities such as return to work are less likely to stay involved with a longitudinal follow-up study. This factor has important implications for the generalizability of the findings and for retention of individuals in long-term follow-up studies. It should also be noted that because this was an observational design, the final functional status of each patient included changes due to rehabilitation services, the passage of time, and other factors specific to that patient. CONCLUSIONS The results of this study support the continued evaluation and development of CAT-based functional assessments in rehabilitation. The current CAT programs were developed for group-level analyses and not for care planning or identifying limitations of individuals in specific areas of functioning. The efficiency gains coupled with very strong psychometric performance of the activity scales suggest that CAT assessments may provide an important tool for future follow-up studies and group monitoring in postacute care. Acknowledgments: We thank Jakob B. Bjorner, MD, PhD, for statistical support with the item bank development and subsequent analyses. Appreciation is extended to Kristen Foget for recruitment of patients for this study. We also acknowledge Maryann McGerigle, Cindy Garven, and Susanne Fantasia for their data collection activities, Christine Cahalan, Erika Wright, and Jeanne McGerigle for data entry, and Julie Cam and Ashley Harper for help with manuscript preparation. APPENDIX 1: PATIENT-REPORTED GLOBAL RATING OF CHANGE* We would like you to think how about much functional change in (movement & physical; personal care & instrumental; applied cognition... each domain described in lay language) has occurred since when you entered the study. Overall, would you say that your functioning is: 1. Worse, 2. About the same, or 3. Better?

THE EFFECTIVENESS of rehabilitation services is best

THE EFFECTIVENESS of rehabilitation services is best 649 Short-Form Activity Measure for Post-Acute Care Stephen M. Haley, PhD, PT, Patricia L. Andres, MS, PT, Wendy J. Coster, PhD, OTR, Mark Kosinski, MA, Pengsheng Ni, MD, MPH, Alan M. Jette, PhD, MPH,

More information

Follow this and additional works at: https://uknowledge.uky.edu/rehabsci_facpub Part of the Rehabilitation and Therapy Commons

Follow this and additional works at: https://uknowledge.uky.edu/rehabsci_facpub Part of the Rehabilitation and Therapy Commons University of Kentucky UKnowledge Rehabilitation Sciences Faculty Publications Rehabilitation Sciences 1-2016 Specificity of the Minimal Clinically Important Difference of the Quick Disabilities of the

More information

PHYSICAL FUNCTION A brief guide to the PROMIS Physical Function instruments:

PHYSICAL FUNCTION A brief guide to the PROMIS Physical Function instruments: PROMIS Bank v1.0 - Physical Function* PROMIS Short Form v1.0 Physical Function 4a* PROMIS Short Form v1.0-physical Function 6a* PROMIS Short Form v1.0-physical Function 8a* PROMIS Short Form v1.0 Physical

More information

Application of a New Measure of Activity and Participation with Children with Autism Spectrum Disorders

Application of a New Measure of Activity and Participation with Children with Autism Spectrum Disorders Application of a New Measure of Activity and Participation with Children with Autism Spectrum Disorders Jessica Kramer, PhD, OTR/L Wendy Coster, PhD, OTR/L Ying-Chia Kao, MS, OTR Steve Haley, PhD, PT Boston

More information

Technical Specifications

Technical Specifications Technical Specifications In order to provide summary information across a set of exercises, all tests must employ some form of scoring models. The most familiar of these scoring models is the one typically

More information

AN ANTICIPATED FEATURE of contemporary patientreported

AN ANTICIPATED FEATURE of contemporary patientreported S37 ORIGINAL ARTICLE Linking the Activity Measure for Post Acute Care and the Quality of Life Outcomes in Neurological Disorders Stephen M. Haley, PhD, Pengsheng Ni, MD, MPH, Jin-Shei Lai, PhD, Feng Tian,

More information

Low Tolerance Long Duration (LTLD) Stroke Demonstration Project

Low Tolerance Long Duration (LTLD) Stroke Demonstration Project Low Tolerance Long Duration (LTLD) Stroke Demonstration Project Interim Summary Report October 25 Table of Contents 1. INTRODUCTION 3 1.1 Background.. 3 2. APPROACH 4 2.1 LTLD Stroke Demonstration Project

More information

COGNITIVE FUNCTION. PROMIS Pediatric Item Bank v1.0 Cognitive Function PROMIS Pediatric Short Form v1.0 Cognitive Function 7a

COGNITIVE FUNCTION. PROMIS Pediatric Item Bank v1.0 Cognitive Function PROMIS Pediatric Short Form v1.0 Cognitive Function 7a COGNITIVE FUNCTION A brief guide to the PROMIS Cognitive Function instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Item Bank v1.0 Applied Cognition - Abilities* PROMIS Item Bank v1.0 Applied Cognition

More information

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data

Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Adjusting for mode of administration effect in surveys using mailed questionnaire and telephone interview data Karl Bang Christensen National Institute of Occupational Health, Denmark Helene Feveille National

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 39 Evaluation of Comparability of Scores and Passing Decisions for Different Item Pools of Computerized Adaptive Examinations

More information

Section 5. Field Test Analyses

Section 5. Field Test Analyses Section 5. Field Test Analyses Following the receipt of the final scored file from Measurement Incorporated (MI), the field test analyses were completed. The analysis of the field test data can be broken

More information

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD

Contents. What is item analysis in general? Psy 427 Cal State Northridge Andrew Ainsworth, PhD Psy 427 Cal State Northridge Andrew Ainsworth, PhD Contents Item Analysis in General Classical Test Theory Item Response Theory Basics Item Response Functions Item Information Functions Invariance IRT

More information

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS

GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS GENERALIZABILITY AND RELIABILITY: APPROACHES FOR THROUGH-COURSE ASSESSMENTS Michael J. Kolen The University of Iowa March 2011 Commissioned by the Center for K 12 Assessment & Performance Management at

More information

CRITICALLY APPRAISED PAPER (CAP)

CRITICALLY APPRAISED PAPER (CAP) CRITICALLY APPRAISED PAPER (CAP) Logan, D. E., Carpino, E. A., Chiang, G., Condon, M., Firn, E., Gaughan, V. J.,... Berde, C. B. (2012). A day-hospital approach to treatment of pediatric complex regional

More information

Table 3.1: Canadian Stroke Best Practice Recommendations Screening and Assessment Tools for Acute Stroke Severity

Table 3.1: Canadian Stroke Best Practice Recommendations Screening and Assessment Tools for Acute Stroke Severity Table 3.1: Assessment Tool Number and description of Items Neurological Status/Stroke Severity Canadian Neurological Scale (CNS)(1) Items assess mentation (level of consciousness, orientation and speech)

More information

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida

Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida Adaptive Testing With the Multi-Unidimensional Pairwise Preference Model Stephen Stark University of South Florida and Oleksandr S. Chernyshenko University of Canterbury Presented at the New CAT Models

More information

Using the AcuteFIM Instrument for Discharge Placement

Using the AcuteFIM Instrument for Discharge Placement Using the AcuteFIM Instrument for Discharge Placement Paulette Niewczyk, MPH, PhD Manager of CFAR / Director of Research Center for Functional Assessment Research Uniform Data System for Medical Rehabilitation

More information

Canadian Stroke Best Practices Table 3.3A Screening and Assessment Tools for Acute Stroke

Canadian Stroke Best Practices Table 3.3A Screening and Assessment Tools for Acute Stroke Canadian Stroke Best Practices Table 3.3A Screening and s for Acute Stroke Neurological Status/Stroke Severity assess mentation (level of consciousness, orientation and speech) and motor function (face,

More information

PSYCHOLOGICAL STRESS EXPERIENCES

PSYCHOLOGICAL STRESS EXPERIENCES PSYCHOLOGICAL STRESS EXPERIENCES A brief guide to the PROMIS Pediatric and Parent Proxy Report Psychological Stress Experiences instruments: PEDIATRIC PROMIS Pediatric Item Bank v1.0 Psychological Stress

More information

Research Report. A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness

Research Report. A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness Research Report A Comparison of Five Low Back Disability Questionnaires: Reliability and Responsiveness APTA is a sponsor of the Decade, an international, multidisciplinary initiative to improve health-related

More information

Computerized Mastery Testing

Computerized Mastery Testing Computerized Mastery Testing With Nonequivalent Testlets Kathleen Sheehan and Charles Lewis Educational Testing Service A procedure for determining the effect of testlet nonequivalence on the operating

More information

A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model

A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model A Comparison of Pseudo-Bayesian and Joint Maximum Likelihood Procedures for Estimating Item Parameters in the Three-Parameter IRT Model Gary Skaggs Fairfax County, Virginia Public Schools José Stevenson

More information

MEANING AND PURPOSE. ADULT PEDIATRIC PARENT PROXY PROMIS Item Bank v1.0 Meaning and Purpose PROMIS Short Form v1.0 Meaning and Purpose 4a

MEANING AND PURPOSE. ADULT PEDIATRIC PARENT PROXY PROMIS Item Bank v1.0 Meaning and Purpose PROMIS Short Form v1.0 Meaning and Purpose 4a MEANING AND PURPOSE A brief guide to the PROMIS Meaning and Purpose instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Item Bank v1.0 Meaning and Purpose PROMIS Short Form v1.0 Meaning and Purpose 4a PROMIS

More information

Reliability and validity of the International Spinal Cord Injury Basic Pain Data Set items as self-report measures

Reliability and validity of the International Spinal Cord Injury Basic Pain Data Set items as self-report measures (2010) 48, 230 238 & 2010 International Society All rights reserved 1362-4393/10 $32.00 www.nature.com/sc ORIGINAL ARTICLE Reliability and validity of the International Injury Basic Pain Data Set items

More information

FATIGUE. A brief guide to the PROMIS Fatigue instruments:

FATIGUE. A brief guide to the PROMIS Fatigue instruments: FATIGUE A brief guide to the PROMIS Fatigue instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS Ca Bank v1.0 Fatigue PROMIS Pediatric Bank v2.0 Fatigue PROMIS Pediatric Bank v1.0 Fatigue* PROMIS

More information

SLEEP DISTURBANCE ABOUT SLEEP DISTURBANCE INTRODUCTION TO ASSESSMENT OPTIONS. 6/27/2018 PROMIS Sleep Disturbance Page 1

SLEEP DISTURBANCE ABOUT SLEEP DISTURBANCE INTRODUCTION TO ASSESSMENT OPTIONS. 6/27/2018 PROMIS Sleep Disturbance Page 1 SLEEP DISTURBANCE A brief guide to the PROMIS Sleep Disturbance instruments: ADULT PROMIS Item Bank v1.0 Sleep Disturbance PROMIS Short Form v1.0 Sleep Disturbance 4a PROMIS Short Form v1.0 Sleep Disturbance

More information

Differential Item Functioning

Differential Item Functioning Differential Item Functioning Lecture #11 ICPSR Item Response Theory Workshop Lecture #11: 1of 62 Lecture Overview Detection of Differential Item Functioning (DIF) Distinguish Bias from DIF Test vs. Item

More information

Mantel-Haenszel Procedures for Detecting Differential Item Functioning

Mantel-Haenszel Procedures for Detecting Differential Item Functioning A Comparison of Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning H. Jane Rogers, Teachers College, Columbia University Hariharan Swaminathan, University of

More information

Development of a self-reported Chronic Respiratory Questionnaire (CRQ-SR)

Development of a self-reported Chronic Respiratory Questionnaire (CRQ-SR) 954 Department of Respiratory Medicine, University Hospitals of Leicester, Glenfield Hospital, Leicester LE3 9QP, UK J E A Williams S J Singh L Sewell M D L Morgan Department of Clinical Epidemiology and

More information

CHAPTER VI RESEARCH METHODOLOGY

CHAPTER VI RESEARCH METHODOLOGY CHAPTER VI RESEARCH METHODOLOGY 6.1 Research Design Research is an organized, systematic, data based, critical, objective, scientific inquiry or investigation into a specific problem, undertaken with the

More information

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION

USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION USE OF DIFFERENTIAL ITEM FUNCTIONING (DIF) ANALYSIS FOR BIAS ANALYSIS IN TEST CONSTRUCTION Iweka Fidelis (Ph.D) Department of Educational Psychology, Guidance and Counselling, University of Port Harcourt,

More information

LEDYARD R TUCKER AND CHARLES LEWIS

LEDYARD R TUCKER AND CHARLES LEWIS PSYCHOMETRIKA--VOL. ~ NO. 1 MARCH, 1973 A RELIABILITY COEFFICIENT FOR MAXIMUM LIKELIHOOD FACTOR ANALYSIS* LEDYARD R TUCKER AND CHARLES LEWIS UNIVERSITY OF ILLINOIS Maximum likelihood factor analysis provides

More information

American Addiction Centers Outcomes Study Long-Term Outcomes Among Residential Addiction Treatment Clients. Centerstone Research Institute

American Addiction Centers Outcomes Study Long-Term Outcomes Among Residential Addiction Treatment Clients. Centerstone Research Institute American Addiction Centers Outcomes Study Long-Term Outcomes Among Residential Addiction Treatment Clients Centerstone Research Institute 2018 1 AAC Outcomes Study: Long-Term Outcomes Executive Summary

More information

Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria

Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria Comparability Study of Online and Paper and Pencil Tests Using Modified Internally and Externally Matched Criteria Thakur Karkee Measurement Incorporated Dong-In Kim CTB/McGraw-Hill Kevin Fatica CTB/McGraw-Hill

More information

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments

Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Using Analytical and Psychometric Tools in Medium- and High-Stakes Environments Greg Pope, Analytics and Psychometrics Manager 2008 Users Conference San Antonio Introduction and purpose of this session

More information

Specialty Rehabilitation Fact Sheet

Specialty Rehabilitation Fact Sheet TIRR Memorial Hermann is a nationally recognized rehabilitation hospital that returns lives interrupted by neurological illness, trauma or other debilitating conditions back to independence. Some of the

More information

Neurodegenerative Diseases, Debilitating Conditions and Multiple Trauma Program (Neuromuscular Rehab)

Neurodegenerative Diseases, Debilitating Conditions and Multiple Trauma Program (Neuromuscular Rehab) Neurodegenerative Diseases, Debilitating Conditions and Multiple Trauma Program (Neuromuscular Rehab) TIRR Memorial Hermann is a nationally recognized rehabilitation hospital that returns lives interrupted

More information

ANXIETY A brief guide to the PROMIS Anxiety instruments:

ANXIETY A brief guide to the PROMIS Anxiety instruments: ANXIETY A brief guide to the PROMIS Anxiety instruments: ADULT PEDIATRIC PARENT PROXY PROMIS Pediatric Bank v1.0 Anxiety PROMIS Pediatric Short Form v1.0 - Anxiety 8a PROMIS Item Bank v1.0 Anxiety PROMIS

More information

THE ESSENTIAL BRAIN INJURY GUIDE

THE ESSENTIAL BRAIN INJURY GUIDE THE ESSENTIAL BRAIN INJURY GUIDE Outcomes Section 9 Measurements & Participation Presented by: Rene Carfi, LCSW, CBIST Senior Brain Injury Specialist Brain Injury Alliance of Connecticut Contributors Kimberly

More information

S P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H A N D WITHOUT PSYCHOPATHOLOGY

S P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H A N D WITHOUT PSYCHOPATHOLOGY Aggregation of psychopathology in a clinical sample of children and their parents S P O U S A L R ES E M B L A N C E I N PSYCHOPATHOLOGY: A C O M PA R I SO N O F PA R E N T S O F C H I LD R E N W I T H

More information

PHYSICAL STRESS EXPERIENCES

PHYSICAL STRESS EXPERIENCES PHYSICAL STRESS EXPERIENCES A brief guide to the PROMIS Physical Stress Experiences instruments: PEDIATRIC PROMIS Pediatric Bank v1.0 - Physical Stress Experiences PROMIS Pediatric Short Form v1.0 - Physical

More information

THE PAST DECADE HAS SEEN significant effort directed

THE PAST DECADE HAS SEEN significant effort directed 622 ORIGINAL ARTICLE Assessing Self-Care and Social Function Using a Computer Adaptive Testing Version of the Pediatric Evaluation of Disability Inventory Wendy J. Coster, PhD, OTR/L, Stephen M. Haley,

More information

Neurodegenerative diseases Includes multiple sclerosis, Parkinson s disease, postpolio syndrome, rheumatoid arthritis, lupus

Neurodegenerative diseases Includes multiple sclerosis, Parkinson s disease, postpolio syndrome, rheumatoid arthritis, lupus TIRR Memorial Hermann is a nationally recognized rehabilitation hospital that returns lives interrupted by neurological illness, trauma or other debilitating conditions back to independence. Some of the

More information

Gambling Decision making Assessment Validity

Gambling Decision making Assessment Validity J Gambl Stud (2010) 26:639 644 DOI 10.1007/s10899-010-9189-x ORIGINAL PAPER Comparing the Utility of a Modified Diagnostic Interview for Gambling Severity (DIGS) with the South Oaks Gambling Screen (SOGS)

More information

A Cross-validation of easycbm Mathematics Cut Scores in. Oregon: Technical Report # Daniel Anderson. Julie Alonzo.

A Cross-validation of easycbm Mathematics Cut Scores in. Oregon: Technical Report # Daniel Anderson. Julie Alonzo. Technical Report # 1104 A Cross-validation of easycbm Mathematics Cut Scores in Oregon: 2009-2010 Daniel Anderson Julie Alonzo Gerald Tindal University of Oregon Published by Behavioral Research and Teaching

More information

accuracy (see, e.g., Mislevy & Stocking, 1989; Qualls & Ansley, 1985; Yen, 1987). A general finding of this research is that MML and Bayesian

accuracy (see, e.g., Mislevy & Stocking, 1989; Qualls & Ansley, 1985; Yen, 1987). A general finding of this research is that MML and Bayesian Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG Clement A. Stone University of Pittsburgh Marginal maximum likelihood (MML) estimation

More information

INTRODUCTION TO ASSESSMENT OPTIONS

INTRODUCTION TO ASSESSMENT OPTIONS ASTHMA IMPACT A brief guide to the PROMIS Asthma Impact instruments: PEDIATRIC PROMIS Pediatric Item Bank v2.0 Asthma Impact PROMIS Pediatric Item Bank v1.0 Asthma Impact* PROMIS Pediatric Short Form v2.0

More information

Running head: CPPS REVIEW 1

Running head: CPPS REVIEW 1 Running head: CPPS REVIEW 1 Please use the following citation when referencing this work: McGill, R. J. (2013). Test review: Children s Psychological Processing Scale (CPPS). Journal of Psychoeducational

More information

Measuring Functional Change in Outpatient Therapy Claims-Based Data Collection Reqs. for Outpatient Therapy Services

Measuring Functional Change in Outpatient Therapy Claims-Based Data Collection Reqs. for Outpatient Therapy Services Measuring Functional Change in Outpatient Therapy Paulette Niewczyk, MPH, PhD Director of Research Manager of Research, Development, and Analytical Services Manager of CFAR Uniform Data System for Medical

More information

LIKE OTHER ARENAS of health care, pediatric rehabilitation

LIKE OTHER ARENAS of health care, pediatric rehabilitation 932 Assessing Mobility in Children Using a Computer Adaptive Testing Version of the Pediatric Evaluation of Disability Inventory Stephen M. Haley, PhD, PT, Anastasia E. Raczek, MEd, Wendy J. Coster, PhD,

More information

Reliability. Internal Reliability

Reliability. Internal Reliability 32 Reliability T he reliability of assessments like the DECA-I/T is defined as, the consistency of scores obtained by the same person when reexamined with the same test on different occasions, or with

More information

Youth Using Behavioral Health Services. Making the Transition from the Child to Adult System

Youth Using Behavioral Health Services. Making the Transition from the Child to Adult System Youth Using Behavioral Health Services Making the Transition from the Child to Adult System Allegheny HealthChoices Inc. January 2013 Youth Using Behavioral Health Services: Making the Transition from

More information

Fatigue is widely recognized as the most common symptom for individuals with

Fatigue is widely recognized as the most common symptom for individuals with Test Retest Reliability and Convergent Validity of the Fatigue Impact Scale for Persons With Multiple Sclerosis Virgil Mathiowetz KEY WORDS energy conservation fatigue assessment rehabilitation OBJECTIVE.

More information

Test review. Comprehensive Trail Making Test (CTMT) By Cecil R. Reynolds. Austin, Texas: PRO-ED, Inc., Test description

Test review. Comprehensive Trail Making Test (CTMT) By Cecil R. Reynolds. Austin, Texas: PRO-ED, Inc., Test description Archives of Clinical Neuropsychology 19 (2004) 703 708 Test review Comprehensive Trail Making Test (CTMT) By Cecil R. Reynolds. Austin, Texas: PRO-ED, Inc., 2002 1. Test description The Trail Making Test

More information

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective decisions Readings: OpenStax Textbook - Chapters 1 5 (online) Appendix D & E (online) Plous - Chapters 1, 5, 6, 13 (online) Introductory comments Describe how familiarity with statistical methods can - be associated

More information

Centerstone Research Institute

Centerstone Research Institute American Addiction Centers Outcomes Study 12 month post discharge outcomes among a randomly selected sample of residential addiction treatment clients Centerstone Research Institute 2018 1 AAC Outcomes

More information

4 Diagnostic Tests and Measures of Agreement

4 Diagnostic Tests and Measures of Agreement 4 Diagnostic Tests and Measures of Agreement Diagnostic tests may be used for diagnosis of disease or for screening purposes. Some tests are more effective than others, so we need to be able to measure

More information

Utilizing the NIH Patient-Reported Outcomes Measurement Information System

Utilizing the NIH Patient-Reported Outcomes Measurement Information System www.nihpromis.org/ Utilizing the NIH Patient-Reported Outcomes Measurement Information System Thelma Mielenz, PhD Assistant Professor, Department of Epidemiology Columbia University, Mailman School of

More information

Neurodegenerative diseases Includes multiple sclerosis, Parkinson s disease, postpolio syndrome, rheumatoid arthritis, lupus

Neurodegenerative diseases Includes multiple sclerosis, Parkinson s disease, postpolio syndrome, rheumatoid arthritis, lupus TIRR Memorial Hermann is a nationally recognized rehabilitation hospital that returns lives interrupted by neurological illness, trauma or other debilitating conditions back to independence. Some of the

More information

ABOUT PHYSICAL ACTIVITY

ABOUT PHYSICAL ACTIVITY PHYSICAL ACTIVITY A brief guide to the PROMIS Physical Activity instruments: PEDIATRIC PROMIS Pediatric Item Bank v1.0 Physical Activity PROMIS Pediatric Short Form v1.0 Physical Activity 4a PROMIS Pediatric

More information

alternate-form reliability The degree to which two or more versions of the same test correlate with one another. In clinical studies in which a given function is going to be tested more than once over

More information

DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS

DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS Worldwide Readership Research Symposium 2005 Session 5.6 DAZED AND CONFUSED: THE CHARACTERISTICS AND BEHAVIOROF TITLE CONFUSED READERS Martin Frankel, Risa Becker, Julian Baim and Michal Galin, Mediamark

More information

NIH Public Access Author Manuscript J Gambl Stud. Author manuscript; available in PMC 2011 December 1.

NIH Public Access Author Manuscript J Gambl Stud. Author manuscript; available in PMC 2011 December 1. NIH Public Access Author Manuscript Published in final edited form as: J Gambl Stud. 2010 December ; 26(4): 639 644. doi:10.1007/s10899-010-9189-x. Comparing the Utility of a Modified Diagnostic Interview

More information

FUNCTIONAL ASSESSMENTS are a key component of

FUNCTIONAL ASSESSMENTS are a key component of 2146 ORIGINAL ARTICLE Performance-Based or Self-Report Measures of Physical Function: Which Should Be Used in Clinical Trials of Hip Fracture Patients? Nancy K. Latham, PT, PhD, Vinay Mehta, PhD, Allison

More information

AROC Intensity of Therapy Project. AFRM Conference 18 September 2013

AROC Intensity of Therapy Project. AFRM Conference 18 September 2013 AROC Intensity of Therapy Project AFRM Conference 18 September 2013 What is AROC? AROC began as a joint initiative of the whole Australian rehabilitation sector (providers, payers, regulators and consumers)

More information

Responsiveness, construct and criterion validity of the Personal Care-Participation Assessment and Resource Tool (PC-PART)

Responsiveness, construct and criterion validity of the Personal Care-Participation Assessment and Resource Tool (PC-PART) Darzins et al. Health and Quality of Life Outcomes (2015) 13:125 DOI 10.1186/s12955-015-0322-5 RESEARCH Responsiveness, construct and criterion validity of the Personal Care-Participation Assessment and

More information

GMAC. Scaling Item Difficulty Estimates from Nonequivalent Groups

GMAC. Scaling Item Difficulty Estimates from Nonequivalent Groups GMAC Scaling Item Difficulty Estimates from Nonequivalent Groups Fanmin Guo, Lawrence Rudner, and Eileen Talento-Miller GMAC Research Reports RR-09-03 April 3, 2009 Abstract By placing item statistics

More information

SUPPLEMENTAL MATERIAL

SUPPLEMENTAL MATERIAL 1 SUPPLEMENTAL MATERIAL Response time and signal detection time distributions SM Fig. 1. Correct response time (thick solid green curve) and error response time densities (dashed red curve), averaged across

More information

Equipment Stopwatch A clear pathway of at least 10 m (32.8 ft) in length in a designated area over solid flooring 2,3.

Equipment Stopwatch A clear pathway of at least 10 m (32.8 ft) in length in a designated area over solid flooring 2,3. Core Measure: 10 Meter Walk Test (10mWT) Overview The 10mWT is used to assess walking speed in meters/second (m/s) over a short distance. Number of Test Items 1 item Scoring The total time taken to ambulate

More information

Final Report. HOS/VA Comparison Project

Final Report. HOS/VA Comparison Project Final Report HOS/VA Comparison Project Part 2: Tests of Reliability and Validity at the Scale Level for the Medicare HOS MOS -SF-36 and the VA Veterans SF-36 Lewis E. Kazis, Austin F. Lee, Avron Spiro

More information

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research

Measures. David Black, Ph.D. Pediatric and Developmental. Introduction to the Principles and Practice of Clinical Research Introduction to the Principles and Practice of Clinical Research Measures David Black, Ph.D. Pediatric and Developmental Neuroscience, NIMH With thanks to Audrey Thurm Daniel Pine With thanks to Audrey

More information

INTRODUCTION TO ASSESSMENT OPTIONS

INTRODUCTION TO ASSESSMENT OPTIONS DEPRESSION A brief guide to the PROMIS Depression instruments: ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.0 Depression PROMIS Pediatric Item Bank v2.0 Depressive Symptoms PROMIS Pediatric

More information

Age as a Predictor of Functional Outcome in Anoxic Brain Injury

Age as a Predictor of Functional Outcome in Anoxic Brain Injury Age as a Predictor of Functional Outcome in Anoxic Brain Injury Mrugeshkumar K. Shah, MD, MPH, MS Samir Al-Adawi, PhD David T. Burke, MD, MA Department of Physical Medicine and Rehabilitation, Spaulding

More information

Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects

Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects Journal of Oral Health & Community Dentistry original article Adjusting the Oral Health Related Quality of Life Measure (Using Ohip-14) for Floor and Ceiling Effects Andiappan M 1, Hughes FJ 2, Dunne S

More information

ABOUT SMOKING NEGATIVE PSYCHOSOCIAL EXPECTANCIES

ABOUT SMOKING NEGATIVE PSYCHOSOCIAL EXPECTANCIES Smoking Negative Psychosocial Expectancies A brief guide to the PROMIS Smoking Negative Psychosocial Expectancies instruments: ADULT PROMIS Item Bank v1.0 Smoking Negative Psychosocial Expectancies for

More information

Estimating the number of components with defects post-release that showed no defects in testing

Estimating the number of components with defects post-release that showed no defects in testing SOFTWARE TESTING, VERIFICATION AND RELIABILITY Softw. Test. Verif. Reliab. 2002; 12:93 122 (DOI: 10.1002/stvr.235) Estimating the number of components with defects post-release that showed no defects in

More information

PAIN INTERFERENCE. ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.1 Pain Interference PROMIS-Ca Bank v1.0 Pain Interference*

PAIN INTERFERENCE. ADULT ADULT CANCER PEDIATRIC PARENT PROXY PROMIS-Ca Bank v1.1 Pain Interference PROMIS-Ca Bank v1.0 Pain Interference* PROMIS Item Bank v1.1 Pain Interference PROMIS Item Bank v1.0 Pain Interference* PROMIS Short Form v1.0 Pain Interference 4a PROMIS Short Form v1.0 Pain Interference 6a PROMIS Short Form v1.0 Pain Interference

More information

COMPUTING READER AGREEMENT FOR THE GRE

COMPUTING READER AGREEMENT FOR THE GRE RM-00-8 R E S E A R C H M E M O R A N D U M COMPUTING READER AGREEMENT FOR THE GRE WRITING ASSESSMENT Donald E. Powers Princeton, New Jersey 08541 October 2000 Computing Reader Agreement for the GRE Writing

More information

2 Types of psychological tests and their validity, precision and standards

2 Types of psychological tests and their validity, precision and standards 2 Types of psychological tests and their validity, precision and standards Tests are usually classified in objective or projective, according to Pasquali (2008). In case of projective tests, a person is

More information

OUTCOMES AND DATA 2016

OUTCOMES AND DATA 2016 AND DATA 2016 SERVED BY REHAB IMPAIRMENT CATEGORY 20 patients 5.1% MAJOR MULTIPLE TRAUMA W/BRAIN OR SPINAL CORD INJURY 24 patients 6.2% TRAUMATIC 39 patients 10.0% AMPUTATION LOWER EXTREMITY 26 patients

More information

Author s response to reviews

Author s response to reviews Author s response to reviews Title: The validity of a professional competence tool for physiotherapy students in simulationbased clinical education: a Rasch analysis Authors: Belinda Judd (belinda.judd@sydney.edu.au)

More information

Research Report. Key Words: Functional status; Orthopedics, general; Treatment outcomes. Neva J Kirk-Sanchez. Kathryn E Roach

Research Report. Key Words: Functional status; Orthopedics, general; Treatment outcomes. Neva J Kirk-Sanchez. Kathryn E Roach Research Report Relationship Between Duration of Therapy Services in a Comprehensive Rehabilitation Program and Mobility at Discharge in Patients With Orthopedic Problems Background and Purpose. The purpose

More information

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4

UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4 UNIT 4 ALGEBRA II TEMPLATE CREATED BY REGION 1 ESA UNIT 4 Algebra II Unit 4 Overview: Inferences and Conclusions from Data In this unit, students see how the visual displays and summary statistics they

More information

Evidence Informed Practice Online Learning Module Glossary

Evidence Informed Practice Online Learning Module Glossary Term Abstract Associations Attrition Bias Background and Significance Baseline Basic Science Bias Blinding Definition An abstract is a summary of a research article. It usually includes the purpose, methods,

More information

Smoking Social Motivations

Smoking Social Motivations Smoking Social Motivations A brief guide to the PROMIS Smoking Social Motivations instruments: ADULT PROMIS Item Bank v1.0 Smoking Social Motivations for All Smokers PROMIS Item Bank v1.0 Smoking Social

More information

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha

Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha Glossary From Running Randomized Evaluations: A Practical Guide, by Rachel Glennerster and Kudzai Takavarasha attrition: When data are missing because we are unable to measure the outcomes of some of the

More information

William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada

William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada William C Miller, PhD, FCAOT Professor Occupational Science & Occupational Therapy University of British Columbia Vancouver, BC, Canada THE L TEST MANUAL Version: November 2014 Table of Contents Introduction...

More information

Validation of the Russian version of the Quality of Life-Rheumatoid Arthritis Scale (QOL-RA Scale)

Validation of the Russian version of the Quality of Life-Rheumatoid Arthritis Scale (QOL-RA Scale) Advances in Medical Sciences Vol. 54(1) 2009 pp 27-31 DOI: 10.2478/v10039-009-0012-9 Medical University of Bialystok, Poland Validation of the Russian version of the Quality of Life-Rheumatoid Arthritis

More information

Still important ideas

Still important ideas Readings: OpenStax - Chapters 1 13 & Appendix D & E (online) Plous Chapters 17 & 18 - Chapter 17: Social Influences - Chapter 18: Group Judgments and Decisions Still important ideas Contrast the measurement

More information

A Comparison of Several Goodness-of-Fit Statistics

A Comparison of Several Goodness-of-Fit Statistics A Comparison of Several Goodness-of-Fit Statistics Robert L. McKinley The University of Toledo Craig N. Mills Educational Testing Service A study was conducted to evaluate four goodnessof-fit procedures

More information

Casa Colina Centers for Rehabilitation: A unique physician-directed model of care that works

Casa Colina Centers for Rehabilitation: A unique physician-directed model of care that works Casa Colina Centers for Rehabilitation: A unique physician-directed model of care that works Emily R. Rosario, PhD Why is Casa Colina unique? Continuum of care offering medical and rehabilitation services

More information

Background: Traditional rehabilitation after total joint replacement aims to improve the muscle strength of lower limbs,

Background: Traditional rehabilitation after total joint replacement aims to improve the muscle strength of lower limbs, REVIEWING THE EFFECTIVENESS OF BALANCE TRAINING BEFORE AND AFTER TOTAL KNEE AND TOTAL HIP REPLACEMENT: PROTOCOL FOR A SYSTEMATIC RE- VIEW AND META-ANALYSIS Background: Traditional rehabilitation after

More information

CRITICALLY APPRAISED PAPER (CAP)

CRITICALLY APPRAISED PAPER (CAP) CRITICALLY APPRAISED PAPER (CAP) Dahl, A., Askim, T., Stock, R., Langørgen, E., Lydersen, S., & Indredavik, B. (2008). Short- and long-term outcome of constraint-induced movement therapy after stroke:

More information

Psychometric properties of the Chinese quality of life instrument (HK version) in Chinese and Western medicine primary care settings

Psychometric properties of the Chinese quality of life instrument (HK version) in Chinese and Western medicine primary care settings Qual Life Res (2012) 21:873 886 DOI 10.1007/s11136-011-9987-3 Psychometric properties of the Chinese quality of life instrument (HK version) in Chinese and Western medicine primary care settings Wendy

More information

Liver Transplantation for Alcoholic Liver Disease: A Survey of Transplantation Programs in the United States

Liver Transplantation for Alcoholic Liver Disease: A Survey of Transplantation Programs in the United States Liver Transplantation for Alcoholic Liver Disease: A Survey of Transplantation Programs in the United States James E. Everhart* and Thomas P. Beresford A lcoholic liver disease (ALD) is one of the most

More information

Cochrane Pregnancy and Childbirth Group Methodological Guidelines

Cochrane Pregnancy and Childbirth Group Methodological Guidelines Cochrane Pregnancy and Childbirth Group Methodological Guidelines [Prepared by Simon Gates: July 2009, updated July 2012] These guidelines are intended to aid quality and consistency across the reviews

More information

restoring hope rebuilding lives

restoring hope rebuilding lives Spinal Cord Injury Brain Injury Stroke Neurologic Diseases Orthopedic Conditions Amputation Cancer Cardiac Recovery The patient experience: 2015 in review restoring hope rebuilding lives Advancing care

More information

Basic concepts and principles of classical test theory

Basic concepts and principles of classical test theory Basic concepts and principles of classical test theory Jan-Eric Gustafsson What is measurement? Assignment of numbers to aspects of individuals according to some rule. The aspect which is measured must

More information

Magellan Health Services: Using the SF-BH assessment to measure success and prove value

Magellan Health Services: Using the SF-BH assessment to measure success and prove value Magellan Health Services: Using the SF-BH assessment to measure success and prove value Background Almost four years ago, Magellan Health Services, a specialty care manager focused on some of today s most

More information