Low back pain (LBP) is a common and costly condition.

Similar documents
Dethroning the Clinical Prediction Rule WPTA Fall Conference 2017

Independent evaluation of a clinical prediction rule for spinal manipulative therapy: a randomised controlled trial

Pros and Cons of Clinical Prediction Rules. Clinical Prediction Rules 7/25/2016

Physiotherapy Interventions for Low Back Pain - Subgrouping Patients with Improved Efficacy. Raymond Tsang. SPT, QMH FHKCOP 24 April 2010.

Psychological predictors of recovery from low back pain: a prospective study

1/26/16 TREATMENT MONITORING. Treatment Monitoring. Psychologically Informed Physical Therapy Practice: Pragmatic Application for Low Back Pain

orthopaedic physical therapy world. Clinical prediction rules are decision-making tools that

What do we want? Cervicothoracic Workgroup. ICF Scheme. start with end in mind. What do consumers want?

Better Outcomes for Older People with Spinal Trouble (BOOST) Research Programme

CHAMP: CHecklist for the Appraisal of Moderators and Predictors

Jones and Bartlett Publishers, LLC. NOT FOR SALE OR DISTRIBUTION

Supplementary Online Content

Aalborg Universitet. Statistical analysis plan Riis, Allan; Karran, E. L. ; Jørgensen, Anette; Holst, S.; Rolving, N. Publication date: 2017

TITLE: Chiropractic Interventions for Acute or Chronic Lower Back Pain in Adults: A Review of the Clinical and Cost-Effectiveness

Low back pain (FS-17)

PRE-ASS ESSMENT. Spinal Manipulation for Lower Back Pain

Straight Leg Raising Test a snapshot summary of evidence (May 2013) Key messages

United Kingdom back pain exercise and manipulation (UK BEAM) randomised trial: effectiveness of physical treatments for back pain in primary care

Stynes, et al. (2016). Classification of patients with low back-related leg pain: a systematic review. BMC Musculoskeletal Disorders; 17:226

Randomised Controlled Trial for Low Back Pain

Systematic Reviews and Meta- Analysis in Kidney Transplantation

Role of evidence from observational studies in the process of health care decision making

Behavioural treatment for chronic low-back pain (Review)

Exercise for Rehabilitation and Treatment: Summary of Research

Arthritis Research UK National Primary Care Centre Winner of a Queen s Anniversary Prize For Higher and Further Education 2009

Generic prognostic factors for musculoskeletal pain in primary care: a systematic review

Time for quality of manual physical therapy Where do we stand now and are we moving in the right direction?

Low back pain (LBP) is one of the most common musculoskeletal

treatment effects (Hedges and Olkin 1980). We conducted a systematic review of randomised clinical trials of spinal manipulative therapy for low back

USDA Nutrition Evidence Library: Systematic Review Methodology

CONSORT 2010 checklist of information to include when reporting a randomised trial*

Tammy Filby ( address: 4 th year undergraduate occupational therapy student, University of Western Sydney

Clinical practice guidelines have proliferated in recent

1/22/2016 PATIENT SUBGROUPING. Patient Subgrouping. Psychologically Informed Physical Therapy Practice: Pragmatic Application for Low Back Pain

Evidence Based Practice. Objectives. Overarching Goal. What it is. Barriers to EBP 4/11/2016

The costs of chronic low back

Musculoskeletal Annotated Bibliography

Research Report. Critical Appraisal of Clinical Prediction Rules That Aim to Optimize Treatment Selection for Musculoskeletal Conditions

Searching for Clinical Prediction Rules in MEDLINE

Systematic reviews and meta-analyses of observational studies (MOOSE): Checklist.

Authors: Claire P MacKelvie BSc (Hons) 1, Chris Seenan, PhD, 2 Cormac G Ryan, PhD 3 Affiliations: 1. Keywords:

MINDFULNESS-BASED INTERVENTIONS IN EPILEPSY

GRADE. Grading of Recommendations Assessment, Development and Evaluation. British Association of Dermatologists April 2018

Brief History of Manipulation The History is on our side

Learning objectives. Examining the reliability of published research findings

Critical Appraisal of a Systematic Review

Osteopathy and the NHS a snapshot summary statement (Nov 2012)

A Systematic Review of the Efficacy and Clinical Effectiveness of Group Analysis and Analytic/Dynamic Group Psychotherapy

Perspective. Treatment-Based Classification System for Low Back Pain: Revision and Update

Controlled Trials. Spyros Kitsiou, PhD

Critical Review: Group Therapy for Post-Stroke Aphasia Rehabilitation

Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist

A systematic review on the effectiveness of complementary and alternative medicine for chronic non-specific low-back pain

Setting The setting was an outpatients department. The economic study was carried out in the UK.

The diagnosis of Chronic Pancreatitis

Functional Tools Pain and Activity Questionnaire

SkillBuilder Shortcut: Levels of Evidence

Evidence based practice. Dr. Rehab Gwada

Using return to work as outcome, 6% to 10% of

Is compensation bad for health? A systematic meta-review. Injury January 8, 2010

Is it useful for chronic pain patients? Therapeutic Patient Education (TPE) Psycho-education Cognitive Behavioral Therapy (CBT) Françoise LAROCHE, MD

Eighty percent of patients with chronic back pain (CBP)

Protocol Development: The Guiding Light of Any Clinical Study

Research Report. Which Prognostic Factors for Low Back Pain Are Generic Predictors of Outcome Across a Range of Recovery Domains?

Essential Skills for Evidence-based Practice Understanding and Using Systematic Reviews

Physical Therapists and Physicians

CHECK-LISTS AND Tools DR F. R E Z A E I DR E. G H A D E R I K U R D I S TA N U N I V E R S I T Y O F M E D I C A L S C I E N C E S

Systematic reviews: From evidence to recommendation. Marcel Dijkers, PhD, FACRM Icahn School of Medicine at Mount Sinai

Workshop: Cochrane Rehabilitation 05th May Trusted evidence. Informed decisions. Better health.

Results. NeuRA Mindfulness and acceptance therapies August 2018

PROSPERO International prospective register of systematic reviews

COMPARISON BETWEEN THE EFFECTIVENESS OF MCKENZIE EXTENSION EXERCISES AND WILLIAM FLEXION EXERCISES FOR TREATMENT OF NON-SPECIFIC LOW BACK PAIN

According to the 2010 Global Burden of Disease study,

Overview and Comparisons of Risk of Bias and Strength of Evidence Assessment Tools: Opportunities and Challenges of Application in Developing DRIs

MUSCULOSKELETAL PROGRAM OF CARE

Supplementary Online Content

A value proposition for early physical therapist management of neck pain: a retrospective cohort analysis

Evidence-based Laboratory Medicine: Finding and Assessing the Evidence

Executive Summary. Musculoskeletal Disorders and Commercial Motor Vehicle Driver Safety. April 30, 2008

Supplemental Video Available at

Research Report. Preliminary Evaluation of a Modified STarT Back Screening Tool Across Different Musculoskeletal. pain conditions.

Pre-exercise stretching does not prevent lower limb running injuries.

Effectiveness of Low Back Pain Manipulative Therapy in Combination with Physical Therapy as Compared to Standard Physical Therapy

Systematic Reviews. Simon Gates 8 March 2007

Can Audit and Feedback Reduce Antibiotic Prescribing in Dentistry?

PROSPERO International prospective register of systematic reviews

GATE CAT Intervention RCT/Cohort Studies

MUSCULOSKELETAL CALCULATOR 42,103. 1in6 SUMMARY. Second Local Authority Bulletin Prevalence of back pain in England and Wolverhampton

Does maintained Spinal manipulation therapy for chronic non-specific. low back pain result in better long term outcome? ACCEPTED

RATING OF A RESEARCH PAPER. By: Neti Juniarti, S.Kp., M.Kes., MNurs

Informed Consent Form

Combined chiropractic interventions for low-back pain (Review)

Does the Manual Therapy Technique Matter?

ACR OA Guideline Development Process Knee and Hip

Systematic Review of Diagnostic Utility and Therapeutic Effectiveness of Thoracic Facet Joint Interventions

Critical appraisal: Systematic Review & Meta-analysis

Allergen immunotherapy for the treatment of allergic rhinitis and/or asthma

Guidelines for Reporting Non-Randomised Studies

Problem solving therapy

Fracture REduction Evaluation (FREE) Study

Transcription:

SPINE Volume 38, Number 9, pp 762 769 2013, Lippincott Williams & Wilkins LITERATURE REVIEW Systematic Review of Randomized Controlled Trials of Clinical Prediction Rules for Physical Therapy in Low Back Pain Shilpa Patel, C. Psychol, * Tim Friede, PhD, Robert Froud, PhD, David W. Evans, PhD, * and Martin Underwood, MD * Study Design. Systematic review. Objective. To evaluate randomized controlled trials validating the effects of a clinical prediction rule for patients with non-specific low back pain (LBP). The outcomes of interest were any back pain or pain-related measures. Summary of Background Data. LBP is a common and costly condition. Interventions for back pain seem to have, at best, small to moderate mean beneficial effects. Identifying subgroups of patients who may respond better to certain treatments may help to improve clinical outcomes in back pain. The development of clinical prediction rules is an attempt to determine who will respond best to certain treatments. Methods. We conducted electronic searches of MEDLINE (1980 2009), EMBASE (1980 2009), PsycINFO (1980 2009), Allied and Complementary Medicine (1980 2009), PubMed (1980 2009), ISI Web of Knowledge (1980 2009), and the Cochrane Library (1980 2009). The reference lists of relevant articles were searched for further references. Results. We identified 1821 potential citations; 3 articles were included. The results from the available data do not support the use of clinical prediction rules in the management of non-specific LBP. From the * University of Warwick, Warwick Clinical Trials Unit, Warwick Medical School, Coventry, United Kingdom ; Department of Medical Statistics, University Medical Centre Göttingen, Göttingen, Germany ; Centre for Primary Care and Public Health, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom ; and University College of Health Sciences, Campus Kristiania, Oslo, Norway. Acknowledgment date: May 9, 2012. First revision date: October 24, 2012. Acceptance date: October 25, 2012. The manuscript submitted does not contain information about medical device(s)/drug(s). This project benefited from facilities funded through Birmingham Science City Translational Medicine Clinical Research and infrastructure Trials platform, with support from Advantage West Midlands. Relevant financial activities outside the submitted work: consultancy, stock/ stock options, travel/accommodations/meeting expenses. Address correspondence and reprint requests to Shilpa Patel, University of Warwick, Clinical Trials Unit, Warwick Medical School, Gibbet Hill Road, Coventry, CV4 7AL, West Midlands, United Kingdom; E-mail: shilpa.patel@ warwick.ac.uk Conclusion. There is a lack of good quality randomized controlled trials validating the effects of a clinical prediction rule for LBP. Furthermore, there is no agreement on appropriate methodology for the validation and impact analysis. The evidence for, and development of, the existing prediction rules is generally weak. Key words: clinical prediction rules, back pain, subgroups, predictors of response to treatment, systematic review. Spine 2013 ; 38 : 762 769 Level of Evidence: 1 Low back pain (LBP) is a common and costly condition. 1, 2 Effective interventions for LBP have, at best, small to moderate effects, when averaged over populations. 3 6 The true potential value of these interventions might have been underestimated because most trials encompass all people with non-specific LBP as a single group, assuming homogeneity. 7 11 The identification of subgroups is an important research priority. 8, 12, 13 There is growing interest in the development and use of clinical prediction rules (CPRs) in the physical therapy literature, where focus has been on using such rules to 14 18 determine who will best respond to a given intervention. It is likely that outcomes will be improved if subgroups of patients with LBP could be identified and better matched to 19, 20 treatment. Clinical prediction rules are defined as: the process by which combinations of clinical findings that have been statistically demonstrated to be meaningful predictors of a condition or outcome of interest are used to categorize a heterogeneous group of patients into subgroups based on a shared likelihood of the presence of that condition or outcome. 7 CPRs can be useful in determining prognosis, assessing the likelihood of the presence or absence of a condition, and to help classify patients into groups more likely to benefit from treatment. Strictly speaking the latter is not covered by the definition given above, but for the purpose of this review we take a wider view. They can help with screening patients to decide when further investigations are likely or unlikely to DOI: 10.1097/BRS.0b013e31827b158f 21 23 yield meaningful findings. 762 www.spinejournal.com April 2013

Developing a CPR should be a 3-step process of derivation, validation, and impact analysis. 24 26 During the derivation stage, it is important to identify known individual predictors of response to treatment. For validation, external validity should be examined by applying the rule to different settings, patients, and clinicians before generalizing. 26, 27 Unfortunately, validation studies are rarely reported in the literature. 28 If these tools are well-designed and validated in appropriate populations, they will have the potential to identify those patients most likely to benefit from a particular treatment, which in turn will help improve clinical decision making and practice. 29 There is currently considerable interest in reviewing the literature on CPRs for back pain. This largely focuses on synthesizing the outcomes of the original studies. If, however, the underpinning research methods used are not robust, little weight can be given to the conclusions. Here, we report a systematic review evaluating validation studies of CPRs for patients with LBP using randomized controlled trials (RCTs). Our outcomes of interest were any back pain or pain-related measures. We provide a methodological and statistical critique of the included articles, different to articles already published in this area. MATERIALS AND METHODS Inclusion Criteria We included RCTs that validated the effects of a CPR. We defined a CPR as any clinical tool with various components drawn from the history, examination, and laboratory tests used to inform treatment choices. We included studies of non-specific LBP, that is, we excluded studies concerned with diagnosis and management of malignancy, infection, fracture, or inflammatory disorders ( e.g., ankylosing spondylitis). Trials were of interventions to treat LBP of any duration with outcomes of pain, disability, and psychological distress. The age of participants in trials was restricted to adults (18 yr or older). Only articles published in English were included. Studies described as RCTs that did not allow the performance of the CPR to be assessed using random allocation were excluded. We excluded studies of surgical techniques and derivation studies (studies prospectively examining the predictive ability of selected variables for a CPR). Search Strategy We searched the following electronic databases from 1980 to 2009: MEDLINE, EMBASE, PsycINFO, AMED, PubMed, ISI Web of Knowledge, and the Cochrane Library. We used MeSH (Medical Subject Heading) terms to identify articles containing material relevant to back pain and low back pain. We combined these using Boolean operators with keyword terms subgroup or subgroups or classification or criteria or diagnostic criteria or rule or rules or decision rules or prediction rule or clinical predictors. Databases were limited to RCTs. We also searched reference lists of relevant articles for further citations. The validation studies included in a recent review of CPRs for Figure 1. Describes the study selection process for the systematic review. the management of LBP were assessed for inclusion in this review. 30 Inclusion Process Two reviewers independently reviewed titles and abstracts of citations identified from the electronic searches ( Figure 1 ). We found that the inter-observer reliability for screening titles and abstracts was poor; we therefore used a third reviewer to screen all titles and abstracts. The third reviewer screened the titles and abstracts and developed a list of the potentially included articles. These were compared with those selected by reviewers 1 and 2. Those that matched were included and those that did not were discussed by all 3 reviewers to reach a consensus. The agreed upon full articles were obtained and reviewed by 2 reviewers. SP and RF independently extracted data and assessed the quality of each article. We extracted data on the derivation and composition of the CPR tested, the treatments being compared, and the study design ( Table 1 ). All disagreements over data extraction and quality assessment were resolved by means of discussion. Quality Assessment We used the Pincus 31 quality assessment tool for studies of treatment moderators. Each study was assessed for quality independently by 2 reviewers. Studies that met all 5 criteria were classified as providing confirmatory evidence, those complying with criteria 3, 4, and 5 were classified as providing Spine www.spinejournal.com 763

exploratory evidence. All other studies were classified as providing insufficient evidence ( Table 2 ). RESULTS We identified 1821 citations for possible inclusion from bibliographic searches. We obtained 35 full texts for detailed inspection. Following examination and discussion, 32 articles were excluded, 27 articles were not validating the effects of a prediction rule, 3 presented results from mixed samples, 1 study was not randomized, and 1 study recruited some participants under the age of 18 years. Subsequently, 3 studies were included in this systematic review ( Table 1 ). Statistical pooling of the data was not performed because of the small number of studies found as well as heterogeneity in the interventions applied, outcome measures collected, and populations from which samples were drawn. Brennan et al 32 reported a 3-armed trial of manipulation, stabilization, and exercise. The CPR used was on the basis of work by Fritz et al. 33 They examined the inter-rater reliability of individual examination items for a classification decision-making algorithm. At baseline history and physical examination, data were collected before randomizing patients to one of the 3 treatments. After completion of the study, the baseline signs and symptoms were used by 2 physical therapists to decide which subgroup the patient fitted and a third therapist was consulted if agreement could not be reached. Patients were then classified as matched (if they received the treatment that matched their subgroup classification) or unmatched (if they received a treatment different to their subgroup classification). The results were analyzed using a 3-way interaction between randomized treatment, classification, and time that was found to be significant, whereas both the 2-way interaction between randomized treatment and time and that between classification subgroup and time were not statistically significant. The authors reported greater change in the Oswestry Disability Questionnaire for matched subjects than unmatched both at 4-week and 1-year followup. Childs et al 14 used a CPR in a 2-arm trial of spinal manipulation and exercise. The rule had previously been developed in a prospective cohort study with a small sample of participants from an army medical center. 17 The prediction rule identified patients with LBP who had a good prognosis when treated with spinal manipulation, based on 5 clinical factors. Data on these 5 factors were collected at baseline by a physical therapist blind to the participant s treatment group. Those participants who met 4 or more of the 5 criteria were classified as positive and therefore likely to respond to manipulation. Those classified with 3 or fewer positive criteria were classified as negative. The authors tested for a 3-way interaction between patients status on the rule, treatment group, and time. The authors concluded that those positive on the prediction rule that received a form of lumbosacral spinal manipulation had the greatest treatment benefit at 4- and 6-month follow-up. Hancock et al 34 evaluated the spinal manipulation rule used by Childs et al 14 to assess generalizibility of setting and sample of patients receiving spinal manipulation. They carried out a preplanned analysis of data from an attention-controlled trial of spinal manipulation therapy. After collection of baseline data, participants were randomized to one of 4 groups. Participants were assessed on the 5 criteria to determine status on the prediction rule. A researcher who was blinded to allocation collected data on 2 of the 5 criteria, the treating physiotherapist collected data on the remaining 3 criteria. The authors conducted a 3-way interaction between a patient s status on the rule, treatment group, and time, which was not statistically significant at the 5% level. The authors concluded no clinically worthwhile interaction effects between treatment group and status on the rule for either pain or disability at any time point. Critique of Included Articles Brennan et al 32 adopted a pragmatic approach to treatment progression, patients moved to the second subacute stage if they achieved a predefined reduction in their Oswestry Disability Index score. For these patients the therapist could select only those treatments permitted based on the patient s original treatment group. Therapists were permitted to use their own clinical judgment to determine exercise dosage for individual patients. Although this is more representative of normal clinical practice, it is difficult to determine confounding factors that may have an effect on response to treatment. Statistically, comparing those that are classified as matched with those that are unmatched is likely to produce larger effects than if comparing those randomized with the different treatments because some participants will receive the correct treatment by chance. A closer inspection reveals that the statistical analysis used by Brennan seems not to have included a test for an interaction between all subgroups and treatment, making it difficult to conclude which subgroup of patients would respond best to which treatment. In addition quality assessment of this article provided inconclusive evidence. In the analysis conducted by Childs et al, 14 the authors do test for a 3-way interaction between CPR, treatment group, and time and concluded a positive effective for the CPR. However, the CPR has been criticized as comprising of items that would ordinarily be associated with a favorable prognosis, 35, 36 which challenges the usefulness of such a rule in clinical practice. Despite their positive result we think that the methods used to develop this rule are not robust and the items included in the rule select those that would get better ordinarily. The quality assessment of this article provided inconclusive evidence. Hancock et al, 34 when independently assessing the findings reported by Childs et al, 14 found no significant interactions between the treatment group and status on the rule for either pain or disability. They found that a positive status on the rule predicted better prognosis regardless of treatment received for pain and disability at 2 and 12 weeks. Importantly, because Hancock et al 34 did not find a difference between intervention and control treatments, it would be unlikely that there would be an important interaction between CPR and treatment group unless it was hypothesized the intervention was harmful for at least some individuals. These results do not 764 www.spinejournal.com April 2013

TABLE 1. Summary of the Included Studies Author Setting and Inclusion Criteria Sample Size Baseline Data Brennan et al 32 Clinics in Utah. Patients aged 18 to 65 yr, LBP of < 90 d, with or without referral into lower extremity, and an OSW score of 25% 123 History and standardized physical examination. Demographic data, current pain intensity, FABQ/, modified OSW Childs et al 14 Clinics mainly with facilities within the US Air Force. Patients aged 18 to 60 yr, LBP with or without referral into lower extremity, and an OSW score of at least 30% 131 History and standardized physical examination. Demographic data, body diagram to assess symptoms, current pain intensity, and best and worst pain in last 24 hr, FABQ, modified OSW Hancock Primary care. et al 34 Patients with LBP of < 6 wk, pain between the 12th rib and buttock crease causing moderate pain and moderate disability 239 Demographics, history, RMDQ, FABQ, catastrophizing, selfstatement, coping self-statement, numerical pain rating scale, patient specific functional scale. Treatments Being Compared Manipulation vs. specific exercise vs. stabilization Spinal manipulation plus an exercise program vs. exercise program alone PL SMT & PL diclofenac vs PL SMT & AT diclofenac vs AT SMT & PL diclofenac vs AT SMT & AT diclofenac 4-arm trial Details of Clinical Prediction Rule Retrospectively baseline data was used to classify them into one of 3 groups: manipulation, specific exercise, or stabilization (classification was based on previous work by Fritz et al 34 ) by 2 therapists blind to the treatment group assignment. If agreement was not reached a third therapist was consulted Patients were examined on the basis of the 5 criteria for spinal manipulation (Flynn et al 17 ) after randomization, therapists were not instructed in the rules criteria and were unaware of the patients status on the rule. After completion of the trial an examiner blinded to the patients treatment assignment determined patients status on the rule using baseline data (positive = 4/5 criteria met, negative = 3 or fewer) Classification based on Childs et al 14 spinal manipulation clinical prediction rule initially developed by Flynn et al 17 Outcome Measure and Follow-up OSW at 4 wk after baseline assessment. Long-term follow-up at 1 yr OSW at 1 wk. A therapist blinded to the patients status on the rule repeated the history and physical examination 1 and 4 wk after randomization. 6-mo followup questionnaire Pain (11 point scale) and disability (RMDQ) at 1, 2, 4, and 12 wk Authors Results and Conclusions 3-way interaction between randomized treatment, classification and time significant. Greater changed in OSW in matched than unmatched for both shortand long-term follow-up. 2 way interactions not significant. 4 wk treatment effect 0.4 3-way interaction between status on rule, treatment group, and time significant. Patients positive on the rule that received spinal manipulation had great improvement in pain and disability at 1 and 4 wk and 6- mo follow-up than positive on the rule and got exercise or negative on the rule and received spinal manipulation 3-way interaction between treatment group, clinical prediction rule status and time not significant for pain or disability. No significant interactions between treatment group and status on rule for either pain or disability. Positive status on rule predicted better prognosis regardless of treatment received for pain and disability at 2 and 12 wk Quality Assessment Inconclusive evidence Inconclusive evidence Providing exploratory evidence (Continued ) Spine www.spinejournal.com 765

TABLE 1. (Continued ) Quality Assessment Authors Results and Conclusions Outcome Measure and Follow-up Details of Clinical Prediction Rule Treatments Being Compared Baseline Data Sample Size Setting and Inclusion Criteria Author ( + ) or (-) on the rule. Researcher blinded to treatment allocation collected data on 2 items (FABQ work subscale and duration of current episode). Treating therapist collected data on the remain 3 criteria in the initial assessment LBP indicates low back pain; FABQ, Fear Avoidance Belief Questionnaire; OSW, Oswestry; RMDQ, Roland and Morris Disability Questionnaire; PL, placebo; SMT, spinal manipulative therapy; AT, Active. support the wider use of the Childs rule and we agree that this conclusion is supported by the analysis of results from the Hancock trial. The differences in findings between Hancock et al 34 and Childs et al 14 can be attributed to a variation on the type and application of spinal manipulation, the setting, and the patient population. More specifically, the spinal manipulation delivered in both studies was not the same. Hancock et al 34 reported baseline pain and disability scores between the spinal manipulation groups and placebo group, and these were small and not statistically significant. Therefore, testing for an interaction effect may be misleading. This was the only trial providing exploratory evidence. DISCUSSION In this review, we focused on evaluating validation studies of CPRs for LBP using RCTs. We have therefore not included studies looking specifically at the derivation of rules. Two systematic reviews, published after we had completed our review, support our conclusions. The first, examined the validity of CPRs and concluded that derivation studies were mainly of high quality, whereas the cross-sectional validation studies were weak, limiting application of the rules in clinical practice. 37 The second, a more recent review of CPRs for the management of LBP conclude the current evidence does not support the clinical application of these rules. 30 Our findings add to this work by demonstrating that the evidence from randomized trials validating CPRs for non-specific LBP is weak. Only 3 articles met our inclusion criteria, 2 of which were small studies with a total sample of less than 150 patients. In all cases, the prediction rule had been developed in small selected populations compromising external validity. Even the largest of these studies (Hancock et al, 25 n = 239) is likely to be too small to validate a CPR adequately, and thus their negative findings are unlikely to be robust due to the possibility of type II error; indeed one of the challenges to performing interaction tests is having adequate statistical power. After completing the work for this review, the results of the STarT back trial, a well conducted RCT of subgrouping and targeted treatment for patients with LBP, which achieved a positive result, were published. 38 This trial would not have met our inclusion criteria as the actual subgrouping tool was not being tested but rather the targeted treatments were being tested. To test the targeting tool the trial design would need to randomize patients to receive the tool versus no tool, with those receiving the tool being allocated to treatment and those without being randomly assigned to treatment. This trial design would allow for the statistical comparison of tool versus no tool, giving a better indication of the tools ability to subgroup patients to targeted treatments. A systematic review by Haskins et al 30 excluded the article by Brennan et al, 32 in which a classification approach was used. We have included this article in our review as we have used a slightly different definition of CPRs that allows trials using classification and categorization to be included. We have excluded the article by Cleland et al, 39 which was included in this recent review because the authors only included patients 766 www.spinejournal.com April 2013

TABLE 2. Methodological Quality of Included Articles Question Brennan Childs Hancock 1. Was the subgroup analysis specified a priori? Yes Yes Preplanned secondary analysis 2. Was the selection of subgroup factors for analysis theory/evidence driven? 3. Were subgroup factors measured prior to randomization? 4. Were subgroup factors measured by adequate (reliable and valid) measurements, appropriate for the target population? 5. Does the analysis contain an explicit test of the interaction between moderator and treatment? Yes based on work by Fritz et al 32 Yes- based on work by Flynn et al 16 Yes independent evaluation of work by Childs et al 14 whose rule was based on Flynn et al 17 Yes Yes Yes Yes Yes Yes No No Yes positive on the CPR, therefore not allowing for the rule to be validated appropriately. Kent et al 40 conclude that treatments targeted to subgroups may be effective; however, the results of the studies need to be interpreted with caution. In their review they include a trial by Long et al, 41 which we have excluded from our review because the authors include patients only with a directional preference and exclude those without, therefore not allowing the clinical prediction rule to be tested appropriately. The discussions within the review by Kent et al 11 focus on the effect sizes of the included trials and significance of this. In our article we focus on the authors ability to test a clinical prediction rule effectively in a RCT. We present a methodological and statistical critique, different than that presented by Kent et al. 11 The development of the CPRs tested in these studies lacked methodological rigor. 37 It remains unclear as to which candidate domains should be included in a prediction rule for LBP. It seems that identification of such domains, or indeed concluding that such domains cannot be identified will require a substantial study with an a priori design. To develop a rule that enables clinicians to choose between interventions, the developmental work of derivation and validation needs to be thoroughly designed and systematically validated. The methodology for quality assessing studies of CPRs is poorly developed. We used a tool developed for a different study of moderators. 31 Based on these criteria only one of the included studies provides exploratory evidence, none fall within the remit of confirmatory evidence ( Table 2 ). The focus of this review was to critique the statistical methods used for the testing of a CPR. Therefore, we did not provide any extensive data on the quality of the underpinning trial. If we had found an apparently robust evaluation of a CPR on which changes in clinical practice might be based, then it would be very important to know the quality of the trials by conventional measures. At present, we do not know if the disappointing performance of CPRs in RCTs is because inappropriate rules have been tested, the trials have been poorly designed, underpowered, or indeed whether it is impossible to develop CPRs that are fit for this purpose. Although most of the derivation studies have been of high quality, 37 our review, as well as previous reviews, have found the quality of validation studies to 28, 37 have been poor. In a perspective article, Hancock et al 25 make a number of useful suggestions for future work on predictors of response to treatment which could also be applied to validation of a CPR. Alternatively, a CPR could be validated as the intervention in a RCT. Any such trials are likely to need very large numbers of subjects. The effect size for main treatment effects in current positive LBP pain trials are typically small to moderate. Any interaction effect is unlikely to be greater than the main treatment effect; indeed if it was, it would suggest that for a substantial group of people the intervention was likely to have no positive effect or for an identifiable group to make their condition worse. Thus, as a rule of thumb, trials at least 4 times the size of current large trials of LBP treatments; perhaps 2000 to 3000 participants will be needed to validate CPRs satisfactorily. 42 Ideally, it would be better to have CPRs that can be applied to a varied range of clinical decisions, but this is unlikely. CPRs would be useful in LBP as the treatment effects currently seen in populations remain small; this is probably, at least in part, due to the heterogeneity of the un-subgrouped back pain population. In the physical therapy literature there are currently 10 prediction rule derivation studies that have not been validated. 28 Notwithstanding this, authors continue 43, 44 to cite the effective management of LBP using CPR. There is a need for future well-designed validation studies of these rules to enable better matching of patients to treatment that in turn may lead to better patient outcomes and less health care usage and thus cost to the health care system. The task of developing, validating, and testing such CPRs should not be underestimated. It is difficult to justify the very large cost entailed in taking one CPR through full testing to inform just one treatment choice, therefore our research efforts and our funders resources may be better directed in alternative directions. However, we warn against the application of CPRs without sufficient evidence because patients in need of treatment may be denied treatments that they might have benefited from. Spine www.spinejournal.com 767

Key Points Identifying subgroups of patients with LBP who can be better matched to treatments might help to improve clinical outcomes. The existing evidence for CPRs is weak. Despite this, prediction rules are still being promoted in the physical therapy literature. There is a need for well-designed validation studies of CPRs to enable progression to clinical implementation. Acknowledgment The authors thank Professor Sallie Lamb for her contribution in the early stage of this review. References 1. Luo X, Pietrobon R, Sun SX, et al. Estimates and patterns of direct health care expenditures among individuals with back pain in the United States. Spine 2004 ; 29 : 79 86. 2. Maniadakis N, Gray A. The economic burden of back pain in the UK. Pain 2000 ; 84 : 95 103. 3. Assendelft WJ, Morton SC, Yu EI, et al. Spinal manipulative therapy for low back pain. Cochrane Database Syst Rev 2004 ; 1 : CD000447. 4. van Middelkoop M, Rubinstein SM, Kuijpers T, et al. A systematic review on the effectiveness of physical and rehabilitation interventions for chronic non-specific low back pain. Eur Spine J 2011 ; 20 : 19 39. 5. van Tulder M, Malmivaara A, Esmail R, et al. Exercise therapy for low back pain: a systematic review within the framework of the cochrane collaboration back review group. Spine 2000 ; 25 : 2784 96. 6. van Tulder MW, Ostelo R, Vlaeyen JW, et al. Behavioral treatment for chronic low back pain: a systematic review within the framework of the Cochrane Back Review Group. Spine 2000 ; 25 : 2688 99. 7. Beattie P, Nelson R. Clinical prediction rules: what are they and what do they tell us? Aust J Physiother 2006 ; 52 : 157 63. 8. Borkan JM, Koes B, Reis S, et al. A report from the second international forum for primary care research on low back pain. Reexamining priorities. Spine 1998 ; 23 : 1992 6. 9. Bouter LM, van Tulder MW, Koes BW. Methodologic issues in low back pain research in primary care. Spine 1998 ; 23 : 2014 20. 10. Deyo RA. Treatments for back pain: can we get past trivial effects? Ann Intern Med 2004 ; 141 : 957 8. 11. Kent P, Keating J. Do primary-care clinicians think that non-specific low back pain is one condition? Spine 2004 ; 29 : 1022 31. 12. Bouter LM, Pennick V, Bombardier C. Cochrane back review group. Spine 2003 ; 28 : 1215 8. 13. Foster NE, Dziedzic KS, Windt DAWM, et al. Research priorities for non-pharmacological therapies for common musculoskeletal problems: nationally and internationally agreed recommendations. BMC Musculoskeletal Disord 2009 ; 10 : 3. 14. Childs J, Fritz J, Flynn T, et al. A clinical prediction rule to identify patients with low back pain most likely to benefit from spinal manipulation: a validation study. Ann Intern Med 2004 ; 141 : 920 8. 15. Cleland JA, Childs JD, Fritz JM, et al. Development of a clinical prediction rule for guiding treatment of a subgroup of patients with neck pain: use of thoracic spine manipulation, exercise, and patient education. Phys Ther 2007 ; 87 : 9 23. 16. Fernandez-de-las-Penas C, Cleland JA, Cuadrado ML, et al. Predictor variables for identifying patients with chronic tension-type headache who are likely to achieve short-term success with muscle trigger point therapy. Cephalalgia 2008 ; 28 : 264 75. 17. Flynn T, Fritz J, Whitman J, et al. A clinical prediction rule for classifying patients with low back pain who demonstrate shortterm improvement with spinal manipulation. Spine 2002 ; 27 : 2835 43. 18. Iverson CA, Sutlive TG, Crowell MS, et al. Lumbopelvic manipulation for the treatment of patients with patellofemoral pain syndrome: development of a clinical prediction rule. J Orthop Sports Phys Ther 2008 ; 38 : 297 309. 19. Froud R, Eldridge S, Lall R, et al. Estimating the number needed to treat from continuous outcomes in randomised controlled trials: methodological challenges and worked example using data from the UK Back Pain Exercise and Manipulation (BEAM) trial. BMC Med Res Methodol 2009 ; 9 : 35. 20. Kraemer HC, Frank E, Kupfer DJ. Moderators of treatment outcomes: clinical, research, and policy importance. JAMA 2006 ; 296 : 1286 9. 21. Bachmann LM, Kolb E, Koller MT, et al. Accuracy of Ottawa ankle rules to exclude fractures of the ankle and mid-foot: systematic review. BMJ 2003 ; 326 : 417. 22. Carragee EJ, Hannibal M. Diagnostic evaluation of low back pain. Orthop Clin N Am 2004 ; 35 : 7 16. 23. Mauck KF, Cuddihy MT, Atkinson EJ, et al. Use of clinical prediction rules in detecting osteoporosis in a population-based sample of postmenopausal women. Arch Intern Med 2005 ; 165 : 530 6. 24. Childs JD, Cleland JA. Development and application of clinical prediction rules to improve decision making in physical therapist practice. Phys Ther 2006 ; 86 : 122 31. 25. Hancock M, Herbert RD, Maher CG. A guide to interpretation of studies investigating subgroups of responders to physical therapy interventions. Phys Ther 2009 ; 89 : 698 704. 26. McGinn TG, Guyatt GH, Wyer PC, et al. Users guides to the medical literature: XXII: how to use articles about clinical decision rules. Evidence-Based Medicine Working Group. JAMA 2000 ; 284 : 79 84. 27. Reilly BM, Evans AT. Translating clinical research into clinical practice: impact of using prediction rules to make decisions. Ann Intern Med 2006 ; 144 : 201 9. 28. Beneciuk JM, Bishop MD, George SZ. Clinical prediction rules for physical therapy interventions: a systematic review. Phys Ther 2009 ; 89 : 114 24. 29. Cook CE. Potential pitfalls of clinical prediction rules. J Man Manip Ther 2008 ; 16 : 69 71. 30. Haskins R, Rivett DA, Osmotherly PG. Clinical prediction rules in the physiotherapy management of low back pain: a systematic review. Man Ther 2012 ; 17 : 9 21. Epub 2011 June. 31. Pincus T, Miles C, Froud R, et al. Methodological criteria for the assessment of moderators in systematic reviews of randomised controlled trials: a consensus study. BMC Med Res Methodol 2011 ; 11 : 14. 32. Brennan GP, Fritz JM, Hunter SJ, et al. Identifying subgroups of patients with acute/subacute non-specific low back pain: results of a randomized clinical trial. Spine 2006 ; 31 : 623 31. 33. Fritz JM, Brennan GP, Clifford SN, et al. An examination of the reliability of a classification algorithm for subgrouping patients with low back pain. Spine 2006 ; 31 : 77 82. 34. Hancock MJ, Maher CG, Latimer J, et al. Independent evaluation of a clinical prediction rule for spinal manipulative therapy: a randomised controlled trial. Eur Spine J 2008 ; 17 : 936 43. 35. Bekkering GE, Hendriks HJ, van Tulder MW, et al. Prognostic factors for low back pain in patients referred for physiotherapy: comparing outcomes and varying modeling techniques. Spine 2005 ; 30 : 1881 6. 36. Croft PR, Dunn KM, Raspe H. Course and prognosis of back pain in primary care: the epidemiological perspective. Pain 2006 ; 122 : 1 3. 37. May S, Rosedale R. Prescriptive clinical prediction rules in back pain research: a systematic review. J Man Manip Ther 2009 ; 17 : 36 45. 38. Hill JC, Whitehurst DG, Lewis M, et al. Comparison of stratified primary care management for low back pain with current best practice (STarT Back): a randomised controlled trial. Lancet 2011 ; 378 : 1560 71. 768 www.spinejournal.com April 2013

39. Cleland JA, Fritz JM, Kulig K, et al. Comparison of the effectiveness of three manual physical therapy techniques in a subgroup of patients with low back pain who satisfy a clinical prediction rule. A randomized clinical trial. Spine 2009 ; 34 : 2720 9. 40. Kent P, Mjøsund HL, Petersen DH. Does targeting manual therapy and/or exercise improve patient outcomes in non-specific low back pain? A systematic review. BMC Med 2010 ; 8 : 22. 41. Long A, Donelson R, Fung T. Does it matter which exercise? A randomized control trial of exercise for low back pain. Spine 2004 ; 29 : 2593 602. 42. Brookes ST, Whitley E, Peters TJ, et al. Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives. Health Technol Assess 2001 ; 5 : 1 56. 43. Bialosky JE, Bishop MD, Robinson ME, et al. Spinal manipulative therapy has an immediate effect on thermal pain sensitivity in people with low back pain: a randomized controlled trial. Phys Ther 2009 ; 89 : 1292 303. 44. George SZ, Zeppieri G Jr, Cere AL, et al. A randomized trial of behavioral physical therapy interventions for acute and sub-acute low back pain (NCT00373867). Pain 2008 ; 140 : 145 57. Spine www.spinejournal.com 769