The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System

Similar documents
Instant Poll. Do you manage sport related concussions as a part of your medical practice?

The Reliability of the Modified Balance Error Scoring System

Pre and Post Concussion Management

A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF

Test-retest Reliability of the NeuroCom VSR Sport in Division I Collegiate Female Soccer Players

HEADACHE DOES NOT PREDICT SIDELINE NEUROSTATUS OR BALANCE IN HIGH SCHOOL FOOTBALL PLAYERS MATTHEW WAYNE NOHREN THESIS

Influence of Testing Environment on Balance Error Scoring System Performance

KINETISENSE 3D CONCUSSION WHITE PAPER

Psychometric and Measurement Properties of Concussion Assessment Tools in Youth Sports

Landing development: A first look at young children

Participants: Male collegiate athletes, male and female high school athletes. add the # of subjects.

The Effectiveness of a Dual Task Concussion Assessment for Identifying Impairments in Concussed Athletes

Inter- and Intra-rater Reliabililty of a Modified Approach to Scoring the Balance Error Scoring System BESS in Undistracted and Distracted Conditions

The Test- Retest Reliability and Minimal Detectable Change of the Sensory Organization Test and Head- Shake Sensory Organization Test

One-Month Test Retest Reliability of the ImPACT Test Battery

Concussion: The Basics. Bill Meehan, MD Micheli Center for Sports Injury Prevention Sports Concussion Clinic, Boston Children s Hospital

Baseline Concussion Testing: The Effects of Learning Disabilities and Sleep

Relationship between Body Core Stabilization and Athletic Function in. Football, Basketball and Swimming Athletes

Comparison of static and dynamic balance among male Amateur basketball players

Tolland High School. Concussion Procedure Manual

Scoring Sway Balance. Scoring System and BESS Comparison. October 2 nd, Copyright 2013 Sway Medical LLC

Knee-Joint Position Sense: The Relationship Between Open and Closed Kinetic Chain Tests

Even if cleared by a physician

Reliability of two dimensional video assessment of frontal plane dynamic knee valgus during common athletic screening tasks

Current Trends in Athletic Training Practice for Concussion Assessment and Management

The Need for Baseline Concussion Testing. Brandie Messer, DNP, RN DNP Coordinator Assistant Professor Saint Anthony College of Nursing

Version of record first published: 25 Apr 2012.

PEDIATRIC SPORTS RELATED CONCUSSIONS

Return of Postural Control to Baseline After Anaerobic and Aerobic Exercise Protocols

IOWA STATE UNIVERSITY SPORTS MEDICINE STANDARD OPERATING PROCEDURES FOR THE ATHLETIC TRAINER MEDICAL EMERGENCIES Cerebral Concussions

Test Retest Reliability of Computerized Neurocognitive Testing in Youth Ice Hockey Players

Triple-Hop Distance as a Valid Predictor of Lower Limb Strength and Power

Case Study. Assessment of Mild Head Injury Using Measures of Balance and Cognition: A Case Study

Ashley Cameron Littleton

What is concussion? DR MARTIN RAFTERY CHIEF MEDICAL OFFICER NOVEMBER 2016

Concussions and the Athlete Child Neurology of Tulsa Page 1 of 5

David R Howell, Louis R Osternig, Li-Shan Chou Department of Human Physiology, University of Oregon, Eugene, OR, USA

Balance Assessment in the Management of Sport-Related Concussion

Reliability of time to stabilization in single leg standing

Flawless. Mike Cendoma, MS, ATC. Athletic Trainer. Sports Medicine Concepts. Get Your Coffee and Get Comfy. We ll Start in Just A Few!

Current Concepts in Sports Concussion: Opportunities for the Physical Therapist. Concussion in Sport

performance of 40 subjects on the single-leg triple hop (SLTH), with quadriceps (Quad 60,

The Clinical Advantage

MEASUREMENT OF VALIDITY FOR BALANCE ASSESSMENTS USING A MODIFIED CTSIB SWAY INDEX VERSUS A BIODEX SWAY INDEX. A Thesis.

Effects of Badminton Training on Physical Parameters of Players

The effects of multiple head micro-trauma on cognitive function in high school football and soccer athletes

The University of North Carolina at Chapel Hill Sport Concussion Policy

Effect of proprioceptive training along with breathing and imagery on performance of gymnasts at various levels

Effects of a Single-Task Versus a Dual-Task Paradigm on Cognition and Balance in Healthy Subjects

EFFECTIVENESS OF NEUROMUSCULAR TRAINING IN YOUNG ATHLETES AT RISK OF LOWER EXTREMITY MUSCULOSKELETAL INJURIES

The Effects of Fatigue on a Dual-Task Postural Control Measure

Two-year Test Retest Reliability of ImPACT in High School Athletes

COMPARISON OF THE BALANCE ERROR SCORING SYSTEM AND THE NEUROCOM SENSORY ORGANIZATION TEST IN HEALTHY, PHYSICALLY ACTIVE ADULTS

When are athletes ready for return to sports??? Functional Testing for Return to Sports. Important Factors Involved in Return to Sport

Sports Concussion After the Injury. Carol Scott, MD UNR Student Health Center Sports Medicine September 22, 2016

BALANCE AND FUNCTIONAL ABILITIES ASSESSMENT AND INTERVENTION

Mind-body Balance Training for Special Populations

Natus Medical Incorporated is the sole designer, manufacturer, and seller of the following systems:

Summary of evidence-based guideline update: Evaluation and management of concussion in sports

Reliability and Concurrent Validity of Select C3 Logix Test Components

Noncontact Anterior Cruciate Ligament Injuries in Collegiate Female Soccer Players: The Effects of a 4-Week Prevention Program on Landing Kinematics

The Changing Landscape of Sports Concussions

Concussions in Sport Definitions, Mechanisms, and Current Issues

lntertester and lntratester Reliability of a Dynamic Balance Protocol Using the Biodex Stability System

THE PSYCHOMETRIC PROPERTIES OF CONCUSSION ASSESSMENT TOOLS IN HIGH SCHOOL ATHLETICS TAMERAH NICOLE HUNT. (Under the Direction of Michael S.

A Prospective Evaluation of the Landing Error Scoring System (LESS) as a Screening Tool for Anterior Cruciate Ligament Injury Risk

Concussion Information

BELMONT ABBEY COLLEGE ATHLETIC TRAINING CONCUSSION MANAGEMENT PLAN. Revised September 7, 2016

Concussion in the Young Athlete:

ADHD and Concussion. Mary Alexis Iaccarino, MD

Emerging Topics in Concussion Treatment: Lower Extremity Injury and Active Recovery

Is There An Increased Risk Of Subsequent Musculo-skeletal Injury Following A Concussion?

THE EFFECTS OF PHYSICAL AND COGNITIVE ACTIVITY LEVELS POST- CONCUSSION ON RECOVERY. Ashley Cameron Littleton

A preliminary investigation into a qualitative assessment tool to identify. Movement Assessment Score (CMAS)

THE RELATIONSHIP BETWEEN SUBCONCUSSIVE IMPACTS AND CONCUSSION HISTORY ON CLINICAL MEASURES OF CONCUSSION IN COLLEGIATE FOOTBALL PLAYERS

Coastal Carolina University Athletic Training Department Policy and Procedure Manual Concussion Management Revised/Reviewed 6/2012

Concussion Management: A Multidimensional & Interdisciplinary Approach

A COMPARISON STUDY: BALANCE ERROR SCORING SYSTEM USING REAL-TIME AND SLOW-MOTION VIDEO PLAYBACK. A Thesis by. Danielle C. Stern

Concussion Management

Sport Concussion Updates:

Concussion and neuropsychological testing (literature search) Dr Ryan Kohler December 2011

STEP HURDLE LADDER IMPORTANT INFORMATION PLEASE READ AND KEEP DISCLAIMER

Concussion guidance. Introduction CONCUSSION FACTS

COS SPORTS MEDICINE CONCUSSION ASSESSMENT FORM BASELINE TEST Adapted from SCAT3

FIFA 11 + WARM-UP TORONTO HIGH PARK FOOTBALL CLUB

The Effectiveness of Balance Training Programs on Reducing the Incidence of Ankle Sprains in Adolescent Athletes

Webinars on Demand 2017

Overview The BBS is a widely-used, clinician-rated scale used to assess sitting and standing, static and dynamic balance.

Non-Linear Dynamical Analysis of a Visual-Motor Tracking Task for Baseline Testing and Post- Concussion Evaluation of Intercollegiate Athletes

SMART EquiTest. Physical Dimensions. Electrical Characteristics. Components. Performance Characteristics. Accessories Included

Concussion, as that occurring in the context of sports, is

Equipment Stopwatch A clear pathway of at least 10 m (32.8 ft) in length in a designated area over solid flooring 2,3.

BCLFC Prehab U915 s/17s 3 rd December 2013 Jade Moore and Tim Colledge

Diagnostic Tools. Objectives UPDATES AND ADVANCES IN THE DIAGNOSIS AND MANAGEMENT OF SPORTS RELATED CONCUSSION

Davidson College Sports Medicine Concussion Management Policy

The Reliability of Four Different Methods. of Calculating Quadriceps Peak Torque Angle- Specific Torques at 30, 60, and 75

Cognitive Risk in High School. Football: A Pilot Study. By: Bernadette Darcy

Journal of Sport Rehabilitation. The reliability of strength tests performed in elevated shoulder positions using a hand-held dynamometer

The Star Excursion Balance Test (SEBT) is a unilateral, UNILATERAL BALANCE PERFORMANCE IN FEMALE COLLEGIATE SOCCER ATHLETES

CORE MEASURE: CORE MEASURE: BERG BALANCE SCALE (BBS)

Transcription:

Journal of Sports Science 2 (2014) 200-207 doi: 10.17265/2332-7839/2014.04.005 D DAVID PUBLISHING The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System Daniel James Amin 1, Jaclyn Coleman 2, and Lee C. Herrington 3 1. Department of Sport, University Centre Doncaster, Doncaster, South Yorkshire DN57SZ, United Kingdom 2. Physiotherapy Department, Heywood, Middleton & Rochdale Primary Care Trust, Rochdale, Lancashire OL161XU, United Kingdom 3. College of Health and Social Care, University of Salford, Salford, Lancashire M66PU, United Kingdom Abstract: Context: Balance is an important aspect of sports performance, which can be impaired following MHI (minor head injury). The BESS (balance error scoring system) is a tool commonly used for assessment of post-mhi recovery. Objective: This study aimed to ascertain test-retest reliability and minimal detectable change of the BESS in order to ensurem that it had appropriate clinical levels of reliability and sensitivity. Design: Test-retest reliability. Setting: University indoor sport facility. Participants: N = 36 sub-elite athletes. Interventions: The BESS was undertaken by the participants on two separate testing sessions, 1 week apart. Main Outcome Measures: Test-retest reliability and minimal detectable change. Results: Strong test-retest reliability of the total BESS score was identified (ICC (intraclass correlation coefficients) = 0.784) with a MDC (minimal detectable change) of 6-10 error points, dependent on confidence interval used. Conclusions: The strong reliability score and the relatively low MDC when using conservative measures would recommend the use of BESS in post-mhi assessment of young academy-level athletes. However, more research is required to further determine BESS MDC in both sub-elite and elite groups of athletic performers so as to provide clinicians with an accurate guideline regarding whether an athlete is presenting with post-mhi balance impairment. Key words: Concussion, balance, reliability, sensitivity. 1. Introduction 1.1 Balance and Minor Head Injury Balance is defined as the ability to maintain a position within the limits of stability or base of support [1]. The factors affecting balance involve somatosensory, visual and vestibular systems as well as musculoskeletal coordination, joint range of motion and strength [1, 2]. MHI (minor head injury) is traumatically induced physiological disruption of brain function, as manifested by at least one of the following: any period of loss of consciousness; any loss of memory for events immediately before or after the accident; any alteration in mental state at the time of the accident (e.g., feeling dazed, disoriented or confused); and focal neurological deficit(s) that may Corresponding author: Daniel James Amin, M.Sc., research field: balance and pedagogy. E-mail: dan.amin@don.ac.uk. or may not be transient [3]. MHI is known to have detrimental effects on sensorimotor functioning of this type, through the impairment of the neuromusculoskeletal and/or sensory systems, and has, therefore, long been associated with loss of balance [3, 4]. Post-MHI, it is important to monitor the athlete s recovery and essential to make a correct return-to-play decision, as it has been reported that, in contact sport, players who sustain an MHI are 3 times more likely to sustain another during the same season [5]. In recent years, various balance assessments have become more ubiquitous when evaluating sport-related MHI [6, 7]. The BESS (Balance Error Scoring System) [8] is one such assessment, which is said to provide an objective method of assessing static balance after MHI [9]. Information obtained from this clinical balance tool can be used to assist clinicians as part of the

The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System 201 process in making post MHI, return-to-play decisions. Baseline scores are usually ascertained at the beginning of the season when the athlete is healthy, then re-administered post-mhi to measure the athlete s recovery [10]. If an athlete s performance does not improve on subsequent test administrations, then clinicians should assume that an enduring abnormality may be present. BESS scores have been shown to correlate well with force plate stability measures in post-mhi athletes (r = 0.78-0.96) [7]. However, Valovich et al. [9] found that administration of multiple trials of the BESS, spaced two days apart, resulted in a learning effect, with the number of errors decreasing with each consecutive trial. This learning effect can last up to two weeks [11] and should alert clinicians to be cautious when administrating the BESS post-mhi to assess recovery in an athlete, as this could mask the actual improvements due to neurological recovery [10]. 1.2 Reliability and Sensitivity of Clinical Assessment Tools A method of assessment needs to be psychometrically robust, in terms of reliability and sensitivity, for it is to be used with confidence in the clinical setting. The test-retest reliability of an assessment tool is important to know, as it aims to ensure that a tool is reliable and consistent when measuring a clinical trait of a participant. Inconsistencies can be caused by tester error, variations in the testing procedure, or changes in the participant s physical or mental condition [12]. The test-retest reliability is of particular importance when considering that there could be a learning effect present as in the case of the BESS. Previous studies have found the BESS to be classified as moderate in terms of test-retest reliability, when considering young athletes (aged 9-14; intraclass correlations ICC (2, 1) = 0.71) [13] and young adults (mean age: 20.4; G (generalizability) = 0.64) [14]. Identifying a change score is another important psychometric measure as it highlights the sensitivity of a tool and whether a change in score is clinically relevant or simply attributed to measurement error [15]. Various methods have been utilized to calculate change scores, including MCID (minimal clinically important difference), which is determined subjectively or objectively and is the smallest meaningful change, as judged by the patient or experts in the field [16]. RCI (reliability change index) is also used to calculate change scores and assesses intra-individual differences over time [6], and has been investigated in the BESS [13] and other neuropsychological measures of concussion [17]. MDC (minimal detectable change), which is the smallest meaningful change in terms of a quantifiable objective measure of subjective performance, however, is the most commonly used in the literature [16]. Once the MDC is determined on a particular test for a given population, therapists can interpret whether the change score for their patient is at or above this minimal level of detectable change. Scores are due to patient improvement on the test rather than measurement error [18]. In the case of the BESS, MDC would need to be known to ensure that post-mhi rehabilitation is the reason for any changes in BESS score and, conversely, to ascertain whether balance has been affected by the MHI; however, to the knowledge of the authors, no previously published investigations have reported the MDC of the BESS. The objectives of this study, therefore, are to ascertain the level of test-retest reliability of the BESS in order to ensure that it has clinically acceptable reliability and also to identify the MDC, so that a measure of true change can be established. 2. Methods 2.1 Participants Thirty six healthy participants (25 female, 11 male; age = 19.66 ± 2.60 years; height = 170.66 ± 8.25 cm).

202 The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System Inclusion criteria required that each participant was performing at sub-elite level within their respective sports (19 trampolinists, 11 soccer players, six dancers). The following exclusion criteria were applied: current musculoskeletal or orthopaedic condition affecting lower limb function; any head injury during the previous six months; previous lower limb surgery; any neurological, visual or vestibular condition affecting balance. The procedures followed protocol with the ethical standards of the responsible institutional review board and was granted approval, in the spirit of the Helsinki declaration. 2.2 Procedures The BESS protocol was developed by researchers and clinicians at the University of North Carolina s Sports Medicine Research Laboratory, Chapel Hill, NC 27599-8700 and is readily accessible online. It shall be referred to as the protocol in these procedures. According to the protocol, participants performed 3 barefoot stances: double-leg, single-leg, and tandem. Each stance lasted 20 seconds, whereby the participants attempted to maintain postural stability, and was performed on both a firm surface and on a 47 39 6.5 cm 3 block of medium-density foam (Power Systems Airex Balance Pad 81000, Knoxville, TN.), with eyes closed and hands on the iliac crests. The BESS has set criteria for scoring errors in the form of error points and were counted by the investigator. Each error was scored as one error point and included: opening the eyes; stepping, stumbling, and falling out of the test position; lifting the hands off the iliac crests; lifting the toes or heels; moving the leg into more than 30 of flexion or abduction; and remaining out of the test position for more than 5 seconds [13]. Multiple errors occurring simultaneously were scored as one error point, whereas an inability to maintain the test position for more than 5 seconds was scored as 10 error points. The total number of error points for each of the six testing conditions were calculated and summed for the participant s total BESS error score. A maximum of 10 error points was attributed to each testing condition, thus a maximum total error score of 60 was attributed to the BESS. Participants were instructed on how to undertake each test condition via the script within the protocol and undertook each test 3 times, with a mean being used for further analyses [19]. In order to ensure that the research design focussed on assessing test-retest reliability, participants were re-tested only seven days later in order to try and ensure that a minimal amount of physiological change took place [12] and to mimic the re-test time-frame when assessing athlete s recovering post-mhi. Any participant who had suffered an injury during this time was removed from the investigation completely (three in total). The investigator had four years clinical experience and had received previous training on BESS administration, within three months of undertaking the investigation. 2.3 Statistical Analyses All statistical analyses were conducted using SPSS for Windows Version 16.0. Alpha levels were set at 0.05 for all tests. Independent samples t-tests were carried out to determine any differences between gender and between sports, and intraclass correlation coefficients (ICC (2, 1) ) were carried out for analysis of test-retest reliability of the BESS. ICC (2, 1) was specifically used as, in effect, only one judge, taken at random from a larger population of judges, scored the errors [20]. The data, therefore, were analysed within SPSS using a two-way random, single measures analysis. Absolute agreement was used as the study looked to identify the test-retest reliability when considering each individual participant [21]. Test-retest reliability strength based on ICC values was classified accordingly [22]: weak (< 0.50), moderate (0.50-0.75), strong (> 0.75), ideal (> 0.90). MDC was calculated to

The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System 203 establish random error and, therefore, a true difference in subsequent scores, using the following formula [16]: MDC = z-score level of confidence SD baseline (2 [1 r test-retest ]) z-score represents the confidence interval from a normal distribution; SD baseline is the standard deviation at baseline; r is the test-retest reliability coefficient; The multiplier of 2 was used to account for the additional uncertainty introduced by using difference scores from measurements at 2 points in time [16]. Some researchers [16, 23] suggested using a confidence interval of 90% due to its use being more common in the literature; however, a confidence interval of 95% increases the precision of score estimation and is also commonly known in the literature as the SDD (smallest detectable difference) [16]. More conservative confidence intervals of 70% and 80% have also been used previously when considering post-concussive balance assessment [13], as such; MDC s using 70%, 80%, 90% and 95% confidence intervals were measured here. 3. Results Independent samples t-tests confirmed no significant differences in total BESS score, on either testing session, between genders or between sport-types (see Table 1). As such, ICC and MDC analyses were undertaken on the whole subject group. Table 2 summarises the key findings of the statistical analyses for test-retest reliability, measurement error and 95% confidence intervals for the various test conditions. ICC (2, 1) values for the individual test conditions and the total error ranged from 0 to 0.823, ICC (2, 1) values were significant in the following conditions total BESS error score, single-leg firm and foam, tandem leg firm and double leg foam. Table 3 highlights the MDC values for the BESS conditions at various levels of confidence. These values ranged from 5.527 to 9.207 raw error points for total BESS error score. Table 1 Significant differences between gender and sport for various test conditions. Test groups Test condition(s) T-value P-value Male vs. female Retest session (tandem, firm) -2.059 0.047 Retest session (double-leg, foam) -2.224 0.034 Initial test session (tandem, firm) -3.122 0.005 Dancers vs. trampolinists Retest session (single-leg, firm) 2.964 0.007 Retest session (double-leg, foam) -2.781 0.012 Initial test session (single-leg, foam) 2.187 0.037 Footballers vs. trampolinists Retest session (tandem, firm) -2.336 0.027 Retest session (double-leg, foam) -2.407 0.025 Retest session (single-leg, foam) 2.252 0.039 Table 2 Values for mean, standard deviation and intraclass correlation coefficient for the various BESS conditions and total error scores. Condition Initial test session Retest session Mean Error Points raw SD Mean error points raw SD ICC (2, 1) Double-leg, firm surface 0 0 0.13 0.34 0 Single-leg, firm surface 2.06 2.06 2.06 1.39 0.484* Tandem-stance, firm surface 1.75 1.66 0.88 1.09 0.635* Double-leg, foam surface 0.81 1.81 0.06 0.25 0.823* Single-leg, foam surface 6.08 2.77 6.63 2.83 0.655* Tandem-stance, foam surface 4.08 2.61 4.88 3.77 0.231 Total error score 14.78 7.69 14.44 6.78 0.784* ICC (2,1) : intraclass correlation coefficient; SD: standard deviation.*p < 0.05.

204 The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System Table 3 Minimal detectable change for each BESS condition at varying levels of confidence. Condition MDC (minimal detectable change) based on level of confidence, raw score (test unit) MDC 95 MDC 90 MDC 80 MDC 70 Double-leg, firm surface 0 0 0 0 Single-leg, firm surface 4.102 (5) 3.442 (4) 2.679 (3) 2.176 (3) Tandem-stance, firm surface 2.780 (3) 2.333 (3) 1.815 (2) 1.475 (2) Double-leg, foam surface 2.111 (3) 1.772 (2) 1.378 (2) 1.120 (2) Single-leg, foam surface 4.510 (5) 3.758 (4) 2.945 (3) 2.393 (3) Tandem-stance, foam surface 6.344 (7) 5.325 (6) 4.143 (5) 3.366 (4) Total error score 9.907 (10) 8.314 (9) 6.471 (7) 5.527 (6) 4. Discussion 4.1 Principle findings In order for an assessment tool to be recommended for clinical practice, it must be classified psychometrically as having good reliability, validity and sensitivity [22]. The results show that the BESS has varying levels of test-retest reliability when considering each test condition (0-0.823). However, the test-retest reliability for total error score (0.784), which is the score used within the clinical environment, is classed as being strong [22]. These results compare favourably with the literature, which have found moderate test-retest reliability amongst children (ICC (2, 1) = 0.71) [13] and adults (generalizability = 0.64) [14]. The latter study found that if an average of two test scores were taken at two time points, then the reliability increased to a more clinically acceptable G = 0.82-0.85, when sexes were examined independently. Direct comparisons are difficult here, however, as generalizability is a different statistical methodology to the intraclass correlation coefficient methodology used in this study [22]. But the concept of taking an average of multiple tests at each session, as was administered in this investigation and advocated elsewhere [19], seems to be beneficial for ensuring higher test-retest values. The study with 9-14 years old subjects [13] did not administer this multiple test approach, so may have had an improved test-retest value if it did, and achieve a strong clinical outcome, as the value is only 0.4 from the statistical indicators. The strong test-retest reliability results found in this investigation coupled with that already seen in the literature suggests that the BESS test is a consistent clinical tool. The weak reliability values associated with the single-leg, firm surface and tandem-stance, foam surface conditions may suggest a favourable learning effect [11] present. However, this is still not being seen consistently enough in the literature to warrant an alteration to the protocol at this time. The MDC for total error score, which highlights the amount of change in error points required for the change to be clinically relevant, ranges from 6 to 10 based on level of confidence. Based on the largest confidence interval MDC 95 (SDD), for a true change in total error score to have taken place, a change of > 9 error points is required. In the clinical setting, this would suggest that a swing of > 9 from baseline would be required before a clinician could consider whether neurological impairment has occurred; conversely, if the test was initially administered post-mhi to monitor recovery, the athlete would need to show a reduction in score of > 9 error points in order to show true signs of neurological improvement. Using more conservative confidence intervals (MDC 70, MDC 80 and MDC 90 ) as advocated in under-18 athletes [24], would mean a smaller change in error points required to infer a true change in score (6, 7 and 9, respectively). When comparing the more conservative MDC 70, MDC 80 and MDC 90 of total BESS error score to the RCI s of similar confidence intervals identified in the study by Valovich McLeod et al. [13], we can see some similarity with only 1 unit difference at each

The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System 205 of these confidence levels. The SDD for total BESS error score of 10 found in this investigation can be considered high since an average difference of seven errors (range: 6-9 errors) from baseline score or control group has been seen in post-concussive subjects [25, 26]. Using SDD in this instance has the potential implication that an athlete, who has undergone MHI and suffered noticeable balance impairment, may not have this change registered as true if undertaking the BESS. 4.2 Considerations for the BESS The reliability of the BESS may be enhanced further when considering any possible test-design limitations. When a lone administrator administers the test, in situ, it is difficult to score all errors that are actually occurring. For instance, some errors occur in different planes (e.g., 30 flexion of the leg is in the sagittal plane, whereas, 30 abduction of the leg is in the frontal plane where does the administrator stand to ensure they capture both of these potential errors?). Some errors occur at various height levels (e.g., an administrator may be looking for the eyes opening, yet miss the heel raising). It is also difficult to accurately determine the time frame of 5 seconds, which has implication upon various error scores. This may be rectified by using video replay to confirm all the errors that have taken place, as seen previously [27]. Previous studies into other dynamic balance assessment tools have found that using video replay can increase reliability scores [28, 29]. Also, multiple errors occurring simultaneously are suggested to be scored as only one error, despite the multiple postural deviations occurring. Once again, video replay could potentially measure more accurately the number of errors occurring, which, when bearing in mind the nature of the test in terms of assessing recovery from MHI, maybe worthwhile despite the obvious time costs. It would be interesting to determine whether test-retest reliability and MDC, which is calculated using test-retest correlation coefficients, is enhanced due to the experience of the administrators. Considering what the BESS is assessing, it is vital that all test administrators can accurately score the errors consistently. It may, therefore, require a clinician to prove their proficiency prior to administering this test with their athletes. Previous literature in the field of post-concussive assessment, with particular reference to BESS, suggested that training is encouraged to establish consistency among multiple raters and that this training should be highlighted [19, 24]. However, the actual training period required for someone to be able to administer the test is not referred to in the literature. In the case of this investigation, the experience of the test administrator was three months and a strong test-retest reliability was achieved, however, the mean total error score was 14.06, which is high when considering the range of means in healthy subjects identified during a recent systematic review of the BESS (8.4-14.1 total error score) [19]. It may have been that the test administrator in this study was being overly liberal when scoring errors, which is partly confirmed when comparing the mean total error score of the soccer players in this study (14.73 ± 6.03) with previous literature assessing total error score in soccer players (12.5 ± 5.16) [30]. However, it must be stated that the gender of the subjects were not comparable between these two studies. As such, it is still uncertain as to whether tester expertise will have an effect on reliability of the BESS test. 4.3 Limitations and Future Research Possibilities The limitations of this study included the fact that the test administrator was possibly not experienced enough as a BESS tester; however, as has been discussed prior, there are still no set recommendations for training period prior to clinical administration. Also, the participants were all healthy athletes, which does not infer to the basis of the BESS test. However, this use of healthy participants is in alignment with

206 The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System other similar BESS test studies [1, 9, 27]. It is now important to further determine the MDC of the BESS, using various subject groups, so as to provide clinicians with an accurate guideline regarding whether an athlete is presenting with post-mhi balance impairment, however, it must be noted that MDC should not be the single factor in making return-to-play decisions, post-mhi; instead, this true change must be taken into consideration along with a variety of clinical assessment methods both subjective and objective [13]. 4.4 Clinical Implications There is currently no recognised gold standard measure for balance activity in patients with neurological conditions [10], however, this study adds to the body of literature in terms of demonstrating that the test-retest reliability of the BESS is strong when using healthy, young adult academy athletes. It also advocates the protocol recommendations of using multiple administrations per session in order to enhance the reliability of the scores recorded [19]. However, there is still room for identifying a minimum trainer experience to see whether this factor has in influence on improving test-retest reliability of the BESS, and thus providing a more accurate MDC. Identifying that MDC 95 is greater than the differences seen in the literature for BESS error score pre- and post-concussion, may suggest that the BESS is not sensitive enough to carry out clinical testing on MHI, however, it has to be noted that the literature has recommended more conservative assessment measures when assessing in this field [24]. So it may be that using an MDC with a lower confidence interval is preferable. In which case, MDC 70 and MDC 80 would be sensitive enough to identify these noted changes. It is also worth pointing out that there is still a paucity in the literature in terms of using concussed athletes as subjects, as such, the average change in score seen pre- and post-mhi thus far may not be a true representation of the changes that take place. References [1] Erkmen, N., Taskin, H., Kaplan, T., and Sanioglu, A. 2009. The Effect of Fatiguing Exercise on Balance Performance as Measured by the Balance Error Scoring System. Isokinetics and Exercise Science 17 (3): 121-7. [2] Palmieri, R. M., Ingersoll, C. D., Cordova, M. L., Kinzey, S. J., Stone, M. B., and Krause, B. A. 2003. The Effect of a Simulated Knee Joint Effusion on Postural Control in Healthy Subjects. Archives of Physical Medicine and Rehabilitation 84 (7): 1076-9. [3] Vieira, T. M., de Oliveira, L. F., and Nadal, J. 2009. An Overview of Age-Related Changes in Postural Control during Quiet Standing Tasks Using Classical and Modern Stabilometric Descriptors. Journal of Electromyography and Kinesiology 19 (6): 513-9. [4] Broglio, S. P., and Puetz, T. W. 2008. The Effect of Sport Concussion on Neurocognitive Function, Self-Report Symptoms, and Postural Control: A Meta-analysis. Sports Medicine 38 (1): 53-67. [5] Guskiewicz, K. M., Weaver, N. L., Padua, D. A., and Garrett, W. E. 2000. Epidemiology of Concussion in Collegiate and High School Football Players. American Journal of Sports Medicine 28 (5): 643-50. [6] Barr, W. B., and McCrea, M. 2001. Sensitivity and Specificity of Standardized Neurocognitive Testing Immediately Following Sports Concussion. Journal of the International Neuropsychology Society 7 (6): 396-402. [7] Riemann, B. L., Guskiewicz, K. M., and Shields, E. W. 1999. Relationship between Clinical and Forceplate Measures of Postural Stability. Journal of Sport Rehabilitation 8 (2): 71-82. [8] Riemann, B. L., and Guskiewicz, K. M. 2000. Effects of Mild Head Injury on Postural Stability as Measured through Clinical Balance Testing. Journal of Athletic Training 35 (1): 19-25. [9] Valovich, T. C., Perrin, D. H., and Gansneder, B. M. 2003. Repeat Administration Elicits a Practice Effect with the Balance Error Scoring System but Not with the Standardized Assessment of Concussion in High School Athletes. Journal of Athletic Training 38 (1): 51-6. [10] Tyson, S. F., and Connell, L. A. 2009. How to Measure Balance in Clinical Practice. A Systematic Review of the Psychometrics and Clinical Utility of Measures of Balance Activity for Neurological Conditions. Clinical Rehabilitation 23 (9): 824-40. [11] Mulligan, I. V., Boland, M. A., and McIlhenny, C. V. 2013. The Balance Error Scoring System Learned Response among Young Adults. Sports Health 5 (1): 22-6. [12] Steffen, T., and Seney, M. 2008. Test-Retest Reliability

The Test-Retest Reliability and Minimal Detectable Change of the Balance Error Scoring System 207 and Minimal Detectable Change on Balance and Ambulation Tests, the 36-Item Short-Form Health Survey, and the Unified Parkinson Disease Rating Scale in People with Parkinsonism. Physical Therapy 88 (6): 733-46. [13] Valovich McLeod, T. C., Barr, W. B., McCrea, M., and Guskiewicz, K. M. 2006. Psychometric and Measurement Properties of Concussion Assessment Tools in Youth Sports. Journal of Athletic Training 41 (4): 399-408. [14] Broglio, S. P., Zhu, W., Sopiarz, K., and Park, Y. 2009. Generalizability Theory Analysis of Balance Error Scoring System Reliability in Health Young Adults. Journal of Athletic Training 44 (5): 497-502. [15] Munro, A. G., and Herrington, L. C. 2010. Between-Session Reliability of the Star Excursion Balance Test. Physical Therapy in Sport 11 (4): 128-32. [16] Haley, S. M., and Fragala-Pinkham, M. A. 2006. Interpreting Change Scores of Tests and Measures Used in Physical Therapy. Physical Therapy 86 (5): 735-43. [17] Parsons, T. D., Notebaert, A. J., Shields, E. W., and Guskiewicz, K. M. 2009. Application of Reliable Change Indices to Computerized Neuropsychological Measures of Concussion. International Journal of Neurology 119 (4): 492-507. [18] Teixeira da Cunha-Filho, I., Henson, H., Qureshy, H., Williams, A. L., Holmes, S. A., and Protas, E. J. 2003. Differential Responses to Measures of Gait Performance among Healthy and Neurologically Impaired Individuals. Archives of Physical Medicine and Rehabilitation 84 (12): 1774-9. [19] Bell, D. R., Guskiewicz, K. M., Clark, M. A., and Padua, D. A. 2011. Systematic Review of the Balance Error Scoring System. Sports Health 3 (3): 287-95. [20] Rankin, G., and Stokes, M. 1998 Reliability of Assessment Tools in Rehabilitation: An Illustration of Appropriate Statistical Analyses. Clinical Rehabilitation 12 (3): 187-9. [21] Ntoumanis, N. 2007. A Step-by-Step Guide to SPSS for Sport and Exercise Studies. Oxon: Routledge, 146-9. [22] Portney, L. G., and Watkins, M. P. 2000. Foundations of Clinical Research: Applications to Practice. 2nd ed. Norwalk, CT: Appleton & Lange, 34-7. [23] Iyer, L. V., Haley, S. M., Watkins, M. P., and Dumas, H. M. 2003. Establishing Minimal Clinically Important Differences for Scores on the Pediatric Evaluation of Disability Inventory for Inpatient Rehabilitation. Physical Therapy 83 (10): 888-97. [24] Guskiewicz, K. M., Bruce, S. L., Cantu, R. C., Ferrara, M. S., Kelly, J. P., McCrae, M., Putukian, M., and Valovich McLeod, T. C. 2004. National Athletic Trainers Association Position Statement: Management of Sport-Related Concussion. Journal of Athletic Training 39 (3): 280-97. [25] Guskiewicz, K. M., Ross, S. E., and Marshall, S. W. 2001. Postural Stability and Neuropsychological Deficits after Concussion in Collegiate Athletes. Journal of Athletic Training 36 (3): 263-73. [26] McCrea, M., Guskiewicz, K. M., Marshall, S. W., Barr, W., Randolph, C., Cantu, R. C., Onate, J. A., Yang, J., and Kelly, J. P. 2003. Acute Effects and Recovery Time Following Concussion in Collegiate Football Players: The NCAA Concussion Study. Journal of the American Medical Association 290 (19): 2556-63. [27] Hunt, T. N., Ferrara, M. S., Bornstein, R. A., and Baumgartner, T. A. 2009. The Reliability of the Modified Balance Error Scoring System. Clinical Journal of Sport Medicine 19 (6): 471-5. [28] Padua, D. A., Marshall, S. W., Boling, M. C., Thigpen, C. A., Garrett, W. E., and Beutler, A. I. 2009. The Landing Error Scoring System (LESS) Is a Valid and Reliable Clinical Assessment Tool of Jump-Landing Biomechanics: The JUMP-ACL Study. American Journal of Sports Medicine 37 (10): 1996-2002. [29] Padua, D. A., Bolin, M. C., DiStefano, L. J., Onate, J. A., and Beutler, A. I. 2011. Reliability of the Landing Error Scoring System Real Time, a Clinical Assessment of Jump-Landing Biomechanics. Journal of Sport Rehabilitation 20 (2): 145-56. [30] Bressel, E. B., Yonker, J. C., Kras, J., and Heath, E. M. 2007. Comparison of Static and Dynamic Balance in Female Collegiate Soccer, Basketball, and Gymnastics Athletes. Journal of Athletic Training 42 (1): 42-6. [31] Kay, T., Harrington, D. E., Adams, R., Anderson, T., Berrol, S., Cicerone, K., Dahlberg, C., Gerber, D., Goka, R., Harley, P., Hilt, J., Horn, L., Lehmkuhl, D., and Malec, J. 1993. Definition of Mild Traumatic Brain Injury. Journal of Head Trauma Rehabilitation 8 (3): 86-7.