Intraobserver and Interobserver Reliability of the Classification of Thoracic Adolescent Idiopathic Scoliosis**

Similar documents
Adolescent Idiopathic Scoliosis A NEW CLASSIFICATION TO DETERMINE EXTENT OF SPINAL ARTHRODESIS

Kao-Wha Chang, MD, Ku-I Chang, MD, and Chi-Ming Wu, MD

Selective fusion in adolescent idiopathic scoliosis: a radiographic evaluation of risk factors for imbalance

Wh e n idiopathic adolescent scoliosis involves 2

Could Structural and Noncompensatory Lenke 3 and 4C Lumbar Curves Be Nonstructural and Compensatory?

Lowest instrumented vertebra selection in Lenke 3C and 6C scoliosis: what if we choose lumbar apical vertebra as distal fusion end?

Spinal Deformity Pathologies and Treatments

Department of Orthopedics, Hai an Hospital Affiliated to Nantong University, Hai an, Nantong, Jiangsu, China; 2

There is No Remarkable Difference Between Pedicle Screw and Hybrid Construct in the Correction of Lenke Type-1 Curves

Postoperative standing posteroanterior spine

Anterior lumbar instrumentation improves correction of severe lumbar Lenke C curves in double major idiopathic scoliosis

Idiopathic scoliosis Scoliosis Deformities I 06

Screws versus hooks: implant cost and deformity correction in adolescent idiopathic scoliosis

COLUNA/COLUMNA - VOLUME 3 (2) - JUNHO ARTIGO DE ATUALIZAÇÃO

Computer-aided King classification of scoliosis

The effectiveness of selective thoracic fusion for treating adolescent idiopathic scoliosis: a systematic review protocol

Results of preoperative pulmonary function testing of adolescents with idiopathic scoliosis: A study of six hundred and thirty-one patients

Ishikawa et al. Scoliosis and Spinal Disorders (2017) 12:16 DOI /s

Jean-Luc Clément Edouard Chau Marie-José Vallade Anne Geoffray. Introduction

LIV selection in selective thoracic fusions

King II Adolescent idiopathic scoliosis (Lenke B and C): Prediction of coronal decompensation

Society (SRS) Classification for Adult Spinal Deformity

Effect of direct vertebral body derotation on the sagittal profile in adolescent idiopathic scoliosis

The ideal correction system for adolescent. Segmental Derotation Using Alternate Pedicular Screws in Adolescent Idiopathic Scoliosis ABSTRACT

Usefulness of Simple Rod Rotation to Correct Curve of Adolescent Idiopathic Scoliosis

Long-term three-dimensional changes of the spine after posterior spinal instrumentation and fusion in adolescent idiopathic scoliosis*

Spinal deformities, such as increased thoracic

Spinal deformity progression after posterior segmental instrumentation and fusion for idiopathic scoliosis

Presented at the 2013 Joint Spine Section Meeting. Shriners Hospitals for Children, Philadelphia, Pennsylvania

As edited by Dr. Oheneba Boachie-Adjei, Dr. Matthew Cunningham, Dr. John Kostuik, Dr. Raymund Woo and the Complex Spine Study Group et al

Pedicle subtraction osteotomy for the treatment of fixed sagittal imbalance: Surgical technique.

Late Complications of Adult Idiopathic Scoliosis Primary Fusions to L4 and Above

Scoliosis classifications Adopted for Non operative Treatment. Manuel D. Rigo MD PhD Institut Elena Salvá Barcelona

18th International Scientific Meeting of the VCFS Educational Foundation Steven M. Reich, MD. July 15-17, 2011 New Brunswick, New Jersey USA

The effect of body mass index on lumbar lordosis on the Mizuho OSI Jackson spinal table

Factors Influencing the Outcome of Arthrodesis for Congenital Kyphosis and Kyphoscoliosis

Degenerative spondylolisthesis at the L4 L5 in a 32-year-old female with previous fusion for idiopathic scoliosis: A case report

Younes Majdouline 1,2, Carl-Eric Aubin 1,2*, Xiaoyu Wang 1, Archana Sangole 1,2 and Hubert Labelle 2

Per D. Trobisch Amer F. Samdani Randal R. Betz Tracey Bastrom Joshua M. Pahys Patrick J. Cahill

Surgery for Idiopathic Scoliosis: Currently Applied Techniques

Maintenance of Thoracic Kyphosis in the 3D Correction of Thoracic Adolescent Idiopathic Scoliosis Using Direct Vertebral Derotation

Choice of Lowest Instrumented Vertebras for Lenke I Adolescent Idiopathic Scoliosis Orthopedics

Author's response to reviews

Introduction. Our hope is that this Surgical Technique Guide enhances your knowledge and contributes to clinical success for your patients.

Adolescent Idiopathic Scoliosis

Correlation of Radiographic Parameters and Clinical Symptoms in Adult Scoliosis

Robert W Gaines 1*, Kan Min 2 and Daniel Zarzycki 3

Long lumbar instrumented fusions have been described

Medical Journal of the Islamic Republic of Iran.Vol. 23, No. 3, November, pp

Forced Lordosis on the Thoracolumbar Junction Can Correct Coronal Plane Deformity in Adolescents With Double Major Curve Pattern Idiopathic Scoliosis

PARAPLEGIA. B FIG. 6 A, B and C, Same patient three years after spinal grafting shows a most remarkable improvement of spinal deformity and posture.

Prevalance of neural axis abnormalities in patients with infantile idiopathic scoliosis

Treatment of thoracolumbar burst fractures by vertebral shortening

Prevention and management of iatrogenic flatback deformity

Does Thoracic Hypokyphosis Matter in Lenke Type 1 Adolescent Idiopathic Scoliosis?

A Comparison of Cobb Angle: Standing Versus Supine Images of Late-Onset Idiopathic Scoliosis

Adolescent Idiopathic Scoliosis

Posterior spinal arthrodesis for adolescent idiopathic scoliosis using pedicle screw instrumentation

Non-invasive quantitative assessment of scoliosis spinal surgery outcome Lama Seoud* a, Farida Cheriet a,b, Hubert Labelle b, Stefan Parent b

Proximal junctional kyphosis in adult spinal deformity with long spinal fusion from T9/T10 to the ilium

Idiopathic Scoliosis: Anterior Approach and Fixation from the Concavity

Original Article Selection of proximal fusion level for degenerative scoliosis and the entailing proximal-related late complications

Dr. Theodoros B Grivas MD, PhD

QUT Digital Repository:

Postoperative Change of Thoracic Kyphosis after Corrective Surgery for Adult Spinal Deformity

Application Residency Grant Project Section I Project Leader: Teddy E. Kim Credentials: MD, DO,

Treatment Options. CallenChiro.com

Jianru Wang, MD, Xiang Li, MD, and Zhaomin Zheng, MD, PhD

Clinical Study Acute Reciprocal Changes Distant from the Site of Spinal Osteotomies Affect Global Postoperative Alignment

Temporary use of shape memory spinal rod in the treatment of scoliosis

Exercises for Scoliosis within the braces and Brace modifications for exercises

ApiFix New minimal invasive method to treat Adolescent Idiopathic Scoliosis Short fixation followed by Specific Physiotherapy Program

Early Failure of Short-Segment Pedicle Instrumentation for Thoracolumbar Fractures

Management of fractures of the pedicle after instrumentation with transpedicular screws

LESS IS MORE SIGNFICANT CORONAL CORRECTION OF AIS DEFORMITY PREDICTS THORACIC HYPOKYPHOSIS

4.5 System. Surgical Technique. This publication is not intended for distribution in the USA.

A three-dimensional retrospective analysis of the evolution of spinal instrumentation for the correction of adolescent idiopathic scoliosis

Results of surgical treatment for kyphotic deformity of the spine secondary to trauma or Scheuermann s disease

Porcine model for early onset scoliosis created with a posterior mini-invasive method

Association between Sacral Slanting and Adjacent Structures in Patients with Adolescent Idiopathic Scoliosis

Scoliosis is considered to be the most common skeletal

Aneurysmal Bone Cyst of the Spine

Change of Sagittal Spinopelvic Parameters after Selective and Non-Selective Fusion in Lenke Type 1 Adolescent Idiopathic Scoliosis Patients

Surgical treatment for adult spinal deformity: Conceptual approach and surgical strategy

The Wilmington Brace in the Treatment of. Adolescent Idiopathic Scoliosis

Ning Liu, MD, 1,3 and Kirkham B. Wood, MD 1,2

Current status of managing pediatric kyphosis deformity Papers divided into 3 categories

Research Article TheHeightGaininScolioticDeformityCorrection: Assessed by New Predictive Formula

Spinal Fusion. North American Spine Society Public Education Series

PROGRESSIVE AND RESOLVING INFANTILE IDIOPATHIC SCOLIOSIS. The Differential Diagnosis

Demonstration of active Side Shift Type1(Mirror Image ) in Right (Major) Thoracic curve.

How to use: Hold the Baseline scoliosis. Fabrication Enterprises Incorporated

Raymond Wiegand, D.C. Spine Rehabilitation Institute of Missouri

Postoperative shoulder imbalance in Lenke Type 1A adolescent idiopathic scoliosis and related factors


5/27/2016. Sagittal Balance What is It and How Did We Get Here? Sagittal Balance. Steven J. Tresser, MD Tampa, FL. Concept:

Lumbar degenerative spinemodalities

Competence of the Deltoid Ligament in Bimalleolar Ankle Fractures After Medial Malleolar Fixation *

Use of the iphone for Cobb angle measurement in scoliosis

Ebrahim Ghayem Hassankhani, 1 Farzad Omidi-Kashani, 1 Shahram Moradkhani, 2 Golnaz Ghayem Hassankhani, 3 and Mohammad Taghi Shakeri 4. 1.

Transcription:

Copyright 1998 by The Journal of Bone and Joint Surgery, Incorporated Intraobserver and Interobserver Reliability of the Classification of Thoracic Adolescent Idiopathic Scoliosis** BY LAWRENCE G. LENKE, M.D4, RANDAL R. BETZ, M.D., KEITH H. BRIDWELL, M.D4, DAVID H. CLEMENTS, M.D.#, JURGEN HARMS, M.D.**, THOMAS G. LOWE, M.D.ft, AND HARRY L. SHUFFLEBARGER, M.D.tt, ST. LOUIS MISSOURI Investigation performed at Washington University, St. Louis ABSTRACT The system described by King et al. is the standard method for the classification of thoracic adolescent idiopathic scoliosis. Although it is widely used and referenced, its reliability and reproducibility among scoliosis surgeons are unknown. We used a scoliosis casepresentation format to examine the interobserver and intraobserver reliability of the classification of thoracic adolescent idiopathic scoliosis with the system of King et al. Eight active, current members of the Scoliosis Research Society reviewed twenty-seven full-length radiographs that had been made before operative correction of the scoliotic deformity. On the basis of these images, which included posteroanterior and lateral radiographs made with the patient standing as well as right and left forced-side-bending radiographs made with the patient supine, the reviewers assigned a type to each curve according to the classification system of King et al. Kappa coefficients were used to test statistical reliability. The mean interobserver reliability of the classification was only 64 per cent (range, 54 to 77 per cent) when the responses of seven of the reviewers were compared with those of one of the originators of the classification. The mean kappa coefficient was 0.49 (range, 0.27 to 0.73), which indicates poor reliability. When each reviewer's responses were compared with those of the other reviewers, the reliability was similarly poor (interobserver reliability, 55 per cent [range, 33 to 81 per cent] and mean kappa coefficient, 0.40 *No benefits in any form have been received or will be received from a commercial party related directly or indirectly to the subject of this article. No funds were received in support of this study. fread at the Annual Meeting of the Scoliosis Research Society, Ottawa, Canada, September 26,1996. ^Department of Orthopaedic Surgery, Spinal Deformity Service, Washington University, One Barnes Plaza, Suite 11300, St. Louis, Missouri 63110. E-mail address for Dr. Lenke: lenkel@msnotes.wustl.edu. Please address requests for reprints to Dr. Lenke. Shriners Hospital, 8400 Roosevelt Boulevard, Philadelphia, Pennsylvania 19152. #Temple University Hospital, 3401 North Broad Street, Philadelphia, Pennsylvania 19140. "Rehabilitations Krankenhaus, Guttmannwtr.l, Postfach 327, 7516 Karlsbad, Langensteinbach, Germany. tt3550 Lutheran Parkway, Suite 201, Wheat Ridge, Colorado 80033. ttll50 Campo Sano Avenue, Suite 300, Coral Gables, Florida 33146. [range, 0.21 to 0.63]). Intraobserver reliability was evaluated in a trial in which five reviewers in a group setting were shown the same radiographs in a different order at two different viewings. Comparison of the results at the two viewings revealed a mean intraobserver reliability of 69 per cent (range, 56 to 85 per cent) and a mean kappa coefficient of 0.62 (range, 0.34 to 0.95), which indicates fair reliability. The current method of classification of adolescent idiopathic scoliosis does not appear to have sufficient intraobserver or interobserver reliability among scoliosis surgeons to portray curve types accurately. Thus, it may not help to guide treatment with use of modern spinal fixation methods. Classification systems play a crucial role in many areas of orthopaedic surgery. Such systems are used in the assessment of fractures and in the determination of treatment guidelines. The classification of thoracic adolescent idiopathic scoliosis was presented in a landmark article by King et al. 9, published in 1983, in which five types of curves (types I through V) were described in detail. According to this system, a type-i curve is an s-shaped curve in which the lumbar curve is larger and less flexible than the thoracic curve, a type-ii curve is an s-shaped curve in which the thoracic curve is larger and less flexible than the lumbar curve, a type-ill curve is a single thoracic curve in which no compensatory lumbar curve crosses the midline, a type-iv curve is a long thoracic curve in which the fourth lumbar vertebra tilts into the thoracic curve, and a type-v curve is a double thoracic curve. Recommendations for the appropriate levels of arthrodesis for these types were also an important aspect of the article. The system was based on 405 patients who had had instrumentation with a Harrington rod. The system of King et al. has remained the principal classification for thoracic adolescent idiopathic scoliosis despite marked changes in operative instrumentation methods during the last decade 19. However, with modern methods of segmental spinal fixation, coronal decompensation of type-ii curves has occurred 412 ". Many theories have been proposed to explain this problem and numerous possible solutions have been described 4 ' 1217-2022. In addition, there recently has been a greater emphasis on alignment in the sagittal VOL. 80-A, NO. 8, AUGUST 1998 1097

1098 L. G. LENKE ET AL. plane during correction of the scoliosis". Recently, other classification systems used in orthopaedic surgery have come under scrutiny because of the realization that a system must demonstrate validity, reliability, and reproducibility in order to be considered truly effective 5. In the case of scoliotic curves, interobserver reliability refers to the agreement between different observers regarding the classification of the same curve pattern. Reproducibility, or intraobserver reliability, refers to agreement by the same observer with regard to the classification of a curve pattern on two separate occasions. When the reliability of classifications in other areas of orthopaedic surgery has been evaluated, the results have been rather disappointing. As far as we know, the classification of thoracic adolescent idiopathic scoliosis described by King et al. 9 has not been tested with regard to interobserver or intraobserver reliability. There are obvious implications regarding the usefulness of a classification system that is unreliable or non-reproducible, or both. The purpose of the present study was to determine both the interobserver and the intraobserver reliability of the classification system of King et al. 9. Any difficulties that occurred during the classification were analyzed to determine their nature and the reasons why they occurred. Published treatment guidelines that are based on specific types of curves will not be reliable if the classification system that is used to evaluate the curves is unreliable. Materials and Methods Eight members of the Scoliosis Research Society participated in this study. Four surgeons represented two centers, whereas the other four surgeons, one of whom was Dr. H. A. King, were from different scoliosis centers. At the time of the study, all of the participants were active scoliosis surgeons but all had different backgrounds with regard to their training in the operative treatment of scoliosis. Five of us (L. G. L., R. R. B., D. H. C, J. H., and T. G. L.) are members of the Harms International Scoliosis Study Group. The lead author (L. G. L.) chose twenty-seven fulllength, good-quality radiographs that had been made before operative treatment of adolescent idiopathic scoliosis. The images included posteroanterior and lateral radiographs made with the patient standing as well as right and left forced-side-bending radiographs made with the patient supine. All curves had been treated during a two-year time-span during which the lead author personally operated on approximately sixty patients who had adolescent idiopathic scoliosis. The mean coronal Cobb angle for the twenty-seven curves was 64 degrees (range, 45 to 105 degrees). Six of the twenty-seven thoracic curves had a Cobb angle of more than 75 degrees. Each radiograph was labeled with the Cobb angle for each coronal curve, with the value for the thoracic kyphosis as measured from the cephalad end plate of the fifth thoracic vertebra to the caudad end plate of the twelfth thoracic vertebra, and with the value for the lumbar lordosis as measured from the cephalad end plate of the twelfth thoracic vertebra to the cephalad end plate of the sacrum. All measurements were performed by an experienced scoliosis nurseclinician and were repeated by the lead author to document the consistency of the measured values. Each radiograph was then photographed in order to convert it into a standard-size projection-type slide. The slides were projected onto a screen so that the axial skeleton was at least thirty-six inches (91.4 centimeters) long so as to mimic the actual size of a long-cassette radiograph. None of the slides were deemed uninterpretable because of poor quality. Each reviewer was provided with a diagrammatic summary of the five types of curves according to the classification system of King et al. taken directly from the original article 9. Also included was a description of the flexibility index, which was defined as the percentage of correction of the thoracic curve subtracted from the percentage of correction of the lumbar curve, as analyzed on side-bending radiographs. In addition, a table that described the five types of curves was included as a reference, and a note describing King's revised criteria for type-ii curves was provided as well 8. To distinguish between type-ii and double-major curves, the reviewers also used comparative analysis of the ratios of the Cobb angles, apical vertebral translation, and rotation between the thoracic and lumbar regions, as well as the flexibility seen on side-bending radiographs 12. Each reviewer had a preprinted form on which to record his specific classification for each of the curves presented. The reviewers determined whether the curve was type I, II, III, IV, or V; thoracolumbar; lumbar; double major; or other (Trial 1). Five of the reviewers participated in a group setting, during meetings of the Harms International Scoliosis Study Group, and three (K. H. B., H. L. S., and Dr. King) participated on an individual basis at their home institution. All of the reviewers had ample time to review the radiographs; no time-limit was imposed on the review. The five reviewers who participated in the group setting worked independently throughout the review process. No discussion was allowed during the time between the presentations of the curves, and the reviewers sat far enough away from each other so that there was no opportunity to view each other's responses. The same twenty-seven curves were reviewed again in a different order in a group setting at a second viewing (Trial 2) by the same five reviewers who had participated in a group setting in Trial 1. The reviewers were asked again to classify each curve. Therefore, Trial 2 was used to test the intraobserver reliability (reproducibility) of the results from Trial 1. None of the curves were reviewed by the group as a whole until both trials had been performed. THE JOURNAL OF BONE AND JOINT SURGERY

INTRAOBSERVER AND INTEROBSERVER RELIABILITY OF THE CLASSIFICATION OF SCOLIOSIS 1099 FIG. 1-A FIG. 1-B FIG. 1-C Figs. 1-A, 1-B, and 1-C: Radiographs of a fifteen-year and one-month-old girl. The curve was classified as type 1 by five reviewers and as double major by three. Fig. 1-A: Posteroanterior radiograph, made with the patient standing, showing a scoliotic curve of 20 degrees from the first through the fifth thoracic vertebra, 62 degrees from the fifth through the eleventh thoracic vertebra, and 72 degrees from the eleventh thoracic through the fourth lumbar vertebra. Fig. 1-B: Left-side-bending radiograph, made with the patient supine, showing correction of the lumbar curve to 50 degrees. Fig. 1-C: Right-side-bending radiograph, made with the patient supine, showing correction of the primary thoracic curve to 49 degrees. Kappa statistics were used to analyze the data with both simple and weighted components and were compared with use of 95 per cent confidence intervals established with SAS software (Statistical Analysis System, Cary, North Carolina). These statistics determine the proportion of agreement that occurs by random chance subtracted from the actual portion of agreement that was obtained. Kappa coefficients range in value from +1.0 (perfect agreement) to 0.0 (chance agreement) to -1.0 (less agreement than expected by chance). Kappa statistics were used to quantify both interobserver reliability (the results of the evaluations performed by seven of the reviewers compared with those of Dr. King in Trial 1) and intraobserver reliability (the results in Trial 1 compared with those in Trial 2 for each of the five reviewers who participated in both trials). In addition, kappa coefficients were generated to compare each reviewer's results with those of the other reviewers within the group. According to Svanholm et al., a kappa coefficient of more than 0.75 indicates excellent reliability; a value of 0.50 to 0.75, fair reliability; and a value of less than 0.50, poor reliability. Results The mean interobserver reliability was only 64 per cent (range, 54 to 77 per cent) when the responses of Dr. King were compared with those of the other seven reviewers (a total of 189 comparisons). The mean kappa coefficient for interobserver reliability was 0.49 (range, 0.27 to 0.73). All eight reviewers agreed on the classification of two (7 per cent) of the twenty-seven curves. Two classifications were listed for four curves (15 per cent), three were listed for eighteen curves (67 per cent), four were listed for one curve (4 per cent), and five were listed for two curves (7 per cent). When the responses of each reviewer were compared with those of the rest of the group (a total of 189 comparisons), the mean interobserver reliability was 55 per cent (range, 33 to 81 per cent) and the mean kappa coefficient was 0.40 (range, 0.21 to 0.63). The mean intraobserver reliability (a total of 135 comparisons) was 69 per cent (range, 56 to 85 per cent), and the mean kappa coefficient was 0.62 (range, 0.34 to 0.95). According to the criteria proposed by Svanholm et al., the mean interobserver reliability was poor when the responses of seven of the reviewers were compared with those of Dr. King (K = 0.49) as well as when the responses of each reviewer were compared with those of the other reviewers (K = 0.40). The mean intraobserver reliability was fair (K = 0.62) when the results in Trial 1 were compared with those in Trial 2. The most common difficulties encountered by the reviewers included distinction of a type-ii curve from a double major curve (seven cases of disagreement); identification of a primary thoracic curve when the VOL. 80-A, NO. 8, AUGUST 1998

1100 L. G. LENKE ET AL. fourth lumbar vertebra was tilted into the curve or when the lumbar curve had structural characteristics of its own, as evidenced by rotation of the lumbar curve opposite to that of the thoracic spine (two cases of disagreement); distinction of a type-ii curve from a type-ill curve on the basis of the amount of rotation of the lumbar curve and its deviation from the midline (seven cases of disagreement); identification of structural cephalad thoracic curves that were not classic type-v curves (seven cases of disagreement); and distinction of a type-i curve from a true double major curve (four cases of disagreement). Discussion Classification schemes help clinicians to organize their thoughts with regard to the type of problem that is being treated and to design appropriate methods of treatment. Thus, classification systems not only organize an approach to a problem and suggest a method of treatment but also may provide an estimate of the outcome of a particular treatment 7. It is becoming increasingly clear, however, that various classification schemes in all areas of medicine, including orthopaedic surgery, may not fulfill the basic requirements that are necessary in order for them to be considered valid. Specifically, for a classification system to be useful, intraobserver and interobserver reliability must be proved. Different practitioners must agree on the classification of the data on a particular patient (interobserver reliability), and a practitioner must assign the same classification every time the data on that patient are reviewed (intraobserver reliability or repeatability). Only after intraobserver and interobserver reliability has been confirmed can an attempt be made to prove that a classification scheme is useful for guiding treatment and testing clinical outcomes 5. If the validity of widely utilized classification schemes is in doubt, the role of multicenter analyses and the results of studies in which these classification schemes have been used become suspect. During reviews of radiographs of scoliotic curves at meetings of the Harms International Scoliosis Study Group, it became evident that there was a great deal of disagreement among the surgeons regarding the appropriate classification, according to the system of King et al. 9, of most of the curves. This disagreement prompted the current study. The main purpose of the present study was to determine the reliability of the classification of thoracic adolescent idiopathic scoliosis described by King et al. Although this system has been the primary method for the classification of thoracic adolescent idiopathic curves, to our knowledge its reliability has never been determined. In 1991, at the annual meeting of the Scoliosis Research Society, Lonstein reported the results of a multicenter study of twenty-nine scoliotic curves FIG. 2-A FIG. 2-B FIG. 2-C Figs. 2-A, 2-B, and 2-C: Radiographs of a seventeen-year and nine-month-old girl. The curve was classified as type II by four reviewers and as type III by four. Fig. 2-A: Posteroanterior radiograph, made with the patient standing, showing a scoliotic curve of 22 degrees from the first through the fifth thoracic vertebra, 58 degrees from the fifth through the eleventh thoracic vertebra, and 44 degrees from the eleventh thoracic through the third lumbar vertebra. The apex of the lumbar spine does not completely cross the midline. Fig. 2-B: Left-side-bending radiograph, made with the patient supine, showing correction of the cephalad thoracic curve to 10 degrees and correction of the lumbar curve to 13 degrees. Fig. 2-C: Right-side-bending radiograph, made with the patient supine, showing correction of the primary thoracic curve to 35 degrees. THE JOURNAL OF BONE AND JOINT SURGERY

INTRAOBSERVER AND INTEROBSERVER RELIABILITY OF THE CLASSIFICATION OF SCOLIOSIS 1101 that had had coronal decompensation after arthrodesis with Cotrel-Dubousset instrumentation. In that study, twelve spine surgeons were asked to classify the curves and to select the appropriate levels of arthrodesis on the preoperative radiographs. There was general agreement regarding the classification of eleven (38 per cent) of the curves; however, the agreement varied for the remaining eighteen (62 per cent). No specific interobserver reliability data were provided in that study of obviously difficult curves. The problem with the reliability of the classification of thoracic adolescent idiopathic scoliosis in the current study is similar to that seen in other recent evaluations of classification schemes used in orthopaedic surgery 510. Specifically, the interobserver reliability in the present study was only 64 per cent when the responses of seven of the reviewers were compared with those of Dr. King. When the possibility that agreement with regard to the classification was due to statistical chance alone was eliminated, the kappa coefficient was 0.49, which is considered borderline poor reliability 21. The findings were similar when the responses of each reviewer were compared with those of the other reviewers: the interobserver reliability was 55 per cent and the kappa coefficient was 0.40, which also indicates poor reliability. The intraobserver reliability was somewhat better: 69 per cent with a kappa coefficient of 0.62, which is considered fair reliability 21. There were several common problems that appeared to produce the inconsistent responses and suboptimum reliability in the present study. The reviewers found it difficult to distinguish between a type-i curve and a true double major curve (four cases of disagreement). With a type-i scoliosis, the lumbar curve has a greater Cobb angle and is less flexible than the thoracic curve 9. Classically, with a double major curve, the thoracic and lumbar curves have nearly equal Cobb angles, apical rotation, and deviation from the midline. However, there are no strict criteria with regard to how different the structural characteristics of the thoracic and lumbar curves must be in order to differentiate a double major curve from a type-i curve. The precise amount by which the Cobb angle of the lumbar curve must exceed that of the thoracic curve in order for the curve to be classified as type I is unclear (Figs. 1-A, 1-B, and 1-C). Often, when the lumbar curve has a greater Cobb angle, the thoracic curve is less flexible because of interposition of the thoracic rib cage and the sternum. The distinction between type-i and double major curves may not be necessary if the curves are to be treated in a similar manner. However, if the treatment modalities differ for example, if an anterior release and arthrodesis is performed for a larger and stiffer lumbar curve then this distinction becomes more important, especially when the results of such treatment are analyzed on the basis of the preoperative classification of the curves. Another frequent problem experienced by the re- FIG. 3 Radiograph of a fourteen-year and five-month-old girl, showing a scoliotic curve of 30 degrees from the first through the fifth thoracic vertebra, 55 degrees from the fifth through the twelfth thoracic vertebra, and 40 degrees from the twelfth thoracic through the fourth lumbar vertebra. The apex of the lumbar curve is directly on the center sacral line, and there is contralateral rotation of the lumbar spine compared with the rotation of the thoracic spine. viewers in the present study involved distinguishing between a type-ii curve and a type-ill curve on the basis of specific structural characteristics of the lumbar curve when the patient had a larger, more structural thoracic curve (seven cases of disagreement). The difficulty in determining exactly how far the lumbar curve was situated from the midline and how much rotation was present led to disagreement regarding whether these curves should be classified as type II or type III (Figs. 2-A, 2-B, and 2-C). Although the degree of deviation from the midline that distinguishes a type-ii curve from a type-ill curve was not strictly quantified in the original classification system of King et al. 9, in a subsequent article 8 King stated that the lumbar curve crosses the midline in type II and does not cross the midline in type III. The clinical examination may also provide important clues in that a lumbar hump is present in a type-ii curve and is virtually absent in a type-ill curve 8. However, some curves cannot be classified according to this strict definition because the position of the VOL. 80-A, NO. 8, AUGUST 1998

1102 L. G. LENKE ET AL. FIG. 4-A FIG. 4-B Figs. 4-A through 4-D: Radiographs of a fourteen-year and three-month-old boy. Fig. 4-A: Posteroanterior radiograph, made with the patient standing, showing a scoliotic curve of 36 degrees from the first through the fifth thoracic vertebra, 65 degrees from the fifth through the twelfth thoracic vertebra, and 53 degrees from the twelfth thoracic through the fourth lumbar vertebra. Fig. 4-B: Left-side-bending radiograph, made with the patient supine, showing correction of the cephalad thoracic curve to 23 degrees and correction of the lumbar curve to 17 degrees. apex of the lumbar curve is somewhere between those of type-ii and type-ill curves. An example is a 55- degree right thoracic and 40-degree left lumbar curve with the thoracic curve having more deviation and more rotation than the lumbar curve but the medial apex of the lumbar curve lying directly on the center sacral line with contralateral rotation of the lumbar spine compared with that of the thoracic spine (Fig. 3). It is difficult to pinpoint whether that curve should be classified as type II or type III because of the intermediate position of the lumbar portion. This is a challenging curve to evaluate because almost invariably the thoracic curve is more structural than the lumbar curve, even if the curves have similar Cobb angles, because of the rigidity contributed by the thoracic rib cage. However, this type of curve is fairly common and thus must be classified appropriately. Similarly, our reviewers had difficulty distinguishing a type-ii curve from a double major curve (seven cases of disagreement). This difficulty appeared to be due to variability in the comparison of the extent of structural characteristics of the thoracic spine with those of the lumbar spine; in addition, some reviewers placed added importance on the alignment of the thoracolumbar junction in the sagittal plane when deciding whether both the thoracic and the lumbar curve should be considered structural curves. Certainly, there is still controversy regarding the strict definition of a type-ii curve that can be successfully treated with selective thoracic arthrodesis 4,812151718 ' 22. Thus, the type-ii curves that were appropriate for selective thoracic arthrodesis in the coronal plane were often classified as double major if a reviewer believed that both curves needed arthrodesis with instrumentation because of a thoracolumbar kyphosis in the sagittal plane (Figs. 4-A through 4-D). Such a kyphosis was not accounted for in the classification system of King et al. 9. Use of the flexibility index alone can be somewhat misleading because most thoracic curves with a Cobb angle that is nearly equal to that of the lumbar curve are inherently less flexible because of the interposition of the thoracic rib cage and the sternum. Thus, comparative analysis of the ratio of the thoracic Cobb angle to the lumbar Cobb angle, apical vertebral translation, and rotation, as well as the flexibility seen on side-bending radiographs, has also been used to distinguish a type- II curve from a double major curve' 218. However, other authors still classify a curve as type II even if they perform instrumentation and arthrodesis of both the thoracic and the lumbar curve 1. Since one of the objectives of any classification system should be to direct decisions regarding treatment 7, we recommend classification THE JOURNAL OF BONE AND JOINT SURGERY

INTRAOBSERVER AND INTEROBSERVER RELIABILITY OF THE CLASSIFICATION OF SCOLIOSIS 1103 of a curve as type II when selective thoracic arthrodesis can be performed successfully, as recommended by King et al. 9. Another situation that caused disagreement with regard to the classification was the presence of a primary thoracic curve with the fourth lumbar vertebra tilted into it as well as a lumbar curve that had structural characteristics of its own as evidenced by rotation of the lumbar curve opposite to the rotation of the thoracic spine and inflexibility on side-bending (two cases of disagreement). These curves were often classified as type II, III, or IV by the reviewers (Figs. 5-A, 5-B, and 5-C). Although not very common, this pattern of reversed rotation with little or no contralateral deviation of the lumbar curve from the midline was also not well defined in the classification system of King et al. 9. The reviewers also had trouble identifying structural cephalad thoracic curves that were not classic type-v curves (seven cases of disagreement). According to the classic definition, a patient is considered to have a type-v curve if positive tilt of the first thoracic vertebra is seen on a posteroanterior radiograph, made with the patient standing, and the left shoulder is higher than the right as seen clinically. This dictates treatment of two structural thoracic curves and the inclusion of the cephalad thoracic curve in the instrumentation and arthrodesis of the main thoracic curve. However, it has been shown that the cephalad thoracic curve may be structural even when there is neutral or negative tilt of the first thoracic vertebra and the shoulders are level or the right shoulder is elevated. The diagnosis of a structural cephalad thoracic curve depends on various radiographic and clinical parameters 111 ' 1, including a Cobb angle of more than 30 degrees as measured on the posteroanterior radiograph, side-bending flexibility (a residual cephalad thoracic curve of more than 20 degrees), rotation (at least grade 1 according to the system of Nash and Moe), and at least one centimeter of deviation of the apex of the cephalad thoracic curve from the midline. Thus, some structural cephalad thoracic curves are not classic type-v curves. These curves fit more than one classification in the system of King et al. 9. In fact, any type of curve may be accompanied by a structural cephalad thoracic curve that needs instrumentation and arthrodesis" 14 (Figs. 6-A, FIG. 4-C FIG. 4-D Fig. 4-C: Right-side-bending radiograph, made with the patient supine, showing correction of the primary thoracic curve to 29 degrees. Fig. 4-D: Lateral radiograph, made with the patient standing, showing a thoracolumbar junctional kyphosis of 19 degrees from the twelfth thoracic through the second lumbar vertebra. Thus, although this curve satisfies all of the criteria for a type-ii curve on the basis of the findings on Figs. 4-A, 4-B, and 4-C, it was classified as type II by three reviewers and as double major by five reviewers on the basis of the lateral radiograph. VOL. 80-A, NO. 8, AUGUST 1998

1104 L. G. LENKE ET AL. FIG. 5-A FIG. 5-B FIG. 5-C Figs. 5-A, 5-B, and 5-C: Radiographs of a twelve-year and nine-month-old girl. The curve was classified as type II by two reviewers, as type III by five, and as type IV by one. Fig. 5-A: Posteroanterior radiograph, made with the patient standing, showing a scoliotic curve of 47 degrees from the first through the sixth thoracic vertebra, 88 degrees from the sixth thoracic through the first lumbar vertebra, and 54 degrees from the eleventh thoracic through the fifth lumbar vertebra. Fig. 5-B: Left-side-bending radiograph, made with the patient supine, showing correction of the cephalad thoracic curve to 31 degrees and correction of the lumbar curve to 20 degrees. Fig. 5-C: Right-side-bending radiograph, made with the patient supine, showing correction of the primary thoracic curve to 61 degrees. 6-B, and 6-C). It is unclear whether all such curves should be classified as type V or if two classifications (for example, type IV and type V) should be listed because the curve satisfies the criteria for both. King et al. 9 developed their classification system during the era of Harrington instrumentation and arthrodesis. During that time, unidimensional (coronal) assessment was the principal manner in which curves were classified and appropriate treatment was recommended. With the advent of segmental spinal fixation, three-dimensional analysis of scoliosis has become routine. Thus, it appears logical that a classification scheme that promotes three-dimensional analysis of the scoliotic deformity would provide additional information with which to classify these curves and would help physicians to determine appropriate three-dimensional treatment. All of the patients in this study had posteroanterior and lateral radiographs made, while they were standing, with use of a thirty-six by sixteen-inch (91.4-centimeter by 40.6-centimeter) cassette, and all had side-bending flexibility radiographs made, as part of the standard preoperative analysis. Lateral radiographs were not used by King et al. to develop their classification; however, the classification of scoliotic curves and decisions regarding treatment could potentially be based on various features (such as thoracolumbar kyphosis) noted on these radiographs. In addition, there will always be some overlap between the characteristics of thoracic curves and other types of curves (such as primary thoracolumbar or lumbar curves) in adolescent idiopathic scoliosis. Thus, the classification system of King et al. is not inclusive of all types of adolescent idiopathic scoliosis and therefore does not allow comprehensive evaluation of the various patterns that are seen. The present study has some shortcomings. First, there may have been an overall reviewer bias as five of the eight reviewers were members of a scoliosis study group. However, the reviewers represented a wide geographical distribution, were all active members of the Scoliosis Research Society, and had different training and backgrounds. Second, the curves were preselected by the lead author and many were chosen because he had had difficulty classifying them himself. Thus, it is certainly possible that the reliability data would have been better if more straightforward types of curves had been analyzed. However, all twenty-seven of the curves were treated in a two-year time-span and reflect the type of idiopathic curves that were treated operatively during that time. One of the strongest criticisms of this study could THE JOURNAL OF BONE AND JOINT SURGERY

INTRAOBSERVER AND 1NTEROBSERVER RELIABILITY OF THE CLASSIFICATION OF SCOLIOSIS 1105 FIG. 6-A FIG. 6-B FIG. 6-C Figs. 6-A, 6-B, and 6-C: Radiographs of a fifteen-year and nine-month-old girl. The curve satisfied criteria for both type IV and type V and was classified as type IV by two reviewers, as type V by five, and as type IV and V by one. Fig. 6-A: Posteroanterior radiograph, made with the patient standing, showing a scoliotic curve of 57 degrees from the second through the seventh thoracic vertebra and 65 degrees from the seventh thoracic through the third lumbar vertebra. The first thoracic vertebra is tilted into the concavity of the cephalad thoracic curve, and the fourth lumbar vertebra is tilted into the long thoracic curve. Fig. 6-B: Left-side-bending radiograph, made with the patient supine, showing correction of the cephalad thoracic curve to 41 degrees. Fig. 6-C: Right-side-bending radiograph, made with the patient supine, showing complete correction of the curve to 0 degrees. be that the reviewers lacked an accurate understanding of the proper use of the classification system of King et al. 9. In other words, a problem with the reliability of the classification may be inherent to the education of the reviewers and not an inherent weakness of this or other methods of classification. We tried to minimize this possibility by familiarizing the reviewers with the classification both in writing and schematically. The reviewers had used this system to classify all of their patients prospectively for data analysis during the four years preceding the study and thus had used it routinely in their practices. However, it was not determined if the reviewers used the classification in a manner that reflected complete understanding of the system. Incomplete understanding may have contributed to the poor reliability data. Additionally, for practical purposes, slides of radiographs instead of actual long-cassette radiographs were used. Although these slides were of sufficiently good quality, in reality preoperative decisions are usually made on the basis of long-cassette radiographs, not slides of radiographs. In addition, clinical examination, which was not a part of this analysis, can be extremely helpful in determining the type of curve and the ultimate decisions regarding treatment. The goal of operative of treatment of thoracic adolescent idiopathic scoliosis is safe correction of the deformity with spinal instrumentation and a solid fusion. However, the definition of a successful result is quite controversial and depends on both clinical and radiographic parameters. Currently, there is no appropriate radiographic scoring system for the objective comparison of the results of operative treatment of idiopathic scoliosis performed by one surgeon with those of the operations performed by another. If a reliable classification system that is able to direct appropriate treatment is developed, then it will be possible to objectively compare the results of operative treatment of similar curves among surgeons in order to determine the ideal treatment and to direct future outcome studies in this field. In the present study, we found poor-to-fair interobserver and intraobserver reliability with use of the system of King et al. 9 for the classification of thoracic adolescent idiopathic scoliosis. Variable interpretations of the classification were noted. In addition, we identified several recurring problems that led to less-thandesirable reliability, and these inconsistencies may interfere with a thorough evaluation of the spinal deformity. Thus, we recommend caution when the results of operations for thoracic adolescent idiopathic scoliosis are compared on the basis of the classification system of King et al. 9. Appropriate analysis of operative outcome data is possible only if the classification of the curves is reliable. NOTE: The authors thank Bradley Wilson, M.A., Division of Biostatistics, Washington University, for statistical analysis; Howard A. King, M.D., for his participation; and Lutz Biederman, for his support of the study group. VOL. 80-A, NO. 8, AUGUST 1998

1106 L. G. LENKE ET AL. References 1. Barr, S. J.; Schuette, A. M.; and Emans, J. B.: Lumbar pedicle screws versus hooks: results in double major curves in adolescent idiopathic scoliosis. Orthop. Trans., 21: 76,1997. 2. Bernhardt, M., and Bridwell, K. H.: Segmental analysis of the sagittal plane alignment of the normal thoracic and lumbar spines and thoracolumbar junction. Spine, 14:717-721,1989. 3. Bridwell, K. H.; Betz, R.; Capelli, A. M.; Huss, G.; and Harvey, C: Sagittal plane analysis in idiopathic scoliosis patients treated with Cotrel-Dubousset instrumentation. Spine, 15:921-926,1990. 4. Bridwell, K. H.; McAllister, J. W.; Betz, R. R.; Huss, G.; Clancy, M.; and Schoenecker, P. L.: Coronal decompensation produced by Cotrel-Dubousset "derotation" maneuver for idiopathic right thoracic scoliosis. Spine, 16:769-777,1991. 5. Burstein, A. H.: Editorial. Fracture classification systems: do they work and are they useful? J. Bone and Joint Surg., 75-A: 1743-1744, Dec. 1993. 6. Cobb, J. R.: Outline for the study of scoliosis. In Instructional Course Lectures, American Academy of Orthopaedic Surgeons. Vol. 5, pp. 261-275. Ann Arbor, J. W. Edwards, 1948. 7. Kalen, V., and Conklin, M.: The behavior of the unfused lumbar curve following selective thoracic fusion for idiopathic scoliosis. Spine, 15:271-274,1990. 8. King, H. A.: Analysis and treatment of type II idiopathic scoliosis. Orthop. Clin. North America, 25:225-237,1994. 9. King, H. A.; Moe, J. H.; Bradford, D. S.; and Winter, R. B.: The selection of fusion levels in thoracic idiopathic scoliosis. J. Bone and Joint Surg., 65-A: 1302-1313, Dec. 1983. 10. Lee, C. K.: Lumbar spinal instability (olisthesis) after extensive posterior spinal decompression. Spine, 8:429-433,1983. 11. Lee, C. K.; Denis, E; Winter, R. B.; and Lonstein, J. E.: Analysis of the upper thoracic curve in surgically treated idiopathic scoliosis. A new concept of the double thoracic curve pattern. Spine, 18:1599-1608,1993. 12. Lenke, L. G.; Bridwell, K. H.; Baldus, C; and Blanke, K.: Preventing decompensation in King type II curves treated with Cotrel- Dubousset instrumentation. Strict guidelines for selective thoracic fusion. Spine, 17(8S): S274-S281,1992. 13. Lenke, L. G.; Bridwell, K. H.; Baldus, C; Blanke, K.; and Schoenecker, P. L.: Cotrel-Dubousset instrumentation for adolescent idiopathic scoliosis. /. Bone and Joint Surg., 74-A: 1056-1067, Aug. 1992. 14. Lenke, L. G.; Bridwell, K. H.; O'Brien, M. F.; Baldus, C; and Blanke, K.: Recognition and treatment of the proximal thoracic curve in adolescent idiopathic scoliosis treated with Cotrel-Dubousset instrumentation. Spine, 19:1589-1597,1994. 15. Lonstein, J. E.: Decompensation with Cotrel Dubousset instrumentation a multicenter study. Orthop. Trans., 16:158,1992. 16. Nash, C. L., Jr., and Moe, J. H.: A study of vertebral rotation. J. Bone and Joint Surg., 51-A: 223-229, March 1969. 17. Richards, B. S.: Lumbar curve response in type II idiopathic scoliosis after posterior instrumentation of the thoracic curve. Spine, 17(8S): S282-S286,1992. 18. Richards, B. S.; Birch, J. G.; Herring, J. A.; Johnston, C. E.; and Roach, J. W.: Frontal plane and sagittal plane balance following Cotrel-Dubousset instrumentation for idiopathic scoliosis. Spine, 14:733-737,1989. 19. Roye, D. P., Jr.; Farcy, J. P.; Rickert, J. B.; and Godfried, D.: Results of spinal instrumentation of adolescent idiopathic scoliosis by King type. Spine, 17(8S): S270-S273,1992. 20. Shufflebarger, H. L., and Clark, C. E.: Fusion levels and hook patterns in thoracic scoliosis with Cotrel-Dubousset instrumentation. Spine, 15: 916-920,1990. 21. Svanholm, H.; Starklint, H.; Gundersen, H. J.; Fabricius, J.; Barlebo, H.; and Olsen, S.: Reproducibility of histomorphologic diagnoses with special reference to the kappa statistic. APMIS: Acta Pathol, Microbiol, Immunol. Scandinavica, 97:689-698,1989. 22. Thompson, J. P.; Transfeldt, E. E.; Bradford, D. S.; Ogilvie, J. W.; and Boachie-Adjei, O.: Decompensation after Cotrel-Dubousset instrumentation of idiopathic scoliosis. Spine, 15:927-931,1990. THE JOURNAL OF BONE AND JOINT SURGERY