Interobserver and Intraobserver Reliability of Sub-Axial Injury Classification and Severity Scale between Radiologist, Resident and Spine Surgeon

Woo Jin Lee; Seung Hwan Yoon; Yeo Ju Kim; Ji Yong Kim; Hyung Chun Park; Chon Oon Park

doi:10.3340/jkns.2012.52.3.200

Abstract

Objective

The sub-axial injury classification (SLIC) and severity scale was developed to decide whether to operate the cervical injured patient or not, but the reliability of SLIC and severity scale among the different physicians was not well known. Therefore, we evaluated the reliability of SLIC among a spine surgeon, a resident of neurosurgery and a neuro-radiologist.

Methods

In retrograde review in single hospital from 2002 to 2009 years, 75 cases of sub-axial spine injured patients underwent operation. Each case was blindly reviewed for the SLIC and severity scale by 3 different observers by two times with 4 weeks interval with randomly allocated. The compared axis was the injury morphology score, the disco-ligamentous complex score, the neurological status score and total SLIC score; the neurological status score was derived from the review of medical record. The kappa value was used for the statistical analysis.

Results

Interobserver agreement of SLIC and severity scale was substantial agreement in the score of injury morphology [intraclass correlation (ICC)=0.603] and total SLIC and severity sacle (ICC value=0.775), but was fair agreement in the disco-ligamentous complex score (ICC value=0.304). Intraobserver agreements were almost perfect agreement in whole scales with ICC of 0.974 in a spine surgeon, 0.948 in a resident of neurosurgery, and 0.963 in a neuro-radiologist.

Conclusion

The SLIC and severity scale is comprehensive and easily applicable tool in spine injured patient. Moreover, it is very useful tool to communicate among spine surgeons, residents of neurosurgery and neuro-radiologists with sufficient reproducibility.

Key Words: Cervical trauma · Sub-axial injury classification scale · Cervical spine injury · Interobserver agreement · Intraobserver reliability.

INTRODUCTION

Despite technological advances in spine surgery, classification of sub-axial cervical spine injuries remains largely descriptive, lacking standardization and any relationship to prognosis or clinical decision making¹⁰⁾. The sub-axial injury classification (SLIC) and severity scale was developed by Vaccaro et al.¹⁰⁾ to define a classification system for sub-axial cervical spine trauma that conveys information about injury pattern and severity as well as treatment considerations and prognosis, and it was world-widely used^1,^5,^6,¹⁰⁾. The reproducibility and reliability of SLIC scale are very important for making decision to operate and communicating among physicians. Unfortunately, there was only a few study results reported about this topic in the literature review⁷⁾. Therefore, we conducted this study to evaluate the reliability of SLIC among spine surgeons, residents of neurosurgery and neuro-radiologists.

MATERIALS AND METHODS

This study was designed as retrospective review of sub-axial spine injured patient with surgical procedure. A total 95 cases of sub-axial spine injury were underwent surgery from April 2002 to December 2009. In the review of the images, the poor quality of radiological findings was observed in 20 cases, and they were dropped out from this study. Finally, 75 cases with medical record, computed tomography (CT) and magnetic resonance imaging (MRI) were recruited as subjects in this study. The mean age of these subjects was 56 years (18-82 years), and the 56 (74.7%) were males and 19 (25.3%) female.

We classified the cervical injured patient according to the comprehensive classification system for SLIC and severity scale which had been developed by Vaccaro et al.¹¹⁾ SLIC scale suggests three major categories as indicators, which would direct the treatment of subaxial injuries. These are : 1) injury morphology as determined by the pattern of spinal column disruption on available imaging studies, 2) integrity of the disco-ligamentous complex (DLC) represented by both anterior and posterior ligamentous structures as well as the intervertebral disc, and 3) neurological status of the patient (Table 1). These three categories are widely recognized as predictors of clinical outcome and influenced on decision making of treatment. Within each of the three categories, subgroups have been identified and graded from the least to the most severe (Table 1).

The SLIC and severity scale is a useful classification system for sub-axial cervical trauma, incorporating pertinent characteristics for generating prognoses and courses of management⁹⁾. A numeric value is generated from each axis (injury morphology, integrity of the disco-ligamentous complex, and neurological status), specific to the descriptive identifier. Injury patterns that are known to result in worse outcomes or require surgical intervention (spinal instability, neurological injury) are weighted to receive greater point values. These three numbers, one from each axis, are summed to provide an overall SLIC score and severity scale. The higher the number of points assigned to a particular category, the more severe the injury and the more likely a surgical procedure is indicated. In this study, the surgical procedure was indicated in more than 5 points of SLIC and severity scale, the non-surgical treatment was done in lesser than 3 points of SLIC and severity scale. A score of four is considered equivocal⁴⁾. All cases included in this study were all surgically treated cases with more than 5 points of SLIC and severity scale.

The score of SLIC and severity scale was retrospectively examined by three different observers by carefully reviewing the preoperative simple radiographs, preoperative CT, preoperative MRI and the medical records. The observers were consisted with a spine surgeon, a resident of neurosurgery and a neuro-radiologist. The patient information was not given to the observers and the list of patient was randomly allocated. The measurement of neurological status was derived from the given medical record. The score of SLIC and severity scale was double examined with 4 weeks interval to check the intraobserver reliability.

Relative to observers as data consistency has been assessed through two approaches : between-observer reliability (interobserver agreement) and within-observer reliability (intraobserver reliability). The interobserver agreement and the intraobserver reliability were checked from each axis and total sum of the SLIC and severity scale, and evaluated with intraclass correlation (ICC) with Cohen's kappa. The interpretation of Cohen's kappa was followed the guideline of Viera and Garrett¹²⁾ : κ<0.20, slight agreement; 0.21<κ<0.40, fair agreement, 0.41<κ<0.60, moderate agreement; 0.61<κ<0.80, substantial agreement; 0.81<κ<0.99, almost perfect agreement. Statistical comparisons were analyzed with SPSS 12.0 (SPSS, Chicago, IL, USA).

RESULTS

Total 450 cases of sub-axial spine injured patient who underwent the surgical procedure were enrolled in this study, by double checking the 75 cases from three different observers. The interobserver agreement and the intraobserver reliability of each axis and total sum of the SLIC and severity scale was summarized in Table 2. The interobserver agreement was checked differently by the axis of SLIC and severity scale. The interobserver agreement of the injury morphology and total sum of SLIC and severity scale resulted substantial agreement [ICC value=0.603 (CI 0.455-0.731) and ICC value=0.775 (CI 0.665-0.860)]. But, the interobserver agreement of the integrity of the DLC showed fair agreement [ICC value=0.304 (CI 0.133-0.483)]. The intraobserver reliability in SLIC and severity scale was presented as almost perfect agreement. Each Cohen's kappa of the intraobserver reliability in SLIC and severity scale was 0.921 in the injury morphology, 0.876 in the integrity of the DLC, and 0.962 in total sum of SLIC and severity scale. The Cohen's kappa of the neurological status in the interobserver agreement and the intraobserver reliabilty was checked as perfect agreement (ICC value=1.000), and this was resulted by the retrospective review of the medical records.

The more detailed results of the interobserver agreement in SLIC and severity scale among the different observers were shown in Table 3. All interobserver agreement of SLIC and severity scale showed almost perfect agreement in ICC value. In the axis of the injury morphology, the neurologic status and total sum of SLIC and severity scale were similar between observers. But, the agreement of the integrity of the DLC is slightly differently checked : the agreement of neuro-radiologist was higher than the agreement of the spine surgeon or the resident of neurosurgery (ICC value=0.963 vs. ICC value=0.814 or 0.852). Sufficient reproducibility of neuro-radiologist was better than others in reviewing of the integrity of the DLC.

DISCUSSION

The sub-axial spine injuries accounts for the majority of cervical injuries, making up about 65% of fractures and more than 75% of all dislocations¹³⁾. Treatment options are conservative treatment and surgical treatment. Posterior stabilization of the cervical spine is a common surgical procedure which is used in a variety of spinal disorders, including cervical spondylosis, postsurgical deformity or instability, tumor and trauma^4,¹²⁾. But, surgical treatment might result in complications, it is not crucial to determine if surgical treatment is necessary. For this reason, many studies reported the classification of spine injuries according to the severity^1,^5,^8,¹¹⁾.

Historically, Holdsworth⁶⁾ first emphasized the importance of the posterior ligamentous complex for maintaining spinal stability in 1970. He published the incipient paradigm for categorizing spinal injuries, which were predicated upon the presumed mechanism of injury. Despite this bold attempt, this system has some limitation of not differentiating the cervical conditions and those affecting the thoracolumbar spine.

In 1982, Allen et al.¹⁾ subsequently proposed their mechanistic classification scheme for subaxial injuries, which was also based upon the findings of plain radiographs. In 1986, Harris et al.⁵⁾ took into account the rotational vectors of the forces applied to the spine. But, these systems have some limitations by reviewing only plain radiograph to guess the mechanism of injuries¹⁴⁾. In this long-standing effort to characterize subaxial trauma, unfortunately no single system has gained widespread acceptance.

In 2007, Subcommittee of the Spine Trauma Study Group introduced the SLIC and severity scale with participation of about 30 spine surgeons. This scoring system consists of injury morphology, integrity of the DLC, and the patient's neurological status. It is for compensating the limitation of previous classification system. The study represents SLIC and severity scale has superior validity and reliability compared to previous study¹⁰⁾.

However, the SLIC and severity scale has some limitations. First, the preliminary study introduced SLIC and severity scale was quite restricted involving only eleven cases. But, related research since the first study collected numerous date and verified the reliability and validity of the SLIC and severity scale, and this research is another support one^2,³⁾. Secondly, preliminary study participants were limited to only the clinical spine surgeons. For the ideal classification system, available tools are required to communicate not only between spine surgeons but also between neuro-radiologists and residents of neurosurgery who are cooperated with them.

Therefore, in this study the intraobserver reliability and interobserver agreement have been evaluated for three different types of physicians including a spine surgeon, a resident of neurosurgery and a neuro-radiologist. Consequently, the intraobserver reliability and interobserver agreement of SLIC and severity scale are sufficient for classifying cervical spine injury and communicable among physicians of several departments. The Cohen's kappa value usually used for clinical reliability, and at least 0.55 of kappa indicate that valuable reliability⁸⁾. Although Oner et al.⁶⁾ argued that this 0.55 standard is too strict to apply to a classification system for traumatic vertebral fracture, but our research showed the excellent of reliability.

In detail, intraobserver reliability among all of three observers showed almost perfect agreement (ICC value >0.8). Also, the interobserver agreement of total score also showed substantial agreement (ICC value=0.775). It is notable interesting that the intraobserver reliability of DLC component was higher in the neuro-radiologist than in the spine surgeon or in the resident of neurosurgery. This may be because of the difficulty in classifying the DLC component to 'intermediate' or 'disrupted', when assessed only by an image study. Consequently, it may be difficult to make objective and definite standard to assess the severity of injury based on the image (Fig. 1).

Nevertheless, this study showed sufficient reproducibility of SLIC and severity scale but also this study has some limitation. First, by using the data of the postoperated patients, the SLIC and severity scale was higher than other previous study (median 7.86, range from 4 to 10). For example, Vaccaro et al.¹⁰⁾ reported 0.83 ICC value in intraobserver reliability and 0.71 ICC value in interobserver agreement which is lower than that of our study, 0.962 and 0.775, respectively. The severely injured patients with distinct imaging results who were participated in the study contributed to high reliability results. Secondly, the high reliability results could affected by the environment that all three physicians worked in same medical center. Another limitation of this study was a data collection by retrospectively, so all neurological status components was accessed reviewing the medical chart, so the ICC value results in perfect agreement. The prospective study for cervical trauma should be done to overcome these limitations.

CONCLUSION

The SLIC and severity scale is comprehensive and easily applicable tool in spine injured patient. The SLIC and severity scale is suspected as a very useful tool to communicate among spine surgeons, residents of neurosurgery and neuro-radiologists with sufficient reproducibility.