Literature DB >> 30796507

The diagnostic accuracy of imaging modalities to detect pseudarthrosis after spinal fusion-a systematic review and meta-analysis of the literature.

Marloes J M Peters1, Carolien H G Bastiaenen2, Boudewijn T Brans3, René E Weijers4, Paul C Willems5.   

Abstract

OBJECTIVE: The aim of the study was to determine the diagnostic accuracy of imaging modalities to detect pseudarthrosis after thoracolumbar spinal fusion, with surgical exploration as reference standard.
MATERIALS AND METHODS: A systematic literature search for original studies was performed on the diagnostic accuracy of imaging as index test compared to surgical exploration as reference standard to diagnose pseudarthrosis after thoracolumbar spinal fusion. Diagnostic accuracy values were extracted and methodologic quality of studies was evaluated by the Quality Assessment of Diagnostic Accuracy Studies 2 (QUADAS-2) tool. Per modality, clinically comparable studies were included in subgroup meta-analysis and weighted odds ratios (ORs) were calculated using the random effects model.
RESULTS: Fifteen studies were included. Risk of bias was classified as high/unclear in 58% of the studies. Concerns of applicability was classified as high/unclear in 40% of the studies. Four scintigraphy studies including 93 patients in total were pooled to OR = 2.91 (95% confidence interval [CI]: 0.93-9.13). Five studies on plain radiography with 398 patients in total were pooled into OR = 7.07 (95% CI: 2.97-16.86). Two studies evaluating flexion-extension radiography of 75 patients in total were pooled into OR = 4.00 (95% CI: 0.15-105.96). Two studies of 68 patients in total were pooled for CT and yielded OR = 17.02 (95% CI: 6.42-45.10). A single study reporting on polytomography, OR = 10.15 (95% CI 5.49-18.78), was also considered to be an accurate study.
CONCLUSIONS: With a pooled OR of 17.02, CT can be considered the most accurate imaging modality for the detection of pseudarthrosis after thoracolumbar spinal fusion from this review.

Entities:  

Keywords:  Diagnostics accuracy; Imaging; Meta-analysis; Pseudarthrosis; Spinal fusion

Mesh:

Year:  2019        PMID: 30796507      PMCID: PMC6702186          DOI: 10.1007/s00256-019-03181-5

Source DB:  PubMed          Journal:  Skeletal Radiol        ISSN: 0364-2348            Impact factor:   2.199


Introduction

Low back pain is a global health and socio-economic problem [1], as it is the leading cause of disability and work absenteeism in the Western world [2]. When conservative measures fail, operative intervention can be considered. Spinal fusion is a surgical procedure in which rigid fixation of vertebral segments is achieved by means of osteosynthesis and bone grafting to create definite bony fusion of the vertebrae involved. Failed spinal fusion may occur in 30–40% of spinal fusion patients [3, 4]. Pseudarthrosis is defined as the absence of solid bony fusion at a minimum follow-up of 6 months after surgery [5, 6]. Pseudarthrosis can be associated with persistent or recurrent back and/or leg pain [7], but can also be asymptomatic [7-9]. Whether symptomatic or asymptomatic, pseudarthrosis increases the risk of material failure, late deformity, and neurological symptoms [10, 11]. Revision surgery is the preferred treatment in spinal fusion patients suffering from symptoms due to pseudarthrosis. Revision surgery is invasive, expensive, and may have a worse outcome than primary surgery [12, 13] and should only be performed when the pseudarthrosis diagnosis is irrefutable. Since symptoms of pseudarthrosis may be nonspecific and multiple individual sources of pain may contribute to the complex of symptoms [14], diagnostic tools are required to set the diagnosis. The gold standard for the diagnosis of pseudarthrosis is surgical exploration [5, 7, 15, 16], an invasive, costly, and nowadays rarely used test which is not desirable or ethical in patients without symptoms. The aim of the study was to determine the diagnostic accuracy of imaging modalities to detect pseudarthrosis after thoracolumbar spinal fusion, with surgical exploration as the reference standard.

Materials and methods

Identification of studies

This review was performed according to the PRISMA statement guidelines [17, 18]. A systematic literature search was conducted in the PubMed, EMBASE, and CINAHL databases from inception until February 2017 to identify relevant studies. A list of keywords and text words was formulated to describe the detection of pseudarthrosis by imaging as index test compared to surgical exploration as reference standard in patients after spinal fusion surgery. Terms for imaging: tomography, radiography, plain radiographs, MRI, CT, scintigraphy, SPECT, SPECT/CT, PET, PET/CT, DEXA. Terms for study design: diagnostic accuracy, precision, predictive value, sensitivity, specificity, false positive, false negative. Terms for patient population: spine, vertebrae, vertebral column, spinal fusion, spinal arthrodesis, spondylodesis, bone graft, pseudarthrosis, non-union, delayed union, clinical failure, surgical exploration, re-operation, second-look operation. The search was limited to the English language. Once the search was completed, the resulting articles were checked for duplicates. Subsequently, two independent reviewers (PW, orthopedic surgeon with over 10 years of experience in spinal surgery and MP, junior researcher specialized in imaging) screened the identified citations to determine whether they met predefined in- and exclusion criteria. If disagreements could not be resolved by consensus, a third reviewer (CB, clinical epidemiologist with over 15 years of experience in conducting systematic reviews) was consulted. Only original studies that provided data to construct contingency tables were included. Exclusion criteria were spinal fusion for the indications bone fracture, tumor, infection; time interval between surgery and index test less than 6 months; patient population smaller than ten; cervical fusion; animal studies; in vitro studies.

Data extraction

Standard reference data, population characteristics, details on spinal fusion, index test, reference test, and time intervals were extracted by the reviewers (PW, MP). Disagreements were resolved by consensus. Besides study characteristics, diagnostic accuracy data was extracted. Since the outcome was dichotomous (diagnosis was either pseudarthrosis or fusion), contingency tables were constructed. We also recorded whether the results originated from per-patient-, per-level-, or per-side-based analysis.

Methodological quality

The methodological quality of each selected study was assessed independently by the reviewers according to the Quality Assessment for Diagnostic Accuracy Studies 2 (QUADAS-2) tool [19]. The QUADAS-2 tool consists of four key domains that discuss patient selection, index test, reference standard, flow of patients through the study, and timing of the index test and reference standard. Each domain was scored in terms of risk of bias and concerns regarding applicability to the research question. Disagreements were resolved by consensus.

Data synthesis and statistical analysis

Pseudarthrosis was defined as a positive test result and fusion as a negative test result. Diagnostic accuracy values were calculated from the extracted contingency tables. Continuity correction was applied to studies with zero-cell counts by adding 0.5 to all cells of the study [20]. Per index test, the studies describing that test were considered for inclusion into subgroup meta-analysis.

Inclusion in meta-analysis

Meta-analysis was only performed when studies evaluating the same modality were not significantly hampered by clinical heterogeneity. Studies were considered clinically heterogeneous when patient groups, outcome measures, and/or the execution of index tests were considerably different. The random effect model was employed during meta-analyses to account for unobserved sources of variation [21]. The odds ratio (OR) was used as the principal summary measure in meta-analysis. The higher the OR, the better the discriminatory performance. An OR of 1 indicates a test that does not discriminate between patients with pseudarthrosis and patients with fusion [22]. An OR below 1 suggests a negative association between index test and surgical exploration. Analyses were performed using the Stata statistical software package, version 14.1 (StataCorp, College Station, TX, USA).

Results

One hundred sixty-five potentially relevant references were identified through database search. After screening, 15 studies were included in this review, reporting on eight modalities: plain radiography, flexion extension radiography (FE radiography), computed tomography (CT), single-photon emission computed tomography (SPECT), planar scintigraphy, polytomography, ultra sound/sonography (US) and 18F-fluoride positron emission tomography/computed tomography (PET/CT). The study selection flowchart is detailed in Fig. 1. The level of evidence of the included studies ranged from I to III.
Fig. 1

Flowchart showing the selection of studies from electronic search (identification) until inclusion in the subgroup meta-analyses. Initially, 165 potentially relevant references were identified through database search. One hundred thirty-two were obtained for further screening after removal of 33 duplicates. After removal based on title and abstract screening, the full text of 35 articles was screened and their reference sections were scanned for additional eligible studies. Hereafter, 15 studies were included this review, reporting on eight modalities. The meta-analysis part at the bottom of the figure will be discussed in ‘inclusion in meta-analysis’, which can be found hereafter in the result section. * 3 of the 15 studies described 2 to 4 modalities, leading to 22 included items

Flowchart showing the selection of studies from electronic search (identification) until inclusion in the subgroup meta-analyses. Initially, 165 potentially relevant references were identified through database search. One hundred thirty-two were obtained for further screening after removal of 33 duplicates. After removal based on title and abstract screening, the full text of 35 articles was screened and their reference sections were scanned for additional eligible studies. Hereafter, 15 studies were included this review, reporting on eight modalities. The meta-analysis part at the bottom of the figure will be discussed in ‘inclusion in meta-analysis’, which can be found hereafter in the result section. * 3 of the 15 studies described 2 to 4 modalities, leading to 22 included items Study characteristics of the 15 included studies are listed in Table 1. The number of levels fused in a single patient during initial surgery ranged from 1 to 13 levels. Eight articles monitored pseudarthrosis per patient, five monitored each level separately, and two made a distinction between the left and right side of each operated level. All articles reported that persistent low back pain and/or suspicion of pseudarthrosis was the reason for surgical exploration. The time interval between initial surgery and surgical exploration ranged from 6 to 120 months.
Table 1

Study characteristics

Author, yearPatient characteristicsFusion surgery characteristicsModality, index testSurgical exploration (SE), reference standardTime intervals
IndicationFusion technique (number of patients/total)Modality (number of patients/levels /sides)Settings/protocol of modalityDefinition pseudarthrosis (PA)Description of intraoperative assessmentDefinition pseudarthrosis (PA)Between fusion surgery and modalityBetween modality and SE
McMaster et al., 1980 [23]

- 110 patients in study and scored

- average age: 13 years, age range: 8–25 years

- multiple levels fused

- SE for PA suspicion

ScoliosisInstrumented interfacetal fusion (110/110), with posterior iliac crest autograft (86/110)Scintigraphy (110 patients)Scanning 3 h after intravenous injection of technetium-99 m; nuclear scanning gamma cameraLocalized or generalized patchy areas of increased uptakeRemoval of soft tissue; exploration of fusion mass; movement of spinous processes is notedIrregular crevice filled with fibrous tissue; hairline PA; defective area in cancellous bone6 months2 days
Slizofski et al., 1987 [24]

- 26 patients in study, 11 patients were scored

- median age: 58 years, age range: 27–71 years

- 1–5 levels fused

- SE for persistent low back pain

Degenerative disc or facet disease, spondylolisthesisBilateral transverse process fusion (16/26), posterior facet fusion (10/26); instrumented (19/26)SPECT (11 patients)Scanning 3 h after intravenous injection of technetium-99 m; 6-mm-thick tomogramsFocal areas of increased activity within the fusion mass noticed by ≥ 2 observersNSNSMedian: 20 months, range: 6–120 monthsNS
Laasonen et al., 1988 [25]

- 48 patients in study, 20 patients were scored- age range: 16–64 years

- 1–3 levels fused

- SE for persistent low back pain w/or w/o radiating leg pain

Spondylolisthesis, chronic pain after disc surgery, miscellaneousPosterolateral fusion (48/48) combined with intercorporal fusion (4/48)CT (48 patients)Slice thickness: 6 mm; reconstructions: selective sagittalNSRemoval of fragmentation/ PA or widening of the spinal / nerve root canalNS7.3 yearsNS
Brodsky et al., 1991 [15]

- 175 patients in study, 214 sides were scored

- age range: 17–79 years

- 1–6 levels fused

- SE for hardware removal / PA suspicion / stenosis / radiculopathy

NSPosterolateral fusion (175/175)Plain radiography (214 sides)Anteroposterior, lateral, obliqueNSVisualization of bony bridging; manual manipulationLack of solid bony bridging; motion between vertebraeMean: 33.7 monthsNS
Polytomography (68 sides)Anteroposterior, obliqueNS
FE radiography (64 sides)Biplane bending filmsNS
CT (42 sides)NSNS
Blumenthal et al., 1993 [26]

- 49 patients in study and scored

- age range: 22–54 years

- 1–2 levels fused

- SE for persistent low back discomfort

NSInterbody fusion combined with posterolateral fusion (49/49)Plain radiography (49 patients)Anteroposterior, lateralBlindly judged by two spinal surgeons and two musculoskeletal radiologistsBone mass inspection; mechanical stress test (Kocher clamp)No bridging bony mantle; motionAverage: 9 monthsJust before surgery
Kant et al., 1995 [27]

- 75 patients in study, 126 levels were scored

- age: NS

- NS levels fused

- SE for persistent low back pain / radiculopathy / PA on radiography / infection

NSPosterolateral fusion (75/75), combined with interbody fusion with cancellous bone chips (37/75)Plain radiography (126 levels)Five viewsNo solid intertransverse bone or facet joint fusion, judged by an uninvolved orthopedic surgeonRemoval of hardware and soft tissue; exploration of motion, bone mass, facet joints, intertransverse process areaAbsence of solid intertransverse bone mass or obliteration of facet joints; interspinous or facet motionMean: 51 weeks1–4 weeks
Larsen et al., 1996 [14]

- 25 patients in study and scored

- age: NS

- 1–3 levels fused

- SE for persistent and severe pain

NSInstrumented posterolateral lumbar fusion (25/25)Plain radiography (21 patients)Anteroposterior, lateral, oblique; while standingNo bridging bony trabeculaeRemoval of instrumentation; fusion inspectionNSNS, fusion surgery to SE: > 1 yearNS
FE radiography (11 patients)While standing> 3 degrees of motion
CT (24 patients)Slice thickness: 5 mm; reconstructions: sagittal and coronal; overlapping slicesNo bridging bony trabeculae
Scintigraphy (20 patients)Intravenous injection of technetium-99 mIncreased uptake
Albert et al., 1997 [28]

- 38 patients in study and scored

- mean age: 42.8 years, age range: 22–73 years

- NS levels fused

- SE for persistent pain

Lumbar degenerative disc disease, kyphosisInstrumented (35/38) spinal fusion (38/38)SPECT (38 patients)NSIncreased uptake beyond background signal judged by one blinded nuclear radiologistRemoval of instrumentation; subperiosteal inspection; stress testing using curettesMotion between two fused levelsAverage: 23.9 months, range: 9–120 monthsNS
Jacobson et al., 1997 [29]

- 10 patients in study, 20 sides were scored

- average age: 43.2 years,

age range: 23–69 years

- 1–13 levels fused

- SE for persistent pain and PA suspicion

Postlaminectomy, scoliosis, spondylolisthesisInstrumented (9/10) posterolateral fusion with iliac crest autograft (10/10)Ultrasound (20 sides)Patient in prone position; posterior approach; linear 7.5 MHz and curvilinear 5.0 MHz transducersNo bridging bone visible / presence of scattered and nonbridging echogenic foci at the fusion site, judged by one musculoskeletal radiologistRemoval of hardware; visual inspection of bridging bone; motion assessmentNo solid bridging between vertebral segmentsAt least 9 monthsLess than 1 week
Bohnsack et al., 1999 [30]

- 42 patients in study and scored

- mean age: 42 years

- average of 4 levels fused

- SE for persistent back pain fusion area

NSDorsolateral fusion (32/42), combined procedures (10/42)Scintigraphy (42 patients)Intravenous injection of technetium-99 mReport by radiologist of external institutionRemoval of hardwareNSMean: 27 monthsJust before SE
Brantigan et al., 2000 [31]

- 221 patients in study, 115 levels were scored

- age range: 24–77 years

- NS levels fused

- SE for disabling back and/or radicular pain and degenerative change on MRI or discogram

Recurrent disc disease, spondylolisthesis, failed fusionInstrumented posterior lumbar interbody fusion with cages (221/221)Plain radiography (115 levels)StandardLucency in bony bridging of the disc spaceFusion status examinationNSNS, fusion surgery to SE: 24 monthsNS
Carreon et al., 2007 [16]

- 93 patients in study, 163 levels scored

- mean age: 57 years, age range: 19–86 years

- 1–4 levels fused

- SE for PA suspicion, painful instrumentation, adjacent level degeneration

NSInstrumented posterolateral lumbar fusion (93/93)CT (163 levels)Slice thickness: 1 mm; reconstructions: sagittal and coronalNo obliteration of the facet joint space; interrupted trabeculated bone between transverse processesInspection of bony continuity; motion evaluation with laminar spreader between screwheads and pediclesNoncontinuity of bone; presence of motionNS, fusion surgery to SE average: 49 monthsAverage: 4 months
Fogel et al., 2007 [32]

- 90 patients in study, 172 levels were scored

- average age: 43 years, age range: 27–70 years

- NS levels fused

- SE for persistent low back pain, adjacent level instability / stenosis / PA on radiography

Degenerative disc disease, failed back surgery and spondylolisthesisInstrumented posterior lumbar interbody fusion with cages and iliac crest autograft, combined with posterolateral fusion (90/90)Plain radiography (interbody, 172 levels)Anteroposterior, lateral; parallel to each levelTwo of the authors blindly and independently graded for evidence of interbody and posterolateral pseudarthrosis

Removal of hardware and soft tissue;

exploration of fusion mass, facet joints, intertransverse areas;

exploration of motion by distraction and compression

Posterolateral PA:

defect in the bridging bone;

visible motion in the posterolateral fusion area

Interbody PA:

any relative motion between segments

Before SE; fusion surgery to SE range: 12–65 monthsNS
Plain radiography (posterolateral, 172 levels)
CT (interbody, 109 levels)Slice thickness: 1 mm; reconstructions: sagittal and coronal; high-speed helical CT scannerGraded by 1 of 3 blinded radiologists; posterolateral PA = graft resorption; no fusion mass (Lenke B, C, D) interbody PA = collapse of construct, loss of disc height;cage displacement; bone resorption; graft lucency (BSF-1 and BSF-2)

Average: 30 months,

Range: 10–60 months

NS
CT (posterolateral, 109 levels)
Carreon et al., 2008 [33]

- 49 patients in study, 69 levels scored

- mean age: 43 years,

age range: 21–65 years

- 1–3 levels fused

- SE for preoperative PA diagnosis / adjacent level degeneration

NSPosteriorly instrumented (28/69) anterior interbody fusion with metallic cages (69/69)CT (69 levels)Slice thickness: fine-cut axial cuts reconstructions: sagittal and coronal; bone and soft tissue windowsFive experienced spine surgeons were asked to consider the disc space medial and lateral to cages, anterior and posterior to cages, and through cagesInspection of fusion mass; distraction forces to detect motionAbsence of bony continuity; presence of motion across the fused levelsNS, fusion surgery to SE average: 22 monthsNS
Quon et al., 2012 [34]

- 22 patients in study, 15 levels were scored

- age range: 36–80 years

- 1 level fused

- SE for recurrent symptoms after spinal fusion, equivocal CT, based on PET/CT

NSNS18F-fluoride PET/CT (15 levels)CT: slice thickness: 1.25–2 mm; 100–140 kV; 180–230 mAs; PET: scanning 45 min after intravenous injection of 222–370 MBq 18F-NaFNuclear medicine physician and a radiologist with musculoskeletal expertise reviewed the PET/CT images for lesions amenable to surgical interventionProbing manually testing the fusion region for loosening and hardware failure at sites of abnormal tracerNSRange: 8–96 monthsNS

SE surgical exploration, PA pseudarthrosis, mm millimeter, NS not specified, MHz megahertz, kV kilovolt, mAs milliampere second, MBq megabecquerel, NaF sodium fluoride

Study characteristics - 110 patients in study and scored - average age: 13 years, age range: 8–25 years - multiple levels fused - SE for PA suspicion - 26 patients in study, 11 patients were scored - median age: 58 years, age range: 27–71 years - 1–5 levels fused - SE for persistent low back pain - 48 patients in study, 20 patients were scored- age range: 16–64 years - 1–3 levels fused - SE for persistent low back pain w/or w/o radiating leg pain - 175 patients in study, 214 sides were scored - age range: 17–79 years - 1–6 levels fused - SE for hardware removal / PA suspicion / stenosis / radiculopathy - 49 patients in study and scored - age range: 22–54 years - 1–2 levels fused - SE for persistent low back discomfort - 75 patients in study, 126 levels were scored - age: NS - NS levels fused - SE for persistent low back pain / radiculopathy / PA on radiography / infection - 25 patients in study and scored - age: NS - 1–3 levels fused - SE for persistent and severe pain - 38 patients in study and scored - mean age: 42.8 years, age range: 22–73 years - NS levels fused - SE for persistent pain - 10 patients in study, 20 sides were scored - average age: 43.2 years, age range: 23–69 years - 1–13 levels fused - SE for persistent pain and PA suspicion - 42 patients in study and scored - mean age: 42 years - average of 4 levels fused - SE for persistent back pain fusion area - 221 patients in study, 115 levels were scored - age range: 24–77 years - NS levels fused - SE for disabling back and/or radicular pain and degenerative change on MRI or discogram - 93 patients in study, 163 levels scored - mean age: 57 years, age range: 19–86 years - 1–4 levels fused - SE for PA suspicion, painful instrumentation, adjacent level degeneration - 90 patients in study, 172 levels were scored - average age: 43 years, age range: 27–70 years - NS levels fused - SE for persistent low back pain, adjacent level instability / stenosis / PA on radiography Removal of hardware and soft tissue; exploration of fusion mass, facet joints, intertransverse areas; exploration of motion by distraction and compression Posterolateral PA: defect in the bridging bone; visible motion in the posterolateral fusion area Interbody PA: any relative motion between segments Average: 30 months, Range: 10–60 months - 49 patients in study, 69 levels scored - mean age: 43 years, age range: 21–65 years - 1–3 levels fused - SE for preoperative PA diagnosis / adjacent level degeneration - 22 patients in study, 15 levels were scored - age range: 36–80 years - 1 level fused - SE for recurrent symptoms after spinal fusion, equivocal CT, based on PET/CT SE surgical exploration, PA pseudarthrosis, mm millimeter, NS not specified, MHz megahertz, kV kilovolt, mAs milliampere second, MBq megabecquerel, NaF sodium fluoride

Methodological quality assessment

Table 2 displays the quality assessment according to QUADAS-2. An overview of the distribution of QUADAS-2 scores is presented in Fig. 2. Risk of bias on ‘flow and timing’, ‘patient selection’, ‘index test’, and ‘reference standard’ was classified as high or unclear in 58% of cases. Common weaknesses related to poor documentation of patient selection and description of the reference standard. Two studies were considered to have low risk of bias in all four domains. Concerns of applicability on ‘patient selection’, ‘index test’, and ‘reference standard’ was classified as high or unclear in 42% of cases. Three studies were considered to suffer from low applicability concerns over all three domains.
Table 2

QUADAS-2 results for the 15 studies included in this review

Fig. 2

Stacked bar charts of QUADAS-2 scores presenting a quick overview of the methodological quality of the 15 included studies, expressed as a percentage of studies that met each criterion. For each quality domain, the proportion of included studies that suggest low, high, or unclear risk of bias and/or concerns regarding applicability are displayed in green, orange, and blue, respectively

QUADAS-2 results for the 15 studies included in this review Stacked bar charts of QUADAS-2 scores presenting a quick overview of the methodological quality of the 15 included studies, expressed as a percentage of studies that met each criterion. For each quality domain, the proportion of included studies that suggest low, high, or unclear risk of bias and/or concerns regarding applicability are displayed in green, orange, and blue, respectively Table 3 shows the diagnostic accuracy values of the included studies, grouped per index test.
Table 3

Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive and negative likelihood ratios (LR+, LR-), prevalence of pseudarthrosis, accuracy ((true positive + true negative) / (total)) and OR values with corresponding 95% confidence intervals for the seven index tests

AuthorSensitivitySpecificityPPVNPVLR+LR-PrevalenceAccuracyOR(95% CI)
Scintigraphy
 McMaster et al. 1980 [23]0.860.940.500.9914.710.150.060.9497.00(10.00–940.69)
 Slizofski et al. 1987 [24]0.750.830.940.504.500.300.770.7715.00(0.52–430.47)
 Larsen et al. 1996 [14]0.250.830.500.631.500.900.400.601.67(0.18–15.13)
 Albert et al. 1997 [28]0.500.580.410.671.200.860.370.551.40(0.37–5.27)
 Bohnsack et al. 1999 [30]0.500.920.400.956.330.540.100.8811.67(1.18–114.90)
Plain radiography
 Brodsky et al. 1991 [15]0.600.890.780.765.300.450.410.7711.77(8.13–17.04)
 Blumenthal et al. 1993 [26]0.550.710.180.931.900.630.100.693.01(1.55–5.84)
 Kant et al. 1995 [27]0.380.850.540.762.570.720.310.713.56(1.48–8.52)
 Larsen et al. 1996 [14]0.890.420.530.831.520.270.430.625.71(0.53–61.41)
 Brantigan et al. 2000 [31]0.550.970.670.9518.910.470.100.9340.40(7.75–210.65)
 Fogel et al. 2007 [32]0.900.770.101.003.850.130.030.7729.51(1.55–560.03)
FE radiography
 Brodsky et al. 1991 [15]0.370.960.860.719.740.660.390.7314.86(5.48–40.28)
 Larsen et al. 1996 [14]0.100.810.250.590.531.110.380.540.48(0.02–14.70)
CT
 Laasonen et al. 1988 [25]0.800.800.800.804.000.250.500.8016.00(1.78–143.15)
 Brodsky et al. 1991 [15]0.390.280.130.630.552.140.220.310.26(0.12–0.57)
 Larsen et al. 1996 [14]0.780.530.500.801.670.420.380.634.00(0.62–25.96)
 Carreon et al. 2007 [16]0.910.690.410.972.900.140.200.7321.22(6.11–73.67)
 Fogel et al. 2007 [32]0.900.700.130.993.030.140.050.7121.29(1.11–407.21)
 Carreon et al. 2008 [33]0.930.460.570.901.730.140.430.6712.00(2.51–57.48)
Polytomography
 Brodsky et al. 1991 [15]0.840.650.730.792.440.240.530.7510.15(5.49–18.78)
US
 Jacobson et al. 1997 [29]0.950.590.700.932.330.080.500.7730.33(1.39–660.76)
PET/CT
 Quon et al. 2012 [34]0.970.250.910.501.290.130.880.889.67(0.14–688.10)
Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive and negative likelihood ratios (LR+, LR-), prevalence of pseudarthrosis, accuracy ((true positive + true negative) / (total)) and OR values with corresponding 95% confidence intervals for the seven index tests The studies discussing the index tests SPECT [24, 28] and planar scintigraphy [14, 23, 30] were considered for inclusion into subgroup meta-analysis further referred to as scintigraphy. McMaster et al. was not included because the time interval between fusion surgery and surgical exploration was deviating too much from the other studies. The remaining four studies were pooled. Six studies were considered for inclusion in meta-analysis for plain radiography [14, 15, 26, 27, 31, 32]. Fogel et al. was excluded since the low prevalence of pseudarthrosis made the study population incomparable to the other studies (see Table 3). The remaining five studies were considered comparable enough to be pooled. Two articles diagnosed pseudarthrosis per patient [14, 26], two per level [27, 31], and one per side [15]. We chose to pool these studies despite differences in analysis region since we were mainly interested in the correlation between findings on imaging and surgical exploration. Using the same rationale, no distinction was made between studies on posterolateral and interbody fusion. Two articles were considered for FE radiography meta-analysis [14, 15]. Apart from differences in analysis regions, the study characteristics were considered comparable and the studies were therefore pooled. Six articles were considered for inclusion in CT meta-analysis [14–16, 25, 32, 33]. The study of Brodsky et al. was excluded for lack of sagittal and coronal reconstructions, essential in the assessment of interbody bony fusion [14, 16, 33, 35]. Laasonen et al. and Larsen et al. were excluded on slice thickness. Thicknesses of 5 and 6 mm were used respectively, while bony bridging should be assessed using thin slice CT to be reliable [16, 32, 33, 35]. Fogel et al. was excluded for low prevalence of pseudarthrosis compared to the other studies. The posterolateral fusion patient group of Carreon et al. [16] and the interbody fusion patient group Carreon et al. [33] were pooled for CT. Figure 3 shows a forest plot of the studies selected for subgroup meta-analysis, with their respective weights and resulting pooled ORs. Index tests for which only one study was identified, i.e., US, polytomography, 18F-fluoride PET/CT [15, 29, 34], could inevitably not undergo subgroup meta-analysis. These single studies were, however, evaluated on the same grounds and if considered reliable, included in Table 4 to complement the meta-analysis results. This was only the case for the study on polytomography [15]. For the study on US [29], the authors considered that with the evaluation of ten patients only, US was not investigated thoroughly enough for pseudarthrosis detection. In the 18F-fluoride PET/CT study [34], the reference standard was either surgical exploration or clinical follow-up, based on the index test outcome. This introduced a bias in the patient population that underwent surgical exploration; only patients with a suspicion of pseudarthrosis on 18F-fluoride PET/CT were surgically explored and used to calculate diagnostic accuracy.
Fig. 3

Forest plot of the included studies in the meta-analysis per modality. The size of each square is proportional to the study’s weight

Table 4

Overview of ORs as determined from included studies

Number of studiesNumber of patients(Pooled) OR [95% CI]
Scintigraphy [14, 24, 28, 30]4932.91 [0.93–9.13]
Plain radiography [14, 15, 26, 27, 31]53987.07 [2.97–16.86]
FE radiography [14, 15]2754.00 [0.15–105.96]
CT [16, 33]214217.02 [6.42–45.10]
Polytomography [15]16810.15 [5.49–18.78]
Forest plot of the included studies in the meta-analysis per modality. The size of each square is proportional to the study’s weight Overview of ORs as determined from included studies

Discussion

This systematic review summarizes studies in literature that investigated the diagnostic accuracy of imaging modalities to detect pseudarthrosis after thoracolumbar spinal fusion with surgical exploration as the reference standard. Diagnostic accuracy values of individual studies were determined, and for studies of the same modality that were clinically comparable, a pooled OR was calculated. Patients after spinal fusion can be monitored by several modalities. Plain radiographs attempt to reveal deficient morphology of the fusion mass as a sign of pseudarthrosis. However, plain radiographs are projections only [35, 36] whereas pseudarthrosis is a three-dimensional problem. The pooled OR of radiography was 7.07. In FE radiography, radiographs are made during flexion and extension of the spinal column to detect motion in the operated segment as a sign of pseudarthrosis. Cases exist where no signs of pseudarthrosis were found on plain radiography, CT, and MRI, but FE radiography detected the pseudarthrosis by unveiling motion between the segments [37]. However, on the other hand, absence of motion does not necessarily correspond with solid fusion and the presence of motion is not directly related to pseudarthrosis [12, 38–40]. Furthermore, no consensus exists on the threshold of allowable motion in a fused segment [40-42]. With a pooled OR of 4.00, FE radiography does not seem to outperform plain radiography. In polytomography, several radiographs along different sectional planes are taken. Going from a single slice in radiography to several planes in polytomography, the OR increased to 10.15. However, polytomography seems to be outdated by CT developments and currently not frequently used. CT offers three-dimensional osseous detail [33, 35]. After meta-analysis, CT was the modality with the highest OR in this review. Besides detection of bridging trabecular bone, CT is able to detect subsidence and lucency around fusion material as possible signs of pseudarthrosis [35]. On the downside, assessment can be complicated by artefacts when metallic cages and/or instrumentation are used [14, 32, 33, 35]. Technological improvements such as iterative reconstruction and dual-energy scanning are likely to improve accuracy [43]. Whether CT alone is sufficient for clinical decision-making is under debate. Choudhri et al. stated that multiple modalities should be considered for the noninvasive evaluation of symptomatic patients with suspected failure of spinal fusion [38]. US can demonstrate callus formation and bone healing [44, 45]. Although the first study assessing the role of US for pseudarthrosis detection in ten patients seemed promising in 1997 [29], it has been the only study since. Pseudarthrosis diagnosis can also be based on abnormalities in bone metabolism. Studies on SPECT and planar scintigraphy were grouped together in meta-analysis since both modalities use 99mTc-labeled phosphonates as tracer. 99mTc-labeled phosphonates are adsorbed onto or into the crystalline structure of hydroxyapatite to mark bone remodeling. With a pooled OR of 2.91, scintigraphy amounted to the lowest OR value after subgroup meta-analyses. An analog to 99mTc-labeled phosphonates is 18F-fluoride. Both tracers have similar uptake mechanisms [46] but 18F-fluoride decays via positron emission and can therefore be imaged by PET. Compared to 99mTc SPECT, 18F-fluoride PET provides higher resolution, higher sensitivity, and better quantification capabilities [47]. PET combined with CT allows localization of abnormal uptake, which might enhance discriminative power [6]. Quon et al. evaluated PET/CT as index test for pseudarthrosis diagnosis [34]. The results seem promising but studies of higher methodological quality should be conducted to draw firmer conclusions on its value in pseudarthrosis diagnosis. In the database search, one paper evaluating MRI [48] and one paper evaluating RSA as index test [49] were identified but not included. In MRI, bridging bone between endplates can be visualized [50] and changes in the vertebral body marrow signal as a sign of functional instability can be detected [48, 51]. On the downside, metal instrumentation complicates pseudarthrosis assessment in MRI. Length of follow-up was too short for the study of Lang et al. to be included. RSA is able to accurately quantify micromovements of vertebrae relative to each other, to evaluate lumbosacral stability [38, 42]. The study of Pape et al. could not be used to calculate the diagnostic accuracy of RSA for pseudarthrosis detection since all patients attained fusion. A strength of the present review was that the patient populations of the included studies resemble patient populations that would undergo these tests in clinical practice to either confirm or exclude pseudarthrosis, since all suffered from persisting or recurrent pain after spinal fusion. The methodological choice to only include studies that compared an index modality to the gold standard of surgical exploration was a strength on one hand since it is the most valid way to assess the diagnostic accuracy of a modality [14]. However, it was a weakness on the other hand, since it meant the exclusion of newer studies that evaluate state-of-the-art modalities. The study design of using surgical exploration as gold standard is no longer ethical or practical in clinical practice. As a result, the value of state-of-the-art modalities could not be discussed in this review and are still left to be evaluated. Another weakness of the study was that studies in meta-analysis, although relatively comparable, did show differences in spinal fusion technique, types of cages and instrumentation, imaging characteristics, pseudarthrosis definition, experience of the observers, and patient characteristics. Especially the time interval between spinal fusion and index test was highly variable between studies. Furthermore, the interpretation of index test results was incomplete in some studies. Imaging findings were reported but not always classified as either pseudarthrosis or fused. In these cases, the cut-off point was determined by the writers of this review, which is arbitrary, although not necessarily far from clinical practice. Studies also reported poorly on patient population inclusion criteria. Lack of information may have led to incorrect inclusion of studies in meta-analyses and weakens the findings of this review. To conclude, with a pooled OR of 17.02, CT can be considered the most accurate non-invasive imaging modality for the detection of pseudarthrosis after spinal fusion from this review.
  6 in total

1.  Pseudarthrosis in adult spine deformity surgery: risk factors and treatment options.

Authors:  Manuel Fernandes Marques; Vincent Fiere; Ibrahim Obeid; Yann-Philippe Charles; Khaled El-Youssef; Abi Lahoud; Joe Faddoul; Emmanuelle Ferrero; Guillaume Riouallon; Clément Silvestre; Jean-Charles Le Huec; David Kieser; Louis Boissiere
Journal:  Eur Spine J       Date:  2021-05-05       Impact factor: 3.134

2.  Comparison between conventional CT and grayscale inversion CT images in the assessment of the post-operative spinal orthopaedic implants.

Authors:  A Patel; S Haleem; R Rajakulasingam; S L James; A M Davies; R Botchu
Journal:  J Clin Orthop Trauma       Date:  2021-08-19

Review 3.  Successful fusion versus pseudarthrosis after spinal instrumentation: a comprehensive imaging review.

Authors:  John C Benson; Vance T Lehman; Arjun S Sebastian; Noelle A Larson; Ahmad Nassr; Felix E Diehn; John T Wald; Naveen S Murthy
Journal:  Neuroradiology       Date:  2022-06-14       Impact factor: 2.995

Review 4.  Glossary of terms for musculoskeletal radiology.

Authors:  William Palmer; Laura Bancroft; Fiona Bonar; Jung-Ah Choi; Anne Cotten; James F Griffith; Philip Robinson; Christian W A Pfirrmann
Journal:  Skeletal Radiol       Date:  2020-06-02       Impact factor: 2.199

5.  Local Autograft Versus Iliac Crest Bone Graft PSF-Augmented TLIF in Low-Grade Isthmic and Degenerative Lumbar Spondylolisthesis.

Authors:  Ali M Abou-Madawi; Sherif H Ali; Ahmed M Abdelmonem
Journal:  Global Spine J       Date:  2020-09-11

6.  Randomized Controlled Trial of Posterior Lumbar Interbody Fusion With Ti- and CaP-Nanocoated Polyetheretherketone Cages: Comparative Study of the 1-Year Radiological and Clinical Outcome.

Authors:  Karel Willems; Philippe Lauweryns; Gino Verleye; Johan VAN Goethem
Journal:  Int J Spine Surg       Date:  2019-12-31
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.