Literature DB >> 31190245

Strength in numbers? The fragility index of studies from the Scandinavian knee ligament registries.

Eleonor Svantesson1, Eric Hamrin Senorski2, Adam Danielsson3, David Sundemo3, Olof Westin3,4, Olufemi R Ayeni5, Kristian Samuelsson3,4.   

Abstract

PURPOSE: The fragility index (FI) is a metric to evaluate the robustness of statistically significant results. It describes the number of patients who would need to change from a non-event to an event to change a result from significant to non-significant. This systematic survey aimed to evaluate the feasibility of applying the FI to findings related to anterior cruciate ligament (ACL) reconstruction in the Scandinavian knee ligament registries.
METHODS: The PubMed, EMBASE, Cochrane Library and AMED databases were searched. Studies from the Scandinavian knee ligament registers were eligible if they reported a statistically significant result (p < 0.05) for any of the following dichotomous outcomes; ACL revision, contralateral ACL reconstruction or the presence of postoperative knee laxity. Only studies with a two-arm comparative analysis were included. Eligibility assessment, data extraction and quality assessment were performed by two independent reviewers. The dichotomous analyses were stratified according to the grouping variable for the two comparative arms as follows; age, patient sex, activity at injury, graft choice, drilling technique, graft fixation, single- versus double-bundle, concomitant cartilage injury and country. The two-sided Fisher's exact test was used to calculate the FI of all statistically significant analyses.
RESULTS: From 158 identified studies, 13 studies were included. They reported statistical significance for a total of 56 dichotomous analyses, of which all but two had been determined by a time-to-event analysis. The median sample size for the arms was 5540 (range 92-38,666). The mean FI for all 56 dichotomous analyses was 80.6 (median 34.5), which means that a mean of 80.6 patients were needed to change outcome status to generate a non-significant result instead of a significant one. Seventeen analyses (30.4%) immediately became non-significant when performing the two-sided Fisher's exact test and, therefore, had an FI of 0. The analyses related to age were the most robust, with a mean FI of 178.5 (median 116, range 1-1089). The mean FI of the other grouping variables ranged from 0.5 to 48.0.
CONCLUSION: There was large variability in the FI in analyses from the Scandinavian knee ligament registries and almost one third of the analyses had an FI of zero. The FI is a rough measurement of robustness when applied to registry studies, however, future studies are needed to determine the most appropriate metric for robustness in registry studies. The use of the FI can provide clinicians with a deeper understanding of significant study results and promotes an evidence-based approach in the clinical care of patients. LEVEL OF EVIDENCE: Systematic review of prospective cohort studies, Level II.

Entities:  

Keywords:  ACL; Anterior cruciate ligament; Contralateral; Fragility; Laxity; Registry; Revision; Statistics

Mesh:

Year:  2019        PMID: 31190245      PMCID: PMC6995986          DOI: 10.1007/s00167-019-05551-x

Source DB:  PubMed          Journal:  Knee Surg Sports Traumatol Arthrosc        ISSN: 0942-2056            Impact factor:   4.342


Introduction

A large number of studies related to anterior cruciate ligament (ACL) reconstruction have been published from the Scandinavian knee ligament registries over the past decade [1, 2]. Many of these studies have aimed to determine predictors and risk factors for an additional ACL reconstruction, i.e. a revision or a contralateral ACL reconstruction, or used the presence of postoperative laxity as a measurement of primary ACL reconstruction failure [2]. A P value of less than 0.05 or a 95% confidence interval (CI) excluding the defined null value have been used as the threshold of significance when drawing conclusions on statistically significant predictors. Meeting these criteria implies that the null hypothesis, stating that there would be no difference in outcome depending on the investigated predictor, has been rejected, meaning that the predictor is likely to have a true effect on the outcome after ACL reconstruction. The concept of a P value was first described by Sir Ronald Fisher and aids in the interpretation of a given result [3]. Although Fisher never did set a threshold for significance, a P value of less than 0.05 shows that a result that is similar to or more extreme than that observed would be found in fewer than 5% of repeated tests, on condition that the null hypothesis was true. It is therefore commonly accepted that a level of significance of 5% is sufficient to conclude that the observed result has not occurred by chance. Nonetheless, the P value says nothing about the robustness of an analysis and the interpretation of P values is many times misunderstood by researchers [4-6]. The fragility index (FI) was developed to evaluate the robustness of significant findings in randomised controlled trials (RCTs). More specifically, the FI describes the minimum number of patients in the group with the fewest events that would need to change from a non-event to an event to change the result from significant to non-significant [7]. Although the FI has not previously been applied to registry studies, it should follow the same principle. For example, there are studies from the Scandinavian knee ligament registries reporting that the use of hamstring tendon (HT) autograft significantly increases the risk of ACL revision compared with the use of patellar tendon (PT) autograft [8-10]. The FI for these studies would describe how many patients in the PT group would need to change from not undergoing an ACL revision to undergoing one to change the analysis to non-significant. The FI is thus a measurement of the number of events (e.g., ACL revisions) on which the statistical significance depends. In other words; the lower the FI, the more fragile the result. Recently, the FI was evaluated for 48 RCTs in sports medicine and arthroscopic surgery [11]. Worryingly, the median FI of the included studies was two [11], meaning that drawing conclusions in current clinical trials of sports medicine is in fact based on the outcome of a very limited number of patients. One of the main methodological strengths of the Scandinavian registries is the prospective data collection from a large population. In fact, the registries together comprise data from over 70,000 primary ACL reconstructions [12]. Large study samples increase the robustness of a statistical analysis, however, the FI of the statistically significant findings presented from the Scandinavian registries has not been evaluated. This is important knowledge since it allows for a more precise interpretation of the results and promotes an evidence-based approach in the clinical care of patients. The purpose of this systematic survey was to evaluate the applicability of the FI to registry studies by determining the FI of all analyses from the Scandinavian registries related to any of the following dichotomised outcomes; ACL revision, contralateral ACL reconstruction and the presence of postoperative residual knee laxity.

Materials and methods

Eligibility criteria

Original studies written in English from the Danish, Norwegian and Swedish knee ligament registries were eligible for inclusion if they reported statistically significant results for any of the following dichotomous outcomes; ACL revision, contralateral ACL reconstruction, or the presence of residual knee laxity after ACL reconstruction. A statistically significant result was defined as a P value of < 0.05 or a 95% CI excluding a null value, under the null hypothesis that there would be no difference between groups. For ratio calculations, such as relative risk calculations, odds ratios or hazard ratio calculations, the definition of significance was a 95% CI excluding one. Only studies comparing a dichotomised outcome between two study groups were included, including studies using a dichotomised time-to-event analysis. Studies were excluded if information needed to calculate the FI was missing, e.g., data on the number/proportion of patients in each group, or the number/proportion of events in each group. For studies where only a proportion (%) was presented, the number of patients or the number of events was calculated for each group. Additionally, studies including data from registries outside Scandinavia were excluded.

Literature search

The literature search was performed by an expert in electronic search methods at the Sahlgrenska University Hospital library on 9 May 2017. An updated literature search was performed on 20 April 2018. The searched databases were the PubMed, EMBASE, the Cochrane Library and AMED electronic databases. Search terms were mapped to relevant MeSH terms or subject headings where possible. Three concepts were used to enter search terms into the databases: Concept 1—‘Register’, ‘registry’, ‘registers’, and ‘registries’. Concept 2—‘Sweden’, ‘Swedish’, ‘Denmark’, ‘Danish’, ‘Norway’, ‘Norwegian’, ‘Scandinavia’, ‘Scandinavian’ and ‘Nordic countries’. Concept 3—‘Anterior cruciate ligament’, ‘Anterior cruciate ligament injuries’, ‘Anterior cruciate ligament reconstruction’, ‘Posterior cruciate ligament’ and ‘Posterior cruciate ligament reconstruction’. The ‘OR’ operator was used to group the keywords in each concept. Subsequently, the results from each concept were combined with the ‘AND’ operator. In addition, an e-mail was sent to the registry holder of each Scandinavian registry with a request for a list of publications from the registry. Two authors independently screened all abstracts and full texts, where needed, to identify eligible studies.

Data extraction

Data were extracted independently by two authors using an electronic piloted form (Microsoft Excel, Microsoft Corp; Version 1812). The following data were extracted for each included study; total sample size, specification of the dichotomous outcome measurement, specification of the grouping variable, number of patients in each group, number of patients experiencing an event (the outcome) in each group, the unadjusted and adjusted (where applicable) statistically significant P value or 95% CI and information on the statistical analysis. All statistically significant results from a study originating from a dichotomous analysis by comparing two groups were extracted. So, if a study performed more than one two-group comparison for a dichotomous outcome, data for each analysis were extracted. If there was any disagreement, it was resolved by consulting a third author.

Outcome

The dichotomised outcomes considered for this review were additional ACL reconstruction (either revision or contralateral ACL reconstruction) or postoperative knee joint laxity (yes/no). The dichotomous evaluation of postoperative knee joint laxity was defined according to the definition used in the original studies, i.e. positive pivot shift test (yes/no) and increased anteroposterior laxity of > 2 mm compared with the healthy knee (yes/no).

Quality assessment

A standardised method for assessing internal validity (bias) in registry studies is lacking. The Downs and Black checklist for randomised and non-randomised studies primarily assesses the reporting quality of studies [13] and was determined to be the best available tool for quality assessment in this study. The checklist originally comprised 27 items scored on a 0–2 scale, yielding a maximum score of 30 points. Items number 14, 15, 23 and 24 are related to randomisation and were, therefore, excluded due to not being applicable to the included studies. Similarly, item 27 (power analysis) and item 21 could not be applied to the included studies. Item 21 was excluded as all the studies aiming to analyse two or more registries would score zero (patients not recruited from the same population), even though the quality of the multi-registry studies could be high. Therefore, a modified checklist yielding a maximum score of 22 points was used. Each study was assessed independently by two authors.

Statistical analysis

The FI was calculated using two-by-two tables, according to the method described by Walsh et al. [7]. The P values for the extracted original data were first recalculated by applying a two-sided Fisher’s exact test. If the result was still significant (p < 0.05), the FI was calculated by adding the number of events to the group with the fewest number of events (or lowest risk of event/outcome), while subtracting the same number from the non-events in the group to keep the group sample size constant. Events were added until the P value of the two-sided Fisher’s exact test was no longer significant (p ≥ 0.05). The smallest number of patients that were required to change from a non-event to an event to obtain a p ≥ 0.05 was defined as the FI. All calculations were performed using Microsoft Excel (Microsoft Corp; Version 1812) and SPSS version 25 (IBM Corp; 2017). The common methodology for reporting the FI for RCTs has been to report only one FI per study, by limiting the FI calculation to only the primary outcome measurement or the first statistically significant result presented in the study abstract [7, 11]. Registry studies often perform multiple analyses for a dichotomous outcome. For example, the dichotomous outcome of ACL revision may be compared between two age groups, two ACL graft choices and two graft fixation devices in the same study. The FI was, therefore, calculated for all the statistically significant results in each study and the results were organised and reported according to the specific predictor studied, i.e., the grouping variable. The grouping variables were classified as either patient- or surgery-related and reported under separate subheadings for readability purposes. Additionally, a subanalysis for the mean and median FI was performed after excluding analyses with an FI of zero. An FI of zero is thought to describe a highly fragile significance, as it means that zero patients need to change from a non-event to an event in order not to obtain significance when applying Fisher’s exact test to the analysis. However, as most studies from the Scandinavian knee ligament registries originally used statistics other than Fisher’s exact test [1, 2], there is a risk that using Fisher’s exact test might underestimate the FI. An FI of zero would be the most extreme underestimation and the subanalysis was therefore performed to compare the overall FI with and without analyses with an FI of zero.

Results

Study selection

The literature search yielded a total of 157 studies assessed for eligibility and one additional study was identified via communication with a registry holder. After a full-text review, 26 studies originating solely from the Scandinavian registries remained, which also reported on additional ACL reconstruction or postoperative residual knee laxity. Of these, two studies were excluded, as they did not perform any dichotomised statistical test [14, 15], one study was excluded due to not reporting any statistically significant result [16] and three studies were excluded due to applying a statistical analysis that did not enable FI calculation [17-19]. The remaining 20 studies reported at least one dichotomous outcome with statistical significance and a statistical test that enabled the calculation of FI. However, seven of these studies were excluded on the basis of not reporting the data needed for the calculation of the FI [20-26]. Finally, 13 studies were included for further analysis. The study selection process is presented in Fig. 1 and Online Appendix 1 presents the reason for excluding the studies that reported on additional ACL reconstruction or residual knee laxity.
Fig. 1

The study selection process. FI fragility index

The study selection process. FI fragility index

Overall study characteristics

The 13 included studies had a total of 56 separate dichotomous analyses, of which 49 analyses determined the outcome of ACL revision, three determined the outcome of a contralateral ACL reconstruction, three determined the outcome of residual knee laxity one year postoperatively and one analysis determined the outcome of either an ACL revision or a contralateral ACL reconstruction. The following variables were identified as determining the groups in the studies; age [8, 27–31], patient sex [30], activity at the time of injury [8, 32], HT versus PT autograft [8-10], femoral drilling technique [28, 33], graft fixation technique [27, 34, 35], single- versus double-bundle ACL reconstruction [27, 36], concomitant cartilage injury [8, 28, 30] and country where the ACL reconstruction was performed [27]. The Downs and Black score ranged from 13 to 18, with a median score of 17 of a maximum possible score of 22 (Table 1).
Table 1

Quality appraisal of included studies according to the Downs and Black checklist

AuthorJournalHypothesis describedMain outcome describedPatient characteristics describedInterventions describedPrincipal confounders statedMain findings describedEstimates of outcome variabilityAdverse events reportedCharacteristics of patients lost to F/UActual probability valuesSubject asked to participate representative
1234567891011
Aga (2017)CORR11112110011
Ahldén (2012)AJSM11110110111
Desai (2017)KSSTA11111110111
Fauno (2014)OJSM11112110101
Gifstad (2014)AJSM11111110111
Persson (2014)AJSM11112110111
Persson (2015)AJSM11112110111
Persson (2018)Acta Orthop01002110011
Rahr-Wagner (2014)AJSM11012110001
Rahr-Wagner (2013)Arthroscopy11012110011
Snæbjörnsson (2017)KSSTA11111110011
Soreide (2016)AJSM11112010111
Svantesson (2016)KSSTA11111110011

Acta Orthop acta orthopaedica, AJSM American journal of sports medicine, CORR clinical orthopaedics and related research, KSSTA knee surgery, sport traumatology, arthroscopy, OJSM orthopaedic journal of sports medicine

Quality appraisal of included studies according to the Downs and Black checklist Acta Orthop acta orthopaedica, AJSM American journal of sports medicine, CORR clinical orthopaedics and related research, KSSTA knee surgery, sport traumatology, arthroscopy, OJSM orthopaedic journal of sports medicine The median sample size for the arms was 5540 (range 92–38,666). The median in sample size difference between the two arms was 5464.5 patients (range 26–31,930). The median number of events for the outcome of additional ACL reconstruction (ACL revision or contralateral ACL reconstruction) was 159 (range 9–1171) and for residual laxity 264 (range 195–729). The mean FI for all 56 dichotomous analyses was 80.6, while the median was 34.5. Seventeen analyses (30.4%) became non-significant when performing the two-sided Fisher’s exact test to their contingency table and had therefore an FI of 0. All the analyses are summarised in Tables 2 and 3.
Table 2

The fragility index of patient-related factors for dichotomous events

Grouping variableAuthor (year)Dichotomous eventArm 1Arm 2Sample size arm 1Sample size arm 2Events arm 1Events arm 2p value or 95% CI (unadjusted/adjusted*)Statistical testFragility indexMean fragility indexMedian fragility index
AgeAga et al. (2017)RevisionAge 14–19Age 20–2414,73312,645985568< 0.001*Cox regression204178.5116.0
RevisionAge 14–19Age 25–2914,7339280985252< 0.001*Cox regression309
RevisionAge 14–19Age 30–6014,73324,120985403< 0.001*Cox regression1089
Desai et al. (2017)RevisionAge 13–15Age 36–49130033507436< 0.001Kaplan–Meier108
RevisionAge 16–20Age 36–495075335025236< 0.001Kaplan–Meier100
RevisionAge 21–25Age 36–493667335011736< 0.001Kaplan–Meier45
RevisionAge 26–30Age 36–492513335043360.040Kaplan–Meier1
RevisionAge 13–25Age 26–4910,0427640443109< 0.001Kaplan–Meier183
Fauno et al. (2014)RevisionAge 13–15Age > 2032711,496222333.48 (2.24–5.38)/3.23 (2.05–5.08)*Cox regression263
RevisionAge 15–20Age > 20288811,4961402332.57 (2.09–3.18)/2.50 (2.01–3.11)*Cox regression230
Gifstad et al. (2014)RevisionAge 15–19Age 20–2410,94785184802860.77 (0.67–0.90)/0.78 (0.67–0.90)*Cox regression39
RevisionAge 15–19Age 25–2910,94767024801450.47 (0.39–0.57)/0.47 (0.39–0.57)*Cox regression108
RevisionAge 15–19Age 30–3410,9475471480830.32 (0.25–0.40)/0.31 (0.25–0.40)*Cox regression121
RevisionAge 15–19Age 35–3910,9475093480700.29 (0.23–0.37)/0.28 (0.22–0.37)*Cox regression119
RevisionAge 15–19Age 40–4410,9474073480480.25 (0.19–0.34)/0.25 (0.19–0.34)*Cox regression101
RevisionAge 15–19Age ≥ 4510,9473400480300.20 (0.14–0.28)/0.19 (0.13–0.28)*Cox regression93
Snaebjornsson et al. (2017)Contralateral ACLRAge 13–15Age 36–491300335088450.002Kaplan–Meier132
Soreide et al. (2016)RevisionAge 15–19Age > 2916803166106a41a0.001*Cox regression116
RevisionAge 20–29Age > 292647316682a41a0.001*Cox regression31
Patient sexSnaebjornsson et al. (2017)Contralateral ACLRFemaleMale766910,0132662600.001Kaplan–Meier35
Activity at injuryAhlden et al. (2012)Revision/contralateral ACLRFemale football players 15–18yMale football players 15–18y1189226a9a0.02X2 test216.05.5
Gifstad et al. (2014)RevisionFootballAlpine activities18,81060835271100.65 (0.53–0.79)/0.65 (0.53–0.79)*Cox regression32
RevisionFootballOther sports18,81012,1035272850.83 (0.72–0.96)/0.85 (0.73–0.98)*Cox regression9
RevisionFootballOther/unknown18,8101033527190.57 (0.36–0.90)/0.54 (0.34–0.86)*Cox regression0
RevisionHandballFootball526018,8101915271.28 (1.08–1.51)/1.23 (1.04–1.45)*Cox regression53
RevisionTraffic/workFootball211318,810665271.07 (0.83–1.38)/1.44 (1.12–1.87)*Cox regression0

Arm 1 has the highest risk of an event across all analyses

aNumber has been calculated from a proportion (%)

*Adjusted P value

ACLR anterior cruciate ligament reconstruction, CI confidence interval, y years

Table 3

The fragility index of surgery-related factors for dichotomous events

Grouping variableAuthor (year)Dichotomous eventArm 1Arm 2Sample size arm 1Sample size arm 2Events arm 1Events arm 2p value or 95% CI (unadjusted/adjusted*)Statistical testFragility indexMean fragility indexMedian fragility index
HT vs PTGifstad et al. (2014)RevisionHT graftPT graft38,66667361042156< 0.001Log-rank test015.010.0
Persson et al. (2014)RevisionHT graftPT graft9215342836269< 0.001*Cox regression40
Rahr Wagner et al. (2014)RevisionHT graftPT graft11,6761971312471.50 (1.11–2.04)/1.41 (1.03–1.92)*Cox regression0
1y positive pivot shiftPT graftHT graft10254554195a729a0.81 (0.68–0.96)*Logistic regression20
Femoral drilling techniqueDesai et al. (2017)RevisionTP referenceTT non-anatomic66851296162400.049/0.041*Cox regression048.017.0
RevisionTP anatomicTP reference403666851461620.028/0.018*Cox regression34
RevisionTP drillingTT drilling12,4405110380167< 0.001/< 0.001*Cox regression0
Rahr-Wagner et al. (2013)RevisionAM drillingTT drilling19456430391022.01 (1.39–2.92)/2.04 (1.39–2.99)*Cox regression0
1y positive pivot shiftAM drillingTT drilling10562949206a401a2.86 (2.40–3.41)*Cox regression95
1y sagittal laxity > 2 mmAM drillingTT drilling10512807208a320a3.70 (2.09–4.43)*Cox regression159
Graft fixationAga et al. (2017)RevisionFemoral metal screwFemoral others (not bioscrew or button)978825,089357902<0.001*Cox regression037.41.0
RevisionFemoral metal screwFemoral button978824,8723579060.01*Cox regression0
RevisionTibial bioscrewTibial metal screw10,85917,5643996090.02*Cox regression0
RevisionTibial others (not bioscrew or button)Tibial metal screw30,87317,56411716090.04*Cox regression0
Persson et al. (2015)RevisionHT endobutton/RCI screwPT233938067224< 0.001*Cox regression62
RevisionHT EzLoc/WasherLocPT135238062924< 0.001*Cox regression27
RevisionHT Endobutton/Biosure HAPT120938064924< 0.001*Cox regression87
RevisionHT Endobutton/IntrafixPT68738062324< 0.001*Cox regression55
RevisionHT TransFix II/metal interference screwPT62038069240.047*Cox regression2
Persson et al. (2018)RevisionFemoral EndobuttonFemoral Rigidfix14,10612,0413423160,7 (0.6–0.8)/0,7 (0.6–0.8)*Cox regression0
RevisionFemoral EndobuttonFemoral Transfix14,10636523421000.7 (0.5–0.8)/0.7 (0.6–0.9)*Cox regression0
RevisionTibial retro interference screwTibial interference screw50818,640274621.8 (1.2–2.6)/1.9 (1.3–2.9)*Cox regression216
SB vs DBAga et al. (2017)RevisionDB HT graftSB PT graft9947790372190.01*Cox regression00.50
Svantesson et al. (2017)RevisionSBDB21,846614689120.01/0.019*Cox regression0
RevisionSB TP referenceDB5609614146120.015/0.037*Cox regression0
RevisionSB TP anatomicDB3449614133120001/0002*Cox regression2
Concomitant cartilage injuryDesai et al. (2017)RevisionNo cartilage injuryCartilage injury13,08445984351170.002Kaplan–Meier919.79.0
Gifstad et al. (2014)RevisionNo cartilage injuryCartilage injury35,61897841007191< 0.001Log-rank test50
Snaebjornsson et al. (2017)Contralateral ACLRNo cartilage injuryCartilage injury13,08445984081180.01Kaplan–Meier0
CountryAga et al. (2017)RevisionNorwaySweden14,64826,299613868< 0.001*Cox regression130

Arm 1 has the highest risk of an event across all analyses

aNumber has been calculated from a proportion (%)

*Adjusted P value

ACLR anterior cruciate ligament reconstruction, AM anteromedial, CI confidence interval, DB double-bundle, HT hamstring tendon autograft, PT patellar tendon autograft, SB single-bundle, TP transportal; TT transtibial, y years

The fragility index of patient-related factors for dichotomous events Arm 1 has the highest risk of an event across all analyses aNumber has been calculated from a proportion (%) *Adjusted P value ACLR anterior cruciate ligament reconstruction, CI confidence interval, y years The fragility index of surgery-related factors for dichotomous events Arm 1 has the highest risk of an event across all analyses aNumber has been calculated from a proportion (%) *Adjusted P value ACLR anterior cruciate ligament reconstruction, AM anteromedial, CI confidence interval, DB double-bundle, HT hamstring tendon autograft, PT patellar tendon autograft, SB single-bundle, TP transportal; TT transtibial, y years

Patient-related factors

The following variables were identified as patient-related; age [8, 27–31], patient sex [30] and activity at time of injury [8, 32]. A total of 19 two-arm analyses for age (18 related to the outcome of ACL revision and one to contralateral ACL reconstruction) were identified. The FI for age ranged from 1 to 1089, with a mean FI of 178.5 and a median of 116.0. The analysis on patient sex as a factor for contralateral ACL reconstruction had an FI of 35.0. There were six analyses on activity at time of injury (five related to the outcome of ACL revision and one to either ACL revision or a contralateral ACL reconstruction). The FI for these analyses ranged from 0 to 53, with a mean FI of 16.0 and a median of 5.5. All analyses with a patient-related factor as the grouping variable are summarised in Table 2.

Surgery-related factors

The following variables were identified as surgery-related; HT versus PT autograft [8-10], femoral drilling technique [28, 33], graft fixation [27, 34, 35], single- versus double-bundle ACL reconstruction [27, 36], concomitant cartilage injury [8, 28, 30] and country where the ACL reconstruction was performed [27]. With regard to HT versus PT autograft, three analyses were related to the outcome of ACL revision and one analysis to the outcome of a positive pivot shift one year postoperatively. The FI ranged from 0 to 40, with a mean FI of 15.0 and median of 10.0. The drilling technique comparisons were made between transtibial drilling and the anteromedial or transportal drilling technique (four analyses related to the outcome of ACL revision, one to the outcome of a positive pivot shift test at one year postoperatively and one to the outcome of > 2 mm sagittal laxity at one year postoperatively). The FI ranged from 0 to 159, with a mean FI of 48.0 and a median of 17.0. Graft fixation was investigated in 12 two-arm analyses. The FI ranged from 0 to 216, with a mean FI of 37.4 and a median of 1.0. Single- versus double-bundle was investigated in four analyses, with an FI ranging from 0 to 2 (mean FI 0.5 and median FI 0). With regard to concomitant cartilage injury, there were two analyses related to the outcome of ACL revision and one to contralateral ACL reconstruction. The FI ranged from 0 to 50, with a mean FI of 19.7 and a median of 9.0. With regard to country where the ACL reconstruction was performed, one analysis related to the outcome of ACL revision. The FI of the significant difference between the countries was 130. All analyses with a surgery-related factor as the grouping variable are summarised in Table 3.

Subanalysis

When excluding the 17 analyses with an FI of 0, a total of 39 analyses remained. The FI of those analyses ranged from 1 to 1089, with a mean FI of 115.7 and a median FI of 87.0 (data not shown).

Discussion

This most important finding of this study was that the FI varied substantially across dichotomous analyses from the Scandinavian knee ligament registries. Although almost one third of the analyses had an FI of zero, the analyses related to age generally had the most robust FI, with a mean FI of 178.5 (range 1–1089). In fact, the majority of the analyses had a higher FI than what previously has been reported from RCTs related to orthopaedic surgery [11, 37]. However, the variable FI underlines that there are difficulties in the interpretation of robustness in analyses from these registry studies. The FI has previously been applied exclusively to RCTs. A median FI of 2 (IQR 1–3) was reported when assessed in 40 RCTs related to orthopaedic spine surgery [37]. Similarly, a median FI of 2 (IQR 1–2.8) was found in 48 RCTs related to arthroscopy and sports medicine surgery [11]. It was concluded that the statistical significance in current orthopaedic RCTs is fragile and that relatively small sample sizes and few outcome events are contributory factors [37]. The large study samples provided by registry studies could theoretically increase the robustness of significant findings. On the other hand, it is not known whether the use of the FI is feasible for registry studies, as there are some fundamental discrepancies in the study design compared with RCTs. A well-designed RCT is thought to exclude confounding factors by assuming an equal distribution of both measured and unmeasured factors due to randomisation and blinding. Registry studies are instead susceptible to confounders and bias, which is commonly dealt with by statistical adjustments. The FI is calculated independently of whether or not the tested P value originates from an adjusted analysis, which is important, as most analyses included in this study were adjusted. Moreover, despite the fact that the FI has previously been applied to time-to-event outcomes in RCTs [7, 11], time-to-event outcomes are more common in registry studies. Time-to-event is not considered when calculating the FI, which means that the FI becomes a rougher measurement for these analyses. It is obvious that the FI had greater variability in registry studies compared with previous studies of RCTs [11, 37]. The variability of the FI questions the feasibility of using this metric on registry studies, especially since a strong contributor to the variable FI probably is the heterogeneous data analyses. For example, there is no consensus on how to stratify age groups in the Scandinavian knee ligament registry studies. This aggravates a comparison of the FI across studies, since the FI in addition to describe the actual robustness also will be affected by group size and age difference between groups. Moreover, almost one third of the analyses had an FI of zero, which is difficult to interpret for analyses which originally used statistics other than Fisher’s exact test and found significance. Does an FI of zero indicate fragility or is it the result of applying a statistical test that was not deemed to be the most appropriate test in the original study? Interestingly, Walsh et al. [7] applied the FI to 399 trials published in high-impact journals and found that 70% of the trials with an FI of zero originally were analysed using time-to-event analysis [7]. This indicates that time-to-event analyses are particularly susceptible to an FI of zero, which might explain the large proportion of analyses with an FI of zero in this study. Not surprisingly, the mean and median FI were considerably higher when the analyses with an FI of zero were excluded in the subanalysis. Further research is needed to determine the most appropriate methodology for FI calculation in studies using time-to-event analysis. Until then, it could be argued that time-to-event analyses with an FI of zero should be excluded to not severely skew the overall FI. To draw conclusions regarding the feasibility of using the FI on registry studies, the FI perhaps needs to be assessed in a larger number of registry studies, which could strengthen the data and narrow the range of the FI. That could potentially also enable a determination of the most robust predictors for ACL failure. In this study, most predictors included analyses from only one or two studies. The inclusion of few analyses per predictor makes the FI analysis sensitive to outliers and makes the process of determining the most robust predictor vulnerable. One should however bear in mind that only significant analyses are considered for the FI calculation and few included analyses for a predictor could mean that the reported significance is an exception among several non-significant findings. Patient sex could be used to exemplify this, where only one analysis was included with an FI of 35. Although the FI for patient sex is difficult to interpret based on a single analysis, it should be remembered that seven other studies using ACL revision as the outcome and three other studies using contralateral ACL reconstruction as the outcome found non-significant results when comparing patient sex in the Scandinavian registries [2]. The interpretation of the FI must therefore also be set in the context of the cumulative evidence. Not all predictors were, however, limited by few analyses, and the feasibility of using the FI is strengthened when determining the FI for the two predictors with most analyses (age with 19 analyses and graft fixation with 12 analyses), as the result reflects previous literature on the subject. The literature is unanimous when it comes to young age as a risk factor for an additional ACL reconstruction [38-42]. In agreement with this, the FI for age was by far the highest, which indicates that the FI calculation is able to provide a reliable estimate of robustness. This is further emphasised by the higher FI in analyses with an increasing age difference between the compared groups. With regard to graft fixation, six of 12 analyses had an FI of 0 and the mean FI for all the analyses was 37.4. For this reason, graft fixation does not predict an additional ACL reconstruction with the same certainty as age, which is supported by the contradictory literature regarding the impact of graft fixation [43-45]. In the light of the limitations associated with using the FI on registry studies, it is our opinion that the FI could be regarded as a rough measurement of robustness for registry studies and that it could be used to compare confidence in the results across analyses with a similar statistical methodology. There is reason to believe that the FI could be a valuable method for registry studies, especially for those using statistical methods that are perfectly compatible with the FI, such as Chi-square or Fisher’s exact test. The large amount of data comprised in registries should intuitively increase the robustness, but it is important to find a metric to quantify this objectively in registry studies. Although there might be outcomes in registry studies that are more or less appropriate for the use of FI, researchers should be encouraged to calculate and report the FI whenever possible. The strength in numbers of registry studies does not compensate for other limitations, such as confounders, bias and an inability to provide causality. Care must be taken not to overestimate the effect of a higher FI in registry studies compared with RCTs, as RCTs still remain the gold standard to determine the efficacy of an intervention. This study is also limited by the fact that analyses from seven studies needed to be excluded, as data on study arm size and the number of events were not reported. Future studies should preferably report these numbers. It should also be mentioned that the outcome of additional ACL reconstruction may underestimate the true rate of failed ACL reconstructions. This becomes especially relevant to consider in studies with small FIs, since the robustness of significance in these circumstances is likely to be even more vulnerable to “hidden” failures not proceeding to an ACL revision.

Conclusion

There was large variability in the FI in analyses from the Scandinavian knee ligament registries and almost one third of the analyses had an FI of zero. The FI is a rough measurement of robustness when applied to registry studies, however, future studies are needed to determine the most appropriate metric for robustness in registry studies. The use of the FI can provide clinicians with a deeper understanding of significant study results and promotes an evidence-based approach in the clinical care of patients. Below is the link to the electronic supplementary material. Supplementary material 1 (DOCX 15 kb)
  44 in total

1.  The feasibility of creating a checklist for the assessment of the methodological quality both of randomised and non-randomised studies of health care interventions.

Authors:  S H Downs; N Black
Journal:  J Epidemiol Community Health       Date:  1998-06       Impact factor: 3.710

2.  The Fragility of Statistically Significant Findings From Randomized Trials in Sports Surgery: A Systematic Survey.

Authors:  Moin Khan; Nathan Evaniew; Mark Gichuru; Anthony Habib; Olufemi R Ayeni; Asheesh Bedi; Michael Walsh; P J Devereaux; Mohit Bhandari
Journal:  Am J Sports Med       Date:  2016-12-14       Impact factor: 6.202

3.  Registry data highlight increased revision rates for endobutton/biosure HA in ACL reconstruction with hamstring tendon autograft: a nationwide cohort study from the Norwegian Knee Ligament Registry, 2004-2013.

Authors:  Andreas Persson; Asle B Kjellsen; Knut Fjeldsgaard; Lars Engebretsen; Birgitte Espehaug; Jonas M Fevang
Journal:  Am J Sports Med       Date:  2015-05-14       Impact factor: 6.202

4.  Increased risk of revision after anteromedial compared with transtibial drilling of the femoral tunnel during primary anterior cruciate ligament reconstruction: results from the Danish Knee Ligament Reconstruction Register.

Authors:  Lene Rahr-Wagner; Theis Muncholm Thillemann; Alma Becic Pedersen; Martin Carøe Lind
Journal:  Arthroscopy       Date:  2013-01       Impact factor: 4.772

5.  The Effect of Limited Perioperative Nonsteroidal Anti-inflammatory Drugs on Patients Undergoing Anterior Cruciate Ligament Reconstruction.

Authors:  Endre Soreide; Lars-Petter Granan; Geir A Hjorthaug; Birgitte Espehaug; Sigbjørn Dimmen; Lars Nordsletten
Journal:  Am J Sports Med       Date:  2016-08-05       Impact factor: 6.202

6.  Associations between inadequate knee function detected by KOOS and prospective graft failure in an anterior cruciate ligament-reconstructed knee.

Authors:  Lars-Petter Granan; Valborg Baste; Lars Engebretsen; Maria C S Inacio
Journal:  Knee Surg Sports Traumatol Arthrosc       Date:  2014-03-12       Impact factor: 4.342

7.  Results from the Swedish national anterior cruciate ligament register.

Authors:  Joanna Kvist; Jüri Kartus; Jon Karlsson; Magnus Forssblad
Journal:  Arthroscopy       Date:  2014-04-18       Impact factor: 4.772

8.  Patient demographic and surgical characteristics in anterior cruciate ligament reconstruction: a description of registries from six countries.

Authors:  Heather A Prentice; Martin Lind; Caroline Mouton; Andreas Persson; Henrik Magnusson; Ayman Gabr; Romain Seil; Lars Engebretsen; Kristian Samuelsson; Jon Karlsson; Magnus Forssblad; Fares S Haddad; Tim Spalding; Tadashi T Funahashi; Liz W Paxton; Gregory B Maletis
Journal:  Br J Sports Med       Date:  2018-03-24       Impact factor: 13.800

9.  Predictors of contralateral anterior cruciate ligament reconstruction: a cohort study of 9061 patients with 5-year follow-up.

Authors:  Daniel Andernord; Neel Desai; Haukur Björnsson; Sofia Gillén; Jón Karlsson; Kristian Samuelsson
Journal:  Am J Sports Med       Date:  2014-11-10       Impact factor: 6.202

10.  Double-bundle anterior cruciate ligament reconstruction is superior to single-bundle reconstruction in terms of revision frequency: a study of 22,460 patients from the Swedish National Knee Ligament Register.

Authors:  Eleonor Svantesson; David Sundemo; Eric Hamrin Senorski; Eduard Alentorn-Geli; Volker Musahl; Freddie H Fu; Neel Desai; Anders Stålman; Kristian Samuelsson
Journal:  Knee Surg Sports Traumatol Arthrosc       Date:  2016-11-23       Impact factor: 4.342

View more
  7 in total

Review 1.  Statistical Fragility of Single-Row Versus Double-Row Anchoring for Rotator Cuff Repair: A Systematic Review of Comparative Studies.

Authors:  Nathan P Fackler; Cooper B Ehlers; Kylie T Callan; Arya Amirhekmat; Eric J Smith; Robert L Parisien; Dean Wang
Journal:  Orthop J Sports Med       Date:  2022-05-10

Review 2.  Fragility Index in Cardiovascular Randomized Controlled Trials.

Authors:  Muhammad Shahzeb Khan; Rohan Kumar Ochani; Asim Shaikh; Muhammad Shariq Usman; Naser Yamani; Safi U Khan; M Hassan Murad; John Mandrola; Rami Doukky; Richard A Krasuski
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2019-12-11

Review 3.  The Statistical Fragility of Single-Bundle vs Double-Bundle Autografts for ACL Reconstruction: A Systematic Review of Comparative Studies.

Authors:  Cooper B Ehlers; Andrew J Curley; Nathan P Fackler; Arjun Minhas; Ariel N Rodriguez; Kory Pasko; Edward S Chang
Journal:  Orthop J Sports Med       Date:  2021-12-20

4.  The Fragility of Significance in the Hip Arthroscopy Literature: A Systematic Review.

Authors:  Robert L Parisien; David P Trofa; Michaela O'Connor; Brock Knapp; Emily J Curry; Paul Tornetta; T Sean Lynch; Xinning Li
Journal:  JB JS Open Access       Date:  2021-10-21

Review 5.  Scoping Review on ACL Surgery and Registry Data.

Authors:  Janina Kaarre; Bálint Zsidai; Eric Narup; Alexandra Horvath; Eleonor Svantesson; Eric Hamrin Senorski; Alberto Grassi; Volker Musahl; Kristian Samuelsson
Journal:  Curr Rev Musculoskelet Med       Date:  2022-07-13

6.  How Robust are the Evidences that Formulate Surviving Sepsis Guidelines? An Analysis of Fragility and Reverse Fragility of Randomized Controlled Trials that were Referred in these Guidelines.

Authors:  Nang S Choupoo; Saurabh K Das; Priyam Saikia; Samarjit Dey; Sumit Ray
Journal:  Indian J Crit Care Med       Date:  2021-07

7.  How fragile are Mediterranean diet interventions? A research-on-research study of randomised controlled trials.

Authors:  Meletios P Nigdelis; Xenophon Theodoridis; Maria G Grammatikopoulou; Konstantinos Gkiouras; Antigoni Tranidou; Theodora Papamitsou; Dimitrios P Bogdanos; Dimitrios G Goulis
Journal:  BMJ Nutr Prev Health       Date:  2021-03-09
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.