| Literature DB >> 32365490 |
María Teresa Martínez-Romero1, Francisco Ayala2, Mark De Ste Croix3, Francisco J Vera-Garcia2, Pilar Sainz de Baranda1, Fernando Santonja-Medina4,5, Julio Sánchez-Meca6.
Abstract
This meta-analysis aimed to estimate the inter- and intra-tester reliability of endurance measures obtained through trunk extension field-based tests and to explore the influence of the moderators on the reliability estimates. The reliability induction rate of trunk extension endurance measures was also calculated. A systematic search was conducted using various databases, and subsequently 28 studies were selected that reported intraclass correlation coefficients for trunk extension endurance measures. Separate meta-analyses were conducted using a random-effects model. When possible, analyses of potential moderator variables were carried out. The inter-tester average reliability of the endurance measure obtained from the Biering-Sorensen test was intraclass correlation coefficient (ICC) = 0.94. The intra-session reliability estimates of the endurance measures recorded using the Biering-Sorensen test, the prone isometric chest raise test, and the prone double straight-leg test were ICC = 0.88, 0.90, and 0.86, respectively. The inter-session average reliability of the endurance measures from the Biering-Sorensen test, the prone isometric chest raise test, and the dynamic extensor endurance test were ICC = 0.88, 0.95, and 0.99, respectively. However, due to the limited evidence available, the reliability estimates of the measures obtained through the prone isometric chest raise, prone double straight-leg, and dynamic extensor endurance tests should be considered with a degree of caution. Position control instruments, tools, and familiarization session demonstrated a statistical association with the inter-session reliability of the Biering-Sorensen test. The reliability induction rate was 72.8%. Only the trunk extension endurance measure obtained through the Biering-Sorensen test presented sufficient scientific evidence in terms of reliability to justify its use for research and practical purposes.Entities:
Keywords: core endurance; physical education; pre-participation assessment; reliability generalization meta-analysis; sports performance
Year: 2020 PMID: 32365490 PMCID: PMC7246735 DOI: 10.3390/ijerph17093088
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram of the literature search.
Summary of findings (SoF).
| Reliability | Pooled Result a | Overall Rating b | Quality of Evidence c |
|---|---|---|---|
|
| |||
| Inter-tester reliability | ICC = 094 (0.84–0.98) | Sufficient | Moderate (as there are multiple doubtful [ |
| Intra-tester (intra-session) reliability | ICC = 0.88 (0.83–0.92) | Sufficient | Low (as all studies are doubtful [ |
| Intra-tester (inter-session) reliability | ICC = 0.88 (0.80–0.92) | Sufficient | Moderate (as there are multiple doubtful and five very good studies [ |
|
| |||
| Inter-tester reliability | ICC = 0.90 (0.80–0.95) | Indeterminate | Did not pool the results or grade the evidence due to there being one study available [ |
| Intra-tester (intra-session) reliability | ICC = 0.90 (0.83–0.94) | Indeterminate | Did not pool the results or grade the evidence due to there being one study available [ |
| Intra-tester (inter-session) reliability | ICC = 0.95 (0.91–0.97) | Sufficient | Low (as all studies are doubtful [ |
|
| |||
| Inter-tester reliability | ICC = 0.83 (0.67–0.93) | Indeterminate | Did not pool the results or grade the evidence due to there being one study available [ |
| Intra-tester (intra-session) reliability | ICC = 0.86 (0.77–0.92) | Indeterminate | Did not pool the results or grade the evidence due to there being one study available [ |
|
| |||
| Intra-tester (inter-session) reliability | ICC = 0.99 (0.88–1.00) | Sufficient | Low (as there is one inadequate study [ |
a Pooled results obtained from mean reliability analysis and adjusted according to the publication bias analysis. b Overall rating was graded as sufficient (intraclass correlation coefficient (ICC) > 0.70), insufficient (ICC < 0.70), or indeterminate (either ICC reported by just one study or by none). c Quality of evidence (high, moderate, low, very low) based on the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach that uses three factors: (1) risk of bias assessed with the consensus-based standards for the selection of health measurement instruments (COSMIN) risk of bias check-list; (2) inconsistency, solved by pooling the results; and (3) imprecision, the total sample included in the studies. When the total sample size of the pooled studies is below 100, downgrade with one level and with two levels when the total sample size is below 50. ICC: intraclass correlation coefficient.
Average reliability, 95% confidence intervals, and heterogeneity statistics for each field-based test and reliability type.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Inter-tester reliability | 8 | 0.94 | 0.84 | 0.98 | 102.37 | 7 | <0.001 | 93.1% |
| Intra-tester and intra-session reliability | 12 | 0.88 | 0.83 | 0.92 | 22.69 | 11 | 0.02 | 51.2% |
| Intra-tester and inter-session reliability | 27 | 0.88 | 0.80 | 0.92 | 234.40 | 26 | <0.001 | 88.9% |
|
| ||||||||
| Inter-tester reliability | 1 | 0.90 | 0.80 | 0.95 | -- | -- | -- | -- |
| Intra-tester and intra-session reliability | 2 | 0.90 | 0.83 | 0.94 | 0.034 | 1 | 0.853 | 0% |
| Intra-tester and inter-session reliability | 5 | 0.95 | 0.91 | 0.97 | 13.57 | 4 | 0.009 | 70.5% |
|
| ||||||||
| Inter-tester reliability | 1 | 0.83 | 0.67 | 0.92 | -- | -- | -- | -- |
| Intra-tester and intra-session reliability | 2 | 0.86 | 0.77 | 0.92 | 0.08 | 9 | 0.777 | 0% |
|
| ||||||||
| Intra-tester and inter-session reliability | 5 | 0.99 | 0.88 | 1.00 | 15.73 | 4 | 0.003 | 74.5% |
k: number of cohorts, ICC: mean intraclass correlation coefficient, ICC and ICC: 95% lower and upper CI for ICC: heterogeneity statistic, DF: degrees of freedom for Q statistic, p: probability level associated to Q statistic, I2: heterogeneity index.
Figure 2Forest plot of the intra-tester and inter-session reliability obtained in the studies that applied the Biering-Sorensen test.
Results of the mixed-effects meta-regressions for the continuous moderator variables on the intraclass correlation coefficient (ICC) estimates obtained from intra-tester and inter-session reliability of the Biering-Sorensen test.
| Moderator Variable |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Publication year | 27 | 0.003 | 0.03 | 0.855 | 232.21 * | 0 |
| Final sample size | 27 | −0.003 | 0.11 | 0.741 | 230.26 * | 0 |
| Sex (% female) | 24 | 0.002 | 0.37 | 0.542 | 200.78 * | 0 |
| Mean age (years) | 26 | −0.017 | 1.62 | 0.203 | 210.81 * | 0.01 |
| SD age | 19 | −0.009 | 0.31 | 0.578 | 177.94 * | 0 |
| % attrition | 27 | −0.008 | 1.39 | 0.238 | 227.47 * | 0 |
| Number of measurements | 27 | −0.034 | 0.01 | 0.933 | 232.93 * | 0 |
| Time interval between measurement | 26 | −0.006 | 0.18 | 0.668 | 229.87 * | 0 |
| Mean test score from total sample | 26 | 0.002 | 0.98 | 0.321 | 210.60 * | 0 |
| SD test score from total sample | 21 | 0.009 | 0.94 | 0.331 | 201.42 * | 0.02 |
| Mean test score from reliability sample | 24 | 0.001 | 0.42 | 0.514 | 207.27 * | 0 |
| SD test score from reliability sample | 20 | 0.012 | 1.54 | 0.214 | 180.06 * | 0.10 |
k: number of studies, b: regression coefficient for the predictor variable, Q: statistic for testing the statistical significance of the predictor variable, p: probability level for the Q statistic, Q: statistic for testing the model misspecification, R2: proportion of variance explained by the predictor variable, * p < 0.001.
Results of the mixed-effects ANOVAs for the qualitative moderator variables on the intraclass correlation coefficient (ICC) estimates obtained from intra-tester and inter-session reliability of the Biering-Sorensen test.
| Moderator Variables |
|
| 95% CI | ANOVA Results | |
|---|---|---|---|---|---|
|
|
| ||||
| Reliability analysis was done with the same sample: | |||||
| Yes | 24 | 0.88 | 0.81 | 0.92 | |
| No | 3 | 0.77 | 0.26 | 0.94 | |
| Sex: | 9 | ||||
| Males | 2 | 0.88 | 0.77 | 0.94 | |
| Females | 15 | 0.97 | 0.89 | 0.99 | |
| Males and females | 0.83 | 0.72 | 0.89 | ||
| Sample type: | |||||
| Children and adolescents | 4 | 0.95 | 0.86 | 0.98 | |
| Adults | 23 | 0.85 | 0.77 | 0.90 | |
| Target population: | |||||
| Asymptomatic | 18 | 0.88 | 0.80 | 0.93 | |
| Clinical | 9 | 0.85 | 0.69 | 0.93 | |
| Physical activity level: | |||||
| Sedentary | 12 | 0.84 | 0.71 | 0.92 | |
| Recreationally active | 15 | 0.89 | 0.81 | 0.94 | |
| Validated modification: | |||||
| Yes | 17 | 0.86 | 0.76 | 0.92 | |
| No | 10 | 0.89 | 0.78 | 0.95 | |
| Tool: | |||||
| Test bench | 22 | 0.85 | 0.77 | 0.90 | |
| Roman chair | 5 | 0.94 | 0.86 | 0.98 | |
| Hands position: | |||||
| Crossed on the chest | 22 | 0.88 | 0.80 | 0.93 | |
| Along the body | 3 | 0.87 | 0.55 | 0.97 | |
| At the level of the ears | 2 | 0.80 | 0.23 | 0.96 | |
| Part of the body on the edge: | |||||
| Not reported | 12 | 0.90 | 0.80 | 0.95 | |
| ASIS | 6 | 0.88 | 0.68 | 0.95 | |
| Upper border of the iliac crest | 6 | 0.85 | 0.62 | 0.95 | |
| Pubis | 3 | 0.77 | 0.28 | 0.94 | |
| Test duration: | |||||
| Until exhaustion | 25 | 0.88 | 0.82 | 0.93 | |
| Until 240 s | 2 | 0.72 | 0.08 | 0.94 | |
| Position control instruments: | |||||
| Visual | 14 | 0.81 | 0.73 | 0.87 | |
| Inclinometer | 5 | 0.83 | 0.67 | 0.91 | |
| Stadiometer | 4 | 0.89 | 0.78 | 0.95 | |
| Light sensor | 2 | 0.87 | 0.59 | 0.96 | |
| Plumb-line | 2 | 0.99 | 0.98 | 0.99 | |
| Familiarization session: | |||||
| Yes | 6 | 0.96 | 0.92 | 0.98 | |
| No | 20 | 0.82 | 0.73 | 0.88 | |
| Test conditions: | |||||
| Similar conditions | 18 | 0.88 | 0.80 | 0.93 | |
| Unclear conditions | 9 | 0.85 | 0.68 | 0.93 | |
| The profession of tester: | |||||
| Sports sciences | 6 | 0.92 | 0.80 | 0.97 | |
| Physical therapy | 17 | 0.86 | 0.76 | 0.92 | |
| Medicine | 3 | 0.77 | 0.30 | 0.94 | |
| Continent: | |||||
| Europe | 16 | 0.88 | 0.78 | 0.93 | |
| America | 10 | 0.88 | 0.74 | 0.94 | |
| Study objective: | |||||
| Psychometric | 20 | 0.88 | 0.79 | 0.93 | |
| Not psychometric | 7 | 0.86 | 0.67 | 0.95 | |
| Conflict of interest: | |||||
| No conflict | 8 | 0.87 | 0.71 | 0.94 | |
| Not reported | 19 | 0.88 | 0.79 | 0.93 | |
| COSMIN Risk of Bias check-list: | |||||
| Very good | 6 | 0.85 | 0.62 | 0.94 | |
| Adequate | 2 | 0.80 | 0.22 | 0.96 | |
| Doubtful | 17 | 0.85 | 0.79 | 0.93 | |
| Inadequate | 2 | 0.92 | 0.56 | 0.99 | |
k: number of cohorts, ASIS: anterosuperior iliac spine, ANOVA: analysis of variance, ICC: mean reliability coefficient, ICC and ICC: lower and upper 95% confidence limits for ICC, Q: statistic for testing the statistical significance of the predictor variable, Q: statistic for testing the model misspecification, R: proportion of variance explained by the predictor variable.
Results of the multiple meta-regression model assuming a mixed-effects model.
| Source |
|
|
|
|
|---|---|---|---|---|
| Intercept | 1.09 | 0.11 | 9.89 | <0.001 |
| Sample type | 0.65 | 0.27 | 2.39 | 0.017 |
| Familiarization session | 0.68 | 0.24 | 2.88 | 0.004 |
| Full model: | ||||
b: unstandardized regression coefficient, SE: standard error of b, Z: statistic for testing the statistical significance of each moderator variable, Q: statistic for testing the global significance of the model, Q: statistic for testing the model misspecification, R2: proportion of variance accounted for by the full model.