| Literature DB >> 27229897 |
Sarah Domone1, Steven Mann2, Gavin Sandercock2,3, Matthew Wade2, Chris Beedie2,4.
Abstract
Previous research has reported the validity and reliability of a range of field-based tests of children's cardiorespiratory fitness. These two criteria are critical in ensuring the integrity and credibility of data derived through such tests. However, the criterion of scalability has received little attention. Scalability determines the degree to which tests developed on small samples in controlled settings might demonstrate real-world value, and is of increasing interest to policymakers and practitioners. The present paper proposes a method by which the scalability of cardiorespiratory field-based tests suitable for school-aged children might be assessed. We developed an algorithm to estimate scalability based on a six-component model; delivery, evidence of operating at scale, effectiveness, costs, resource requirements and practical implementation. We tested the algorithm on data derived through a systematic review of research that has used relevant fitness tests. A total of 229 studies that had used field based cardiorespiratory fitness tests to measure children's fitness were identified. Initial analyses indicated that the 5-min run test did not meet accepted criteria for reliability, whilst the 6-min walk test likewise failed to meet the criteria for validity. Of the remainder, a total of 28 studies met the inclusion criteria, 22 reporting the 20-m shuttle-run and seven the 1-mile walk/run. Using the scalability algorithm we demonstrate that the 20-m shuttle run test is substantially more scalable than the 1-mile walk/run test, with tests scoring 34/48 and 25/48, respectively. A comprehensive analysis of scalability was prohibited by the widespread non-reporting of data, for example, those relating to cost-effectiveness. Of all sufficiently valid and reliable candidate tests identified, using our algorithm the 20-m shuttle run test was identified as the most scalable. We hope that the algorithm will prove useful in the examination of scalability in either new data relating to existing tests or in data pertaining to new tests.Entities:
Mesh:
Year: 2016 PMID: 27229897 PMCID: PMC5097078 DOI: 10.1007/s40279-016-0553-6
Source DB: PubMed Journal: Sports Med ISSN: 0112-1642 Impact factor: 11.136
Scoring schedule for components of scalability
| Variable | Operational definition | Assessment criteria | Maximum score | Weight | |
|---|---|---|---|---|---|
| Delivery | Test context | Can this test be conducted in a school setting? | (2) Strong evidence | 2 | |
| Test duration | Can this test be carried out within the time limits of a normal PE lesson? | (2) Strong evidence | 2 | ||
| Testing interval | Is this test suitable for use within a longitudinal testing programme? | (2) Strong evidence | 2 | ||
| Delivery staff | Can the test be administered by PE teachers and/or school staff? | (2) Strong evidence | 2 | ||
| Total | 8 | 1 | |||
| Evidence of operating at scale | Sample size | Is this test appropriate for population level testing? | (2) Strong evidence = field test administered at a national or international level | 2 | |
| Number of schools | Is implementation of this field test likely to be acceptable to multiple target schools when scaled up? | (2) Strong evidence | 2 | ||
| Total | 4 | 2 | |||
| Effectiveness | Validity | Is the criterion-related validity of the test acceptable for the target population? | (2) Strong evidence | 2 | |
| Test–retest reliability | Is the test–retest reliability validity of test acceptable for the target population? | (2) Strong evidence | 2 | ||
| Reach and adoption | Is there a high level of participation of the intended target population? | (2) Strong evidence | 2 | ||
| Completion rates | Can the test be completed safely and is the test acceptable to the target participants? | (2) Strong evidence | 2 | ||
| Total | 8 | 1 | |||
| Cost | Cost effectiveness | Is the test affordable? | (2) Strong evidence = i.e. NCMP estimated cost is £123,000 based on collection of annual data from 147 PCTs (3 person days each) | 2 | |
| Total | 2 | 4 | |||
| Resource requirements | Are there additional requirements in terms of equipment, space, skills, competencies and workforce requirements? | (2) None | 2 | ||
| Total | 2 | 4 | |||
| Practical implementation issues | Can the field test be undertaken, administered and scored with ease? | (2) Strong evidence | 2 | ||
| Total | 2 | 4 | |||
PE physical education, NCMP National Child Measurement Programme, PCT primary care trust
Fig. 1Flowchart of test assessment. PE physical education
Details of review items relating to scalability framework
| Component | Variable | Operational definition | Assessment criteria |
|---|---|---|---|
| Delivery | Test environment | Information relating to whether the field testing was conducted in a school setting | Yes = test performed in a school setting |
| Test duration | Expected or actual duration of the field test protocol reported | Yes = duration of test/trial reported | |
| Testing interval | Duration relating to the interval over which the testing was conducted | Yes = duration reported | |
| Delivery staff | Information relating to the personnel used to administer the testing protocols and record the results | Yes = tests performed by usual service delivery staff (PE teachers) | |
| Evidence of operating at scale | Sample size | Evidence that the field test has been used to assess fitness of young people at a national/population level | Yes = field test administered at a national or international level |
| Number of schools | Evidence that the implementation of the field test is likely to be acceptable to multiple target schools when scaled up | Yes = multiple schools used in study | |
| Effectiveness | Validity | How well a specific test measures what it intends to measure | Yes = strong or moderate evidence of acceptable criterion related validity of test |
| Test–retest reliability | The consistency of performer/s scoring over repeated rounds of testing | Yes = strong or moderate evidence of acceptable test–retest reliability | |
| Reach and adoption | Differential effect, reach and adoption across target groups, socioeconomic status and settings | Yes = reach and adoption is reported | |
| Completion rates | Measure of acceptability to individuals | Yes = completion rates are reported | |
| Cost considerations | Cost effectiveness | Information relating to the cost of the field test per head is provided | Yes = cost per head of test is reported |
| Resource requirements | Information relating to the required resources in terms of equipment, space, skills, competencies, workforce, and financial requirements provided | Yes = resource requirements are reported | |
| Practical implementation issues/considerations | The ease with which the field test can be undertaken, administered and scored | Yes = feasibility/practicality is discussed |
PE physical education, NR not reported
Scalability properties of reviewed articles
| Field test | Study | Sample size ( | Match to review criteria | Schools ( | Delivery | Test duration | Testing interval | Staff | Reach and adoption | Completion rate (%) | Cost-effectiveness | Resources required | Practical issues |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20-m shuttle run | |||||||||||||
| Baquet et al. [ | 503 | No | NR | Yes | NR | 70 days | Yes | No | 100 % | NR | Partial | Partial | |
| Beets and Pitetti [ | 241 | No | No = 1 | Yes | NR | 21 days | Yes | Yes | 100 % | NR | NR | Yes | |
| Boddy et al. [ | 27,942 | Partial | Yes >1 | No | NR | 12 years | No | Yes | NR | NR | NR | Partial | |
| Borehamet al. [ | 1015 | Yes | Yes = 16 | NR | NR | NR | No | Yes | 78 % | NR | Partial | NR | |
| Burns et al. [ | 134 | Partial | Yes = 3 | Yes | NR | NR | No | Yes | NR | NR | Partial | NR | |
| Castro-Piñeiro et al. [ | 2752 | Partial | Yes = 18 | NR | NR | NR | NR | Yes | 95 % | NR | Yes | NR | |
| Jenner et al. [ | 1311 | Partial | Yes = 27 | Yes | NR | 4 months | No | Yes | 84 % | NR | Partial | NR | |
| Kim et al. [ | 6297 | Partial | Yes = 15 | Yes | NR | 3 years | Yes | Yes | 89 % | NR | Partial | NR | |
| Mahar et al. [ | 266 | Partial | Yes = 26 | Yes | NR | 7 days | Yes | No | NR | NR | Partial | NR | |
| Mahoney et al. [ | 103 | No | No = 1 | Yes | NR | 28 days | No | Yes | 100 % | NR | Partial | Partial | |
| Matsuzaka et al. [ | 132 | Partial | Yes = >1 | NR | NR | 2 months | NR | Yes | NR | NR | Partial | NR | |
| Ortega et al. [ | 123 | Yes | No = 0 | No | NR | 14 days | No | Yes | 100 % | NR | Partial | NR | |
| Ortega et al. [ | 3528 | Yes | No = 0 | No | 90 minsa | 2 years | No | Yes | NR | NR | Partial | NR | |
| Quinart et al. [ | 30 | No | No = 0 | No | NR | 9 months | No | Yes | 88 % | NR | NR | Partial | |
| Roberts et al. [ | 15,315 | Partial | Yes = >1 | Yes | NR | 4 years | No | Yes | NR | NR | NR | NR | |
| Sandercock et al. [ | 2041 | Partial | Yes = 5 | Yes | NR | 3 months | No | Yes | NR | NR | Yes | NR | |
| Sandercock et al. [ | 6628 | Partial | Yes = 28 | Yes | NR | 1 years | No | Yes | NR | NR | Partial | NR | |
| Sandercock et al. [ | 7393 | Partial | Yes = 26 | Yes | NR | 4 years | No | Yes | NR | NR | Partial | NR | |
| Stratton et al. [ | 15,621 | Partial | Yes = 106 | NR | NR | 6 years | No | Yes | 74 % | NR | NR | NR | |
| Voss and Sandercock [ | 208 | No | NR | Yes | NR | 3 months | No | No | NR | NR | Yes | NR | |
| Voss and Sandercock [ | 5927 | Partial | Yes = 23 | Yes | NR | 1 years | No | Yes | NR | NR | Partial | NR | |
| Voss and Sandercock [ | 4029 | Partial | Yes = 26 | Yes | NR | 1 years | No | Yes | NR | NR | Partial | NR | |
| 1-mile walk/run | |||||||||||||
| Beets and Pitetti [ | 241 | No | No −1 | Yes | NR | 21 days | Yes | Yes | 100 % | NR | NR | Yes | |
| Buono et al. [ | 90 | No | No = 1 | Yes | NR | 2 days | NR | Yes | 100 % | NR | Partial | NR | |
| Burns et al. [ | 134 | Partial | Yes = 3 | Yes | NR | NR | No | Yes | NR | NR | Partial | NR | |
| Castro-Piñeiro et al. [ | 2752 | Partial | Yes = 18 | NR | NR | NR | NR | Yes | 95 % | NR | Yes | NR | |
| Cureton et al. [ | 753 | Partial | NR | NR | NR | 4 years | NR | Yes | 99.30 % | NR | NR | NR | |
| Hunt et al. [ | 86 | No | NR | Yes | NR | 14 days | No | Yes | 97 % | NR | Yes | Partial | |
| Mahar et al. [ | 266 | Partial | Yes = 26 | Yes | NR | 7 days | Yes | No | NR | NR | Partial | NR |
NR not reported
aTest battery
Scalability of field based cardiovascular fitness tests
| Assessment item | 20-m shuttle run | 1-mile walk/run |
|---|---|---|
| Delivery | ||
| Ease of integration into usual service delivery | 66.7 % (four teachers) had previous experience of test [ | Number of children that can be tested at once depends on space restrictions and capacity for timing individuals |
| Burden on delivery staff and other stakeholders | Considered feasible based on survey results from six PE teachers who were asked about factors relating to: (1) whether children wore appropriate clothing to perform, (2) ease of instructions, (3) ease of implementation, (4) rejections and appropriateness of facilities [ | – |
| Preparation requirements | Two lines set up 20 m apart, speakers equal distance from each [ | Measure distance if track unavailable |
| Test duration | Preparation = 5 min, testing = 10 min (a group of 20 individuals) [ | Mean ± SD time for 8 = 11 years = 9.2 ± 1.8 mins (males), 10.3 ± 1.8 mins (females) [ |
| Effectiveness | ||
| Reach and adoption | Shown to be the preferable choice over the one mile run for student’s motivation for participation [ | Physical activity engagement (duration of activity, pace, energy expenditure) was significantly greater in the one mile run than the 20-m shuttle run, particularly for the low-performing students with a relatively high BMI [ |
| Completion rates | One participant ( | – |
| Resource requirements | ||
| Equipment | Audio device, speakers, cones to mark length [ | Stopwatch |
| Space | Flat surface, indoor (preferred) or outdoor (weather dependent), 20 m in length + room to turn round, 1-m width per child [ | Outside measurable area, flat surface, no standard surface for this test therefore outdoor 400-m athletics track [ |
| Human resource | Two members of staff = one to ensure protocols are followed correctly, one to record scores [ | Two members of staff = one to time and one to record results [ |
| Training | CD provides audio instructions = no technical training required [ | No advanced technical training requirements |
| Costs | – | – |
| Practical implementation issues and considerations | For a single study, 22 (37.9 %) children and 25 (33.3 %) adolescents experienced some degree of DOMS, from whom six children (10.3 %) and four adolescents (5.3 %) indicated that their DOMS was severe. Three (2.3 %) subjects reported having severe pain in the upper body, 29 (21.8 %) in the lower body and 14 (10.5 %) in the whole body. Most (39 participants; 29.3 %) assumed that the 20-m shuttle run test could be the cause. For 11 (19 %) children and 14 (18.7%) adolescents, DOMS caused difficulties in daily activities, especially stair climbing and walking [ | Participants may have difficulty in developing an appropriate pace; participants may either start too fast so that they are not able to keep up the speed all through the test, or they may start too slow so that when they want to increase speed, the test is already finished [ |
PE physical education, DOMS delayed onset muscle soreness, CD compact disc, SD standard deviation, BMI body mass index
Assessment percentage scores for reviewed articles
| Assessment item | Review items percentage score (%) | |||
|---|---|---|---|---|
| Delivery | Yes | Partial | No | NR |
| Test context | 72 | 16 | 12 | |
| Test duration | 3 | 97 | ||
| Testing interval | 88 | 12 | ||
| Delivery staff | 20 | 68 | 12 | |
| Effectiveness | ||||
| Reach and adoption | 85 | 15 | ||
| Completion rates | 45 | 55 | ||
| Cost considerations | ||||
| Cost effectiveness | 0 | 100 | ||
| Evidence of operating at scale | ||||
| Sample size | 12 | 56 | 20 | 12 |
| Number of schools | 56 | 28 | 16 | |
| Resource requirements | 11 | 65 | 24 | |
| Practical implementation issues/considerations | 8 | 19 | 73 | |
NR not reported
Scalability scores for the 20-m shuttle run test and the 1-mile walk/run test
| Component | Variable | 20-m shuttle run test | 1 mile run/walk | ||
|---|---|---|---|---|---|
| Score | Comment | Score | Comment | ||
| Delivery | Test context | 2 | 14 studies conducted in school setting | 2 | Five studies conducted in school setting |
| Test duration | 1 | One study reported = 90 mins (test battery) | 0 | Not reported | |
| Testing interval | 2 | 19 studies used test for longitudinal testing, testing period range 7 days: 12 years | 2 | Four studies used test for longitudinal studies, range 7 days: 4 years | |
| Delivery staff | 2 | Three studies reported using PE staff to administer test | 1 | Two studies reported using PE staff to administer test | |
| Total | 7 | 5 | |||
| Evidence of operating at scale | Sample size | 2 | Three studies at population level (national, international), 13 studies multiple settings within local area | 2 | Three studies multiple settings within local area |
| Number of schools | 2 | 15 studies administered test in multiple schools (range 1–106) | 1 | Three studies administered test in multiple schools (range 1–26) | |
| Total | 4 | 3 | |||
| Effectiveness | Validity | 2 | Strong evidence [ | 1 | Moderate evidence [ |
| Test–retest reliability | 2 | Strong evidence [ | 1 | Moderate evidence [ | |
| Reach and adoption | 2 | Reach and adoption across target groups and differential effect considered in 19 studies | 2 | Reach and adoption across target groups and differential effect considered in five studies | |
| Completion rates | 1 | Where reported completion rates varied from 74–100 % | 2 | Where reported completion rates varied from 97–100 % | |
| Total score | 7 | 6 | |||
| Cost | Cost-effectiveness | 0 | Not reported | 0 | Not reported |
| Total | 0 | 0 | |||
| Resource requirements | 1 | Equipment = audio device, speakers, cones to mark length [ | 1 | Equipment = stopwatch. Space = outside measurable area, flat surface, no standard surface for this test therefore outdoor 400-m athletics track [ | |
| Total | 1 | 1 | |||
| Practical implementation issues | 2 | For a single study, 22 (37.9 %) children and 25 (33.3 %) adolescents experienced some degree of DOMS, from which six children (10.3 %) and four adolescents (5.3 %) indicated that their DOMS was severe. Three (2.3 %) subjects reported having severe pain in the upper body, 29 (21.8 %) in lower body, and 14 (10.5 %) in the whole body. Most (39 participants; 29.3 %) assumed that the 20-m shuttle run test could be the cause. For 11 (19 %) children and 14 (18.7 %) adolescents, DOMS caused difficulties in daily activities, especially stair climbing and walking [ | 1 | Participants may have difficulty in developing an appropriate pace; participants may either start too fast so that they are not able to keep up the speed all through the test, or they may start too slow so that when they want to increase speed, the test is already finished [ | |
| Total | 2 | 1 | |||
| Overall weighted score | 34 | 25 | |||
PE physical education, CD compact disc, DOMS delayed onset muscle soreness
Fig. 2Scalability scores for 20m SRT compared with 1 mile walk/run test. SRT shuttle run test
| Previous research has reported the validity and reliability of a number of tests of children’s fitness. |
| Our systematic review indicated that the 5-min run test did not meet accepted criteria for reliability, whilst the 6-min walk test failed to meet the criteria for validity. |
| We further identified that of all sufficiently valid and reliable tests of children’s fitness, the 20-m shuttle run test was identified as the most scalable. |