| Literature DB >> 25897837 |
Martani J Lombard1, Nelia P Steyn2, Karen E Charlton3, Marjanne Senekal4.
Abstract
BACKGROUND: Several statistical tests are currently applied to evaluate validity of dietary intake assessment methods. However, they provide information on different facets of validity. There is also no consensus on types and combinations of tests that should be applied to reflect acceptable validity for intakes. We aimed to 1) conduct a review to identify the tests and interpretation criteria used where dietary assessment methods was validated against a reference method and 2) illustrate the value of and challenges that arise in interpretation of outcomes of multiple statistical tests in assessment of validity using a test data set.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25897837 PMCID: PMC4471918 DOI: 10.1186/s12937-015-0027-y
Source DB: PubMed Journal: Nutr J ISSN: 1475-2891 Impact factor: 3.271
Summary of identified statistical tests and interpretation criteria for validation of dietary intake assessment methods
| Statistical test | Facet of validity reflected | Interpretation criteria | ||
|---|---|---|---|---|
| Good outcome | Acceptable outcome | Poor outcome | ||
| Correlation coefficient [ | Strength and direction of association at individual level [ | ≥0.50 [ | 0.20 - 0.49 [ | <0.20 [ |
| Paired | Agreement at group level [ | P > 0.05 [ | P ≤ 0.05 [ | |
| Percent difference [ | Agreement at group level (size and direction of error) [ | 0.0 - 10.0% [ | >10% | |
| Cross-classification (tertiles/ quartiles or quintiles) [ | Agreement (including chance), at individual level [ | ≥50% in same tertile/quartile [ | <50% in same tertile/quartile [ | |
| • In same tertile | ||||
| • In opposite tertile | ||||
| Weighted Kappa statistics (coefficient) [ | Agreement (excluding chance) at individual level [ | ≥0.61 [ | 0.20 - 0.60 [ | <0.20 [ |
| Bland Altman analysis: Correlation between mean and mean difference) [ | Presence, direction and extent of bias at group level [ | P > 0.05 [ | P ≤ 0.05 [ | |
Summary of statistical test combinations applied in reviewed validation studies
| Combination of tests ranked from most to least frequent use | Statistical test (number of combinations | Number of studies the combination (total n = 60) | References for identified studies | |||||
|---|---|---|---|---|---|---|---|---|
| Correlation coefficient | Cross-classification | % difference | Kappa Statistic | Bland Altman | ||||
| 1 | X | X | 8 | [ | ||||
| 2 | X | 7 | [ | |||||
| 3 | X | X | X | 6 | [ | |||
| 4 | X | X | X | 5 | [ | |||
| 5 | X | X | 5 | [ | ||||
| 6 | X | X | 5 | [ | ||||
| 7 | X | X | X | 4 | [ | |||
| 8 | X | X | X | 2 | [ | |||
| 9 | X | X | X | X | X | 2 | [ | |
| 10 | X | X | X | X | 2 | [ | ||
| 11 | X | X | 2 | [ | ||||
| 12 | X | X | X | X | 2 | [ | ||
| 13 | X | X | X | X | 2 | [ | ||
| 14 | X | X | X | 1 | [ | |||
| 15 | X | X | X | 1 | [ | |||
| 16 | X | X | X | 1 | [ | |||
| 17 | X | X | X | X | 1 | [ | ||
| 18 | X | X | X | X | 1 | [ | ||
| 19 | X | X | X | 1 | [ | |||
| 20 | X | 1 | [ | |||||
| 21 | X | 1 | [ | |||||
Mean(SD) and median(IQ Range) estimates for energy and select nutrient intakes derived from the test data set
| Nutrient | EER/RDA | Test method (g) | Reference method (g) | Difference (g) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Males | Females | Mean (SD) | Median (IQ Range) | Mean (SD) | Median (IQ Range) | Mean (SD) | 95% CI of difference | Median (IQ Range) | |
| Energy (kJ) | 12881 | 10093 | 12463 (5854) | 12475 (7686–17091) | 13819 (3677) | 13661 (11003–16225) | −1356 (6025) | −9192 - 8602 | −1642 (−6439 - 2830) |
| Protein (g) | 56 | 46 | 67.3 (34.0) | 66.0 (44.7-91.3) | 84.9 (26.2) | 81.6 (68.8-99.5) | −17.6 (38.5) | −75.5-48.0 | −19.3 (−42.4 - 4.2) |
| Fat (g) | 118 | 93 | 69.5 (46.9) | 54.8 (38.3 – 86.5) | 83.9 (31.5) | 82.8 (62.7-102.5) | −14.4 (55.5) | −83.3-62.1 | −12.8 (−54.0-16.9) |
| Carbohydrates (g) | 130 | 130 | 475.7 (222.3) | 475.7 (289.1-618.3) | 494.1 (148.6) | 487.7 (397.3-572.0) | −18.5 (215.4) | −326.2-325.5 | −16.3 (−153.1-140.3) |
| Folate (mcg) | 400 | 400 | 558.0 (355.2) | 454.0 (331.0-788.0) | 419.6 (235.7) | 405.8 (247.5-552.7) | 138.4 (393.7) | −401.6-842.9 | 108.6 (0.9-305.0) |
| Vitamin A (mcg) | 900 | 700 | 192.6 (231.1) | 92.0 (51.0-242.0) | 346.7 (276.9) | 277.0 (171.1-402.4) | −154.2 (360.9) | −895.2-373.9 | −105.2 (−278.8-5.1) |
| Iron (mg) | 8 | 18 | 12.1 (7.1) | 12.3 (6.5-15.7) | 16.1 (5.5) | 15.4 (11.4-0.2) | −6.0 (8.2) | −18.9 – 10.5 | −6.2 (−11.1- -1.7) |
1Energy and nutrient intake derived from a QFFQ and four repeated 24-hour recalls conducted in 18–65 year old adults (n = 47, 11 males & 36 females) in a rural area in the Eastern Cape.
RDA: Recommended dietary allowances.
SD = standard deviation, IQ Range = inter quartile range, CI = 95% Confidence interval.
Statistical test outcomes and interpretation for energy and nutrient intakes derived from the test data set
| Nutrient | Spearman correlation (r value) | Wilcoxon signed rank test (P value) | Percentage difference (%) | Cross-classification (Tertiles) | Weighted Kappa statistics (value) | Bland – Altman 2,3 Spearman Correlation (r value) | |
|---|---|---|---|---|---|---|---|
| Association (strength & direction) | Agreement | Agreement (size & direction of error) | Agreement (including chance) | Agreement (excluding chance) | Presence, direction and extent of bias | ||
| % in same tertile | % in opposite tertile | ||||||
| Level of validation | Individual | Group | Group | Individual | Individual | Individual | Group |
| Energy (kJ) | 0.26 | P > 0.05 | −9.8 | 46.8 | 19.2 | 0.20 | P < 0.001 |
| Validly interpretation | Acceptable | Good | Good | Poor | Poor | Acceptable | Biased |
| Protein (g) | 0.23 | P < 0.01 | −19.1 | 42.6 | 23.4 | 0.12 | P < 0.05 |
| Validly interpretation | Acceptable | Poor | Acceptable | Poor | Poor | Poor | Biased |
| Fat (g) | 0.01 | P > 0.05 | −6.9 | 34.0 | 31.9 | −0.01 | P > 0.05 |
| Validly interpretation | Poor | Good | Good | Poor | Poor | Poor | Not biased |
| Carbohydrates (g) | 0.40 | P > 0.05 | −1.4 | 50.0 | 17.4 | 0.25 | P < 0.01 |
| Validly interpretation | Acceptable | Good | Good | Good | Poor | Acceptable | Biased |
| Folate (mcg) | 0.40 | P < 0.05 | 33.0 | 53.2 | 8.5 | 0.30 | P < 0.01 |
| Validly interpretation | Acceptable | Poor | Poor | Good | Good | Acceptable | Biased |
| Vitamin A (mcg) | 0.15 | P < 0.01 | −22.9 | 34.0 | 14.9 | 0.03 | P > 0.05 |
| Validly interpretation | Poor | Poor | Poor | Poor | Poor | Poor | Not biased |
| Iron (mg) | 0.38 | P > 0.05 | −24.8 | 51.1 | 23.4 | 0.29 | P < 0.01 |
| Validly interpretation | Acceptable | Good | Poor | Good | Poor | Acceptable | Biased |
1Energy and nutrient intake derived from a QFFQ and four repeated 24-hour recalls conducted in 18–65 year old adults (n = 47, 11 males & 36 females) in a rural area in the Eastern Cape.
2% in LOA: Energy: 93.6%; protein: 95.7%, fat: 97.9%, carbohydrate: 95.7%; folate: 95.7%, vitamin A: 89.4%; iron: 98%.
3Upper & lower limits of agreement (LOA) and DRI for females aged 18 to 55: energy (kJ): −13406 & 10694 (EER: 10093 kJ); protein (g): −94.6 & 59.4 (RDA: 46 g); fat: −125.4 & 96.6 (no DRI); folate (mcg): −649 & 925.8 (RDA: 400mcg); vitamin A (mcg): −876 &576.4 (RDA: 700mcg); iron (mg): −22,4 &10.4 (RDA: ????mg).
Interpretation criteria for statistical tests.
Wilcoxon signed rank test: Good: p > 0.05; Poor: ≤ 0.05 [9].
Percentage difference: Good: 0.0 – 10.9%; Acceptable: 11.0 – 20.0%; Poor: > 20.0% [92].
Correlations coefficient (Spearman): Good: ≥ 0.50; Acceptable: 0.20 – 0.49; Poor < 0.20[2]
Cross-classification (Tertiles) (% in same tertile): Good: ≥ 50%; Poor: < 50% [2].
Cross-classification (% in opposite tertile): Good: ≤ 10%, Poor: > 10% [2].
Weighted Kappa statistics: Good: ≥ 0.61; Acceptable: 0.20 – 0.59; Poor: < 0.20 [2].
Bland-Altman - Correlation coefficient (Spearman): Good: P > 0.05; Poor: P ≤ 0.05 [2].