| Literature DB >> 24106484 |
Juan Botella1, Huiling Huang, Manuel Suero.
Abstract
Studies that evaluate the accuracy of binary classification tools are needed. Such studies provide 2 × 2 cross-classifications of test outcomes and the categories according to an unquestionable reference (or gold standard). However, sometimes a suboptimal reliability reference is employed. Several methods have been proposed to deal with studies where the observations are cross-classified with an imperfect reference. These methods require that the status of the reference, as a gold standard or as an imperfect reference, is known. In this paper a procedure for determining whether it is appropriate to maintain the assumption that the reference is a gold standard or an imperfect reference, is proposed. This procedure fits two nested multinomial tree models, and assesses and compares their absolute and incremental fit. Its implementation requires the availability of the results of several independent studies. These should be carried out using similar designs to provide frequencies of cross-classification between a test and the reference under investigation. The procedure is applied in two examples with real data.Entities:
Keywords: binary classification; diagnostic accuracy; gold standard; imperfect reference; multinomial tree models
Year: 2013 PMID: 24106484 PMCID: PMC3789284 DOI: 10.3389/fpsyg.2013.00694
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Contingency table of the binary classifications of a reference (R) and a test (T).
Figure 2Tree diagram of Model 1 (GS) and the corresponding equations.
Figure 3Tree diagram of Model 2 (IR) and the corresponding equations.
Parameters estimated in the two solutions provided by Multitree for Model 2, IR, in the two examples with real data.
| AUDIT | Set 1 | 0.996 | 1.00 | 0.637 | 0.960 | 13.985 |
| Set 2 | 0.000 | 0.004 | 0.040 | 0.363 | ||
| MMSE | Set 1 | 0.876 | 1.00 | 0.864 | 0.872 | 12.136 |
| Set 2 | 0.000 | 0.124 | 0.128 | 0.136 |
Raw frequencies of the primary studies included in the two examples with real data.
| AUDIT | Bradley et al., | 58 | 47 | 6 | 150 | 0.552 | 0.962 | 0.402 |
| Philpot et al., | 13 | 4 | 7 | 104 | 0.765 | 0.937 | 0.133 | |
| Reid et al., | 22 | 3 | 6 | 148 | 0.880 | 0.961 | 0.140 | |
| Reid et al., | 7 | 3 | 8 | 243 | 0.700 | 0.968 | 0.038 | |
| MMSE | Brayne and Calloway, | 24 | 5 | 31 | 205 | 0.828 | 0.869 | 0.109 |
| Brodaty et al., | 66 | 16 | 48 | 153 | 0.805 | 0.761 | 0.290 | |
| Clarke et al., | 137 | 17 | 28 | 122 | 0.890 | 0.813 | 0.507 | |
| Cullen et al., | 40 | 4 | 138 | 933 | 0.909 | 0.871 | 0.039 |
Parameter estimates, goodness-of-fit and Minimum Description Length of the two models for the AUDIT and the MMSE data.
| AUDIT | 1—GS | – | – | 0.637 | 0.038 | 0.960 | 0.008 | 13.99 | 6 | 0.030 | 17.9 | 574.8 | ||
| 2—IR | 0.996 | 0.054 | 1.00 | 0.011 | 0.637 | 0.044 | 0.960 | 0.011 | 13.99 | 4 | 0.007 | 20.1 | 577.0 | |
| MMSE | 1—GS | – | – | 0.864 | 0.020 | 0.852 | 0.009 | 21.14 | 6 | 0.002 | 20.1 | 1495.2 | ||
| 2—IR | 0.876 | 0.040 | 1.00 | 0.004 | 0.864 | 0.023 | 0.872 | 0.011 | 12.14 | 4 | 0.016 | 23.0 | 1493.6 | |