| Literature DB >> 19915664 |
Patricia Scolari Fontela1, Nitika Pant Pai, Ian Schiller, Nandini Dendukuri, Andrew Ramsay, Madhukar Pai.
Abstract
BACKGROUND: Poor methodological quality and reporting are known concerns with diagnostic accuracy studies. In 2003, the QUADAS tool and the STARD standards were published for evaluating the quality and improving the reporting of diagnostic studies, respectively. However, it is unclear whether these tools have been applied to diagnostic studies of infectious diseases. We performed a systematic review on the methodological and reporting quality of diagnostic studies in TB, malaria and HIV.Entities:
Mesh:
Year: 2009 PMID: 19915664 PMCID: PMC2771907 DOI: 10.1371/journal.pone.0007753
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow diagram for study selection.
Biases in diagnostic accuracy test studies.
| Bias | Definition |
| Spectrum composition bias | When the spectrum of patients is not representative of the patients who will receive the test in practice |
| Disease progression bias | When the time period between reference standard and index test is not short enough to be reasonably sure that the target condition did not change between the two tests |
| Partial verification bias | When the whole sample or a random selection of the sample does not receive verification using a reference standard of diagnosis |
| Differential verification bias | When patients receive different reference standard depending on the index test result |
| Incorporation bias | When the reference standard is not independent of the index test, i.e., when the index test forms part of the reference standard |
| Test review bias | When the index test results are interpreted with knowledge of the results of the reference standard |
| Reference standard review bias | When the reference standard test results are interpreted with knowledge of the results of the index test |
| Clinical review bias | When test results are interpreted in the light of the clinical data that would not be available when the test is used in practice |
Adapted from: Whiting P, Rutjes A, Reitsma J, Bossuyt P, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Medical Research Methodology 2003;3:25.
Characteristics of the studies included (N = 90).
| Characteristic | Frequency (%) |
|
| |
| Tuberculosis | 45 (50) |
| Malaria | 18 (20) |
| HIV | 27 (30) |
|
| |
| Africa | 16 |
| Asia | 29 |
| Australia and Oceania | 01 |
| Europe | 27 |
| North America | 11 |
| South America | 06 |
|
| |
| Median (interquartile range) | 209 (110–555) |
|
| 39 (43) |
|
| 38 (42) |
|
| |
| 2004 | 42 (47) |
| 2005 | 21 (23) |
| 2006 | 27 (30) |
|
| 46 |
The total number of countries is not 90 because there were some studies that were performed in more than one country.
Assessment of methodological quality using QUADAS* stratified by disease.
| QUADAS item(scored as “Yes”) | Disease | Total | ||
| Tuberculosis (N = 45) | Malaria (N = 18) | HIV (N = 27) | (N = 90) | |
| n (%) | n (%) | n (%) | n (%) | |
|
| ||||
| Adequate spectrum composition | 26 (58) | 13 (72) | 17 (63) | 56 (62) |
|
| ||||
| Clear description of selection criteria | 21 (47) | 12 (67) | 13 (48) | 46 (51) |
|
| ||||
| Adequate reference standard | 44 (98) | 18 (100) | 24 (89) | 86 (96) |
|
| ||||
| Absence of disease progression bias | 42 (93) | 15 (83) | 21 (78) | 78 (87) |
|
| ||||
| Absence of partial verification bias | 44 (98) | 17 (94) | 22 (81) | 83 (92) |
|
| ||||
| Absence of differential verification bias | 42 (93) | 17 (94) | 17 (63) | 76 (84) |
|
| ||||
| Absence of incorporation bias | 45 (100) | 18 (100) | 25 (93) | 88 (98) |
|
| ||||
| Adequate description of the index test execution | 15 (33) | 3 (17) | 7 (26) | 25 (28) |
|
| ||||
| Adequate description of the reference test execution | 6 (13) | 2 (11) | 1 (4) | 9 (10) |
|
| ||||
| Absence of index test review bias | 6 (13) | 5 (28) | 6 (22) | 17 (19) |
|
| ||||
| Absence of reference test review bias | 7 (16) | 8 (44) | 7 (26) | 22 (24) |
|
| ||||
| Absence of clinical review bias | 14 (31) | 12 (67) | 8 (30) | 34 (38) |
|
| ||||
| Report of uninterpretable results | 9 (20) | 1 (6) | 10 (37) | 20 (22) |
|
| ||||
| Description of withdrawals | 3 (7) | 1 (6) | 1 (4) | 5 (6) |
Whiting P, Rutjes A, Reitsma J, Bossuyt P, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Medical Research Methodology 2003;3:25.
Assessment of methodological quality using QUADAS* stratified by year of publication.
| QUADAS item (scored as “Yes”) | Year | Total | ||
| 2004 (N = 42) | 2005 (N = 21) | 2006 (N = 27) | (N = 90) | |
| n (%) | n (%) | n (%) | n (%) | |
|
| ||||
| Adequate spectrum composition | 26 (62) | 14 (67) | 16 (59) | 56 (62) |
|
| ||||
| Clear description of selection criteria | 21 (50) | 12 (57) | 13 (48) | 46 (51) |
|
| ||||
| Adequate reference standard | 41 (98) | 20 (95) | 25 (93) | 86 (96) |
|
| ||||
| Absence of disease progression bias | 38 (91) | 16 (76) | 24 (89) | 78 (87) |
|
| ||||
| Absence of partial verification | 40 (95) | 17 (81) | 26 (96) | 83 (92) |
|
| ||||
| Absence of differential verification bias | 36 (86) | 16 (76) | 24 (89) | 76 (84) |
|
| ||||
| Absence of incorporation bias | 42 (100) | 19 (91) | 27 (100) | 88 (98) |
|
| ||||
| Adequate description of the index test execution | 11 (26) | 4 (19) | 10 (37) | 25 (28) |
|
| ||||
| Adequate description of the reference test execution | 3 (7) | 0 (0) | 6 (22) | 9 (10) |
|
| ||||
| Absence of index test review bias | 10 (24) | 4 (19) | 3 (11) | 17 (19) |
|
| ||||
| Absence of reference test review bias | 10 (24) | 3 (14) | 9 (33) | 22 (24) |
|
| ||||
| Absence of clinical review bias | 17 (41) | 7 (33) | 10 (37) | 34 (38) |
|
| ||||
| Report of uninterpretable results | 10 (24) | 4 (19) | 6 (22) | 20 (22) |
|
| ||||
| Description of withdrawals | 2 (5) | 0 (0) | 3 (11) | 5 (6) |
Whiting P, Rutjes A, Reitsma J, Bossuyt P, Kleijnen J. The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Medical Research Methodology 2003;3:25.
Assessment of the quality of report using STARD* stratified by disease.
| Section and Topic in the STARD checklist (scored as “Reported”) | Disease | Total | ||
| TB (N = 45) | Malaria (N = 18) | HIV (N = 27) | (N = 90) | |
| n (%) | n (%) | n (%) | n (%) | |
|
| ||||
| Identify the article as a study of diagnostic accuracy (recommend MeSH heading ‘sensitivity and specificity’). | 44 (98) | 18 (100) | 27 (100) | 89 (99) |
|
| ||||
| State the research questions or study aims, such as estimating diagnostic accuracy or comparing accuracy between tests or across participant groups. | 44 (98) | 17 (94) | 25 (93) | 86 (96) |
|
| ||||
|
| ||||
| The study population: the inclusion and exclusion criteria, setting and locations where the data were collected. | 30 (67) | 17 (94) | 23 (85) | 70 (78) |
| Participant recruitment: was recruitment based on presenting symptoms, results from previous tests, or the fact that the participants had received the index tests or the reference standard?γ | 28 (62) | 13 (29) | 13 (48) | 54 (60) |
| Participant sampling: was the study population a consecutive series of participants defined by the selection criteria in the previous 2 items? If not, specify how participants were further selected.§ | 14 (31) | 8 (44) | 6 (22) | 28 (31) |
| Data collection: was data collection planned before the index test and reference standard were performed (prospective study) or after (retrospective study)?θ | 38 (84) | 16 (89) | 21 (78) | 75 (83) |
|
| ||||
| The reference standard and its rationale. | 45 (100) | 18 (100) | 25 (93) | 88 (98) |
| Technical specifications of material and methods involved including how and when measurements were taken, and/or cite references for index tests and reference standard. | 44 (98) | 16 (89) | 21 (78) | 81 (90) |
| Definition of and rationale for the units, cutoffs, and/or categories of the results of the index tests and the reference standard. | 41 (91) | 16 (89) | 18 (67) | 75 (83) |
| The number, training, and expertise of the persons executing and reading the index tests and the reference standard. | 3 (7) | 7 (39) | 5 (19) | 15 (17) |
| Whether or not the readers of the index tests and reference standard were blind (masked) to the results of the other test and describe any other clinical information available to the readers. | 5 (11) | 6 (33) | 3 (11) | 14 (16) |
|
| ||||
| Methods for calculating or comparing measures of diagnostic accuracy, and the statistical methods used to quantify uncertainty (e.g., 95% confidence intervals). | 16 (36) | 12 (67) | 14 (52) | 42 (47) |
| Methods for calculating test reproducibility, if done. | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
|
| ||||
|
| ||||
| When study was done, including beginning and ending dates of recruitment. | 34 (76) | 16 (89) | 16 (59) | 66 (73) |
| Clinical and demographic characteristics of the study population (e.g., age, sex, spectrum of presenting symptoms, comorbidity, current treatments, recruitment centers). | 27 (60) | 13 (29) | 19 (70) | 59 (66) |
| The number of participants satisfying the criteria for inclusion that did or did not undergo the index tests and/or the reference standard; describe why participants failed to receive either test (a flow diagram is strongly recommended). | 7 (16) | 4 (22) | 2 (7) | 13 (14) |
|
| ||||
| Time interval from the index tests to the reference standard, and any treatment administered between. | 36 (80) | 13 (29) | 18 (67) | 67 (74) |
| Distribution of severity of disease (define criteria) in those with the target condition; other diagnoses in participants without the target condition. | 6 (13) | 1 (6) | 3 (11) | 10 (11) |
| A cross tabulation of the results of the index tests (including indeterminate and missing results) by the results of the reference standard; for continuous results, the distribution of the test results by the results of the reference standard. | 45 (100) | 18 (100) | 26 (96) | 89 (99) |
| Any adverse events from performing the index tests or the reference standard. | 1 (2) | 0 (0) | 0 (0) | 1 (1) |
|
| ||||
| Estimates of diagnostic accuracy and measures of statistical uncertainty (e.g., 95% confidence intervals). | 43 (96) | 17 (94) | 27 (100) | 87 (97) |
| How indeterminate results, missing responses, and outliers of the index tests were handled. | 8 (18) | 0 (0) | 7 (26) | 15 (17) |
| Estimates of variability of diagnostic accuracy between subgroups of participants, readers or centers, if done. | 1 (2) | 2 (11) | 6 (22) | 9 (10) |
| Estimates of test reproducibility, if done. | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
|
| ||||
| Discuss the clinical applicability of the study findings. | 44 (98) | 18 (100) | 27 (100) | 89 (99) |
TB = tuberculosis MeSH = medical subject heading γ = recruitment based on symptoms § = consecutive sampling θ = prospective study.
Adapted from Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative. Ann Intern Med 2003;138:40-4.
Assessment of the quality of report using STARD* stratified by year of publication.
| Section and Topic in the STARD checklist (scored as “Reported”) | Year | Total | ||
| 2004 (N = 42) | 2005 (N = 21) | 2006 (N = 27) | (N = 90) | |
| n (%) | n (%) | n (%) | n (%) | |
|
| ||||
| Identify the article as a study of diagnostic accuracy (recommend MeSH heading ‘sensitivity and specificity’). | 42 (100) | 21 (100) | 26 (96) | 89 (99) |
|
| ||||
| State the research questions or study aims, such as estimating diagnostic accuracy or comparing accuracy between tests or across participant groups. | 39 (93) | 21 (100) | 26 (96) | 86 (96) |
|
| ||||
|
| ||||
| The study population: the inclusion and exclusion criteria, setting and locations where the data were collected. | 34 (81) | 17 (81) | 19 (70) | 70 (78) |
| Participant recruitment: was recruitment based on presenting symptoms, results from previous tests, or the fact that the participants had received the index tests or the reference standard? γ | 24 (57) | 13 (62) | 17 (63) | 54 (60) |
| Participant sampling: was the study population a consecutive series of participants defined by the selection criteria in the previous 2 items? If not, specify how participants were further selected.§ | 14 (33) | 8 (38) | 6 (22) | 28 (31) |
| Data collection: was data collection planned before the index test and reference standard were performed (prospective study) or after (retrospective study)? θ | 36 (86) | 18 (86) | 21 (78) | 75 (83) |
|
| ||||
| The reference standard and its rationale. | 45 (100) | 18 (100) | 25 (93) | 88 (98) |
| Technical specifications of material and methods involved including how and when measurements were taken, and/or cite references for index tests and reference standard. | 41 (98) | 20 (95) | 27 (100) | 88 (98) |
| Definition of and rationale for the units, cutoffs, and/or categories of the results of the index tests and the reference standard. | 38 (91) | 19 (91) | 24 (89) | 81 (90) |
| The number, training, and expertise of the persons executing and reading the index tests and the reference standard. | 34 (81) | 17 (81) | 24 (89) | 75 (83) |
| Whether or not the readers of the index tests and reference standard were blind (masked) to the results of the other test and describe any other clinical information available to the readers. | 8 (19) | 3 (14) | 4 (15) | 15 (17) |
|
| ||||
| Methods for calculating or comparing measures of diagnostic accuracy, and the statistical methods used to quantify uncertainty (e.g., 95% confidence intervals). | 17 (41) | 11 (52) | 14 (52) | 42 (47) |
| Methods for calculating test reproducibility, if done. | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
|
| ||||
|
| ||||
| When study was done, including beginning and ending dates of recruitment. | 34 (81) | 13 (62) | 19 (70) | 66 (73) |
| Clinical and demographic characteristics of the study population (e.g., age, sex, spectrum of presenting symptoms, comorbidity, current treatments, recruitment centers). | 29 (69) | 13 (62) | 17 (70) | 59 (66) |
| The number of participants satisfying the criteria for inclusion that did or did not undergo the index tests and/or the reference standard; describe why participants failed to receive either test (a flow diagram is strongly recommended). | 7 (17) | 2 (10) | 4 (24) | 13 (14) |
|
| ||||
| Time interval from the index tests to the reference standard, and any treatment administered between. | 33 (79) | 14 (67) | 20 (74) | 67 (74) |
| Distribution of severity of disease (define criteria) in those with the target condition; other diagnoses in participants without the target condition. | 4 (10) | 2 (10) | 4 (24) | 10 (11) |
| A cross tabulation of the results of the index tests (including indeterminate and missing results) by the results of the reference standard; for continuous results, the distribution of the test results by the results of the reference standard. | 42 (100) | 20 (95) | 27 (100) | 89 (99) |
| Any adverse events from performing the index tests or the reference standard. | 1 (2) | 0 (0) | 0 (0) | 1 (1) |
|
| ||||
| Estimates of diagnostic accuracy and measures of statistical uncertainty (e.g., 95% confidence intervals). | 40 (95) | 20 (95) | 27 (100) | 87 (97) |
| How indeterminate results, missing responses, and outliers of the index tests were handled. | 8 (18) | 4 (19) | 3 (11) | 15 (17) |
| Estimates of variability of diagnostic accuracy between subgroups of participants, readers or centers, if done. | 5 (12) | 2 (10) | 2 (7) | 9 (10) |
| Estimates of test reproducibility, if done. | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
|
| ||||
| Discuss the clinical applicability of the study findings. | 41 (98) | 21 (100) | 27 (100) | 89 (99) |
MeSH = Medical Subject Heading γ = recruitment based on symptoms § = consecutive sampling θ = prospective study.
Adapted from Bossuyt PM, Reitsma JB, Bruns DE, et al. Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative. Ann Intern Med 2003;138:40-4.