| Literature DB >> 36268103 |
Clare McGenity1,2, Patrick Bossuyt3, Darren Treanor1,2,4,5.
Abstract
Artificial intelligence (AI) research is transforming the range tools and technologies available to pathologists, leading to potentially faster, personalized and more accurate diagnoses for patients. However, to see the use of tools for patient benefit and achieve this safely, the implementation of any algorithm must be underpinned by high quality evidence from research that is understandable, replicable, usable and inclusive of details needed for critical appraisal of potential bias. Evidence suggests that reporting guidelines can improve the completeness of reporting of research, especially with good awareness of guidelines. The quality of evidence provided by abstracts alone is profoundly important, as they influence the decision of a researcher to read a paper, attend a conference presentation or include a study in a systematic review. AI abstracts at two international pathology conferences were assessed to establish completeness of reporting against the STARD for Abstracts criteria. This reporting guideline is for abstracts of diagnostic accuracy studies and includes a checklist of 11 essential items required to accomplish satisfactory reporting of such an investigation. A total of 3488 abstracts were screened from the United States & Canadian Academy of Pathology annual meeting 2019 and the 31st European Congress of Pathology (ESP Congress). Of these, 51 AI diagnostic accuracy abstracts were identified and assessed against the STARD for Abstracts criteria for completeness of reporting. Completeness of reporting was suboptimal for the 11 essential criteria, a mean of 5.8 (SD 1.5) items were detailed per abstract. Inclusion was variable across the different checklist items, with all abstracts including study objectives and no abstracts including a registration number or registry. Greater use and awareness of the STARD for Abstracts criteria could improve completeness of reporting and further consideration is needed for areas where AI studies are vulnerable to bias.Entities:
Year: 2022 PMID: 36268103 PMCID: PMC9576989 DOI: 10.1016/j.jpi.2022.100091
Source DB: PubMed Journal: J Pathol Inform
Figure 1Flowchart of process for identification of artificial intelligence abstracts.
Figure 2Flowchart of screening process to identify diagnostic accuracy studies of artificial intelligence.
The STARD for Abstracts criteria as outlined by Cohen et al. 2017.
| The STARD for abstracts 11 checklist items |
Identification as a study of diagnostic accuracy using at least one measure of accuracy (such as sensitivity, specificity, predictive values or AUC) |
Study objectives |
Data collection: whether this was a prospective or retrospective study |
Eligibility criteria for participants and the settings where the data were collected |
Whether participants formed a consecutive, random, or convenience series |
Description of the index test and reference standard |
Number of participants with and without the target condition included in the analysis |
Estimates of diagnostic accuracy and their precision (such as 95% confidence intervals) |
General interpretation of the results |
Implications for practice, including the intended use of the index test |
Registration number and name of registry |
Distribution of abstracts included in final assessment by pathological subspecialty of study.
| Pathological subspecialty of study | No. of abstracts | Percent (%) |
|---|---|---|
| Gastrointestinal Pathology | 13 | 26 |
| Breast Pathology | 11 | 21 |
| Urological Pathology | 9 | 18 |
| Cardiothoracic pathology | 7 | 14 |
| Dermatopathology | 3 | 5.9 |
| Gynaecological pathology | 3 | 5.9 |
| Haematopathology | 3 | 5.9 |
| Nephropathology | 1 | 2.0 |
| Neuropathology | 1 | 2.0 |
Summary of dataset descriptions provided by each abstract.
| Abstract number | Summarized dataset information provided in each abstract | Abstract number | Summarized dataset information provided in each abstract |
|---|---|---|---|
| 1 | 1522 H&E images, 64 cases | 27 | 410 patients, 1136 biopsy instances |
| 2 | 805 H&E images | 28 | 443 cropped images from 580 WSIs, 129 biopsies, 129 patients |
| 3 | >1288 WSIs (exact total no. not specified) | 29 | 35 slides for training, 80 cases for testing |
| 4 | 28 patients | 30 | 1461 biopsies, 238 patients |
| 5 | 1500 cases | 31 | 252 cases, 385 slides |
| 6 | 115 H&E images | 32 | 1000 cases |
| 7 | 63 WSIs | 33 | 53 cases |
| 8 | 532 glass slides, 2162 biopsies | 34 | 266 patients |
| 9 | 54,587 pixel patches | 35 | 100 cases |
| 10 | 58 cases | 36 | 225 cases |
| 11 | Dataset numbers not given | 37 | 173 WSI |
| 12 | 19 H&E images | 38 | 417 biopsies |
| 13 | 23 patients, 23 WSIs, 23 specimens | 39 | 250 cases |
| 14 | Dataset numbers not given | 40 | 55,000 patches, 50 cases |
| 15 | 50 WSIs | 41 | 13 patients |
| 16 | 3858 patients + 867 external samples | 42 | 60 slides |
| 17 | Dataset numbers not given | 43 | 90 tumours |
| 18 | 58 cases for training, 29 WSIs for testing (exact total no. not specified) | 44 | 232 patients |
| 19 | 36 biopsies | 45 | Dataset numbers not given |
| 20 | 765 biopsy sections | 46 | Dataset numbers not given |
| 21 | 300 cases | 47 | 73 WSI |
| 22 | 1294 WSIs | 48 | 19 slides |
| 23 | 108 patients | 49 | 156 WSI |
| 24 | 24 cases, 221 H&E images | 50 | 184 images |
| 25 | Dataset numbers not given | 51 | 182,590 patches, 170 patients, 400 biopsies |
| 26 | 21 cases |
Figure 3Graph showing the type and frequency of statistical performance measure used in abstracts selected for final assessment.
Completeness of reporting of abstracts against STARD for Abstracts criteria by numbers and percentages of abstracts.
| STARD for Abstracts checklist item | No. (%) abstracts |
|---|---|
Identification as a study of diagnostic accuracy using at least one measure of accuracy (such as sensitivity, specificity, predictive values or AUC) | 51 (100) |
Study objectives | 51 (100) |
Data collection: whether this was a prospective or retrospective study | 20 (39) |
Eligibility criteria for participants and the settings where the data were collected | 9 (18) |
Whether participants formed a consecutive, random, or convenience series | 8 (16) |
Description of the index test and reference standard | 35 (69) |
Number of participants with and without the target condition included in the analysis | 28 (55) |
Estimates of diagnostic accuracy and their precision (such as 95% confidence intervals) | 8 (16) |
General interpretation of the results | 42 (82) |
Implications for practice, including the intended use of the index test | 42 (84) |
Registration number and name of registry | 0 (0) |
Figure 4Graph showing the proportion of abstracts with between 1 and 11 of the STARD for Abstracts checklist items provided.