| Literature DB >> 35584845 |
Baptiste Vasey1,2,3, Myura Nagendran4, Bruce Campbell5,6, David A Clifton2, Gary S Collins7, Spiros Denaxas8,9,10,11, Alastair K Denniston12,13,14, Livia Faes14, Bart Geerts15, Mudathir Ibrahim16,17, Xiaoxuan Liu12,13, Bilal A Mateen8,18,19, Piyush Mathur20, Melissa D McCradden21,22, Lauren Morgan23, Johan Ordish24, Campbell Rogers25, Suchi Saria26,27, Daniel S W Ting28,29, Peter Watkinson3,30, Wim Weber31, Peter Wheatstone32, Peter McCulloch16.
Abstract
Entities:
Mesh:
Year: 2022 PMID: 35584845 PMCID: PMC9116198 DOI: 10.1136/bmj-2022-070904
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Fig 1Comparison of development pathways for drug therapies, artificial intelligence (AI) in healthcare, and surgical innovation. The coloured lines represent reporting guidelines, some of which are study design specific (TRIPOD-AI, STARD-AI, SPIRIT/CONSORT, SPIRIT/CONSORT-AI), others stage specific (DECIDE-AI, IDEAL). Depending on the context, more than one study design can be appropriate for each stage. *Only apply to AI in healthcare
Overview of existing and upcoming artificial intelligence (AI) reporting guidelines
| Name | Stage | Study design | Comment |
|---|---|---|---|
|
| Preclinical development | Prediction model evaluation* | Extension of TRIPOD. Used to report prediction models (diagnostic or prognostic) development, validation and updates. Focuses on model performance |
|
| Preclinical development, offline validation | Diagnostic accuracy studies* | Extension of STARD. Used to report diagnostic accuracy studies, either at development stage or as an offline validation in clinical settings. Focuses on diagnostic accuracy |
|
| Early live clinical evaluation* | Various (prospective cohort studies, non-randomised controlled trials, . . .)† with additional features such as modification of intervention, analysis of prespecified subgroups, or learning curve analysis | Stand alone guideline. Used to report the early evaluation of AI systems as an intervention in live clinical settings (small scale, formative evaluation), independently of the study design and AI system modality (diagnostic, prognostic, therapeutic). Focuses on clinical utility, safety, and human factors |
|
| Comparative prospective evaluation | Randomised controlled trials (protocol)* | Extension of SPIRIT. Used to report the protocols of randomised controlled trials evaluating AI systems as interventions |
|
| Comparative prospective evaluation | Randomised controlled trials* | Extension of CONSORT. Used to report randomised controlled trials evaluating AI systems as interventions (large scale, summative evaluation), independently of the AI system modality (diagnostic, prognostic, therapeutic). Focuses on effectiveness and safety |
Primary target of the guidelines, either a specific stage or a specific study design.
Although existing reporting guidelines exist for some of these study designs (eg, STROBE for cohort studies), none of them cover all the core aspects of AI system early stage evaluation and none would fit all possible study designs; DECIDE-AI was therefore developed as a new stand alone reporting guideline for these studies.
DECIDE-AI checklist
| Item No | Theme | Recommendation | Reported on page |
|---|---|---|---|
|
|
|
| |
| I-X | Generic reporting items | ||
|
| |||
|
|
|
|
|
| I | Abstract | Provide a structured summary of the study. Consider including: intended use of the AI system, type of underlying algorithm, study setting, number of patients and users included, primary and secondary outcomes, key safety endpoints, human factors evaluated, main results, conclusions | |
|
| |||
|
|
|
|
|
|
|
| ||
| II | Objectives | State the study objectives | |
|
| |||
| III | Research governance | Provide a reference to any study protocol, study registration number, and ethics approval | |
|
|
|
| |
|
| |||
|
| |||
|
|
|
| |
|
| |||
|
| |||
|
|
|
| |
|
| |||
| IV | Outcomes | Specify the primary and secondary outcomes measured | |
|
|
|
|
|
|
|
| ||
|
|
|
|
|
| V | Analysis | Describe the statistical methods by which the primary and secondary outcomes were analysed, as well as any prespecified additional analyses, including subgroup analyses and their rationale | |
|
|
|
|
|
| VI | Patient involvement | State how patients were involved in any aspect of: the development of the research question, the study design, and the conduct of the study | |
|
| |||
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
| ||
| VII | Main results | Report on the prespecified outcomes, including outcomes for any comparison group if applicable | |
| VIII | Subgroups analysis | Report on the differences in the main outcomes according to the prespecified subgroups | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
| ||
|
| |||
|
|
|
|
|
|
|
|
|
|
| IX | Strengths and limitations | Discuss the strengths and limitations of the study | |
|
| |||
|
|
|
|
|
| X | Conflicts of interest | Disclose any relevant conflicts of interest, including the source of funding for the study, the role of funders, any other roles played by commercial companies, and personal conflicts of interest for each author | |
AI=artificial intelligence.
AI specific items are numbered in Arab numerals, generic items in Roman numerals.