| Literature DB >> 32908283 |
Xiaoxuan Liu1,2,3,4,5, Samantha Cruz Rivera5,6,7, David Moher8,9, Melanie J Calvert4,5,6,7,10,11,12, Alastair K Denniston13,14,15,16,17,18.
Abstract
The CONSORT 2010 statement provides minimum guidelines for reporting randomized trials. Its widespread use has been instrumental in ensuring transparency in the evaluation of new interventions. More recently, there has been a growing recognition that interventions involving artificial intelligence (AI) need to undergo rigorous, prospective evaluation to demonstrate impact on health outcomes. The CONSORT-AI (Consolidated Standards of Reporting Trials-Artificial Intelligence) extension is a new reporting guideline for clinical trials evaluating interventions with an AI component. It was developed in parallel with its companion statement for clinical trial protocols: SPIRIT-AI (Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence). Both guidelines were developed through a staged consensus process involving literature review and expert consultation to generate 29 candidate items, which were assessed by an international multi-stakeholder group in a two-stage Delphi survey (103 stakeholders), agreed upon in a two-day consensus meeting (31 stakeholders) and refined through a checklist pilot (34 participants). The CONSORT-AI extension includes 14 new items that were considered sufficiently important for AI interventions that they should be routinely reported in addition to the core CONSORT 2010 items. CONSORT-AI recommends that investigators provide clear descriptions of the AI intervention, including instructions and skills required for use, the setting in which the AI intervention is integrated, the handling of inputs and outputs of the AI intervention, the human-AI interaction and provision of an analysis of error cases. CONSORT-AI will help promote transparency and completeness in reporting clinical trials for AI interventions. It will assist editors and peer reviewers, as well as the general readership, to understand, interpret and critically appraise the quality of clinical trial design and risk of bias in the reported outcomes.Entities:
Mesh:
Year: 2020 PMID: 32908283 PMCID: PMC7598943 DOI: 10.1038/s41591-020-1034-x
Source DB: PubMed Journal: Nat Med ISSN: 1078-8956 Impact factor: 87.241
CONSORT-AI checklist
| Section | CONSORT 2010 itema | CONSORT-AI item | Addressed on page numberb | ||
|---|---|---|---|---|---|
| Title and abstract | |||||
| 1a | Identification as a randomized trial in the title | CONSORT-AI 1a,b Elaboration | (i) Indicate that the intervention involves artificial intelligence/machine learning in the title and/or abstract and specify the type of model. | ||
| 1b | Structured summary of trial design, methods, results, and conclusions (for specific guidance see CONSORT for abstracts) | (ii) State the intended use of the AI intervention within the trial in the title and/or abstract. | |||
| Introduction | |||||
| 2a | Scientific background and explanation of rationale | CONSORT-AI 2a (i) Extension | Explain the intended use of the AI intervention in the context of the clinical pathway, including its purpose and its intended users (for example, healthcare professionals, patients, public). | ||
| 2b | Specific objectives or hypotheses | ||||
| Methods | |||||
| 3a | Description of trial design (such as parallel, factorial) including allocation ratio | ||||
| 3b | Important changes to methods after trial commencement (such as eligibility criteria), with reasons | ||||
| 4a | Eligibility criteria for participants | CONSORT-AI 4a (i) Elaboration | State the inclusion and exclusion criteria at the level of participants. | ||
| CONSORT-AI 4a (ii) Extension | State the inclusion and exclusion criteria at the level of the input data. | ||||
| 4b | Settings and locations where the data were collected | CONSORT-AI 4b Extension | Describe how the AI intervention was integrated into the trial setting, including any onsite or offsite requirements. | ||
| 5 | The interventions for each group with sufficient details to allow replication, including how and when they were actually administered | CONSORT-AI 5 (i) Extension | State which version of the AI algorithm was used. | ||
| CONSORT-AI 5 (ii) Extension | Describe how the input data were acquired and selected for the AI intervention. | ||||
| CONSORT-AI 5 (iii) Extension | Describe how poor quality or unavailable input data were assessed and handled. | ||||
| CONSORT-AI 5 (iv) Extension | Specify whether there was human–AI interaction in the handling of the input data, and what level of expertise was required of users. | ||||
| CONSORT-AI 5 (v) Extension | Specify the output of the AI intervention | ||||
| CONSORT-AI 5 (vi) Extension | Explain how the AI intervention’s outputs contributed to decision-making or other elements of clinical practice. | ||||
| 6a | Completely defined pre-specified primary and secondary outcome measures, including how and when they were assessed | ||||
| 6b | Any changes to trial outcomes after the trial commenced, with reasons | ||||
| 7a | How sample size was determined | ||||
| 7b | When applicable, explanation of any interim analyses and stopping guidelines | ||||
| Randomization | |||||
| 8a | Method used to generate the random allocation sequence | ||||
| 8b | Type of randomization; details of any restriction (such as blocking and block size) | ||||
| 9 | Mechanism used to implement the random allocation sequence (such as sequentially numbered containers), describing any steps taken to conceal the sequence until interventions were assigned | ||||
| 10 | Who generated the random allocation sequence, who enrolled participants, and who assigned participants to interventions | ||||
| 11a | If done, who was blinded after assignment to interventions (for example, participants, care providers, those assessing outcomes) and how | ||||
| 11b | If relevant, description of the similarity of interventions | ||||
| 12a | Statistical methods used to compare groups for primary and secondary outcomes | ||||
| 12b | Methods for additional analyses, such as subgroup analyses and adjusted analyses | ||||
| Results | |||||
| 13a | For each group, the numbers of participants who were randomly assigned, received intended treatment, and were analyzed for the primary outcome | ||||
| 13b | For each group, losses and exclusions after randomization, together with reasons | ||||
| 14a | Dates defining the periods of recruitment and follow-up | ||||
| 14b | Why the trial ended or was stopped | ||||
| 15 | A table showing baseline demographic and clinical characteristics for each group | ||||
| 16 | For each group, number of participants (denominator) included in each analysis and whether the analysis was by original assigned groups | ||||
| 17a | For each primary and secondary outcome, results for each group, and the estimated effect size and its precision (such as 95% confidence interval) | ||||
| 17b | For binary outcomes, presentation of both absolute and relative effect sizes is recommended | ||||
| 18 | Results of any other analyses performed, including subgroup analyses and adjusted analyses, distinguishing pre-specified from exploratory | ||||
| 19 | All important harms or unintended effects in each group (for specific guidance see CONSORT for harms) | CONSORT-AI 19 Extension | Describe results of any analysis of performance errors and how errors were identified, where applicable. If no such analysis was planned or done, justify why not. | ||
| Discussion | |||||
| 20 | Trial limitations, addressing sources of potential bias, imprecision, and, if relevant, multiplicity of analyses | ||||
| 21 | Generalizability (external validity, applicability) of the trial findings | ||||
| 22 | Interpretation consistent with results, balancing benefits and harms, and considering other relevant evidence | ||||
| Other Information | |||||
| 23 | Registration number and name of trial registry | ||||
| 24 | Where the full trial protocol can be accessed, if available | ||||
| 25 | Sources of funding and other support (such as supply of drugs), role of funders | CONSORT-AI 25 Extension | State whether and how the AI intervention and/or its code can be accessed, including any restrictions to access or re-use. | ||
aWe strongly recommend reading this statement in conjunction with the CONSORT 2010 Explanation and Elaboration for important clarifications on all the items. bIndicates page numbers to be completed by authors during protocol development.
Fig. 1CONSORT 2010 flow diagram — adapted for AI clinical trials.
CONSORT-AI 4a (i): State the inclusion and exclusion criteria at the level of participants. CONSORT-AI 4a (ii): State the inclusion and exclusion criteria at the level of the input data. CONSORT 13b (core CONSORT item): For each group, losses and exclusions after randomization, together with reasons.
| The science of developing computer systems which can perform tasks normally requiring human intelligence. | |
| A health intervention that relies upon an AI/ML component to serve its purpose. | |
| Consolidated Standards of Reporting Trials. | |
| An additional checklist item to address AI-specific content that is not adequately covered by CONSORT 2010. | |
| Class-activation maps are particularly relevant to image classification AI interventions. Class-activation maps are visualizations of the pixels that had the greatest influence on predicted class, by displaying the gradient of the predicted outcome from the model with respect to the input. They are also referred to as ‘saliency maps’ or ‘heat maps’. | |
| Measured variables in the trial that are used to assess the effects of an intervention. | |
| The process of how users (humans) interact with the AI intervention, for the AI intervention to function as intended. | |
| Measured variables in the trial which are used to assess the effects of an intervention. | |
| A research method that derives the collective opinions of a group through a staged consultation of surveys, questionnaires, or interviews, with an aim to reach consensus at the end. | |
| The clinical and operational settings from which the data used for training the model is generated. This includes all aspects of the physical setting (such as geographical location, physical environment), operational setting (such as integration with an electronic record system, installation on a physical device) and clinical setting (such as primary, secondary and/or tertiary care, patient disease spectrum). | |
| Modifications or additional training performed on the AI intervention model, done with the intention of improving its performance. | |
| The data that need to be presented to the AI intervention to allow it to serve its purpose. | |
| A field of computer science concerned with the development of models/algorithms that can solve specific tasks by learning patterns from data, rather than by following explicit rules. It is seen as an approach within the field of AI. | |
| The environment in which the AI intervention will be deployed, including the infrastructure required to enable the AI intervention to function. | |
| The predicted outcome given by the AI intervention based on modeling of the input data. The output data can be presented in different forms, including a classification (including diagnosis, disease severity or stage, or recommendation such as referability), a probability, a class activation map, etc. The output data typically provide additional clinical information and/or trigger a clinical decision. | |
| Instances in which the AI intervention fails to perform as expected. This term can describe different types of failures, and it is up to the investigator to specify what should be considered a performance error, preferably based on prior evidence. This can range from small decreases in accuracy (compared to expected accuracy) to erroneous predictions or the inability to produce an output, in certain cases. | |
| Standard Protocol Items: Recommendations for Interventional Trials. | |
| An additional checklist item to address AI-specific content that is not adequately covered by SPIRIT 2013. | |
| Additional considerations to an existing SPIRIT 2013 item when applied to AI interventions. |