| Literature DB >> 32908284 |
Samantha Cruz Rivera1,2,3, Xiaoxuan Liu3,4,5,6,7, An-Wen Chan8, Alastair K Denniston9,10,11,12,13,14, Melanie J Calvert1,2,3,6,15,16,17.
Abstract
The SPIRIT 2013 statement aims to improve the completeness of clinical trial protocol reporting by providing evidence-based recommendations for the minimum set of items to be addressed. This guidance has been instrumental in promoting transparent evaluation of new interventions. More recently, there has been a growing recognition that interventions involving artificial intelligence (AI) need to undergo rigorous, prospective evaluation to demonstrate their impact on health outcomes. The SPIRIT-AI (Standard Protocol Items: Recommendations for Interventional Trials-Artificial Intelligence) extension is a new reporting guideline for clinical trial protocols evaluating interventions with an AI component. It was developed in parallel with its companion statement for trial reports: CONSORT-AI (Consolidated Standards of Reporting Trials-Artificial Intelligence). Both guidelines were developed through a staged consensus process involving literature review and expert consultation to generate 26 candidate items, which were consulted upon by an international multi-stakeholder group in a two-stage Delphi survey (103 stakeholders), agreed upon in a consensus meeting (31 stakeholders) and refined through a checklist pilot (34 participants). The SPIRIT-AI extension includes 15 new items that were considered sufficiently important for clinical trial protocols of AI interventions. These new items should be routinely reported in addition to the core SPIRIT 2013 items. SPIRIT-AI recommends that investigators provide clear descriptions of the AI intervention, including instructions and skills required for use, the setting in which the AI intervention will be integrated, considerations for the handling of input and output data, the human-AI interaction and analysis of error cases. SPIRIT-AI will help promote transparency and completeness for clinical trial protocols for AI interventions. Its use will assist editors and peer reviewers, as well as the general readership, to understand, interpret and critically appraise the design and risk of bias for a planned clinical trial.Entities:
Mesh:
Year: 2020 PMID: 32908284 PMCID: PMC7598944 DOI: 10.1038/s41591-020-1037-7
Source DB: PubMed Journal: Nat Med ISSN: 1078-8956 Impact factor: 53.440
SPIRIT-AI checklist
| Section | Item | SPIRIT 2013 itema | SPIRIT-AI item | Addressed on page numberb | ||
|---|---|---|---|---|---|---|
| Administrative information | ||||||
| 1 | Descriptive title identifying the study design, population, interventions, and, if applicable, trial acronym | SPIRIT-AI 1 (i) Elaboration | Indicate that the intervention involves artificial intelligence/machine learning and specify the type of model. | |||
| SPIRIT-AI 1 (ii) Elaboration | Specify the intended use of the AI intervention. | |||||
| 2a | Trial identifier and registry name. If not yet registered, name of intended registry | |||||
| 2b | All items from the World Health Organization Trial Registration Dataset | |||||
| 3 | Date and version identifier | |||||
| 4 | Sources and types of financial, material, and other support | |||||
| 5a | Names, affiliations, and roles of protocol contributors | |||||
| 5b | Name and contact information for the trial sponsor | |||||
| 5c | Role of study sponsor and funders, if any, in study design; collection, management, analysis, and interpretation of data; writing of the report; and the decision to submit the report for publication, including whether they will have ultimate authority over any of these activities | |||||
| 5d | Composition, roles, and responsibilities of the coordinating center, steering committee, endpoint adjudication committee, data management team, and other individuals or groups overseeing the trial, if applicable (see Item 21a for data monitoring committee) | |||||
| Introduction | ||||||
| 6a | Description of research question and justification for undertaking the trial, including summary of relevant studies (published and unpublished) examining benefits and harms for each intervention | SPIRIT-AI 6a (i) Extension | Explain the intended use of the AI intervention in the context of the clinical pathway, including its purpose and its intended users (for example, healthcare professionals, patients, public). | |||
| SPIRIT-AI 6a (ii) Extension | Describe any pre-existing evidence for the AI intervention. | |||||
| 6b | Explanation for choice of comparators | |||||
| 7 | Specific objectives or hypotheses | |||||
| 8 | Description of trial design including type of trial (for example, parallel group, crossover, factorial, single group), allocation ratio, and framework (for example, superiority, equivalence, noninferiority, exploratory) | |||||
| Methods: participants, interventions and outcomes | ||||||
| 9 | Description of study settings (for example, community clinic, academic hospital) and list of countries where data will be collected. Reference to where list of study sites can be obtained | SPIRIT-AI 9 Extension | Describe the onsite and offsite requirements needed to integrate the AI intervention into the trial setting. | |||
| 10 | Inclusion and exclusion criteria for participants. If applicable, eligibility criteria for study centers and individuals who will perform the interventions (for example, surgeons, psychotherapists) | SPIRIT-AI 10 (i) Elaboration | State the inclusion and exclusion criteria at the level of participants. | |||
| SPIRIT-AI 10 (ii) Extension | State the inclusion and exclusion criteria at the level of the input data. | |||||
| 11a | Interventions for each group with sufficient detail to allow replication, including how and when they will be administered | SPIRIT-AI 11a (i) Extension | State which version of the AI algorithm will be used. | |||
| SPIRIT-AI 11a (ii) Extension | Specify the procedure for acquiring and selecting the input data for the AI intervention. | |||||
| SPIRIT-AI 11a (iii) Extension | Specify the procedure for assessing and handling poor-quality or unavailable input data. | |||||
| SPIRIT-AI 11a (iv) Extension | Specify whether there is human–AI interaction in the handling of the input data, and what level of expertise is required for users. | |||||
| SPIRIT-AI 11a (v) Extension | Specify the output of the AI intervention. | |||||
| SPIRIT-AI 11a (vi) Extension | Explain the procedure for how the AI intervention’s output will contribute to decision-making or other elements of clinical practice. | |||||
| 11b | Criteria for discontinuing or modifying allocated interventions for a given trial participant (for example, drug dose change in response to harms, participant request, or improving/worsening disease) | |||||
| 11c | Strategies to improve adherence to intervention protocols, and any procedures for monitoring adherence (for example, drug tablet return, laboratory tests) | |||||
| 11d | Relevant concomitant care and interventions that are permitted or prohibited during the trial | |||||
| 12 | Primary, secondary, and other outcomes, including the specific measurement variable (for example, systolic blood pressure), analysis metric (for example, change from baseline, final value, time to event), method of aggregation (for example, median, proportion), and time point for each outcome. Explanation of the clinical relevance of chosen efficacy and harm outcomes is strongly recommended | |||||
| 13 | Time schedule of enrollment, interventions (including any run-ins and washouts), assessments, and visits for participants. A schematic diagram is highly recommended (Fig. | |||||
| 14 | Estimated number of participants needed to achieve study objectives and how it was determined, including clinical and statistical assumptions supporting any sample size calculations | |||||
| 15 | Strategies for achieving adequate participant enrollment to reach target sample size | |||||
| Methods: assignment of interventions (for controlled trials) | ||||||
| 16a | Method of generating the allocation sequence (for example, computer-generated random numbers), and list of any factors for stratification. To reduce predictability of a random sequence, details of any planned restriction (for example, blocking) should be provided in a separate document that is unavailable to those who enroll participants or assign interventions | |||||
| 16b | Mechanism of implementing the allocation sequence (for example, central telephone; sequentially numbered, opaque, sealed envelopes), describing any steps to conceal the sequence until interventions are assigned | |||||
| 16c | Who will generate the allocation sequence, who will enroll participants, and who will assign participants to interventions | |||||
| 17a | Who will be blinded after assignment to interventions (for example, trial participants, care providers, outcome assessors, data analysts), and how | |||||
| 17b | If blinded, circumstances under which unblinding is permissible, and procedure for revealing a participant’s allocated intervention during the trial | |||||
| Methods: data collection, management and analysis | ||||||
| 18a | Plans for assessment and collection of outcome, baseline, and other trial data, including any related processes to promote data quality (for example, duplicate measurements, training of assessors) and a description of study instruments (for example, questionnaires, laboratory tests) along with their reliability and validity, if known. Reference to where data collection forms can be found, if not in the protocol | |||||
| 18b | Plans to promote participant retention and complete follow-up, including list of any outcome data to be collected for participants who discontinue or deviate from intervention protocols | |||||
| 19 | Plans for data entry, coding, security, and storage, including any related processes to promote data quality (for example, double data entry; range checks for data values). Reference to where details of data management procedures can be found, if not in the protocol | |||||
| 20a | Statistical methods for analyzing primary and secondary outcomes. Reference to where other details of the statistical analysis plan can be found, if not in the protocol | |||||
| 20b | Methods for any additional analyses (for example, subgroup and adjusted analyses) | |||||
| 20c | Definition of analysis population relating to protocol non-adherence (for example, as randomized analysis), and any statistical methods to handle missing data (for example, multiple imputation) | |||||
| Methods: monitoring | ||||||
| 21a | Composition of data monitoring committee (DMC); summary of its role and reporting structure; statement of whether it is independent from the sponsor and competing interests; and reference to where further details about its charter can be found, if not in the protocol. Alternatively, an explanation of why a DMC is not needed | |||||
| 21b | Description of any interim analyses and stopping guidelines, including who will have access to these interim results and make the final decision to terminate the trial | |||||
| 22 | Plans for collecting, assessing, reporting, and managing solicited and spontaneously reported adverse events and other unintended effects of trial interventions or trial conduct | SPIRIT-AI 22 Extension | Specify any plans to identify and analyze performance errors. If there are no plans for this, justify why not. | |||
| 23 | Frequency and procedures for auditing trial conduct, if any, and whether the process will be independent from investigators and the sponsor | |||||
| Ethics and dissemination | ||||||
| 24 | Plans for seeking research ethics committee/institutional review board (REC/IRB) approval | |||||
| 25 | Plans for communicating important protocol modifications (for example, changes to eligibility criteria, outcomes, analyses) to relevant parties (for example, investigators, REC/IRBs, trial participants, trial registries, journals, regulators) | |||||
| 26a | Who will obtain informed consent or assent from potential trial participants or authorized surrogates, and how (see Item 32) | |||||
| 26b | Additional consent provisions for collection and use of participant data and biological specimens in ancillary studies, if applicable | |||||
| 27 | How personal information about potential and enrolled participants will be collected, shared, and maintained in order to protect confidentiality before, during, and after the trial | |||||
| 28 | Financial and other competing interests for principal investigators for the overall trial and each study site | |||||
| 29 | Statement of who will have access to the final trial dataset, and disclosure of contractual agreements that limit such access for investigators | SPIRIT-AI 29 Extension | State whether and how the AI intervention and/or its code can be accessed, including any restrictions to access or re-use. | |||
| 30 | Provisions, if any, for ancillary and post-trial care, and for compensation to those who suffer harm from trial participation | |||||
| 31a | Plans for investigators and sponsor to communicate trial results to participants, healthcare professionals, the public, and other relevant groups (for example, via publication, reporting in results databases, or other data sharing arrangements), including any publication restrictions | |||||
| 31b | Authorship eligibility guidelines and any intended use of professional writers | |||||
| 31c | Plans, if any, for granting public access to the full protocol, participant-level dataset, and statistical code | |||||
| Appendices | ||||||
| 32 | Model consent form and other related documentation given to participants and authorized surrogates | |||||
| 33 | Plans for collection, laboratory evaluation, and storage of biological specimens for genetic or molecular analysis in the current trial and for future use in ancillary studies, if applicable | |||||
aIt is strongly recommended that this checklist be read in conjunction with the SPIRIT 2013 Explanation & Elaboration for important clarification on the items.
bIndicates page numbers to be completed by authors during protocol development.
Fig. 1CONSORT 2010 flow diagram — adapted for AI clinical trials.
SPIRIT-AI 10 (i): State the inclusion and exclusion criteria at the level of participants. SPIRIT-AI 10 (ii): State the inclusion and exclusion criteria at the level of the input data. SPIRIT 13 (core CONSORT item): Time schedule of enrollment, interventions (including any run-ins and washouts), assessments, and visits for participants. A schematic diagram is highly recommended.
| The science of developing computer systems that can perform tasks normally requiring human intelligence. | |
| A health intervention that relies upon an AI/ML component to serve its purpose. | |
| Consolidated Standards of Reporting Trials. | |
| An additional checklist item to address AI-specific content that is not adequately covered by CONSORT 2010. | |
| Class-activation maps are particularly relevant to image classification AI interventions. Class-activation maps are visualizations of the pixels that had the greatest influence on predicted class, by displaying the gradient of the predicted outcome from the model with respect to the input. They are also referred to as ‘saliency maps’ or ‘heat maps’. | |
| Measured variables in the trial that are used to assess the effects of an intervention. | |
| The process of how users (humans) interact with the AI intervention, for the AI intervention to function as intended. | |
| Measured variables in the trial that are used to assess the effects of an intervention. | |
| A research method that derives the collective opinions of a group through a staged consultation of surveys, questionnaires, or interviews, with an aim to reach consensus at the end. | |
| The clinical and operational settings from which the data used for training the model is generated. This includes all aspects of the physical setting (such as geographical location, physical environment), operational setting (such as integration with an electronic record system, installation on a physical device) and clinical setting (such as primary, secondary and/or tertiary care, patient disease spectrum). | |
| Modifications or additional training performed on the AI intervention model, done with the intention of improving its performance. | |
| The data that need to be presented to the AI intervention to allow it to serve its purpose. | |
| A field of computer science concerned with the development of models/algorithms which can solve specific tasks by learning patterns from data, rather than by following explicit rules. It is seen as an approach within the field of AI. | |
| The environment in which the AI intervention will be deployed, including the infrastructure required to enable the AI intervention to function. | |
| The predicted outcome given by the AI intervention based on modeling of the input data. The output data can be presented in different forms, including a classification (including diagnosis, disease severity or stage, or recommendation such as referability), a probability, a class activation map, etc. The output data typically provide additional clinical information and/or trigger a clinical decision. | |
| Instances in which the AI intervention fails to perform as expected. This term can describe different types of failures, and it is up to the investigator to specify what should be considered a performance error, preferably based on prior evidence. This can range from small decreases in accuracy (compared to expected accuracy) to erroneous predictions or the inability to produce an output, in certain cases. | |
| Standard Protocol Items: Recommendations for Interventional Trials. | |
| An additional checklist item to address AI-specific content that is not adequately covered by SPIRIT 2013. | |
| Additional considerations to an existing SPIRIT 2013 item when applied to AI interventions. |