| Literature DB >> 35156003 |
Arnaud Serret-Larmande1,2, Jonathan R Kaltman3, Paul Avillach1,2.
Abstract
Reproducibility in medical research has been a long-standing issue. More recently, the COVID-19 pandemic has publicly underlined this fact as the retraction of several studies reached out to general media audiences. A significant number of these retractions occurred after in-depth scrutiny of the methodology and results by the scientific community. Consequently, these retractions have undermined confidence in the peer-review process, which is not considered sufficiently reliable to generate trust in the published results. This partly stems from opacity in published results, the practical implementation of the statistical analysis often remaining undisclosed. We present a workflow that uses a combination of informatics tools to foster statistical reproducibility: an open-source programming language, Jupyter Notebook, cloud-based data repository, and an application programming interface can streamline an analysis and help to kick-start new analyses. We illustrate this principle by (1) reproducing the results of the ORCHID clinical trial, which evaluated the efficacy of hydroxychloroquine in COVID-19 patients, and (2) expanding on the analyses conducted in the original trial by investigating the association of premedication with biological laboratory results. Such workflows will be encouraged for future publications from National Heart, Lung, and Blood Institute-funded studies.Entities:
Keywords: FAIR principles; clinical trial; statistical reproducibility
Year: 2022 PMID: 35156003 PMCID: PMC8826998 DOI: 10.1093/jamiaopen/ooac001
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Primary outcome of the ORCHID Clinical Trial. The top panel represents the bar plot as published in the original article. The bottom panel is the figure as displayed in the notebook reproducing the analysis.
Figure 2.Elements composing the reproducible workflow.
Sensitivity analysis: COVID-19 Outcomes Scale at randomization, day 14, and day 28, according to premedication by azithromycin (prescription of azithromycin before inclusion in the trial)
| At randomization | 14 d after randomization | 28 d after randomization | ||||
|---|---|---|---|---|---|---|
| COVID-19 Outcomes Scale | Yes ( | No ( | Yes ( | No ( | Yes ( | No ( |
| (1) Death | 0 (0%) | 0 (0%) | 10 (6.7%) | 22 (6.7%) | 17 (11.3%) | 33 (10%) |
| (2) Invasive mechanical ventilation or extracorporeal membrane oxygenation | 16 (10.7%) | 16 (4.9%) | 20 (13.3%) | 22 (6.7%) | 12 (8%) | 11 (3.3%) |
| (3) Noninvasive ventilation or high flow nasal cannula | 22 (14.7%) | 33 (10%) | 3 (2%) | 9 (2.7%) | 0 (0%) | 0 (0%) |
| (4) Hospitalized with oxygen | 69 (46%) | 155 (47.1%) | 11 (7.3%) | 29 (8.8%) | 4 (2.7%) | 15 (4.6%) |
| (5) Hospitalized without oxygen | 43 (28.7%) | 125 (38%) | 8 (5.3%) | 29 (8.8%) | 3 (2%) | 8 (2.4%) |
| (6) Discharged, limitation in activity | 0 (0%) | 0 (0%) | 63 (42%) | 106 (32.2%) | 50 (33.3%) | 97 (29.5%) |
| (7) Discharged, no limitation in activity | 0 (0%) | 0 (0%) | 35 (23.3%) | 112 (34%) | 64 (42.7%) | 165 (50.2%) |
Figure 3.Supplemental analysis based on ORCHID clinical trial data: laboratory test trajectories according to premedication by azithromycin.
Details of the “FAIR Guiding Principles for scientific data management and stewardship”
| FAIR principles | Details |
|---|---|
| Findable |
F1: (Meta)data are assigned a globally unique and persistent identifier F2: Data are described with rich metadata (defined by R1 below) F3: Metadata clearly and explicitly include the identifier of the data they describe F4: (Meta)data are registered or indexed in a searchable resource |
| Accessible |
A1: (Meta)data are retrievable by their identifier using a standardized communications protocol A1.1: The protocol is open, free, and universally implementable A1.2: The protocol allows for an authentication and authorization procedure, where necessary A2: Metadata are accessible, even when the data are no longer available |
| Interoperable |
I1: (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation I2: (Meta)data use vocabularies that follow FAIR principles I3: (Meta)data include qualified references to other (meta)data |
| Reusable |
R1: (Meta)data are richly described with a plurality of accurate and relevant attributes R1.1: (Meta)data are released with a clear and accessible data usage license R1.2: (Meta)data are associated with detailed provenance R1.3: (Meta)data meet domain-relevant community standards |