Literature DB >> 35156003

Streamlining statistical reproducibility: NHLBI ORCHID clinical trial results reproduction.

Arnaud Serret-Larmande^1,2, Jonathan R Kaltman³, Paul Avillach^1,2.

Abstract

Reproducibility in medical research has been a long-standing issue. More recently, the COVID-19 pandemic has publicly underlined this fact as the retraction of several studies reached out to general media audiences. A significant number of these retractions occurred after in-depth scrutiny of the methodology and results by the scientific community. Consequently, these retractions have undermined confidence in the peer-review process, which is not considered sufficiently reliable to generate trust in the published results. This partly stems from opacity in published results, the practical implementation of the statistical analysis often remaining undisclosed. We present a workflow that uses a combination of informatics tools to foster statistical reproducibility: an open-source programming language, Jupyter Notebook, cloud-based data repository, and an application programming interface can streamline an analysis and help to kick-start new analyses. We illustrate this principle by (1) reproducing the results of the ORCHID clinical trial, which evaluated the efficacy of hydroxychloroquine in COVID-19 patients, and (2) expanding on the analyses conducted in the original trial by investigating the association of premedication with biological laboratory results. Such workflows will be encouraged for future publications from National Heart, Lung, and Blood Institute-funded studies.

Entities: Chemical

Keywords: FAIR principles; clinical trial; statistical reproducibility

Year: 2022 PMID： 35156003 PMCID： PMC8826998 DOI： 10.1093/jamiaopen/ooac001

Source DB: PubMed Journal: JAMIA Open ISSN： 2574-2531

As of February 7, 2021, the website retractionwatch.com referenced 69 retracted COVID-19-related papers. This recent spate has highlighted an increasing mistrust in biomedical research overall, both by scientists and general audiences. Peer review alone is insufficient to validate the reported results. This realization is heightened as some retractions occurred in the most prestigious medical journals. This mistrust is a further reflection of the ongoing reproducibility crisis in biomedical research. Therefore, as a proof of concept, we present how we leveraged a biomedical informatics platform (BioData Catalyst powered by PIC-SURE) we developed for the National Heart, Lung, and Blood Institute (NHLBI) to reproduce the results of a recently published clinical trial evaluating the efficacy of hydroxychloroquine on patients hospitalized for COVID-19. The numerous publications on the reproducibility crisis have sounded an alarm, leading to the emergence of multiple guidelines to tackle the problem, including the use of large sample sizes, multiple-comparison accounting, preregistration of research hypotheses, and standardization of reporting guidelines.,, From a bioinformatics perspective, a strong emphasis has been placed on sharing data and statistical analysis source code to enable the research community to internally validate the results. Despite the encouragement of source code sharing, the most influential medical journals have not mandated the practice. A second enabler of reproducibility is making experimental data accessible. Sharing medical data is critical to scientific knowledge dissemination; however, it conflicts with individual privacy concerns. Given these two constraints, others have proposed openly shared anonymized datasets and data sharing by request of investigators. However, these solutions are suboptimal; a better answer lies in dedicated health data repositories offering centralized, controlled-access to sensitive data. Examples of such repositories include the NCTN/NCORP Data Archive for datasets from clinical trials of the National Clinical Trials Network (NCTN), the NCI Community Oncology Research Program (NCORP), or the UK Data Service. However, the use of such solutions is not yet widespread because sharing data conflicts not only with individuals' privacy but also with investigators’ direct interests. Building a cohort involves a significant time and cost investment; thus, investigators are reluctant to share it. As a consequence, data sharing through standardized means remains relatively limited, despite substantial funding. A recent study examined the proportion of clinical trials that share their data after publication. Of 487 clinical trials published in three of the most influential medical journals—NEJM, Lancet, and JAMA—only 17 (3.5%) shared data through repositories, even though 89 had pledged to do so during the publication process. Clearly, there is a desperate need for broader data sharing, amplified by the recent questioning of the internal validity of some COVID-19 studies. In response to this concern, we present a workflow by which we reproduced the NHLBI-funded ORCHID clinical trial. This is a multicenter, randomized controlled clinical trial comparing hydroxychloroquine against placebo, which took place between April and July 2020 in the United States. The primary outcome was patients’ clinical status assessed at 14 days after inclusion. The analysis reproduction process took advantage of NHLBI BioData Catalyst. This ecosystem has been created to ease data reuse of NHLBI and other NIH-funded studies, providing a set of tools for efficient data exploration, analysis, and reporting. Data are managed in a secure, cloud environment and can be explored using graphical user interfaces, or retrieved through an application programming interface (API), thereby enforcing high standards in cybersecurity and mediating user-specific data access authorization, in this case through the PIC-SURE API. The API is accessible via two different open-source programming languages clients, R and python. The NHLBI encourages investigators to contribute to the BioData Catalyst ecosystem by making their source code available at the time of publication, preferentially through an investigator-friendly format like a Jupyter Notebook. It is a format that combines plain text, source code, and outputs in a single file, widely used in the data science community and already described as a valuable tool to make analysis reproducible., We accessed the data of the ORCHID clinical trial using the BioData Catalyst powered by PIC-SURE API and reproduced the entire analysis using the R programming language (R 4.0.3). This reproduction is based on the original investigators’ source code, consisting of several SAS analysis source files, and the statistical analysis plan. From data retrieval to plotted results, the entire process is displayed in the form of a Jupyter Notebook, freely available on GitHub. All the published results were identically reproduced (Figure 1), except for one safety outcome (lymphopenia was mistakenly reported instead of cytopenia). The authors acknowledged the error, noting that a typographical error caused it in the data management source code. Other than this mistake, our results and interpretation are identical to those published in the original article: hydroxychloroquine did not demonstrate superior performance to placebo in hospitalized patients with COVID-19. The reproduction of results of an RCT based on its published protocol, although seemingly straightforward, is far from granted: a systematic review found out that discrepancies or selective reporting are common between the published analysis of an RCT and what was initially planned in the protocol.

Figure 1.

Primary outcome of the ORCHID Clinical Trial. The top panel represents the bar plot as published in the original article. The bottom panel is the figure as displayed in the notebook reproducing the analysis. Leveraging preexisting and newly developed tools, our work showcased how the combination of simple standard tools (open-source programming language, notebook, data repository, and API) can streamline the reproducibility process (Figure 2). Moreover, by using the cloud computing environments provided by BioData Catalyst, any investigator registered in the ecosystem and authorized to access this dataset in the database of Genotypes and Phenotypes (dbGaP) can execute the notebook in one click, without the need for downloading the data. Finally, because the heavy lifting data management process has already been done, it lowers the entry cost for investigators who can reuse the data and code right away and kick-start new analyses.

Figure 2.

Elements composing the reproducible workflow.

Elements composing the reproducible workflow. We illustrate this idea by carrying out sensitivity analyses that extend the work realized by the original investigators. We studied the correlation of premedication by azithromycin and outcomes (treatment received before entering the clinical trial). This off-label drug has been commonly used as a COVID-19 treatment, especially at the beginning of the pandemic (150 out of 479 patients of the ORCHID clinical trial population received it before inclusion). Our analysis showed that baseline clinical status as well as outcomes were worse in the population of patients who received azithromycin, possibly reflecting an incentive to use off-label drugs on more severe cases (Table 1). We also studied the differences in the lab trajectories during the first 5 days of the clinical trial, according to having received azithromycin. Systematic differences can be observed in AST, ALP, ALT, and troponin concentrations among the laboratory values studied (Figure 3). More extreme values are present in the group that did not receive azithromycin, highlighting the baseline characteristic differences in patient groups. One potential explanation could be the lower rate of prescription of azithromycin to patients with liver or cardiac conditions.

Table 1.

Sensitivity analysis: COVID-19 Outcomes Scale at randomization, day 14, and day 28, according to premedication by azithromycin (prescription of azithromycin before inclusion in the trial)

	At randomization		14 d after randomization		28 d after randomization
COVID-19 Outcomes Scale	Yes (n = 150)	No (n = 329)	Yes (n = 150)	No (n = 329)	Yes (n = 150)	No (n = 329)
(1) Death	0 (0%)	0 (0%)	10 (6.7%)	22 (6.7%)	17 (11.3%)	33 (10%)
(2) Invasive mechanical ventilation or extracorporeal membrane oxygenation	16 (10.7%)	16 (4.9%)	20 (13.3%)	22 (6.7%)	12 (8%)	11 (3.3%)
(3) Noninvasive ventilation or high flow nasal cannula	22 (14.7%)	33 (10%)	3 (2%)	9 (2.7%)	0 (0%)	0 (0%)
(4) Hospitalized with oxygen	69 (46%)	155 (47.1%)	11 (7.3%)	29 (8.8%)	4 (2.7%)	15 (4.6%)
(5) Hospitalized without oxygen	43 (28.7%)	125 (38%)	8 (5.3%)	29 (8.8%)	3 (2%)	8 (2.4%)
(6) Discharged, limitation in activity	0 (0%)	0 (0%)	63 (42%)	106 (32.2%)	50 (33.3%)	97 (29.5%)
(7) Discharged, no limitation in activity	0 (0%)	0 (0%)	35 (23.3%)	112 (34%)	64 (42.7%)	165 (50.2%)

Figure 3.

Supplemental analysis based on ORCHID clinical trial data: laboratory test trajectories according to premedication by azithromycin.

Supplemental analysis based on ORCHID clinical trial data: laboratory test trajectories according to premedication by azithromycin. Sensitivity analysis: COVID-19 Outcomes Scale at randomization, day 14, and day 28, according to premedication by azithromycin (prescription of azithromycin before inclusion in the trial) An essential part of producing sound and reproducible analyses lies in following established guidelines when reporting results. Our work implements the FAIR principle, materialized by the Three-point FAIRification Framework: findable (data loaded in BioData Catalyst is being assigned globally unique and persistent identifiers, and variable names are searchable globally without requiring specific authorization), accessible (the API and web-based graphical user interface are implemented in open source languages, with different level of authorizations), interoperable (data vocabulary and variables are being made accessible through open PIC-SURE), and reusable (reusable by every person granted individual patient level access). A more detailed version of these principles is provided in Table 2, and the FAIR initiative website provides a checklist for investigators who want to implement these principles.

Table 2.

Details of the “FAIR Guiding Principles for scientific data management and stewardship”

FAIR principles	Details
Findable	F1: (Meta)data are assigned a globally unique and persistent identifier F2: Data are described with rich metadata (defined by R1 below) F3: Metadata clearly and explicitly include the identifier of the data they describe F4: (Meta)data are registered or indexed in a searchable resource
Accessible	A1: (Meta)data are retrievable by their identifier using a standardized communications protocol A1.1: The protocol is open, free, and universally implementable A1.2: The protocol allows for an authentication and authorization procedure, where necessary A2: Metadata are accessible, even when the data are no longer available
Interoperable	I1: (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation I2: (Meta)data use vocabularies that follow FAIR principles I3: (Meta)data include qualified references to other (meta)data
Reusable	R1: (Meta)data are richly described with a plurality of accurate and relevant attributes R1.1: (Meta)data are released with a clear and accessible data usage license R1.2: (Meta)data are associated with detailed provenance R1.3: (Meta)data meet domain-relevant community standards

Details of the “FAIR Guiding Principles for scientific data management and stewardship” F1: (Meta)data are assigned a globally unique and persistent identifier F2: Data are described with rich metadata (defined by R1 below) F3: Metadata clearly and explicitly include the identifier of the data they describe F4: (Meta)data are registered or indexed in a searchable resource A1: (Meta)data are retrievable by their identifier using a standardized communications protocol A1.1: The protocol is open, free, and universally implementable A1.2: The protocol allows for an authentication and authorization procedure, where necessary A2: Metadata are accessible, even when the data are no longer available I1: (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation I2: (Meta)data use vocabularies that follow FAIR principles I3: (Meta)data include qualified references to other (meta)data R1: (Meta)data are richly described with a plurality of accurate and relevant attributes R1.1: (Meta)data are released with a clear and accessible data usage license R1.2: (Meta)data are associated with detailed provenance R1.3: (Meta)data meet domain-relevant community standards It is worth noting that the advancements presented in our work are primarily addressing the statistical aspect of reproducibility. An entire body of literature has been devoted to identifying and handling other elements of reproducibility. Setting up reproducible environments is an essential part of reproducibility, but it usually involves a certain degree of technicity. Almugbel et al have facilitated the use of containers by setting up a web-based interface automatically generating Dockerfiles. Eyal-Altman et al created a platform (PCM-SABRE) for reproducing and expanding on previous work in the domain of prediction in oncology. Reproducibility issues can also stem from experimental design, selective reporting, and journal publication biases. Study preregistration has been thought of as a potential solution for these issues. Nonetheless, the presented solution represents a significant step forward in reproducibility by providing NHLBI's investigators the possibility to reuse data and transparently showcase their work. But the impact of such a process actually comes from the level of adoption by the research community. As a comparison, publishing a clinical trial protocol on clinicaltrials.gov has become ubiquitous in the medical research community because it addressed a crucial need of the scientific community: selective reporting of results and data-driven inferences. Similarly, such an initiative will only be helpful if embraced by the research community on a large scale. Therefore, we advocate that the practice of submitting a report of the analysis code in the form of a Jupyter-like format could be encouraged by updating the clinical trial reporting guidelines. Moreover, medical journals could also make it a required document for publication, especially because the minimal requirements to implement such workflow only rely on a few elements: using an open-source programming language, displaying the code and results in a notebook, and accessing the data from a cloud-based repository. Such initiatives would go a long way in incentivizing investigators to produce robust analyses, thereby fostering trust in published results. Besides BioData Catalyst, other initiatives emphasize reproducibility through transparent workflow and results, such as AnVIL, a genomic data resource that leverages a cloud-based infrastructure for democratizing genomic data access, or the Cancer Research Data Commons that connects diverse datasets with analytical tools in the cloud. Over the course of the COVID-19 pandemic, the NIH launched the National COVID Cohort Collaborative which aims at sharing and harmonizing individual-level clinical data to accelerate COVID-19 research, and in which transparency and reproducibility are promoted as cornerstones of the project and facilitated by cloud-based platforms and tools. Moreover, the NIH set up the Cloud Platform Interoperability Effort to allow investigators using BioData Catalyst data to find and integrate data across three other platforms: Cancer Research Data Commons, Kids First Data Resource Center, and AnVIL, therefore making BioData Catalyst part of a larger network of connected cloud-based data repositories. We do not anticipate that our effort will solve the reproducibility crisis. The practices laid out here are only effective if combined with other best practices for conducting reproducible science. It is worth noting that this protocol cannot help with flaws intrinsic to a given study, like data collection issues, flaws in the study design, or population selection bias. Neither can it eliminate the necessary reproduction of experimental results using different populations or variations in the methodology. Although these issues are of paramount importance, they would be addressed by a completely different framework. However, the principles demonstrated can address reproducibility issues that stem from inaccuracies in statistical analysis or data-management process; discrepancies between prerecorded and shared outcomes; selective reporting; and other practices like p-hacking or outliers trimming. Those principles can have a real impact. The heated debate over hydroxychloroquine may have benefited from more transparency in the analyses, helping science prevail over opinion and eventually translating into more informed treatment choices and public health policies.

Funding

This work was supported by the National Institutes of Health, National Heart, Lung, and Blood Institute, through the BioData Catalyst program (award 1OT3HL142479-01, 1OT3HL142478-01, 1OT3HL142481-01, and 1OT3HL142480-01).

Author Contributions

PA and JK conceptualized the work; PA setup the software platform to access the data; ASL and PA designed the methodology; ASL carried out the statistical analyses; PA supervised the statistical analyses; ASL and PA accessed and verified the underlying data; ASL, JK, and PA interpreted and validated the results; ASL drafted the initial version of the manuscript; ASL, PA, and JK reviewed and edited the manuscript. All authors approved the final version to be published. All authors agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Ethics Approval

This study was deemed institutional review board (IRB) exempt by the Harvard Medical School IRB based on its being nonhuman subject research.

16 in total

1. Accelerating Biomedical Discoveries through Rigor and Transparency.

Authors: Judith A Hewitt; Liliana L Brown; Stephanie J Murphy; Franziska Grieder; Shai D Silberberg
Journal: ILAR J Date: 2017-07-01

2. Clinical outcomes and adverse events in patients hospitalised with COVID-19, treated with off-label hydroxychloroquine and azithromycin.

Authors: Mary Kelly; Ròisìn O'Connor; Liam Townsend; Miriam Coghlan; Eileen Relihan; Miriam Moriarty; Bernard Carr; Gail Melanophy; Caitriona Doyle; Ciaran Bannan; Ruth O'Riordan; Concepta Merry; Susie Clarke; Colm Bergin
Journal: Br J Clin Pharmacol Date: 2020-08-02 Impact factor: 4.335

3. CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials.

Authors: Kenneth F Schulz; Douglas G Altman; David Moher
Journal: PLoS Med Date: 2010-03-24 Impact factor: 11.069

4. Azithromycin in the treatment of COVID-19: a review.

Authors: Daniel Echeverría-Esnal; Clara Martin-Ontiyuelo; María Eugenia Navarrete-Rouco; Marta De-Antonio Cuscó; Olivia Ferrández; Juan Pablo Horcajada; Santiago Grau
Journal: Expert Rev Anti Infect Ther Date: 2020-10-06 Impact factor: 5.091

5. Policy: NIH plans to enhance reproducibility.

Authors: Francis S Collins; Lawrence A Tabak
Journal: Nature Date: 2014-01-30 Impact factor: 49.962

6. Effect of Hydroxychloroquine on Clinical Status at 14 Days in Hospitalized Patients With COVID-19: A Randomized Clinical Trial.

Authors: Wesley H Self; Matthew W Semler; Lindsay M Leither; Jonathan D Casey; Derek C Angus; Roy G Brower; Steven Y Chang; Sean P Collins; John C Eppensteiner; Michael R Filbin; D Clark Files; Kevin W Gibbs; Adit A Ginde; Michelle N Gong; Frank E Harrell; Douglas L Hayden; Catherine L Hough; Nicholas J Johnson; Akram Khan; Christopher J Lindsell; Michael A Matthay; Marc Moss; Pauline K Park; Todd W Rice; Bryce R H Robinson; David A Schoenfeld; Nathan I Shapiro; Jay S Steingrub; Christine A Ulysse; Alexandra Weissman; Donald M Yealy; B Taylor Thompson; Samuel M Brown; Jay Steingrub; Howard Smithline; Bogdan Tiru; Mark Tidswell; Lori Kozikowski; Sherell Thornton-Thompson; Leslie De Souza; Peter Hou; Rebecca Baron; Anthony Massaro; Imoigele Aisiku; Lauren Fredenburgh; Raghu Seethala; Lily Johnsky; Richard Riker; David Seder; Teresa May; Michael Baumann; Ashley Eldridge; Christine Lord; Nathan Shapiro; Daniel Talmor; Thomas O’Mara; Charlotte Kirk; Kelly Harrison; Lisa Kurt; Margaret Schermerhorn; Valerie Banner-Goodspeed; Katherine Boyle; Nicole Dubosh; Michael Filbin; Kathryn Hibbert; Blair Parry; Kendall Lavin-Parsons; Natalie Pulido; Brendan Lilley; Carl Lodenstein; Justin Margolin; Kelsey Brait; Alan Jones; James Galbraith; Rebekah Peacock; Utsav Nandi; Taylor Wachs; Michael Matthay; Kathleen Liu; Kirsten Kangelaris; Ralph Wang; Carolyn Calfee; Kimberly Yee; Gregory Hendey; Steven Chang; George Lim; Nida Qadir; Andrea Tam; Rebecca Beutler; Joseph Levitt; Jenny Wilson; Angela Rogers; Rosemary Vojnik; Jonasel Roque; Timothy Albertson; James Chenoweth; Jason Adams; Skyler Pearson; Maya Juarez; Eyad Almasri; Mohamed Fayed; Alyssa Hughes; Shelly Hillard; Ryan Huebinger; Henry Wang; Elizabeth Vidales; Bela Patel; Adit Ginde; Marc Moss; Amiran Baduashvili; Jeffrey McKeehan; Lani Finck; Carrie Higgins; Michelle Howell; Ivor Douglas; Jason Haukoos; Terra Hiller; Carolynn Lyle; Alicia Cupelo; Emily Caruso; Claudia Camacho; Stephanie Gravitz; James Finigan; Christine Griesmer; Pauline Park; Robert Hyzy; Kristine Nelson; Kelli McDonough; Norman Olbrich; Mark Williams; Raj Kapoor; Jean Nash; Meghan Willig; Henry Ford; Jayna Gardner-Gray; Mayur Ramesh; Montefiore Moses; Michelle Ng Gong; Michael Aboodi; Ayesha Asghar; Omowunmi Amosu; Madeline Torres; Savneet Kaur; Jen-Ting Chen; Aluko Hope; Brenda Lopez; Kathleen Rosales; Jee Young You; Jarrod Mosier; Cameron Hypes; Bhupinder Natt; Bryan Borg; Elizabeth Salvagio Campbell; R Duncan Hite; Kristin Hudock; Autumn Cresie; Faysal Alhasan; Jose Gomez-Arroyo; Abhijit Duggal; Omar Mehkri; Andrei Hastings; Debasis Sahoo; Francois Abi Fadel; Susan Gole; Valerie Shaner; Allison Wimer; Yvonne Meli; Alexander King; Thomas Terndrup; Matthew Exline; Sonal Pannu; Emily Robart; Sarah Karow; Catherine Hough; Bryce Robinson; Nicholas Johnson; Daniel Henning; Monica Campo; Stephanie Gundel; Sakshi Seghal; Sarah Katsandres; Sarah Dean; Akram Khan; Olivia Krol; Milad Jouzestani; Peter Huynh; Alexandra Weissman; Donald Yealy; Denise Scholl; Peter Adams; Bryan McVerry; David Huang; Derek Angus; Jordan Schooler; Steven Moore; Clark Files; Chadwick Miller; Kevin Gibbs; Mary LaRose; Lori Flores; Lauren Koehler; Caryn Morse; John Sanders; Caitlyn Langford; Kristen Nanney; Masiku MdalaGausi; Phyllis Yeboah; Peter Morris; Jamie Sturgill; Sherif Seif; Evan Cassity; Sanjay Dhar; Marjolein de Wit; Jessica Mason; Andrew Goodwin; Greg Hall; Abbey Grady; Amy Chamberlain; Samuel Brown; Joseph Bledsoe; Lindsay Leither; Ithan Peltan; Nathan Starr; Melissa Fergus; Valerie Aston; Quinn Montgomery; Rilee Smith; Mardee Merrill; Katie Brown; Brent Armbruster; Estelle Harris; Elizabeth Middleton; Robert Paine; Stacy Johnson; Macy Barrios; John Eppensteiner; Alexander Limkakeng; Lauren McGowan; Tedra Porter; Andrew Bouffler; J. Clancy Leahy; Bennet deBoisblanc; Matthew Lammi; Kyle Happel; Paula Lauto; Wesley Self; Jonathan Casey; Matthew Semler; Sean Collins; Frank Harrell; Christopher Lindsell; Todd Rice; William Stubblefield; Christopher Gray; Jakea Johnson; Megan Roth; Margaret Hays; Donna Torr; Arwa Zakaria; David Schoenfeld; Taylor Thompson; Douglas Hayden; Nancy Ringwood; Cathryn Oldmixon; Christine Ulysse; Richard Morse; Ariela Muzikansky; Laura Fitzgerald; Samuel Whitaker; Adrian Lagakos; Roy Brower; Lora Reineck; Neil Aggarwal; Karen Bienstock; Michelle Freemer; Myron Maclawiw; Gail Weinmann; Laurie Morrison; Mark Gillespie; Richard Kryscio; Daniel Brodie; Wojciech Zareba; Anne Rompalo; Michael Boeckh; Polly Parsons; Jason Christie; Jesse Hall; Nicholas Horton; Laurie Zoloth; Neal Dickert; Deborah Diercks
Journal: JAMA Date: 2020-12-01 Impact factor: 56.272

Review 7. Evidence for the selective reporting of analyses and discrepancies in clinical trials: a systematic review of cohort studies of clinical trials.

Authors: Kerry Dwan; Douglas G Altman; Mike Clarke; Carrol Gamble; Julian P T Higgins; Jonathan A C Sterne; Paula R Williamson; Jamie J Kirkham
Journal: PLoS Med Date: 2014-06-24 Impact factor: 11.069

Review 8. Sharing Is Caring-Data Sharing Initiatives in Healthcare.

Authors: Tim Hulsen
Journal: Int J Environ Res Public Health Date: 2020-04-27 Impact factor: 3.390

9. Evaluation of Data Sharing After Implementation of the International Committee of Medical Journal Editors Data Sharing Statement Requirement.

Authors: Valentin Danchev; Yan Min; John Borghi; Mike Baiocchi; John P A Ioannidis
Journal: JAMA Netw Open Date: 2021-01-04

10. The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment.

Authors: Melissa A Haendel; Christopher G Chute; Tellen D Bennett; David A Eichmann; Justin Guinney; Warren A Kibbe; Philip R O Payne; Emily R Pfaff; Peter N Robinson; Joel H Saltz; Heidi Spratt; Christine Suver; John Wilbanks; Adam B Wilcox; Andrew E Williams; Chunlei Wu; Clair Blacketer; Robert L Bradford; James J Cimino; Marshall Clark; Evan W Colmenares; Patricia A Francis; Davera Gabriel; Alexis Graves; Raju Hemadri; Stephanie S Hong; George Hripscak; Dazhi Jiao; Jeffrey G Klann; Kristin Kostka; Adam M Lee; Harold P Lehmann; Lora Lingrey; Robert T Miller; Michele Morris; Shawn N Murphy; Karthik Natarajan; Matvey B Palchuk; Usman Sheikh; Harold Solbrig; Shyam Visweswaran; Anita Walden; Kellie M Walters; Griffin M Weber; Xiaohan Tanner Zhang; Richard L Zhu; Benjamin Amor; Andrew T Girvin; Amin Manna; Nabeel Qureshi; Michael G Kurilla; Sam G Michael; Lili M Portilla; Joni L Rutter; Christopher P Austin; Ken R Gersing
Journal: J Am Med Inform Assoc Date: 2021-03-01 Impact factor: 7.942