Literature DB >> 32220289

Early in the epidemic: impact of preprints on global discourse about COVID-19 transmissibility.

Maimuna S Majumder1, Kenneth D Mandl2.   

Abstract

Entities:  

Mesh:

Year:  2020        PMID: 32220289      PMCID: PMC7159059          DOI: 10.1016/S2214-109X(20)30113-3

Source DB:  PubMed          Journal:  Lancet Glob Health        ISSN: 2214-109X            Impact factor:   26.763


× No keyword cloud information.
Since it was first reported by WHO in Jan 5, 2020, over 80 000 cases of a novel coronavirus disease (COVID-19) have been diagnosed in China, with exportation events to nearly 90 countries, as of March 6, 2020. Given the novelty of the causative pathogen (named SARS-CoV-2), scientists have rushed to fill epidemiological, virological, and clinical knowledge gaps—resulting in over 50 new studies about the virus between January 10 and January 30 alone. However, in an era where the immediacy of information has become an expectation of decision makers and the general public alike, many of these studies have been shared first in the form of preprint papers—before peer review. For the past three decades, preprint servers have become commonplace in the scientific publication ecosystem, and COVID-19 has prompted a seemingly unprecedented use of these platforms. Although peer-review is crucial for the validation of science, the ongoing outbreak has showcased the speed with which preprints can disseminate information during emergencies. In this Comment, we used both preprint and peer-reviewed studies that estimated the transmissibility potential (ie, basic reproduction number [R 0]) of SARS-CoV-2 on or before Feb 1, 2020 to investigate the role that preprints have had in information dissemination during the ongoing outbreak. We also analysed the agreement of preprint estimates compared with those presented by peer-reviewed studies and propose a consensus-based approach for evaluating the validity of preprint findings during public health crises. For our analysis, we collected publicly available data from scientific studies, news reports, and search trends pertaining to SARS-CoV-2 and its R 0. Defined as the average number of secondary infections that a new case might transmit in a fully susceptible population, estimates of R 0 can provide decision makers with insights into the epidemic potential of a given outbreak. Relevant news reports were discovered through MediaCloud and search trends by use of Google Search Trends, and both served as a proxy indicator for information dissemination. Meanwhile, relevant scientific studies were discovered through a combination of searches executed with use of Google Scholar and, to address possible delays in indexing, four popular public preprint servers (ie, arXiv, bioRxiv, medRxiv, and Social Science Research Network [SSRN]) that we believe are representative of the relevant preprint literature. Search terms and specifications for each data source are outlined in the appendix (p 2). All studies discovered through Google Scholar, arXiv, bioRxiv, medRxiv, and SSRN were manually checked for relevance to the topic area of interest. We retained only studies that included estimates for the R 0 associated with SARS-CoV-2 in the body of the text. After this initial data discovery phase, which yielded 11 individual studies, date of first publication, publication platform, review status (ie, preprint vs peer-reviewed), and methodological details were manually curated from each study (appendix p 3).4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 R 0 estimates were also extracted from each study for further analysis. In the event of multiple R 0 estimates—because of preprint revisions after the first version or the use of multiple approaches in a single study—each estimate was recorded and treated as a separate entry to represent all available knowledge at any given point in time (appendix p 3). Given that the first known preprint estimates for R 0 were posted to SSRN by us on Jan 23, we plotted search trend fractions and news report volume between Jan 23 and Feb 1 (appendix p 4). Baseline data for both sources before Jan 23, 2020, yielded negligible search trend interest and news report volume, and data collected up to Feb 9, 2020, showed diminishing interest and volume after the catchment window (appendix p 4). To illustrate when each of the 11 relevant studies became available to the public, indicator bars were overlaid against the search trend and news report data by date of publication (appendix p 4). We then plotted each of the 16 R 0 estimates produced by the 11 studies, including both the mean and the estimate range (eg, 95% CI, 95% credible interval, and so on) presented (appendix p 3). Estimates were plotted by date of publication and alphabetically there-in, offering a side-by-side comparison of preprint versus peer-reviewed results; averages and 95% CIs were also computed for both groups (figure ).
Figure

R0 mean and range estimates from 11 different studies of COVID–19 as a function of time

For preprints that were revised before publication of the first relevant peer-reviewed study on Jan 29, the version number is indicated between parentheses as (n). When multiple R0 estimates were presented in a single study because of the use of multiple approaches, the version number is followed by a single decimal place to indicate the approach used (n.n). If a first author published more than one relevant independent study before Feb 1, the version number is followed immediately by an alphabetical marker ordered by date of publication (nx). Ranges presented vary by study (eg, 95% CI, 95% credible interval, and so on) and are presented in the appendix (p 3). R0=basic reproduction number.

R0 mean and range estimates from 11 different studies of COVID–19 as a function of time For preprints that were revised before publication of the first relevant peer-reviewed study on Jan 29, the version number is indicated between parentheses as (n). When multiple R0 estimates were presented in a single study because of the use of multiple approaches, the version number is followed by a single decimal place to indicate the approach used (n.n). If a first author published more than one relevant independent study before Feb 1, the version number is followed immediately by an alphabetical marker ordered by date of publication (nx). Ranges presented vary by study (eg, 95% CI, 95% credible interval, and so on) and are presented in the appendix (p 3). R0=basic reproduction number. Google Search Trends and MediaCloud data suggested that both general (ie, search) interest and news media interest in the R 0 associated with COVID–19 peaked before the publication of relevant peer-reviewed studies during the early stages of the epidemic. In the selected time frame, search interest peaked on Jan 27 after a sharp increase between Jan 23 and Jan 25 immediately after the publication of five early preprint studies—all of which estimated R 0—in bioRxiv, medRxiv, and SSRN. Meanwhile, news media interest peaked on Jan 28, coinciding with a sixth preprint study published in arXiv (appendix p 4). The first peer-reviewed estimates were then published by Li and colleagues in The New England Journal of Medicine on Jan 29 at 17:00 h (eastern standard time), followed by four additional peer-reviewed studies in Eurosurveillance, The International Journal of Infectious Diseases, The Lancet, and Journal of Clinical Medicine up to Feb 1.14, 19 Average R 0 estimates across the preprint group were 3·61 (95% CI 2·77–4·45) and 2·54 (2·17–2·91) across the peer-reviewed group—showing overlap in 95% CIs despite a wide diversity of modelling methods and data sources used both in-group and across-group (appendix p 3). Although the average mean for the preprint group was higher than that for the peer-reviewed group, this effect was driven primarily by two upper-limit outlier estimates (with R 0 higher than the 95% CI maximum; figure).9, 10 Exclusion of these two estimates by use of a consensus-based approach based on the 95% CIs yielded an average R 0 estimate of 3·02 (95% CI 2·65–3·39) for the preprint group. Notably, two studies in the peer-reviewed group had previously been published as preprints.15, 16 Although estimates presented by Riou and Althaus remained unchanged after peer review, estimates presented by Zhao and colleagues were higher before peer review than afterwards. Our findings suggest that, because of the speed of their release, preprints—rather than peer-reviewed literature in the same topic area—might be driving discourse related to the ongoing COVID-19 outbreak. Although our analysis focused on search trends and news media data as a measure for general discourse, it is likely that preprints are also influencing policy making discussions, given that WHO announced on Jan 26, 2020, that they would be creating a repository of relevant studies—including those that have not yet been peer-reviewed. Nevertheless, despite the advantages of speedy information delivery, the lack of peer review can also translate into issues of credibility and misinformation, both intentional and unintentional. This particular drawback has been highlighted during the ongoing outbreak, especially after the high-profile withdrawal of a virology study from the preprint server bioRxiv, which erroneously claimed that COVID-19 contained HIV “insertions”. The very fact that this study was withdrawn showcases the power of open peer-review during emergencies; the withdrawal itself appears to have been prompted by outcry from dozens of scientists from around the globe who had access to the study because it was placed on a public server. Much of this outcry was documented on Twitter (a microblogging platform) and on longer-form popular science blogs, signalling that such fora would serve as rich additional data sources for future work on the impact of preprints on public discourse. However, instances such as this one described showcase the need for caution when acting upon the science put forth by any one preprint. With this in mind, taking multiple studies into consideration as presented in our analysis can help operationalise the kind of caution necessitated by preprints while simultaneously allowing for important, robust insights before the publication of a peer-reviewed study in the same topic area. Here, we used a simple method in which we plotted the ten R 0 estimates that were posted as preprints before publication of the first peer-reviewed study on Jan 29; we then took the average of these estimates and excluded the two estimates that qualified as upper-limit outliers—both upon visual inspection and as a function of the 95% CI. Even before outlier elimination, this simple method yielded average R 0 estimates similar to those presented by the peer-reviewed studies subsequently published on and after Jan 29; however, more complex approaches that incorporate weighted averages based on estimate confidence, similar to traditional meta-analytical methods, offer a promising avenue for future work. Such collective, consensus-based approaches will arguably be easiest to use when the research of interest is quantitative in nature; nevertheless, given that many crucial epidemiological parameters that inform decision making (eg, incubation period, generation time, and so on) are quantitative, our proposed approach could work well in these contexts as well. Our work showcases the powerful role preprints can have during public health crises because of the timeliness with which they can disseminate new information. Furthermore, given that two of the preprints included in this analysis were later published in peer-reviewed outlets, the evidence shows that that even prestigious journals now permit the sharing of important findings before peer review and that the use of preprint platforms does not jeopardise future peer-reviewed publication.15, 16 Without question, primacy and peer-reviewed publications are key metrics in individual professional advancement (eg, academic promotion); nevertheless, the impact of preprints on discourse and decision making pertaining to the ongoing COVID-19 outbreak suggests that we must rethink how we reward and recognise community contributions during present and future public health crises.
  7 in total

1.  Estimating the Unreported Number of Novel Coronavirus (2019-nCoV) Cases in China in the First Half of January 2020: A Data-Driven Modelling Analysis of the Early Outbreak.

Authors:  Shi Zhao; Salihu S Musa; Qianying Lin; Jinjun Ran; Guangpu Yang; Weiming Wang; Yijun Lou; Lin Yang; Daozhou Gao; Daihai He; Maggie H Wang
Journal:  J Clin Med       Date:  2020-02-01       Impact factor: 4.241

2.  Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia.

Authors:  Qun Li; Xuhua Guan; Peng Wu; Xiaoye Wang; Lei Zhou; Yeqing Tong; Ruiqi Ren; Kathy S M Leung; Eric H Y Lau; Jessica Y Wong; Xuesen Xing; Nijuan Xiang; Yang Wu; Chao Li; Qi Chen; Dan Li; Tian Liu; Jing Zhao; Man Liu; Wenxiao Tu; Chuding Chen; Lianmei Jin; Rui Yang; Qi Wang; Suhua Zhou; Rui Wang; Hui Liu; Yinbo Luo; Yuan Liu; Ge Shao; Huan Li; Zhongfa Tao; Yang Yang; Zhiqiang Deng; Boxi Liu; Zhitao Ma; Yanping Zhang; Guoqing Shi; Tommy T Y Lam; Joseph T Wu; George F Gao; Benjamin J Cowling; Bo Yang; Gabriel M Leung; Zijian Feng
Journal:  N Engl J Med       Date:  2020-01-29       Impact factor: 176.079

3.  Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

Authors:  Joseph T Wu; Kathy Leung; Gabriel M Leung
Journal:  Lancet       Date:  2020-01-31       Impact factor: 79.321

4.  Pattern of early human-to-human transmission of Wuhan 2019 novel coronavirus (2019-nCoV), December 2019 to January 2020.

Authors:  Julien Riou; Christian L Althaus
Journal:  Euro Surveill       Date:  2020-01

5.  Estimation of the Transmission Risk of the 2019-nCoV and Its Implication for Public Health Interventions.

Authors:  Biao Tang; Xia Wang; Qian Li; Nicola Luigi Bragazzi; Sanyi Tang; Yanni Xiao; Jianhong Wu
Journal:  J Clin Med       Date:  2020-02-07       Impact factor: 4.241

6.  Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak.

Authors:  Shi Zhao; Qianyin Lin; Jinjun Ran; Salihu S Musa; Guangpu Yang; Weiming Wang; Yijun Lou; Daozhou Gao; Lin Yang; Daihai He; Maggie H Wang
Journal:  Int J Infect Dis       Date:  2020-01-30       Impact factor: 3.623

7.  Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV.

Authors:  Tao Zhou; Quanhui Liu; Zimo Yang; Jingyi Liao; Kexin Yang; Wei Bai; Xin Lu; Wei Zhang
Journal:  J Evid Based Med       Date:  2020-02-12
  7 in total
  47 in total

1.  Preprint citation practice in PLOS.

Authors:  Marc Bertin; Iana Atanassova
Journal:  Scientometrics       Date:  2022-05-25       Impact factor: 3.801

2.  The Academic, Societal and Animal Welfare Benefits of Open Science for Animal Science.

Authors:  Christian Nawroth; E Tobias Krause
Journal:  Front Vet Sci       Date:  2022-04-12

3.  The Main Sources and Potential Effects of COVID-19-Related Discrimination.

Authors:  Piotr Rzymski; Hanna Mamzer; Michał Nowicki
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

4.  Overview of current state of research on the application of artificial intelligence techniques for COVID-19.

Authors:  Vijay Kumar; Dilbag Singh; Manjit Kaur; Robertas Damaševičius
Journal:  PeerJ Comput Sci       Date:  2021-05-26

Review 5.  Preprint Servers in Kidney Disease Research: A Rapid Review.

Authors:  Caitlyn Vlasschaert; Cameron Giles; Swapnil Hiremath; Matthew B Lanktree
Journal:  Clin J Am Soc Nephrol       Date:  2020-07-17       Impact factor: 8.237

6.  Challenges in control of COVID-19: short doubling time and long delay to effect of interventions.

Authors:  Lorenzo Pellis; Francesca Scarabel; Helena B Stage; Christopher E Overton; Lauren H K Chappell; Elizabeth Fearon; Emma Bennett; Katrina A Lythgoe; Thomas A House; Ian Hall
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-05-31       Impact factor: 6.237

7.  The COVID-19 Diagnostic Technology Landscape: Efficient Data Sharing Drives Diagnostic Development.

Authors:  Eric R G R Aguiar; Jesús Navas; Luis G C Pacheco
Journal:  Front Public Health       Date:  2020-06-18

8.  Preprint servers: a 'rush to publish' or 'just in time delivery' for science?

Authors:  Alan Robert Smyth; Claire Rawlinson; Gisli Jenkins
Journal:  Thorax       Date:  2020-04-20       Impact factor: 9.139

9.  Novel coronavirus 2019-nCoV (COVID-19): early estimation of epidemiological parameters and epidemic size estimates.

Authors:  Jonathan M Read; Jessica R E Bridgen; Derek A T Cummings; Antonia Ho; Chris P Jewell
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-05-31       Impact factor: 6.237

10.  Using statistics and mathematical modelling to understand infectious disease outbreaks: COVID-19 as an example.

Authors:  Christopher E Overton; Helena B Stage; Shazaad Ahmad; Jacob Curran-Sebastian; Paul Dark; Rajenki Das; Elizabeth Fearon; Timothy Felton; Martyn Fyles; Nick Gent; Ian Hall; Thomas House; Hugo Lewkowicz; Xiaoxi Pang; Lorenzo Pellis; Robert Sawko; Andrew Ustianowski; Bindu Vekaria; Luke Webb
Journal:  Infect Dis Model       Date:  2020-07-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.