| Literature DB >> 33935332 |
Rut Lucas-Dominguez1,2,3, Adolfo Alonso-Arroyo1,2, Antonio Vidal-Infer1,2, Rafael Aleixandre-Benavent2,4.
Abstract
During the previous Ebola and Zika outbreaks, researchers shared their data, allowing many published epidemiological studies to be produced only from open research data, to speed up investigations and control of these infections. This study aims to evaluate the dissemination of the COVID-19 research data underlying scientific publications. Analysis of COVID-19 publications from December 1, 2019, to April 30, 2020, was conducted through the PubMed Central repository to evaluate the research data available through its publication as supplementary material or deposited in repositories. The PubMed Central search generated 5,905 records, of which 804 papers included complementary research data, especially as supplementary material (77.4%). The most productive journals were The New England Journal of Medicine, The Lancet and The Lancet Infectious Diseases, the most frequent keyword was pneumonia, and the most used repositories were GitHub and GenBank. An expected growth in the number of published articles following the course of the pandemics is confirmed in this work, while the underlying research data are only 13.6%. It can be deduced that data sharing is not a common practice, even in health emergencies, such as the present one. High-impact generalist journals have accounted for a large share of global publishing. The topics most often covered are related to epidemiological and public health concepts, genetics, virology and respiratory diseases, such as pneumonia. However, it is essential to interpret these data with caution following the evolution of publications and their funding in the coming months. © Akadémiai Kiadó, Budapest, Hungary 2021.Entities:
Keywords: COVID-19; Data sharing; PubMed central; Repository; Supplementary material
Year: 2021 PMID: 33935332 PMCID: PMC8072296 DOI: 10.1007/s11192-021-03971-6
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.801
Fig. 1Number of articles, funded research, supplementary material and number of cases worldwide
Evolution from January to April 2020 of the research data publication on COVID-19 pandemics in the form of supplementary material or deposited in repositories
| Fortnight | Total papers | Journal name | Total papers | Key words | Total papers | Repository | Total papers |
|---|---|---|---|---|---|---|---|
| Jan20 fortnight 2 | 27 | N Engl J Med | 4 | Genome | 4 | GenBank | 3 |
| The Lancet | 4 | MERS–CoV; Wuhan | 3 | ||||
| Emerg Microbes Infect | 4 | Bats; Diagnosis; Emerging; Emerging infectious disease; Pneumonia; Polymerase chain reaction (PCR); Virus | 2 | ||||
| Feb20 fortnight 1 | 56 | J Med Virol | 4 | MERS–CoV | 6 | GenBank | 3 |
| N Engl J Med; Lancet; Euro Surveill; J Virol | 3 | Zoonosis | 5 | BioProject; Github; Sequence Read Archive (SRA) | 2 | ||
| Epidemiology; Pneumonia; SARS-CoV; Wuhan; Antiviral Therapy | 4 | ||||||
| Feb20 fortnight 2 | 77 | The Lancet | 7 | Pneumonia | 7 | Github | 4 |
| The Lancet Infect Dis | 6 | SARS-CoV | 6 | ||||
| J Clin Med | 6 | Epidemic | 5 | ||||
| Mar20 fortnight 1 | 63 | J Infect | 4 | SARS-CoV | 5 | Protein Data Bank (PDB); Electron Microscopy Data Bank (EMDB); GenBank; GISAID | 2 |
| The Lancet | 3 | Spike protein | 4 | ||||
| The Lancet Infect Dis | 3 | Angiotensin converting enzyme 2 (ACE2); MERS–CoV; Pneumonia | 3 | ||||
| Mar20 fortnight 2 | 143 | N Engl J Med | 8 | Pneumonia | 7 | Github | 6 |
| The Lancet Infect Dis | 7 | spike protein | 6 | Gene Expression Omnibus (GEO) | 3 | ||
| Emerg Microbes Infect | 6 | Angiotensin converting enzyme 2 (ACE2); Molecular docking; Pandemics | 5 | ||||
| Apr20 fortnight 1 | 201 | N Engl J Med | 14 | Pandemics | 9 | Github | 4 |
| Eur Heart J | 7 | Outbreak; Public health | 7 | GenBank | 3 | ||
| Int J Nurs Sci | 7 | Polymerase chain reaction (PCR) | 6 | Protein Data Bank (PDB); GISAID | 2 | ||
| Apr20 fortnight 2 | 237 | J Arthroplasty | 12 | Pandemics | 13 | Github | 4 |
| The Lancet | 11 | Antiviral Therapy; Arthroplasty; Clinical characteristics; Diagnosis; E-health; Infectious disease; Statistic model; Therapy | 5 | GenBank | 4 | ||
| The Lancet Infect Dis | 7 | Molecular dynamics; Mortality; Orthopaedic; Pneumonia; Psychological disorders; Public health; SARS; Spike protein; Surveillance | 4 | GISAID; Mendeley Data | 2 |
Fig. 2Bimonthly mean production of articles per journal. Related-samples Wilcoxon signed rank test: p = 0.0001
Fig. 3a Evolution of publications per authors’ country of affiliation. b International collaboration network. (The size of the node is proportional to the number of documents generated by each country. The color of the nodes is the same in all countries belonging to the same continent. The width of the lines represents the numbers papers in collaboration between the countries.)
Fig. 4Bimonthly mean production of articles per country. Related-samples Wilcoxon signed rank test: p = 0.0001
Fig. 5Evolution of collaboration according to the number of authors signing the papers
Type of files of supplementary materials analysed
| Type of files | N° papers | N° files | Average of files/articles | % articles /total | % files /total |
|---|---|---|---|---|---|
| 348 | 530 | 1.52 | 42.75% | 41.31% | |
| DOC/DOCX | 265 | 394 | 1.49 | 32.56% | 30.71% |
| JPEG/JPG/TIF/TIFF/PNG/GIF | 31 | 79 | 2.55 | 3.81% | 6.16% |
| XLS/XLSX | 65 | 110 | 1.69 | 7.99% | 8.57% |
| MOV/MP4/WMV/MPG/AVI | 25 | 45 | 1.80 | 3.07% | 3.51% |
| PPT/PPTX | 10 | 12 | 1.20 | 1.23% | 0.94% |
| EPS | 1 | 2 | 2.00 | 0.12% | 0.16% |
| HTML | 2 | 3 | 1.50 | 0.25% | 0.23% |
| CSV | 8 | 45 | 5.63 | 0.98% | 3.51% |
| XML/GZ | 29 | 32 | 1.10 | 3.56% | 2.49% |
| TXT/PL | 1 | 1 | 1.00 | 0.12% | 0.08% |
| M | 1 | 3 | 3.00 | 0.12% | 0.23% |
| MAT | 1 | 4 | 4.00 | 0.12% | 0.31% |
| SuppDOI | 10 | 10 | 1.00 | 1.23% | 0.78% |
| 7z | 1 | 1 | 1.00 | 0.12% | 0.08% |
| FLV | 3 | 3 | 1.00 | 0.37% | 0.23% |