| Literature DB >> 27679558 |
Florian Leitner1, Concha Bielza2, Sean L Hill3, Pedro Larrañaga2.
Abstract
Neuroscience and molecular biology have been generating large datasets over the past years that are reshaping how research is being conducted. In their wake, open data sharing has been singled out as a major challenge for the future of research. We conducted a comparative study of citations of data publications in both fields, showing that the average publication tagged with a data-related term by the NCBI MeSH (Medical Subject Headings) curators achieves a significantly larger citation impact than the average in either field. We introduce a new metric, the data article citation index (DAC-index), to identify the most prolific authors among those data-related publications. The study is fully reproducible from an executable Rmd (R Markdown) script together with all the citation datasets. We hope these results can encourage authors to more openly publish their data.Entities:
Keywords: DAC-index; citations; data article citation index; data publications; data sharing; open data
Year: 2016 PMID: 27679558 PMCID: PMC5020078 DOI: 10.3389/fnins.2016.00419
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1Comparative study of empirical data . Left (A,D): Histogram of the number of data articles published per year. Middle (B,E): Cumulative distribution function (CDF) over the first few citation counts C (x-axis), showing the observed probability for the median citation value of average articles (C = 6 and C = 11, respectively). Right (C,F): Log-log plots of the complementary (1 - CDF) visualizing the growing gap between the data and random citation distributions in the heavy tails. The small horizontal black line marks the observed citation count difference for articles in the top decile (P = 0.1; i.e., the 10% most cited articles).
DAC-index ranking for the top 10 neuroscience (left) and molecular biology (right) authors.
| 113 | 1192 | 21 (29) | 1 | Amos Bairoch | 685 | 22,504 | 88.5 (52) | |
| Anthony Marmarou | 108 | 2246 | 80 (26) | 2 | Eugene V. Koonin | 366 | 21,984 | 89.5 (55) |
| Gordon D. Murray | 76 | 1061 | 81 (33) | 3 | A. Keith Dunker | 308 | 6187 | 54.5 (30.5) |
| Andrew I. R. Maas | 68 | 850 | 79 (18.5) | 4 | Richard M. Durbin | 305 | 23,790 | 148 (78.5) |
| Ewout W. Steyerberg | 67 | 848 | 81 (19.5) | 5 | 272 | 5463 | 42 (35) | |
| Juan Lu | 66 | 760 | 79 (14) | 6 | Jeffrey Skolnick | 254 | 3743 | 42 (28) |
| Michal J. | 62 | 934 | 56 (24) | 7 | Denis F. Hochstrasser | 235 | 3561 | 55 (22.5) |
| Isabella Butcher | 60 | 741 | 79 (18.5) | 8 | Jean-Charles Sanchez | 229 | 3430 | 59 (24) |
| Gillian S. McHugh | 60 | 741 | 79 (25) | 9 | 227 | 4549 | 33.5 (32) | |
| 58 | 1394 | 67 (65) | 10 | Vladimir N. Uversky | 223 | 4984 | 47 (26.5) | |
Data articles except for authors with italic names received significantly more citations than their non-data articles. DAC-idx: DAC-index according to Formula (1). Citations: Authors' cumulative citations on data articles. Median: Author's median number of citations for data articles (in parenthesis: median for all articles by that author).