| Literature DB >> 24115908 |
Adam Eyre-Walker1, Nina Stoletzki.
Abstract
The assessment of scientific publications is an integral part of the scientific process. Here we investigate three methods of assessing the merit of a scientific paper: subjective post-publication peer review, the number of citations gained by a paper, and the impact factor of the journal in which the article was published. We investigate these methods using two datasets in which subjective post-publication assessments of scientific publications have been made by experts. We find that there are moderate, but statistically significant, correlations between assessor scores, when two assessors have rated the same paper, and between assessor score and the number of citations a paper accrues. However, we show that assessor score depends strongly on the journal in which the paper is published, and that assessors tend to over-rate papers published in journals with high impact factors. If we control for this bias, we find that the correlation between assessor scores and between assessor score and the number of citations is weak, suggesting that scientists have little ability to judge either the intrinsic merit of a paper or its likely impact. We also show that the number of citations a paper receives is an extremely error-prone measure of scientific merit. Finally, we argue that the impact factor is likely to be a poor measure of merit, since it depends on subjective assessment. We conclude that the three measures of scientific merit considered here are poor; in particular subjective assessments are an error-prone, biased, and expensive method by which to assess merit. We argue that the impact factor may be the most satisfactory of the methods we have considered, since it is a form of pre-publication review. However, we emphasise that it is likely to be a very error-prone measure of merit that is qualitative, not quantitative.Entities:
Mesh:
Year: 2013 PMID: 24115908 PMCID: PMC3792863 DOI: 10.1371/journal.pbio.1001675
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1The distribution of the impact factor in the two datasets.
The correspondence between assessor scores for the WT dataset.
| Second Assessor | |||||
| 1 | 2 | 3 | 4 | ||
| First assessor | 1 | 60 (42) | 97 | 13 | 0 |
| 2 | 104 | 229 (222) | 76 | 1 | |
| 3 | 12 | 59 | 42 (23) | 8 | |
| 4 | 0 | 3 | 6 | 6 (0.3) | |
Table gives the number of papers rated 1 to 4 for the WT data. Figures in parentheses are the numbers expected by chance alone. Note the ordering of assessors is of no consequence in the WT data since the assessments were performed simultaneously and independently.
The correspondence between assessor scores for the F1000 dataset.
| Second Assessor | ||||
| Recommended | Must Read | Exceptional | ||
| First assessor | Recommended | 365 (295) | 197 | 39 |
| Must Read | 240 | 255 (223) | 76 | |
| Exceptional | 46 | 66 | 44 (19) | |
Table gives the number of papers rated recommended, must read, or exceptional for F1000 papers when both assessments were made within 12 months. Figures in parentheses are the numbers expected by chance alone. Note the second assessor scored the paper after the first assessor and may have known the score the first assessor gave.
Figure 2The correlation between assessor score and impact factor in the two datasets.
Figure 3The proportion of papers, with between 90 and 110 citations in the F1000 dataset, scored in each category as a function of the IF of the journal in which the paper was published.
The numbers of papers in each category are 131, 194, and 128 for IF<10, 10
Correlations within journals with 100 or more papers in the F1000 dataset.
| Journal | Correlation between Assessor Scores | Correlation between Assessor Score and the Number of Citations | ||
|
| Correlation |
| Correlation | |
| Cell | 114 | 0.23 | 203 | 0.11 |
| Current Biology | 28 | −0.16 | 103 | 0.23 |
| Development | 22 | −0.18 | 100 | −0.089 |
| Journal of Biological Chemistry | 14 | 0.44 | 219 | 0.15 |
| Journal of Cell Biology | 29 | −0.022 | 103 | 0.22 |
| Journal of Neuroscience | 12 | −0.063 | 133 | −0.057 |
| Journal of the American Chemical Society | 22 | 0.42 | 126 | 0.043 |
| Molecular Cell | 32 | −0.049 | 121 | 0.15 |
| Nature | 217 | 0.15 | 375 | 0.20 |
| Neuron | 34 | 0.24 | 116 | 0.13 |
| PNAS | 115 | 0.32 | 531 | 0.093 |
| Science | 199 | 0.019 | 355 | 0.15 |
| Average | 0.11 | 0.11 | ||
p<0.05.
p<0.01.
p<0.001.
Figure 4The correlation between assessor score and the number of citations in the two datasets.
Figure 5The distribution of the number of citations in journals with IF<5 and IF>30 in the F1000 dataset.