| Literature DB >> 36212767 |
Alice Fleerackers1, Lise Nehring2, Lauren A Maggio3, Asura Enkhbayar1, Laura Moorhead4, Juan Pablo Alperin5.
Abstract
The company Altmetric is often used to collect mentions of research in online news stories, yet there have been concerns about the quality of this data. This study investigates these concerns. Using a manual content analysis of 400 news stories as a comparison method, we analyzed the precision and recall with which Altmetric identified mentions of research in 8 news outlets. We also used logistic regression to identify the characteristics of research mentions that influence their likelihood of being successfully identified. We find that, for a predefined set of outlets, Altmetric's news mention data were relatively accurate (F-score = 0.80), with very high precision (0.95) and acceptable recall (0.70), although recall is below 0.50 for some news outlets. Altmetric is more likely to successfully identify mentions of research that include a hyperlink to the research item, an author name, and/or the title of a publication venue. This data source appears to be less reliable for mentions of research that provide little or no bibliometric information, as well as for identifying mentions of scholarly monographs, conference presentations, dissertations, and non-English research articles. Our findings suggest that, with caveats, scholars can use Altmetric news mention data as a relatively reliable source to identify research mentions across a range of outlets with high precision and acceptable recall, offering scholars the potential to conserve resources during data collection. Our study does not, however, offer an assessment of completeness or accuracy of Altmetric news data overall. Supplementary Information: The online version contains supplementary material available at 10.1007/s11192-022-04510-7. © Akadémiai Kiadó, Budapest, Hungary 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.Entities:
Keywords: Accuracy; Altmetric; Data quality; Journalism; News; Scholarly communication
Year: 2022 PMID: 36212767 PMCID: PMC9526208 DOI: 10.1007/s11192-022-04510-7
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.801
Number of stories and mentions across news outlets
| Outlet | Number of stories | Number of stories with mentions | Percent of stories with mentions | Number of mentions | Average number of mentions per story |
|---|---|---|---|---|---|
| Health Day | 50 | 30 | 60 | 32 | 1.1 |
| IFLScience | 50 | 31 | 62 | 62 | 2.0 |
| MedPage Today | 50 | 34 | 68 | 108 | 3.2 |
| New York Times | 50 | 21 | 42 | 54 | 2.6 |
| Popular Science | 50 | 18 | 36 | 37 | 2.1 |
| The Guardian | 50 | 18 | 36 | 31 | 1.7 |
| News Medical | 50 | 43 | 86 | 46 | 1.1 |
| Wired | 50 | 33 | 66 | 132 | 4.0 |
| Total | 400 | 228 | 57 | 502 | 2.2 |
How research was mentioned across news outlets
| Outlet | Number of mentions | Describes as research | Has link | Institution mentioned | Author mentioned | Journal mentioned | Study date mentioned | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | n | % | n | % | n | % | n | % | n | % | n | % | |
| Health Day | 32 | 31 | 97 | 0 | 0 | 31 | 97 | 30 | 94 | 25 | 78 | 25 | 78 |
| IFLScience | 62 | 45 | 73 | 50 | 81 | 20 | 32 | 21 | 34 | 20 | 32 | 9 | 15 |
| MedPage Today | 108 | 98 | 91 | 93 | 86 | 38 | 35 | 35 | 32 | 61 | 56 | 31 | 29 |
| New York Times | 54 | 36 | 67 | 40 | 74 | 26 | 48 | 28 | 52 | 15 | 28 | 16 | 30 |
| Popular Science | 37 | 29 | 78 | 25 | 68 | 15 | 41 | 14 | 38 | 10 | 27 | 10 | 27 |
| The Guardian | 31 | 24 | 77 | 21 | 68 | 18 | 58 | 11 | 35 | 6 | 19 | 9 | 29 |
| News Medical | 46 | 42 | 91 | 40 | 87 | 30 | 65 | 39 | 85 | 36 | 78 | 34 | 74 |
| Wired | 132 | 84 | 64 | 116 | 88 | 60 | 45 | 59 | 45 | 29 | 22 | 41 | 31 |
| Total | 502 | 389 | 77 | 385 | 77 | 238 | 47 | 237 | 47 | 202 | 40 | 175 | 35 |
Precision and recall of Altmetric research mention data
| Metric | Description | Value |
|---|---|---|
| True positive | Number of correctly identified mentions | 349 |
| False positive | Number of incorrectly identified mentions | 21 |
| False negative | Number of unidentified mentions | 153 |
| Precision | Proportion of identified mentions that were correct | 0.95 |
| Recall | Proportion of correctly identified | 0.70 |
| F-score | The harmonic mean of precision and recall | 0.80 |
Precision, recall, and accuracy (F-score) by news outlet
| Outlet | Number of mentions | Precision | Recall | F-score |
|---|---|---|---|---|
| Health Day | 52 | 0.95 | 0.56 | 0.71 |
| IFLScience | 81 | 0.85 | 0.71 | 0.77 |
| MedPage Today | 124 | 1.00 | 0.77 | 0.87 |
| New York Times | 83 | 0.86 | 0.56 | 0.67 |
| Popular Science | 69 | 0.85 | 0.46 | 0.60 |
| The Guardian | 63 | 1.00 | 0.45 | 0.62 |
| News Medical | 53 | 0.97 | 0.83 | 0.89 |
| Wired | 149 | 0.97 | 0.80 | 0.88 |
Results of logistic regression
| Standard error | Odds ratio (95% confidence interval) | p-value | |
|---|---|---|---|
| Describes as research | − 1.014 | 0.65 (0.29–1.49) | 0.311 |
| Has link | 9.517 | 53.77 (23.67–122.17) | 0.000 |
| Journal mentioned | 3.002 | 3.85 (1.60–9.28) | 0.003 |
| Author mentioned | 2.587 | 4.23 (1.42–12.61) | 0.010 |
| Institution mentioned | − 1.367 | 0.49 (0.18–1.36) | 0.171 |
| Study date mentioned | − 0.558 | 0.79 (0.35–1.79) | 0.577 |