| Literature DB >> 29772003 |
Zohreh Zahedi1, Rodrigo Costas1,2.
Abstract
The data collection and reporting approaches of four major altmetric data aggregators are studied. The main aim of this study is to understand how differences in social media tracking and data collection methodologies can have effects on the analytical use of altmetric data. For this purpose, discrepancies in the metrics across aggregators have been studied in order to understand how the methodological choices adopted by these aggregators can explain the discrepancies found. Our results show that different forms of accessing the data from diverse social media platforms, together with different approaches of collecting, processing, summarizing, and updating social media metrics cause substantial differences in the data and metrics offered by these aggregators. These results highlight the importance that methodological choices in the tracking, collecting, and reporting of altmetric data can have in the analytical value of the data. Some recommendations for altmetric users and data aggregators are proposed and discussed.Entities:
Mesh:
Year: 2018 PMID: 29772003 PMCID: PMC5957428 DOI: 10.1371/journal.pone.0197326
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Coverage (% of DOIs with at least one metric) of PloS ONE DOIs across altmetric aggregators and aggregators and per data sources.
| 19,185 | 19,073 | 17,926 | 3,623 | 639 | |
| 2364 | N/A | 555 | N/A | 716 | |
| 31,398 | 30,117 | 9,973 | 2,497 | 1,615 | |
| 30,154 | 30,124 | N/A | N/A | N/A | |
| (95.9) | (95.8) | ||||
| 31,418 | 30,389 | 7,526 | 5,149 | 747 | |
(N/A: metrics not available in the platform)
Statistics (sum [t] and mean [m] scores) of altmetric counts across aggregators and per data source.
| 491,630 | 15.6 | 164,919 (143,471) | 5.2 (4.5) | 22,627 | 0.7 | 1,060 | 0.0 |
nP = number of Publication; t = sum score; m = mean score; MR = Mendeley readership counts, TW = (re)tweets, FB = Facebook counts, W = Wikipedia mentions, N/A = metrics not available in the platform, values in parentheses refer to statistics of distinct tweeters (Twitter users)—only for Altmetric.com and CrossRef ED.
Analysis of (dis)agreement among aggregators in Mendeley readership counts.
| readerships | overlapped | equal | > | < | overlapped | equal | > | < | overlapped | equal | > | < |
| 19,015 | 18,613 | 153 | 249 | |||||||||
| 97.9% | 0.8% | 1.3% | ||||||||||
| 30,117 | 14,416 | 13,974 | 1,727 | 19,012 | 7,977 | 9,823 | 1,212 | |||||
| 47.9% | 46.4% | 5.7% | 42.0% | 51.7% | 6.4% | |||||||
| 30,089 | 9,027 | 10,531 | 10,531 | 19,057 | 5,120 | 6,974 | 6,963 | 30,086 | 7,676 | 7,815 | 14,595 | |
| 30.0% | 35.0% | 35.0% | 26.9% | 36.6% | 36.5% | 25.5% | 26.0% | 48.5% | ||||
Analysis of (dis)agreement among aggregators in Twitter counts (re)tweets, and distinct tweeters.
| overlapped | equal | > | < | overlapped | equal | > | < | overlapped | equal | > | < | |
| 546 | 54 | 8 | 484 | |||||||||
| 9.9% | 1.5% | 88.6% | ||||||||||
| 9,763 | 3,135 | 1,027 | 5,601 | 515 | 74 | 404 | 37 | |||||
| 32.1% | 10.5% | 57.4% | 14.4% | 78.4% | 7.2% | |||||||
| 7,356 | 2402 | 258 | 4,696 | 525 | 156 | 355 | 14 | 4,143 | 957 | 895 | 2,291 | |
| 32.7% | 3.5% | 63.8% | 29.7% | 67.6% | 2.7% | 23.1% | 21.6% | 55.3% | ||||
*The values in the parentheses refer to number of tweeters—only available for Altmetric.com and CrossRef ED.
Analysis of (dis)agreement among aggregators in Facebook counts.
| overlapped | equal | > | < | overlapped | equal | > | < | |
| 1193 | 149 | 770 | 274 | |||||
| % | 12.5% | 64.5% | 23.0% | |||||
| 1819 | 225 | 1362 | 232 | 2496 | 1130 | 1330 | 36 | |
| % | 12.4% | 74.9% | 12.8% | 45.3% | 53.3% | 1.4% | ||
Analysis of (dis)agreement among aggregators in Wikipedia counts.
| overlapped | equal | > | < | overlapped | equal | > | < | overlapped | equal | > | < | |
| 464 | 74 | 367 | 23 | |||||||||
| 15.9% | 79.1% | 5.0% | ||||||||||
| 611 | 380 | 218 | 13 | 643 | 97 | 71 | 475 | |||||
| 62.2% | 35.7% | 2.1% | 15.1% | 11.0% | 73.9% | |||||||
| 612 | 527 | 42 | 43 | 518 | 97 | 21 | 400 | 697 | 457 | 8 | 232 | |
| 86.1% | 6.9% | 7.0% | 18.7% | 4.1% | 77.2% | 65.6% | 1.1% | 33.3% | ||||
Pearson correlation analysis across different aggregators and their Mendeley readership counts.
| N = 30,433 | ||||
| 1 | .917 | .918 | .874 | |
| 1 | .998 | .945 | ||
| 1 | .946 | |||
| 1 |
Pearson correlation analysis across different aggregators and their Wikipedia counts.
| N = 1,727 | ||||
| 1 | .380 | .551 | .867 | |
| 1 | .276 | .388 | ||
| 1 | .459 | |||
| 1 |
Pearson correlation analysis across different aggregators and their tweets and retweets.
| N = 18,285 | ||||||
| 1 | .979 | .636 | .602 | .952 | .762 | |
| 1 | .593 | .578 | .955 | .752 | ||
| 1 | .983 | .641 | .516 | |||
| 1 | .622 | .488 | ||||
| 1 | .728 | |||||
| 1 | ||||||
Tweeters (Twitter users) refer to the number of users who have tweeted publications. This information is available for Altmetric.com and CrossRef Event Data.
Pearson correlation analysis across different aggregators and their Facebook counts.
| N = 6,953 | |||
| 1 | .112 | .134 | |
| 1 | .397 | ||
| 1 |
Overview of the main methods of collecting, tracking, and updating metrics across different altmetric data aggregators—As reported by the data aggregators.
| Social media sources | Aggregators | Data collection approaches | Data aggregation and reporting approaches | Data updating approaches | ||
|---|---|---|---|---|---|---|
| Mendeley API. | Tracks, orderly, scholarly objects with DOI, PMID, ArXiv ID and stops the process if any result is found by any of the identifiers. | Aggregated individual user readership counts. | Raw data on readership by academic types, countries, and disciplines is recorded. | Daily updates. | ||
| Is part of Elsevier and does not directly use the Mendeley API. | Tracks any identifiers (DOIs, PMIDs, etc.) | Raw data is not provided. | ||||
| Mendeley API. | Tracks DOIs. | Aggregated individual user and group readership counts. | Raw data is not provided. | |||
| Twitter GNIP API. | Tracks a range of different identifiers (URLs, DOIs, PMIDs, ArXiv ids, SSRN IDs, ADS IDs, Amazon URLs, and ISBNs) | Aggregated count of distinct tweeters. Aggregated counts of (re)tweets provided in the Bookmarklet. | Raw data from Twitter (tweets, retweets, tweeters, followers, etc.) is available in the JSON files through the Altmetric.com API | Real-time update. | ||
| Tracks a range of different identifiers (URLs, DOIs, PMIDs, PMCID, ArXiv IDs, ISBNs, etc. | Aggregated counts of (re) tweets across multiple versions of the same output. | Raw data is not provided. | Real-time update. | |||
| Twitter GNIP Power Track API. | Tracks DOIs and article landing page URLs. No other identifiers. | Only raw data is provided. | Raw data from Twitter (tweets, retweets, tweeters, followers, etc.) | Real-time update. | ||
| Twitter Search API with rate limit of 1,800 requests per hour. | Tracks DOIs and journal landing page URLs. | Aggregated counts of (re)tweets. | Raw data is not provided. | No information. | ||
| Facebook Graph API. | Same as for Twitter. | Aggregated counts of public Facebook posts. | Raw data is not provided. | No information. | ||
| Same as for Twitter. | Combined counts of all public and private Facebook likes, shares, and comments. | Daily update. | ||||
| Tracks journal landing page URLs. | Raw data is not provided. | No information. | ||||
| Wikipedia API. | Tracks all Wikipedia edits searching for links to scholarly domains, and also clearly labeled identifiers (DOIs and PMIDs). | Count of Wikipedia mentions in the references of English pages based on the 2016 version of altmetric.com. | Raw data is available. | Real-time update. | ||
| Retrieves Wikipedia mentions by mining search engine results, watching Wikipedia pages for citation changes, and mining the full text of all Wikipedia pages. | Tracks only URLs. | Count of Wikipedia mentions in the references of English pages (Plum Analytics has recently announced the tracking of Spanish and Portuguese language Wikipedia entries: | Raw data is not provided. | Daily update. | ||
| MediaWiki API. | Tracks DOIs and URLs. | Aggregated score of mentions in the Wikipedia pages and files. | Wikipedia mentions in the references of the 25 most popular languages of Wikipedia pages ( | No information | ||
| Wikipedia MediaWiki Event Streams and the MediaWiki APIs. | Only tracks DOIs and article landing page URLs (not any other identifiers). | Count is not provided. | Raw data on Wikipedia mentions in the references of both old and new versions (edits) of English and non-English pages. | Real-time update. | ||
Fig 1Examples of different readership counts across different altmetric aggregators: Plum Analytics vs. Mendeley (accessed on 15 December 2017).
Fig 2Examples of different readership counts across different altmetric aggregators: Plum Analytics, Altmetric.com, and Lagotto vs. Mendeley (accessed on 15 December 2017).
Fig 3Examples of different Mendeley readership counts across different altmetric aggregators: Mendeley, Plum Analytics, Altmetric.com, and Lagotto are presented orderly (accessed on 29 November 2017).
Fig 4Examples of different Facebook counts across different altmetric aggregators: Plum Analytics, Lagotto, and Altmetirc.com are presented orderly (accessed on 29 November 2017).
Fig 5Examples of different tweets (tweeters) across different altmetric aggregators: Plum Analytics, Altmetric.com, and Lagotto are presented orderly (accessed on 29 November 2017).
Fig 6Examples of Wikipedia counts for an object reported by Lagotto (accessed on 29 November 2017).
Fig 7Examples of different Wikipedia mentions across different altmetric aggregators: Lagotto, Altmetric.com, and Plum Analytics are presented orderly (accessed on 29 November 2017).