| Literature DB >> 35915099 |
Abstract
Acknowledgements represent scholars' relationships as part of the research contribution. While co-authors and citations are often provided as a well-formatted bibliometric database, acknowledged individuals are difficult to identify because they appear as part of the statements in the paper. We identify acknowledged scholars who appeared in papers published in open-access journals by referring to the co-author and citation relationships stored in the Microsoft Academic Graph (MAG). Therefore, the constructed dataset is compatible with MAG, which accelerates and expands the acknowledgements as a data source of scholarly relationships similar to collaboration and citation analysis. Moreover, the implemented code is publicly available; thus, it can be applied in other studies.Entities:
Year: 2022 PMID: 35915099 PMCID: PMC9343655 DOI: 10.1038/s41597-022-01585-y
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Overview of the processes from raw data to identify acknowledged scholars. With the input of text of papers, three major steps exist until the identification of the acknowledged scholars.
Fig. 2Proposed method for identifying authors and acknowledged scholars.
List of names that have been manually removed because they were used as institutions or foundations.
| List of institutions |
|---|
| Instituto de Salud Carlos III |
| Albert Einstein |
| La Jolla |
| Fundação de Amparo |
| Marie Curie |
| Generalitat Valenciana |
| Alice Wallenberg |
| Fundação de Amparo |
| Generalitat de Catalunya |
| Marie Skłodowska-Curie |
| Miguel Servet |
| Salud Carlos III |
| Sara Borrell |
| Severo Ochoa |
| KU Leuven |
| Susan G. Komen |
| Deutsche Forschungsgemeinschaft |
| della ricerca |
| Fondazione Umberto Veronesi |
| Ricerca Corrente |
| Liwen Bianji |
| Institut Curie |
| Irene Feroce |
| chapel Hill |
List of datasets for acknowledged scholars. All datasets are CSV format files containing acknowledged scholars’ IDs.
| File | Lines | Short description |
|---|---|---|
| compbiology.csv | 3,186 | Acknowledged scholars in paper published by PLOS Computational Biology |
| biology.csv | 5,189 | Acknowledged scholars in paper published by PLOS Biology |
| medicine.csv | 4,263 | Acknowledged scholars in paper published by PLOS Medicine |
| genetics.csv | 11,357 | Acknowledged scholars in paper published by PLOS Genetics |
| ntds.csv | 6,139 | Acknowledged scholars in paper published by PLOS Neglected Tropical Diseases |
| pathogenes.csv | 9,278 | Acknowledged scholars in paper published by PLOS Pathogens |
| plosone.csv | 155,461 | Acknowledged scholars in paper published by PLOS ONE |
| srep.csv | 40,693 | Acknowledged scholars in paper published by Scientific Reports |
Data type for the acknowledged scholars.
| Index | Type | Short description |
|---|---|---|
| DOI | String | DOI of a acknowledging paper |
| PaperId | Integer | PaperId of a acknowledging paper in MAG |
| AcknowledgedId | Integer | Acknowledged scholar’s ‘ AuthorId in MAG |
| CollaborationApproach | Boolean | True if a scholar is detected by collaboration relationships, otherwise False |
| CitationApproach | Boolean | True if a scholar is detected by citation relationships, otherwise False |
Descriptive statistics of each dataset. Journal names are abbreviated. “PLOS” has been omitted in the PLOS series except for PLOS ONE.
| Comput Biol | Biol | Med | Genet | NTDs | Pathog | PLOS ONE | Sci Rep | |
|---|---|---|---|---|---|---|---|---|
| Number of identified acknowledged scholars | 2,905 | 4,802 | 3,984 | 9,241 | 5,050 | 7,606 | 127,551 | 37,185 |
| Number of papers including identified acknowledged scholars | 1,539 | 2,045 | 1,044 | 4,343 | 2,857 | 4,034 | 73,869 | 20,612 |
| Average number of the identified acknowledged per paper | 2.07 | 2.54 | 4.08 | 2.62 | 2.15 | 2.30 | 2.10 | 1.97 |
Numbers of detected scholar IDs by collaboration and citation approaches. Journal names are abbreviated as in Table 2.
| Comput Biol | Biol | Med | Genet | NTDs | Pathog | PLOS ONE | Sci Rep | Total | |
|---|---|---|---|---|---|---|---|---|---|
| Collaboration | 1433 | 2436 | 2369 | 4478 | 3084 | 4042 | 80679 | 19567 | 118088 |
| Citation | 431 | 826 | 278 | 1736 | 419 | 1172 | 15240 | 7161 | 27263 |
| Both | 1157 | 1706 | 1424 | 3726 | 1836 | 2936 | 39916 | 11580 | 64281 |
Fig. 3Complementary cumulative distribution of in-degree for the acknowledgement network. The parameter α of the power-law distribution is estimated as 3.11.
The ten highest in-degree scholars.
| Name | AcknowledgedId | In-degree |
|---|---|---|
| Heather Thorne | 1979004069 | 1350 |
| Eveline Niedermayr | 2010482067 | 1348 |
| Judi Maskiell | 2054520798 | 1284 |
| Maggie Angelakos | 2616126658 | 1284 |
| Teresa Selander | 2305142036 | 1271 |
| Helena Kemiläinen | 2615861737 | 1264 |
| Michael Stagner | 2577585844 | 1259 |
| Pei Chao | 2790030099 | 1237 |
| Ursula Eilber | 294419845 | 1188 |
| Irja Erkkilä | 2614792144 | 1181 |
The ten most highly acknowledged scholars counted per paper.
| Name | AcknowledgedId | Acknowledged count |
|---|---|---|
| Takaji Wakita | 1974321678 | 73 |
| Shizuo Akira | 2149472920 | 51 |
| Bert Vogelstein | 679456835 | 46 |
| Feng Zhang | 2256777311 | 42 |
| Noboru Mizushima | 1985327407 | 41 |
| Charles M. Rice | 2235486152 | 40 |
| Roger Mundry | 345639720 | 39 |
| Bernard Moss | 2104435105 | 34 |
| Norbert Perrimon | 174839232 | 32 |
| Kamil Ugurbil | 1996768038 | 31 |
| Measurement(s) | acknowledgements section • acknowledged scholar |
| Technology Type(s) | natural language processing • social network analysis |