| Literature DB >> 33870068 |
Ilya Rahkovsky1, Autumn Toney1, Kevin W Boyack2, Richard Klavans3, Dewey A Murdick1.
Abstract
Our work analyzes the artificial intelligence and machine learning (AI/ML) research portfolios of six large research funding organizations from the United States [National Institutes of Health (NIH) and National Science Foundation (NSF)]; Europe [European Commission (EC) and European Research Council (ERC)]; China [National Natural Science Foundation of China (NNSFC)]; and Japan [Japan Society for the Promotion of Science (JSPS)]. The data for this analysis is based on 127,000 research clusters (RCs) that are derived from 1.4 billion citation links between 104.8 million documents from four databases (Dimensions, Microsoft Academic Graph, Web of Science, and the Chinese National Knowledge Infrastructure). Of these RCs, 600 large clusters are associated with AI/ML topics, and 161 of these AI/ML RCs are expected to experience extreme growth between May 2020 and May 2023. Funding acknowledgments (in the corpus of the 104.9 million documents) are used to characterize the overall AI/ML research portfolios of each organization. NNSFC is the largest funder of AI/ML research and disproportionately funds computer vision. The EC, RC, and JSPS focus more efforts on natural language processing and robotics. The NSF and ERC are more focused on fundamental advancement of AI/ML rather than on applications. They are more likely to participate in the RCs that are expected to have extreme growth. NIH funds the largest relative share of general AI/ML research papers (meaning in areas other than computer vision, natural language processing, and robotics). We briefly describe how insights such as these could be applied to portfolio management decision-making.Entities:
Keywords: artificial intelligence; forecasting; machine learning; map of science; research analysis; research funding agencies; research portfolio analysis
Year: 2021 PMID: 33870068 PMCID: PMC8028401 DOI: 10.3389/frma.2021.630124
Source DB: PubMed Journal: Front Res Metr Anal ISSN: 2504-0537
Figure 1Article deduplication process scheme (step 1).
Figure 2Article deduplication process scheme (step 2).
Figure 3Construction of merged academic dataset.
Breakdown of document counts by languages (in millions), language labels are non-exclusive.
| All documents | 382.8 | 307.1 | 72.1 | 7.6 | 6.1 | 5.3 | 3.7 | 13.0 |
| Unique documents | 237.6 | 159.9 | 33.5 | 5.2 | 4.7 | 4.7 | 2.8 | 12.1 |
| Citation graph | 108.9 | 93.7 | 14.1 | 2.0 | 1.1 | 1.0 | 0.8 | 0.5 |
| Core science | 54.7 | 51.6 | 4.0 | 0.6 | 0.3 | 0.2 | 0.2 | 0.1 |
| Clustered papers | 104.9 | 92.0 | 11.4 | 1.9 | 1.1 | 0.9 | 0.8 | 0.5 |
All Documents include all the scientific publications aggregated from Web of Science, Digital Science, Microsoft Academic Graph, and the Chinese National Knowledge Infrastructure; Unique Documents is the deduplicated set of scientific publications; Citation Graph contains the articles with at least one citation or one reference; Core Science contains the articles with at least one citation and one reference; and Clustered Papers contains the core science data clustered using the Leiden algorithm with additional articles assigned to clusters using direct-citation connections.
Breakdown of document counts by country affiliation (in millions), country labels are non-exclusive.
| All documents | 382.8 | 193.7 | 31.4 | 60.3 | 10.9 | 56.7 |
| Unique documents | 237.5 | 157.6 | 12.6 | 24.8 | 4.5 | 23.9 |
| Citation graph | 108.9 | 44.0 | 10.9 | 20.6 | 3.7 | 19.3 |
| Core science | 54.7 | 10.0 | 6.4 | 15.2 | 2.7 | 14.2 |
| Clustered papers | 104.9 | 40.6 | 10.6 | 20.5 | 3.7 | 19.2 |
All Documents include all the scientific article publications from Web of Science, Digital Science, Microsoft Academic Graph, and the Chinese National Knowledge Infrastructure, while Unique Documents implements article-level disambiguation on this set. Citation Graph contains the articles with at least one citation or one reference, Core Science contains the articles with at least one citation and one reference, and Clustered Papers contains the core science data clustered using the Leiden algorithm with additional articles assigned to clusters using direct-citation connections.
Breakdown of document counts by funding agencies (in millions), funding agency acknowledgments are non-exclusive.
| All documents | 382.8 | 328.1 | 2.3 | 0.7 | 2.1 | 8.0 | 11.3 | 2.3 | 32.3 |
| Unique documents | 237.5 | 219.3 | 0.7 | 0.2 | 0.7 | 2.9 | 4.2 | 0.7 | 10.7 |
| Citation graph | 108.9 | 90.9 | 0.7 | 0.2 | 0.7 | 2.9 | 4.0 | 0.7 | 10.5 |
| Core science | 54.7 | 39.7 | 0.6 | 0.2 | 0.6 | 2.5 | 2.6 | 0.6 | 8.7 |
| Clustered papers | 104.9 | 86.9 | 0.7 | 0.2 | 0.7 | 2.9 | 4.0 | 0.7 | 10.5 |
Figure 4Map of science: 126,915 active research clusters (RCs) creating an inferred structural basis for analysis from direct citation links between tens of millions of articles across four data sources.
Publications in AI research clusters: production by region, funding organization, and presence in extreme growth RCs.
| CN | 355,068 | 127,378 | 35.8 | NNSFC | 256,526 | 19.3 | 102,326 |
| EU | 187,624 | 53,867 | 28.7 | EC | 18,731 | 1.3 | 4,702 |
| ERC | 7,037 | 0.2 | 2,999 | ||||
| JP | 40,430 | 11,132 | 27.5 | JSPS | 16,139 | 1.0 | 4,954 |
| U.S. | 183,355 | 72,259 | 39.4 | NIH | 18,821 | 0.4 | 8,223 |
| NSF | 22,984 | 1.7 | 9,327 |
Article content is aggregated from Web of Science, Digital Science, Microsoft Academic Graph, and the Chinese National Knowledge Infrastructure databases and includes publications from 600 research clusters with at least a 50% share of AI-relevant articles and contain at least 20 articles published between May 2019 and May 2020.
Funding organization participation in extreme growth AI research clusters (RCs).
| CN | NNSFC | 30.7 | 69.3 | 70.2 |
| EU | EC | 21.9 | 78.1 | 9.3 |
| ERC | 36.2 | 63.8 | 3.8 | |
| JP | JSPS | 26.7 | 73.3 | 36.7 |
| U.S. | NIH | 30.0 | 70.0 | 8.1 |
| NSF | 35.0 | 65.0 | 12.0 |
Article content is aggregated from Web of Science, Digital Science, Microsoft Academic Graph, and the Chinese National Knowledge Infrastructure databases and includes publications from 600 research clusters with at least a 50% share of AI-relevant articles and contain at least 20 articles published between May 2019 and May 2020.
Top five extreme growth research clusters for each category of AI.
| AI/ML | 1. Interpretable machine learning | NSF, NIH, EC, ERC, JSPS | 9 | 32 | 4 | 32 |
| 2. Multiple attribute decision | NNSFC | 56 | 5 | 0 | 3 | |
| 3. Overparameterized NNs | NSF, ERC | 9 | 13 | 2 | 55 | |
| 4. Bias/Fairness in AI/ML | NSF, ERC, EC, JSPS, NIH | 1 | 28 | 2 | 48 | |
| 5. Actor-critic models | JSPS, ERC | 22 | 20 | 5 | 30 | |
| CV | 1. Deep learning (GANs) | NNSFC, NSF, JSPS, NIH, ERC | 30 | 17 | 5 | 25 |
| 2. R-CNN object detection | NNSFC | 58 | 9 | 3 | 11 | |
| 3.3D object classification | NNSFC, NSF, ERC, EC, JSPS, NIH | 34 | 18 | 3 | 22 | |
| 4. Adversarial neural networks | NSF, NIH, ERC | 24 | 15 | 3 | 36 | |
| 5. Depth estimation | NNSFC, NSF, EC, ERC | 35 | 18 | 3 | 20 | |
| NLP | 1. Natural language inference | ERC | 23 | 13 | 3 | 35 |
| 2. Text completion | NNSFC, ERC, NIH | 28 | 20 | 8 | 22 | |
| 3. Neural conversation models | JSPS | 35 | 12 | 6 | 26 | |
| 4. Cross-lingual word embeddings | JSPS, ERC | 21 | 25 | 5 | 27 | |
| 5. Hate speech detection | NSF, EC, JSPS | 4 | 28 | 2 | 27 | |
| RO | 1. Soft robotics | NNSFC, NSF, JSPS, EC, NIH, ERC | 24 | 19 | 10 | 23 |
| 2. Imitation learning | NSF, EC, NIH, JSPS, ERC | 9 | 22 | 4 | 44 | |
| 3. Gaussian processes | ERC, EC, JSPS, NSF | 5 | 29 | 4 | 20 | |
| 4. Visual odeometry | NNSFC, NSF, EC, ERC, NIH | 33 | 18 | 1 | 26 | |
| 5. Autonomous driving | NNSFC, NSF | 24 | 19 | 3 | 27 |
All funders are listed in order from highest contributor to lowest.
Figure 5Breakdown of research clusters (RCs) by AI field.
Figure 6Research cluster (RC) funding portfolio breakdown by funder and AI field.