| Literature DB >> 35971409 |
Ebru Çağlayan Akay1, Naciye Tuba Yılmaz Soydan1, Burcu Kocarık Gacar2.
Abstract
An extensive literature providing information on published materials in machine learning exists. However, machine learning is still a rather new concept in the fields of economics and econometrics. This study aims to identify different properties of published documents about machine learning in economics and econometrics and therefore to draw a detailed picture of recent publications from bibliometric analysis perspectives. For the aim of the study, the data are collected from the publications indexed by Web of Science and Scopus databases from the period 1991 to 2020. Inthe study, the data have been illustrated by VOSviewer for science mapping. The analysis of variance has also been used to identify the links between the number of citations of articles and years. The findings obtained provides information about the studies on machine learning in the relevant field conducted in the past, as well as providing an opportunity to gain knowledge about the researched area by shedding light on what the future research areas would be. There is no doubt that it attracts attention has increased significantly on machine learning in the field of economics and econometrics and academic publications on machine learning in the relevant field have increased over the last decade.Entities:
Keywords: Bibliometric analysis; Econometrics; Economics; Machine learning; Science mapping; Scopus; Web of science
Year: 2022 PMID: 35971409 PMCID: PMC9365204 DOI: 10.1007/s13278-022-00916-6
Source DB: PubMed Journal: Soc Netw Anal Min
Fig. 1Flowchart according to the documents identification
Fig. 2Number of publications by year
Number and distribution of publications by document type
| Document type | Web of science | Scopus | ||
|---|---|---|---|---|
| 2010–2020 | 2010–2020 | |||
| Number | Percent (%) | Number | Percent (%) | |
| Journal articles | 532 | 63.3 | 1263 | 44.3 |
| Conference papers | 223 | 26.6 | 1245 | 43.7 |
| Reviews | 59 | 7.0 | 161 | 5.6 |
| Others* | 26 | 3.1 | 183 | 6.4 |
| Total | 840 | 100 | 2852 | 100 |
*It expresses books, book chapters and editorial materials
the countries that published the most articles and number of ascriptions made to them
| Web of science | Scopus | ||||
|---|---|---|---|---|---|
| Country | Number of documents | Number of citations | country | Number of documents | Number of citations |
| US | 311 | 4260 | US | 799 | 12,222 |
| China | 119 | 1212 | China | 635 | 4580 |
| UK | 85 | 1049 | UK | 223 | 3861 |
| Germany | 51 | 467 | India | 148 | 648 |
| Australia | 42 | 599 | Germany | 131 | 1315 |
| Italy | 38 | 268 | Canada | 91 | 1056 |
| Spain | 34 | 175 | Italy | 89 | 1118 |
| France | 27 | 391 | Russia | 86 | 294 |
| Canada | 26 | 177 | Australia | 84 | 1265 |
| India | 24 | 41 | S. Korea | 71 | 602 |
| Russia | 23 | 22 | Spain | 69 | 567 |
| S. Korea | 20 | 158 | France | 61 | 790 |
| Netherlands | 19 | 63 | Japan | 59 | 585 |
| Switzerland | 19 | 85 | Switzerland | 46 | 616 |
| Japan | 17 | 48 | Brazil | 43 | 265 |
| Taiwan | 16 | 319 | Netherlands | 43 | 1028 |
| Singapore | 14 | 281 | Taiwan | 42 | 633 |
| Austria | 12 | 105 | Hong Kong | 41 | 612 |
| Sweden | 11 | 140 | Greece | 37 | 245 |
| Turkey | 10 | 37 | Singapore | 36 | 696 |
Fig. 3Network structure relationship among countries
Institutions and organizations that published the most articles in 2010–2020 and the number of ascriptions made to them
| Web of science | Scopus | ||||||
|---|---|---|---|---|---|---|---|
| Organization | Doc | Cit | Country | Organization | Doc | Cit | Country |
| Univ. of California | 21 | 590 | US | Univ. of California | 85 | 3843 | US |
| Harvard Univ | 15 | 340 | US | Stanford Univ | 62 | 2536 | US |
| Oxford Univ | 13 | 243 | UK | Chinese Academy of S | 56 | 582 | China |
| MIT | 12 | 510 | US | Harvard Univ | 46 | 1691 | US |
| Stanford Univ | 12 | 378 | US | Oxford Univ | 39 | 536 | UK |
| Carnegie Mellon Univ | 12 | 131 | US | Univ. of Pennsylvania | 27 | 210 | US |
| Univ. of Pennsylvania | 11 | 121 | US | Carnegie Mellon Univ | 22 | 438 | US |
| Univ. Illinois | 11 | 34 | US | Beijing Jiaotong Univ | 21 | 109 | China |
| New York Univ | 9 | 333 | US | Univ. of Texas | 20 | 90 | US |
| Univ. of Chicago | 9 | 146 | US | Imperial College London | 19 | 317 | UK |
| Univ. College London | 8 | 287 | UK | Univ. of Washington | 19 | 253 | US |
| Univ of California San Diego | 7 | 210 | US | Univ. Illinois | 18 | 137 | US |
| Chinese Academy of S | 7 | 155 | China | Princeton Univ | 17 | 581 | US |
| Rutgers Univ | 7 | 30 | US | Rutgers Univ | 16 | 80 | US |
| Yale Univ | 6 | 184 | US | Univ. of Chicago | 15 | 279 | US |
| Univ. Southampton | 5 | 290 | UK | National Research Univ | 15 | 15 | Russia |
| Google Inc | 4 | 355 | US | MIT | 11 | 116 | US |
| Imperial Collage London | 4 | 190 | UK | Univ. of Cambridge | 10 | 238 | US |
| National Bureau of Economic Res | 3 | 190 | US | Duy Tan Univ | 10 | 34 | Vietnam |
| National Taiwan Univ | 2 | 193 | Taiwan | Yale Univ | 10 | 300 | US |
Doc. and Cit. represent “number of documents” and “number of citations”, respectively. The order in the table is made from the highest number of documents to the least
Fig. 4Intensity map among documents (First Author’s Name Abbreviated)
Fig. 5Intensity map among authors
Ranking of authors with the highest number of publications and who are ascribed to the most
| Web of science | Scopus | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of ascriptions | Number of documents | Number of ascriptions | Number of documents | ||||||||
| Author | Docs | Cit | Author | Docs | Cit | Author | Docs | Cit | Author | Docs | Cit |
| Varian, H. R | 2 | 353 | Peysakhovich, A | 4 | 13 | Foley, A. M | 1 | 709 | Wang, Y | 29 | 324 |
| Azaria, A | 1 | 350 | Mullainathan, S | 3 | 186 | Leahy, P. G | 1 | 709 | Zhang, Y | 27 | 317 |
| Ghose, A | 1 | 226 | Athey, S | 3 | 171 | Marvuglia, A | 1 | 709 | Wang, J | 22 | 351 |
| Li, B | 1 | 226 | Hoi, Steven C. H | 3 | 124 | Mckeogh, E. J | 1 | 709 | Li, Y | 21 | 378 |
| Ipeirotis, P. G | 1 | 226 | Li, B | 3 | 124 | Azaria, A | 1 | 611 | Chen, Y | 18 | 108 |
| Mullainathan, S | 3 | 186 | Swanson, N. R | 3 | 19 | Ekblaw, A | 1 | 611 | Wang, L | 18 | 144 |
| Spiess, J | 1 | 183 | Varian, Hal R | 2 | 353 | Lippman, A | 1 | 611 | Liu, Y | 17 | 110 |
| Rust, R. T | 2 | 174 | Rust, Roland T | 2 | 174 | Vieira, T | 1 | 611 | Liu, J | 16 | 242 |
| Loebbecke, C | 1 | 173 | Imbens, Guido W | 2 | 153 | Zuboff, S | 1 | 552 | Li, X | 15 | 134 |
| Picot, A | 1 | 173 | Akter, S | 2 | 146 | Babuška, R | 1 | 537 | Liu, X | 15 | 326 |
| Athey, S | 3 | 171 | Shi, Y | 2 | 76 | Buşoniu, L | 1 | 537 | Wang, S | 15 | 154 |
| West, R | 1 | 171 | Acquisti, A | 2 | 73 | De Schutter, B | 1 | 537 | Wu, J | 14 | 66 |
| Yardley, L | 1 | 171 | Hu, J | 2 | 24 | Ernst, D | 1 | 537 | Zhang, J | 14 | 55 |
| Huang, M.-H | 1 | 167 | Zhang, Y | 2 | 24 | Bates, D. W | 3 | 459 | Lee, J | 13 | 94 |
| Huang, G.Q | 1 | 161 | Vieira, T | 1 | 353 | Escobar, G | 1 | 444 | Li, J | 12 | 30 |
| Lan, S | 1 | 161 | Ghose, A | 1 | 226 | Ohno-Machado, L | 1 | 444 | Li, Z | 12 | 16 |
| Newman, S. T | 1 | 161 | Li, B | 1 | 226 | Saria, S | 1 | 444 | Wang, H | 12 | 31 |
| Zhong, R. Y | 1 | 161 | Ipeirotis, P. G | 1 | 226 | Shah, A | 1 | 444 | Yang, Y | 12 | 182 |
| Imbens, G. W | 2 | 153 | Spiess, J | 1 | 183 | Varian, H. R | 3 | 436 | Zhang, L | 12 | 74 |
| Einav, L | 1 | 147 | Loebbecke, C | 1 | 173 | Xie, M | 4 | 395 | Liu, H | 11 | 76 |
Doc. and Cit. represent “number of documents” and “number of citations”, respectively. The number of ascriptions column in the table are listed from the highest number of citations to the least and the number of documents column are listed from the highest number of documents to the least
Fig. 6The most frequently used keywords (2010–2020)
Results of ANOVA
| ANOVA | Average degree | Sum of Squares | df | Mean Square | ||
|---|---|---|---|---|---|---|
| Economics | Between Groups | 3834.455 | 10 | 383.445 | 5.269 | .006 |
| Inside Groups | 800.500 | 11 | 72.773 | |||
| Total | 4634.955 | 21 | ||||
| Econometrics | Between Groups | 237.273 | 10 | 23.727 | 4.110 | .014 |
| Inside Groups | 63.500 | 11 | 5.773 | |||
| Total | 300.773 | 21 | ||||
| Machine Learning | Between Groups | 24,788.364 | 10 | 2478.836 | 2.469 | .077 |
| Inside Groups | 11,044.000 | 11 | 1004.000 | |||
| Total | 35,832.364 | 21 |
ANOVA is used to by separate variation into components such as between groups and within groups. The total variation is the sum of the between groups variation and within groups variation for each group. df degrees of freedom
Use of Keywords according to Year
| Web of science | Scopus | |||
|---|---|---|---|---|
| Years | Frequently used keywords | Frequency* (respectively) | Frequently used keywords | Frequency*(respectively) |
| 2010 | Data Mining, Economics | 4,2 | Machine Learning, Data Mining | 3,2 |
| 2011 | Economics, Machine Learning, Algorithm, Big Data, Causality, Econometrics | 4,3,2,1,1,1 | Machine Learning | 8 |
| 2012 | Data Mining, Economics, Machine Learning | 3,2,2 | Machine Learning, Big Data | 4,2 |
| 2013 | Data Mining, Economics, Machine Learning, Time Series Analysis | 4,3,2,2 | Machine Learning, Big Data, Smart Grid, Data Mining, Electronic Health Record, Text Mining, Time Series Analysis | 9,6,2,2,2,2,2 |
| 2014 | Economics, Big Data, Machine Learning, Data Mining, Algorithms, Econometrics | 9,7,5,4,4,2,1 | Big Data, Machine Learning, Data Mining, Economics, Econometrics, Cloud Computing, Data Analytics, Data Science, Ensemble Learning Healthcare, Modelling, Predictive Analytics | 22,11,3,3,3,2,2,2,2,2,2 |
| 2015 | Big Data, Economics, Machine Learning, Data Mining, Algorithms, Causality-Causal Inference, Econometrics | 25,13,8,6,3,2,1 | Big Data, Machine Learning, Cloud Computing, Classification, Analytics, Time Series, Data Mining, Finance, Artificial Intelligence, Creative Economy, Data Analytics, Economics, Optimization, Predictive Analytics, Regression, Information Economics, Innovation | 66,19,10,4,4,3,3,3,2,2,2,2,2,2,2,2, |
| 2016 | Big Data, Economics, Data Mining/Data Analytics, Machine Learning, Time Series Analysis, Causality-Causal Inference, Econometrics, Parameter Estimation | 34,22,11,12,16,7,3,4,2 | Big Data, Machine Learning, Cloud Computing, Data Mining, Economics, Classification, Econometrics, Artificial Neural Networks, Data Integration, Forecasting, Analytics, Artificial Intelligence, China, Data Management, Data Science, Economic Development, Genetic Algorithm, Optimization, Text Mining, Time Series, Industry 4.0,Internet of Things, Casual Inference, Causality, Data Analysis, Economic Growth, Experimental Economics, Financial Forecasting, Macroeconomics, Neural Network, Regression Analysis | 80,25,12,11,8,6,6,4,4,4,3,3,3,3,3,3,3,3,3,3,2,2,2,2,2,2,2, 2,2,2,2 |
| 2017 | Big Data, Economics, Machine Learning, Data Mining/Data Analytic, Artificial Intelligence, Econometrics, Internet of Things, Prediction & Forecasting, Causality-Causal Inference, Behavioural Economics, Computational Social Science, Business Intelligence, Game Theory, Spatial Econometrics, Financial Econometrics | 42,28,20,17,5,44,4,3,2,2,2,2, 2,2,1 | Big Data, Machine Learning, Internet of Things, Cloud Computing, Classification, Data Mining, Artificial Intelligence, Prediction, Regression, Decision Making, Data Science, Economic Growth, Economics, Analytics, Behavioral Economics, Data Analysis, Deep Learning, Forecasting, Game Theory, Health Economics, Simulation, Spatial Econometrics, Support Vector Machine, Innovation, Time-Series, Behavioral Finance | 97,51,10,9,7,7,6,5,4,4,4,4,4,3,3,3,3,3,3,3,3,3,3,2 |
| 2018 | Big Data, Machine Learning, Economics, Data Mining/Analysis, Prediction & Forecasting Artificial Intelligence, Econometrics, Digital Platforms, Causality-Causal Inference, Deep Learning, Optimization, IOT, Behavioural Economics, Industry 4.0, Healthcare, Spatial Econometrics, Model Selection, Cloud Computing, Explanatory Econometrics, Machine Learning Econometrics | 35,35 30,24, 10,10, 10,4, 3,3,3, 3,3,3, 3,3,2, 2,1,1 | Big Data, Machine Learning, Cloud Computing, Data Mining, Artificial Intelligence, Industry 4.0, Economics, Naïve Bayes, Internet of Things, Deep Learning, Forecasting, Agriculture, Analytics, Neural Network, Text Mining, Time Series, Behavioral Economics, Causal Inference, Data Analysis, Data Analytics, Economic Growth, Optimization, Sentiment Analysis | 98,79,12,12,149,12,5,9,8,6,4,4,4,4,5,3,3,3, 3,3,3,3 |
| 2019 | Big Data, Machine Learning, Economics, Network Analysis, Artificial Intelligence, Causal Inference /Methods, Deep Learning, Algorithms, Statistical Data Analysis, Social Media, Econometrics, Behavioural Economics, Support Vector Machine, Cloud Computing, Artificial Neural Networks, Classification, Financial Econometrics, Bayesian Econometrics | 50,47,38,13,12,9,7,5,5,5,4,4,4,3,3,1 | Machine Learning, Big Data, Artificial Intelligence, Deep Learning, Classification, Reinforcement Learning, Neural Networks, Data Science, Economics, Industry 4.0, Forecasting, Data Mining, Time Series, Internet of Things, Data Mining, Prediction, Digital Economy, Sharing Economy, Sustainability, Causal Inference, Cloud Computing, Econometrics, Optimization, Decision Making, Digitalization, Sentiment Analysis, Smart Cities, Support Vector Machine, Text mining, Artificial Neural Network, Behavioral Economics, Data Analytics, Health Economics, Demand Response, Simulation, Cluster Analysis, Innovation | 144,128,30,20,13,13,15,10,10,10,9,8,8,7,8,8,7,7,7,6,6,6,6,5,5,5,10,10,5,4,4,4,4,4,4,8,4 |
| 2020 | Economics, Machine Learning, Big Data, Internet of Things, Prediction, Computational Social Sciences, Deep Learning, Industry 4.0, Neural Networks, Algorithms, Stock Markets, Data Analysis, Data Science, Behavioural Finance, Optimization, Sharing Economy, Support Vector Machine, Natural Language Processing, Econometrics, Financial Econometrics Cloud Computing, Causality-Causal Inference, Structural Econometrics | 53,41,36,15,10,9,9, 9,9,6,6,5,5,4,4,3,3,3,4,3,2,1 | Machine Learning, Big Data, Artificial Intelligence, Neural Networks, Internet of Things, Covid-19 Pandemic, Artificial Neural Network, Support Vector Machine, Regression, Time Series, Industry 4.0, Behavioral Economics, Deep Learning, Economics, Forecasting, Data Science, Digitalization, Random Forest, Economic Growth, Optimization, Reinforcement Learning, Blockchain, Cloud Computing, China, Clustering, Data Analytics, Data Mining, Digital Economy, Agriculture, Classification, Climate Change, Data Visualization, Decision Making, Econometrics, Financial Econometrics, Genetic Algorithm, Sentiment Analysis, Smart cities, Social Networks, Supervised Learning, Sustainability, Casual Inference, Data Analysis | 140,115,35,16,14,13,10,7,10,7,5,21,16,10,9,8,13,6,6,6,5,5,5,8,5,5,5,4,4,4,4,4,4,4,4,4,4,4,4,4,3,3 |
*Frequency implies how many times the specified keyword was used in the specified year
Main congresses that published the highest number of articles and that were ascribed to the most in 2010–2020
| Web of science | Scopus | ||||||
|---|---|---|---|---|---|---|---|
| Number of documents | Number of ascriptions | Number of documents | Number of ascriptions | ||||
| Congress | Document | Congress | Citation | Congress | Documents | Congress | Citation |
| Proceedings of the 21st ACM International Conference on Knowledge Discovery and Data Mining | 2 | Proceedings of the 2nd International Conference on Open and Big Data (2016) | 358 | Journal of Physics: Conference Series | 52 | 2nd International Conference on Open and Big Data (2016) | 611 |
| Proceedings of the IEEE ACM International Conference on Advances in Social Networks Analysis and Mining (2015) | 2 | Proceedings of the National Academy of Sciences of the United States | 29 | ACM International Conference Proceeding Series | 50 | Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining | 194 |
| International Conference on Computational Intelligence (2015) | 2 | 17th IEEE International Conference on Machine Learning | 18 | CEUR Workshop Proceedings | 39 | Proceedings of the National Academy of Sciences of the USA | 191 |
| IEEE 37th International Conference on Distributed Computing Systems (2017) | 2 | 37th International Conference on Distributed Computing Systems | 12 | E3S Web of Conference | 23 | IEEE International Congress on Big Data (2014) | 140 |
| IEEE 6th International Congress on Big Data (2017) | 2 | Proceedings of the Tenth ACM International Conference on Web Search | 12 | IOP: Conference Series: Earth and Environmental Science | 16 | IEEE/ACM 6th International Conference on Utility and Cloud Computing (2013) | 97 |
| IEEE International Conference on Big Data (2019) | 2 | Proceedings of the IEEE Second International Conference on Big Data (2016) | 11 | IOP: Conference Series: Materials Science and Engineering | 15 | IEEE International Conference on Big Data (2015) | 83 |
| 13th International Technology, Education and Development Conference | 2 | Proceedings of the Twelfth ACM International Conference on Web Search | 11 | Portland International Conference on Management of Engineering and Technology: Managing Technological Entrepreneurship: The Engine for Economic Growth, Proceedings (2018) | 12 | IEEE International Conference on Services Computing (2011) | 76 |
| 14th International Technology, Education and Development Conference | 2 | IEEE 14th International Conference on Industrial Informatics (2016) | 9 | Proceedings of the National Academy of Sciences of the USA | 11 | Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining | 75 |
| 13th International Days of Statistics and Economics | 2 | Thirtieth AAAI Conference on Artificial Intelligence | 7 | IEEE International Conference on Big Data (2018) | 10 | ||
| Proceedings of the Forty-Third Annual ACM Symposium on Theory of Computing | 1 | IEEE International Congress on Big Data (2016) | 7 | International Conference on Intelligent Transportation, Big Data and Smart City (2015) | 8 | Proceedings of the Annual ACM Symposium on Theory of Computing | 73 |
| IEEE 11th International Conference on Ubiquitous Intelligence and Computing (2014) | 1 | 4th International Conference on Teaching and Computational Science | 7 | Proceedings of the ACM International Conference on Knowledge Discovery and Data Mining | 7 | 17th IEEE International Conference on Machine Learning and Applications (2018) | 60 |
| Proceedings of the forty-sixth annual ACM Symposium on Theory of Computing | 1 | 6th International Congress on Big Data | 6 | IEEE International Conference on Big Data (2015) | 6 | Proceedings of the 23rd International Conference on World Wide Web | 45 |
| IEEE Conference on Computational Intelligence for Financial Economics (2014) | 1 | Third European Network Intelligence Conference (2016) | 6 | Proceedings of the 28th International Business Information Management Association Conference Vision 2020: Innovation Management, Development Sustainability, and Competitive Economic Growth | 6 | International Conference on Smart Technologies and Management for Computing Communication, Controls, Energy and Materials (2015) | 43 |
| Proceedings IEEE International Conference on Big Data (2015) | 1 | Ceur Workshop Proceedings | 5 | Conference on Human Factors in Computing Systems | 42 | ||
| Proceedings of the 29th AAAI Conference on Artificial | 1 | IEEE 22nd International Conference on Intelligent Engineering (2018) | 5 | 36th International Conference on Machine Learning (2019) | 5 | 25th International Association for Management of Technology Conference Proceedings: Technology – Future Thinking (2016) | 38 |
| Proceedings 2nd International Conference on Open and Big Data (2016) | 1 | 25th Australasian Software Engineering Conference (2018) | 4 | Matec Web of Conferences | 5 | 32nd International Conference on Machine Learning (2015) | 37 |
| 17th IEEE International Conference on Machine Learning (2018) | 1 | 4th International Scientific Conference: TOSEE | 4 | Proceedings 16th IEEE International Symposium on Parallel and Distributed Processing with Applications | 5 | IEEE International Conference on Big Data (2018) | 36 |
| Proceedings IEEE Second International Conference on Big Data (2016) | 1 | Proceedings of the 2016 ACM Conference on Economics and Computation | 4 | Proceedings of the 29th International Business Information Management Association Conference Education Excellence and Innovation Management Through Vision 2020: From Regional Development Sustainability to Global Economic Growth | 5 | 10th International Conference Management of Large-Scale System Development (2017) | 36 |
| IEEE International Congress on Big Data (2016) | 1 | 13th International Technology, Education and Development Conference | 3 | 2nd IEEE International Conference on Cloud Computing and Big Data Analysis (2017) | 4 | Proceedings of the 12th ACM International Conference on Web Search and Data Mining | 30 |
| 3rd European Network Intelligence Conference (2016) | 1 | IEEE International Conference on Big Data (2019) | 3 | AIES Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics and Society | 4 | IEEE International Conference on Big Data (2017) | 26 |
| AIP Conference Proceedings | Proceedings of the Conference on Traffic and Transportation Studies | 25 | |||||