| Literature DB >> 35958385 |
Abstract
Currently, with the implementation of big data strategies in countries all over the world, big data has achieved vigorous development in various fields. Big data research and application practices have also rapidly attracted the attention of the library and information field. Objective. The study explored the current state of research and research hotspots of big data in the library and information field and further discussed the future research trends. Methods. In the CNKI database, 16 CSSCI source journals in the discipline of library information and digital library were selected as data sources, and the relevant literature was retrieved with the theme of "big data." The collected literature was excluded and expanded according to the citation relationship. Then, with the help of Bicomb and SPSS, co-word analysis and cluster analysis would be carried out on these literature results. Results. According to the findings of the data analysis, the research hotspots on the topic mainly focus on five major research themes, namely, big data and smart library, big data and intelligence research, data mining and cloud computing, big data and information analysis, and library innovation and services. Limitations. At present, the research scope and coverage on this topic are wide, which leads to the research still staying at the macro level. Conclusions. Big data research will remain one of the hotspots in the future. However, the most study is still limited to the perspective of library and information and has not yet analyzed the research status, research hotspots, and development trends in this field from the perspective of big data knowledge structure. Moreover, machine learning, artificial intelligence, knowledge services, AR, and VR may be new directions for future attention and development.Entities:
Mesh:
Year: 2022 PMID: 35958385 PMCID: PMC9357669 DOI: 10.1155/2022/2802835
Source DB: PubMed Journal: J Environ Public Health ISSN: 1687-9805
Distribution of retrieved studies among 16 journals.
| Number | Journals | Number of studies issued | Percentage |
|---|---|---|---|
| 1 |
| 217 | 14.13 |
| 2 |
| 203 | 13.22 |
| 3 |
| 162 | 10.55 |
| 4 |
| 151 | 9.83 |
| 5 |
| 149 | 9.70 |
| 6 |
| 119 | 7.75 |
| 7 |
| 97 | 6.32 |
| 8 |
| 88 | 5.73 |
| 9 |
| 78 | 5.08 |
| 10 |
| 59 | 3.84 |
| 11 |
| 57 | 3.52 |
| 12 |
| 47 | 3.06 |
| 13 |
| 34 | 2.21 |
| 14 |
| 33 | 2.15 |
| 15 |
| 24 | 1.56 |
| 16 |
| 21 | 1.37 |
The weight results of formula (3).
| Source | Target | Weight |
|---|---|---|
| A | B | 1 |
| A | C | 2 |
| A | D | 1 |
| B | C | 1 |
| B | D | 1 |
| C | D | 1 |
Figure 1The distribution trend of the publication volume with the theme of “big data” in the library and information science in China from 2011 to 2020.
The top 30 high-frequency keywords.
| Keywords | Frequency |
|---|---|
| Big data | 241 |
| Library | 135 |
| Smart library | 63 |
| Big data era | 55 |
| Data mining | 43 |
| Information service | 32 |
| Cloud computing | 30 |
| Digital library | 28 |
| Knowledge service | 25 |
| Data analysis | 22 |
| Data processing | 18 |
| Competitive intelligence | 18 |
| Library service | 15 |
| Cloud library | 12 |
| Personalized service | 11 |
| Digital resources | 10 |
| Public library | 10 |
| Knowledge consultation | 9 |
| Internet of things | 8 |
| Intelligence analysis | 8 |
| Unstructured data | 7 |
| Resource construction | 7 |
| Information resource | 6 |
| Information analysis | 6 |
| Information technology | 6 |
| Bibliometrics | 5 |
| Library big data | 5 |
| Data environment | 5 |
| Information science | 5 |
| Big data knowledge service | 5 |
Co-occurrence matrix of the top 30 high-frequency keywords (partial).
| Big data | Library | Smart library | Big data era | Data mining | Information service | Cloud computing | Digital library | |
|---|---|---|---|---|---|---|---|---|
| Big data | 241 | 0 | 10 | 5 | 12 | 9 | 12 | 8 |
| Library | 0 | 135 | 9 | 12 | 1 | 4 | 0 | 6 |
| Smart library | 10 | 9 | 63 | 7 | 6 | 5 | 6 | 0 |
| Big data era | 5 | 12 | 7 | 55 | 8 | 3 | 2 | 1 |
| Data mining | 12 | 1 | 6 | 8 | 43 | 4 | 2 | 0 |
| Information service | 9 | 4 | 5 | 3 | 4 | 32 | 0 | 3 |
| Cloud computing | 12 | 0 | 6 | 2 | 2 | 0 | 30 | 1 |
| Digital library | 8 | 6 | 0 | 1 | 0 | 3 | 1 | 28 |
Similarity matrix of the top 30 high-frequency keywords (partial).
| Big data | Library | Smart library | Big data era | Data mining | Information service | Cloud computing | Digital library | |
|---|---|---|---|---|---|---|---|---|
| Big data | 1.0000 | 0.0000 | 0.0812 | 0.0434 | 0.1179 | 0.1025 | 0.1411 | 0.0972 |
| Library | 1.0000 | 0.0976 | 0.1393 | 0.0131 | 0.0609 | 0.0000 | 0.0976 | |
| Smart library | 1.0000 | 0.1189 | 0.1153 | 0.1114 | 0.1380 | 0.0000 | ||
| Big data era | 1.0000 | 0.1645 | 0.0715 | 0.0492 | 0.0255 | |||
| Data mining | 1.0000 | 0.1078 | 0.0557 | 0.0000 | ||||
| Information service | 1.0000 | 0.0000 | 0.1002 | |||||
| Cloud computing | 1.0000 | 0.0345 | ||||||
| Digital library | 1.0000 |
Dissimilarity matrix of the top 30 high-frequency keywords (partial).
| Big data | Library | Smart library | Big data era | Data mining | Information service | Cloud computing | Digital library | |
|---|---|---|---|---|---|---|---|---|
| Big data | 0.0000 | 1.0000 | 0.9188 | 0.9566 | 0.8821 | 0.8975 | 0.8589 | 0.9028 |
| Library | 0.0000 | 0.9024 | 0.8607 | 0.9869 | 0.9391 | 1.0000 | 0.9024 | |
| Smart library | 0.0000 | 0.8811 | 0.8847 | 0.8886 | 0.8620 | 1.0000 | ||
| Big data era | 0.0000 | 0.8355 | 0.9285 | 0.9508 | 0.9745 | |||
| Data mining | 0.0000 | 0.8922 | 0.9443 | 1.0000 | ||||
| Information service | 0.0000 | 1.0000 | 0.8998 | |||||
| Cloud computing | 0.0000 | 0.9655 | ||||||
| Digital library | 0.0000 |
Figure 2The clustering analysis tree diagram of hotspot keywords on the research topic.