| Literature DB >> 36223420 |
Chunqi Hu1, Huaping Gong1, Yiqing He2.
Abstract
Difficulties in collecting, processing, and identifying massive data have slowed research on cutting-edge science and technology hotspots. Promoting these technologies will not be successful without an effective data-driven method to identify cutting-edge technologies. This paper proposes a data-driven model for identifying global cutting-edge science technologies based on SpaCy. In this model, we collected data released by 17 well-known American technology media websites from July 2019 to July 2020 using web crawling with Python. We combine graph-based neural network learning with active learning as the research method in this paper. Next, we introduced a ten-fold cross-check to train the model through machine learning with repeated experiments. The experimental results show that this model performed very well in entity recognition tasks with an F value of 98.11%. The model provides an information source for cutting-edge technology identification. It can promote innovations in cutting-edge technologies through its effective identification and tracking and explore more efficient scientific and technological research work modes.Entities:
Mesh:
Year: 2022 PMID: 36223420 PMCID: PMC9555621 DOI: 10.1371/journal.pone.0275872
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Ranking of technology websites.
| Data source | Website address | Techmeme ranking and well-known technology websites |
|---|---|---|
| Arstechnica |
| 17 |
| Theverge |
| 1 |
| Engadget |
| 28 |
| Arstechnica_tech |
| 17 |
| Techcrunch |
| 4 |
| Cnet |
| 25 |
| Vice |
| 13 |
| Geekwire |
| 44 |
| Venturebeat |
| 19 |
| Fortune |
| 46 |
| Theinformation |
| 15 |
| Fastcompany |
| 42 |
| Zdnet |
| 12 |
| Reuters |
| 6 |
| Gizmodo |
| Well-known technology blogs in the United States |
| Scientificamerican |
| Popular high-level academic journals |
| Entrepreneur |
| News site about entrepreneurs, small business management and business opportunities |
| Readwirte |
| Internet famous technology news blog |
Automatic extraction and evaluation of international cutting-edge technology recognition entities based on the SpaCy model.
| Number | P(%) | R(%) | F(%) |
|---|---|---|---|
|
| 89.14 | 86.27 | 85.71 |
|
| 89.00 | 93.85 | 91.37 |
|
| 84.07 | 84.49 | 84.27 |
|
| 92.08 | 90.87 | 91.47 |
|
| 91.35 | 87.5 | 89.39 |
|
| 84.72 | 86.67 | 85.69 |
|
| 93.30 | 91.50 | 92.39 |
|
| 92.72 | 97.02 | 83.21 |
|
| 91.93 | 91.01 | 91.46 |
|
| 98.01 | 98.21 | 98.11 |
|
| 90.63 | 90.74 | 89.31 |