| Literature DB >> 34961813 |
Mingying Xu1, Junping Du1, Zeli Guan1, Zhe Xue1, Feifei Kou1, Lei Shi2, Xin Xu1, Ang Li1.
Abstract
Computer science discipline includes many research fields, which mutually influence and promote each other's development. This poses two great challenges of predicting the research topics of each research field. One is how to model fine-grained topic representation of a research field. The other is how to model research topic of different fields and keep the semantic consistency of research topics when learning the scientific influence context from other related fields. Unfortunately, the existing research topic prediction approaches cannot handle these two challenges. To solve these problems, we employ multiple different Recurrent Neural Network chains which model research topics of different fields and propose a research topic prediction model based on spatial attention and semantic consistency-based scientific influence modeling. Spatial attention is employed in field topic representation which can selectively extract the attributes from the field topics to distinguish the importance of field topic attributes. Semantic consistency-based scientific influence modeling maps research topics of different fields to a unified semantic space to obtain the scientific influence context of other related fields. Extensive experiment results on five related research fields in the computer science (CS) discipline show that the proposed model is superior to the most advanced methods and achieves good topic prediction performance.Entities:
Mesh:
Year: 2021 PMID: 34961813 PMCID: PMC8710157 DOI: 10.1155/2021/1766743
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The structure of Recurrent Neural Network. (a) Vanilla RNN, (b) LSTM, and (c) GRU.
Figure 2The framework of attention mechanism.
Figure 3Research topic prediction model based on SASC.
Figure 4Scientific influence modeling based on semantic consistency.
The statistics of datasets.
| Field | Total papers | Period time |
|---|---|---|
| Computation and Language ( | 22633 | 2006–2020 |
| Computer Vision and Pattern Recognition ( | 50465 | 2006–2020 |
| Machine Learning ( | 72072 | 2006–2020 |
| Information Retrieval ( | 28154 | 2006–2020 |
| Artificial Intelligence ( | 8667 | 2006–2020 |
RMSE of different models.
| Model | RMSE | |||||
|---|---|---|---|---|---|---|
| CL | CV | ML | IR | AI | Average | |
| ARIMA | 1.179e−4 | 2.967e−4 | 5.822e−4 | 3.640e−4 | 3.893e−4 | 3.500e−4 |
| GRU | 4.385e−4 | 2.899e−4 | 2.411e−4 | 1.792e−4 | 2.122e−4 | 2.720e−4 |
| LSTM | 4.470e−4 | 2.879e−4 | 1.312e−4 | 2.032e−4 | 1.987e−4 | 2.540e−4 |
| ENDE | 4.317e−4 | 2.944e−4 | 2.306e−4 | 2.034e−4 | 2.127e−4 | 2.750e−4 |
| TARNN | 4.375e−4 | 3.013e−4 | 2.385e−4 | 2.050e−4 | 2.094e−4 | 2.780e−4 |
| DARNN | 4.319e−4 | 2.923e−4 | 2.331e−4 | 2.070e−4 | 2.046e−4 | 2.740e−4 |
| CONI | 4.196e−4 | 2.558e−4 | 2.377e−4 | 2.049e−4 | 1.942e−4 | 2.620e−4 |
| SASC | 3.981e−4 | 2.384e−4 | 0.970e−4 | 1.577e−4 | 0.701e−4 | 1.920e−4 |
RMSE and precision of different models of different fields.
| (a) Precision of different models of CL field | ||||||||
| p@10 | p@20 | p@30 | p@40 | p@50 | p@60 | p@70 | p@80 | |
|
| ||||||||
| ARIMA | 0.6000 | 0.5000 | 0.5333 | 0.5000 | 0.4600 | 0.4667 | 0.4857 | 0.4750 |
| GRU | 0.7000 | 0.8000 | 0.8000 | 0.7250 | 0.7200 | 0.6500 | 0.6571 | 0.6750 |
| LSTM | 0.7000 | 0.8000 | 0.7667 | 0.7750 | 0.7200 | 0.7167 | 0.7000 | 0.7250 |
| ENDE | 0.6000 | 0.6000 | 0.6000 | 0.5500 | 0.5800 | 0.5667 | 0.5286 | 0.5500 |
| TARNN | 0.6000 | 0.6000 | 0.6000 | 0.5500 | 0.5600 | 0.5500 | 0.5286 | 0.5375 |
| DARNN | 0.6000 | 0.6000 | 0.6000 | 0.5500 | 0.5800 | 0.5667 | 0.5286 | 0.5375 |
| CONI | 0.7000 | 0.8000 | 0.7667 | 0.7250 | 0.6800 | 0.6333 | 0.6286 | 0.6750 |
| SASC | 0.7000 | 0.9000 | 0.9000 | 0.8500 | 0.8000 | 0.8167 | 0.8143 | 0.8000 |
|
| ||||||||
| (b) Precision of different models of CV field | ||||||||
| ARIMA | 0.8000 | 0.6000 | 0.6000 | 0.6250 | 0.5400 | 0.5500 | 0.5714 | 0.6250 |
| GRU | 0.9000 | 0.6000 | 0.6667 | 0.6250 | 0.5600 | 0.5667 | 0.6000 | 0.6375 |
| LSTM | 0.9000 | 0.6000 | 0.6667 | 0.6250 | 0.5800 | 0.5833 | 0.6000 | 0.6375 |
| ENDE | 0.9000 | 0.6000 | 0.6667 | 0.6250 | 0.5800 | 0.5667 | 0.6000 | 0.6375 |
| TARNN | 0.9000 | 0.6000 | 0.6667 | 0.6250 | 0.5600 | 0.5833 | 0.6000 | 0.6250 |
| DARNN | 0.9000 | 0.6000 | 0.6667 | 0.6250 | 0.5800 | 0.5667 | 0.5857 | 0.6500 |
| CONI | 0.9000 | 0.6500 | 0.7000 | 0.6750 | 0.6800 | 0.7000 | 0.6857 | 0.7000 |
| SASC | 1.0000 | 0.9000 | 0.8333 | 0.8750 | 0.8400 | 0.8333 | 0.8714 | 0.8500 |
|
| ||||||||
| (c) Precision of different models of ML field | ||||||||
| ARIMA | 0.6000 | 0.5500 | 0.5333 | 0.6000 | 0.5800 | 0.5500 | 0.5286 | 0.6125 |
| GRU | 0.7000 | 0.6000 | 0.6000 | 0.7000 | 0.6200 | 0.5500 | 0.6143 | 0.6250 |
| LSTM | 0.8000 | 0.8500 | 0.7667 | 0.7750 | 0.7400 | 0.7500 | 0.7714 | 0.7875 |
| ENDE | 0.7000 | 0.6000 | 0.6000 | 0.7000 | 0.6200 | 0.5667 | 0.6143 | 0.6250 |
| TARNN | 0.7000 | 0.6000 | 0.6000 | 0.7000 | 0.6200 | 0.5833 | 0.6143 | 0.6375 |
| DARNN | 0.7000 | 0.6000 | 0.6000 | 0.7000 | 0.6200 | 0.5833 | 0.6143 | 0.6250 |
| CONI | 0.7000 | 0.6000 | 0.6000 | 0.7000 | 0.6200 | 0.5833 | 0.6143 | 0.6250 |
| SASC | 1.0000 | 0.9000 | 0.9000 | 0.9000 | 0.9000 | 0.9000 | 0.8714 | 0.9250 |
|
| ||||||||
| (d) Precision of different models of IR field | ||||||||
| ARIMA | 0.7000 | 0.5500 | 0.6333 | 0.7000 | 0.7000 | 0.7000 | 0.7000 | 0.7250 |
| GRU | 0.8000 | 0.6000 | 0.7000 | 0.8000 | 0.7600 | 0.8167 | 0.8000 | 0.7875 |
| LSTM | 0.8000 | 0.6000 | 0.6333 | 0.7250 | 0.7600 | 0.7500 | 0.7571 | 0.7625 |
| ENDE | 0.8000 | 0.5500 | 0.6667 | 0.7000 | 0.7600 | 0.7000 | 0.7429 | 0.7625 |
| TARNN | 0.8000 | 0.6000 | 0.6667 | 0.7250 | 0.7600 | 0.7167 | 0.7429 | 0.7500 |
| DARNN | 0.8000 | 0.5500 | 0.6333 | 0.7250 | 0.7600 | 0.7167 | 0.7429 | 0.7500 |
| CONI | 0.8000 | 0.5500 | 0.6333 | 0.7250 | 0.7600 | 0.7167 | 0.7625 | 0.7556 |
| SASC | 0.8000 | 0.7500 | 0.8000 | 0.8000 | 0.8000 | 0.8333 | 0.8143 | 0.8000 |
|
| ||||||||
| (e) Precision of different models of AI field | ||||||||
| ARIMA | 0.5000 | 0.5500 | 0.5000 | 0.5750 | 0.6000 | 0.6667 | 0.6857 | 0.6375 |
| GRU | 0.7000 | 0.6500 | 0.5000 | 0.5750 | 0.6200 | 0.6833 | 0.7000 | 0.6750 |
| LSTM | 0.6000 | 0.7000 | 0.5000 | 0.5750 | 0.6200 | 0.6667 | 0.7000 | 0.6625 |
| ENDE | 0.6000 | 0.6500 | 0.5333 | 0.5750 | 0.6200 | 0.6500 | 0.7000 | 0.6500 |
| TARNN | 0.6000 | 0.6500 | 0.5333 | 0.5750 | 0.6200 | 0.6500 | 0.6714 | 0.6750 |
| DARNN | 0.6000 | 0.6500 | 0.5000 | 0.5750 | 0.6200 | 0.6833 | 0.7000 | 0.6500 |
| CONI | 0.6000 | 0.6500 | 0.5000 | 0.6000 | 0.6400 | 0.6667 | 0.7000 | 0.6500 |
| SASC | 0.9000 | 0.8500 | 0.8333 | 0.9000 | 0.9000 | 0.8667 | 0.8429 | 0.8250 |
| (f) Average precision of different models of five fields | ||||||||
| ARIMA | 0.6400 | 0.5500 | 0.5600 | 0.6000 | 0.5760 | 0.5867 | 0.5943 | 0.6150 |
| GRU | 0.7600 | 0.6500 | 0.6533 | 0.6850 | 0.6560 | 0.6533 | 0.6743 | 0.6800 |
| LSTM | 0.7600 | 0.7100 | 0.6667 | 0.6950 | 0.6840 | 0.6933 | 0.7057 | 0.7150 |
| ENDE | 0.7200 | 0.6000 | 0.6133 | 0.6300 | 0.6320 | 0.6100 | 0.6371 | 0.6450 |
| TARNN | 0.7200 | 0.6100 | 0.6133 | 0.6350 | 0.6240 | 0.6167 | 0.6314 | 0.6450 |
| DARNN | 0.7200 | 0.6000 | 0.6000 | 0.6350 | 0.6320 | 0.6233 | 0.6343 | 0.6425 |
| CONI | 0.7400 | 0.6500 | 0.6400 | 0.6850 | 0.6760 | 0.6600 | 0.6714 | 0.6825 |
| SASC | 0.8800 | 0.8600 | 0.8533 | 0.8650 | 0.8480 | 0.8500 | 0.8429 | 0.8400 |
Figure 5The change curve of precision of different prediction models. (a) Precision@10, (b) Precision@20, (c) Precision@40, and (d) Precision@60.
Figure 6The change curve of RMSE of different prediction models.
Figure 7Precision comparison of SASC with two of its variants. (a) CL, (b) CV, (c) ML, (d) IR, (e) AI, and (f) AVERAGE.
RMSE of SASC and its variants.
| Model | RMSE | |||||
|---|---|---|---|---|---|---|
| CL | CV | ML | IR | AI | Average | |
| SASC | 3.981e−4 | 2.384 e−4 | 0.970 e−4 | 1.577 e−4 | 0.701 e−4 | 1.920 e−4 |
| SASC-SA | 4.084 e−4 | 2.439 e−4 | 1.521 e−4 | 1.615 e−4 | 1.063 e−4 | 2.140 e−4 |
| SASC-SC | 4.668 e−4 | 2.447 e−4 | 0.914 e−4 | 1.583 e−4 | 1.011 e−4 | 2.120 e−4 |
Predicted top 10 research topics of CL, CV, ML, IR, and AI fields in 2020.
| Field | Predicted research topics | True research topics |
|---|---|---|
| CL | Neural, speech, semantic, learning, networks, evaluation, classification, recognition, detection, and extraction | Learning, neural, generation, translation, speech, knowledge, recognition, detection, classification, and extraction |
| CV | Learning, networks, convolutional, neural, recognition, detection, segmentation, estimation, adversarial, and classification | Learning, detection, neural, segmentation, networks, recognition, classification, adversarial, convolutional, and estimation |
| ML | Learning, neural, networks, adversarial, detection, reinforcement, classification, prediction, graph, and optimization | Learning, neural, networks, reinforcement, detection, graph, adversarial, classification, optimization, and prediction |
| IR | Ranking, semantic, learning, recommendation, search, neural, retrieval, information, query, and analysis | Recommendation, semantic, learning, ranking, retrieval, search, embedding, information, query, and recommender |
| AI | Learning, networks, language, machine, deep, neural, reinforcement, data, model, and knowledge | Learning, machine, reinforcement, deep, language, neural, topic, networks, knowledge, and model |