| Literature DB >> 34811711 |
Takuya Takata1, Hajime Sasaki2, Hiroko Yamano2, Masashi Honma3, Mayumi Shikano4.
Abstract
Horizon scanning for innovative technologies that might be applied to medical products and requires new assessment approaches to prepare regulators, allowing earlier access to the product for patients and an improved benefit/risk ratio. The purpose of this study is to confirm that citation network analysis and text mining for bibliographic information analysis can be used for horizon scanning of the rapidly developing field of AI-based medical technologies and extract the latest research trend information from the field. We classified 119,553 publications obtained from SCI constructed with the keywords "conventional," "machine-learning," or "deep-learning" and grouped them into 36 clusters, which demonstrated the academic landscape of AI applications. We also confirmed that one or two close clusters included the key articles on AI-based medical image analysis, suggesting that clusters specific to the technology were appropriately formed. Significant research progress could be detected as a quick increase in constituent papers and the number of citations of hub papers in the cluster. Then we tracked recent research trends by re-analyzing "young" clusters based on the average publication year of the constituent papers of each cluster. The latest topics in AI-based medical technologies include electrocardiograms and electroencephalograms (ECG/EEG), human activity recognition, natural language processing of clinical records, and drug discovery. We could detect rapid increase in research activity of AI-based ECG/EEG a few years prior to the issuance of the draft guidance by US-FDA. Our study showed that a citation network analysis and text mining of scientific papers can be a useful objective tool for horizon scanning of rapidly developing AI-based medical technologies.Entities:
Keywords: Artificial intelligence; Citation network; Delivery of health care/trends; Diagnostic imaging; Horizon scanning
Mesh:
Year: 2021 PMID: 34811711 PMCID: PMC8854249 DOI: 10.1007/s43441-021-00355-z
Source DB: PubMed Journal: Ther Innov Regul Sci ISSN: 2168-4790 Impact factor: 1.778
Key articles and the clusters in which they are contained
| Label | Paper title | published year | Web of science | |
|---|---|---|---|---|
| Cluster No | Times cited within each cluster | |||
| A | Gradient-based learning applied to document recognition. [ | 1998 | 1 | 1590 |
| B | Learning hierarchical features for scene labeling. [ | 2012 | 1 | 304 |
| C | Imagenet classification with deep convolutional neural networks. [ | 2012 | 1 | 1742 |
| D | Deep learning. [ | 2015 | 3 | 1825 |
| E | Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. [ | 2016 | 3 | 239 |
| F | Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning. [ | 2016 | 3 | 151 |
| G | Dermatologist-level classification of skin cancer with deep neural networks. [ | 2017 | 3 | 1015 |
| H | A survey on deep learning in medical image analysis. [ | 2017 | 3 | 1127 |
The key articles that have contributed to the development of AI-based medical image analysis were selected based on a review article on AI-based medical image analysis [32]. The clusters obtained from the citation network analysis of these articles are indicated. The clusters are numbered in descending order of the number of constituent papers included. The cells for papers not included in the analysis were shadowed. 8 articles are listed, excluding the 5 articles [31–33, 35, 39] that were excluded
Fig. 1Steps of clustering and making Academic Landscape based on citation network. This figure has been published in reference [10]. The procedure of the citation network is as follows: (1) Extract the dataset of academic papers for analysis. (2) To extract the data, convert the citation network into an unweighted network with papers as nodes and citation relationships as links. (3) Divide the network into several clusters by using the topological clustering method. (4) Use a large graph layout (LGL), based on a force-direct layout algorithm, to display the largest connected component of the network to generate coordinates for the nodes in two dimensions and to visualize the citation network by expressing inter-cluster links with the same color.
Fig. 2Tracking clusters containing key articles. We analyzed papers obtained from WoS published up to the indicated years. We plotted the cluster numbers that contained the eight key articles shown in Table 1, with the circle sizes representing the approximate number of citations in the cluster for each paper
Fig. 3Tracking clusters related to ECG and EEG. We analyzed papers obtained from WoS published up to the indicated years. A cluster number indicates the cluster on ECG and EEG. The circle sizes indicate the approximate citation frequency of the key article, [73] and the number in each circle represents the number of citations in the cluster. Clusters on ECG and EGG were first detected in 2015 as cluster number 10 and were classified into cluster numbers 11, 21, 1, 15, and 15 for 2016, 2017, 2018, 2019, and 2020, respectively
Sub-clustering results for clusters of AI-based medical technologies
| Cluster name | Average year | The number of papers | Top keywords | Hub papers |
|---|---|---|---|---|
| Cluster3 | 2018.8 | 10,992 | Segmentation, cancer, radiomics | Deep learning [ |
| Sub3-1 | 2018.3 | 1179 | Glaucoma, optical coherence tomography, retinal | Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs [ |
| Sub3-2 | 2019.1 | 1017 | Brain tumour segmentation, MRI, lesion | Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation [ |
| Sub3-3 | 2018.9 | 1011 | Whole slide, cancer, pathology | Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer [ |
| Sub3-4 | 2019.1 | 1004 | Radiograph, bone age, aneurysm | Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks [ |
| Sub3-5 | 2018.8 | 980 | Radiomics, glioma, MRI | Machine Learning methods for Quantitative Radiomic Biomarkers [ |
| Cluster15 | 2018.3 | 3202 | EEG (electroencephalogram), ECG (electrocardiogram), seizure | Real-Time Patient-Specific ECG Classification by 1-D Convolutional Neural Networks [ |
| Sub15-1 | 2019.0 | 606 | ECG, arrhythmia, heartbeat | Real-Time Patient-Specific ECG Classification by 1-D Convolutional Neural Networks [ |
| Sub15-2 | 2017.4 | 560 | EEG, brain–computer interface, motor imagery | A novel deep learning approach for classification of EEG motor imagery signals [ |
| Sub15-3 | 2018.5 | 394 | Seizure, EEG, epilepsy | Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals [ |
| Sub15-4 | 2018.3 | 379 | Emotion, EEG, physiological signal | EEG-Based Emotion Recognition in Music Listening [ |
| Sub15-5 | 2018.3 | 239 | Surface electromyography, myoelectric, prosthesis | Electromyography data for non-invasive naturally-controlled robotic hand prostheses [ |
| Cluster12 | 2018.3 | 4101 | Gait, activity recognition, video | 3D Convolutional Neural Networks for Human Action Recognition [ |
| Sub12-1 | 2018.8 | 694 | Action recognition, video, convolutional neural network | 3D Convolutional Neural Networks for Human Action Recognition [ |
| Sub12-2 | 2018.6 | 530 | Human activity recognition, wearable sensor, accelerometer | Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition [ |
| Sub12-3 | 2018.2 | 463 | Facial expression, emotion, ck + | Facial expression recognition based on Local Binary Patterns: A comprehensive study [ |
| Sub12-4 | 2017.4 | 379 | Gait, parkinson, walking | A machine learning approach for automated recognition of movement patterns using basic, kinetic and kinematic gait data [ |
| Sub12-5 | 2018.8 | 251 | Hand pose, sign language, human pose | Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks [ |
| Cluster5 | 2017.4 | 7829 | Clinical text, disease, electronic health record | Predicting the Future—Big Data, Machine Learning, and Clinical Medicine [ |
| Sub5-1 | 2016.7 | 828 | Clinical text, radiology report, electronic health record | 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text [ |
| Sub5-2 | 2018.3 | 759 | Recidivism, treatment effect, uplift modelling | Machine Learning: An Applied Econometric Approach [ |
| Sub5-3 | 2019.0 | 753 | Readmission, patient, electronic health record | Scalable and accurate deep learning with electronic health records [ |
| Sub5-4 | 2018.1 | 731 | Sepsis, acute kidney injury, ICU | An Interpretable Machine Learning Model for Accurate Prediction of Sepsis in the ICU [ |
| Sub5-5 | 2018.9 | 710 | Coronary artery, cardiac, angiography | A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac MRI [ |
| Cluster13 | 2017.6 | 3800 | Disorder, brain, schizophrenia | Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls [ |
| Sub13-1 | 2017.2 | 546 | Schizophrenia, psychosis, bipolar disorder | Using Support Vector Machine to identify imaging biomarkers of neurological and psychiatric disease: A critical review [ |
| Sub13-2 | 2018.6 | 441 | Alzheimer, MCI (mild cognitive impairment), disease | Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis [ |
| Sub13-3 | 2017.1 | 433 | MCI, Alzheimer, mild cognitive impairment, impairment | A review on neuroimaging-based classification studies and associated feature extraction methods for Alzheimer's disease and its prodromal stages [ |
| Sub13-4 | 2018.3 | 350 | Suicide risk, depression, mental health | Predicting Risk of Suicide Attempts Over Time Through Machine Learning [ |
| Sub13-5 | 2018.1 | 315 | Autism spectrum disorder, child, ADHD | Identification of autism spectrum disorder using deep learning and the ABIDE dataset [ |
| Cluster2 | 2016.4 | 13,309 | Protein, drug discovery, peptide | Random forests [ |
| Sub2-1 | 2016.4 | 2056 | Ligand, drug, virtual screening | Deep Neural Nets as a Method for Quantitative Structure–Activity Relationships [ |
| Sub2-2 | 2017.3 | 1873 | Gene, random forest, cancer | Random forests [ |
| Sub2-3 | 2018.0 | 1546 | Enhancer, gene, RNA | Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [ |
| Sub2-4 | 2018.6 | 1158 | Ligand, patient, NGC (NIDA Genetics Consortium) | Scikit-learn: Machine Learning in Python [ |
| Sub2-5 | 2016.5 | 1027 | DNA binding protein, peptide, amino acid composition | Predicting protein structural classes for low-similarity sequences by evaluating different features[ |
The clusters of AI-medical technologies were re-analyzed and the characteristics of the top five sub-clusters, that is, the number and average of publications of constituent papers, specific keywords, and the title of hub paper are shown