| Literature DB >> 34831392 |
Yameng Wu1, Yu Guo1, Jun Ma1, Yu Sa1, Qifeng Li1, Ning Zhang1.
Abstract
In the field of gliomas research, the broad availability of genetic and image information originated by computer technologies and the booming of biomedical publications has led to the advent of the big-data era. Machine learning methods were applied as possible approaches to speed up the data mining processes. In this article, we reviewed the present situation and future orientations of machine learning application in gliomas within the context of workflows to integrate analysis for precision cancer care. Publicly available tools or algorithms for key machine learning technologies in the literature mining for glioma clinical research were reviewed and compared. Further, the existing solutions of machine learning methods and their limitations in glioma prediction and diagnostics, such as overfitting and class imbalanced, were critically analyzed.Entities:
Keywords: gene expression; gliomas; machine learning; prediction; radiomics
Mesh:
Substances:
Year: 2021 PMID: 34831392 PMCID: PMC8622230 DOI: 10.3390/cells10113169
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1A timeline of the development of the combination of gliomas and machine learning.
Figure 2A typical machine learning process.
Figure 3A line chart showing the exponential growth in published papers using machine learning algorithms to solve glioma tasks. The data were collected using a variety of key word searches through PubMed. In this figure, two axes have been plotted. The y-axis represents the number for publications related to “glioma” and “machine learning methods”. The x-axis represents the publication year. Each line represents the cumulative total of papers published over a year period. The earliest papers appeared in the early 2000s.
Some related articles of other research directions were not discussed in detail in our review.
| Category | Reference | Machine Learning Methods | Data Type |
|---|---|---|---|
| feature selection | Zöllner et al. [ | SVM | dynamic susceptibility contrast MRI |
| Sun et al. [ | L1-SVM + multi-layer perceptron | radiomics | |
| Abusamra [ | SVM, KNN, RF and eight other feature selection methods | gene expression data | |
| automatic segmentation | Wu et al. [ | SVM | T2 weighted MRI |
| Chen et al. [ | multiscale 3D convolutional neural network | MRI | |
| recurrence | X. Gao et al. [ | SVM | Pre- and post-contrast T1WI and T2 FLAIR |
| Rathore [ | PCA | MRI |
Publications discussed in this review.
| Category | Reference | Machine Learning Algorithm | Training Data | Year | Aims |
|---|---|---|---|---|---|
| Biomarkers prediction | Hsu, J.BK. et al. [ | random forests | gene expression profile | 2019 | Identify gene biomarkers |
| J. Haubold et al. [ | linear SVM, random forest | multiparametric 18F-FET PET-MRI and MR Fingerprinting | 2020 | Identify ATRX, IDH1, and 1p19q status | |
| Y. Matsui et al. [ | Neural network | magnetic resonance imaging (MRI), positron emission tomography (PET), and computed tomography(CT) | 2020 | Identify IDH1, and 1p19q status with multimodal data | |
| Grades classification | C. Lu et al. [ | SVM and ensemble learning approaches | multimodal MR radiomics | 2018 | grades classification |
| A. Sengupta et al. [ | random forests and SVM | Conventional MRI images and 3D T1 perfusion MRI data | 2019 | feature selection before calssification | |
| B. Niu et al. [ | random forests and Complement Naive Bayes | gene expression data | 2020 | imbalanced data problem | |
| Prognosis prediction | P. Mobadersany et al. [ | convolutional neural network | pathology images and genomics | 2018 | predict glioma outcomes |
| X. Gong et al. [ | LASSO | Transcriptomic data | 2021 | develop a signature associated with the tumor immune | |
| N. Czarnek et al. [ | Khachiyan and Cox proportional hazards | Axial preoperative fluid-attenuated inversion recovery (FLAIR) and post contrast T1 (T1 + C) images | 2017 | investigated the relationship between tumor shape and prognosis |
Challenges in future research.
| Category | Challenges |
|---|---|
| Data aspect | lack of annotated data |
| data quality and integrity | |
| data class imbalance | |
| Model aspect | Overfitting |
| lack of comparing with different models | |
| generalizability of models | |
| reproducibility of model | |
| Clinical application aspect | Physicians’ knowledge limitations |