| Literature DB >> 31803734 |
XiaoYong Pan1,2,3, Tao Zeng4, Fei Yuan5, Yu-Hang Zhang6, Lei Chen7,8, LiuCun Zhu1, SiBao Wan1, Tao Huang6, Yu-Dong Cai1.
Abstract
Isocitrate dehydrogenase (IDH) is an oncogene, and the expression of a mutated IDH promotes cell proliferation and inhibits cell differentiation. IDH exists in three different isoforms, whose mutation can cause many solid tumors, especially gliomas in adults. No effective method for classifying gliomas on genetic signatures is currently available. DNA methylation may be applied to distinguish cancer cells from normal tissues. In this study, we focused on three subtypes of IDH-mutation gliomas by examining methylation data. Several advanced computational methods were used, such as Monte Carlo feature selection (MCFS), incremental feature selection (IFS), support machine vector (SVM), etc. The MCFS method was adopted to analyze methylation features, resulting in a feature list. Then, the IFS method incorporating SVM was applied to the list to extract important methylation features and construct an optimal SVM classifier. As a result, several methylation features (sites) were found to relate to glioma subclasses, which are annotated onto multiple genes, such as FLJ37543, LCE3D, FAM89A, ADCY5, ESR1, C2orf67, REST, EPHA7, etc. These genes are enriched in biological functions, including cellular developmental process, neuron differentiation, cellular component morphogenesis, and G-protein-coupled receptor signaling pathway. Our results, which are supported by literature reports and independent dataset validation, showed that our identified genes and functions contributed to the detailed glioma subtypes. This study provided a basic research on IDH-mutation gliomas.Entities:
Keywords: IDH-mutation; gliomas; isocitrate dehydrogenase; methylation; multi-class classification
Year: 2019 PMID: 31803734 PMCID: PMC6871504 DOI: 10.3389/fbioe.2019.00339
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Figure 1The entire procedures for investigating the methylation profiles of patients with three IDH-mutation glioma subclasses.
The 10-fold cross-validation performance of IFS with different classifiers on the training set.
| SVM | 750 | 0.987 | 0.957 | 1.000 | 0.985 | 0.977 |
| SVM | 20 | 1.000 | 0.913 | 1.000 | 0.980 | 0.970 |
| RF | 1,330 | 0.987 | 0.913 | 1.000 | 0.975 | 0.962 |
| RIPPER | 19,270 | 0.962 | 0.848 | 0.950 | 0.931 | 0.895 |
Figure 2Performance of SVM, RF, and RIPPER that changed with the corresponding number of features. (A) RF performance, (B) RIPPER performance, and (C) SVM performance.
The performance of IFS with different classifiers on the independent test set.
| SVM | 750 | 0.947 | 0.780 | 1.000 | 0.936 | 0.899 |
| SVM | 20 | 0.926 | 0.756 | 0.964 | 0.908 | 0.855 |
| RF | 1,330 | 0.968 | 0.756 | 1.000 | 0.940 | 0.907 |
| RIPPER | 19,270 | 0.957 | 1.000 | 1.000 | 0.982 | 0.972 |
Top features (methylation probes) and their targeting genes.
| 1 | cg04437966 | 0.5637 | |
| 2 | cg14159026 | 0.4719 | |
| 3 | cg22519158 | 0.3781 | |
| 4 | cg12450347 | 0.3505 | |
| 5 | cg17482114 | 0.3397 | |
| 6 | cg08415493 | 0.3244 | |
| 7 | cg12760041 | 0.3119 | |
| 8 | cg12930304 | – | 0.2875 |
| 9 | cg26694713 | 0.2846 | |
| 10 | cg04360458 | 0.2591 | |
| 11 | cg17398252 | 0.2497 | |
| 12 | cg21552709 | 0.2374 | |
| 13 | cg20138711 | 0.2327 | |
| 14 | cg11902641 | – | 0.2271 |
| 15 | cg03903398 | 0.2052 | |
| 16 | cg19681793 | 0.1916 | |
| 17 | cg24215279 | 0.1889 | |
| 18 | cg05427966 | 0.1797 | |
| 19 | cg11235583 | 0.1766 | |
| 20 | cg14158583 | 0.1739 |
The significantly enriched GO/KEGG functions with FDR < 0.05.
| GO:0048731 system development | 5.02E-05 | 3.18E-09 |
| GO:0030154 cell differentiation | 9.78E-05 | 1.88E-08 |
| GO:0032502 developmental process | 9.78E-05 | 2.13E-08 |
| GO:0048869 cellular developmental process | 9.78E-05 | 2.48E-08 |
| GO:0007275 multicellular organism development | 0.0001 | 4.69E-08 |
| GO:0048856 anatomical structure development | 0.0001 | 4.33E-08 |
| GO:0048513 animal organ development | 0.0002 | 1.06E-07 |
| GO:0009653 anatomical structure morphogenesis | 0.0003 | 1.98E-07 |
| GO:0032501 multicellular organismal process | 0.0003 | 1.92E-07 |
| GO:0007399 nervous system development | 0.0004 | 2.52E-07 |
| GO:0048518 positive regulation of biological process | 0.0005 | 3.44E-07 |
| GO:0030182 neuron differentiation | 0.0009 | 7.14E-07 |
| GO:0048699 generation of neurons | 0.0010 | 7.99E-07 |
| GO:0022008 neurogenesis | 0.0011 | 9.80E-07 |
| GO:0051239 regulation of multicellular organismal process | 0.0028 | 2.61E-06 |
| GO:0048468 cell development | 0.0050 | 5.02E-06 |
| GO:0009887 animal organ morphogenesis | 0.0054 | 5.86E-06 |
| GO:0048598 embryonic morphogenesis | 0.0066 | 7.53E-06 |
| GO:0000904 cell morphogenesis involved in differentiation | 0.0084 | 1.01E-05 |
| GO:0050793 regulation of developmental process | 0.0088 | 1.11E-05 |
| GO:0001501 skeletal system development | 0.0094 | 1.25E-05 |
| GO:0051240 positive regulation of multicellular organismal process | 0.0108 | 1.51E-05 |
| GO:0048534 hematopoietic or lymphoid organ development | 0.0117 | 1.70E-05 |
| GO:0002520 immune system development | 0.0124 | 1.95E-05 |
| GO:0035295 tube development | 0.0124 | 1.96E-05 |
| GO:0000902 cell morphogenesis | 0.0129 | 2.13E-05 |
| GO:0048522 positive regulation of cellular process | 0.0160 | 2.73E-05 |
| GO:0009790 embryo development | 0.0224 | 3.97E-05 |
| GO:0009888 tissue development | 0.0253 | 4.64E-05 |
| GO:0007187 G-protein coupled receptor signaling pathway, coupled to cyclic nucleotide second messenger | 0.0352 | 6.91E-05 |
| GO:0032989 cellular component morphogenesis | 0.0352 | 6.92E-05 |
| GO:0032736 positive regulation of interleukin-13 production | 0.0356 | 7.21E-05 |
| GO:0048871 multicellular organismal homeostasis | 0.0418 | 8.73E-05 |
| GO:0030097 hemopoiesis | 0.0459 | 9.88E-05 |
| GO:0046703 natural killer cell lectin-like receptor binding | 0.0481 | 1.04E-05 |