| Literature DB >> 32831068 |
Maryam Khoshnejat1,2, Kaveh Kavousi3,4, Ali Mohammad Banaei-Moghaddam2,5, Ali Akbar Moosavi-Movahedi2,6.
Abstract
BACKGROUND: Type 2 diabetes mellitus (T2DM) is a complex multifactorial disease with a high prevalence worldwide. Insulin resistance and impaired insulin secretion are the two major abnormalities in the pathogenesis of T2DM. Skeletal muscle is responsible for over 75% of the glucose uptake and plays a critical role in T2DM. Here, we sought to provide a better understanding of the abnormalities in this tissue.Entities:
Keywords: Classification; Clustering; Flux variability analysis; Insulin resistance; Metabolic modeling; Muscle; Subtype; Type 2 diabetes
Mesh:
Substances:
Year: 2020 PMID: 32831068 PMCID: PMC7444195 DOI: 10.1186/s12920-020-00767-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Graphical overview of the study design. This study included two supervised and unsupervised classification sections. At the supervised classification part, we used a machine learning approach to identify a set of genes whose expression patterns could discriminate T2DM individuals from healthy ones. At the unsupervised section, the clustering of T2DM patients was employed for potential subtyping of the disease
Fig. 2Workflow for cluster-based metabolic modeling. HMR2 model was used as the generic model. The personalized metabolic models were reconstructed by integrating gene expression data into the HMR2 using the E-Flux algorithm. Diabetic models were categorized into three groups based on the clusters obtained from the hierarchical clustering of T2DM patients. FVA was employed to obtain maximum and minimum possible fluxes in each reaction. Perturbed reactions in each cluster in comparison to the healthy group were identified by applying t-test on obtained fluxes
Evaluation of the different classifiers for discrimination of T2DM individuals from healthy ones. Here, 247 differentially expressed genes were used as the classification features
| Method | AUC | ACC | F1 | Precision | Recall |
|---|---|---|---|---|---|
| 0.889 | 0.838 | 0.806 | 0.788 | 0.825 | |
| 0.877 | 0.812 | 0.772 | 0.766 | 0.778 | |
| 0.837 | 0.766 | 0.71 | 0.721 | 0.698 | |
| 0.801 | 0.734 | 0.717 | 0.634 | 0.825 | |
| 0.758 | 0.727 | 0.58 | 0.784 | 0.46 |
AUC, ACC, F1 score, precision, and Recall are reported
Performance of different classifiers when the 26 top-ranked genes were used as the features
| Method | AUC | ACC | F1 | Precision | Recall |
|---|---|---|---|---|---|
| 0.958 | 0.942 | 0.927 | 0.950 | 0.905 | |
| 0.966 | 0.903 | 0.878 | 0.900 | 0.857 | |
| 0.896 | 0.818 | 0.791 | 0.746 | 0.841 | |
| 0.836 | 0.799 | 0.735 | 0.796 | 0.683 | |
| 0.829 | 0.721 | 0.538 | 0.833 | 0.397 |
Fig. 3Hierarchical clustering of diabetic samples. The top three clusters were selected, and DEGs and perturbed pathways in each cluster compared to normal samples were found. Some of the specific dysregulated genes and pathways in each cluster are shown in the boxes. Green boxes show down-regulated genes, and peach boxes show up-regulated genes. The blue boxes show perturbed pathways and abnormalities in each cluster
Fig. 4Heatmap representation of the gene expression pattern in three diabetic clusters. The columns of the heatmap represent diabetic individuals and the rows show standardized gene expression (Z scores). Higher expressions are shown in lighter red and lower expressions are shown in lighter green. Clusters with relevant dendrogram obtained from hierarchical clustering with Euclidean distance are shown at the top of columns in which blue, green, and red lines demonstrate clusters 1 to 3, respectively. Row dendrogram indicates the clustering of genes using complete linkage hierarchical clustering with Euclidean distance
Subject characteristics for data. Fasting plasma glucose, fasting serum insulin, BMI, and waist/hip ratio (WHR) in each diabetic cluster and healthy group
| Healthy | Cluster 1 | Cluster 2 | Cluster 3 | ||
|---|---|---|---|---|---|
| 5.62 ± 0.3 | 7.17 ± 0.5 | 6.86 ± 0.5 | 7.39 ± 0.75 | 6.14e-43 | |
| 6.87 ± 3.3 | 10.19 ± 5.3 | 7.79 ± 3.9 | 12.93 ± 8.7 | 1.08e-06 | |
| 26.35 ± 3.5 | 29.03 ± 5.0 | 28.58 ± 4.5 | 30.13 ± 5.5 | 1.96e-04 | |
| 0.92 ± 0.08 | 0.99 ± 0.07 | 0.95 ± 0.06 | 1.02 ± 0.06 | 3.64e-08 |
All values are shown as means ± standard deviation
P values were calculated using ANOVA F-test