| Literature DB >> 36139115 |
Jianbo Qing1,2, Fang Zheng3, Huiwen Zhi1,2, Hasnaa Yaigoub4, Hasna Tirichen4, Yaheng Li2,5, Juanjuan Zhao3, Yan Qiang3, Yafeng Li2,6,7,8.
Abstract
(1) Objective: Identification of potential genetic biomarkers for various glomerulonephritis (GN) subtypes and discovering the molecular mechanisms of GN. (2)Entities:
Keywords: deep learning; glomerulonephritis; immune infiltration; immune-related genes; machine learning
Mesh:
Substances:
Year: 2022 PMID: 36139115 PMCID: PMC9496457 DOI: 10.3390/biom12091276
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
The information of the datasets used for analysis.
| Number | Platform | Tissue | Data Sources |
|---|---|---|---|
| GSE99339 | GPL19109 | Glomeruli | Shved, Natallia et al. [ |
| GPL19184 | Glomeruli | ||
| GSE104948 | GPL22945 | Glomeruli | Grayson, Peter C et al. [ |
| GPL24120 | Glomeruli | ||
| GSE99325 | GPL19109 | Tubulointerstitium | Shved, Natallia et al. [ |
| GPL19184 | Tubulointerstitium | ||
| GSE104954 | GPL22945 | Tubulointerstitium | Grayson, Peter C et al. [ |
| GPL24120 | Tubulointerstitium |
The information of all samples in data 1 and data 2.
| Groups | The Number of Samples | |
|---|---|---|
| Data 1 (Glomeruli) | Data 2 (Tubulointerstitium) | |
| Control | 21 | 25 |
| Diabetic nephropathy (DN) | 26 | 35 |
| Focal and segmental glomerulosclerosis (FSGS) | 40 | 25 |
| Hypertensive nephropathy (HT) | 33 | 40 |
| IgA nephropathy (IgAN) | 53 | 49 |
| Minimal change disease (MCD) | 27 | 25 |
| Membranous glomerulonephritis (MGN) | 42 | 36 |
| Rapidly progressive glomerulonephritis (RPGN) | 45 | 42 |
| Lupus nephritis (LN) | 62 | 62 |
| Total | 349 | 339 |
Figure 1Merging datasets and removing batch effect. (A) The UMAP of sample distribution of each dataset of glomeruli before the removal of batch effect, samples from individual datasets are clustered separately, which indicates the existence of batch effect. (B) The UMAP of sample distribution of each dataset of glomeruli after the removal of batch effect, the samples of each datasets are clustered together, suggesting a good removal of batch effect. (C) The UMAP of sample distribution of each dataset of tubulointerstitium before the removal of batch effect, samples from individual datasets are clustered separately, which indicates the existence of batch effect. (D) The UMAP of sample distribution of each dataset of tubulointerstitium after the removal of batch effect, the samples of each datasets are clustered together, suggesting a good removal of batch effect.
Figure 2Identification of common DIRGs in data 1 and data 2. (A) Volcano map of all DEGs of data 1, there are 3474 up-regulated and 2036 down-regulated genes. (B) Volcano map of all DEGs of data 2, there are 2338 up-regulated and 2441 down-regulated genes. (C) Venn diagram of DEGs and IRGs, there are 274 common DIRGs in glomeruli and tubulointerstitium. (D) log2FC–log2FC plot of data 1 and data 2, there are 170 common up-regulated, 104 common down-regulated, and 19 oppositely expressed DIRGs.
Figure 3Enrichment analysis. (A) KEGG enrichment analysis of 170 common up-regulated DIRGs in data 1 and data 2. (B) GO enrichment analysis of 170 common up-regulated DIRGs in data 1 and data 2. (C) KEGG enrichment analysis of 104 common down-regulated DIRGs in data 1 and data 2. (D) GO enrichment analysis of 104 common down-regulated DIRGs in data 1 and data 2. The numbers next to the bars represent the number of genes enriched in the GO terms.
Figure 4Immune signatures of the glomeruli and tubulointerstitium in GN. (A) Bar graph of 10 major types of immune cells in the glomeruli of 8 GN subtypes. (B) Bar graph of 10 major types of immune cells in the tubulointerstitium of 8 GN subtypes. (C) Boxplot of 10 major types of immune cells in the glomeruli of GN and healthy controls. (D) Boxplot of 10 major types of immune cells in the tubulointerstitium of GN and healthy controls. (E); Boxplot of 14 T-cell subtypes of immune cells in the glomeruli of GN and healthy controls. (F): Boxplot of 14 T-cell subtypes in the tubulointerstitium of GN and healthy controls. (G): Heatmap of 10 major types of immune cells in 8 GN subtypes and healthy controls. (H): Heatmap of 14 T-cell subtypes in 8 GN subtypes and healthy controls.
Figure 5Gene screening and regression model establishment. (A) AUC−AUC plot of 274 common up-regulated or down-regulated DIRGs in data 1 and data 2, the DIRGs with top 20% AUC values both in glomeruli and tubulointerstitium are deep red. (B) The elastic net of 11 DIRGs in data 1. (C) Seven DIRGs were screened based on lambda = 0.02.
Figure 6Construction and assessment of machine learning model. (A) Nomogram model for GN diagnosis, based on the 7 DIRGs (ARG2, CSHL1, CX3CR1, LTF, LYZ, TMSB10, TRIM27). (B) Calibration curve to evaluate the nomogram model. The actual GN risk and the predicted risk are very close. (C) The ROC curve to assess the nomogram model. The AUC value of the nomogram model in data 2 is 0.855935.
Figure 7Workflow of the deep learning. Firstly, we downloaded microarray data from GEO database, screened candidate genes, and constructed MLP network. Then, LRP algorithm was used to calculate the characteristic genes of each GN subtypes.
Figure 8The loss curves, confusion matrix, and ROC curves of deep learning. (A) The loss curve of 274 candidate DIRGs based on data 1. (B) The loss curve of 274 candidate DIRGs based on data 2. (C) The confusion matrix of 274 candidate DIRGs based on data 1. (D) ROC curves of characteristic genes for each GN subtype in data 1. (E) The confusion matrix of 274 candidate DIRGs based on data 2. (F) ROC curves of characteristic genes for each GN subtype in data 2.
The top five characteristic genes of eight GN subtypes.
| Subtypes of GN | Glomeruli | Tubulointerstitium |
|---|---|---|
| DN | AVPR1A, GDF9, SEMA6C, ADA2, SSTR2 | PLSCR1, CXCL8, TRIM22, CXCL1, PLXND1 |
| FSGS | NFYA, MDK, BRD8, ADA2, JAK1 | ADA2, VEGFC, NPY, ZAP70, IFNGR2 |
| HT | TYMP, GRN, AMBN, CRIM1, SHC3 | NR4A3, NR4A1, S100A8, ADIPOR2, SEMA6C |
| IgAN | INSR, PSMD7, GRN, TYMP, GDF9 | MAP2K1, CSF1R, PTPN6, ZAP70, CD86 |
| MCD | PSMD7, GRN, TNFRSF11B, TYMP, GIPR | MDK, PPARG, NFYA, CRABP1, ZAP70 |
| MGN | NFKBIE, IL32, AVPR1A, PSMD7, CCL25 | ZAP70, VEGFC, MAP2K1, GDF15, PPARG |
| RPGN | PAK4, CALCA, C3, C3AR1, ITGAL | OAS1, MAP2K1, GDNF, GDF2, FGF |
| SLE | GRN, B2M, DDX58, EIF2AK2, PTGDS | ADIPOR2, OAS1, EIF2AK2, MX1, PLSCR1 |