| Literature DB >> 34522169 |
Jingni Wu1,2, Xiaoling Fang1, Xiaomeng Xia1.
Abstract
Background: Endometriosis is a common gynecological disorder with high rates of infertility and pelvic pain. However, its pathogenesis and diagnostic biomarkers remain unclear. This study aimed to elucidate potential hub genes and key pathways associated with endometriosis in ectopic endometrium (EC) and eutopic endometrium (EU). Material and Method: EC and EU-associated microarray datasets were obtained from the gene expression omnibus (GEO) database. Gene set enrichment analysis was performed to obtain further biological insight into the EU and EC-associated genes. Weighted gene co-expression network analysis (WGCNA) was performed to find clinically significant modules of highly-correlated genes. The hub genes that belong to both the weighted gene co-expression network and protein-protein interaction (PPI) network were identified using a Venn diagram.Entities:
Keywords: WGCNA; biological markers; endometriosis; pathway
Mesh:
Year: 2021 PMID: 34522169 PMCID: PMC8436105 DOI: 10.7150/ijms.63541
Source DB: PubMed Journal: Int J Med Sci ISSN: 1449-1907 Impact factor: 3.738
Figure 1Flow diagram of strategy for data preparation, preprocessing, and analysis.
Figure 2Differentially expressed genes in endometriosis. (A) Volcano map of differentially expressed genes in EC compared with Ctrl. (B) Volcano map of differentially expressed genes in EU compared with Ctrl. EC: ectopic endometrium from patient with endometriosis. EU: eutopic endometria from patient with endometriosis. Ctrl: endometrium from patient without endometriosis.
Figure 3Gene set enrichment analysis (GSEA) of genes in endometriosis. GSEA-identified biological processes (A) and pathways (B) with significant enrichment in EC compared with Ctrl. GSEA-identified biological processes (C) and pathways (D) with significant enrichment in EU compared with Ctrl.
Figure 4Protein-protein interaction network cluster analysis. The PPI network of differentially expressed genes in EC versus Ctrl or EU versus Ctrl was constructed using the STRING website. The MCODE clustering algorithm was applied to the network to identify the potential hub genes. Top two clusters in each group are shown in the figure. In the EC-associated PPI network, cluster 1 consists of 11 nodes and 31 edges (A) and cluster 2 consists of 11 nodes and 31 edges (B). In the EU-associated PPI network, cluster 1 consists of 21 nodes and 209 edges (C) and cluster 2 consists of 11 nodes and 55 edges (D). PPI: protein-protein interaction; STRING, search tool for retrieval of interacting genes/proteins.
Figure 5Weighted gene co-expression network analysis of genes in endometriosis. Analysis of the scale-free topology fit index and the mean connectivity for various soft-threshold powers (β) for the genes in the EC (A) and EU (B). Dendrogram of all expressed genes in the EC (C) and EU (D) clustered based on a dissimilarity measure (1‐TOM). Determination of module-trait relationship of the EC (E) or EU (F) group in endometriosis and identification of the most clinically relevant modules; each row indicates a module eigengene (the first principal component of the gene expression matrix in a module), and each column represents a clinical trait. TOM: topological overlap matrix.
Figure 6Function annotations of clinically significant module. The GO biological processes (A) and KEGG pathways (B) of genes in EC-associated blue module. The GO biological process (C) and KEGG pathway (D) of genes in EU-associated magenta module. GO: Gene Ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes.
The GO and KEGG Pathway analysis of genes in the blue module
| Category | Term | Description | Log(p-value) |
|---|---|---|---|
| GO Biological Processes | GO:0030029 | actin filament-based process | -20.593297 |
| GO Biological Processes | GO:0071417 | cellular response to organonitrogen compound | -20.2406423 |
| GO Biological Processes | GO:0030155 | regulation of cell adhesion | -20.0365408 |
| GO Biological Processes | GO:0006914 | autophagy | -15.9477686 |
| GO Biological Processes | GO:0010942 | positive regulation of cell death | -14.3658968 |
| GO Biological Processes | GO:0007264 | small GTPase mediated signal transduction | -14.0979829 |
| GO Biological Processes | GO:0070848 | response to growth factor | -13.3811234 |
| GO Biological Processes | GO:0045055 | regulated exocytosis | -13.372418 |
| GO Biological Processes | GO:0051345 | positive regulation of hydrolase activity | -13.0915765 |
| GO Biological Processes | GO:0001568 | blood vessel development | -12.7380519 |
| GO Biological Processes | GO:0000904 | cell morphogenesis involved in differentiation | -12.5393524 |
| GO Biological Processes | GO:0043062 | extracellular structure organization | -12.4268848 |
| GO Biological Processes | GO:0009611 | response to wounding | -12.0333641 |
| GO Biological Processes | GO:0043408 | regulation of MAPK cascade | -11.5789888 |
| GO Biological Processes | GO:0022604 | regulation of cell morphogenesis | -11.4691072 |
| GO Biological Processes | GO:0002576 | platelet degranulation | -11.2263844 |
| GO Biological Processes | GO:0019221 | cytokine-mediated signaling pathway | -10.9869128 |
| GO Biological Processes | GO:0045936 | negative regulation of phosphate metabolic process | -10.9676618 |
| GO Biological Processes | GO:0003012 | muscle system process | -10.2900458 |
| GO Biological Processes | GO:0051129 | negative regulation of cellular component organization | -10.2118548 |
| KEGG Pathway | hsa04068 | FoxO signaling pathway | -9.5 |
| KEGG Pathway | hsa04510 | Focal adhesion | -9 |
| KEGG Pathway | hsa04014 | Ras signaling pathway | -8.6 |
| KEGG Pathway | hsa04010 | MAPK signaling pathway | -8.2 |
| KEGG Pathway | hsa05200 | Pathways in cancer | -7.3 |
| KEGG Pathway | hsa05205 | Proteoglycans in cancer | -7.2 |
| KEGG Pathway | hsa04380 | Osteoclast differentiation | -6.8 |
| KEGG Pathway | hsa04610 | Complement and coagulation cascades | -6.6 |
| KEGG Pathway | hsa04630 | Jak-STAT signaling pathway | -6 |
| KEGG Pathway | hsa05418 | Fluid shear stress and atherosclerosis | -5.9 |
| KEGG Pathway | hsa04270 | Vascular smooth muscle contraction | -5.9 |
| KEGG Pathway | hsa04144 | Endocytosis | -5.9 |
| KEGG Pathway | hsa04727 | GABAergic synapse | -5.8 |
| KEGG Pathway | hsa04137 | Mitophagy - animal | -5.5 |
| KEGG Pathway | hsa04915 | Estrogen signaling pathway | -5.5 |
| KEGG Pathway | hsa04621 | NOD-like receptor signaling pathway | -5.1 |
| KEGG Pathway | hsa05202 | Transcriptional misregulation in cancer | -4.2 |
| KEGG Pathway | hsa04720 | Long-term potentiation | -4.1 |
| KEGG Pathway | hsa04668 | TNF signaling pathway | -3.8 |
| KEGG Pathway | hsa04020 | Calcium signaling pathway | -3.7 |
Note: GO Biological Processes: Gene ontology analysis of biological process. KEGG: Kyoto Encyclopedia of Genes.
The GO and KEGG Pathway analysis of genes in the magenta module
| Category | Term | Description | Log(p-value) |
|---|---|---|---|
| GO Biological Processes | GO:0043408 | regulation of MAPK cascade | -6.05131099 |
| GO Biological Processes | GO:0043409 | negative regulation of MAPK cascade | -4.84419894 |
| GO Biological Processes | GO:1903351 | cellular response to dopamine | -4.25433626 |
| GO Biological Processes | GO:0035666 | TRIF-dependent toll-like receptor signaling pathway | -3.72907579 |
| GO Biological Processes | GO:0060398 | regulation of growth hormone receptor signaling pathway | -3.59181535 |
| GO Biological Processes | GO:1904407 | positive regulation of nitric oxide metabolic process | -3.51272839 |
| GO Biological Processes | GO:0009190 | cyclic nucleotide biosynthetic process | -3.38699864 |
| GO Biological Processes | GO:0000226 | microtubule cytoskeleton organization | -3.37279545 |
| GO Biological Processes | GO:0030258 | lipid modification | -3.34130208 |
| GO Biological Processes | GO:0016569 | covalent chromatin modification | -3.19790154 |
| GO Biological Processes | GO:0046856 | phosphatidylinositol dephosphorylation | -3.11446184 |
| GO Biological Processes | GO:0046777 | protein autophosphorylation | -3.11274876 |
| GO Biological Processes | GO:0090336 | positive regulation of brown fat cell differentiation | -3.07693529 |
| GO Biological Processes | GO:0060391 | positive regulation of SMAD protein signal transduction | -3.07693529 |
| GO Biological Processes | GO:0007099 | centriole replication | -3.06412444 |
| GO Biological Processes | GO:0031570 | DNA integrity checkpoint | -2.93592311 |
| GO Biological Processes | GO:0019835 | cytolysis | -2.87954652 |
| GO Biological Processes | GO:0007030 | Golgi organization | -2.79292724 |
| GO Biological Processes | GO:0048539 | bone marrow development | -2.684576 |
| KEGG Pathway | hsa00310 | Lysine degradation | -5 |
| KEGG Pathway | hsa04668 | TNF signaling pathway | -3.4 |
| KEGG Pathway | hsa04621 | NOD-like receptor signaling pathway | -3.2 |
| KEGG Pathway | hsa05020 | Prion diseases | -2.4 |
| KEGG Pathway | hsa00514 | Other types of O-glycan biosynthesis | -2.4 |
| KEGG Pathway | hsa04140 | Autophagy - animal | -2.3 |
| KEGG Pathway | hsa04912 | GnRH signaling pathway | -2.2 |
| KEGG Pathway | hsa04070 | Phosphatidylinositol signaling system | -2.1 |
| KEGG Pathway | hsa04210 | Apoptosis | -2.1 |
| KEGG Pathway | hsa04550 | Signaling pathways regulating pluripotency of stem cells | -2 |
Figure 7Identification of hub genes in endometriosis. (A) Scatterplot of module eigengenes related to EC in the blue co-expression module. (B) Scatterplot of module eigengenes related to EU in the magenta co-expression module. (C) Venn plot of common hub genes in blue module and PPI network. (D) Venn plot of common hub genes in magenta module and PPI network.
The common hub genes of WGCNA and PPI analysis in the blue and magenta modules
| Gene Symbol | Gene Description | Module | WGCNA | PPI Analysis | Limma Analysis | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| GS | GS P value | MM | MM, P value | K Score | Log2FC | FDR | Up or Down | |||
| ACTG2 | actin, gamma 2, smooth muscle, enteric | blue | 0.922139646 | 7.51E-09 | 0.923366176 | 6.54E-09 | 5.667 | 3.548158459 | 1.02E-08 | up |
| C1R | complement C1r subcomponent | blue | 0.897984733 | 7.81E-08 | 0.96174151 | 1.45E-11 | 4 | 2.208552292 | 1.14E-07 | up |
| CDH3 | cadherin 3 | blue | 0.931420936 | 2.48E-09 | 0.971836642 | 9.56E-13 | 6.2 | 2.775338929 | 3.97E-09 | up |
| CLU | clusterin | blue | 0.863575842 | 9.38E-07 | 0.878310934 | 3.54E-07 | 4 | 2.044619742 | 1.29E-06 | up |
| COL8A1 | collagen type VIII alpha 1 chain | blue | 0.92193141 | 7.69E-09 | 0.941172973 | 6.47E-10 | 3 | 4.329881143 | 1.01E-08 | Up |
| GATA6 | GATA binding protein 6 | blue | 0.98763453 | 6.14E-16 | 0.972736196 | 7.16E-13 | 5.667 | 4.878194698 | 1.42E-15 | up |
| HOXC6 | homeobox C6 | blue | 0.944317318 | 3.99E-10 | 0.90875176 | 2.98E-08 | 4.8 | 3.487499137 | 6.36E-10 | up |
| LMOD1 | leiomodin 1 | blue | 0.880126801 | 3.12E-07 | 0.947799849 | 2.26E-10 | 5.667 | 2.476035697 | 4.14E-07 | up |
| MYH11 | myosin heavy chain 11 | blue | 0.933072838 | 2.00E-09 | 0.975751992 | 2.52E-13 | 5.667 | 3.708376486 | 2.96E-09 | Up |
| MYLK | myosin light chain kinase | blue | 0.914954646 | 1.62E-08 | 0.94723591 | 2.48E-10 | 5.667 | 2.125367821 | 2.54E-08 | up |
| MYOCD | myocardin | blue | 0.859995861 | 1.17E-06 | 0.929712545 | 3.08E-09 | 5.667 | 3.441041169 | 1.43E-06 | up |
| PROS1 | protein S (alpha) | blue | 0.934607399 | 1.64E-09 | 0.984275338 | 5.27E-15 | 4 | 2.899254277 | 2.67E-09 | up |
| SERPING1 | serpin family G member 1 | blue | 0.859067114 | 1.23E-06 | 0.937390849 | 1.12E-09 | 4 | 2.424791429 | 1.61E-06 | up |
| TAGLN | transgelin | blue | 0.909061849 | 2.89E-08 | 0.959903156 | 2.20E-11 | 5.667 | 2.815676431 | 3.97E-08 | up |
| THBS1 | thrombospondin 1 | blue | 0.764480323 | 8.68E-05 | 0.84490284 | 2.77E-06 | 4 | 2.047575043 | 0.000118406 | up |
| THBS2 | thrombospondin 2 | blue | 0.870101501 | 6.18E-07 | 0.92128971 | 8.25E-09 | 3 | 2.397111323 | 8.17E-07 | up |
| ABCC8 | ATP binding cassette subfamily C member 8 | magenta | 0.726050115 | 0.000966995 | 0.922624053 | 1.35214E-07 | 3 | 2.368106299 | 0.000000474 | Up |
| COL4A6 | collagen type IV alpha 6 chain | magenta | 0.831303264 | 3.54879E-05 | 0.873736048 | 4.59798E-06 | 4.71 | 3.592049292 | 5.82E-08 | up |
| COL5A3 | collagen type V alpha 3 chain | magenta | 0.864722139 | 7.50448E-06 | 0.96496314 | 4.02139E-10 | 4.71 | 2.690812909 | 6.65E-08 | up |
| CXCL13 | C-X-C motif chemokine ligand 13 | magenta | 0.676038594 | 2.89E-03 | 0.808787134 | 8.47082E-05 | 20.9 | 2.692421845 | 0.00000669 | up |
| CYP2E1 | cytochrome P450 family 2 subfamily E member 1 | magenta | 0.813393515 | 7.1566E-05 | 0.937472578 | 2.85784E-08 | 4.5 | 2.3219305 | 4.78E-08 | up |
| CYP4B1 | cytochrome P450 family 4 subfamily B member 1 | magenta | 0.736111943 | 0.000754364 | 0.858320627 | 1.04106E-05 | 4.5 | 3.0752333 | 0.000000839 | up |
| DKK1 | dickkopf WNT signaling pathway inhibitor 1 | magenta | 0.697058489 | 0.001871854 | 0.815860925 | 6.53E-05 | 4.71 | 3.533265993 | 0.000000546 | up |
| FSTL3 | follistatin like 3 | magenta | 0.76382863 | 0.000358509 | 0.894831724 | 1.24E-06 | 5.067 | 2.997977144 | 0.000000244 | up |
| KLF4 | Kruppel like factor 4 | magenta | 0.725261009 | 0.000985574 | 0.884368596 | 2.45E-06 | 4.71 | 3.223838581 | 9.88E-08 | up |
| NR4A2 | nuclear receptor subfamily 4 group A member 2 | magenta | 0.800101777 | 0.000115047 | 0.864686086 | 7.52E-06 | 4.71 | 3.44866934 | 4.88E-08 | up |
| PROK1 | prokineticin 1 | magenta | 0.830706844 | 3.6373E-05 | 0.807977649 | 8.72E-05 | 11 | 4.082022638 | 1.93E-08 | up |
| WDR27 | WD repeat domain 27 | magenta | 0.790440633 | 0.000159037 | 0.950669894 | 5.02E-09 | 3 | 2.232448342 | 9.99E-08 | up |
WGCNA: weight gene co-expression network analysis, PPI: protein-protein interaction, GS: gene significance, MM: module membership, log2FC: log2 (Fold-Change) values of differentially expressed genes.
Figure 8Validation of hub genes in the independent GEO datasets. (A) Boxplot shows the hub gene expression in EC and Ctrl. (B) Boxplot shows the hub gene expression in EU and Ctrl (*p< 0.05 versus Ctrl).