Literature DB >> 35378934

Bioinformatical analysis identifies PDLIM3 as a potential biomarker associated with immune infiltration in patients with endometriosis.

Lei Gan1, Jiani Sun1, Jing Sun1.   

Abstract

Background: Endometriosis is a chronic systemic disease, whose classic symptoms are pelvic pain and infertility. This disease seriously reduces the life quality of patients. The pathogenesis, recognition and treatment of endometriosis is still unclear, and cannot be over emphasized. The aim of our study was to investigate the potential biomarker of endometriosis for the mechanism and treatment.
Methods: Using GSE11691, GSE23339 and GSE5108 datasets, differentially expressed genes (DEGs) were identified between endometriosis and normal samples. The functions of DEGs were reflected by the analysis of gene ontology (GO), pathway enrichment and gene set enrichment analysis (GSEA). The LASSO regression model was performed to identify candidate biomarkers. The receiver operating characteristic curve (ROC) was used to evaluate discriminatory ability of candidate biomarkers. The predictive value of the markers in endometriosis were further validated in the GSE120103 dataset. Then, the expression level of biomarkers was detected by qRT-PCR and Western blot. Finally, the relationship between candidate biomarker expression and immune infiltration was estimated using CIBERSORT.
Results: A total of 42 genes were identified, which were mainly involved in cytokine-cytokine receptor interaction, systemic lupus erythematosus and chemokine signaling pathway. We confirmed PDLIM3 was a specific biomarker in endometriosis (AUC = 0.955) and validated in the GSE120103 dataset (AUC = 0.836). The mRNA and protein expression level of PDLIM3 in endometriosis tissue was significantly higher than normal. Immune cell infiltration analysis revealed that PDLIM3 was correlated with M2 macrophages, neutrophils, CD4+ memory resting T cells, gamma delta T cells, M1 Macrophages, resting mast cells, follicular helper T cells, activated NK cells, CD8+ T cells, regulatory T cells (Tregs), naive B cells, plasma cells and resting NK cells.
© 2022 Gan et al.

Entities:  

Keywords:  Diagnostic gene biomarker; Endometriosis; Immune infiltration; Machine learning; PDLIM3; bioinformatic

Year:  2022        PMID: 35378934      PMCID: PMC8976475          DOI: 10.7717/peerj.13218

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


Introduction

Recently, endometriosis is defined as a chronic systemic disease, characterized by the presence and infiltration of ectopic endometrial tissue (Taylor, Kotlyar & Flores, 2021). It affects nearly 10% of women who are of reproductive age. The most common symptoms are pelvic pain and infertility. Nearly 70% of teens with pelvic pain are later diagnosed with endometriosis (Yeung et al., 2011). Endometriosis seriously reduces the life quality of patients, increases incidence of depression and causes a heavy burden on the direct and indirect healthcare (Agarwal et al., 2019). The gold standard for endometriosis diagnosis is invasive laparoscopy examination. It is difficult to recognize because of patients’ nonspecific descriptions of symptoms, especially at early stages. Despite its prevalence, the average time from symptom onset to diagnosis is 5 to 10 years (Hudelist et al., 2012; Greene et al., 2009). The new opinion about diagnosis of endometriosis is emphasize disease symptoms and their origins rather than lesion presence or absence (Yeung et al., 2011). Specific circulating biomarkers, such as miR-125b-5p, miR-150-5p, miR-61, have been studied as promising early clinical diagnostic tools, although functional researches are needed to confirm their effect on this disease. Currently, medical therapy and surgical resection are the gold standard treatment for endometriosis (Vercellini et al., 2014) while both are associated with high recurrence rates and limited by high occurrence of side effects. The mechanisms underlying the pathogenesis and development of endometriosis will open the door to a new approach to treatment. Various hypotheses have been proposed to explain the underlying molecular mechanisms of endometriosis, consisting of the endometrial implantation theory of Sampson (Kruitwagen et al., 1991), eutopic endometrium determinism, stem cell factors (Burney & Giudice, 2012), epigenetic factors, immunodeficiency as well as immune and inflammatory factors (Ahn et al., 2015). Recent clinical and molecular studies have suggested that innate properties or acquired properties of the endometrium and impaired immune homeostasis are systems of significance in explaining the mechanism of endometriosis (Burney & Giudice, 2012). Immune imbalance is associated with increased implantation, proliferation, and angiogenesis of ectopic endometrium. Previous studies have shown that immune dysfunction happens in peritoneal fluid of endometriosis, which may contribute to the progression of endometriosis. Functional changes in immune components have been indicated, consisting of natural killer (NK) cells, monocytes/macrophages, B cells, T lymphocytes, and cytokines (Zou et al., 2021). Although numerous studies have described occurrence of immune abnormalities in endometriosis, the role of the immune system is not clearly understood. Hence, it is essential to explore mechanism of immune dysfunction in endometriosis, which can contribute to elucidate the role of immune system in pathogenesis of endometriosis and generate applicable insights to develop preventive and control strategies, original non-invasive diagnostic methods and targeted therapeutics. In the present study, we downloaded three microarray datasets (GSE11691, GSE23339, GSE5108) from the Gene Expression Omnibus (GEO) database to detect the potential biomarkers for further analysis. We identified candidate genes, explore immune mechanism and key signaling pathways regulating occurrence, development and progression of endometriosis through bioinformatics method. In conclusion, these findings provide novel insights into understanding the underlying immune mechanisms and seeking for potential molecular treatment.

Materials and Methods

Gene expression profile

We searched the associated gene expression profiles of endometriosis in NCBI-GEO (https://www.ncbi.nlm.nih.gov/geo/) (Barrett et al., 2005), using the keywords “endometriosis” and “Homo sapiens”. Three gene expression datasets containing microarray data from normal endometrial tissues and ectopic endometrial tissues, GSE11691 (Hull et al., 2008), GSE23339 (Hawkins et al., 2011) and GSE5108 (Eyster et al., 2007), were selected for further analysis. The GSE11691 microarray data consisted of nine normal human endometrium and nine ectopic human endometrium (two from the peritoneal wall, seven from the broad ligament) and was generated using the GPL96 platform Affymetrix Human Genome U133A Array. The GSE23339 microarray data consisted of nine normal human endometrium and 10 ectopic human endometrium (all from the endometriotic cysts of the ovaries) and was generated using the GPL6102 Illumina human-6 v2.0 expression beadchip. The GSE5108 microarray data consisted of 11 normal human endometrium and 11 ectopic human endometrium (five from the peritoneal wall, six from the endometriotic cysts of the ovaries) and was generated using the GPL2895 GE Healthcare/Amersham Biosciences CodeLink Human Whole Genome Bioarray. In addition, the GSE120103 (Bhat et al., 2019) dataset, containing 18 endometriosis (all from the endometriotic cysts of the ovaries) and 18 control samples, was used as the validation cohort using the GPL6480 platform Agilent-014850 Whole Human Genome Microarray 4 × 44K G4112F. GSE12768 (Ritchie et al., 2015) dataset, containing two endometriosis (both from the endometriotic cysts of the ovaries) and two control samples, was used as another validation cohort using the GPL7304 Institut Cochin HG18 60mer expression array 47K.

Identification of differentially expressed genes (DEGs)

The raw data of three datasets, GSE11691, GSE23339 and GSE5108, was merged and removed batch effects by “sva” package (http://www.bioconductor.org/) (Ritchie et al., 2015). Then, the “limma” package (http://www.bioconductor.org/) was used to attain DEGs between endometriosis samples and normal samples. The raw data of GSE12768 was analyzed separately to acquired the DEGs list, and compared with the list of three datasets in order to verify the reliability of our analysis. The significant DEGs were identified according to adjusted P value < 0.05 and ∣log fold change (FC)∣ > 2.

Functional and pathway enrichment analysis of DEGs

To reflect gene functions, gene ontology (GO) (http://geneontology.org/) (Thomas, 2017) biological processes were shown in three aspects: biological processes (BP), cellular component (CC) and molecular function (MF). And the analysis process was achieved by “clusterProfiler” package (Wu et al., 2021; Yu et al., 2012) in the present study, using the cut-off criterion P value < 0.05. ToppGene (https://toppgene.cchmc.org/enrichment.jsp) (Chen et al., 2009) is a free tool for uncovering the biological significance behind a cluster of genes. The REACTOME (https://reactome.org/) (Jassal et al., 2020) pathway enrichment analysis, was exactly performed using this tool. A P value < 0.05 was considered as the threshold points. Moreover, to get more information of mechanism behind DEGs, gene set enrichment analysis (GSEA) was employed to explore the potential feature between the endometriosis and control groups. The “c2.cp.kegg.v7.4.symbols.gmt” from the Molecular Signatures Database (MSigDB) was downloaded and used as the reference gene set. An adjusted P < 0.05 and false discovery rate (FDR) < 0.05 were considered as significant.

Protein–protein interaction (PPI) networks construction

STRING (https://string-db.org/) (Szklarczyk et al., 2015) is a tool designed to achieve a comprehensive and objective global network and visualize PPI information. Proteins associated with DEGs were chosen based on information in STRING (confidence score > 0.4), and then PPI networks were constructed using Cytoscape software (http://cytoscape.org/) (Shannon et al., 2003).

Screening of specific gene biomarkers

The least absolute shrinkage and selection operator (LASSO), a regression analysis algorithm, was applied to evaluate the value of DEGs in this study. The “glmnet” package in R did the above analysis and returned candidate potential genes. And the expression levels of candidate biomarkers were further tested in the GSE120103 dataset.

Receiver operating characteristic (ROC) curve analysis

To test the predictive accuracy of the identified biomarkers, we generated ROC curves using “pROC” package (Robin et al., 2011) based on the candidate genes and compare the predictive value of candidate genes by the area under the ROC curve (AUC). And then, the predictive value of these genes was also validated in the GSE120103 dataset.

Relationship between identified genes and infiltrating immune cells in endometriosis

We used the CIBERSORT (https://cibersortx.stanford.edu/) (Newman et al., 2015; Robin et al., 2011) to calculate immune cell infiltrations of endometriosis and control tissues. Then, we applied “corrplot” and “vioplot” R package to visualize the results and explore correlation of infiltrating immune cells. In addition, the relation of the candidate gene markers with the levels of infiltrating immune cells was calculated and visualized by “ggpubr” package.

Quantitative real-time PCR (qRT-PCR)

Total RNA was isolated from ovarian endometriosis and eutopic endometrium (NC). These operations were approved by Ethics committee of Shanghai First Maternity and Infant Hospital (KS21198). We used Trizol (RNAiso Plus; Takara, Tokyo, Japan) to isolate RNA from these two kinds of tissues according to the manufacturer’s instructions. Then, one microgram of RNA was reversely transcribed into cDNA using a PrimeScript™ RT reagent kit (Takara, Tokyo, Japan). Amplification was performed using gene-specific primers (Sangon, Shanghai, China) and TB Green® Premix EX TaqTM II (Takara, Tokyo, Japan) on a qRT-PCR device (QuantStudio5, Thermo Fisher Scientific, Waltham, MA, USA). β-actin was used as an internal control. The relative expression of the genes was calculated using the 2–ΔΔCT method. The primers used were as follows: human β-actin-Forward: 5′-GTGCTATGTTGCTCTAGACTTCG-3′; human β-actin-Reverse: 5′-ATGCCACAGGATTCCATACC-3′; human PDLIM3-Forward: 5′-GAGTCGGACGTGTACCGGA-3′; human PDLIM3-Reverse: 5′-TGCCACTCCCACATTTGTCAC-3′.

Protein extraction and western blot

Normal endometrium tissues and ovarian endometriosis tissues were lysed in RIPA lysis buffer (PC101; EpiZyme, Shanghai, China) with protease and phosphatase inhibitor cocktail. The protein concentration was detected by a BCA protein assay. The same amounts of total protein (32 µg) were loaded and separated onto 12.5% SDS-PAGE and was then transferred to PVDF membranes. After blocking with protein-free rapid blocking buffer (PS108P; EpiZyme, Shanghai, China) for 0.5 h at room temperature, the membranes were incubated with primary antibodies at 4 °C overnight. The primary antibodies rabbit anti-PDLIM3 antibody (1:2,000; Servicebio) was used and rabbit anti-β-Actin (1:5,000; ABclonal) served as the internal control. The membranes were probed with secondary antibodies (1:3,000, Servicebio) for 1.5 h at room temperature and three washes in TBST were processed to remove excess secondary antibody. We then used an ECL western blotting kit (NCM Biotech, Suzhou, China) to visualize and image the targeted protein bands.

Statistical analysis

Most statistical calculations were conducted using R (version 4.1.1). The results of qRT-PCR and western blot were shown as the mean ± SD and GraphPad Prism 8.0 (GraphPad Software, San Diego, CA, USA) were used for statistical analysis. All the comparisons between two groups were analyzed using Student’s t-test for normally distributed variables. If the variables distribution was abnormal, the Mann–Whitney U-test would be used. The relationship between the expression of biomarkers and infiltrating immune cells was analyzed using Spearman’s rank correlation analysis. Two-sided with P < 0.05 (95% confidence interval) and false discovery rate (FDR) < 0.05 were considered as statistically significant.

Results

Identification of DEGs in endometriosis

Data from a total of 30 endometriosis samples (14 peritoneal endometriosis samples, 16 ovarian endometriosis samples) and 29 control endometrium samples from three GEO datasets (GSE11691, GSE23339 and GSE5108) were analyzed. The DEGs were calculated using the “limma” package after removing the batch effects. We obtained a total of 42 DEGs, including upregulated 23 genes and downregulated 19 genes (Table 1). All DEGs were shown in the heatmap and volcano map (Fig. 1). In addition, the DEGs list from GSE12768 dataset was displayed in Table S1.
Table 1

The statistical metrics for key differentially expressed genes (DEGs).

Gene symbollogFCP Valueadj. P Valt valueRegulation
C73.7839990532.71E–131.35E–109.285264Up
PTGIS3.6040980223.16E–203.15E–1613.64407Up
FABP43.4685068923.05E–164.17E–1311.06606Up
PLA2G2A3.4209626612.02E–102.06E–087.609885Up
MYH112.9190649026.89E–105.46E–087.301042Up
WISP22.7507333391.73E–101.81E–087.648918Up
CFH2.6820222622.63E–164.17E–1311.10573Up
PDLIM32.6300600761.29E–148.06E–1210.07301Up
C32.6213357952.33E–126.85E–108.735987Up
CLDN112.614472176.34E–132.53E–109.067731Up
PLN2.5498766172.77E–114.53E–098.110245Up
FZD72.505395622.88E–164.17E–1311.08116Up
DPYSL32.3347331212.79E–181.39E–1412.35555Up
FMO12.2878115281.46E–151.62E–1210.64648Up
FAM129A2.2402307661.77E–141.04E–119.990994Up
S100A82.2371194059.13E–106.96E–087.230211Up
LY962.1951230383.34E–164.17E–1311.04102Up
CPE2.1605359791.47E–124.75E–108.852691Up
COL8A12.1485218447.63E–121.73E–098.435667Up
CXCL22.0824975493.79E–065.78E–055.0803Up
CD1632.068090142.63E–114.38E–098.12315Up
AOX12.0261803821.98E–091.32E–077.034879Up
CCL22.0060012472.29E–102.20E–087.578566Up
RORB–2.070362282.91E–081.12E–06–6.3538Down
TMPRSS4–2.120816884.98E–092.79E–07–6.80216Down
EHF–2.135975217.95E–106.25E–08–7.26498Down
SCNN1A–2.143401327.39E–119.84E–09–7.86305Down
MSX1–2.171018329.74E–107.30E–08–7.21392Down
KIAA1324–2.181486151.16E–101.37E–08–7.74893Down
ASRGL1–2.246122743.73E–103.42E–08–7.45546Down
PRSS8–2.272827312.86E–114.61E–09–8.10186Down
RAB25–2.352554821.18E–112.35E–09–8.32591Down
HSD17B2–2.364712964.09E–079.60E–06–5.67198Down
DEFB1–2.388493433.57E–081.32E–06–6.3015Down
GNLY–2.443901331.59E–086.89E–07–6.50697Down
MMP26–2.601946215.19E–071.15E–05–5.60914Down
AGR2–2.629569011.19E–085.42E–07–6.58042Down
CLDN3–2.674382952.53E–102.41E–08–7.55304Down
CD24–2.681741096.68E–119.13E–09–7.88849Down
GABRP–2.845710261.07E–097.82E–08–7.18967Down
SCGB1D2–2.979031011.58E–050.000193–4.68827Down
SCGB2A1–3.67344245.33E–092.93E–07–6.78496Down
Figure 1

Differentially expressed genes (DEGs) between endometriosis tissues and normal endometrium.

(A) Heatmap of DEGs. (B) Volcano Plot of DEGs.

Differentially expressed genes (DEGs) between endometriosis tissues and normal endometrium.

(A) Heatmap of DEGs. (B) Volcano Plot of DEGs.

GO and pathway enrichment of DEGs in endometriosis

The above list of DEGs was used to evaluate if they were enriched into specific GO or reactome pathways. GO analysis identified that the DEGs were significantly enriched in BP, including humoral immune response, complement activation, alternative pathway, antimicrobial humoral response, and so on (Table 2). In terms of MF, DEGs were mainly enriched in Toll-like receptor binding, G protein-coupled receptor binding, long-chain fatty acid binding, and chemokine receptor binding (Table 2). Moreover, CC demonstrated that filamentous actin, lipid droplet, sperm flagellum and actin filament were the most significantly enriched GO term (Table 2). The results were displayed in Figs. 2A and 2B. In addition, the GSEA results illustrated that the enriched pathways mainly involved cytokine–cytokine receptor interaction, systemic lupus erythematosus, chemokine signaling pathway, complement and coagulation cascades, and leishmania infection (Figs. 2C and 2D).
Table 2

The enriched GO terms of differentially expressed genes.

IDCategoryGO NameP Valueadj. P ValCountGene ID
GO:0006959 BPHumoral immune response4.84E–085.04E–059CFH/C7/C3/PLA2G2A/CCL2/S100A8/GNLY/DEFB1/CXCL2
GO:0006957 BPComplement activation, alternative pathway6.03E–060.0031413CFH/C7/C3
GO:0019730 BPAntimicrobial humoral response9.26E–060.0032165PLA2G2A/S100A8/GNLY/DEFB1/CXCL2
GO:0002237 BPResponse to molecule of bacterial origin6.14E–050.0159846LY96/FMO1/CD24/CCL2/S100A8/CXCL2
GO:0050727 BPRegulation of inflammatory response8.37E–050.017446PTGIS/FABP4/C3/PLA2G2A/S100A8/MMP26
GO:0050729 BPPositive regulation of inflammatory response0.0001450.0251494FABP4/C3/PLA2G2A/S100A8
GO:0097278 BPComplement-dependent cytotoxicity0.0001760.0262062CFH/C3
GO:0032496 BPResponse to lipopolysaccharide0.000470.0598675LY96/FMO1/CCL2/S100A8/CXCL2
GO:0061844 BPAntimicrobial humoral immune response mediated by antimicrobial peptide0.0005170.0598673GNLY/DEFB1/CXCL2
GO:0016338 BPCalcium-independent cell-cell adhesion via plasma membrane cell-adhesion molecules0.0009740.0922212CLDN11/CLDN3
GO:0031941 CCFilamentous actin0.0018580.1709312DPYSL3/PDLIM3
GO:0005811 CCLipid droplet0.0154510.1709312FABP4/CLDN11
GO:0036126 CCSperm flagellum0.0207080.1709312SCNN1A/DEFB1
GO:0005884 CCActin filament0.021410.1709312DPYSL3/PDLIM3
GO:0005923 CCBicellular tight junction0.0228460.1709312CLDN11/CLDN3
GO:0005788 CCEndoplasmic reticulum lumen0.0230030.1709313FMO1/C3/COL8A1
GO:0097729 CC9+2 motile cilium0.0239480.1709312SCNN1A/DEFB1
GO:0070160 CCTight junction0.0254510.1709312CLDN11/CLDN3
GO:1902495 CCTransmembrane transporter complex0.026450.1709313PLN/SCNN1A/GABRP
GO:0005859 CCMuscle myosin complex0.0295640.1709311MYH11
GO:0035325 MFToll-like receptor binding0.0002580.0142792LY96/S100A8
GO:0001664 MFG protein-coupled receptor binding0.0002710.0142795FZD7/C3/CCL2/DEFB1/CXCL2
GO:0036041 MFLong-chain fatty acid binding0.0003050.0142792FABP4/S100A8
GO:0042379 MFChemokine receptor binding0.0003620.0142793CCL2/DEFB1/CXCL2
GO:0005504 MFFatty acid binding0.0025240.0797472FABP4/S100A8
GO:0048020 MFCCR chemokine receptor binding0.0038770.0866812CCL2/DEFB1
GO:0005044 MFScavenger receptor activity0.0040450.0866812CD163/TMPRSS4
GO:0008009 MFChemokine activity0.0043890.0866812CCL2/CXCL2
GO:0033293 MFMonocarboxylic acid binding0.009020.1583412FABP4/S100A8
GO:0038024 MFCargo receptor activity0.0100220.1583412CD163/TMPRSS4
Figure 2

Functional enrichment analyses to identify potential biological processes.

(A, B) Gene ontology enrichment analysis of DEGs between endometriosis and control samples. (C, D) Enrichment analyses via gene set enrichment analysis.

Functional enrichment analyses to identify potential biological processes.

(A, B) Gene ontology enrichment analysis of DEGs between endometriosis and control samples. (C, D) Enrichment analyses via gene set enrichment analysis. REACTOME pathway enrichment analysis was also used to obtain the signaling pathways for different genes. In this study, these DEGs were mainly involved in the regulation of complement cascade, suprofen pathway, indomethacin pathway and mefenamic acid pathway (Table 3).
Table 3

The enriched pathway terms of differentially expressed genes.

IDPathway nameP ValueGene countGene ID
1269250Regulation of Complement cascade4.35E–053CFH, C3, C7
SMP00101Suprofen Pathway1.34E–042PLA2G2A, PTGIS
SMP00104Indomethacin Pathway1.34E–042PLA2G2A, PTGIS
SMP00109Mefanamic acid Pathway1.34E–042PLA2G2A, PTGIS
SMP00113Oxaprozin Pathway1.34E–042PLA2G2A, PTGIS
SMP00114Nabumetone Pathway1.34E–042PLA2G2A, PTGIS
SMP00120Naproxen Pathway1.34E–042PLA2G2A, PTGIS
SMP00289Diflunisal Pathway1.34E–042PLA2G2A, PTGIS
SMP00077Piroxicam Pathway1.34E–042PLA2G2A, PTGIS
SMP00085Ketoprofen Pathway1.34E–042PLA2G2A, PTGIS
SMP00086Ibuprofen Pathway1.34E–042PLA2G2A, PTGIS
SMP00094Sulindac Pathway1.34E–042PLA2G2A, PTGIS
SMP00093Diclofenac Pathway1.34E–042PLA2G2A, PTGIS
SMP00098Ketorolac Pathway1.34E–042PLA2G2A, PTGIS
1457777Antimicrobial peptides1.45E–044S100A8, PLA2G2A, DEFB1, GNLY
SMP00102Bromfenac Pathway1.78E–042PLA2G2A, PTGIS
M22072Alternative Complement Pathway2.28E–042C3, C7
1269241Complement cascade3.52E–043CFH, C3, C7
M39649Complement and Coagulation Cascades4.57E–043CFH, C3, C7
M4732Lectin Induced Complement Pathway4.91E–042C3, C7
M7146Classical Complement Pathway6.59E–042C3, C7
M16894Complement and coagulation cascades7.24E–043CFH, C3, C7
1427857Regulation of TLR by endogenous ligand7.52E–042S100A8, LY96
M39770Epithelial to mesenchymal transition in colorectal cancer7.92E–044FZD7, TMPRSS4, CLDN11, CLDN3
M3952Cells and Molecules involved in local acute inflammatory response8.51E–042C3, C7
M39733Cells and Molecules involved in local acute inflammatory response8.51E–042C3, C7
SMP00083Acetylsalicylic Acid Pathway8.51E–042PLA2G2A, PTGIS
PW:0000485eicosanoids metabolic9.56E–042PLA2G2A, PTGIS
MAP00590MAP00590 Prostaglandin and leukotriene metabolism1.07E–032PLA2G2A, PTGIS
83073Complement and coagulation cascades1.07E–033CFH, C3, C7
M917Complement Pathway1.18E–032C3, C7
SMP00106Meloxicam Pathway1.18E–032PLA2G2A, PTGIS
SMP00116Valdecoxib Pathway1.18E–032PLA2G2A, PTGIS
SMP00084Etodolac Pathway1.18E–032PLA2G2A, PTGIS
SMP00087Rofecoxib Pathway1.18E–032PLA2G2A, PTGIS
SMP00096Celecoxib Pathway1.18E–032PLA2G2A, PTGIS
M39502Complement Activation1.43E–032C3, C7
M39398Allograft Rejection1.52E–033C3, GNLY, C7
M264Endogenous TLR signaling1.71E–032S100A8, LY96
1474301IL-17 signaling pathway1.72E–033S100A8, CCL2, CXCL2
M39382Eicosanoid Synthesis2.00E–032PLA2G2A, PTGIS
M39581Human Complement System2.06E–033CFH, C3, C7
M4005616p11.2 distal deletion syndrome2.16E–032C3, PLN
PW:0000138Vitamin B6 metabolic2.58E–031AOX1
1270236Tight junction interactions2.84E–032CLDN11, CLDN3
M39341Spinal Cord Injury3.47E–033PLA2G2A, CCL2, CXCL2
M11355Tight junction4.64E–033CLDN11, CLDN3, MYH11

PPI networks construction and validation of predictive feature biomarkers

We uploaded a list of proteins associated with DEGs, removed the disconnected nodes and then acquired a PPI network using Cytoscape software (Fig. 3A). There were 26 nodes and 50 edges on this network.
Figure 3

Screening diagnostic biomarker candidates for endometriosis.

(A) A PPI network of DEGs. (B) A plot of biomarkers selection by LASSO regression algorithm. (C) The receiver operating characteristic (ROC) curve of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the metadata cohort. (D) ROC curve of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the GSE120103 dataset.

Screening diagnostic biomarker candidates for endometriosis.

(A) A PPI network of DEGs. (B) A plot of biomarkers selection by LASSO regression algorithm. (C) The receiver operating characteristic (ROC) curve of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the metadata cohort. (D) ROC curve of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the GSE120103 dataset. Furthermore, the LASSO algorithm was used to narrowed down the DEGs, leading to the identification of seven variables as specific biomarkers for endometriosis (Fig. 3B). The seven genes were FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1. And then, we used ROC curve to estimate the predictive value of genes. As shown in Fig. 3C, the ability of PDLIM3 in discriminating endometriosis from the control samples demonstrated a favorable diagnostic value, with an AUC of 0.955 (95% CI [0.894–0.997]). Moreover, this result was confirmed in the GSE120103 dataset with an AUC of 0.836 (95% CI [0.691–0.948]). Other six genes showed no significant difference in the GSE120103 dataset (Fig. 3D). At the same time, in order to generate more accurate and reliable results, the GSE120103 dataset was employed to verify the expression levels of the seven features. The expression levels of PDLIM3 in endometriosis tissue were notably higher than those in the normal tissue (Fig. 4A). There was no significant difference between the two groups in other six gene expression (Fig. 4A). Therefore, PDLIM3 became one of the potential specific biomarkers in endometriosis.
Figure 4

Validation of the expression of diagnostic biomarkers.

(A) The expression level of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the GSE120103 dataset. (B) The mRNA expression level of PDLIM3 in endometriosis and control samples. (C) The protein expression level of PDLIM3 in endometriosis and control samples. Mean ± SD, ***P < 0.001.

Validation of the expression of diagnostic biomarkers.

(A) The expression level of FMO1, PDLIM3, FAM129A, CLDN11, C3, TMPRSS4 and DEFB1 in the GSE120103 dataset. (B) The mRNA expression level of PDLIM3 in endometriosis and control samples. (C) The protein expression level of PDLIM3 in endometriosis and control samples. Mean ± SD, ***P < 0.001.

QRT-PCR and western blot

To validate the expression of PDLIM3, qRT-PCR and western blot were used to measure transcriptional and protein expression in both ovarian endometriosis and normal eutopic endometrium tissue from endometriosis-free women. We found PDLIM3 had higher expression level in endometriosis group than normal control (qRT-PCR: n = 6 vs. 6; Western blot: n = 7 vs. 4) (Figs. 4B and 4C). The patients of endometriosis ranged in age from 26 to 43 years (average age 32 years). The BMI was 21.76 ± 2.42 (mean ± SD). And, their rASRM endometriosis stage was III or IV. Normal group samples were from non-endometriosis patients, ranged in age from 31 to 41 years (average age 34 years). The BMI was 21.23 ± 1.21. There was no significant different in their general characteristics.

Correlation analysis between PDLIM3 and infiltrating immune cells

The composition of immune cells in endometriosis tissues and normal control tissues were shown in Fig. 5A. The proportions of naive B cells (P < 0.001), plasma cells (P = 0.012), CD8+ T cells (P = 0.013), follicular helper T cells (P < 0.001), regulatory T cells (Tregs) (P = 0.004), resting NK cells (P = 0.037), and activated NK cells (P < 0.001) in endometriosis tissues were significantly lower than those in control tissues. However, the proportions of CD4+ memory resting T cells (P < 0.001), gamma delta T cells (P = 0.031), M1 Macrophages (P = 0.020), M2 Macrophages (P < 0.001), and neutrophils (P = 0.002) in endometriosis tissues were significantly higher than those in normal tissues (Fig. 5B). However, in GSE120103, only the proportion of Tregs cell (P = 0.037) in endometriosis tissues were significantly lower than that in control tissues (Fig. S1).
Figure 5

Distribution and visualization of immune cell infiltration.

(A) The composition of immune cells in endometriosis tissues and normal control tissues. (B) Comparison of 22 immune cell subtypes between endometriosis tissues and normal tissues. Blue and red colors represent normal and endometriosis samples, respectively. (C) Correlation matrix of all 22 immune cell subtype compositions. Both horizontal and vertical axes demonstrate immune cell subtypes. Immune cell subtype compositions (higher, lower, and same correlation levels are displayed in red, blue, and white, respectively).

Distribution and visualization of immune cell infiltration.

(A) The composition of immune cells in endometriosis tissues and normal control tissues. (B) Comparison of 22 immune cell subtypes between endometriosis tissues and normal tissues. Blue and red colors represent normal and endometriosis samples, respectively. (C) Correlation matrix of all 22 immune cell subtype compositions. Both horizontal and vertical axes demonstrate immune cell subtypes. Immune cell subtype compositions (higher, lower, and same correlation levels are displayed in red, blue, and white, respectively). The association of 22 types of immune cells was calculated and drawn in Fig. 5C. Then, we found PDLIM3 was positively correlated with M2 Macrophages (r = 0.70, P < 0.001), neutrophils (r = 0.45, P < 0.001), CD4+ memory resting T cells (r = 0.44, P < 0.001), gamma delta T cells (r = 0.34, P = 0.0088), resting mast cells (r = 0.31, P = 0.019) and M1 Macrophages (r = 0.28, P = 0.037). However, PDLIM3 was negatively correlated with follicular helper T cells (r = –0.66, P < 0.001), activated NK cells (r = –0.46, P < 0.001), CD8+ T cells (r = –0.41, P = 0.0015), Tregs cells (r = –0.39, P = 0.0028), naive B cells (r = –0.37, P = 0.0047), plasma cells (r = –0.31, P = 0.018) and resting NK cells (r = –0.29, P = 0.031) (Fig. 6). Besides, PDLIM3 was significantly negatively correlated with activated mast cells (r = –0.69, P = 0.038) in GSE120103 (Fig. S1).
Figure 6

Correlation between PDLIM3 and infiltrating immune cells in endometriosis.

(A) Correlation between PDLIM3 and single type of immune cell. (B) A comprehensive correlation analysis of PDLIM3 and infiltrating immune cells in endometriosis.

Correlation between PDLIM3 and infiltrating immune cells in endometriosis.

(A) Correlation between PDLIM3 and single type of immune cell. (B) A comprehensive correlation analysis of PDLIM3 and infiltrating immune cells in endometriosis.

Discussion

The most widly accepted theory about pathogenesis of endometriosis is retrograde menstruation and endometrial implantation reported by Sampson et al. However, it cannot perfectly explain all aspects of endometriosis. More evidence determined a combination of genetic or epigenetic alterations leads to endometriosis (Saha et al., 2015; Dalsgaard et al., 2013; Bulun et al., 2019; Manolio et al., 2009). Therefore, it is critical to acquire a clear understanding of the molecular mechanisms underlying endometriosis if we are to improve the diagnosis and treatment of endometriosis. The main purpose of our study was to detect the potential biomarker of endometriosis for the mechanism and treatment. In this investigation, we compared three public datasets and a total of 42 DEGs were identified, including 23 upregulated genes and 19 downregulated genes. Another DEGs list of the validation set included most of 42 DEGs, which strengthen current results. The results of enrichment analyses suggested that gene function enriched by DEGs were mainly associated with immune system, such as humoral immune response, complement activation, alternative pathway, antimicrobial humoral response, cytokine–cytokine receptor interaction, systemic lupus erythematosus, chemokine signaling pathway, and regulation of complement cascade. Endometriosis is a disease with prolonged inflammatory response. The presence of ectopic tissues in peritoneal cavity, without clear mechanism, reaches a state of balance with a high level of proinflammatory cytokines and immune cells (Li et al., 2020). This evidence is consistent with our findings, proving that the results in the present study are accurate, as well as confirming that the immune response plays a vital role in endometriosis. Based on a machine-learning algorithm, seven variables were identified from 42 DEGs and then verified their diagnosis value by ROC curve. Finally, PDLIM3 was screened out due to it is highly expressed in ectopic endometrium and has a favorable diagnostic value in all datasets. There were totally 14 peritoneal endometriosis samples, 16 ovarian endometriosis samples in three train sets and 18 ovarian endometriosis samples in the fourth verification dataset. Pelvic endometriosis includes ovarian endometriosis (OME), deep infiltrating (DIE) and superficial (SUP) endometriosis. OME is the most common lesion of endometriosis (Dalsgaard et al., 2013), which mostly accounted for our training data and totally accounted for our validating data. Therefore, we believed four datasets used in our study could effectively reflect the clinical proportions of endometriosis of different types. On the other hand, indeed, the origin of samples, which was similar but not identical, between meta-analysis and validation could partly explain the no significant different gene expression for six of the seven genes initially identified. PDLIM3, also known as ALP (actin-associated LIM protein), is a member of the PDZ-LIM domain family (Xia et al., 1997). It is mainly contained a N-terminal PDZ domain and a C-terminal LIM domain. Previous studies revealed PDLIM3 was mainly expressed in skeletal muscle (Passier, Richardson & Olson, 2000) and could regulate skeletal muscle development (Pomiès et al., 2007; Ohsawa et al., 2011). Lak et al. (2021) identified PDLIM3 was one of the rhabdomyosarcoma markers and associated with clinical outcome. Wang et al. (2019) found that polymorphisms in PDLIM3 gene increased risk of idiopathic dilated cardiomyopathy. Other studies indicated PDLIM3 was also correlated to hypertrophic cardiomyopathy (Lopes et al., 2015; Bagnall, Yeates & Semsarian, 2010), atrial fibrillation (Vad et al., 2020), myofibrillar myopathy (Williams et al., 2021), the prognosis of hepatocellular carcinoma (Wu et al., 2021), early regional metastasis of tongue cancer (Lee et al., 2021), muscle-invasive bladder urothelial carcinoma (Feng et al., 2020) and medulloblastoma (Shou et al., 2015). It is the first time that PDLIM3 has been linked to endometriosis. We believe it will play an essential role in better understanding endometriosis mechanism and acting a potential target of therapy. Furthermore, we used CIBERSORT to assess the types of immune cell infiltration in endometriosis and normal samples. A variety of immune cell subtypes were potentially related to the occurrence and development of endometriosis. We performed correlation analysis between PDLIM3 and immune cells, PDLIM3 was found to be correlated with M2 macrophages, neutrophils, CD4+ memory resting T cells, gamma delta T cells, M1 Macrophages, resting mast cells, follicular helper T cells, activated NK cells, CD8+ T cells, Tregs cells, naive B cells, plasma cells and resting NK cells. And, we analyzed the immune microenvironment of validation dataset. The tissue compositions and sample size of the meta-analysis and GSE120103 were different, which may be able to explain the different results. Notably, these two datasets both showed the specificity of the proportion of Tregs cells. Previous studies indicated the alteration of Tregs cell and Th17 cell may lead to the occurrence and development of ectopic endometrium implantation and accelerate the progression of endometriosis towards advanced stage (Vallvé-Juanico, Houshdaran & Giudice, 2019; Khan et al., 2019; Le et al., 2021). Endometriosis is more than a pelvic disease. Changes in multiple systems, such as cardiovascular system, immune system and nervous system, have been found in patients with endometriosis (Taylor, Kotlyar & Flores, 2021). The shifts in circulating immune cell populations as well as the presence of proinflammatory cytokines create a widespread inflammatory environment. A precise control over various types of immune cells is demanded to achieve a safe and effective treatment on endometriosis. The novel biomarkers of endometriosis correlated with the magnitude of immune cell infiltration may be targets of therapy or predictors of therapeutic response. The limitations of this study should also be acknowledged. First, the study was retrospective; thus, important clinical information was unavailable. Second, immune cell infiltration and the functions of PDLIM3 in endometriosis were inferred by bioinformatics analysis, and future prospective studies with larger sample sizes should be employed to validate our findings. Third, the biomarker PDLIM3 in this study is derived from endometriosis tissue, which indicates that this biomarker is not suitable to be used for early clinical diagnosis. Further study should be performed to confirm the function of PDLIM3 in endometriosis.

Conclusions

In a word, our results strongly suggest that PDLIM3 plays a previously unidentified but critical role in pathogenesis and immune environment of endometriosis. Better understanding of PDLIM3 could have a highly beneficial impact on the clinical treatment and outcomes of endometriosis patients.

Raw data.

note: the gene "PINR-LEI" is the target gene "PDLIM3" Click here for additional data file.

The immune microenvironment of GSE120103 dataset.

(A) The composition of immune cells in endometriosis tissues and normal control tissues. (B) Comparison of 22 immune cell subtypes between endometriosis tissues and normal tissues. Blue and red colors represent normal and endometriosis samples, respectively. (C) A comprehensive correlation analysis of PDLIM3 and infiltrating immune cells in endometriosis. (D) Correlation between PDLIM3 and activated mast cell. Click here for additional data file. Click here for additional data file.
  47 in total

1.  Oracle, a novel PDZ-LIM domain protein expressed in heart and skeletal muscle.

Authors:  R Passier; J A Richardson; E N Olson
Journal:  Mech Dev       Date:  2000-04       Impact factor: 1.882

2.  Use of high-throughput targeted exome-sequencing to screen for copy number variation in hypertrophic cardiomyopathy.

Authors:  L R Lopes; C Murphy; P Syrris; C Dalageorgou; W J McKenna; P M Elliott; V Plagnol
Journal:  Eur J Med Genet       Date:  2015-10-09       Impact factor: 2.708

Review 3.  The endometrial immune environment of women with endometriosis.

Authors:  Júlia Vallvé-Juanico; Sahar Houshdaran; Linda C Giudice
Journal:  Hum Reprod Update       Date:  2019-09-11       Impact factor: 15.610

4.  Endometrial epithelial cells in peritoneal fluid during the early follicular phase.

Authors:  R F Kruitwagen; L G Poels; W N Willemsen; I J de Ronde; P H Jap; R Rolland
Journal:  Fertil Steril       Date:  1991-02       Impact factor: 7.329

5.  Differential Levels of Regulatory T Cells and T-Helper-17 Cells in Women With Early and Advanced Endometriosis.

Authors:  Khaleque N Khan; Kazuo Yamamoto; Akira Fujishita; Hideki Muto; Akemi Koshiba; Haruo Kuroboshi; Shigeru Saito; Satoshi Teramukai; Masahiro Nakashima; Jo Kitawaki
Journal:  J Clin Endocrinol Metab       Date:  2019-10-01       Impact factor: 5.958

Review 6.  Endometriosis: pathogenesis and treatment.

Authors:  Paolo Vercellini; Paola Viganò; Edgardo Somigliana; Luigi Fedele
Journal:  Nat Rev Endocrinol       Date:  2013-12-24       Impact factor: 43.330

7.  pROC: an open-source package for R and S+ to analyze and compare ROC curves.

Authors:  Xavier Robin; Natacha Turck; Alexandre Hainard; Natalia Tiberti; Frédérique Lisacek; Jean-Charles Sanchez; Markus Müller
Journal:  BMC Bioinformatics       Date:  2011-03-17       Impact factor: 3.307

8.  Actinin-associated LIM protein: identification of a domain interaction between PDZ and spectrin-like repeat motifs.

Authors:  H Xia; S T Winokur; W L Kuo; M R Altherr; D S Bredt
Journal:  J Cell Biol       Date:  1997-10-20       Impact factor: 10.539

9.  NCBI GEO: mining millions of expression profiles--database and tools.

Authors:  Tanya Barrett; Tugba O Suzek; Dennis B Troup; Stephen E Wilhite; Wing-Chi Ngau; Pierre Ledoux; Dmitry Rudnev; Alex E Lash; Wataru Fujibuchi; Ron Edgar
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.