| Literature DB >> 35710932 |
Shengxin Tang1, Yue Liu2, Bin Liu3.
Abstract
Pulmonary arterial hypertension (PAH) is a chronic cardiopulmonary syndrome with high pulmonary vascular load and eventually causing RV heart failure even death. However, the mechanism of pulmonary hypertension remains unclear. The purpose of this research is to detect the underlying key genes and potential mechanism of PAH using several bioinformatic methods. The microarrays GSE22356, GSE131793 and GSE168905 were acquired from the GEO. Subsequently, a host of bioinformatics techniques such as DAVID, STRING, R language and Cytoscape were utilized to investigate DEGs between PAH and healthy controls and conduct GO annotation, KEGG enrichment analysis and PPI network construction etc. Additionally, we predicted the transcription factors regulating DEGs through iRegulon plugin of Cytoscape and CIBERSORT was used to conduct immune infiltration analysis. One thousand two hundred and seventy-seven DEGs (403 up-regulated and 874 down-regulated) were identified from peripheral blood samples of 32 PAH patients and 29 controls, among which SLC4A1, AHSP, ALAS2, CA1, HBD, SNCA, HBM, SELENBP1, SERPINE1 and ITGA2B were detected as hub genes. The functional enrichment changes of DEGs were mainly enriched in protein binding, extracellular exosome, extracellular space, extracellular region and integral component of plasma membrane. The hub genes are chiefly enriched at extracellular exosome, hemoglobin complex, blood microparticle, oxygen transporter activity. Among TF-DEGs network, 42 target DEGs and 6 TFs were predicted with an NES > 4 (TEAD4, TGIF2LY, GATA5, GATA1, GATA2, FOS). Immune infiltration analysis showed that monocytes occupied the largest proportion of immune cells. The trend analysis results of infiltration immune cells illustrated that PAH patients had higher infiltration of NK cell activation, monocyte, T cell CD4 memory activation, and mast cell than healthy controls and lower infiltration of T cell CD4 naive. We detected SLC4A1, AHSP, ALAS2, CA1, HBD, SNCA, HBM, SELENBP1, SERPINE1 and ITGA2B as the most significant markers of PAH. The PAH patients had higher infiltration of NK cell activation, monocyte, T cell CD4 memory activation, and mast cell than healthy controls and lower infiltration of T cell CD4 naive. These identified genes and these immune cells probably have precise regulatory relationships in the development of PAH.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35710932 PMCID: PMC9203517 DOI: 10.1038/s41598-022-14307-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flow chart of the analysis program in this study.
Figure 2The identification of EDGs. (A–C) The heat-maps of differential expression genes of GSE22356, GSE131793 and GSE168905 (P < 0.05, |log FC|> 1). Up-regulated genes were in red and down-regulated genes were in blue. (D–F) Volcano plot of genes identified in PAH of GSE22356, GSE131793 and GSE168905 (P < 0.05, |log FC|> 0.5). Red dots represent up-regulated and green dots represent down-regulated.
The top 10 BPs, CCs, MFs of GO functional annotation analyses of DEGs.
| Term | Count | Gene ID | p. adjust |
|---|---|---|---|
| GO:0030099 ~ myeloid cell differentiation | 8 | SLC4A1/LTF/ITGA2B/INHBA/CCR7/BPGM/ALAS2/AHSP | 0.001688296 |
| GO:0048872 ~ homeostasis of number of cells | 7 | SLC4A1/INHBA/IL7R/CCR7/BPGM/ALAS2/AHSP | 0.001033927 |
| GO:0032496 ~ response to lipopolysaccharide | 7 | SNCA/SERPINE1/SELP/LTF/DEFA1B/CCR7/ADM | 0.001688296 |
| GO:0007596 ~ blood coagulation | 7 | SLC4A1/SERPINE1/SELP/KRT1/ITGB3/ITGA2B/HBD | 0.001688296 |
| GO:0007599 ~ hemostasis | 7 | SLC4A1/SERPINE1/SELP/KRT1/ITGB3/ITGA2B/HBD | 0.001688296 |
| GO:0050817 ~ coagulation | 7 | SLC4A1/SERPINE1/SELP/KRT1/ITGB3/ITGA2B/HBD | 0.001688296 |
| GO:0002237 ~ response to molecule of bacterial origin | 7 | SNCA/SERPINE1/SELP/LTF/DEFA1B/CCR7/ADM | 0.001791109 |
| GO:0006959 ~ humoral immune response | 7 | MS4A1/LTF/KRT1/DEFA1B/CCR7/CCR6/C3AR1 | 0.002283905 |
| GO:0043312 ~ neutrophil degranulation | 7 | TNFAIP6/S100P/LTF/KRT1/HP/DEFA1B/C3AR1 | 0.007504148 |
| GO:0002283 ~ neutrophil activation involved in immune response | 7 | TNFAIP6/S100P/LTF/KRT1/HP/DEFA1B/C3AR1 | 0.007506536 |
| GO:0034774 ~ secretory granule lumen | 6 | TIMP3/SERPINE1/S100P/LTF/HP/DEFA1B | 0.001216318 |
| GO:0060205 ~ cytoplasmic vesicle lumen | 6 | TIMP3/SERPINE1/S100P/LTF/HP/DEFA1B | 0.001216318 |
| GO:0031983 ~ vesicle lumen | 6 | TIMP3/SERPINE1/S100P/LTF/HP/DEFA1B | 0.001216318 |
| GO:0009897 ~ external side of plasma membrane | 6 | SELP/MS4A1/ITGA2B/IL7R/CCR7/CCR6 | 0.003575905 |
| GO:0031091 ~ platelet alpha granule | 5 | SNCA/SERPINE1/SELP/ITGB3/ITGA2B | 6.39342E-05 |
| GO:0072562 ~ blood microparticle | 5 | SLC4A1/KRT1/ITGA2B/HP/HBD | 0.000440622 |
| GO:0030667 ~ secretory granule membrane | 5 | SNCA/SELP/ITGB3/ITGA2B/C3AR1 | 0.006030401 |
| GO:0062023 ~ collagen-containing extracellular matrix | 5 | TIMP3/SERPINE1/KRT1/DEFA1B/COL4A1 | 0.018115583 |
| GO:0031092 ~ platelet alpha granule membrane | 4 | SNCA/SELP/ITGB3/ITGA2B | 5.75912E-06 |
| GO:0031838 ~ haptoglobin-hemoglobin complex | 3 | HP/HBM/HBD | 6.39342E-05 |
| GO:0043177 ~ organic acid binding | 6 | TNFAIP6/SELP/PLOD2/HBM/HBD/ALAS2 | 0.002918932 |
| GO:0019955 ~ cytokine binding | 4 | NOG/ITGB3/CCR7/CCR6 | 0.011923107 |
| GO:0140375 ~ immune receptor activity | 4 | IL7R/CCR7/CCR6/C3AR1 | 0.011923107 |
| GO:0004866 ~ endopeptidase inhibitor activity | 4 | TIMP3/SNCA/SERPINE1/LTF | 0.021672983 |
| GO:0030414 ~ peptidase inhibitor activity | 4 | TIMP3/SNCA/SERPINE1/LTF | 0.021672983 |
| GO:0061135 ~ endopeptidase regulator activity | 4 | TIMP3/SNCA/SERPINE1/LTF | 0.021672983 |
| GO:0031406 ~ carboxylic acid binding | 4 | TNFAIP6/SELP/PLOD2/ALAS2 | 0.021672983 |
| GO:0061134 ~ peptidase regulator activity | 4 | TIMP3/SNCA/SERPINE1/LTF | 0.025123719 |
| GO:0004857 ~ enzyme inhibitor activity | 4 | TIMP3/SNCA/SERPINE1/LTF | 0.095869699 |
| GO:0019956 ~ chemokine binding | 3 | ITGB3/CCR7/CCR6 | 0.0065716 |
The top 12 KEGG enrichment pathway analysis of DEGs.
| Term | Count | Genes | P value |
|---|---|---|---|
| hsa04640 ~ Hematopoietic cell lineage | 4 | MS4A1/ITGB3/ITGA2B/IL7R | 0.000392423 |
| hsa04060 ~ Cytokine-cytokine receptor interaction | 4 | INHBA/IL7R/CCR7/CCR6 | 0.020856078 |
| hsa04350 ~ TGF-beta signaling pathway | 3 | NOG/INHBA/ID1 | 0.004707872 |
| hsa04512 ~ ECM-receptor interaction | 3 | ITGB3/ITGA2B/COL4A1 | 0.004707872 |
| hsa04510 ~ Focal adhesion | 3 | ITGB3/ITGA2B/COL4A1 | 0.046127511 |
| hsa05144 ~ Malaria | 2 | SELP/HBD | 0.017943455 |
| hsa05150 ~ Staphylococcus aureus infection | 2 | SELP/C3AR1 | 0.021407095 |
| hsa00010 ~ Glycolysis / Gluconeogenesis | 2 | BPGM/ALDH1A3 | 0.028279271 |
| hsa04610 ~ Complement and coagulation cascades | 2 | SERPINE1/C3AR1 | 0.031583311 |
| hsa05412 ~ Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 2 | ITGB3/ITGA2B | 0.035918255 |
| hsa05410 ~ Hypertrophic cardiomyopathy (HCM) | 2 | ITGB3/ITGA2B | 0.044264599 |
| hsa05222 ~ Small cell lung cancer | 2 | ITGA2B/COL4A1 | 0.046209667 |
Figure 3GO and KEGG analysis of DEGs between PAH and healthy groups. (A–C) The top 10 GO annotation results of DEGs. (D) KEGG pathway analyses of DEGs (top 10 according to enrichment score).
Figure 4(A) The PPI network of DEGs was established using string and Cytoscape, containing 33 nodes and 57 edges. The upregulated genes are red and downregulated are green. (B) The top 10 hub genes were detected from the PPI network. (C) GO terms of the top 10 key genes.
Summary of the function of 10 key genes.
| Gene symbol | Full name | Function |
|---|---|---|
| SLC4A1 | Solute carrier family 4 member 1 | PART of the anion exchanger (AE) family, expressed in the erythrocyte plasma membrane, where it functions as a chloride/bicarbonate exchanger involved in carbon dioxide transport from tissues to lungs;This protein is predominantly dimeric but forms tetramers in the presence of ankyrin. Many mutations in this gene are known in man, and these mutations can lead to two types of disease: destabilization of red cell membrane leading to hereditary spherocytosis, and defective kidney acid secretion leading to distal renal tubular acidosis |
| AHSP | Alpha hemoglobin stabilizing protein | Encodes a molecular chaperone which binds specifically to free alpha-globin and is involved in hemoglobin assembly. The encoded protein binds to monomeric alpha-globin until it has been transferred to beta-globin to form a heterodimer, which in turn binds to another heterodimer to form the stable tetrameric hemoglobin. Diseases associated with AHSP include Beta-Thalassemia and Thalassemia. GO annotations related to this gene include unfolded protein binding and hemoglobin binding. Acts as a chaperone to prevent the harmful aggregation of alpha-hemoglobin during normal erythroid cell development. Specifically protects free alpha-hemoglobin from precipitation |
| ALAS2 | 5′-Aminolevulinate synthase 2 | An erythrocyte-specific mitochondrial localization enzyme and that step one of the heme biosynthetic pathway is catalyzed by this product; Blemishes on ALAS2 will lead to the development of X-linked pyridoxine-responsive sideroblastic anemia.Diseases associated with ALAS2 include Anemia, Sideroblastic, 1 and Protoporphyria, Erythropoietic, X-Linked. Among its related pathways are Metabolism and Metabolism of porphyrins. Gene Ontology (GO) annotations related to this gene include pyridoxal phosphate binding and glycine binding |
| CA1 | Carbonic anhydrase 1 | Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid–base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. This CA1 gene is closely linked to the CA2 and CA3 genes on chromosome 8. It encodes a cytosolic protein that is found at the highest level in erythrocytes. Diseases associated with CA1 include Subacute Thyroiditis and Transient Global Amnesia. Among its related pathways are Metabolism and Reversible hydration of carbon dioxide. Gene Ontology (GO) annotations related to this gene include carbonate dehydratase activity and hydro-lyase activity |
| HBD | Hemoglobin subunit delta | The delta (HBD) gene is normally expressed in the adult, Two alpha chains plus two delta chains constitute HbA-2, which with HbF comprises the adult hemoglobin. Mutations in the delta-globin gene are associated with hemoglobin lepore-beta-thalassemia syndrome and fetal hemoglobin quantitative trait locus1. Among its related pathways are Factors involved in megakaryocyte development and platelet production and Response to elevated platelet cytosolic Ca2 + . Gene Ontology (GO) annotations related to this gene include iron ion binding and oxygen binding |
| SNCA | Synuclein alpha | a member of the synuclein family, which also includes beta- and gamma-synuclein. Synucleins are abundantly expressed in the brain and alpha- and beta-synuclein inhibit phospholipase D2 selectively. serve to integrate presynaptic signaling and membrane trafficking. SNCA peptides are a major component of amyloid plaques in the brains of patients with Alzheimer's disease. Alternatively spliced transcripts encoding different isoforms have been identified for this gene. Diseases associated with SNCA include Dementia, Lewy Body and Parkinson Disease 1, Autosomal Dominant. Gene Ontology (GO) annotations related to this gene include calcium ion binding and enzyme binding |
| HBM | Hemoglobin subunit Mu | HBM gene has an ORF encoding a 141 aa polypeptide which is similar to the delta globins found in reptiles and birds. Diseases associated with HBM include Alcohol-Induced Mental Disorder and Alcoholic Psychosis. Gene Ontology (GO) annotations related to this gene include iron ion binding and oxygen binding |
| SELENBP1 | Serpin family E member 1 | A member of the selenium-binding protein family. Selenium is an essential nutrient that exhibits potent anticarcinogenic properties, and deficiency of selenium may cause certain neurologic diseases. The effects of selenium in preventing cancer and neurologic diseases may be mediated by selenium-binding proteins, and decreased expression of this gene may be associated with several types of cancer. The encoded protein may play a selenium-dependent role in ubiquitination/ deubiquitination-mediated protein degradation |
| SERPINE1 | Serpin family E member 1 | A member of the serine proteinase inhibitor (serpin) superfamily. This member is the principal inhibitor of tissue plasminogen activator (tPA) and urokinase (uPA), and hence is an inhibitor of fibrinolysis. The protein also functions as a component of innate antiviral immunity. Defects in this gene are the cause of plasminogen activator inhibitor-1 deficiency (PAI-1 deficiency), and high concentrations of the gene product are associated with thrombophilia. Among its related pathways are G-protein signaling Ras family GTPases in kinase cascades (scheme) and hypothesized pathways in pathogenesis of cardiovascular disease. GO annotations related to this gene include signaling receptor binding and protease binding |
| ITGA2B | Pro-platelet basic protein | a member of the integrin alpha chain family of proteins. The encoded preproprotein is proteolytically processed to generate light and heavy chains that associate through disulfide linkages to form a subunit of the alpha-IIb/beta-3 integrin cell adhesion receptor. This receptor plays a crucial role in the blood coagulation system, by mediating platelet aggregation. Mutations in this gene are associated with platelet-type bleeding disorders, which are characterized by a failure of platelet aggregation, including Glanzmann thrombasthenia |
Figure 5The TF-DEGs network including 42 DEGs and 6 TFs with an NES > 4.
Summary of the function of 6 TFs.
| Transcription factor | Full name | Function |
|---|---|---|
| TEAD4 | TEA domain transcription factor 4 | a member of the transcriptional enhancer factor (TEF) family of transcription factors, which contain the TEA/ATTS DNA-binding domain. It is preferentially expressed in the skeletal muscle, and binds to the M-CAT regulatory element found in promoters of muscle-specific genes to direct their gene expression. Alternatively spliced transcripts encoding distinct isoforms, some of which are translated through the use of a non-AUG (UUG) initiation codon |
| TGIF2LY | TGFB induced factor homeobox 2 like y-linked | a member of the TALE/TGIF homeobox family of transcription factors. This gene lies within the male specific region of chromosome Y, in a block of sequence that is thought to be the result of a large X-to-Y transposition. The C-terminus of this protein is divergent from that of its chromosome X homolog (TGIF2LX), suggesting that this protein may act as a regulator of TGIF2LX |
| GATA5 | GATA binding protein 5 | a transcription factor that contains two GATA-type zinc fingers. The encoded protein is known to bind to hepatocyte nuclear factor-1alpha (HNF-1alpha), and this interaction is essential for cooperative activation of the intestinal lactase-phlorizin hydrolase promoter. In other organisms, similar proteins may be involved in the establishment of cardiac smooth muscle cell diversity |
| GATA1 | GATA binding protein 1 | Transcriptional activator or repressor which probably serves as a general switch factor for erythroid development. It binds to DNA sites with the consensus sequence 5'-[AT]GATA[AG]-3' within regulatory regions of globin genes and of other genes expressed in erythroid cells. Activates the transcription of genes involved in erythroid differentiation of K562 erythroleukemia cells, including HBB, HBG1/2, ALAS2 and HMBS |
| GATA2 | GATA binding protein 2 | a transcription factor involved in stem cell maintenance with key roles in hematopoietic development. GATA2 mutations are associated with a variety of inherited and acquired immune disorders including myelodysplastic syndrome and acute myeloid leukemia. In addition to a role in hematopoiesis, the maintenance GATA2 expression has been implicated as a requirement in KRAS-driven non-small cell lung cancer. Preclinical models have indicated therapeutic benefit from targeting GATA2-mediated pathways in the context of KRAS-driven NSCLC |
| FOS | Fos proto-oncogene | The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation. In some cases, expression of the FOS gene has also been associated with apoptotic cell death |
Figure 6Summary of immune cell subpopulations between normal controls and PAH. (A) Composition of 22 kinds of immune cells subsets from GSE22356, GSE131793 and GSE168905 datasets. (B) Heatmap shows differences between the 22 subpopulations cell types and samples. (C) Correlation matrix of the 22 immunocyte proportions in healthy control and PAH samples. Red colors represent positive and blue colors represent negative correlations. (D) The difference of immune cells infiltration between 9 normal controls and 12 PAH (blue color represent normal controls group and red colors represent PAH group. P values < 0.05).