| Literature DB >> 36194576 |
Babajan Banaganapalli1,2, Bayan Mallah1,2, Kawthar Saad Alghamdi3, Walaa F Albaqami4, Dalal Sameer Alshaer1, Nuha Alrayes2,5, Ramu Elango1,2, Noor A Shaik1,2.
Abstract
Chronic obstructive pulmonary disease (COPD) is a multifactorial progressive airflow obstruction in the lungs, accounting for high morbidity and mortality across the world. This study aims to identify potential COPD blood-based biomarkers by analyzing the dysregulated gene expression patterns in blood and lung tissues with the help of robust computational approaches. The microarray gene expression datasets from blood (136 COPD and 6 controls) and lung tissues (16 COPD and 19 controls) were analyzed to detect shared differentially expressed genes (DEGs). Then these DEGs were used to construct COPD protein network-clusters and functionally enrich them against gene ontology annotation terms. The hub genes in the COPD network clusters were then queried in GWAS catalog and in several cancer expression databases to explore their pathogenic roles in lung cancers. The comparison of blood and lung tissue datasets revealed 63 shared DEGs. Of these DEGs, 12 COPD hub gene-network clusters (SREK1, TMEM67, IRAK2, MECOM, ASB4, C1QTNF2, CDC42BPA, DPF3, DET1, CCDC74B, KHK, and DDX3Y) connected to dysregulations of protein degradation, inflammatory cytokine production, airway remodeling, and immune cell activity were prioritized with the help of protein interactome and functional enrichment analysis. Interestingly, IRAK2 and MECOM hub genes from these COPD network clusters are known for their involvement in different pulmonary diseases. Additional COPD hub genes like SREK1, TMEM67, CDC42BPA, DPF3, and ASB4 were identified as prognostic markers in lung cancer, which is reported in 1% of COPD patients. This study identified 12 gene network- clusters as potential blood based genetic biomarkers for COPD diagnosis and prognosis.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36194576 PMCID: PMC9531836 DOI: 10.1371/journal.pone.0274629
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1B Analysis of COPD differentially expressed genes (DEGs) in comparison to corresponding controls (A) Volcano plots of log fold changes in gene expression. (B) Identification of 63 common DEGs from blood and lung tissue datasets using VENNY. The overlapped area defines the shared DEGs of lung tissue and blood. (C) Heatmap of DEGs with a LogFC > 1.5. Red: up-regulation; green: down-regulation.
Fig 2Hub genes TMEM67, IRAK2, ASB4, MECOM, DPF3 and DDX3Y with their clusters identified from the common DEGs between blood and lung tissue datasets.
Their selection is based on degree of centrality in the PPI network with the score >18.
A total of 12 significant genes with more than 17 of DC were obtained from network analysis and chosen as hub proteins.
| S.No | Name | Degree | BetweennessCentrality | ClosenessCentrality | Clustering Coefficient |
|---|---|---|---|---|---|
| 1 | SREK1 | 77 | 0.004 | 0.456 | 0.177 |
| 2 | TMEM67 | 54 | 0.012 | 0.386 | 0.0405 |
| 3 | IRAK2 | 43 | 0.004 | 0.412 | 0.129 |
| 4 | MECOM | 31 | 0.002 | 0.394 | 0.234 |
| 5 | ASB4 | 29 | 5.15E-04 | 0.390 | 0.122 |
| 6 | C1QTNF2 | 28 | 0.005 | 0.369 | 0.010 |
| 7 | CDC42BPA | 26 | 0.005 | 0.415 | 0.0289 |
| 8 | DPF3 | 24 | 2.37E-04 | 0.359 | 0.471 |
| 9 | DET1 | 23 | 3.08E-04 | 0.384 | 0.260 |
| 10 | CCDC74B | 22 | 0.001 | 0.358 | 0.835 |
| 11 | KHK | 19 | 0.002 | 0.381 | 0.073 |
| 12 | DDX3Y | 17 | 2.80E-04 | 0.413 | 0.051 |
Functional enrichment of CCDC74B, MECOM, IRAK2 and DET1 clusters, in which highlights highest functional enrichment in different molecular processes like molecular function (MF), biological process (BP), cellular components (CC) and KEGG pathways based on FDR value.
| DEG Clusters | Ontology | Term ID | Term Description | Observed Gene Count | FDR |
|---|---|---|---|---|---|
|
|
| GO:0016579 | Protein Deubiquitination | 20 | 1.64E-31 |
| GO:0006511 | Ubiquitin-Dependent Protein Catabolic Process | 19 | 2.42E-25 | ||
| GO:0043687 | Post-Translational Protein Modification | 18 | 3.09E-25 | ||
|
| GO:0036402 | Proteasome-activating ATPase activity | 6 | 1.28E-13 | |
| GO:0017025 | TBP-class protein binding | 6 | 5.02E-11 | ||
| GO:0008134 | Transcription factor binding | 8 | 6.89E-06 | ||
|
| GO:0005838 | Proteasome Regulatory Particle | 19 | 2.98E-48 | |
| GO:0000502 | Proteasome Complex | 20 | 1.65E-44 | ||
| GO:0031597 | Cytosolic Proteasome Complex | 10 | 2.16E-24 | ||
|
| hsa03050 | Proteasome | 16 | 5.94E-36 | |
| hsa05169 | Epstein-Barr virus infection | 16 | 9.78E-27 | ||
|
|
| GO:0006357 | Regulation of transcription by RNA polymerase II | 29 | 3.62E-19 |
| GO:0000122 | Negative regulation of transcription by RNA polymerase II | 21 | 8.33E-19 | ||
| GO:0045892 | Negative regulation of transcription, DNA-templated | 23 | 8.55E-19 | ||
|
| GO:0043565 | Sequence-specific DNA binding | 22 | 1.03E-18 | |
| GO:0140110 | Transcription regulator activity | 25 | 7.51E-17 | ||
| GO:1990837 | Sequence-specific double-stranded DNA binding | 18 | 7.02E-16 | ||
|
| GO:0005654 | Nucleoplasm | 27 | 4.42E-14 | |
| GO:0031981 | Nuclear Lumen | 28 | 5.76E-14 | ||
| GO:0000785 | Chromatin | 13 | 1.54E-11 | ||
|
| hsa05220 | Chronic myeloid leukemia | 7 | 7.23E-09 | |
| hsa05200 | Pathways in cancer | 11 | 2.25E-08 | ||
| hsa04068 | FoxO signaling pathway | 7 | 8.41E-08 | ||
| IRAK2 |
| GO:0070498 | Interleukin-1-Mediated Signaling Pathway | 11 | 9.63E-16 |
| GO:0071347 | Cellular Response To Interleukin-1 | 12 | 6.86E-14 | ||
| GO:0002757 | Immune Response-Activating Signal Transduction | 15 | 1.62E-13 | ||
|
| GO:0004672 | Protein Kinase Activity | 11 | 2.03E-05 | |
| GO:0016301 | Kinase Activity | 12 | 2.03E-05 | ||
| GO:0140096 | Catalytic Activity, Acting On A Protein | 17 | 5.41E-05 | ||
|
| GO:0010008 | Endosome Membrane | 8 | 0.0015 | |
| GO:0044433 | Cytoplasmic Vesicle Part | 13 | 0.0015 | ||
| GO:0044440 | Endosomal Part | 8 | 0.0015 | ||
|
| hsa04064 | NF-Kappa B Signaling Pathway | 9 | 7.07E-11 | |
| hsa04620 | Toll-Like Receptor Signaling Pathway | 9 | 7.70E-11 | ||
| hsa05133 | Pertussis | 7 | 1.19E-08 | ||
|
|
| GO:0042176 | Regulation Of Protein Catabolic Process | 11 | 1.42E-10 |
| GO:0045732 | Positive Regulation Of Protein Catabolic Process | 9 | 1.15E-09 | ||
| GO:1903362 | Regulation Of Cellular Protein Catabolic Process | 9 | 2.44E-09 | ||
|
| GO:0031625 | Ubiquitin Protein Ligase Binding | 9 | 8.27E-09 | |
| GO:0048156 | Tau Protein Binding | 3 | 0.0001 | ||
| GO:0004842 | Ubiquitin-Protein Transferase Activity | 6 | 0.00015 | ||
|
| GO:0080008 | Cul4-RING E3 Ubiquitin Ligase Complex | 7 | 2.55E-12 | |
| GO:0000151 | Ubiquitin Ligase Complex | 10 | 3.77E-11 | ||
| GO:0031464 | Cul4A-RING E3 Ubiquitin Ligase Complex | 5 | 3.07E-10 | ||
|
| hsa04120 | Ubiquitin Mediated Proteolysis | 9 | 2.32E-12 | |
| Hh | Nucleotide Excision Repair | 4 | 1.09E-05 | ||
| hsa05215 | Prostate Cancer | 4 | 0.00012 |
Fig 3GO-annotations stacked network view of (A) MECOM and (B) IRAK2 clusters. The size of the circle (left side) represents the number of genes involved in a specific GO-term. The GWAS loci of (C) MECOM (D) IRAK2 genes from the GWAS catalog.
Association of two hub genes with the lung related traits and lung cancer from GWAS catalog database.
| Gene | Variant and risk allele | P-value | Reported trait | Trait(s) | Study accession |
|---|---|---|---|---|---|
|
| rs114743735 | 6 x 10–11 | Eosinophil percentage of white cells | eosinophil percentage of leukocytes | GCST004600 |
| rs114743735 | 1 x 10–10 | Eosinophil counts | eosinophil count | GCST004606 | |
| rs115820364- | 1 x 10–26 | Eosinophil counts | eosinophil count | GCST90002298 | |
| rs115820364 | 3 x 10–24 | Eosinophil counts | eosinophil count | GCST90002302 | |
| rs115820364 | 1 x 10–24 | Eosinophil counts | eosinophil count | GCST007065 | |
| rs114743735 | 6 x 10–9 | Sum eosinophil basophil counts | basophil count, eosinophil count | GCST004624 | |
|
| rs1344555 | 3 x 10–8 | Pulmonary function | pulmonary function measurement, forced expiratory volume | GCST001251 |
| rs1344555 | 4 x 10–6 | Pulmonary function (smoking interaction) | pulmonary function measurement, forced expiratory volume, smoking behaviour measurement | GCST001784 | |
| rs11721111 | 8 x 10–6 | Chronic obstructive pulmonary disease | chronic obstructive pulmonary disease | GCST007692 | |
| rs78101726 | 5 x 10–16 | Lung function (FVC) | vital capacity | GCST007429 | |
| rs78101726 | 8 x 10–25 | FEV1 | forced expiratory volume | GCST007432 | |
| rs78101726 | 4 x 10–8 | Lung function (FEV1/FVC) | FEV/FEC ratio | GCST007431 | |
| rs17485347 | 3 x 10–9 | Asthma | asthma | GCST010043 | |
| rs191494905 | 1 x 10–11 | Lung function (FEV1/FVC) | FEV/FEC ratio | GCST007080 | |
| rs6763377 | 9 x 10–10 | Lung function (FEV1/FVC) | FEV/FEC ratio | GCST007080 | |
| rs10936584 | 3 x 10–18 | Lung function (FVC) | vital capacity | GCST007081 | |
| rs6806825 | 5 x 10–12 | Lung function (FVC) | vital capacity | GCST007081 | |
| rs419076 | 2 x 10–24 | Diastolic blood pressure (cigarette smoking interaction) | smoking status measurement, diastolic blood pressure | GCST006187 | |
| rs419076 | 4 x 10–22 | Systolic blood pressure (cigarette smoking interaction) | smoking status measurement, systolic blood pressure | GCST006188 |
Fig 4Expression levels in lung adenocarcinoma and lung squamous cell carcinoma cells in compression to normal tissues from GEPIA2.
A) SREK1. B) IRAK2. C) DDX3Y. D) C1QTNF2. The signature score is calculated by mean value of log2 (TPM + 1). The |Log2FC| cutoff of the expression of proposed biomarker was 1. The p-value cutoff of the expression of proposed biomarker was 0.01. The red box indicates the tumor samples while the gray one represents the normal tissues. E. Pathological Stage Plot of SREK1, IRAK2, DDX3Y and C1QTNF2 genes in lung cancer.
Fig 5The prognostic values (patient survival in days) of the expression status of 6 COPD-hub genes.
A) SREK1 (P<0.001). B) TMEM67 (P = 0.002). C) CDC42BPA (P = 0.003). D) DPF3 (P = 0.005). E) ASB4 (P = 0.024) F) IRAK2 (P = 0106). The correlation of survival status of patients with different lung cancer subtypes (Squamous, Adeno, Large) to all six genes expression level.
Fig 6The expression of the two hub genes from the Human Protein Atlas (HPA) in normal and cancer lung tissue.
The expression levels of the 10 hub genes in normal lung and cancer tissues: Human Protein Atlas (HPA).
| Genes | Normal Tissue Staining | Cancer Lung Tissue (Tumor cell) | ||||
|---|---|---|---|---|---|---|
| Cell | Staining | Quantity | Staining | Quantity | Type of Cancer | |
|
| Macrophage | Not detected | None | Medium | 75%-25% | LUAD |
| Pneumonocyte | Not detected | None | ||||
|
| Macrophage | Medium | >75% | Medium | 75%-25% | LUSC |
| Pneumonocyte | Medium | >75% | ||||
|
| Macrophage | Low | <25% | Low | <25% | LUSC |
| Pneumonocyte | Not Detected | None | ||||
|
| Macrophage | Medium | >75% | Medium | >75% | LUSC |
| Pneumonocyte | Low | 75%-25% | ||||
|
| Macrophage | High | >75% | High | >75% | LUSC |
| Pneumonocyte | High | >75% | ||||
|
| Macrophage | Not detected | None | High | 75%-25% | LUSC |
| Pneumonocyte | Not detected | None | ||||
|
| Macrophage | Medium | 75%-25% | High | >75% | LUSC |
| Pneumonocyte | Medium | 75%-25% | ||||
|
| Macrophage | Not Detected | <25% | Not Detected | None | LUAD |
| Pneumonocyte | Not detected | <25% | ||||
|
| Macrophage | Not detected | None | High | >75% | LAUD |
| Pneumonocyte | Medium | 75%-25% | ||||
|
| Macrophage | Medium | >75% | Low | >75% | LAUD |
| Macrophage | Not detected | None | ||||