Literature DB >> 36194576

Integrative weighted molecular network construction from transcriptomics and genome wide association data to identify shared genetic biomarkers for COPD and lung cancer.

Babajan Banaganapalli1,2, Bayan Mallah1,2, Kawthar Saad Alghamdi3, Walaa F Albaqami4, Dalal Sameer Alshaer1, Nuha Alrayes2,5, Ramu Elango1,2, Noor A Shaik1,2.   

Abstract

Chronic obstructive pulmonary disease (COPD) is a multifactorial progressive airflow obstruction in the lungs, accounting for high morbidity and mortality across the world. This study aims to identify potential COPD blood-based biomarkers by analyzing the dysregulated gene expression patterns in blood and lung tissues with the help of robust computational approaches. The microarray gene expression datasets from blood (136 COPD and 6 controls) and lung tissues (16 COPD and 19 controls) were analyzed to detect shared differentially expressed genes (DEGs). Then these DEGs were used to construct COPD protein network-clusters and functionally enrich them against gene ontology annotation terms. The hub genes in the COPD network clusters were then queried in GWAS catalog and in several cancer expression databases to explore their pathogenic roles in lung cancers. The comparison of blood and lung tissue datasets revealed 63 shared DEGs. Of these DEGs, 12 COPD hub gene-network clusters (SREK1, TMEM67, IRAK2, MECOM, ASB4, C1QTNF2, CDC42BPA, DPF3, DET1, CCDC74B, KHK, and DDX3Y) connected to dysregulations of protein degradation, inflammatory cytokine production, airway remodeling, and immune cell activity were prioritized with the help of protein interactome and functional enrichment analysis. Interestingly, IRAK2 and MECOM hub genes from these COPD network clusters are known for their involvement in different pulmonary diseases. Additional COPD hub genes like SREK1, TMEM67, CDC42BPA, DPF3, and ASB4 were identified as prognostic markers in lung cancer, which is reported in 1% of COPD patients. This study identified 12 gene network- clusters as potential blood based genetic biomarkers for COPD diagnosis and prognosis.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36194576      PMCID: PMC9531836          DOI: 10.1371/journal.pone.0274629

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


1. Introduction

Chronic obstructive pulmonary disease (COPD) is a progressive airflow obstruction in the lungs which slowly becomes apparent after the 40th or 50th year of age [1]. With a global prevalence of 251 million, COPD disease is currently the fourth leading cause of global deaths and ranked fifth in terms of disease burden [2, 3]. The primary characteristics of the disease are lung inflammation, breathing difficulties, airflow blockage, emphysema, long term cough with mucus, chronic bronchitis, and refractory asthma [4]. Although cigarette smoking is the most well-known significant risk factor for COPD, other factors such as tuberculosis history and environmental exposure to lung irritants (such as indoor air pollutants and occupational dust) are also known to contribute to modifying disease causality and severity [5, 6]. Chronic inflammation is thought to be responsible for pathologic changes such as narrowing of airways in the lungs and destruction of the lung parenchyma, with an underlying role of genetic, epigenetic, and environmental factors [7]. Genetic studies of twins [8], first degree relatives [9] and sporadic COPD cases [10] have all confirmed the role of heritability, which explains at least 30% of the variation in COPD risk. For so long, the genetic basis of COPD has come from Mendelian syndromes, where rare pathogenic variants in ELN and FBLN5 genes cause cutis laxa and SERPINA1 causes α1-antitrypsin deficiency [11]. Genome-wide association studies have reported the strong association of over 20 genetic loci with COPD and a few additional loci for COPD-related phenotypes like hypoxemia, chronic bronchitis, and emphysema [12]. The molecular basis of COPD, however, could not be fully explained by candidate genetic variants alone, but also by changes in global gene expression. Besides providing an unbiased assessment of thousands of genes in the disease etiology, global gene expression could also potentially help in developing personalized medicine. However, analysis and interpretation of such massive gene expression data is so complex and challenging. A few studies have attempted to analyze gene expression changes in COPD patients’ blood samples in recent years [13-16]. However, the correlation of common gene expression dysregulations between blood and lung tissue samples from COPD patients is not well explored. Recent deployment of advanced statistics and integrative bioinformatics methods, incorporating gene network graphs, unsupervised clustering, and functional annotations of pathways, has provided a new dimension to explore the microarray gene expression datasets to discover the molecular basis of different genetic pathologies [17-19]. Therefore, the objective of this study is to expand our current understanding of COPD pathogenesis and to identify potential genetic biomarkers. By involving a series of comprehensive bioinformatics approaches, this study has identified several gene-network clusters involved in cell communication, inflammation, proliferation, and differentiation processes, are dysregulated in blood and lung tissues of COPD patients. Our findings provide an insight into understanding the mechanisms of COPD and its potential link with lung cancer, besides uncovering genetic markers with potential for disease diagnosis and therapeutic modulation.

2. Materials and methods

2.1 Microarray gene expression datasets

The NCBI-GEO and EBL-EBI Array Express databases were used to search for COPD gene expression datasets using the keywords like “COPD”, “COPD blood”, and “COPD tissue”. From the output, we selected two COPD gene expression datasets, i.e. GSE8581 and GSE54837 for our study. The first dataset (GSE8581) consists of gene expression data, from 35 lung tissues, which were collected from 16 COPD subjects (with FEV1 < 70% predicted and FEV1/FVC < 0.7) and 19 controls (with FEV1 > 80% predicted and FEV1/FVC > 0.7), generated on the Affymetrix U133 Plus 2.0 array [20]. The second dataset, GSE54837 includes the expression data generated on GPL570 platform (Affymetrix, Santa Clara, CA, USA) from the blood samples of 136 COPD patients and 6 controls (ex-smokers) [21].

2.2 Data preprocessing and analysis

The microarray gene expression data analysis was performed using R/Bioconductor (http://www.R-project.org/). The raw data extracted in.CEL format was normalized into expression values using the Bioconductor-Affy package for the standardization and background correction of the probe data [22]. The limma package was then used to select the statistical significance of the differentially expressed genes between normal and COPD samples by applying the t-test statistical method. The Benjamini-Hochberg method was used to calculate the false discovery rate (FDR) of all the statistically significant genes to enable the removal of false positive ones [24]. The cutoff value for DEGs was set as FDR < 0.01 and |log2 FC| > 1.5. A p-value of less than 0.05 was considered as statistically significant. The expression values of DEGs were divided into up- and down-regulated genes and visualized using the Heatmap online webtool (http://www.heatmapper.ca).

2.3 Gene ontology and functional enrichment analyses

Gene Ontology (GO) and KEGG pathway (https://www.genome.jp/kegg/pathway.html) enrichment analysis of DEGs was conducted using STRING database (http://string-db.org). The significant GO terms and pathways were chosen at a threshold of adjusted p< 0.05 and FDR of 0.05. The GO annotation networks were visualized in the Cytoscape network style plugin (http://www.cytoscape.org/)).

2.4 Construction of protein-protein interaction (PPI) map

The potential PPI networks from the lung and blood DEGs were constructed using Bisogenet, a cytoscape plugin (version 3.4.0). Furthermore, the network clusters from PPI interactions were identified with the help of network analyzer tool. The cut-off value of input nodes and their neighbors was up to a distance of 1 edge. During the creation of PPIM, only protein-protein interactions were selected, excluding protein-DNA interactions and microRNA silencing interactions. Each node represents a gene connected with edges which are physical or functional between the nodes. Therefore, few nodes have a large number of edges while several nodes have low connectivity [23].

2.5 Hub gene subnetwork construction

PPIM is considered to be a large-scale network. By following the network biology concepts, the PPIM complex was decomposed into significant subnetwork clusters of Significant Protein Interaction Network (SPIN). Based on degree centrality (DC) and betweenness centrality (BC) parameters, several genes were extracted. Each protein captured in the network was incorporated and standardized into Cytoscape 3.2.1 using Network Analyzer to calculate local degree centrality (DC) and global betweenness centrality (BC) parameters of the network [24].

2.6 Genome wide association study analysis

The hub genes from the above gene-network clusters were searched in the GWAS catalog database (https://www.ebi.ac.uk/gwas/) to check their association with COPD risk. Variant details like reference and alternate alleles, population frequency, genome wide association value (P-value of <5 × 10−8), reported trait, and accession number of the study were collected. We have also used another genotype-phenotype association database, PhenoScanner V2 (http://www.phenoscanner.medschl.cam.ac.uk/) to cross reference the association of hub genes with COPD risk. Each hub gene name was searched in the database generated tables, which contain trait specific associations of each gene and genome wide association values for its variants (P-value of <5 × 10−8).

2.7 Lung cancer expression analysis

We used three different databases to investigate the expression status of the COPD-hub genes in lung cancer tissues: Gene Expression Profiling Interactive Analysis (GEPIA2), Gene Expression Database of Normal and Tumor Tissues (GENT2), and Human Protein Atlas (HPA). Gene Expression Profiling Interactive Analysis (GEPIA2) (http://gepia2.cancer-pku.cn) was used to provide tumor/normal differential expression analysis. The signature score of hub genes is calculated by mean value of log2 (TPM + 1). The |Log2FC| of 1 and an expression value cutoff of 0.01 (p-value) were determined in Lung Adenocarcinoma (LUAD) and Lung Squamous Carcinoma (LUSC) tissues. The Gene Expression Database of Normal and Tumor Tissues GENT2 (http://gent2.appex.kr/gent2/) platform was used to explore the gene expression patterns across normal and tumor tissues generated from public gene expression data sets. The survival rate status of hub genes in lung cancer and its histological subtypes (adenocarcinomas, large and squamous) represented by Kaplan Meier plots at 95% confidence intervals (CI) and computed log rank p-value was determined. The human protein atlas (https://www.proteinatlas.org/) database was used to explore the expression status of each hub gene in human non-malignant and lung cancer tissues. This database takes the query gene or protein name and provides the information about that candidate protein expression based on the primary antibody staining data with a series of immunohistochemistry images of the corresponding clinical specimens.

3. Results

3.1 Differently expressed gene (DEGs) identification

A total of 54,675 probes were expressed in both datasets. In the human lung tissue dataset (E-GEOD-8581), 678 DEGs including 247 up- and 431 down-regulated genes were identified, whereas, blood dataset (GSE54837) showed the differential expression of 724 DEGs including 499 up- and 225 down-regulated genes. Comparison of both datasets revealed the shared expression of 63 genes (Fig 1A). The expression level of DEGs of COPD patient samples (both tissue and blood) is shown in the form of heatmaps and volcano plots (Fig 1B and 1C).
Fig 1

B Analysis of COPD differentially expressed genes (DEGs) in comparison to corresponding controls (A) Volcano plots of log fold changes in gene expression. (B) Identification of 63 common DEGs from blood and lung tissue datasets using VENNY. The overlapped area defines the shared DEGs of lung tissue and blood. (C) Heatmap of DEGs with a LogFC > 1.5. Red: up-regulation; green: down-regulation.

B Analysis of COPD differentially expressed genes (DEGs) in comparison to corresponding controls (A) Volcano plots of log fold changes in gene expression. (B) Identification of 63 common DEGs from blood and lung tissue datasets using VENNY. The overlapped area defines the shared DEGs of lung tissue and blood. (C) Heatmap of DEGs with a LogFC > 1.5. Red: up-regulation; green: down-regulation.

3.2 PPI network analysis and significant genes clusters

Bisogenet, a Cytoscape plugin analysis of DEGs from both datasets generated a complex PPIM network of 1072 nodes (genes) and 20079 edges (interactions). The average edge-node ratio was 18.73 (S1 Fig). In the context of the PPIM network, protein interactions within the same group of clusters are assumed to have similar functions to the less interconnected regions or different cluster groups. Therefore, the Network Analyzer plugin was applied to find significant hub genes with the highest degree of centrality. A total of 12 significant genes and clusters with a degree of centrality of >17 were identified from network analysis (Fig 2; S2 Fig) and chosen as hub proteins (Table 1).
Fig 2

Hub genes TMEM67, IRAK2, ASB4, MECOM, DPF3 and DDX3Y with their clusters identified from the common DEGs between blood and lung tissue datasets.

Their selection is based on degree of centrality in the PPI network with the score >18.

Table 1

A total of 12 significant genes with more than 17 of DC were obtained from network analysis and chosen as hub proteins.

S.NoNameDegreeBetweennessCentralityClosenessCentralityClustering Coefficient
1SREK1770.0040.4560.177
2TMEM67540.0120.3860.0405
3IRAK2430.0040.4120.129
4MECOM310.0020.3940.234
5ASB4295.15E-040.3900.122
6C1QTNF2280.0050.3690.010
7CDC42BPA260.0050.4150.0289
8DPF3242.37E-040.3590.471
9DET1233.08E-040.3840.260
10CCDC74B220.0010.3580.835
11KHK190.0020.3810.073
12DDX3Y172.80E-040.4130.051

Hub genes TMEM67, IRAK2, ASB4, MECOM, DPF3 and DDX3Y with their clusters identified from the common DEGs between blood and lung tissue datasets.

Their selection is based on degree of centrality in the PPI network with the score >18.

3.3 GO annotation analysis

Gene Ontology annotation is the process by which functional categories of genes are assigned. The GO annotations of 12 COPD gene clusters showed their enrichment in cell-cell communication, cell regulation, immune processes, transcription factors regulation and ubiquitin pathways. Four of these 12 COPD -gene clusters, CCDC74B, MECOM, IRAK2 and DET1 have shown the lowest FDR values (Table 2), which reflects their highest functional enrichment in i molecular function (MF), biological process (BP), cellular components (CC) categories and KEGG pathways. For the CCDC74B cluster, GO enrichment highlights its involvement in ‘ubiquitin pathways and protein modification’, under the biological processes category, ‘Protein Deubiquitination’ (GO:0016579) was the top GO term. The other top GO enriched terms falling into remaining categories are as follows; ‘proteasome-activating ATPase activity’ in MF, ‘Proteasome Regulatory Particle’ (GO:0005838) in CC and ‘Protein degradation’ (hsa03050) in KEGG pathways. IRAK2 cluster was highly involved in signaling pathways and Kinase activity. The GO term in BP highlighted ‘Interleukin-1-Mediated Signaling Pathway’ (GO:0070498), which mediates cytokine responses during inflammation. The MF ontology source showed ‘Protein Kinase Activity’ (GO:0004672) and ‘Catalytic Activity Acting on A Protein’ (GO:0140096) as top GO terms. The CC ontology source was mainly enriched in ‘endosome membrane’ (GO:0010008) and ‘Cytoplasmic Vesicle Part’ (GO:0044433). KEGG underlined GO terms which are responsible for cytokine production and regulating the immune response like ‘NF-Kappa B Signaling Pathway’ (hsa04064) and ‘Toll-Like Receptor Signaling Pathway’ (hsa04620). DET1 cluster was mostly reported in relation to protein degradation processes. The BP ontology source highlighted ‘Regulation of Protein Catabolic Process’ (GO:0042176) as the top GO term. MF ontology source identified ‘Ubiquitin Protein Ligase Binding’ (GO:0031625) as the significant GO term. ‘Cul4-RING E3 Ubiquitin Ligase Complex’ (GO:0080008) are the top CC terms, while ‘Ubiquitin Mediated Proteolysis’ (hsa04120) and ‘Nucleotide Excision Repair’ (hsa03420) was the significant KEGG pathways. (Fig 3A). The MECOM cluster was highly enriched in regulation of transcription by ‘RNA polymerase II’ as top BP GO term (GO:0006357). Top MF term was ‘transcription regulator activity’ (GO:0140110). ‘Nucleoplasm’ (GO:0005654) and ‘Nuclear Lumen’ (GO:0031981) are the top CC terms, and ‘Pathways in cancer’ was the significant KEGG pathway (Fig 3B).
Table 2

Functional enrichment of CCDC74B, MECOM, IRAK2 and DET1 clusters, in which highlights highest functional enrichment in different molecular processes like molecular function (MF), biological process (BP), cellular components (CC) and KEGG pathways based on FDR value.

DEG ClustersOntologyTerm IDTerm DescriptionObserved Gene CountFDR
CDC74B Biological Process (BP) GO:0016579Protein Deubiquitination201.64E-31
GO:0006511Ubiquitin-Dependent Protein Catabolic Process192.42E-25
GO:0043687Post-Translational Protein Modification183.09E-25
Molecular Function (MF) GO:0036402Proteasome-activating ATPase activity61.28E-13
GO:0017025TBP-class protein binding65.02E-11
GO:0008134Transcription factor binding86.89E-06
Cellular Component (CC) GO:0005838Proteasome Regulatory Particle192.98E-48
GO:0000502Proteasome Complex201.65E-44
GO:0031597Cytosolic Proteasome Complex102.16E-24
KEGG Pathways (KP) hsa03050Proteasome165.94E-36
hsa05169Epstein-Barr virus infection169.78E-27
MECOM Biological Process (BP) GO:0006357Regulation of transcription by RNA polymerase II293.62E-19
GO:0000122Negative regulation of transcription by RNA polymerase II218.33E-19
GO:0045892Negative regulation of transcription, DNA-templated238.55E-19
Molecular Function (MF) GO:0043565Sequence-specific DNA binding221.03E-18
GO:0140110Transcription regulator activity257.51E-17
GO:1990837Sequence-specific double-stranded DNA binding187.02E-16
Cellular Component (CC) GO:0005654Nucleoplasm274.42E-14
GO:0031981Nuclear Lumen285.76E-14
GO:0000785Chromatin131.54E-11
KEGG Pathways (KP) hsa05220Chronic myeloid leukemia77.23E-09
hsa05200Pathways in cancer112.25E-08
hsa04068FoxO signaling pathway78.41E-08
IRAK2 Biological Process (BP) GO:0070498Interleukin-1-Mediated Signaling Pathway119.63E-16
GO:0071347Cellular Response To Interleukin-1126.86E-14
GO:0002757Immune Response-Activating Signal Transduction151.62E-13
Molecular Function (MF) GO:0004672Protein Kinase Activity112.03E-05
GO:0016301Kinase Activity122.03E-05
GO:0140096Catalytic Activity, Acting On A Protein175.41E-05
Cellular Component (CC) GO:0010008Endosome Membrane80.0015
GO:0044433Cytoplasmic Vesicle Part130.0015
GO:0044440Endosomal Part80.0015
KEGG Pathways (KP) hsa04064NF-Kappa B Signaling Pathway97.07E-11
hsa04620Toll-Like Receptor Signaling Pathway97.70E-11
hsa05133Pertussis71.19E-08
DET1 Biological Process (BP) GO:0042176Regulation Of Protein Catabolic Process111.42E-10
GO:0045732Positive Regulation Of Protein Catabolic Process91.15E-09
GO:1903362Regulation Of Cellular Protein Catabolic Process92.44E-09
Molecular Function (MF) GO:0031625Ubiquitin Protein Ligase Binding98.27E-09
GO:0048156Tau Protein Binding30.0001
GO:0004842Ubiquitin-Protein Transferase Activity60.00015
Cellular Component (CC) GO:0080008Cul4-RING E3 Ubiquitin Ligase Complex72.55E-12
GO:0000151Ubiquitin Ligase Complex103.77E-11
GO:0031464Cul4A-RING E3 Ubiquitin Ligase Complex53.07E-10
KEGG Pathways (KP) hsa04120Ubiquitin Mediated Proteolysis92.32E-12
HhNucleotide Excision Repair41.09E-05
hsa05215Prostate Cancer40.00012
Fig 3

GO-annotations stacked network view of (A) MECOM and (B) IRAK2 clusters. The size of the circle (left side) represents the number of genes involved in a specific GO-term. The GWAS loci of (C) MECOM (D) IRAK2 genes from the GWAS catalog.

GO-annotations stacked network view of (A) MECOM and (B) IRAK2 clusters. The size of the circle (left side) represents the number of genes involved in a specific GO-term. The GWAS loci of (C) MECOM (D) IRAK2 genes from the GWAS catalog. The functional enrichment values of the remaining 8 gene clusters (CDC42BPA, DPF3, SREK1, TMEM67, ASB4, DDX3Y, KHK, and C1QTNF2) are shown in Fig 3A and 3B. CDC42BPA cluster predicted its participation in ‘Fc Gamma R-Mediated Phagocytosis’ (hsa04666). ‘Legionellosis’ (hsa05134) was mainly enriched by DPF3 cluster. While the TMEM67 and ASB4 clusters were mostly involved in "Proteasome" (hsa03050) and "Ubiquitin-mediated proteolysis" (hsa04120), the SREK1 cluster was mostly involved in ‘RNA binding’ (GO:0003723). The ‘Pathways in cancer’ (hsa05200) was significant GO term in DDX3Y cluster. Lastly, KHK and C1QTNF2 were mainly enriched in ‘metabolic process’ (GO:0044238) and ‘extracellular matrix-receptor interaction’ (hsa04512) respectively.

3.4 Examining the role of hub genes in data from COPD genome wide association studies (GWAS)

Both the GWAS catalog and PhenoScanner databases were used to collate the genetic association data of hub genes with the risk of COPD development. The GWAS data findings of 12 COPD-hub genes, include variant IDs, reference and alternate alleles, significance of association (P = 5 × 10−8), phenotypic traits associated with the query genes, etc. GWAS findings revealed the association of IRAK2 with eosinophil count alterations usually manifested in inflammatory conditions (Table 3). For the MECOM hub gene, the associated traits are pulmonary complications including COPD, asthma and lung function (Fig 4C and 4D No significant GWAS data linking the remaining 10 COPD-hub genes (SREK1, TMEM67, ASB4, C1QTNF2, CDC42BPA, DPF3, DET1, CCDC74B, KHK, and DDX3Y) with any kind of lung disease was found. In PhenoScanner, 6 out of 12 hub genes (IRAK2, MECOM, ASB4, CDC42BPA, DPF3 and TMEM67) have revealed an association with lung related traits and lung cancer (S1 Table). IRAK2 is associated with lung cancer and a high eosinophil count. Genotype-phenotype associations of MECOM highlighted pulmonary function interaction, lung cancer, and COPD with acute exacerbation. Both ASB4 and CDC42BPA showed an association with COPD with acute lower respiratory infection. The DPF3 gene is associated with COPD and squamous cell carcinoma, lung cancer. Lastly, TMEM67 is associated with lung cancer.
Table 3

Association of two hub genes with the lung related traits and lung cancer from GWAS catalog database.

GeneVariant and risk alleleP-valueReported traitTrait(s)Study accession
IRAK2 rs1147437356 x 10–11Eosinophil percentage of white cellseosinophil percentage of leukocytesGCST004600
rs1147437351 x 10–10Eosinophil countseosinophil countGCST004606
rs115820364-1 x 10–26Eosinophil countseosinophil countGCST90002298
rs1158203643 x 10–24Eosinophil countseosinophil countGCST90002302
rs1158203641 x 10–24Eosinophil countseosinophil countGCST007065
rs1147437356 x 10–9Sum eosinophil basophil countsbasophil count, eosinophil countGCST004624
MECOM rs13445553 x 10–8Pulmonary functionpulmonary function measurement, forced expiratory volumeGCST001251
rs13445554 x 10–6Pulmonary function (smoking interaction)pulmonary function measurement, forced expiratory volume, smoking behaviour measurementGCST001784
rs117211118 x 10–6Chronic obstructive pulmonary diseasechronic obstructive pulmonary diseaseGCST007692
rs781017265 x 10–16Lung function (FVC)vital capacityGCST007429
rs781017268 x 10–25FEV1forced expiratory volumeGCST007432
rs781017264 x 10–8Lung function (FEV1/FVC)FEV/FEC ratioGCST007431
rs174853473 x 10–9AsthmaasthmaGCST010043
rs1914949051 x 10–11Lung function (FEV1/FVC)FEV/FEC ratioGCST007080
rs67633779 x 10–10Lung function (FEV1/FVC)FEV/FEC ratioGCST007080
rs109365843 x 10–18Lung function (FVC)vital capacityGCST007081
rs68068255 x 10–12Lung function (FVC)vital capacityGCST007081
rs4190762 x 10–24Diastolic blood pressure (cigarette smoking interaction)smoking status measurement, diastolic blood pressureGCST006187
rs4190764 x 10–22Systolic blood pressure (cigarette smoking interaction)smoking status measurement, systolic blood pressureGCST006188
Fig 4

Expression levels in lung adenocarcinoma and lung squamous cell carcinoma cells in compression to normal tissues from GEPIA2.

A) SREK1. B) IRAK2. C) DDX3Y. D) C1QTNF2. The signature score is calculated by mean value of log2 (TPM + 1). The |Log2FC| cutoff of the expression of proposed biomarker was 1. The p-value cutoff of the expression of proposed biomarker was 0.01. The red box indicates the tumor samples while the gray one represents the normal tissues. E. Pathological Stage Plot of SREK1, IRAK2, DDX3Y and C1QTNF2 genes in lung cancer.

Expression levels in lung adenocarcinoma and lung squamous cell carcinoma cells in compression to normal tissues from GEPIA2.

A) SREK1. B) IRAK2. C) DDX3Y. D) C1QTNF2. The signature score is calculated by mean value of log2 (TPM + 1). The |Log2FC| cutoff of the expression of proposed biomarker was 1. The p-value cutoff of the expression of proposed biomarker was 0.01. The red box indicates the tumor samples while the gray one represents the normal tissues. E. Pathological Stage Plot of SREK1, IRAK2, DDX3Y and C1QTNF2 genes in lung cancer.

3.5 Examining the transcriptional status of COPD hub genes in lung cancer expression

In GPEIA2 analysis, boxplots of 12 hub genes were retrieved. Adenocarcinoma (LUAD) and Squamous Cell Carcinoma (LUSC) were selected with a P-value cutoff of 0.01 using The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) database. Out of 12 hub genes, only 4 (IRAK2, SREK1, C1QTNF2 and DDX3Y) have shown significant gene expression in lung cancer compared to the normal tissues (Fig 4A–4E). The boxplots of IRAK2, SREK1 and C1QTNF2 show their significant expression in LUSCs. The DDX3Y gene was significantly expressed in both LAUD and LUSC cells. The GENT2 platform is used to explore gene expression patterns across normal and lung tumor tissues. Fig 5 shows the prognostic value (patient survival in days) of the expression status of 6 COPD-hub genes. Out of the 12-COPD hub genes, 5 genes (SREK1, TMEM67, CDC42BPA, DPF3, and ASB4) showed an improvement in lung cancer survival duration up to 1500 days (P values for all the associations is <0.02) (Fig 5). The correlation of survival status of patients with different lung cancer subtypes to all five gene expression levels reveals that adenocarcinomas have a longer survival rate (0.2–0.4) than those with squamous and large cell lung cancers.
Fig 5

The prognostic values (patient survival in days) of the expression status of 6 COPD-hub genes.

A) SREK1 (P<0.001). B) TMEM67 (P = 0.002). C) CDC42BPA (P = 0.003). D) DPF3 (P = 0.005). E) ASB4 (P = 0.024) F) IRAK2 (P = 0106). The correlation of survival status of patients with different lung cancer subtypes (Squamous, Adeno, Large) to all six genes expression level.

The prognostic values (patient survival in days) of the expression status of 6 COPD-hub genes.

A) SREK1 (P<0.001). B) TMEM67 (P = 0.002). C) CDC42BPA (P = 0.003). D) DPF3 (P = 0.005). E) ASB4 (P = 0.024) F) IRAK2 (P = 0106). The correlation of survival status of patients with different lung cancer subtypes (Squamous, Adeno, Large) to all six genes expression level. The Human Protein Atlas (HPA) derived protein expression status in normal tissue and lung cancer tissues for COPD- hub genes is illustrated in Fig 6 and Table 4. Abundance of these proteins could be divided into four categories like high, medium, low and not detected by the scoring system based on the intensity of staining, whether strong, moderate, weak, or negative. The macrophage and pneumonocytes staining for ASB4 in normal lung tissues was not detected and was medium in lung cancer. Medium staining detection of CDC42BPA and IRAK2 were found in both normal and cancer lung tissue in addition to SREK1 and C1QTNF2 which was observed in medium in normal tissue but higher in lung cancer tissue. Staining of MECOM was very high in both normal and cancer lung tissue. While low to no protein detection of DET1 and KHK were observed in both normal and lung cancer tissue. Furthermore, DDX3Y and DPF3 staining in normal lung tissues were negative but DDX3Y was higher in lung cancer tissue but not DPF3. Finally, data were not available for TMEM67 and CCDC74B genes in HPA (S3 Fig).
Fig 6

The expression of the two hub genes from the Human Protein Atlas (HPA) in normal and cancer lung tissue.

Table 4

The expression levels of the 10 hub genes in normal lung and cancer tissues: Human Protein Atlas (HPA).

GenesNormal Tissue StainingCancer Lung Tissue (Tumor cell)
CellStainingQuantityStainingQuantityType of Cancer
ASB4 MacrophageNot detectedNoneMedium75%-25%LUAD
PneumonocyteNot detectedNone
CDC42BPA MacrophageMedium>75%Medium75%-25%LUSC
PneumonocyteMedium>75%
DET1 MacrophageLow<25%Low<25%LUSC
PneumonocyteNot DetectedNone
IRAK2 MacrophageMedium>75%Medium>75%LUSC
PneumonocyteLow75%-25%
MECOM MacrophageHigh>75%High>75%LUSC
PneumonocyteHigh>75%
DDX3Y MacrophageNot detectedNoneHigh75%-25%LUSC
PneumonocyteNot detectedNone
SREK1 MacrophageMedium75%-25%High>75%LUSC
PneumonocyteMedium75%-25%
DPF3 MacrophageNot Detected<25%Not DetectedNoneLUAD
PneumonocyteNot detected<25%
C1QTNF2 MacrophageNot detectedNoneHigh>75%LAUD
PneumonocyteMedium75%-25%
KHK MacrophageMedium>75%Low>75%LAUD
MacrophageNot detectedNone

4. Discussion

Massive high throughput genome wide- sequencing and expression studies have been effective in querying the molecular basis of many inherited diseases in humans. However, deciphering the molecular basis of chronic diseases is challenging, owing to the complex interplay of genes and environmental factors. The etiopathogenesis of complex diseases like COPD can be better explained by studying the global gene expression changes. The recent biomarker discoveries in intracranial aneurysm [25], Parkinson disease [26], Diabetes mellitus [27] and cancers [28] once again proves the robustness of bioinformatics methods in analyzing the huge gene expression data. Few studies have attempted to analyze the gene expression changes either in blood samples [29, 30] or tissue samples of COPD patients [31, 32]. But, none of them attempted to identify blood based genetic biomarkers. Therefore, this study tried to explore the shared gene expression changes between both blood and lung tissues to identify potential biomarkers to assist in diagnostics or prognostic aspects of COPD patients. Chronic diseases are caused by the action of dysregulation of multiple genes at different stages of the disease pathology. Hence, we constructed a protein interactome based on the differentially expressed genes in the COPD patients. The protein-protein interaction networks establish the physical contacts between two or more proteins as a result of biochemical events underlying the disease etiopathogenesis. The characteristic features of PPI is based on various connectivity between nodes and edges, where each node indicates a gene connected to its functional partners [23]. To reduce the complexity of PPI network, highest connected nodes are decomposed into a clusters or modules. The gene with the highest number of edges among group of genes within the same cluster is known as hub gene, which are basically chosen based on its degree of centrality (DC) values in the network [33]. DC also refers to node connectivity, i.e. the number of connections to the node and its interaction [34]. In context of these network principles in identifying the COPD-hub genes, we assessed the essential properties of the genes that are involved in the disease. Since clusters are characterized by extensive connectivity between a set of genes, GO annotations provides functional interpretation of them under vaiety of biological categories [35]. In the present study we identified 12 COPD hub gene clusters (SREK1, TMEM67, IRAK2, MECOM, ASB4, C1QTNF2, CDC42BPA, DPF3, DET1, CCDC74B, KHK, DDX3Y) from PPI network, which revealed their enrichment in cell regulation, immune process, transcription factors regulations and protein degradation pathways. The upregulated gene (DDX3Y) in both blood and lung tissue were enriched in functions associated with regulation of gene expression, cell cycle, cellular senescence and FoxO signaling pathway which is involved in many cellular physiological events such as apoptosis and cell-cycle control. Moreover, there were two downregulated genes (MECOM and KHK). MECOM were associated with regulation of transcription, pathways in cancers and FoxO signaling pathway. While KHK were involved in Starch and sucrose metabolic processes. However, of those 12 gene clusters, CCDC74B, MECOM, IRAK2 and DET1 clusters had shown the lowest FDR values, which reflects their highest functional enrichment in different molecular processes. The CCDC74B gene cluster was mainly enriched in proteasome pathway, which degrades unneeded proteins within the cell. The activity of proteasome can be impaired by cigarette smoke resulting in reduction of antigen presentation and lead to prolonged lung infections and COPD patients [36]. In lung tissues of COPD patients, accumulation of ubiquitylated proteins and further degradation by proteasome machinery is reported [37]. Protein catabolic processes pathway enriched in DET1 gene cluster also plays an important role in pathogenesis of COPD. The chronic inflammation in COPD contributes to the imbalance of protein degradation resulting in the loss of skeletal muscle protein, one of the characteristic features present in COPD [38]. On the other hand, MECOM gene cluster highlights the regulation of transcriptional pathway which controls the changes in gene transcription of many inflammatory substances that play a key role in the pathogenesis of COPD [39, 40]. The IRAK2 gene cluster showed its involvement in regulation of inflammatory process such as interleukin (IL)-1 pathway activation and Toll like receptor that is directly linked to the pathogenesis of COPD, is characterized by abnormal release of inflammatory cytokines, remodeling of the airways and dysregulated immune cell activity [41, 42]. Genome wide association studies reveals the association of genetic variants with risk of developing common diseases by screening genetic samples from thousands of samples. In this study, 12 hub genes were searched in GWAS databases for their association with COPD, lung function traits as well as lung cancers. The GWAS data confirmed that the variants in the 6 COPD-hub genes (IRAK2, MECOM, CDC42BPA, ASB4, DPF3 and TMEM67) shows genome wide significant association to traits that could potentially modify the risk of COPD pathology development. At least 5 variants in IRAK2 were significantly associated with variety of eosinophil count traits [43]. Eosinophilia (high eosinophil counts) causes inflammation of the lung tissue and exacerbates the lung function in the COPD patients. However, the role of eosinophils in COPD is unclear, as not all COPD patients develop eosinophilic airway inflammation [44, 45]. Interestingly, IL-1 signaling has been shown to be associated with eosinophilic inflammatory profiles in patient with COPD [43]. Moreover, in COPD patients with eosinophilic inflammation have the tendency to respond to steroid therapy. Therefore, eosinophil count is an important point of view to direct biological therapies for COPD [46]. Many variants in MECOM were strongly associated with FEV1and other traits that are directly related to lung function and COPD pathogenesis [47]. Other COPD- hub genes (CDC42BPA, ASB4, DPF3 and TMEM67) are were also associated with lung function related traits and lung cancer [48-50]. COPD is one of the significant risk factors for oncogenesis of the lung tissues, which is seen in about 1% of COPD patients every year [51]. Both COPD and lung cancer share many common pathways such as immune dysfunction and regulation of transcription factors [52]. Interestingly, pathways enriched by MECOM and IRAK2 were involved in lung cancer development. For instance, MECOM is an important transcription factor involved in oncogenesis [53, 54]. Aberrant expression of MECOM is one of the characteristic features of many malignancies including leukemia [55] and solid tumors such as breast cancer and hepatocellular carcinoma [53, 56] as well as lung cancer [57]. Moreover, frequent alterations in MECOM have been associated with primary and metastatic lung adenocarcinomas [58]. On the other hand, activation of the TLR pathway has a significant impact on cancer progression regulation including lung cancer [59, 60]. One genetic variant in IRAK2 (rs779901 C > T) in the TLR signaling pathway is suggested to be a prognostic biomarker for non-small cell lung cancer (NSCLC) [61]. Global gene expression profile analysis provides a valuable insight into the normal biological process and to disease pathogenesis [62]. To support the contribution of IRAK2 and MECOM hub genes, significant dysregulation of expression in lung cancer types were observed in HPA, GPEIA2 and GENT2 databases as well. Furthermore, differentially expression of IRAK2 and MECOM genes has been reported in many studies in cancers or COPD [63-65].

5. Conclusions

In conclusion, we identified, 12 blood based molecular biomarkers (SREK1, TMEM67, IRAK2, MECOM, ASB4, C1QTNF2, CDC42BPA, DPF3, DET1, CCDC74B, KHK, DDX3Y) for COPD diagnosis, by integrative gene expression and gene network approaches. Out of these 12 hub genes, two (MECOM and IRAK2) were over expressed in lung cancers tissues, which reflects a shared molecular lineage between COPD and lung cancers. Interestingly, we have also identified that the expression status of other COPD hub genes like SREK1, TMEM67, CDC42BPA, DPF3, and ASB4 improves the survival duration of lung cancer patients, hence they may act as potential molecular drug targets and/or biomarkers for both COPD and/or lung cancer. However, biological and clinical relevance of each COPD hub gene can be better understood, when our findings are explored through future in vitro and in vivo validation assays.

Overview of PPI network constructed from 63 common genes using cytoscape STRING database.

The PPI network at p-value >0.05 consist of 995 nodes interact with 18924 edges. (PDF) Click here for additional data file.

The 12 hub gene protein interaction network.

(A) Cluster-1 (SREK1). (B) Cluster-2 (TMEM67). (C) Cluster-3 (IRAK2). (D) Cluster-4 (MECOM). (E) Cluster-5 (ASB4). (F) Cluster-6 (CDC42BPA). (G) Cluster-7 (DPF3). (H) Cluster-8 (DET1). (I) Cluster-9 (KHK). (J) Cluster-10 (DDX3Y), the hub gene selected based on high centrality in the protein network of DEGs. (PDF) Click here for additional data file.

Histopathological images of DEGS.

Protein Pathology Atlas of 12 hug genes in normal lung and lung cancer tissues. (PDF) Click here for additional data file.

Hub genes GWAS association.

Genetic association of hub genes with the lung related traits and lung cancer from Phenoscanner database. (DOCX) Click here for additional data file. 1 May 2022
PONE-D-22-07850
Integrative Weighted Molecular Network Construction from Transcriptomics and Genome Wide Association Data Identified Shared Genetic Biomarkers for COPD and Lung Cancer
PLOS ONE Dear Dr. Banaganapalli, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by Jun 15 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript:
A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Narasimha Reddy Parine, Ph.D Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. Thank you for stating the following in the Acknowledgments Section of your manuscript: "This project was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University, under Grant no. G-593-140-1441. The authors, therefore, acknowledge the DSR for technical and financial support." We note that you have provided funding information. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: "Funding Author: BB Grant Number: G-593-140-1441 Full Name of Funder: Deanship of Scientific Research (DSR), King Abdulaziz University URL: https://dsr.kau.edu.sa/ Role: The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." Please include your amended statements within your cover letter; we will change the online submission form on your behalf. 3. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The original article by Banaganapalli and colleagues sought to identify significant changes in COPD and lung cancer. The authors collected transcriptomics and protein-protein interaction networks, and then discovered some differentially expressed genes, that are analyzed by some bioinformatics approaches and are available in silico datasets. They further predicted these clinicopathological events and patient outcomes through the features they selected. Other comments are listed as follows • There are different stages of COPD to determine the severity of disease status, is there any difference among their gene expression? • Some figures have poor pixels and the font is too small. • They did not describe the statistical algorithm and p-value of each website and panel. • The number of DEGs in manuscript and figure 1B are different. • Some of the web links they provide cannot be entered • There is no title of Table 4. It is also better to use scoring system to quantify and present the IHC results of these genes. Reviewer #2: In the present study, the authors aim to identify potential biomarkers for COPD diagnosis and prognosis by analyzing the dysregulated gene expression patterns from blood and lung tissues. They identified many common DEGs from blood and lung tissue datasets and focused on 12 hub genes with DC score of >17. They further performed PPI analysis, GO analysis and GWAS to reveal the significance of 12 hub genes in COPD and lung cancer. They provided massive and valuable information about COPD and lung cancer. Minor Points: 1. There are many references without formal styles such as Ref #10, #12, #13, #20, #28 #31, #39 and The Ref#16 is wrongly embedded. The authors should correct them. 2. The refractive asthma should be corrected as refractory asthma. 3. There are many inconsistencies among figures, legends and contexts such as Figure 1 & 3 and table 4. 12 significant genes should be with more than 17 of DC in the legends of Table 1 and Figure 2. 5. Go-annotations should be corrected as GO-annotations in Figure 3. 6. The items in table 1 should be re-modify to a professional one and the style in the column of Term Description should be consistent. 7. The image and staining intensity of MECOM from HPA database compared to Figure S3 were wrongly embedded and the authors should correct them. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Magesh Ramasamy [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 16 Jun 2022 Reviewer #1: The original article by Banaganapalli and colleagues sought to identify significant changes in COPD and lung cancer. The authors collected transcriptomics and protein-protein interaction networks, and then discovered some differentially expressed genes, that are analyzed by some bioinformatics approaches and are available in silico datasets. They further predicted these clinicopathological events and patient outcomes through the features they selected. Other comments are listed as follows • There are different stages of COPD to determine the severity of disease status, is there any difference among their gene expression? Answer: Our study aimed to identify the common biomarker between COPD and lung cancer. For this reason, we ignored the disease severity variability in our study. In this study, we collected all stages of COPD (GSE54837). • Some figures have poor pixels, and the font is too small. Answer: We increased the quality of figures • They did not describe the statistical algorithm and p-value of each website and panel. Answer: WE revised as per the reviewer suggestion • The number of DEGs in manuscript and figure 1B are different. Answer: Corrected • Some of the web links they provide cannot be entered Answer: Weblinks updated • There is no title of Table 4. It is also better to use scoring system to quantify and present the IHC results of these genes. Answer: Updated in the table 4 Reviewer #2: In the present study, the authors aim to identify potential biomarkers for COPD diagnosis and prognosis by analyzing the dysregulated gene expression patterns from blood and lung tissues. They identified many common DEGs from blood and lung tissue datasets and focused on 12 hub genes with DC score of >17. They further performed PPI analysis, GO analysis and GWAS to reveal the significance of 12 hub genes in COPD and lung cancer. They provided massive and valuable information about COPD and lung cancer. Minor Points: 1. There are many references without formal styles such as Ref #10, #12, #13, #20, #28 #31, #39 and The Ref#16 is wrongly embedded. The authors should correct them. Answer: All the references have been updated with reference manger software 2. The refractive asthma should be corrected as refractory asthma. Answer: Corrected 3. There are many inconsistencies among figures, legends and contexts such as Figure 1 & 3 and table Answer: Updated 4. 12 significant genes should be with more than 17 of DC in the legends of Table 1 and Figure 2. Answer: Table 1 shows the degree centrality of 12 significant genes, whereas Figure 2 mentioned >18 centrality score genes 5. Go-annotations should be corrected as GO-annotations in Figure 3. Answer: corrected 6. The items in table 1 should be modified to a professional one and the style in the column of Term Description should be consistent. Answer: Table 1 is centrality table, all the values are consistence according to the journal style 7. The image and staining intensity of MECOM from HPA database compared to Figure S3 were wrongly embedded and the authors should correct them. Answer: Figure 6 was revised and now its exactly mapping to Figure S3 Submitted filename: Response to Reviewers.docx Click here for additional data file. 1 Sep 2022 Integrative Weighted Molecular Network Construction from Transcriptomics and Genome Wide Association Data Identified Shared Genetic Biomarkers for COPD and Lung Cancer PONE-D-22-07850R1 Dear Dr. Banaganapalli, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Narasimha Reddy Parine, Ph.D Academic Editor PLOS ONE Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #3: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #3: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: (No Response) Reviewer #3: In this study, the authors analyzed the gene expression differences between the blood and lung tissues of COPD patients to identify diagnostic and prognostic biomarkers. They have performed multiple computational biology analysis methods (differential gene expression analysis, gene networking, hub gene identification, GO annotations, and druggability analysis) to discover the genetic biomarkers. They have also queried the candidate hub genes in GWAS and in cancer expression databases to understand their contribution to lung cancer. The manuscript is well written. The number of figures and tables is sufficient. I am convinced that the new information identified in COPD may lead to developing future treatment strategies. Therefore, I sincerely recommend this article for publication in your journal. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #3: No ********** 26 Sep 2022 PONE-D-22-07850R1 Integrative Weighted Molecular Network Construction from Transcriptomics and Genome Wide Association Data to Identify Shared Genetic Biomarkers for COPD and Lung Cancer Dear Dr. Banaganapalli: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Narasimha Reddy Parine Academic Editor PLOS ONE
  62 in total

Review 1.  Introduction to the Gene Expression Analysis.

Authors:  Ignacio San Segundo-Val; Catalina S Sanz-Lozano
Journal:  Methods Mol Biol       Date:  2016

2.  Omics and the Search for Blood Biomarkers in Chronic Obstructive Pulmonary Disease. Insights from COPDGene.

Authors:  Elizabeth A Regan; Craig P Hersh; Peter J Castaldi; Dawn L DeMeo; Edwin K Silverman; James D Crapo; Russell P Bowler
Journal:  Am J Respir Cell Mol Biol       Date:  2019-08       Impact factor: 6.914

3.  Catabolic/anabolic balance and muscle wasting in patients with COPD.

Authors:  Richard Debigaré; Karine Marquis; Claude H Côté; Roland R Tremblay; Annie Michaud; Pierre LeBlanc; François Maltais
Journal:  Chest       Date:  2003-07       Impact factor: 9.410

4.  The role of interleukin-1beta in murine cigarette smoke-induced emphysema and small airway remodeling.

Authors:  Andrew Churg; Steven Zhou; Xiaoshan Wang; Rona Wang; Joanne L Wright
Journal:  Am J Respir Cell Mol Biol       Date:  2008-10-17       Impact factor: 6.914

5.  Chronic Obstructive Pulmonary Disease (COPD).

Authors:  Suzanne C Lareau; Bonnie Fahy; Paula Meek; Angela Wang
Journal:  Am J Respir Crit Care Med       Date:  2019-01-01       Impact factor: 21.405

Review 6.  IL-1 Receptor-Associated Kinase Signaling and Its Role in Inflammation, Cancer Progression, and Therapy Resistance.

Authors:  Ajay Jain; Sabina Kaczanowska; Eduardo Davila
Journal:  Front Immunol       Date:  2014-11-17       Impact factor: 7.561

7.  Bioinformatic analysis of gene expression profiling of intracranial aneurysm.

Authors:  Lijuan Bo; Bo Wei; Zhanfeng Wang; Chaohui Li; Zheng Gao; Zhuang Miao
Journal:  Mol Med Rep       Date:  2017-12-29       Impact factor: 2.952

Review 8.  Inflammatory mechanisms in patients with chronic obstructive pulmonary disease.

Authors:  Peter J Barnes
Journal:  J Allergy Clin Immunol       Date:  2016-05-27       Impact factor: 10.793

9.  Bioinformatics-based identification of potential microRNA biomarkers in frequent and non-frequent exacerbators of COPD.

Authors:  Xiao Liu; Jingge Qu; Weixiao Xue; Liangai He; Jun Wang; Xuejiao Xi; Xiaoxia Liu; Yunhong Yin; Yiqing Qu
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2018-04-16

10.  EVI‑1 acts as an oncogene and positively regulates calreticulin in breast cancer.

Authors:  Lei Wu; Tianyi Wang; Dongning He; Xiaoxi Li; Youhong Jiang
Journal:  Mol Med Rep       Date:  2018-12-24       Impact factor: 2.952

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.