Literature DB >> 31419078

Identification of differentially expressed genes and signaling pathways in chronic obstructive pulmonary disease via bioinformatic analysis.

Xinwei Huang1,2, Yunwei Li2,3, Xiaoran Guo2, Zongxin Zhu2, Xiangyang Kong2, Fubing Yu4, Qiang Wang5.   

Abstract

Chronic obstructive pulmonary disease (COPD) is a multifactorial and heterogeneous disease that creates public health challenges worldwide. The underlying molecular mechanisms of COPD are not entirely clear. In this study, we aimed to identify the critical genes and potential molecular mechanisms of COPD by bioinformatic analysis. The gene expression profiles of lung tissues of COPD cases and healthy control subjects were obtained from the Gene Expression Omnibus. Differentially expressed genes were analyzed by integration with annotations from Gene Ontology and Kyoto Encyclopedia of Genes and Genomes, followed by construction of a protein-protein interaction network and weighted gene coexpression analysis. We identified 139 differentially expressed genes associated with the progression of COPD, among which 14 Hub genes were identified and found to be enriched in certain categories, including immune and inflammatory response, response to lipopolysaccharide and receptor for advanced glycation end products binding; in addition, these Hub genes are involved in multiple signaling pathways, particularly hematopoietic cell lineage and cytokine-cytokine receptor interaction. The 14 Hub genes were positively or negatively associated with COPD by wgcna analysis. The genes CX3CR1, PTGS2, FPR1, FPR2, S100A12, EGR1, CD163, S100A8 and S100A9 were identified to mediate inflammation and injury of the lung, and play critical roles in the pathogenesis of COPD. These findings improve our understanding of the underlying molecular mechanisms of COPD.
© 2019 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.

Entities:  

Keywords:  GEO data; bioinformatic analysis; chronic obstructive pulmonary disease; differentially expressed gene; epidemiology

Mesh:

Year:  2019        PMID: 31419078      PMCID: PMC6823288          DOI: 10.1002/2211-5463.12719

Source DB:  PubMed          Journal:  FEBS Open Bio        ISSN: 2211-5463            Impact factor:   2.693


arginase 1 biological process cellular component chronic obstructive pulmonary disease cigarette smoke C‐X3‐C motif chemokine receptor 1 disability‐adjusted life year differentially expressed gene early growth response 1 fold change fibrinogen gamma chain formyl peptide receptor 1 formyl peptide receptor 2 Gene Ontology gene significance Kyoto Encyclopedia of Genes and Genomes molecular function module membership orosomucoid 1 proplatelet basic protein protein‐protein interaction prostaglandin‐endoperoxide synthase 2 S100 calcium binding protein A12 S100 calcium binding protein A8 S100 calcium binding protein A9 vascular cell adhesion molecule 1 weighted gene coexpression network analysis years of life lost Chronic obstructive pulmonary disease (COPD), characterized by long‐term poorly reversible airway limitation and persistent respiratory symptoms, is a common and preventable disease 1. COPD is projected to become the third leading cause of all death by 2030 in the world 2. Globally, COPD affected 299.4 million people in 2017, with a 71.2% increase in the prevalence rate compared with 2015, ranking it as the fifth leading cause of disability‐adjusted life years (DALYs) and the seventh leading noncommunicable disease cause of years of life lost (YLLs) 3, 4, 5, 6. As shown in Fig. 1A, we observed a 12.3% increase in global all‐age deaths caused by COPD from 2.85 million in 1990 to 3.20 million in 2017 6, 7, 8, 9 and a predicted increase of 60% by 2020 compared with 1990. Figure 1B indicated that the all‐age standardized death rate of COPD in males, females, and both sexes separately decreased from 1990 to 2015 6, 8, which could be because of population growth and aging. Although the COPD death rate varies with different countries, more than 90% of COPD deaths occurred in low‐ and middle‐income countries 10. The global all‐age YLLs with COPD showed a small increase of 7.5% and 3.6% for both sexes and males, respectively, as well as a 21% decrease for females from 1990 to 2015 (Fig. 1C) 6, 8. In addition, as shown in Fig. 1D 3, 4, 5, 7, 11, 12, 13, 14, global all‐age DALYs caused by COPD had a small increase of 4.2% during 1990–2015 and was projected to decline to 57.6 million by 2020. The age‐standardized DALY rate caused by COPD in females was about twice as high as that of males, and that in low‐ and middle‐income countries was 6.7 times higher than in some high‐income countries 3. We observed that the global all‐age years lived with disability caused by COPD has grown 52.2% from 1990 to 2017. Taken together, COPD has presented a global public health challenge with high prevalence, mortality and disability rates, whereas the diagnosis of COPD is usually made based on spirometry values and clinical symptoms with a frequent underdiagnosis 15. Thus, it is important to explore the underlying molecular mechanisms and identify more effective early diagnostic methods and reliable biomarkers for this disease.
Figure 1

The global death and burden caused by COPD. (A) Global age‐related deaths (millions) caused by COPD in men and women, respectively, from 1990 to 2020 6, 7, 8, 9. (B) Global age‐related death rates (per 100 000) caused by COPD for both sexes, males and females, respectively, from 1990 to 2015 6, 8. (C) Global age‐related YLLs (millions) caused by COPD for both sexes, males and female, respectively, from 1990 to 2015 6, 8. (D) Global age‐related DALYs and years lived with disability (YLD) (millions) by COPD for both sexes from 1990 to 2020 3, 4, 5, 7, 11, 12, 13, 14.

The global death and burden caused by COPD. (A) Global age‐related deaths (millions) caused by COPD in men and women, respectively, from 1990 to 2020 6, 7, 8, 9. (B) Global age‐related death rates (per 100 000) caused by COPD for both sexes, males and females, respectively, from 1990 to 2015 6, 8. (C) Global age‐related YLLs (millions) caused by COPD for both sexes, males and female, respectively, from 1990 to 2015 6, 8. (D) Global age‐related DALYs and years lived with disability (YLD) (millions) by COPD for both sexes from 1990 to 2020 3, 4, 5, 7, 11, 12, 13, 14. As a large‐scale and efficient technique for acquiring genomic data, microarray‐based gene expression profiles have been widely used to seek new insights for biomarkers in many human diseases 16. Currently, many studies have been conducted on COPD gene expression profiles, and these studies have screened thousands of differentially expressed genes (DEGs) implicated in the development and progression of this disease 17, 18. However, the results for the identification of DEGs are discrepant among these studies due to sample heterogeneity and differences in technological detection platforms. In this study, we performed an integrated analysis on some of the gene expression profiling data based on lung tissues of COPD cases and control subjects using an unbiased approach aiming to identify the potential molecular mechanisms and biomarkers for COPD. We selected two Gene Expression Omnibus microarray datasets on COPD ( GSE27597 and GSE106986). DEGs were identified by r software (Auckland, New Zealand) and subsequently analyzed using bioinformatic methods including Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichments and construction of protein‐protein interaction (PPI) and weighted gene coexpression analysis (wgcna) networks. We screened the DEGs for potential association with the development and progression of COPD. Our work may further the understanding of the potential molecular mechanisms of COPD.

Materials and methods

Gene data

Two gene expression datasets, GSE27597 and GSE106986, were downloaded from the National Center for Biotechnology Information Gene Expression Omnibus database (https://www.ncbi.nlm.nih.gov/geo/). GSE27597 comprises the expression profile from 64 lung tissue samples from COPD cases and 16 samples from healthy donors 19. GSE106986 includes molecular profiling of 19 lung tissue samples, containing 14 samples from COPD cases and 5 from smokers 20. The experiments of GSE27597 and GSE106986 were conducted on the Affymetrix Human Exon 1.0 ST GeneChip (Affymetrix, Inc., Santa Clara, CA, USA; GPL5175 platform) and Agilent‐026652 Whole Human Genome Microarray 4x44K v2 (Agilent Technologies, Inc., Palo Alto, CA, USA; GPL13497 platform), respectively.#AuthorQuery Rep

Data preprocessing and screening for DEGs

The probe set IDs of two datasets were converted into gene symbols using the r software and annotation packages. The two datasets were merged into one array dataset and then batch normalized using r packages (sva and limma 3.40.6). The DEGs between COPD cases and control subjects were identified using the limma package in r 3.60. A P‐value <0.05 after being adjusted by false discovery rate and |log2FC > 1, where FC represents fold change, were applied together as the cutoff for DEGs screening.

GO and KEGG pathway enrichment analysis of DEGs

GO enrichment of the DEGs on the biological process (BP), molecular function (MF) and cellular component (CC) categories was performed using a david online tool (https://david.ncifcrf.gov/) 21, 22. KEGG pathway enrichment analysis was performed by using the KOBAS 3.0 online analysis database (http://kobas.cbi.pku.edu.cn/) 23.

Construction of the PPI network

STRING database (https://string-db.org/) is frequently used for identifying the protein interactions 16, 24. STRING database contains huge amounts of experimental and text mining data 25. cytoscape is an open source bioinformatic software platform used for integrating gene expression profiles and visualizing molecular interaction networks. cytoscape plugin cytoHubba provides multiple topological analysis methods on Hub genes, regulatory networks and drug targets for experimental biologists 26. In this study, we used STRING database to identify the interactions between the identified DEGs. A confidence score >0.4 was used as the cutoff criterion. In addition, Hub DEGs were identified with cytoscape version 3.6.1 (1999 Free Software Foundation, Inc., Boston, MA, USA) with cytoHubba plugin according to the rank of connection degree (number) for each gene, which is represented by the different degrees of color (from red to yellow): the role of the gene is greater in the PPI network with the darker color of the gene 26.

wgcna on COPD

wgcna may be used for screening modules (clusters) of highly correlated genes and for calculating module membership (MM) measures, in which the module eigengene or an intramodular Hub gene is used to summarize such modules, and eigengene network methodology is used for relating modules to one another and to external sample traits 27. In this study, the wgcna package was used to identify coexpression modules for the merged and standardized array datasets (GSE27597 and GSE106986). In brief, first, a weighted adjacency matrix containing pairwise connection strengths was constructed based on the selected soft threshold power (β = 11) on the matrix of pairwise correlation coefficients. Then, the connectivity measure per gene was calculated by summing the connection strengths with other genes; modules were defined as branches of a hierarchical clustering tree by using a dissimilarity measure, and each module was assigned a color. Subsequently, module preservation r function was used to assess the module structure preservation. Finally, the module eigengene was used for summarizing the gene expression profiles of each module, and each module eigengene was regressed on case trait (COPD) and smoking status by using the linear model in the limma r package.

Results

Identification of DEGs in COPD

After batch normalization on the integrated dataset from GSE27597 and GSE106986 by the sva and limma packages, 139 DEGs were identified using the limma package (corrected P < 0.05; |log2FC| > 1) (Tables 1 and 2). The cluster heatmap of the top 139 DEGs is shown in Fig. 2. Among them, 62 genes were up‐regulated and 77 genes were down‐regulated, which is shown in Fig. 3.
Table 1

Screening up‐regulated and down‐regulated DEGs in COPD by integrated microarray

DEGsGene symbol
Up‐regulated (62 genes) PIEZO2, INHBA, VCAM1, RSPO2, CD1C, HSD17B6, AQP3, APLNR, GPD1, EGR2, KCNA, SLC18A2, FRZB, PTGS2, OLR1, EDNRA, SHISA2, ELF5, LUM, CYSLTR1, BMP5, HPGDS, MS4A2, DCC, FOSB, HHIP, MSMB, CD200, AMPD1, ICOS, RTKN2, CD83, FABP4, ISLR, GPR174, SLAMF7, WIF1, CTSW, CHIT1, CPA3, BMP1, EGR3, CX3CR1, EGR1, TREM2, ATF3, ICAM4, C8B, NR4A3, CCL8, HAS2, C4BPA, ITGB6, HBEGF, AGTR2, SERPIND1, CEACAM5, SFRP2, SELE, HLA‐DRB1, HLA‐DRB5, FGG
Down‐regulated (77 genes) IL1R2, CEACAM4, ST6GALNAC3, ITGA10, MERTK, FKBP5, CLEC4E, LAMB1, MGAM, RPS6KA2, FLT1, IRAK3, SULT1B1, AOX1, SH3PXD2B, RNASE2, IL18R1, CD163, IL1RL1, GRASP, MT1A, S100A12, MMP8, GLT1D1, TMTC1, S100A8, IL4R, IL18RAP, FLT3, ANPEP, MT1M, SIGLEC10, SPARCL1, SMAP2, TIMP4, ANGPTL1, HIF3A, PKHD1L1, LILRB3, FPR1, SLED1, LDLRAD3, FAM150B, FPR2, ZBTB16, GCA, ARG1, CXCR2P1, S100A9, TMEM204, TINAGL1, ABCC8, VCAN, APOLD1, DDIT4, ARRDC2, SERPINA3, PIK3R3, ADM, PNMT, BTNL9, CRISPLD2, SLCO2A1, VNN2, TUBB1, PSAT1, PPBP, DEFA4, AQP9, TTN, PDK4, MT1L, MT1X, ORM1, CHRM2, PTX3, EIF1AY
Table 2

Screening 139 DEGs in COPD by integrated microarray. AveExpr, average expression; FDR, false discovery rate

Gene symbollogFCAveExpr t P‐valueFDR
IL1R2 −3.7241706354.910345092−14.570701965.24E−258.38E−21
CEACAM4 −1.5461900383.289938835−14.115789343.65E−242.92E−20
ST6GALNAC3 −1.5093482974.115922297−13.729343651.93E−231.03E−19
ITGA10 −1.3044524084.156071544−13.34248541.05E−224.18E−19
MERTK −1.3711006195.743922084−13.226696551.74E−224.99E−19
FKBP5 −2.5531520665.964790869−13.210106461.87E−224.99E−19
CLEC4E −2.1025923833.712865442−12.27244441.22E−202.44E−17
LAMB1 −1.2369003416.51036255−12.112821862.51E−204.31E−17
MGAM −1.9182756043.743487329−12.09718522.69E−204.31E−17
RPS6KA2 −1.0511714256.235125074−11.992872814.32E−206.28E−17
FLT1 −1.437358595.302047032−11.65882631.98E−192.43E−16
IRAK3 −1.4403629495.407265256−11.535045353.48E−193.71E−16
PIEZO2 1.1651203225.15480168911.48460694.38E−194.38E−16
SULT1B1 −1.9661285663.712292447−11.441422825.34E−195.02E−16
AOX1 −1.4980035674.380016656−10.802393921.02E−178.14E−15
SH3PXD2B −1.3600723744.796109196−10.453482545.15E−173.92E−14
RNASE2 −2.2143524883.502846585−10.417079216.11E−174.44E−14
IL18R1 −1.5527090134.479557525−10.215661151.56E−161.04E−13
CD163 −2.3533311286.483508901−10.189050441.77E−161.13E−13
IL1RL1 −2.0082546975.747865421−10.067015343.13E−161.88E−13
INHBA 1.4375657574.69842357210.064510423.17E−161.88E−13
VCAM1 1.583234484.549494879.9757656914.80E−162.74E−13
GRASP −1.0191252215.351837423−9.9539044345.32E−162.93E−13
RSPO2 1.225081854.2812796229.9435242415.59E−162.98E−13
CD1C 1.3209089184.0481385759.8478362598.75E−164.51E−13
HSD17B6 1.6113037625.5435894189.8112257631.04E−155.03E−13
MT1A −2.0038091365.341700669−9.8021064691.08E−155.10E−13
AQP3 1.4654441066.2556729889.7092152861.68E−157.66E−13
S100A12 −2.3819117893.793860618−9.6580620642.13E−159.26E−13
APLNR 1.4627617794.0607944859.5140442144.19E−151.76E−12
GPD1 1.1624081884.4253919919.4187450736.56E−152.65E−12
EGR2 1.77562425.3124931919.4154981456.66E−152.65E−12
MMP8 −1.9633763421.850422771−9.4109969456.80E−152.65E−12
KCNA3 1.1277205795.1815877949.3768728017.99E−152.98E−12
SLC18A2 1.1550766784.8612523059.3754655248.04E−152.98E−12
GLT1D1 −1.3542531874.134707596−9.3334140439.80E−153.33E−12
TMTC1 −1.2219221744.907718006−9.22680971.62E−145.28E−12
S100A8 −1.9844364866.732435285−9.2266775461.62E−145.28E−12
IL4R −1.0167376276.13438495−9.2200880631.67E−145.34E−12
IL18RAP −1.207063024.403623117−9.1819024292.00E−146.26E−12
FRZB 1.1089470754.5706213429.0542802153.64E−141.08E−11
PTGS2 1.9095995435.3857322529.0376232323.94E−141.14E−11
FLT3 −1.0687283292.955444457−9.0118754954.44E−141.24E−11
OLR1 1.5160247575.6301567569.0062805534.56E−141.24E−11
ANPEP −1.4517543114.687966342−9.0018614474.66E−141.24E−11
MT1M −2.5857088715.454959847−8.9909024544.90E−141.29E−11
SIGLEC10 −1.4642877754.29164058−8.9607795255.65E−141.46E−11
EDNRA 1.0162338525.6910955878.9392506836.25E−141.56E−11
SHISA2 1.006603725.2880527548.8863933398.02E−141.94E−11
SPARCL1 −1.0968141787.567105051−8.7608263991.45E−133.30E−11
SMAP2 −1.0307333426.515681918−8.5937153993.17E−136.73E−11
TIMP4 −1.6560763694.229601784−8.4565270936.04E−131.19E−10
ELF5 1.5216173194.6732195578.4487826956.26E−131.21E−10
ANGPTL1 −1.1110361763.340161894−8.4133212757.39E−131.37E−10
HIF3A −1.0656141584.526630285−8.3867053368.38E−131.52E−10
PKHD1L1 −1.4020919533.533210448−8.2813174361.37E−122.29E−10
LUM 1.0081257446.3888244258.2618290561.50E−122.45E−10
CYSLTR1 1.0176219954.3347133398.2310602861.74E−122.78E−10
LILRB3 −1.1820357233.930324211−8.2011106242.00E−123.11E−10
FPR1 −1.4973583395.123265733−8.1651840272.36E−123.63E−10
BMP5 1.0825724.8568669638.0390565044.26E−126.14E−10
HPGDS 1.1789767584.96614827.9710571795.85E−128.07E−10
SLED1 −1.4186430684.176703046−7.935294386.92E−129.42E−10
MS4A2 1.1275457044.5010696417.9342554266.95E−129.42E−10
DCC 1.1070549244.0663969177.8741128239.20E−121.21E−9
LDLRAD3 −1.1321835394.777010972−7.8276705941.14E−111.42E−9
FAM150B −1.3267639293.661484534−7.8014672951.29E−111.57E−9
FOSB 2.167548116.6059094347.7226773131.86E−112.20E−9
FPR2 −1.5857796164.015151074−7.7025601882.04E−112.40E−9
ZBTB16 −1.7040305745.808698383−7.6607996712.48E−112.87E−9
GCA −1.0369624744.831056628−7.6461200162.65E−113.03E−9
HHIP 1.3616814556.3383099417.6245014982.93E−113.28E−9
CD200 1.0454481114.1056585157.5798485643.61E−113.95E−9
ARG1 −1.3633943522.024018729−7.542544254.29E−114.60E−9
AMPD1 1.0437219793.1039404637.4437351136.77E−116.85E−9
ICOS 1.2225413784.2776435637.418744737.59E−117.59E−9
CXCR2P1 −1.156166154.439580145−7.3997246078.29E−118.23E−9
S100A9 −1.5456353395.747693368−7.3789202449.12E−119.01E−9
RTKN2 1.6969291996.1667052327.3443414351.07E−101.03E−8
TMEM204 −1.0485297866.26406167−7.1659749612.43E−102.11E−8
CD83 1.1346584565.714356647.1634957752.45E−102.12E−8
FABP4 1.585455634.7782549357.123199942.95E−102.51E−8
TINAGL1 −1.0263265355.532226715−7.0938475393.38E−102.84E−8
ABCC8 −1.2784732613.740007718−7.0697815063.77E−103.09E−8
VCAN −1.1621944175.676393323−7.0505636184.11E−103.29E−8
ISLR 1.0179445125.0187536677.0104201814.94E−103.80E−8
APOLD1 −1.3851705885.098238511−6.9760219045.78E−104.35E−8
DDIT4 −1.347721185.257280573−6.8085723581.23E−98.33E−8
ARRDC2 −1.0612690855.499220509−6.7624882271.52E−91.01E−7
GPR174 1.3913141284.4862804376.7477141061.63E−91.07E−7
SERPINA3 −1.6127099185.391772667−6.7455857471.64E−91.07E−7
PIK3R3 −1.1903767575.660292101−6.7284496311.77E−91.14E−7
SLAMF7 1.0836045524.3928009116.7272198311.78E−91.14E−7
WIF1 1.2371560676.2588728426.7087476671.94E−91.24E−7
ADM −1.0756624245.196278444−6.7012363972.00E−91.28E−7
PNMT −1.0516315884.068574241−6.5538489293.88E−92.30E−7
CTSW 1.079573024.7474800586.5453689494.04E−92.38E−7
BTNL9 −1.1328363495.578696855−6.4942969375.07E−92.82E−7
CHIT1 1.7742098885.5945733776.4900676245.17E−92.87E−7
CRISPLD2 −1.0784487785.7156417−6.4591481825.93E−93.26E−7
CPA3 1.0027101475.2787694066.3989044337.75E−94.12E−7
BMP1 1.0498145225.4507581416.34264569.95E−95.12E−7
SLCO2A1 −1.0517368586.743524273−6.3129936031.13E−85.69E−7
VNN2 −1.0155368914.57616008−6.2746905291.34E−86.55E−7
TUBB1 −1.1105368543.804924924−6.2641999991.41E−86.82E−7
PSAT1 −1.1900660313.005323949−6.2587371841.44E−86.94E−7
EGR3 1.1869359024.0973354916.2485734881.51E−87.22E−7
CX3CR1 1.2848289475.2121055926.1449668982.38E−81.08E−6
EGR1 1.1949055417.5383280186.085640013.08E−81.35E−6
TREM2 1.1287030863.9870818436.031631783.90E−81.67E−6
ATF3 1.0341624754.8842736985.6964108771.65E−75.92E−6
PPBP −1.4245697643.860588827−5.6469555822.04E−77.16E−6
ICAM4 1.3909114255.0196410815.6250309522.24E−77.74E−6
DEFA4 −1.2063194052.718323142−5.6143667022.34E−78.06E−6
C8B 1.1992894173.7244424415.5950439772.54E−78.65E−6
NR4A3 1.1530286064.6462537135.561553942.93E−79.79E−6
CCL8 1.5339544813.9329331515.4486233894.71E−71.48E−5
AQP9 −1.1822908054.835881555−5.4306160065.08E−71.56E−5
HAS2 1.5258649453.5621274255.3063119348.51E−72.44E−5
TTN −1.0432075073.817413024−5.2932384798.98E−72.56E−5
C4BPA 1.0551767987.3566915445.2684762099.95E−72.79E−5
ITGB6 1.1404444496.0767366555.2600916551.03E−62.86E−5
PDK4 −1.0214088156.733080202−5.1342671781.72E−64.47E−5
HBEGF 1.0328521555.6808062715.0618175862.31E−65.73E−5
MT1L −1.1829182742.526143932−4.9906975483.08E−67.38E−5
MT1X −1.1375704233.368378253−4.9745840063.29E−67.77E−5
AGTR2 1.3649300713.5244429934.8421128745.57E−60.000119643
ORM1 −1.3317930012.918107873−4.8007489676.56E−60.000138298
CHRM2 −1.0112819092.379732249−4.6894458741.01E−50.000200001
SERPIND1 1.2534856833.2274857424.6002755611.43E−50.000265554
CEACAM5 1.1218614313.162297534.4957195422.14E−50.000368719
SFRP2 1.1296565174.149611754.4868562172.21E−50.000377179
SELE 1.1380347483.8530296853.802144580.000266540.003035211
PTX3 −1.4022488444.135304651−3.5744132070.0005770090.005747801
HLA‐DRB1 2.0962641764.010154873.4968671410.0007454690.007027456
HLA‐DRB5 2.1348968413.0070616233.4738028380.0008039320.007481531
EIF1AY −1.5061380163.020677733−3.357479220.001170780.010029593
FGG 1.2860567574.9187165512.9654325060.0039058040.026040868
MSMB 1.9363391643.1461922422.9310362270.004320760.028265267
Figure 2

Hierarchical clustering heatmap of 139 DEGs screened on the basis of |FC| > 1 and a corrected P < 0.05. Red represents the up‐regulated DEGs, and green represents down‐regulated DEGs.

Figure 3

Volcano plots of differential gene expression data between two sets of samples. Red represents the up‐regulated DEGs, and green represents down‐regulated DEGs. adj.,P.Val, adjusted P‐value.

Screening up‐regulated and down‐regulated DEGs in COPD by integrated microarray Screening 139 DEGs in COPD by integrated microarray. AveExpr, average expression; FDR, false discovery rate Hierarchical clustering heatmap of 139 DEGs screened on the basis of |FC| > 1 and a corrected P < 0.05. Red represents the up‐regulated DEGs, and green represents down‐regulated DEGs. Volcano plots of differential gene expression data between two sets of samples. Red represents the up‐regulated DEGs, and green represents down‐regulated DEGs. adj.,P.Val, adjusted P‐value.

GO enrichment analysis of DEGs

GO analysis was done on the DEGs against BP, MF and CC terms. Biological annotation of the DEGs with COPD was identified using the david online analysis tool. As shown in Fig. 4, GO functional enrichments of the DEGs with a P‐value <0.05 were obtained. Significant results of the GO enrichment analysis of the DEGs associated with COPD were shown in Table 3. In the BP category, the DEGs were mainly involved in inflammatory response, immune response and response to lipopolysaccharide. In the MF category, the DEGs were mainly enriched in receptor activity and receptor for advanced glycation end products (RAGE) receptor binding. In the CC category, the DEGs were mainly involved in the extracellular region and space, the integral component of the plasma membrane, the plasma membrane and the external side of the plasma membrane (Fig. 5).
Figure 4

GO enrichment analysis of DEGs in COPD. GO analysis divided DEGs into three functional groups: BPs, cell composition and MF. Green represents BP category, blue represents cell composition category and red represents MF category.

Table 3

GO analysis of DEGs associated with COPD

Term IDCategoryDescriptionCount P‐valueBonferroni
GO:0006954BPInflammatory response189.11E−90.0000088
GO:0006955BPImmune response184.26E−80.0000412
GO:0032496BPResponse to lipopolysaccharide90.00005130.04832
GO:0002523BPLeukocyte migration involved in inflammatory response40.00007820.072795
GO:0007155BPCell adhesion140.00007870.073195
GO:0002576BPPlatelet degranulation70.0001750.155654
GO:0030198BPExtracellular matrix organization90.0001780.157584
GO:0050729BPPositive regulation of inflammatory response60.0002920.245789
GO:0006935BPChemotaxis70.0004380.344834
GO:0071294BPCellular response to zinc ion40.0004380.345355
GO:0045926BPNegative regulation of growth40.0004380.345355
GO:0045600BPPositive regulation of fat cell differentiation50.000530.400817
GO:0050832BPDefense response to fungus40.0012630.705093
GO:0060326BPCell chemotaxis50.0018010.824787
GO:0030593BPNeutrophil chemotaxis50.0019060.841641
GO:0010043BPResponse to zinc ion40.0029270.941058
GO:0001501BPSkeletal system development60.0048430.990808
GO:0007165BPSignal transduction190.005030.992337
GO:0002437BPInflammatory response to antigenic stimulus30.00620.997541
GO:0030178BPNegative regulation of Wnt signaling pathway40.0082580.999668
GO:0007263BPNitric oxide‐mediated signal transduction30.0098890.999932
GO:0071356BPCellular response to tumor necrosis factor50.0116760.999988
GO:0001666BPResponse to hypoxia60.0123120.999994
GO:0050776BPRegulation of immune response60.0141050.999999
GO:0035924BPCellular response to vascular endothelial growth factor stimulus30.0143310.999999
GO:0002035BPBrain renin‐angiotensin system20.0158971
GO:0070488BPNeutrophil aggregation20.0158971
GO:0006952BPDefense response40.0164211
GO:0032868BPResponse to insulin40.0164211
GO:0050900BPLeukocyte migration50.0165191
GO:0001816BPCytokine production30.0168181
GO:0035987BPEndodermal cell differentiation30.0194741
GO:0042476BPOdontogenesis30.0194741
GO:0032689BPNegative regulation of interferon‐gamma production30.0208641
GO:0010033BPResponse to organic substance30.0208641
GO:0071549BPCellular response to dexamethasone stimulus30.0222941
GO:0007204BPPositive regulation of cytosolic calcium ion concentration50.0224271
GO:1900625BPPositive regulation of monocyte aggregation20.0237511
GO:2001179BPRegulation of interleukin‐10 secretion20.0237511
GO:0032602BPChemokine production20.0237511
GO:0070295BPRenal water absorption20.0237511
GO:1902042BPNegative regulation of extrinsic apoptotic signaling pathway via death domain receptors30.02841
GO:0042742BPDefense response to bacterium50.0289711
GO:0030307BPPositive regulation of cell growth40.0296381
GO:0002793BPPositive regulation of peptide secretion20.0315431
GO:0032673BPRegulation of interleukin‐4 production20.0315431
GO:0032119BPSequestering of zinc ion20.0315431
GO:0072593BPReactive oxygen species metabolic process30.0316761
GO:0018108BPPeptidyl‐tyrosine phosphorylation50.0342541
GO:0045786BPNegative regulation of cell cycle30.0350931
GO:0042493BPResponse to drug70.035181
GO:0007160BPCell‐matrix adhesion40.0353161
GO:0006953BPAcute‐phase response30.0386461
GO:0002381BPImmunoglobulin production involved in immunoglobulin‐mediated immune response20.0392731
GO:0008285BPNegative regulation of cell proliferation80.0396041
GO:0043408BPRegulation of mitogen‐activated protein kinase cascade30.0423291
GO:0002548BPMonocyte chemotaxis30.0442191
GO:0045429BPPositive regulation of nitric oxide biosynthetic process30.0461391
GO:0010042BPResponse to manganese ion20.0469421
GO:0038084BPVascular endothelial growth factor signaling pathway20.0469421
GO:0005576CCExtracellular region442.35E−143.22E−12
GO:0005615CCExtracellular space404.29E−145.89E−12
GO:0005887CCIntegral component of plasma membrane322.71E−80.00000371
GO:0005886CCPlasma membrane560.000001320.000181
GO:0009897CCExternal side of plasma membrane100.00002840.003882
GO:0031093CCPlatelet alpha‐granule lumen50.0007130.093083
GO:0005578CCProteinaceous extracellular matrix80.0037040.398509
GO:0070062CCExtracellular exosome320.0119450.807247
GO:0030666CCEndocytic vesicle membrane40.0126980.826358
GO:0031012CCExtracellular matrix70.022280.954358
GO:0030669CCClathrin‐coated endocytic vesicle membrane30.036510.993875
GO:0048471CCPerinuclear region of cytoplasm100.0399310.996238
GO:0004872MFReceptor activity100.00003210.009231
GO:0050786MFRAGE receptor binding40.00006110.017515
GO:0004908MFInterleukin‐1 receptor activity30.0010970.271869
GO:0005201MFExtracellular matrix structural constituent40.0131710.97833
GO:0017147MFWnt protein binding30.021660.998215
GO:0004896MFCytokine receptor activity30.0286580.999776
GO:0035662MFToll‐like receptor 4 binding20.0290630.999801
GO:0004982MF N‐formyl peptide receptor activity20.0290630.999801
GO:0008201MFHeparin binding50.0303850.999866
GO:0008083MFGrowth factor activity50.0315970.999907
GO:0004714MFTransmembrane receptor protein tyrosine kinase activity30.0316770.999909
GO:0050544MFArachidonic acid binding20.0361960.999976
GO:0005160MFTransforming growth factor‐beta receptor binding30.038070.999987
GO:0005178MFIntegrin binding40.0422330.999996
GO:0042803MFProtein homodimerization activity110.0425650.999997
GO:0004875MFComplement receptor activity20.0432780.999997
Figure 5

Distribution of DEGs in COPD for the most significant GO‐enriched functions.

GO enrichment analysis of DEGs in COPD. GO analysis divided DEGs into three functional groups: BPs, cell composition and MF. Green represents BP category, blue represents cell composition category and red represents MF category. GO analysis of DEGs associated with COPD Distribution of DEGs in COPD for the most significant GO‐enriched functions.

KEGG pathway analysis of DEGs

We analyzed the DEGs using the KOBAS 3.0 online analysis database. As shown in Table 4, the DEGs were enriched in 48 pathways, especially hematopoietic cell lineage and cytokine‐cytokine receptor interaction. The genes and pathway nodes are represented by cytoscape version 3.6.1 software that was used to calculate the topological characteristics of the network and determine each node (Fig. 6).
Table 4

KEGG pathway analysis of DEGs associated with COPD

IDPathwayGene countCorrected P‐valueDEGs
hsa04640Hematopoietic cell lineage74.07E−6 FLT3, IL4R, HLA‐DRB1, HLA‐DRB5, IL1R2, CD1C, ANPEP
hsa04060Cytokine‐cytokine receptor interaction104.07E−6 FLT3, FLT1, CCL8, IL4R, INHBA, CX3CR1, IL1R2, IL18RAP, PPBP, IL18R1
hsa05150 Staphylococcus aureus infection50.000147 HLA‐DRB1, FPR2, FGG, FPR1, HLA‐DRB5
hsa05166Human T‐lymphotropic virus type 1 infection80.000174 PIK3R3, HLA‐DRB1, ATF3, HLA‐DRB5, VCAM1, IL1R2, EGR2, EGR1
hsa05321Inflammatory bowel disease50.000174 HLA‐DRB1, IL18RAP, IL4R, HLA‐DRB5, IL18R1
hsa04514Cell adhesion molecules60.000456 SELE, HLA‐DRB1, ICOS, HLA‐DRB5, VCAM1, VCAN
hsa04151Phosphoinositide 3‐kinase‐Akt signaling pathway80.000732 CHRM2, PIK3R3, ITGA10, IL4R, LAMB1, DDIT4, ITGB6, FLT1
hsa05146Amoebiasis50.000732 C8B, PIK3R3, ARG1, LAMB1, IL1R2
hsa04668Tumor necrosis factor signaling pathway50.001006 VCAM1, PIK3R3, SELE, PTGS2, IL18R1
hsa04080Neuroactive ligand‐receptor interaction70.00112 CHRM2, FPR2, AGTR2, APLNR, EDNRA, CYSLTR1, FPR1
hsa05200Pathways in cancer80.001401 DCC, FLT3, ZBTB16, LAMB1, PIK3R3, HHIP, PTGS2, EDNRA
hsa04614Renin‐angiotensin system30.00144 CPA3, AGTR2, ANPEP
hsa04610Complement and coagulation cascades40.002727 FGG, C8B, C4BPA, SERPIND1
hsa05310Asthma30.003037 HLA‐DRB1, HLA‐DRB5, MS4A2
hsa00750Vitamin B6 metabolism20.003805 PSAT1, AOX1
hsa05202Transcriptional misregulation in cancer50.005115 FLT3, ZBTB16, FLT1, NR4A3, IL1R2
hsa04933AGE‐RAGE signaling pathway in diabetic complications40.005115 VCAM1, PIK3R3, SELE, EGR1
hsa04510Focal adhesion50.007653 PIK3R3, ITGA10, ITGB6, LAMB1, FLT1
hsa04672Intestinal immune network for IgA production30.007653 HLA‐DRB1, ICOS, HLA‐DRB5
hsa05145Toxoplasmosis40.007715 HLA‐DRB1, PIK3R3, LAMB1, HLA‐DRB5
hsa04978Mineral absorption30.007715 MT1X, MT1A, MT1M
hsa04923Regulation of lipolysis in adipocytes30.009035 PIK3R3, PTGS2, FABP4
hsa05221Acute myeloid leukemia30.009074 FLT3, ZBTB16, PIK3R3
hsa04380Osteoclast differentiation40.009515 LILRB3, PIK3R3, FOSB, TREM2
hsa01100Metabolic pathways120.00997 PSAT1, HPGDS, ARG1, ST6GALNAC3, AOX1, PNMT, HSD17B6, AMPD1, PTGS2, CHIT1, MGAM, ANPEP
hsa04145Phagosome40.015537 HLA‐DRB1, TUBB1, HLA‐DRB5, OLR1
hsa05140Leishmaniasis30.015831 HLA‐DRB1, PTGS2, HLA‐DRB5
hsa04512Extracellular matrix‐receptor interaction30.02016 ITGA10, ITGB6, LAMB1
hsa05410Hypertrophic cardiomyopathy30.02016 TTN, ITGA10, ITGB6
hsa05222Small‐cell lung cancer30.021463 PIK3R3, PTGS2, LAMB1
hsa05414Dilated cardiomyopathy30.023455 TTN, ITGA10, ITGB6
hsa05323Rheumatoid arthritis30.023455 HLA‐DRB1, FLT1, HLA‐DRB5
hsa04062Chemokine signaling pathway40.023572 CX3CR1, PIK3R3, CCL8, PPBP
hsa04915Estrogen signaling pathway30.027527 FKBP5, PIK3R3, HBEGF
hsa04024cAMP signaling pathway40.027527 CHRM2, PIK3R3, HHIP, EDNRA
hsa04810Regulation of actin cytoskeleton40.032602 CHRM2, PIK3R3, ITGA10, ITGB6
hsa00350Tyrosine metabolism20.032602 AOX1, PNMT
hsa05020Prion diseases20.032602 C8B, EGR1
hsa05143African trypanosomiasis20.032602 VCAM1, SELE
hsa05330Allograft rejection20.038438 HLA‐DRB1, HLA‐DRB5
hsa04722Neurotrophin signaling pathway30.038438 IRAK3, PIK3R3, RPS6KA2
hsa05332Graft‐versus‐host disease20.042328 HLA‐DRB1, HLA‐DRB5
hsa04940Type I diabetes mellitus20.04503 HLA‐DRB1, HLA‐DRB5
hsa04973Carbohydrate digestion and absorption20.047746 MGAM, PIK3R3
hsa05322Systemic lupus erythematosus30.048708 HLA‐DRB1, C8B, HLA‐DRB5
hsa04930Type II diabetes mellitus20.049139 ABCC8, PIK3R3
hsa05144Malaria20.049139 VCAM1, SELE
hsa05030Cocaine addiction20.049139 SLC18A2, FOSB
Figure 6

The significant KEGG pathways enrichment of DEGs. Green represents down‐regulated DEGs, blue represents up‐regulated DEGs and red represents the signaling pathway.

KEGG pathway analysis of DEGs associated with COPD The significant KEGG pathways enrichment of DEGs. Green represents down‐regulated DEGs, blue represents up‐regulated DEGs and red represents the signaling pathway.

PPI network analysis of DEGs

The 139 DEGs were applied for construction of PPI networks using STRING. After removing the discrete and partially connected nodes, the PPI network data of DEGs were imported into the cytoHubba of cytoscape version 3.6.1, and a complex network of the DEGs was constructed. As shown in Figs 7 and 8, 14 Hub DEGs were obtained, including C‐X3‐C motif chemokine receptor 1 (CX3CR1), proplatelet basic protein (PPBP), prostaglandin‐endoperoxide synthase 2 (PTGS2), formyl peptide receptor 1 (FPR1), formyl peptide receptor 2 (FPR2), vascular cell adhesion molecule 1 (VCAM1), S100 calcium binding protein A12 (S100A12), arginase 1 (ARG1), early growth response 1 (EGR1), CD163, fibrinogen gamma chain (FGG), orosomucoid 1 (ORM1), S100 calcium binding protein A8 (S100A8) and S100 calcium binding protein A9 (S100A9).
Figure 7

PPI network and Hub DEGs. Hub DEGs were identified with cytoscape version 3.6.1 with cytoHubba plugin, according to the rank of connection degree (number) for each gene, which is represented by the different degrees of color (from red to yellow): the role of the gene is greater in the PPI network with the darker color of the gene. Red, saffron yellow and yellow represent Hub DEGs.

Figure 8

PPI network identified Hub DEGs. Numbers represent connection points of the 14 Hub genes identified by the cytoHubba plugin.

PPI network and Hub DEGs. Hub DEGs were identified with cytoscape version 3.6.1 with cytoHubba plugin, according to the rank of connection degree (number) for each gene, which is represented by the different degrees of color (from red to yellow): the role of the gene is greater in the PPI network with the darker color of the gene. Red, saffron yellow and yellow represent Hub DEGs. PPI network identified Hub DEGs. Numbers represent connection points of the 14 Hub genes identified by the cytoHubba plugin.

wgcna network construction in lung tissues

A wgcna network was first constructed using lung tissue expression data from cohorts GSE27597 and GSE106986, independent of COPD status and smoking status (ever/current smoking versus nonsmoking). A total of 2942 DEGs were selected (a corrected P < 0.05) and subsequently used to identify modules of coexpressed genes using a hierarchical clustering procedure. The corresponding branches of the resulting dynamic clustering tree and module are shown as colored bands underneath the cluster tree. We then merged the highly similar dynamic clustering modules into the merged dynamic modules (cut height = 0.25) (Fig. 9). We identified nine modules ranging in size from 113 genes in the Purple module to 1081 in the Grey module. A module eigengene, a weighted average of the module gene expression profiles, was used to summarize the expression profiles of transcripts in a given module through their first principal component.
Figure 9

Network construction identifies distinct modules of coexpressed genes. The network was constructed using the lung tissue expression dataset of GSE27597 and GSE106986. The cluster dendrogram was produced by average linkage hierarchical clustering of genes using 1 − topological overlap as dissimilarity measure. Modules (Dynamic Tree Cut) and similarly merged modules (Merged dynamic) of coexpressed genes were assigned colors corresponding to the branches indicated by the horizontal bar beneath the dendrogram (merged cut height = 0.25).

Network construction identifies distinct modules of coexpressed genes. The network was constructed using the lung tissue expression dataset of GSE27597 and GSE106986. The cluster dendrogram was produced by average linkage hierarchical clustering of genes using 1 − topological overlap as dissimilarity measure. Modules (Dynamic Tree Cut) and similarly merged modules (Merged dynamic) of coexpressed genes were assigned colors corresponding to the branches indicated by the horizontal bar beneath the dendrogram (merged cut height = 0.25).

Coexpression modules associated with COPD

To pinpoint modules associated with COPD and smoking status, we analyzed the association of each of the nine module eigengenes with the two traits. As shown in Fig. 10 and Table 5, all nine modules were significantly correlated with COPD and smoking status. Four modules were negatively associated with COPD and smoking status, marked Tan, Brown, Blue and Cyan, indicating that genes in these modules were predominantly down‐regulated in COPD cases and those who had a history of smoking. However, five modules, in Green yellow, Purple, Black, Red and Grey, were positively associated with COPD cases and smoking status, showing that genes in these modules are predominantly up‐regulated with the traits.
Figure 10

wgcna heatmap. Using the default parameter setting and all DEGs (n = 2942), we identified nine gene modules using wgcna that were positively or negatively associated with COPD and smoking trait. Each row corresponds to a module eigengene and each column to a clinical trait (COPD and smoking status). Positive associations are red, and negative associations are green. HCS, history of smoking.

Table 5

Correlation of module eigengene with COPD and smoking status traits

wgcna modulesGene numberMerged COPD datasetSmoking status
Correlation P‐valueCorrelation P‐value
Tan353−0.698E−13−0.330.002
Brown244−0.52E−6−0.369E−4
Blue202−0.541E−7−0.589E−9
Cyan467−0.611E−9−0.511E−6
Green yellow1050.589E−90.435E−5
Purple1130.572E−80.485E−6
Black2560.632E−100.519E−7
Red1210.632E−100.384E−4
Grey10810.823E−210.674E−12
wgcna heatmap. Using the default parameter setting and all DEGs (n = 2942), we identified nine gene modules using wgcna that were positively or negatively associated with COPD and smoking trait. Each row corresponds to a module eigengene and each column to a clinical trait (COPD and smoking status). Positive associations are red, and negative associations are green. HCS, history of smoking. Correlation of module eigengene with COPD and smoking status traits Four of these nine gene modules, in Cyan, Purple, Red and Grey, attracted our attention in that 14 Hub genes were identified as DEGs from the PPI analysis, including CX3CR1, PPBP, PTGS2, FPR1, FPR2, S100A12, ARG1, EGR1, CD163, VCAM1, FGG, ORM1, S100A8 and S100A9. We calculated gene significance (GS) versus each MM. We found that the 14 Hub genes were also either positively or negatively associated with COPD (Table 6). CX3CR1, PPBP, PTGS2, VCAM1, S100A12, ARG1, EGR1, CD163, S100A8 and S100A9 were significantly associated with each MM, whereas FPR1, FPR2 and ORM1 were correlated with each MM except Red MM, and FGG was correlated to each MM except the Purple MM. In addition, we found that the Purple (CX3CR1), Red (EGR1, VCAM1 and PTGS2) and Grey (ARG1, FGG, and PPBP) genes most significantly correlated with GS for COPD were also the important MM elements (Fig. 11).
Table 6

Fourteen Hub genes positively or negatively associated with COPD and each MM. P, P‐value for COPD or each MM

GeneLocated moduleGS.COPD P GS.COPD P.MM Cyan P.MM Grey P.MM Purple P.MM Red
CD163 Cyan−0.7399740231.33E−154.23E−218.88E−100.002492.57E−5
FPR1 Cyan−0.6622235389.25E−127.92E−235.84E−76.61E−50.07339
FPR2 Cyan−0.639830637.43E−111.37E−214.51E−76.73E−50.07508
ORM1 Cyan−0.4597214441.23E−53.81E−200.000140.001960.70155
S100A12 Cyan−0.7215778721.41E−141.50E−142.82E−74.13E−50.00278
S100A8 Cyan−0.705989899.04E−146.83E−184.01E−70.000260.01347
S100A9 Cyan−0.6234680373.07E−107.84E−204.68E−60.000490.04290
ARG1 Grey−0.6324800571.42E−100.023742.47E−100.000542.19E−9
FGG Grey0.3040972610.005190.006460.0059690.24253.39E−8
PPBP Grey−0.5203148854.61E−75.44E−60.0018420.03880.01072
CX3CR1 Purple0.5532126755.84E−80.001339.38E−101.81E−134.46E−5
EGR1 Red0.5497548357.33E−80.001272.09E−60.039031.80E−14
PTGS2 Red0.6986848082.07E−130.003412.42E−114.99E−51.10E−28
VCAM1 Red0.73444824902.75E−150.009472.66E−125.70E−133.14E−16
Figure 11

COPD absolute GS versus MM. wgcna calculation of GS to COPD versus MM. In oversimplified terms, MM is a measure of how ‘tight’ genes cluster within the module, or mathematically, how close gene expression is to the module eigenvalue. A gene with high MM and GS identifies Hub genes that are both key components to the underlying BP and highly associated with the trait of interest. The GS for COPD was plotted. (A) Cyan represents MM. (B) Grey represents MM. (C) Purple represents MM. (D) Red represents MM.

Fourteen Hub genes positively or negatively associated with COPD and each MM. P, P‐value for COPD or each MM COPD absolute GS versus MM. wgcna calculation of GS to COPD versus MM. In oversimplified terms, MM is a measure of how ‘tight’ genes cluster within the module, or mathematically, how close gene expression is to the module eigenvalue. A gene with high MM and GS identifies Hub genes that are both key components to the underlying BP and highly associated with the trait of interest. The GS for COPD was plotted. (A) Cyan represents MM. (B) Grey represents MM. (C) Purple represents MM. (D) Red represents MM.

Discussion

In this study, we performed an integrated analysis on the gene expression profiles from lung tissues with or without COPD, aiming to identify the DEGs and related key signaling pathways for the disease. We identified 139 DEGs, including 62 up‐regulated genes and 77 down‐regulated genes. In addition, GO analysis showed that the 139 DEGs associated with COPD were involved in 60 BPs, 16 MFs and 12 CCs. Among these categories, the most important BPs included inflammatory response, immune response and response to lipopolysaccharide; the most important MFs included receptor activity and RAGE receptor binding; and the most important CCs included extracellular region and space, integral component of plasma membrane, plasma membrane and external side of plasma membrane. This finding accords with the knowledge that COPD is characterized by chronic inflammation in the lung and airways 28, 29; immune response mediates the development of COPD caused by the harmful stimuli 30, 31, 32; lipopolysaccharide may lead to increased airway and systemic inflammation, and contribute to the progressive deterioration of lung function 33, 34; and RAGE is a ‘driving force’ for cigarette smoke (CS)‐induced airway inflammation in COPD 35. KEGG pathway analysis indicated that 48 pathways corresponded to these DEGs associated with COPD. Two pathways including hematopoietic cell lineage and cytokine‐cytokine receptor interaction were most important. This finding is in line with the results from previous studies 18, 32. The PPI network of proteins encoded by DEGs identified 14 Hub DEGs associated with COPD, including CX3CR1, PPBP, PTGS2, FPR1, FPR2, VCAM1, S100A12, ARG1, EGR1, CD163, FGG, ORM1, S100A8 and S100A9. All of these Hub genes were involved in the most important two BPs, two MFs or five CCs revealed by GO analysis, and were mainly implicated in multiple pathways identified by KEGG analysis in this study. Those results indicate that these Hub DEGs are involved in the development and progression of COPD by playing important biological roles in multiple signaling pathways. Using wgcna on the merged expression profile from two cohorts of lung tissues with COPD and healthy controls, we identified a set of gene signatures based on the 14 Hub genes. The increased expression of CX3CR1, FGG, EGR1, VCAM1 and PTGS2 is positively associated with COPD, and the underexpression of PPBP, FPR1, FPR2, S100A12, ARG1, CD163, ORM1, S100A8 and S100A9 is negatively associated with COPD. CX3CR1 plays an important role in the development of chronic inflammatory lung diseases, such as COPD and emphysema, by contributing to structural destruction and remodeling. Chemoattractant inflammatory cells releasing CX3CR1, such as CD8−/CD4, dendritic cells, γδ T lymphocytes, natural killer cells and monocytes/macrophages, may lead to mononuclear cell accumulation in the parenchyma and lung vessel walls, release mediators to induce injury, stimulate proliferation and chemoattractant inflammatory cells 36. In addition, CX3CR1 + mononuclear phagocytes may induce an innate immune response to CS via producing interleukin‐6 and tumor necrosis factor‐α, and contribute to emphysema 37. PTGS2 (COX‐2), an important mediator of inflammation, was shown to be involved in inflammation response and associated with COPD pathogenesis 38, 39, 40, 41. The decreased activity of PTGS2 may protect smokers against the development of COPD 40. Furthermore, FPR1 and FPR2 were reported to be involved in recruitment and activation of inflammatory cells induced by CS, and play important roles in lung inflammation, injury and the pathogenesis of COPD 42, 43, 44, 45. S100A8, S100A9, and S100A12 might induce neutrophil and monocyte chemotaxis, adhesion to fibrinogen and diapedesis, and neutrophil migration to inflammatory sites 46, 47, and have been identified as key biomarkers in inflammatory diseases including COPD and neutrophil‐dominated infections 35, 48, 49. The mRNA levels of S100A8, S100A9 and S100A12 may be regulated by RAGE, which was shown to contribute to CS‐induced airway inflammation in COPD 35. This is consistent with our result in this study that the RAGE pathway including S100A8, S100A9 and S100A12 is important in the development of COPD. EGR1 is an autophagy regulator gene that plays important roles in cellular homeostasis, airway remodeling and control of inflammatory immune response; it is also a significant risk factor for susceptibility to COPD 50, 51, 52. EGR1 may be induced by CS and involved in proinflammatory mechanisms that are likely associated with the development of COPD 51, whereas Egr‐1−/− mice were observed to resist CS‐induced autophagy, apoptosis and emphysema 53. These findings exhibit a critical role for EGR1 in CS‐induced inflammatory immune response and COPD, and effective inhibition of EGR1 may attenuate airway remodeling and inflammation associated with the pathology of COPD. CD163, a carefully regulated component of the innate immune response to infection and a macrophage receptor for bacteria, was shown to play important roles in functional pulmonary defense elements and the inflammatory immune response of the respiratory system 54, 55. Overexpression of CD163 on lung alveolar macrophages may be implicated in the pathogenesis of COPD and poor lung function 56. ARG1 was shown to contribute to asthma pathogenesis by inhibiting nitric oxide production, modulating fibrosis, regulating arginine metabolism and inhibiting T cell proliferation, and it involves the initiation of adaptive T helper 2 cell‐mediated allergic lung inflammation by regulating group 2 innate lymphoid cells 57, 58, 59, 60, whereas ARG1 ablation in the lung may enhance peripheral lung function but have little effect on airway inflammation 61. The role of ARG1 in COPD needs to be studied in the future. ORM1 appears to function in regulating the activity of the immune system during the acute‐phase reaction and has been identified as an acute exacerbation of COPD‐specific immunomodulatory mediator 62. PPBP serves as a potent neutrophil chemoattractant and activator, and its elevated expression in the bronchial mucosa might be involved in the pathogenesis of COPD 63, 64. In addition, VCAM1 was shown to express in endothelial cells of atopic asthma cases, but not COPD cases 65, and present an association with lung function 66. FGG was found to be involved in blast lung injury resistance via promoting tissue‐protective adenosine signaling 67. In the lung tissues of COPD cases, its mRNA expression was reported to correlate with the burden of particulate matter in total lung and lung parenchyma 68. In conclusion, we have identified 139 candidate DEGs associated with the progression of COPD. The results from bioinformatic analysis are in agreement with those from previous cell and animal models and human studies. Our results showed that nine Hub genes, CX3CR1, PTGS2, FPR1, FPR2, S100A12, EGR1, CD163, S100A8 and S100A9, potentially mediated inflammation and injury of the lung, and play critical roles in the pathogenesis of COPD. The roles of five Hub genes, including PPBP, ARG1, FGG, VCAM1 and ORM1, identified to be associated with COPD in this study need to be confirmed in the future. These findings could improve our understanding of the underlying molecular mechanisms of COPD and provide us with insights for drug target discovery for the disease.

Conflict of interest

The authors declare no conflict of interest.

Author contributions

XK, FY and QW conceived and designed the project. XH and YL acquired the data. XG and ZZ analyzed and interpreted the data. XH and YL wrote the paper.
  68 in total

1.  Selection of AECOPD-specific immunomodulatory biomarkers by integrating genomics and proteomics with clinical informatics.

Authors:  Lin Shi; Bijun Zhu; Menglin Xu; Xiangdong Wang
Journal:  Cell Biol Toxicol       Date:  2017-08-04       Impact factor: 6.691

2.  CX3CR1+ lung mononuclear phagocytes spatially confined to the interstitium produce TNF-α and IL-6 and promote cigarette smoke-induced emphysema.

Authors:  Zeyu Xiong; Adriana S Leme; Prabir Ray; Steven D Shapiro; Janet S Lee
Journal:  J Immunol       Date:  2011-01-28       Impact factor: 5.422

3.  Asthma and bronchodilator responsiveness are associated with polymorphic markers of ARG1, CRHR2 and chromosome 17q21.

Authors:  Hing Yee Sy; Fanny W S Ko; Hong Yin Chu; Iris H S Chan; Gary W K Wong; David S C Hui; Ting Fan Leung
Journal:  Pharmacogenet Genomics       Date:  2012-07       Impact factor: 2.089

4.  Association of Egr-1 and autophagy-related gene polymorphism in men with chronic obstructive pulmonary disease.

Authors:  Chiung-Zuei Chen; Chih-Ying Ou; Ru-Hsueh Wang; Cheng-Hung Lee; Chien-Chung Lin; Han-Yu Chang; Tzuen-Ren Hsiue
Journal:  J Formos Med Assoc       Date:  2013-09-04       Impact factor: 3.282

5.  Rapamycin decreases airway remodeling and hyperreactivity in a transgenic model of noninflammatory lung disease.

Authors:  Elizabeth L Kramer; William D Hardie; Elizabeth M Mushaben; Thomas H Acciani; Patricia A Pastura; Thomas R Korfhagen; Gurjit Khurana Hershey; Jeffrey A Whitsett; Timothy D Le Cras
Journal:  J Appl Physiol (1985)       Date:  2011-09-08

6.  Different expression ratio of S100A8/A9 and S100A12 in acute and chronic lung diseases.

Authors:  Eva Lorenz; Marianne S Muhlebach; Philippe A Tessier; Neil E Alexis; R Duncan Hite; Michael C Seeds; David B Peden; Wayne Meredith
Journal:  Respir Med       Date:  2007-12-27       Impact factor: 3.415

7.  Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017.

Authors: 
Journal:  Lancet       Date:  2018-11-08       Impact factor: 79.321

Review 8.  Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990-2013: a systematic analysis for the Global Burden of Disease Study 2013.

Authors: 
Journal:  Lancet       Date:  2015-06-07       Impact factor: 202.731

9.  cytoHubba: identifying hub objects and sub-networks from complex interactome.

Authors:  Chia-Hao Chin; Shu-Hwa Chen; Hsin-Hung Wu; Chin-Wen Ho; Ming-Tat Ko; Chung-Yen Lin
Journal:  BMC Syst Biol       Date:  2014-12-08

10.  Global, regional, and national age-sex specific mortality for 264 causes of death, 1980-2016: a systematic analysis for the Global Burden of Disease Study 2016.

Authors: 
Journal:  Lancet       Date:  2017-09-16       Impact factor: 79.321

View more
  10 in total

Review 1.  α-Arrestins and Their Functions: From Yeast to Human Health.

Authors:  Kacper Zbieralski; Donata Wawrzycka
Journal:  Int J Mol Sci       Date:  2022-04-30       Impact factor: 6.208

2.  EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models.

Authors:  Xiangju Liu; Yu Zhang; Chunli Fu; Ruochi Zhang; Fengfeng Zhou
Journal:  Front Genet       Date:  2021-04-27       Impact factor: 4.599

3.  Integrated analysis reveals lung fibrinogen gamma chain as a biomarker for chronic obstructive pulmonary disease.

Authors:  Hai Zhang; Chenfei Li; Xiaomin Song; Lei Cheng; Qi Liu; Na Zhang; Liangyu Wei; Kianfan Chung; Ian M Adcock; Chunhua Ling; Feng Li
Journal:  Ann Transl Med       Date:  2021-12

4.  Differences of core genes in liver fibrosis and hepatocellular carcinoma: Evidence from integrated bioinformatics and immunohistochemical analysis.

Authors:  Yue Li; Shou-Li Yuan; Jing-Ya Yin; Kun Yang; Xin-Gang Zhou; Wen Xie; Qi Wang
Journal:  World J Gastrointest Oncol       Date:  2022-07-15

5.  Identification of Inflammation-Related Biomarker Lp-PLA2 for Patients With COPD by Comprehensive Analysis.

Authors:  Mingming Deng; Yan Yin; Qin Zhang; Xiaoming Zhou; Gang Hou
Journal:  Front Immunol       Date:  2021-05-21       Impact factor: 7.561

6.  Identification of the hub genes and prognostic indicators of gastric cancer and correlation of indicators with tumor-infiltrating immune cell levels.

Authors:  Yun Ji; Lu Gao; Can Zhang; Xu Sun; Liping Dai; Zhenyu Ji; Jianying Zhang; Zhida Zhang; Wei Cao; Yang Zhao; Liguo Zhang
Journal:  J Cancer       Date:  2021-05-13       Impact factor: 4.207

7.  m6A RNA methylation regulators could contribute to the occurrence of chronic obstructive pulmonary disease.

Authors:  Xinwei Huang; Dongjin Lv; Xiao Yang; Min Li; Hong Zhang
Journal:  J Cell Mol Med       Date:  2020-09-22       Impact factor: 5.310

8.  Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis.

Authors:  Dongfang Jia; Cheng Chen; Chen Chen; Fangfang Chen; Ningrui Zhang; Ziwei Yan; Xiaoyi Lv
Journal:  Front Genet       Date:  2021-05-17       Impact factor: 4.599

9.  Identification of Hub Genes Associated with COPD Through Integrated Bioinformatics Analysis.

Authors:  Lin Chen; Donglan Zhu; Jinfu Huang; Hui Zhang; Guang Zhou; Xiaoning Zhong
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2022-03-03

Review 10.  The S100 Protein Family as Players and Therapeutic Targets in Pulmonary Diseases.

Authors:  Zeeshan Sattar; Alnardo Lora; Bakr Jundi; Christopher Railwah; Patrick Geraghty
Journal:  Pulm Med       Date:  2021-06-18
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.