| Literature DB >> 29016655 |
Dominik Reinhold1, Jarrett D Morrow2, Sean Jacobson3, Junxiao Hu1, Benjamin Ringel4, Max A Seibold4,5, Craig P Hersh2, Katerina J Kechris1, Russell P Bowler3.
Abstract
Chronic obstructive pulmonary disease (COPD) occurs typically in current or former smokers, but only a minority of people with smoking history develops the disease. Besides environmental factors, genetics is an important risk factor for COPD. However, the relationship between genetics, environment and phenotypes is not well understood. Sample sizes for genome-wide expression studies based on lung tissue have been small due to the invasive nature of sample collection. Increasing evidence for the systemic nature of the disease makes blood a good alternative source to study the disease, but there have also been few large-scale blood genomic studies in COPD. Due to the complexity and heterogeneity of COPD, examining groups of interacting genes may have more relevance than identifying individual genes. Therefore, we used Weighted Gene Co-expression Network Analysis to find groups of genes (modules) that are highly connected. However, module definitions may vary between individual data sets. To alleviate this problem, we used a consensus module definition based on two cohorts, COPDGene and ECLIPSE. We studied the relationship between the consensus modules and COPD phenotypes airflow obstruction and emphysema. We also used these consensus module definitions on an independent cohort (TESRA) and performed a meta analysis involving all data sets. We found several modules that are associated with COPD phenotypes, are enriched in functional categories and are overrepresented for cell-type specific genes. Of the 14 consensus modules, three were strongly associated with airflow obstruction (meta p ≤ 0.0002), and two had some association with emphysema (meta p ≤ 0.06); some associations were stronger in the case-control cohorts, and others in the cases-only subcohorts. Gene Ontology terms that were overrepresented included "immune response" and "defense response." The cell types whose type-specific genes were overrepresented in modules (p < 0.05) included natural killer cells, dendritic cells, and neutrophils. Together, this is the largest investigation of gene blood expression in COPD with 469 cases in COPDGene, ECLIPSE and TESRA combined, with 6267 genes common to all data sets. Additional, we have 42 and 83 controls in COPDGene and ECLIPSE, respectively.Entities:
Mesh:
Year: 2017 PMID: 29016655 PMCID: PMC5633174 DOI: 10.1371/journal.pone.0185682
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Demographics.
| COPDGene cases | ECLIPSE cases | TESRA cases | COPDGene control | ECLIPSE control | |
|---|---|---|---|---|---|
| n | 83 | 138 | 248 | 42 | 83 |
| Age (mean (sd)) | 64.70 (8.28) | 64.92 (5.35) | 66.4 (7.9) | 60.42 (9.07) | 61.29 (8.46) |
| Sex (number (%) males) | 49 (59.04) | 93 (67.4) | 170 (68.6) | 22 (52.38) | 53 (63.86) |
| BMI (mean (sd)) | 27.77 (6.07) | 26.93 (5.27) | 26 (4.7) | 27.98 (5.00) | 28.02 (3.89) |
| Ethnicity/Race | |||||
| White | 82 (98.80) | 137 (99.28) | 248 (100) | 42 (100) | 81 (97.59) |
| Black | 1 (1.20) | 0 (0) | 0 (0) | 0 (0) | 2 (2.41) |
| Other | 0 (0) | 1 (0.72) | 0 (0) | 0 (0) | 0 (0) |
| Current smoker (%) | 17 (20.48) | 0 (0) | 0 (0) | 13 (30.95) | 0 (0) |
| Pack Years (mean (sd)) | 49.58 (24.64) | 47.14 (29.33) | 47 (24.3) | 44.47 (28.49) | 29.96 (26.97) |
| Percent Emph; -950 HU (mean (sd)) | 13.58 (11.12) | 18.59 (11.95) | NA | 1.36 (1.51) | 4.31 (3.79) |
| Percent Emph; -910 HU (mean (sd)) | 39.80 (18.26) | NA | 41.65 (16.2) | 20.86 (11.74) | NA |
| FEV1% (mean (sd)) | 50.62 (21.13) | 50.31 (16.68) | 48.8 (9.3) | 97.87 (13.80) | 110.29 (14.58) |
| FEV1/FVC (mean (sd)) | 0.49 (0.14) | 0.44 (0.12) | 0.43 (0.09) | 0.77 (0.04) | 0.80 (0.06) |
| GOLD stage | |||||
| GOLD 0 | 0 (0) | 0 (0) | 0 (0) | 42 (100) | 83 (100) |
| GOLD 1 | 6 (7.23) | 3 (2.17) | 0 (0) | 0 (0) | 0 (0) |
| GOLD 2 | 35 (42.17) | 67 (48.55) | 105 (42.3) | 0 (0) | 0 (0) |
| GOLD 3 | 25 (30.12) | 55 (39.85) | 139 (56.0) | 0 (0) | 0 (0) |
| GOLD 4 | 17 (20.48) | 13 (9.42) | 4 (1.6) | 0 (0) | 0 (0) |
| Exacerbation Freq. (mean (sd)) | 0.75 (1.16) | 0.91 (1.57) | 0.60 (0.65) | 0.29 (0.86) | 0 (0) |
† Seven and nine cases in COPDGene and ECLIPSE, respectively, did not have emphysema data. In the controls of ECLIPSE, 13 individuals had no emphysema data.
†† Cases of COPD all have FEV1/FVC < 0.7, and are further classified in GOLD Stages 1-4, corresponding to mild (FEV1% ≥80), moderate (50% ≤ FEV1% < 80%), severe (30% ≤ FEV1% <50%), and very severe (FEV1% < 30%) COPD [22].
Fig 1Correspondence of individual and consensus modules.
Left (right, respectively) panel shows the correspondence between consensus modules and COPDGene (ECLIPSE, respectively) modules. The row labels give the color (label) of the consensus modules and the number of genes in the module. The columns represent in an analogous way the modules for COPDGene and ECLIPSE. The number in each cell gives the number of genes common to the modules in the corresponding row and column; the heatmap colors represent the −log10 transformed p-values (truncated at 50), which are based on Fisher’s exact test based on the hypergeometric distribution. Note that the module colors have no meaning, they simply represent consensus modules or modules in the individual data sets.
Fig 2Consensus module−trait relationships across COPDGene and ECLIPSE for cases and controls.
Meta-analysis Z-scores and p-values were based on cases and controls from COPDGene and ECLIPSE. The sign and magnitude of the Z-score give information about the overall direction and magnitude of association.
Fig 3Consensus module-trait relationships across COPDGene, ECLIPSE, and TESRA for cases only.
Z-scores and meta p-values for FEV1% and FEV1/FVC were based on COPDGene, ECLIPSE, and TESRA cases. The meta p-value for emphysema was based on COPDGene and ECLIPSE only, since the definition of emphysema in TESRA differed (see Materials and methods). As in Fig 2, the relationship information is given in terms of Z-scores rather than correlations.
Top 10 module membership genes.
The module membership of a gene (kME) is the correlation between the gene and the eigengene of the module. For the black, salmon, red and greenyellow modules, we list the top 10 kME genes based on the sum of ranks in the COPDGene and ECLIPSE cohort. The ranks were based on kME. For COPDGene and ECLIPSE, the correlation direction between the gene and the phenotypes are represented by “+” and “-”, if the correlation is significant at α = 0.05.
| Module | Genes | Rank | COPDGene | ECLIPSE | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| COPDGene | ECLIPSE | Sum | FEV1% | FEV1/FVC | Emph. | FEV1% | FEV1/FVC | Emph. | ||
| red | FCGRT | 5 | 4 | 9 | - | - | - | - | ||
| TALDO1 | 8 | 10 | 18 | - | - | |||||
| NFAM1 | 11 | 15 | 26 | - | - | |||||
| TYROBP | 2 | 28 | 30 | - | - | - | - | |||
| PFKFB4 | 18 | 14 | 32 | - | - | - | - | |||
| ZNF467 | 21 | 11 | 32 | - | - | |||||
| FES | 12 | 21 | 33 | - | - | - | - | |||
| CFP | 3 | 32 | 35 | - | - | - | - | |||
| LTBR | 6 | 29 | 35 | - | - | - | - | |||
| TKT | 20 | 17 | 37 | - | - | - | - | |||
| black | KLF12 | 2 | 1 | 3 | + | + | + | |||
| PLEKHA1 | 9 | 2 | 11 | + | + | + | + | |||
| OXCT1 | 10 | 7 | 17 | + | + | + | ||||
| SLAIN1 | 3 | 20 | 23 | + | + | + | + | |||
| PSIP1 | 23 | 3 | 26 | + | + | + | + | |||
| RASGRP1 | 32 | 4 | 36 | + | + | + | + | |||
| CDC42SE2 | 27 | 11 | 38 | + | + | + | + | |||
| DDHD2 | 36 | 12 | 48 | + | + | + | + | |||
| ITK | 25 | 23 | 48 | + | + | + | + | |||
| TC2N | 13 | 37 | 50 | |||||||
| greenyellow | EIF4EBP1 | 7 | 1 | 8 | - | - | ||||
| PYCARD | 2 | 6 | 8 | - | - | - | - | |||
| TMEM205 | 8 | 2 | 10 | - | - | |||||
| FAM127A | 10 | 5 | 15 | - | ||||||
| AGTRAP | 4 | 12 | 16 | - | - | - | - | |||
| AP2S1 | 13 | 3 | 16 | - | ||||||
| KIAA0930 | 5 | 15 | 20 | - | - | - | - | |||
| NRM | 17 | 7 | 24 | - | ||||||
| TSPO | 3 | 22 | 25 | - | - | |||||
| ZNF385A | 9 | 16 | 25 | - | - | |||||
| salmon | IFI35 | 3 | 2 | 5 | - | - | ||||
| XAF1 | 5 | 1 | 6 | - | ||||||
| IFIT3 | 7 | 3 | 10 | - | - | |||||
| HERC5 | 4 | 7 | 11 | |||||||
| RSAD2 | 2 | 9 | 11 | - | - | |||||
| MX1 | 13 | 5 | 18 | |||||||
| IFI6 | 6 | 13 | 19 | - | - | |||||
| UBE2L6 | 1 | 18 | 19 | |||||||
| IFI44L | 9 | 11 | 20 | |||||||
| OASL | 14 | 6 | 20 | - | - | |||||
Panther results for biological processes.
For each of the salmon, red, greenyellow, and black module, the top 10 enriched biological processes GO terms are presented. The reference list consists of the 6322 genes that were mapped by PANTHER and were common to all data sets and used for module construction. The # Hits denotes the number of genes in the module that are in the GO term. Fold Enrichment gives the overrepresentation. The p-values were calculated by the PANTHER software based on the binomial distribution. They were adjusted based on the Benjamini-Hochberg procedure on a per module basis, considering all tests for biological processes and molecular function (see S2 Table for molecular function GO term results).
| Module | Top Biological Processes | Ref. list | # Hits | Fold | p-value | p.adjust |
|---|---|---|---|---|---|---|
| red | immune response (GO:0006955) | 441 | 33 | 2.82 | 6.1E-08 | 3.6E-05 |
| defense response (GO:0006952) | 454 | 31 | 2.57 | 1.2E-06 | 3.4E-04 | |
| actin polymerization or depolymerization (GO:0008154) | 21 | 7 | 12.54 | 1.9E-06 | 3.6E-04 | |
| single organism signaling (GO:0044700) | 1613 | 70 | 1.63 | 3.6E-06 | 3.6E-04 | |
| signaling (GO:0023052) | 1615 | 70 | 1.63 | 3.8E-06 | 3.6E-04 | |
| signal transduction (GO:0007165) | 1555 | 68 | 1.65 | 4.2E-06 | 3.6E-04 | |
| actin filament organization (GO:0007015) | 75 | 11 | 5.52 | 6.4E-06 | 4.7E-04 | |
| immune system process (GO:0002376) | 864 | 44 | 1.92 | 1.3E-05 | 7.7E-04 | |
| actin filament polymerization (GO:0030041) | 11 | 5 | 17.1 | 1.3E-05 | 7.7E-04 | |
| cell communication (GO:0007154) | 1649 | 69 | 1.57 | 1.7E-05 | 9.0E-04 | |
| black | mitotic cell cycle checkpoint (GO:0007093) | 78 | 8 | 3.95 | 1.1E-03 | 2.6E-02 |
| regulation of natural killer cell activation (GO:0032814) | 10 | 3 | 11.56 | 2.4E-03 | 3.6E-02 | |
| mitotic DNA integrity checkpoint (GO:0044774) | 54 | 6 | 4.28 | 3.0E-03 | 4.0E-02 | |
| DNA integrity checkpoint (GO:0031570) | 78 | 7 | 3.46 | 4.5E-03 | 4.3E-02 | |
| cell cycle checkpoint (GO:0000075) | 103 | 8 | 2.99 | 5.8E-03 | 4.3E-02 | |
| negative regulation of mitotic cell cycle (GO:0045930) | 107 | 8 | 2.88 | 7.2E-03 | 4.3E-02 | |
| snoRNA localization (GO:0048254) | 5 | 2 | 15.42 | 7.7E-03 | 4.3E-02 | |
| pos. regul. of lymphocyte mediated immunity (GO:0002708) | 31 | 4 | 4.97 | 9.0E-03 | 4.3E-02 | |
| regulation of interleukin-10 production (GO:0032653) | 17 | 3 | 6.8 | 1.0E-02 | 4.3E-02 | |
| neural tube patterning (GO:0021532) | 6 | 2 | 12.85 | 1.1E-02 | 4.3E-02 | |
| greenyellow | multicellular organismal response to stress (GO:0033555) | 17 | 4 | 14.88 | 1.7E-04 | 4.5E-03 |
| hexose metabolic process (GO:0019318) | 77 | 7 | 5.75 | 2.4E-04 | 6.0E-03 | |
| monosaccharide metabolic process (GO:0005996) | 81 | 7 | 5.46 | 3.2E-04 | 6.6E-03 | |
| hexose catabolic process (GO:0019320) | 21 | 4 | 12.04 | 3.7E-04 | 6.6E-03 | |
| glutathione derivative biosynthetic process (GO:1901687) | 9 | 3 | 21.07 | 4.2E-04 | 6.6E-03 | |
| glutathione derivative metabolic process (GO:1901685) | 9 | 3 | 21.07 | 4.2E-04 | 6.6E-03 | |
| monosaccharide catabolic process (GO:0046365) | 22 | 4 | 11.49 | 4.4E-04 | 6.6E-03 | |
| glucose metabolic process (GO:0006006) | 62 | 6 | 6.12 | 4.8E-04 | 6.8E-03 | |
| regulation of endopeptidase activity (GO:0052548) | 146 | 9 | 3.9 | 5.4E-04 | 7.3E-03 | |
| regulation of peptidase activity (GO:0052547) | 151 | 9 | 3.77 | 6.8E-04 | 7.7E-03 | |
| salmon | type I interferon signaling pathway (GO:0060337) | 38 | 19 | 46.49 | 1.6E-26 | 2.3E-24 |
| cellular response to type I interferon (GO:0071357) | 38 | 19 | 46.49 | 1.6E-26 | 2.3E-24 | |
| response to type I interferon (GO:0034340) | 42 | 19 | 42.06 | 1.0E-25 | 9.9E-24 | |
| cytokine-mediated signaling pathway (GO:0019221) | 195 | 27 | 12.87 | 3.2E-23 | 2.3E-21 | |
| innate immune response (GO:0045087) | 239 | 28 | 10.89 | 3.4E-22 | 1.9E-20 | |
| defense response to virus (GO:0051607) | 80 | 20 | 23.24 | 5.1E-22 | 2.4E-20 | |
| response to virus (GO:0009615) | 125 | 22 | 16.36 | 5.4E-21 | 2.2E-19 | |
| immune response (GO:0006955) | 441 | 33 | 6.96 | 1.6E-20 | 5.9E-19 | |
| response to cytokine (GO:0034097) | 316 | 29 | 8.53 | 3.7E-20 | 1.2E-18 | |
| cellular response to cytokine stimulus (GO:0071345) | 264 | 27 | 9.51 | 7.3E-20 | 2.1E-18 |
Overrepresentation of cell-type specific genes in modules.
Only overrepresentations significant at level α = 0.05 are displayed for readability. For a complete table of p-values, see S3 Table. The 11 groups of distinct immune cell types considered consist of eosinophils (eos), basophils/mast cells (mast_baso), dendritic cells (dendr), neutrophils (neut), b-cells, t-cells, NK-cells, t-helper cells (thelp), monocyte Lipopolysaccharides (LPS) day 0 stimulation, monocyte LPS day 1 stimulation, monocyte LPS day 7 stimulation (mono_d0, mono_d1, mono_d7, respectively).
| mono_d0 | mono_d1 | mono_d7 | NK-cell | t-cell | b-cell | neut | dendr | eos | mast_baso | thelp | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| turquoise | 6.3E-03 | 3.0E-02 | |||||||||
| blue | 1.5E-02 | ||||||||||
| midnightblue | |||||||||||
| tan | 2.4E-03 | ||||||||||
| black | 9.1E-03 | ||||||||||
| cyan | |||||||||||
| salmon | 2.1E-02 | ||||||||||
| lightcyan | |||||||||||
| green | 1.7E-02 | ||||||||||
| pink | |||||||||||
| magenta | 1.8E-02 | ||||||||||
| yellow | |||||||||||
| red | 7.9E-06 | ||||||||||
| greenyellow | 1.3E-03 | ||||||||||
| grey | 1.8E-02 | 2.7E-05 |