| Literature DB >> 25032995 |
Jörg Menche, Amitabh Sharma, Michael H Cho, Ruth J Mayer, Stephen I Rennard, Bartolome Celli, Bruce E Miller, Nick Locantore, Ruth Tal-Singer, Soumitra Ghosh, Chris Larminie, Glyn Bradley, John H Riley, Alvar Agusti, Edwin K Silverman, Albert-László Barabási.
Abstract
BACKGROUND: An important step toward understanding the biological mechanisms underlying a complex disease is a refined understanding of its clinical heterogeneity. Relating clinical and molecular differences may allow us to define more specific subtypes of patients that respond differently to therapeutic interventions.Entities:
Mesh:
Year: 2014 PMID: 25032995 PMCID: PMC4101699 DOI: 10.1186/1752-0509-8-S2-S8
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Summary of the characteristics of 140 subjects with sputum gene expression data from the ECLIPSE Cohort.
| Demographics and clinical data | |
|---|---|
| Age, yrs. | 65 ± 5.5 |
| Males, % | 66 |
| Body mass index, Kg/m2 | 26.8 ± 5.2 |
| Smoking exposure, pack-yrs. | 48.3 ± 29.1 |
| Annual Exacerbation rate, year-1 | 0.98 ± 1.6 |
| FEV1, L | 1.26 ± 0.45 |
| FEV1, % revers. | 9.5 ± 10.4 |
| FEV1/FVC, % | 43.2 ± 11.5 |
| Emphysema, -950HU % | 19.2 ± 12.2 |
| Emphysema, extent code | 2.8 ± 1.8 |
| hsCRP (mg/L) | 8.24 ± 15.0 |
| IL6 (pg/mL) | 7.8 ± 36 |
| IL8 (pg/mL) | 9.3 ± 5.2 |
| CCL18 (ng/mL) | 121.7 ± 46 |
| Fibronogen (mg/dL) | 481.9 ± 107.6 |
| TNFA (ng/mL) | 103.2 ± 624 |
| SPD (ng/mL) | 120.6 ± 78 |
| Total cell count, × 106 | 7.5 ± 1.78 |
| Neutrophils, % | 64.8 ± 8.5 |
| Eosinophils, % | 3.1 ± 2.04 |
| Lymphocytes, % | 25.4 ± 7.9 |
Note that all subjects are COPD patients and former smokers. The values represent mean ± standard deviation, frequency or proportion, as appropriate.
Summary of the clinical characteristics of COPD patients identified as most relevant by clinical experts.
| Category | Continuous Variable for Quantitative Analysis | Discrete Variable | Bins | Characteristics | Differentially expressed genes at FDR < 0.05 |
|---|---|---|---|---|---|
| Chronic Bronchitis | Not applicable | Cough with Phlegm for at least 3 mos/yr for at least 2 years | low extreme (Q1 = 64) | neither chronic cough nor chronic phlegm | 0 |
| high extreme (Q4 = 46) | both chronic cough and chronic phlegm | ||||
| History of Exacerbations | Number of exacerbations per year | 2 or more per year and less than 2 per year | low extreme (Q1 = 26) | 0 - Never | 0 |
| high extreme (Q4 = 17) | 3 - Always | ||||
| Body Mass Index (Kg/m2) | BMI | BMI < 21, 21-30, > 30 | low extreme (Q1 = 18) | BMI < 21 | 0 |
| high extreme (Q4 = 35) | BMI > 30 | ||||
| Airflow Limitation severity | FEV1 (% predicted) | GOLD Stage | low extreme (Q1 = 69) | < 2-GOLD stage | 6,049 |
| high extreme (Q4 = 13) | >4 GOLD stage | ||||
| 6 Minute Walk Distance | Quantitative 6MWD | < 350 meters and > 350 meters | low extreme (Q1 = 38) | >350 meters | 0 |
| high extreme (Q4 = 101) | >350 meters | ||||
| Radiologist | Emphysema severity category: | low extreme (Q1 = 40) | 0-1.5 -No emphysema | 0 | |
| assessment | Not affected (N): 0 | Yes/No/Uncertain | high extreme (Q4 = 45) | 4-5 - severe | |
| Trivial (T): 1 | |||||
| Mild (M) 5-25%: 2 | |||||
| Moderate (O) 25-50%: 3 | |||||
| Severe (S) 50-75%: 4 | |||||
| Very Severe (V) > 75%: 5 | |||||
| Densitometric Emphysema | Emphysema at -950 HU | Emphysema >10% (Yes/No) | low extreme (Q1 = 37) | Emphysema >10% = No | 0 |
| high extreme (Q4 = 95) | Emphysema >10% = yes | ||||
| CT Airway Disease | Pi10 (Square root of wall area of 10 mm internal perimeter airways) | GOLD Stages 2-4 with Emphysema < 5% (Yes) or > 5% (No) | low extreme (Q1 = 63) | Trivial (< %5) | 0 |
| high extreme (Q4 = 33) | Severe (50-75%, very severe (>75%)) | ||||
Columns 4-6 show the results of the differential gene expression analysis comparing the subjects of the defined bins or extremes for each characteristic. Q1/Q4 refer to the number of patients in the respective group (1st and 4th "quartile"). The only single characteristic yielding significantly differentially expressed genes is the degree of airflow limitation as given by GOLD stage (GOLDCD).
Figure 1Schematic representation of the diVIsive Shuffling Approach (VIStA). A Initially the subjects are divided randomly into three groups; gene expression differences are calculated between group 1 & 2, the third group serves as a reservoir for the subsequent shuffling steps. At each shuffling step, a subject from group 1 or 2 is randomly exchanged with a subject from the reservoir. If the number of differentially expressed genes increases thereby, the swap is accepted, otherwise rejected. B 20 exemplary time series of the number of differentially expressed genes between group 1 & 2 as a function of the number of attempted shuffles. The different curves correspond to different random initial divisions. After approximately 1000 shuffles the groups converge and present a large, stationary number of differentially expressed genes. C For each of the obtained divisions (500 in total), clinical characteristics in group 1 & 2 are compared.
Figure 2Combination of clinical characteristics associated with groups from VIStA. A Number of times the characteristics were found significantly different between group 1 & 2 in a total of 500 divisions. Severity of airflow limitation (GOLDCD) is the single most important determinant of differential gene expression, being statistically significant in 95% of all VIStA outputs. B Summary of the individual and pairwise number of significant occurrences of the clinical characteristics. Node size is proportional to the number of times a measure was found significant, the width of a link indicates how often two measures appeared significant in the same VIStA division. The core group contains severity of airflow limitation (GOLDCD) and the two emphysema measures EMPHETCD and FV950. C, Number of times that pairwise combinations of clinical characteristics co-occurred in the 500 VIStA outcomes. The most significant pair (as compared to a Null model of independent occurrence) is EMPHETCD and FV950, which are both measures of emphysema. D The most frequent and significant triplet is a combination of GOLDCD and EMPHETCD and FV950, measuring disease severity. E We find significant combinations of the disease severity triplet in B with four clinical characteristics: BMI, PHLEGM, DWALK and AGE.
Summary of the clinical measures, biomarkers, and cell counts among the four groups of COPD patients identified from the results of Figure 2: each group combines GOLDCD, EMPHETCD and FV950, with either BMI (Group I), DWALK (Group II), AGE (Group III) or Phlegm (Group IV).
| Age | 65.4 | 65.4 | - | 63.9 | 65.4 | - | 58.73 | 68.7 | *** | 63 | 65.96 | - |
| Lung Function | - | - | - | - | ||||||||
| FEV1 | 1.72 | 0.89 | *** | 1.7 | 0.9 | *** | 1.72 | 0.93 | ** | 1.79 | 0.9 | *** |
| FEV1/FVC (%) | 57.88 | 32.43 | *** | 55.0 | 32.5 | *** | 56.53 | 33.93 | *** | 57.1 | 33.04 | *** |
| FEV1 reversibility (%) | 7.64 | 4.73 | *** | 11.5 | 7.2 | *** | 11.62 | 5.93 | *** | 10.4 | 5.5 | *** |
| Radiologist Emphysema | - | - | - | - | ||||||||
| Emphysema severity | 1.2 | 4.2 | *** | 1.3 | 4.2 | *** | 1.336 | 3.7 | *** | 1.275 | 4.2 | *** |
| Densitometric Emphysema | - | - | - | - | ||||||||
| Emphysema at -950 HU | 6.98 | 33.42 | *** | 6.6 | 31.7 | *** | 7.71 | 28.09 | *** | 7.06 | 33.42 | *** |
| Airflow Obstruction | - | - | - | - | ||||||||
| GOLD Stage | 2 | 3.3 | *** | 2.0 | 3.3 | *** | 2 | 3.2 | *** | 2 | 3.3 | *** |
| Body Mass Index | 30.76 | 21.21 | *** | 27.3 | 24.2 | * | 29.87 | 25.8 | - | 28.67 | 24.42 | * |
| Chronic Bronchitis (ATS_CB) | 1 = 24% | 1 = 30.4% | - | 1 = 85.7% | 1 = 62.5% | - | 1 = 6.7% | 1 = 37% | - | 1-100% | 1 = 46.2% | *** |
| 1 = no chronic phlegm | 1 = 56% | 1 = 35% | - | 1 = 62% | 1 = 41% | - | 1 = 66.6% | 1 = 33% | - | 1-100% | 1 = 0% | *** |
| 6 Minute Walk Distance | 428.32 | 330.02 | ** | 508.8 | 273.9 | *** | 438.97 | 321.83 | ** | 462.59 | 322.9 | ** |
| Exacerbations 0 = no-Exacerbations | 0 = 68% | 0 = 34.8% | - | 0 = 71.4% | 0 = 28.1% | ** | 0 = 60% | 0 = 37% | - | 0 = 70% | 0 = 38.5% | * |
| CCL6 | 7.3 | 6.33 | - | 7.0 | 6.8 | - | 6.19 | 6.73 | - | 8.64 | 6.9 | - |
| IL6 | 5.65 | 20.2 | - | 4.3 | 18.6 | - | 2.79 | 6.89 | 3.72 | 18.53 | - | |
| IL8 | 8.88 | 10.77 | - | 8.3 | 9.4 | - | 7.5 | 10.28 | * | 9.8 | 10.65 | - |
| TNFa | 26.99 | 160.32 | - | 31.7 | 162.9 | - | 2.35 | 60.44 | - | 24.14 | 142.3 | - |
| CCL18 | 130.3 | 117.59 | - | 126.9 | 124.4 | - | 115.8 | 117.94 | - | 134 | 126.19 | - |
| CRPHS | 10.4 | 9.6 | - | 9.9 | 9.5 | - | 5.7 | 9.72 | - | 510 | 8.5 | - |
| FIBRINOG | 494.9 | 499.1 | - | 481.0 | 506.2 | - | 456.8 | 498.58 | - | 510.8 | 489.84 | - |
| SPD | 129.64 | 110.94 | - | 124.9 | 119.1 | - | 79.73 | 116.3 | * | 138.76 | 109.7 | - |
| mMRC | 3 | 2.09 | ** | 1.0 | 2.4 | *** | 1.21 | 2.04 | * | 1.1 | 2.04 | * |
| SGRQ | 43.29 | 55.55 | ** | 35.9 | 56.0 | *** | 41.8 | 52.87 | * | 36 | 57.73 | *** |
| FFMI | 19.53 | 16.13 | *** | 18.5 | 17.1 | * | 18.48 | 17.8 | - | 18.83 | 17.15 | * |
| % Fat (Tissue) | 34.92 | 29.08 | ** | 31.0 | 31.7 | - | 35.96 | 31.94 | - | 32.69 | 31.25 | - |
| - | - | - | - | |||||||||
| Neutrophils, % Neut_Blq | 61.38 | 64.87 | - | 60.7 | 67.0 | ** | 61.34 | 66.69 | * | 62.08 | 65.55 | - |
| Eosinophils, % Eos blq | 3 | 3.1 | - | 3.5 | 3.1 | - | 2.48 | 2.92 | - | 3.26 | 3.3 | - |
| Lymphocytes, % lymhblq | 28.63 | 24.84 | - | 28.6 | 23.2 | * | 29.19 | 23.55 | * | 27.77 | 23.688 | - |
* = p-value < 0.05; ** = p-value < 0.01; *** = p-value < 0.0001; - = not significant
Figure 3Four subtypes and differentially expressed genes. A The combinations of phenotypic measures that define the subtypes predicted by the VIStA method: all four subtypes share a common core of high values of GOLDCD, FV950 and EMPHETCD, reflecting disease severity. Each of the individual subtypes I-IV presents one additional clinical characteristic: BMI (subtype I), DWALK (II), AGE (III) or PHLEGM (IV). B Venn diagram showing the number of differentially expressed genes unique to each subtype, as well as common to all four subtypes. The common genes show a large overlap with the genes differentially expressed between subjects with GOLDCD 2 and subjects with GOLDCD 3&4, indicating that these genes reflect mostly disease severity.
The 10 most strongly enriched pathways in the set of genes common among all four groups described in table 3, as well as in the individual gene sets of each group.
| Top ten pathways among Common Genes | |||
|---|---|---|---|
| REACTOME_GENE_EXPRESSION | 235 | 425 | |
| REACTO ME_DIABETES_PATHWAYS | 1.91E-33 | 214 | 383 |
| REACTOME_METABOLISM_OF_PROTEINS | 9.48E-28 | 134 | 215 |
| REACTOME_CELL_CYCLE_MITOTIC | 7.34E-25 | 167 | 306 |
| REACTOME_GLUCOSE_REGULATION_OF_ | 1.24E-23 | 104 | 161 |
| KEGG_HUNTINGTONS_DISEASE | 3.16E-23 | 114 | 185 |
| REACTOME_INTEGRATION_OF_ENERGY_METABOLISM | 1.09E-21 | 130 | 229 |
| REACTOME_ELECTRON_TRANSPORT_CHAIN | 1.11E-21 | 60 | 75 |
| REACTOME_RNA_POLYMERASE_I_III_AND_MITOCHONDRIAL_TRANSCR.PT.ON | 2.72E-21 | 82 | 120 |
| REACTOME_INFLUENZA_LIFE_CYCLE | 1.11E-20 | 89 | 137 |
| REACTOME_INORGANIC_CATION_ANION_SLC_TRANSPORTERS | 0.00133586 | 7 | 94 |
| KEGG_SMALL_CELL_LUNG_CANCER | 0.00359651 | 6 | 84 |
| KEGG_FC_GAMMA_R_MEDIATED_PHAGOCYTOSIS | 0.00723812 | 6 | 97 |
| KEGG_MATURITY_ONSET_DIABETES_OF_THE_YOUNG | 0.00921957 | 3 | 25 |
| REACTOME_AM.NO_ACID_AND_OLIGOPEPTIDE_SLC_TRANSPORTERS | 0.00984371 | 4 | 48 |
| REACTOME_SLC_MEDIATED_TRANSMEMBRANE_TRANSPORT | 0.01009928 | 8 | 169 |
| KEGG_B_CELL_RECEPTOR_SIGNALING_PATHWAY | 0.01020969 | 5 | 75 |
| KEGG_GLYCOSPHINGOLIPID_BIOSYNTHESIS_LACTO_AND_NEOLACTO_SERIES | 0.0102887 | 3 | 26 |
| REACTOME_NUCLEAR_RECEPTOR_TRANSCRIPTION_PATHWAY | 0.01133769 | 4 | 50 |
| REACTOME_CDC6_ASSOCIATION_WITH_THE_ORC:ORIGIN_COMPLEX | 0.01516009 | 2 | 11 |
| REACTOME_REG ULATION_OF_GENE_EXPRESSIO N_IN_B ETA_CELLS | 0.00552 | 5 | 101 |
| REACTOME_HDL_MEDIATED_LIPID_TRANSPORT | 0.00637 | 2 | 11 |
| REACTOME_GTP_HYDROLYSIS_AND_JOINING_OF_THE_60S_RIBOSOMAL_SUBUNIT | 0.00675 | 5 | 106 |
| REACTOME_FACILITATIVE_NA_INDEPENDENT_GLUCOSE_TRANSPORTERS | 0.00759 | 2 | 12 |
| REACTOME_REGULATION_OF_BETA_CELLDEVELOPMENT | 0.00911 | 5 | 114 |
| REACTOME_TRANSLATION | 0.01121 | 5 | 120 |
| REACTOME_TRANSMEMBRANE_TRANSPORT_OF_SMALL_MOLECULES | 0.01148 | 7 | 218 |
| REACTOME_IRS_RELATED_EVENTS | 0.01182 | 4 | 79 |
| REACTOME_INFLUENZA_LIFE_CYCLE | 0.01889 | 5 | 137 |
| REACTOME_DEADENYLATION_OF_MRNA | 0.02469 | 2 | 22 |
| REACTOME_DOWN_STREAM_SIGNAL_TRANSDUCTION | 0.00302075 | 5 | 35 |
| REACTOME_GAB1_SIGNALOSOME | 0.00324484 | 3 | 11 |
| REACTOME_SIGNALING_IN_IMMUNE_SYSTEM | 0.00470548 | 20 | 366 |
| REACTOME_MITOTIC_PROMETAPHASE | 0.00489389 | 8 | 92 |
| REACTOME_INNATE_IMMUNITY_SIGNALING | 0.00584887 | 10 | 136 |
| REACTOME_SIGNALLING_TO_RAS | 0.0060289 | 4 | 26 |
| REACTOME_FORMATION_OF_PLATELET_ | 0.00753191 | 12 | 186 |
| REACTOME_GRB2_SOS_PROVIDES_LINKAGE_TO_MAPK_SIGNALING_FOR_INTERGRINS | 0.00821656 | 3 | 15 |
| REACTOME_MYOGENESSIS | 0.00895252 | 4 | 29 |
| REACTOME_HEMOSTASIS | 0.01301061 | 15 | 274 |
| REACTOME_PEPTIDE_LIGAND_BINDING_RECEPTORS | 0.00059 | 12 | 173 |
| REACTOME_GAP_JUNCTION_ASSEMBLY | 0.00076 | 4 | 19 |
| KEGG_CHEMOKINE_SIGNALING_PATHWAY | 0.00133 | 12 | 190 |
| REACTOME_GAP_JUNCTION_TRAFICKING | 0.00340 | 4 | 28 |
| REACTOME_CHEMOKINE_RECEPTORS_BIND_CHEMOKINES | 0.00787 | 5 | 55 |
| REACTOME_ACTIVATION_OF_ATR_IN_RESPONSE_TO_REPLICATION_STRESS | 0.00936 | 4 | 37 |
| REACTOME_SIGNALING_IN_IMMUNE_SYSTEM | 0.00943 | 16 | 366 |
| KEGG_T_CELL_RECEPTOR_SIGNALING_PATHWAY | 0.01126 | 7 | 108 |
| REACTOME_CELL_CYCLE_CHECKPOINTS | 0.01237 | 7 | 110 |
| KEGG_NATURAL_KILLER_CELL_MEDIATED_CYTOTOXICITY | 0.01263 | 8 | 137 |
Top ten upregulated and downregulated unique genes and their fold-change (FC) in each group (In group II, only five unique genes are downregulated).
| Group 1 | Group II | Group III | Group IV | ||||
|---|---|---|---|---|---|---|---|
| LOC100127940 | 2.8 | RP-3377H14.5 | 2.4 | DDX3Y | 4.6 | IL1F9 | 2.5 |
| PDCD6 | 2.4 | ZFYVE16 | 2.2 | EIF1AY | 3.2 | IL23A | 2.5 |
| AHRR | 2.4 | TGFBR1 | 2.2 | HELB | 3 | TUB | 2.4 |
| CD1B | 2.4 | MARCH6 | 2.2 | LOC100130224 | 2.9 | GJB2 | 2.3 |
| KIT | 2.4 | CAPZA1 | 2.2 | UTY | 2.9 | CD22 | 2.3 |
| CADM1 | 2.3 | KIAA0319 | 2.2 | ADORA3 | 2.9 | FAF1 | 2.3 |
| MMP7 | 2.3 | DHX36 | 2.2 | ARNT2 | 2.9 | MB0AT7 | 2.3 |
| C20orf197 | 2.3 | DLGAP4 | 2.1 | CXCL14 | 2.6 | SULT2A1 | 2.3 |
| RNF144A | 2.2 | RIF1 | 2.1 | TMEM61 | 2.6 | TMEM88 | 2.3 |
| MYO1B | 2.2 | NT5C2 | 2.1 | PPARGC1B | 2.6 | CHST7 | 2.3 |
| SGK493 | -2.0 | TIFAB | -2.0 | C1orf201 | -2.5 | VASH1 | -2.3 |
| ALS2CR4 | -2.1 | CCDC42 | -2.2 | ST3GAL3 | -2.5 | LINC00607 | -2.3 |
| ENPP5 | -2.2 | HBE1 | -2.2 | APOOL | -2.6 | KLHDC7B | -2.3 |
| FLJ14082 | -2.2 | NAPSB | -2.2 | IL28RA | -2.6 | DHODH | -2.3 |
| LOC1441204 | -2.2 | C4orf7 | -3.8 | ZNF624 | -2.6 | CDDC113 | -2.3 |
| L0C100134569 | -2.2 | SMAD5 | -2.6 | IGF2BP3 | -2.3 | ||
| FAM101A | -2.7 | NRP1 | -2.6 | C3orf27 | -2.3 | ||
| LOC92270 | -2.8 | LOC654342 | -2.6 | ZNF618 | -2.3 | ||
| HPR | -2.9 | TSIX | -3.3 | AKR1C4 | -2.4 | ||
| HP | -2.9 | XIST | -4.1 | LOC401321 | -2.4 | ||