| Literature DB >> 29058630 |
Yunguan Wang1, Jaswanth Yella1, Jing Chen1, Francis X McCormack2, Satish K Madala3,4, Anil G Jegga5,6,7.
Abstract
BACKGROUND: Idiopathic Pulmonary Fibrosis (IPF) is a fatal fibrotic lung disease occurring predominantly in middle-aged and older adults. The traditional diagnostic classification of IPF is based on clinical, radiological, and histopathological features. However, the considerable heterogeneity in IPF presentation suggests that differences in gene expression profiles can help to characterize and distinguish disease severity.Entities:
Keywords: Gene expression analysis; Gene signature; IPF subtyping; Idiopathic pulmonary fibrosis; Ipf
Mesh:
Substances:
Year: 2017 PMID: 29058630 PMCID: PMC5649521 DOI: 10.1186/s12890-017-0472-9
Source DB: PubMed Journal: BMC Pulm Med ISSN: 1471-2466 Impact factor: 3.317
Patient demographics and clinical characteristics of the LGRC IPF cohort
| Disease Group | UIP/IPF | Control |
|
|---|---|---|---|
| Number | 131 | 12 | |
| Age—mean (SD) | 62.6 (12.2) | 64.1 (8.2) | 0.5631 a |
| %predicted FEV1 (SD) | 71.37 (19.00) | 94.33 (9.86) | 6.3E-5 a |
| %predicted FVC (SD) | 64.78 (17.41) | 91.75 (7.44) | 4.3E-7 a |
| %predicted DLCO (SD) | 49.33 (18.14) | 97.00 (21.30) | 1.1E-11 a |
| Gender—% male | 67.2 | 25 | 0.0375 b |
IPF idiopathic pulmonary fibrosis; LGRC Lung Genome Research Consortium; UIP usual interstitial pneumonia; FVC forced vital capacity; FEV1 forced expiratory volume in 1 s; DCLO diffusing capacity of the lung for carbon monoxide; SD standard deviation
aBy two-tailed Student’s t-Test
bBy χ2 test
Fig. 1UIP/IPF patient subgroups stratified by disease severity by FEV1, FVC and DLCO have different gene expression profiles. Patient subgroups were identified using hierarchical clustering with Euclidean distance metric and Ward’s linkage (panels a, b, and c). Average DLCO (a), FEV1 (b), and FVC (c) in six UIP/IPF patient subgroups. Panel d shows heat map representation of 2968 DEG (rows) in 143 control and UIP/IPF patients (columns). Genes and patients were ordered using hierarchical clustering. Color bar represents patents subgroups (1st row in heat map), DLCO (2nd row), FEV1 (3rd row), FVC (4th row) and smoking status (5th row). Data are expressed as the mean ± SD. Differential expression analysis was performed using Limma. Comparison of lung function measures was carried out using two-tailed Student’s T-test
Fig. 2Comparison of DEGs of each patient cluster revealed genes commonly dysregulated in IPF and genes associated with severe lung function decline. DEG were divided into six groups based on the number of patient clusters where a gene was differentially expressed. Panel a shows schematic representation of 2968 DEG in six IPF patient clusters. DEGs along with their group designation are shown in the same order along the outer rim of each circular plot. The center of each circular represents patient cluster with the color intensity representing average % predicted DLCO in that cluster. Each colored edge (red: up-regulated; green: down-regulated) from a patient cluster to a gene in the rim indicates differential expression of that gene in the connected patient cluster. Panel b is a heat map representation of the 2968 DEG. Up- or down- regulated genes in each group that are involved or implicated in IPF were highlighted
Fig. 3Enriched biological processes in each gene category revealed commonly and high-severity-associated biological pathways perturbed in IPF. Selected enrichment terms derived from gene lists in DEG groups were shown. Connection from a gene (rectangular node) to a biological process (purple oval node) indicates involvement of that gene in the connected process. Differential expression status of a gene in each patient subgroup was shown as a mini heat map (orange: up-regulation; turquoise: down-regulation; gray: not differentially expressed, patient subgroup order: C1, C2, C3, C4, C5 and C6). Network was made in Cytoscape 3.5, and layout was performed using AllegroLayout v2 Professional with manual curation
Fig. 4The core IPF gene set robustly differentiated IPF patients from normal controls in three independent validation cohorts. Logistic regression models were trained on the core IPF gene sets using the training cohort with 2-fold cross-validation, and tested with each validation cohort. The decision threshold was set to provide at least 90% sensitivity for IPF discovery. ROC curves were shown in the left column, and classification scatter plots of IPF and control samples were shown in the right column
Top 10% prioritized genes in the core and advanced IPF gene sets
Genes in red font color represent genes that have been reported to be related to IPF (PubMed-based literature mining). Genes in core IPF and advanced IPF (patient clusterC6) gene sets were prioritized using ToppGene application of the ToppGene Suite and their absolute fold change in IPF compared to controls