| Literature DB >> 32900903 |
Debajyoti Ghosh1, Lili Ding2, Jonathan A Bernstein1, Tesfaye B Mersha3.
Abstract
An integrative analysis focused on multi-tissue transcriptomics has not been done for asthma. Tissue-specific DEGs remain undetected in many multi-tissue analyses, which influences identification of disease-relevant pathways and potential drug candidates. Transcriptome data from 609 cases and 196 controls, generated using airway epithelium, bronchial, nasal, airway macrophages, distal lung fibroblasts, proximal lung fibroblasts, CD4+ lymphocytes, CD8+ lymphocytes from whole blood and induced sputum samples, were retrieved from Gene Expression Omnibus (GEO). Differentially regulated asthma-relevant genes identified from each sample type were used to identify (a) tissue-specific and tissue-shared asthma pathways, (b) their connection to GWAS-identified disease genes to identify candidate tissue for functional studies, (c) to select surrogate sample for invasive tissues, and finally (d) to identify potential drug candidates via connectivity map analysis. We found that inter-tissue similarity in gene expression was more pronounced at pathway/functional level than at gene level with highest similarity between bronchial epithelial cells and lung fibroblasts, and lowest between airway epithelium and whole blood samples. Although public-domain gene expression data are limited by inadequately annotated per-sample demographic and clinical information which limited the analysis, our tissue-resolved analysis clearly demonstrated relative importance of unique and shared asthma pathways, At the pathway level, IL-1b signaling and ERK signaling were significant in many tissue types, while Insulin-like growth factor and TGF-beta signaling were relevant in only airway epithelial tissue. IL-12 (in macrophages) and Immunoglobulin signaling (in lymphocytes) and chemokines (in nasal epithelium) were the highest expressed pathways. Overall, the IL-1 signaling genes (inflammatory) were relevant in the airway compartment, while pro-Th2 genes including IL-13 and STAT6 were more relevant in fibroblasts, lymphocytes, macrophages and bronchial biopsies. These genes were also associated with asthma in the GWAS catalog. Support Vector Machine showed that DEGs based on macrophages and epithelial cells have the highest and lowest discriminatory accuracy, respectively. Drug (entinostat, BMS-345541) and genetic perturbagens (KLF6, BCL10, INFB1 and BAMBI) negatively connected to disease at multi-tissue level could potentially repurposed for treating asthma. Collectively, our study indicates that the DEGs, perturbagens and disease are connected differentially depending on tissue/cell types. While most of the existing literature describes asthma transcriptome data from individual sample types, the present work demonstrates the utility of multi-tissue transcriptome data. Future studies should focus on collecting transcriptomic data from multiple tissues, age and race groups, genetic background, disease subtypes and on the availability of better-annotated data in the public domain.Entities:
Keywords: Asthma transcriptome; Connectivity Map; GWAS Catalog; ilincs; machine learning; pathways/networks; tissue-specific analysis
Mesh:
Substances:
Year: 2020 PMID: 32900903 PMCID: PMC7642926 DOI: 10.1534/g3.120.401718
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Asthma transcriptome data sources. Case and control gene expression datasets obtained from nasal epithelial cells, airway epithelial cells, bronchial biopsies, peripheral blood, CD4+ and CD8+ lymphocytes, airway macrophages, proximal and distal fibroblasts and induced sputum were used for the current study.
Summary of transcriptomics datasets used for the present study. NCBI Gene Expression Omnibus (GEO) accession number, tissue/cell sample type, platform, sample size (excluding smokers and subjects using inhaled corticosteroids) have been shown for each study
| Sample type | GEO ID | Tissues/cell types | Platform | Case | Control | |
|---|---|---|---|---|---|---|
| Airway Epithelium | GSE4302 | Airway epithelial cells | Affy HG-U133 | 10 | 10 | |
| Airway Epithelium | GSE18965 | Airway epithelial cells | Affy HG-U13A | 9 | 7 | |
| Bronchial Biopsy | GSE15823 | Endobronchial biopsies | HG_U95 | 4 | 4 | |
| Bronchial Biopsy | GSE41649 | Endobronchial biopsies | HG-U133 | 4 | 4 | |
| Macrophages | GSE22528 | Alveolar macrophages | Affy HG-U133 | 4 | 4 | |
| Macrophages | GSE2125 | Alveolar macrophages | Affy HG-U133 | 15 | 15 | |
| Fibroblasts | GSE27335 | Proximal airway fibroblasts | Agilent | 8 | 4 | |
| Fibroblasts | GSE27335 | Distal lung fibroblasts | Agilent | 8 | 4 | |
| Lymphocytes | GSE31773 | CD4+ T-cells | Affy HG-U133 | 12 | 8 | |
| Lymphocytes | GSE31773 | CD8+ T-cells | Affy HG-U133 | 12 | 8 | |
| Nasal Epithelial | GSE44037 | Nasal epithelial cells | Affy HG-U133 | 6 | 6 | |
| Whole blood | GSE69683 | Peripheral Blood | Affy HG U133 | 411 | 84 | |
| Induced sputum | GSE76262 | Induced sputum (whole) | Affy HG U133 | 118 | 21 |
Figure 2Major steps to identify and analyze tissue specific asthma gene expression data. Individual datasets were obtained from GEO. Differentially expressed asthma-relevant genes identified from each tissue types were used to find tissue/ cell-specific genes and networks, and discriminatory gene-sets to classify samples into normal vs. asthma classes and to predict asthma disease state. In addition, differentially expressed genes identified from each sample/ tissue type were linked with GWAS catalog data and Connectivity Map resources to identify novel drug candidates.
Figure 3Gene-level overlap: Venn diagram showing overlap of differentially regulated genes identified from tissue samples (A; airway epithelial, bronchial, nasal and whole blood samples) and isolated cellular samples (B; lymphocytes, fibroblasts and macrophages). There is considerable overlap at the gene-level between airway epithelial and bronchial biopsy samples. However, each sample type shows a unique asthma-relevant gene expression pattern; DEG sharing is not observed between all samples. Top significant tissue/ cell-specific DEGs (red up-regulated, blue down-regulated) have been shown for each sample type.
Figure 4Pathway-level overlap in tissue (A) and isolated cellular level (B) samples: Argenase pathway, Th1/Th2 signaling, VEGF signaling and Inflammation and Aryl hydrocarbon pathways were most predominant on blood, biopsy, airway epithelial and nasal samples respectively. EIF2 signaling was relevant for macrophages and proximal fibroblasts, while lymphocyte extravasation (adhesion and diapedesis), IL6 pathway and IL-8 signaling pathways were very relevant for CD4+, distal lung fibroblasts and CD8+ cells respectively. In addition, Glucocorticoid Receptor Signaling, Clathrin-mediated Endocytosis Signaling were top significantly enriched pathways in all tissue types. IL-1beta and ERK signaling pathways were common across a wide range of tissue types. Chemokine signaling (in nasal epithelium) were the most significant pathways.
Figure 5Tissue-based expression of asthma-relevant genes identified by (A) GWAS retrieved from GWAS-catalog (European Bioinformatics Institute) and by (B) literature mining identified by using Literature Lab (Accumenta, USA). The results showed that asthma-relevant genes can be clustered by their expression in target cells/ tissues. Macrophage samples are closely associated with induced sputum samples, whereas CD4+ and CD8+ lymphocyte samples cluster with peripheral blood samples in both (A) and (B) samples. Bronchial biopsy samples cluster together. The tissue-based clustering of asthma-relevant genes has been very clearly demonstrated in case of genes identified by literature-mining (B). A gene-level circular heatmap demonstrating tissue-wise expression of GWAS-identified genes have been shown in panel C.
SVM-generated classifiers (Asthma Gene Expression signatures) for sample types (blood, Macrophage, induced sputum) that are widely used to study the pathology of asthma. Percent predictive accuracy for each classifier has also been indicated
| Tissue Type | Asthma Gene Signature (for >80% predictive accuracy) | % correct prediction |
|---|---|---|
| MYD88, MYL12A, BOLA2, HBXIP, ACSL1, VPS24, GPM6B, F5, PDE7A, AB11FIP3, ZNF785, TCF4, RAB11A, GPR109B, GNAQ, DIS3L2,, SLC7A5P1/P2, TSR2, AMICA1, CR1, QRICH1, SERBP1, ZNF37A, DERL2, ZBTB20, CDC42SE1, NSMAF, KIAA0562, NT5C2, DAPP1, ATP6V0E1, LOC203274, TMEM59, NAPEPLD, ANXA3, WIPI2, ATAD5, LARS, PTBP1, SPG7, DNAJC16, IL6ST, CLN8, SAT1, EVI2B, MRPS5, AQP9, RPP14, ZNF747, GAPT | 83 | |
| KBTBD2, UPF3A, MLLT10, PSPC1, ST3GAL1, ARFRP1, TRIM14, EIF4G1, BM25, CDK11A/B, | 83 | |
| IL18R1, TLR7, GPR85, LIMA1, SPINT1, EIF1AY, PDCD6, MGAT4A, CSTA, LOC100129845 | 96 |
Perturbagens associated with at least three asthma-relevant asthma sample types. occurrence (connection to number of asthma tissues), associated asthma tissue/ sample type and their known association with allergy/ asthma/ lung diseases (known/ unknown) have been Mentioned
| Perturbagen | Occurrences | Associated samples | Indicated in allergy/ asthma/ lung disease |
|---|---|---|---|
| KLF6 (Kruppel like factor 6) | 5 | AirEpi, Macrophages, FibroDistal, CD8+, Nasal | Yes |
| BCL10 (B-cell lymphoma/ leukemia 10) | 4 | Macrophages, FibroDistal, CD4+, Blood | Yes |
| HOXB13 Homeobox protein Hox-B13 | 4 | Macrophages, Sputum, FibroDistal, Nasal | No |
| IFNB1(Interferon beta 1) | 4 | Sputum, FibroDistal, CD8+, Blood | Yes |
| Entinostat | 4 | Macrophages, FibroDistal, CD8, Nasal | No |
| ATOX1(Antioxidant 1 Copper Chaperone) | 3 | FibroDistal, CD4, FibroProximal | No |
| BAMBI(BMP and activin membrane bound inhibitor) | 3 | Macrophages, Sputum, Blood | Yes |
| BMS-345541 | 3 | FibroDistal, CD8+, Nasal | Yes |
| CCNL1 (cyclin L1) | 3 | Macrophages, FibroDistal, CD8+ | No |
| CDCA8 (cell division cycle associated 8) | 3 | AirEpi, CD8+, Blood | No |
| DHX8 (DEAH-box helicase 8) | 3 | Sputum, Bronchial, Blood | Yes |
| DTX2 (deltex E3 ubiquitin ligase) | 3 | AirEpi, Sputum, Nasal | No |
| KLF3 (Kruppel like factor 3) | 3 | Bronchial, FibroDistal, CD8+ | Yes |
| LASP1 (LIM and SH3 protein 1) | 3 | AirEpi, Macrophages, CD8+ | Yes |
| LOXL1 (lysyl oxidase like 1) | 3 | AirEpi, Sputum, Blood | Yes |
| PPP2R3C (protein phosphatase 2 regulatory subunit B’’gamma) | 3 | AirEpi, Macrophages, Blood | Yes |
| PREB(prolactin regulatory element binding) | 3 | Macrophages, FibroDistal, FibroProximal | No |
| PUF60 (poly-U binding splicing factor 60) | 3 | Macrophages, CD8+, Nasal | No |
| SORBS3 (sorbin and SH3 domain containing 3) | 3 | Macrophages, FibroProximal, Nasal | No |
| TRIP10 (Thyroid Hormone Receptor Interactor 10) | 3 | Macrophages, Bronchial, FibroDistal | No |
| XPO7 (Exportin 7) | 3 | AirEpi, Sputum, Bronchial | Yes |
| YWHAZ | 3 | AirEpi, Bronchial, Nasal | No |
| Calyculin | 3 | Macrophages, FibroDistal, Blood | No |
Figure 6Asthma tissue transcriptome linked to drug repurposing. The blue connected dots (in A) indicate overlap between sample types (gray tracks) while the asterisk showing perturbagen connecting maximum tissue types. The bar chart (B) indicates the number of unique perturbagens for each sample type with overlapped portion in orange. Clustered heatmap (C) connecting asthma-relevant tissue/ samples to perturbagens has been generated using respective connectivity S1 Table scores) in each tissue. Asthma associated up and down-regulated genes identified form different tissue/ sample types were used to identify perturbagens that can potentially reverse asthma signature in respective sample type.
Figure 7Diagramatic summary of the current study starting from the discovery of differentially regulated genes to drug repositioning in asthma using multi-tissue transcriptomic analysis. Using asthma transcriptome data identified from multiple sample types such as blood, lung, isolated fibroblasts, nasal epithelium, lymphocytes, and airway macrophages we identified molecular signatures of asthma. The signatures were further connected with GWAS-identified asthma genes. The tissue-resolved molecular signatures were further evaluated for their utility in drug repurposing (i.e. identifying perturbagens via connectivity map analysis to reverse asthma signature).