| Literature DB >> 27097888 |
Michela Riba1, Jose Manuel Garcia Manteiga1, Berislav Bošnjak2, Davide Cittaro1, Pavol Mikolka2, Connie Le2, Michelle M Epstein2, Elia Stupka1.
Abstract
Systems biology provides opportunities to fully understand the genes and pathways in disease pathogenesis. We used literature knowledge and unbiased multiple data meta-analysis paradigms to analyze microarray datasets across different mouse strains and acute allergic asthma models. Our combined gene-driven and pathway-driven strategies generated a stringent signature list totaling 933 genes with 41% (440) asthma-annotated genes and 59% (493) ignorome genes, not previously associated with asthma. Within the list, we identified inflammation, circadian rhythm, lung-specific insult response, stem cell proliferation domains, hubs, peripheral genes, and super-connectors that link the biological domains (Il6, Il1ß, Cd4, Cd44, Stat1, Traf6, Rela, Cadm1, Nr3c1, Prkcd, Vwf, Erbb2). In conclusion, this novel bioinformatics approach will be a powerful strategy for clinical and across species data analysis that allows for the validation of experimental models and might lead to the discovery of novel mechanistic insights in asthma.Entities:
Mesh:
Year: 2016 PMID: 27097888 PMCID: PMC4838989 DOI: 10.1038/srep24647
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Gene expression studies of asthma used for the bioinformatics analysis.
| No. | Study | Affimerix Mouse Array | Mice | Antigen (Mice number) | No. of challenges | Sampling time (hr after last challenge) | |||
|---|---|---|---|---|---|---|---|---|---|
| GSE | GPL | PMID | Comparison | ||||||
| 1 | 13032 | 1261 | 19491150 | A | Genome 430 2.0 | A/J | PBS (3) | 3 | 3 |
| OVA (3) | |||||||||
| B | B6 | PBS (3) | 3 | 3 | |||||
| OVA (3) | |||||||||
| 2 | 6858 | 1261 | 17437023 | C | Genome 430 2.0 | BALB/c | PBS (4) | 7 | 24 |
| OVA (4) | |||||||||
| 3 | 9465 | 1261 | 19057703 | D | Genome 430 2.0 | A/J | PBS (3) | 1 | 96 |
| OVA (3) | |||||||||
| 4 | 1301 | 339–340 | – | E | U74 Version 2 | BALB/c | PBS (3) | 2 | 72 |
| HDM (3) | |||||||||
| 5 | 3184 | 339 | – | F | U74 Version 2 | C3H | PBS (5) | 1 | 6 |
| OVA (5) | |||||||||
| G | C3H | PBS (5) | 1 | 24 | |||||
| OVA (5) | |||||||||
| H | A/J | PBS (5) | 1 | 6 | |||||
| OVA (5) | |||||||||
| I | A/J | PBS (5) | 1 | 24 | |||||
| OVA (5) | |||||||||
| 6 | 18010 | 1261 | 19770271 | J | Genome 430 2.0 | B6.Il4raQ576/Q576 | -(5) | 7 | 24 |
| B6.Il4raQ576/Q576/IL-13tg | IL-13 (8) | ||||||||
Six GEO Datasets (http://www.ncbi.nlm.nih.gov/gds/) studies were selected for meta-analysis on the basis of the microarray platform and in vivo experimental protocol. From those 6 studies, 10 asthmatic vs. healthy control comparisons (A to J) have been extracted and further analyzed.
Figure 1Schematic overview of analysis pipeline.
Using experiment selection criteria, we selected six studies that were further subdivided into 10 comparisons of control and asthmatic mice according to mouse strain and time of analysis. After initial analysis with liner models to obtain differential gene expression, data from each comparison were re-analyzed using pathway- and gene-driven approaches (for details please refer to supplementary methods). Lists of differentially regulated genes generated in the 2 approaches were merged into a final asthma signature list of 933 genes, which was used for literature-coverage searches in PubMed.
Figure 2Microarray data for 59 randomly selected genes from 10 comparisons of control and asthmatic mice correlate to quantitative PCR data from our independent mouse asthma model.
For qPCR, OVA-sensitized BALB/c mice received PBS (controls) or OVA challenge 24 h or 72 h before extraction of total lung RNA. Data are shown as mean ± SEM. Pearson r coefficients and p values for each correlation are indicated.
Figure 3An acute asthma ignorome.
(A) PubMed literature coverage of 933 genes from acute asthma signature list related to “asthma”. The x-axis represents the number of asthma-annotated literature for each gene in PubMed in November 2014. The left y-axis shows the gene number (shown as bars), while the right y-axis indicates the relative annotation to gene-disease association databases (Comparative Toxicogenomics Database (CTD) or Malacards; shown as lines). (B) List of MGI gene symbols for 933 asthma signature genes according to their number of asthma-annotated literature in PubMed according to the method described in supplementary methods and exact number of publications is listed in Supplementary Table 2.
Figure 4Network representation of 759 human ortholog genes in 7 clusters from our asthma-signature list.
(A) Within the network, we have detected 7 main clusters using STRINGdb library in Bioconductor that were further functionally annotated to reveal biological functions with online tool EnrichR (for details please refer to supplementary material). Genes are coloured according to literature number associating each gene to (B) “asthma” or (C) “inflammation OR immunity”. For each gene, we have retrieved number of publications in PubMed in November 2014. Organic layout algorithm was used to produce this figure.
Biological processes, pathways and tissue expression enrichment analysis of main clusters in asthma-signature genes list, examples of hubs, peripheral genes, super-connectors and genes present in Malacards, a gene-disease association database.
| Cluster | 1A | 1B | 2 | 3A | 3B | 3C | 4 | |
|---|---|---|---|---|---|---|---|---|
| Description | Cytokine-cytokine receptor siganling | Leukocyte transendothelial migration | Circadian rhythm | Extracellular matrix remodeling | Adherens and tight junctions | Mucus secretion | Stem cell proliferation | |
| Gene number | 239 | 114 | 15 | 282 | 19 | 8 | 82 | |
| Gene Ontology Biological Processes | Immune system process (GO:0002376); Defense response (GO:0006952); Immune response (GO:0006955); Chemotaxis (GO:0006935); Response to external stimulus (GO:0009605) | Immune system process (GO:0002376); Defense response (GO:0006952); Immune response (GO:0006955); Innate immune response (GO:0045087); Respiratory burst (GO:0045730) | Regulation of RNA metabolic process (GO:0051252); Regulation of transcription, DNA-dependent (GO:0006355), Heme biosynthetic process (GO:0006783), Regulation of transcription from RNA polymerase II promoter (GO:0006357), Regulation of transcription (GO:0045449) | Regulation of cell proliferation (GO:0042127); Negative regulation of apoptosis (GO:0043066); Negative regulation of programmed cell death (GO:0043069); Regulation of cell migration (GO:0030334); Regulation of cell motion (GO:0051270) | Establishment or maintenance of cell polarity (GO:0007163) | Monosaccharide metabolic process (GO:0005996); Cellular alcohol metabolic process (GO:0006066); Ventricular cardiac muscle morphogenesis (GO:0055010); Cardiac muscle contraction (GO:0060048); Carbohydrate metabolic process (GO:0005975) | Organelle organization (GO:0006996); Chromosome organization (GO:0051276); Regulation of cell cycle (GO:0051726); DNA replication (GO:0006260); DNA metabolic process (GO:0006259) | |
| KEGG Pathways | Cytokine cytokine receptor interaction (HSA04060); Hematopoietic cell lineage (HSA04640); Toll like receptor signaling pathway (HSA04620); Cell adhesion molecules (HSA04514); JAK STAT signaling pathway (HSA04630) | Leukocyte transendothelial migration (HSA04670); Fc epsilon ri signaling pathway (HSA04664); Phosphatidylinositol signaling system (HSA04070); Natural killer cell mediated cytotoxicity (HSA04650); Inositol phosphate metabolism (HSA00562) | Circadian rhythm (HSA04710); Glycine serine and threonine metabolism (HSA00260); Arginine and proline metabolism (HSA00330); Porphyrin and chlorophyll metabolism (HSA00860) | ECM receptor interaction (HSA04512); Focal adhesion (HSA04510); Complement and coagulation cascades (HSA04610); p53 signaling pathway (HSA04115); Arachidonic acid metabolism (HSA00590) | Tight junction (HSA04530) | Keratan sulfate biosynthesis (HSA00533); Glycan structures biosynthesis 1 (HSA01030); Olfactory transduction (HSA04740); Galactose metabolism (HSA00052) | Cell cycle (HSA04110); Purine metabolism (HSA00230); Pyrimidine metabolism (HSA00240) | |
| Human Atlas Enriched Tissues | CD33+ Myeloid cells CD19+ B cells CD14+ Monocytes Whole blood Smooth muscle | CD14+ Monocytes CD33+ Myeloid cells Whole blood Lung | CD71+ early erythoid cells Lung CD56+ NK cells CD8+ T cells CD4+ T cells | Smooth muscle Lung | Bronchial epithelial cells CD105+ Endothelial cells CD34+ cells Whole blood CD71+ early erythoid cells | Trachea CD71+ early erythoid cells CD56+ NK cells CD14+ Monocytes Lung | B lymphocytes CD105+ Endothelial cells CD71+ early erythoid cells CD34+ cells | |
| Selected genes | Hubs | |||||||
| Peripheral | – | – | ||||||
| Super-connectors | – | – | – | – | – | |||
| Malacards Asthma | – | – | – | – | ||||
| Percentage of genes in PubMed related to | “asthma” | 75.7% | 48.2% | 13.3% | 48.6% | 42.1% | 62.5% | 14.6% |
| “Inflammation or immunity” | 96.7% | 84.2% | 73.3% | 84.8% | 63.2% | 100.% | 56.1% | |
1Maximum of 5 enriched categories ordered by Enrichr Combined score showing an adjusted P-value < 0.05.
2Maximum of 5 tissues within a list of 17 tissues potentially found in the whole lung tissue samples in datasets under study (See
3Supplementary Figure S4 for details on Human Atlas Gene expression for all genes in each cluster).
4Top 5 hub genes ordered by Betweenness Centrality.
5Peripheral genes defined as 1-degree nodes connected to a clique.
6Malacards overlap for Asthma (119 genes, as for Sept 2014).
Figure 5Schematic diagram depicting connectivity between 4 domains and individual clusters in acute asthma signature list.
Bars indicate number of asthma-related (blue) and –ignorome (red) genes in each cluster or domain. Arrows indicating connections between clusters or domains are scaled according to relative connection strength between clusters and domains (see supplementary methods for details).
Figure 6Schematic diagram depicting 12 super-connector genes in the asthma-signature gene list.
Gene circle colour indicates the cluster of origin and connected to the pertaining topological cluster. Topological clusters are grouped into 4 biological domains by dotted rectangles.
Figure 7Expression profiles of 8 from 12 super-connectors were confirmed with quantitative PCR.
Data are presented as mean log2 fold changes of gene expression by quantitative real-time PCR and microarray relative to control mice. Quantitative real-time PCR data were determined in whole lung extracts and are pooled from 2 independent experiments (n = 6). Microarray data are from 6 publicly available datasets broken down into 10 direct comparisons of asthmatic and control mice (please refer to Supplementary data and Fig. 1 for details). *p < 0.05 compared with PBS challenged mice (unpaired t-test).
Figure 8Expression of super-connectors in acute allergic asthma after DEX treatment.
Total lung RNA was extracted from mice intranasally challenged with PBS (controls) and mice that received vehicle or DEX before and after OVA challenge to induce allergic asthma. Samples were collected at 72 h after allergen challenge and super-connector expression was determined with quantitative PCR. Super-connectors are grouped according to their change to DEX treatment into (A) completely reverted, (B) partially reverted, (C) further down regulated, (D) not affected with DEX treatment, and (E) not changed in comparison to PBS controls. Data are presented as mean ± SEM log2 fold changes of gene expression relative to control (PBS) mice and are pooled from 2 independent experiments (n = 6). *p < 0.05 compared with PBS challenged mice; #p < 0.05 compared with vehicle-treated group (unpaired t-test).