| Literature DB >> 31290545 |
Susie S Y Huang1, Fatima Al Ali1, Sabri Boughorbel1, Mohammed Toufiq1, Damien Chaussabel1, Mathieu Garand1.
Abstract
Prevalence of allergies has reached ~20% of population in developed countries and sensitization rate to one or more allergens among school age children are approaching 50%. However, the combination of the complexity of atopic allergy susceptibility/development and environmental factors has made identification of gene biomarkers challenging. The amount of publicly accessible transcriptomic data presents an unprecedented opportunity for mechanistic discoveries and validation of complex disease signatures across studies. However, this necessitates structured methodologies and visual tools for the interpretation of results. Here, we present a curated collection of transcriptomic datasets relevant to immunoglobin E-mediated atopic diseases (ranging from allergies to primary immunodeficiencies). Thirty-three datasets from the Gene Expression Omnibus, encompassing 1860 transcriptome profiles, were made available on the Gene Expression Browser (GXB), an online and open-source web application that allows for the query, visualization and annotation of metadata. The thematic compositions, disease categories, sample number and platforms of the collection are described. Ranked gene lists and sample grouping are used to facilitate data visualization/interpretation and are available online via GXB (http://ige.gxbsidra.org/dm3/geneBrowser/list). Dataset validation using associated publications showed good concordance in GXB gene expression trend and fold-change.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31290545 PMCID: PMC6616200 DOI: 10.1093/database/baz066
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1Workflow of search strategies for datasets retrieval from the GEO database. Datasets and search terms obtained from the seven independent search strategies were merged and with duplicates removed. The results were then filtered for human studies and microarray or NGS planforms. The studies were manually confirmed for IgE-related pathogenesis and only datasets with appropriate study design, sample type and disease groups were kept. The final collection consists of 33 datasets. Brackets indicate the number of datasets obtained after each search step.
Descriptive summary of the dataset collection
| Title | GEO ID | Platform name | Platform GEO accession number (GPL) | Disease/treatment/therapy | Sample type | Number of sample | Expt designa | Default group comparisonb | Pubmed IDc | Trend validation | FC validation | Comments for validation | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Allergen-specific immunotherapy modulates the balance of circulating Tfh and Tfr cells | GSE87399 | Illumina HiSeq 2500 | GPL16791 | Allergy | PBMC/T cells | 17 |
| Low allergic vs High allergic | 28506846 | CXCR5, FOXP3 | Good validation | |
| 2 | Comparison of two sets of microarray experi ments to define allergic asthma expression pattern | GSE41649 | Affymetrix Human Genome U133A Array | GPL96 | Allergic asthma | Bronchial biopsy | 8 |
| Allergic asthma vs Healthy | 1984284 | SERPINB2, CX3CR1, C7 | Strong validation | |
| 3 | Distinct epithelial gene expression phenotypes in childhood respiratory allergy—disease state | GSE19190 | Affymetrix Human Gene 1.0 ST Array [transcript(gene) version] | GPL6244 | Rhinitis, allergy, asthma | Human nasal epithelium cells | 38 |
| Rhinitis vs Healthy | 22005912 | Validated with stimulation | ||
| 4 | Distinct epithelial gene expression phenotypes in childhood respiratory allergy—Stimulation | GSE19190 | Affymetrix Human Gene 1.0 ST Array [transcript(gene) version | GPL6244 | Rhinitis, allergy, asthma | Human nasal epithelium cells | 21 |
| IL4 vs Control | 22005912 | DHX58, PRIC285, STARD5 | Good validaton | |
| 5 | Effect of intraderma immunotherapy (IDIT) injections on gene expression profiles of activated T cells derived from skin biopsy explants | GSE72324 | Illumina HumanHT-12 V4.0 | GPL10558 | Grass pollen allergy | T cells | 15 |
| IDIT vs Control | 27773851 | TNFSF8, TNIP3, HDAC1 | Strong validation | |
| 6 | Expression data for human epithelium from subjects with atopic dermatitis, psoriasis and nonatopic controls | GSE26952 | Sentrix HumanRef-8 Expression BeadChip | GPL2700 | Atopic dermatitis, psoriasis | Epidermis | 16 |
| Atopic dermatitis vs Control | 21163515 | GJA1, TGM1, OCLN | Strong validation | |
| 7 | Expression data from irritable bowel syndrome (IBS) patients before and after treatment | GSE14842 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Diarrhea IBS | Epidermis | 14 |
| Oral cromoglycate vs Control | N/A | No associated publication | ||
| 8 | Functional classes of bronchial mucosa genes that are differentially expressed in asthma | GSE15823 | Affymetrix Human Genome U95 Version 2 Array | GPL8300 | Asthma | Bronchial biopsy | 12 |
| Allergic asthma vs Healthy | 15038835 | SFRP1, Alox15, LDB1 | Strong validation | |
| 9 | Gene expression analysis related to olive pollen allergy | GSE37157 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Olive pollen allergy | PBMC | 28 |
| Polination asymptomatic vs Polinaztion non-allergic | 23830385 | No associated publication | ||
| 10 | Gene expression changes in early phase of venom immunotherapy | GSE92866 | Affymetrix Human Gene Expression Array [Brainarray ENTREZG Version 20] | GPL22841 | Venom allergy | Blood | 59 |
| Beekeeper vs Control | N/A | CD99, SDC1 | GATA3, FoxP3 | No associated publication, but some articles mentioned up-regulated genes that matches with GXB data |
| 11 | Gene expression pattern of alveolar macrophages of allergic asthmatics in comparison with control subjects | GSE22528 | Affymetrix Human Genome U133A Array | GPL96 | Asthma | Alveolar macrophage | 10 |
| Asthma vs. Control | 19913588 | CCR1, HSPD1, SERPINH1 | Strong validation | |
| 12 | Gene expression patterns in house dust mite (HDM) stimulated CD4 T cells and IgG to IgE ratios | GSE70760 | Affymetrix Human Gene 1.0 ST Array [hugene10st_Hs_ENTREZG_19.0.0] | GPL20171 | HDM | T cells | 90 |
| HDM vs Control | 26518094 | IL4,IL13,IL17RB | Data not available or reported differently in associated publication | |
| 13 | Gene expression patterns in PBMC associated with asthma exacerbation attack | GSE19301 | Affymetrix Human Genome U133A Array | GPL96 | Asthma | PBMC | 685 |
| Exacerbation vs Quiet | 21779351 | Data reported differently | ||
| 14 | Gene expression profile of patients with moderate and severe chronic spontaneous urticaria | GSE72542 | Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381 (Feature Number version) | GPL16699 | Chronic spontaneous urticaria | Skin/whole blood | 61 |
| Patient wheal vs Healthy | 28407332 | FOSB, S100A9, ADAMTS4 | Data not available or reported differently in associated publication | |
| 15 | Gene expression profiling of patients with allergy to latex and/or vegetable food | GSE13619 | Affymetrix Human HG-Focus Target Array | GPL201 | Latex/plant-derived food allergy | PBMC | 21 |
| Latex fruit vs Healthy | N/A | IFNG, STAT4, IL10RA | No associated publication, but some articles mentioned up-regulated genes that matches with GXB data | |
| 16 | Genome-wide expression analysis demonstrates a dominant role of TLR4 for activation of human phagocytes by the alarmin MRP8 | GSE56681 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL96 GPL570 | Alarmins myeloid-related protein 8 and 14 signaling | Monocytes | 19 |
| MRP8 vs Control | 25505274 | TRIP10, NFBK1, RNASE6 | Good Validation | |
| 17 | Genome-wide expression profiling of B Lymphocytes reveals IL4R increase in allergic asthma | GSE52742 | Illumina HiSeq 2000 | GPL11154 | HDM | B cells | 6 |
| Allergy vs Control | 24975796 | IL4R, TCL1A, SESTD1 | Strong validation | |
| 18 | Human basophil expression profiles–Atopic vs Non-atopic | GSE64639 | Illumina HumanHT-12 V4.0 | GPL10558 | Healthy subjet | Pheripheral blood | 16 |
| Atopic vs Non-atopic | 25962139 | Data not available or reported differently in associated publication | ||
| 19 | Identification of IL-21-induced STAT3 dependent genes in human B cells | GSE51587 | Affymetrix Human Gene 1.0 ST Array [transcript (gene) version] | GPL6244 | Job’s syndrome | B cells | 14 |
| AD-HIES CD40L vs Control CD40L | 24159173 | PRDM1, IL2RA, SGK1 | Gene expression values not available in associated publication | |
| 20 | Illumina Bead expression array data from Human IgE+ and IgG+ B cell subsets | GSE99948 | Illumina HumanHT-12 V4.0 | GPL10558 | Tonsillectomy/Heathly subject | B cells | 24 |
| Plasma Blast IgE+ vs B Cell IgE+ | N/A | CD99, SDC1 | No associated publication, but some articles mentioned up-regulated genes that matches with GXB data | |
| 21 | Influence of olive pollen stimuli on the geneexpression profile in healthy controls and allergic patients | GSE54522 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Olive pollen allergy | PBMC | 46 |
| Allergic basal vs Control basal | 25553522 | Data not available or reported differently in associated publication | ||
| 22 | Integrated genomic and prospective clinical studies show the importance of modular pleiotropy for disease susceptibility, diagnosis and treatment (dataset 1) | GSE44956 | Agilent-028004 SurePrint G3 Human GE 8x60K Microarray (Probe Name version) | GPL14550 | Seasonal allergic rhinitis | T cells | 48 |
| Low allergen GCR vs Low diluent | 24571673 | Data not available or reported differently in associated publication | ||
| 23 | Lack of allergy of timothy grass pollen is not a passive phenomenon but associated with allergen specific modulation of immune reactivity | GSE70050 | Illumina HiSeq 2500 | GPL16791 | Pollen allergy | PBMC/T cells | 76 |
| Allergic_IL5 vs NonAllergic_IL5 | 4846575 | Discrepancy in sample names between GEO and SOFT files. Reported group comparison could not be replicated | ||
| 24 | Leukotriene E4 is a full functional agonist for human cysteinyl leukotriene type 1 receptor—Cell Comparison | GSE75603 | Affymetrix Human Gene 1.0 ST Array [transcript (gene) version] | GPL6244 | Leukotriene E4 response | Mast cells | 6 |
| LUVA vs LAD2 | 26830450 | Validated with stimulation | ||
| 25 | Leukotriene E4 is a full functional agonist for human cysteinyl leukotriene type 1 receptor—Stimulation | GSE75603 | Affymetrix Human Gene 1.0 ST Array [transcript (gene) version] | GPL6244 | Leukotriene E4 response | Mast cells | 9 |
| LTD4 vs Control | 26830450 | STX3, GGR3, NFKBID | Good validaton | |
| 26 | Neutrophil and PBMC gene expression data from Job’s Syndrome—Neutrophils | GSE8507 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Job’s syndrome | Neutrophils | 90 |
| HIES control 3 vs Healthy control 3 | 17881745 | Validated with PBMC | ||
| 27 | Neutrophil and PBMC gene expression data from Job’s Syndrome - PBMC | GSE8507 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Job’s syndrome | PBMC | 51 |
| HIES control 0 vs Healthy control 0 | 17881745 | CD151, CLEC12A, CoL6A2 | Good validaton | |
| 28 | Novel mediators of eicosanoid and epithelial nitric oxide production in asthma | GSE13785 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL96 + GPL570 | Asthma | Sputum | 22 |
| EIB pos healthy pre vs EIB neg pre | 20052409 | TGM2, THBS1, CD24 | Good validaton | |
| 29 | Progressive activation of Th2/Th22 characterizes acute and chronic atopic dermatitis | GSE36842 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Atopic dermatitis | Skin | 39 |
| Chronic lesion vs Normal | 22951056 | PI3, S100A8, S100A7 | Strong validation | |
| 30 | The effect of a dexamethasone and a FK506 on the induction of chemokines in human mast cells | GSE15174 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Allergy | Mast cells | 5 |
| AlgE vs DMSO | 19454720 | CCL1, CCL2, CCL4 | Good validaton | |
| 31 | Transcriptional profiling of egg allergy and relationship to disease phenotype | GSE88796 | Illumina HumanHT-12 V4.0 | GPL10558 | Egg allergy | PBMC | 132 |
| Allergic control vs Healthy control | 27 788 149 | CEACAM1, CLC, CCL17 | Good validaton | |
| 32 | Transcriptome profiling of (i) nasal polyp derived human IL17RB positive and negative T-helper cells and (ii) T-helper cells from normal nasal mucosa and matched peripheral blood | GSE70898 GSE70900 | Illumina HumanHT-12V4.0 | GPL10558 | Chronic rhinosinusitis with nasal polyposis | T cells | 24 |
| P Nasal Neg Act vs P Nasal Neg Rest | 26684290 | PCID2, VPS13C, GPR87 | Strong validation | |
| 33 | Upper airway gene expression is an effective surrogate biomarker for Th2-driven inflammation in the lower airway | GSE41861 | Affymetrix Human Genome U133 Plus 2.0 Array | GPL570 | Allergic asthma | Human nasal epithelium/ Bronchial biopsies | 138 |
| Asthma Severe_N vs Healthy Control_N | N/A | No associated publication |
aN/A = Not available.
bMeaning, the ranklist displayed by default in GXB for the dataset.
c Ex vivo: Experimentation or measurements done in or on tissue from an organism in an external environment with minimal alteration of natural conditions; e.g. purification of specific cell types. In vitro: Experimentation or measurements done in or on altered tissue from an organism in an altered external environment; e.g. stimulation of cultured bilological specimen.
Figure 2Examples of dataset validation. Differentially expressed genes from two datasets were compared with the results presented in the respective publications. (A) When comparing between Job’s syndrome patients (HIES_control_0) and healthy controls (Healthy_control_0), the mean fold-change of CD151 was 1.7 on the GXB (GSE8507-PBMC). The reported value in Holland et al. (2007) (39) was 2.0. (B) When comparing between PBMC samples from egg allergic patients (Allergic_Egg) and allergic egg-tolerant controls (Allergic_Control), the mean fold-change of CEACAM1 was 1.69 on the GXB (GSE88796). Kosoy et al. (2016) (40) reported a mean fold-change of 1.6 for the same gene. The overall trends of gene expression are conserved between GXB and published data.
Figure 3Comparison of gene expression profile in different tissues from HDM-sensitized individuals. Gene expression data from two datasets present in our collection are shown: (i) gene expression patterns in HDM-stimulated CD4 T cells and IgG to IgE ratios—GSE70760, and (ii) distinct epithelial gene expression phenotypes in childhood respiratory allergy—GSE19190—disease state. SERPINB2 (A and B) and IL1β (C and D) gene expression. HDM = house dust mite; Control = Healthy = healthy individuals; and Uncontrolled asthma = individuals with rhinitis and uncontrolled asthma (further definition can be found in the original study description (33)).
Figure 4Annotated gene list function. Number of unique and shared differentially expressed genes from the disease asthma (A) and pathways-cytokine and chemokine receptors (B) gene lists in allergic patients and healthy controls from three different studies (GSE22528, GSE19190 and GSE15823) are shown.