| Literature DB >> 35609980 |
Guy P Hunt1,2,3,4, Luigi Grassi5, Rafael Henkin6, Fabrizio Smeraldi7, Thomas P Spargo2, Renata Kabiljo1, Sulev Koks3,4, Zina Ibrahim1, Richard J B Dobson1,8,9, Ammar Al-Chalabi2,10, Michael R Barnes6,11, Alfredo Iacoangeli1,2,9.
Abstract
Gene Expression Omnibus (GEO) is a database repository hosting a substantial proportion of publicly available high throughput gene expression data. Gene expression analysis is a powerful tool to gain insight into the mechanisms and processes underlying the biological and phenotypic differences between sample groups. Despite the wide availability of gene expression datasets, their access, analysis, and integration are not trivial and require specific expertise and programming proficiency. We developed the GEOexplorer webserver to allow scientists to access, integrate and analyse gene expression datasets without requiring programming proficiency. Via its user-friendly graphic interface, users can easily apply GEOexplorer to perform interactive and reproducible gene expression analysis of microarray and RNA-seq datasets, while producing a wealth of interactive visualisations to facilitate data exploration and interpretation, and generating a range of publication ready figures. The webserver allows users to search and retrieve datasets from GEO as well as to upload user-generated data and combine and harmonise two datasets to perform joint analyses. GEOexplorer, available at https://geoexplorer.rosalind.kcl.ac.uk, provides a solution for performing interactive and reproducible analyses of microarray and RNA-seq gene expression data, empowering life scientists to perform exploratory data analysis and differential gene expression analysis on-the-fly without informatics proficiency.Entities:
Year: 2022 PMID: 35609980 PMCID: PMC9252785 DOI: 10.1093/nar/gkac364
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.GEOexplorer workflow overview. (A) GEOexplorer's workflow begins with the users selecting the data source of their gene expression dataset, either GEO or user upload. GEOexplorer will automatically source GEO microarray datasets and several formats of GEO RNA-seq datasets. Users can also upload their own gene expression datasets. GEOexplorer enables users to search for GEO datasets. (B) Users can select to combine two gene expression datasets and then perform batch correction, so they are comparable. (C) Log2 transformation and k-nearest neighbour (KNN) imputation can be selected before analysing microarray data. Log2 and counts per million transformations can be selected before analysing RNA-seq data. (D) Dataset details, including information about the study and experiment, can be reviewed. (E) Results of EDA can be reviewed. (F) Options for DGEA can be set based on the outputs of EDA. Subsequently, the outputs of DGEA can be reviewed. (G) Options for GEA can be set. Subsequently, the outputs of GEA can be reviewed.
Figure 2.GEOexplorer data collection, harmonisation, and transformation settings, study and experiment information, and EDA outputs. (A) GEOexplorer data collection, harmonisation, and transformation settings. (B) Experiment information. (C) Experimental conditions information. (D) Gene expression dataset. (E) Mean-variance plot. (F) Expression density plot pre-batch correction. (G) Expression density plot post-empirical Bayes batch correction. (H) Heatmap plot. (I) Box-and-whisper plot pre-batch correction. (J) Box-and-whisper plot post-empirical Bayes batch correction. (K) 3D PCA variables.
Figure 3.GEOexplorer DGEA analysis settings and DGEA outputs. (A) Sample selection for group 1. (B) Sample selection for group 2. (C) DGEA options. (D) Table of the differentially expressed probes. (E) Histogram plot of adjusted P-values. (F) Quantile-quantile (QQ) plot. (G) Volcano plot. (H) Mean difference plot. (I) Heatmap plot.
Figure 4.GEOexplorer GEA settings and GEA outputs. (A) Selecting the gene symbols. (B) GEA options. (C) Table of enriched terms. (D) Bar chart of the top enriched terms. (E) Volcano plot. (F) Manhattan plot.