| Literature DB >> 31978081 |
Abstract
Gene expression data have been archived as microarray and RNA-seq datasets in two public databases, Gene Expression Omnibus (GEO) and ArrayExpress (AE). In 2018, the DNA DataBank of Japan started a similar repository called the Genomic Expression Archive (GEA). These databases are useful resources for the functional interpretation of genes, but have been separately maintained and may lack RNA-seq data, while the original sequence data are available in the Sequence Read Archive (SRA). We constructed an index for those gene expression data repositories, called All Of gene Expression (AOE), to integrate publicly available gene expression data. The web interface of AOE can graphically query data in addition to the application programming interface. By collecting gene expression data from RNA-seq in the SRA, AOE also includes data not included in GEO and AE. AOE is accessible as a search tool from the GEA website and is freely available at https://aoe.dbcls.jp/.Entities:
Year: 2020 PMID: 31978081 PMCID: PMC6980531 DOI: 10.1371/journal.pone.0227076
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Comparison of EBI ArrayExpress and NCBI Gene Expression Omnibus.
The number of overlapping data-series entries in ArrayExpress (left) and Gene Expression Omnibus (right).
Fig 2How to make AOE index.
Three levels of AOE data flow are required to make the AOE index. Level 1 from ArrayExpress data, level 2 from GEO data in SRA via DBCLS SRA API, and level 3 from RNA-seq data in SRA, but not in GEO via DBCLS SRA API.
Fig 3AOE web interface.
Users can retrieve data of interest graphically in the AOE web interface.