| Literature DB >> 26380270 |
Pankaj Kumar1, Anna Halama1, Shahina Hayat1, Anja M Billing1, Manish Gupta1, Noha A Yousri1, Gregory M Smith1, Karsten Suhre2.
Abstract
The number of RNA-Seq studies has grown in recent years. The design of RNA-Seq studies varies from very simple (e.g., two-condition case-control) to very complicated (e.g., time series involving multiple samples at each time point with separate drug treatments). Most of these publically available RNA-Seq studies are deposited in NCBI databases, but their metadata are scattered throughout four different databases: Sequence Read Archive (SRA), Biosample, Bioprojects, and Gene Expression Omnibus (GEO). Although the NCBI web interface is able to provide all of the metadata information, it often requires significant effort to retrieve study- or project-level information by traversing through multiple hyperlinks and going to another page. Moreover, project- and study-level metadata lack manual or automatic curation by categories, such as disease type, time series, case-control, or replicate type, which are vital to comprehending any RNA-Seq study. Here we describe "MetaRNA-Seq," a new tool for interactively browsing, searching, and annotating RNA-Seq metadata with the capability of semiautomatic curation at the study level.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26380270 PMCID: PMC4561952 DOI: 10.1155/2015/318064
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The MetaRNA-Seq web interface. On the left it has the search functionality for RNA-Seq studies. Below the search, the table contains all RNA-Seq study details, including name, title, number of samples, number of experiments, and number of runs, allowing one to quickly scroll through all of the studies. The table is filtered based on the search. The table can be sorted by double clicking any column. Upon clicking any study in the table, the study details are populated at the upper right. A tree-like data structure containing biosamples, experiments, and runs for the selected study is populated in the lower right.
Figure 2RNA-Seq metadata annotations in MetaRNA-Seq. Suggestions are based on a program-assisted search of all data available for a particular study. Custom annotation fields can be used in cases when the annotator feels that additional information is important and it cannot be stored using default options. The annotator can use as many custom annotation fields as required.
Figure 3Guided search using annotated fields. The search is performed to identify RNA-Seq studies involving breast cancer with cell line as the sample type and annotation status as completed. The result output is a filtered table, with rows highlighted in green because it searched for completed annotation. The user can obtain additional details about any of the filtered studies by simply clicking on them.