| Literature DB >> 28222698 |
Sergio Gonzalez1, Bernardo Clavijo2, Máximo Rivarola3,4, Patricio Moreno5, Paula Fernandez6,7,8, Joaquín Dopazo9, Norma Paniego6,7.
Abstract
BACKGROUND: In the last years, applications based on massively parallelized RNA sequencing (RNA-seq) have become valuable approaches for studying non-model species, e.g., without a fully sequenced genome. RNA-seq is a useful tool for detecting novel transcripts and genetic variations and for evaluating differential gene expression by digital measurements. The large and complex datasets resulting from functional genomic experiments represent a challenge in data processing, management, and analysis. This problem is especially significant for small research groups working with non-model species.Entities:
Keywords: Data integration; De novo transcriptomics; Ontology storage; Web application
Mesh:
Year: 2017 PMID: 28222698 PMCID: PMC5320735 DOI: 10.1186/s12859-017-1494-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 2Screenshot of ATGC views. a Drop down menu displaying all available options. b Search output by gene name. The Table shows the transcript inferred functional annotation. Tools to define Table setting and internal searches are provided at the Table header. c Search output by ontology terms; the Table displays direct and indirect annotated features and contains a link to deploy the members associated to the feature names. d Pie charts and structured graph to explore annotation from any loaded ontology. The pie chart shows the distribution of annotation in direct children terms. The sequence counts associated to each classification take into account the possible multiple annotations; users have links to obtain the lists of sequences annotated under each term
Fig. 1Application schema. Schema illustrating how the four main components interact with the Chado database through the pychado module. Each component contains a set of functions (controllers) and views which enable users to access to the database content
Fig. 3Feature details. a Graphic representation of a transcript with a detailed description of exons, CDS, mRNA, UTRs, molecular markers and protein domains. The table below the graphic shows the features relationships with reference positions. b Dynamic expression graphics, taking into account the experiment structure; users can select the conditions to plot and generate graphics divided by statistical value and genotype. The charts display the mean and standard deviation values of replicates in each condition. c GO acyclic graph shows functional annotation terms related to the feature annotations. The lists contain Direct and Inherited associated ontology terms with links to detailed information about each term
Tools capability comparison
| ATGC | Tripal | ||
|---|---|---|---|
| Complete software installation | ✓✓✓ | ✓ | |
| Access to personalize whole application | ✓ | ✓✓ | |
| Access control for external (non-administrative) users | ✓✓ | ✓✓✓ | |
| Out of the box ready querying of ontology terms (GO, SO) in database | ✓✓✓ | ✓ | |
| Expression data integrated in schema (ready-to-use) | ✓✓✓ | Ni | |
| Functional Annotation Loading | Blast2Go (annot files) | ✓✓✓ | ✓ |
| InterProScan (raw or xml files) | ✓✓✓ | ✓✓✓ | |
| Kegg Pathways (KAAS output files) | Ni | ✓✓✓ | |
GMODWeb, ChadoOnRails, and Badger cannot be included in the comparison because could not be installed and tested for lack of documentation or support. ✓ - computer expertise required; ✓✓ - moderately easy to complete; ✓✓✓ - straightforward; Ni Not implemented