| Literature DB >> 28185543 |
Daniela Evangelista1, Kumar Parijat Tripathi2, Mario Rosario Guarracino2.
Abstract
BACKGROUND: RNA sequencing takes advantage of the Next Generation Sequencing (NGS) technologies for analyzing RNA transcript counts with an excellent accuracy. Trying to interpret this huge amount of data in biological information is still a key issue, reason for which the creation of web-resources useful for their analysis is highly desiderable.Entities:
Keywords: Annotations; Database; Hydra vulgaris; MySQL; PHP; Transcriptome
Mesh:
Year: 2016 PMID: 28185543 PMCID: PMC5046195 DOI: 10.1186/s12859-016-1172-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The bioinformatics pipeline of the Atlas of Hydra vulgaris
Fig. 4Data retrieval path of the HAAC10000596 transcript
The Atlas content at a glance
| ENA | BlastX Hits | Proteins obtained | DAVID annotation | |
|---|---|---|---|---|
| Trans >200 | 45.269 | 31.988 | 18.133 | 13.761 |
| Trans <200 | 3.640 | n/a | n/a | n/a |
| LeftOver | n/a | 13.281 | n/a | 4.372 |
| TOT | 48.909 | 45.269 | 18.133 | 18.133 |
Fig. 2Species distribution
Fig. 3Functional annotation distribution of Hydra vulgaris transcriptome in the Atlas. a Domain annotation category shows Interpro and PFAM as the more highly enriched domains. b Miscellaneous category shows that SP-PIR keywords annotation occupies the half of the terms enriched within the databases. c Gene Ontology category shows that biological processes (BP) and molecular functions (MF) share the similar level of enrichment. d Protein interaction category shows that Mint and Bind databases are almost equally distributed. e Pathway category shows that KEGG and Panther are the most over expressed terms in relation with the stored transcripts