| Literature DB >> 19906712 |
Rasko Leinonen1, Ruth Akhtar, Ewan Birney, James Bonfield, Lawrence Bower, Matt Corbett, Ying Cheng, Fehmi Demiralp, Nadeem Faruque, Neil Goodgame, Richard Gibson, Gemma Hoad, Christopher Hunter, Mikyung Jang, Steven Leonard, Quan Lin, Rodrigo Lopez, Michael Maguire, Hamish McWilliam, Sheila Plaister, Rajesh Radhakrishnan, Siamak Sobhany, Guy Slater, Petra Ten Hoopen, Franck Valentin, Robert Vaughan, Vadim Zalunin, Daniel Zerbino, Guy Cochrane.
Abstract
The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena) is Europe's primary nucleotide sequence archival resource, safeguarding open nucleotide data access, engaging in worldwide collaborative data exchange and integrating with the scientific publication process. ENA has made significant contributions to the collaborative nucleotide archival arena as an active proponent of extending the traditional collaboration to cover capillary and next-generation sequencing information. We have continued to co-develop data and metadata representation formats with our collaborators for both data exchange and public data dissemination. In addition to the DDBJ/EMBL/GenBank feature table format, we share metadata formats for capillary and next-generation sequencing traces and are using and contributing to the NCBI SRA Toolkit for the long-term storage of the next-generation sequence traces. During the course of 2009, ENA has significantly improved sequence submission, search and access functionalities provided at EMBL-EBI. In this article, we briefly describe the content and scope of our archive and introduce major improvements to our services.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19906712 PMCID: PMC2808951 DOI: 10.1093/nar/gkp998
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
ENA-Annotation and ENA-Assembly data classes
| Data class | Description |
|---|---|
| Expressed sequence tag (EST) | High-throughput short transcribed cDNA (mRNA) sequences |
| Genome survey sequence (GSS) | High-throughput short genomic sequences |
| High throughput cDNA sequencing (HTC) | Unfinished cDNA (mRNA) sequences |
| High throughput genome sequencing (HTG) | Unfinished genomic sequences |
| Patent sequence (PAT) | Patent sequences |
| Sequenced tagged site (STS) | Short unique genomic sequences |
| Standard sequence (STD) | High-quality annotated sequences |
| Third party annotation sequence (TPA) | Re-annotated and re-assembled sequences |
| Transcriptome shotgun assembly (TSA) | Computationally assembled sequences |
| Whole genome shotgun (WGS) | Shotgun sequences |
| Constructed sequences (CON) | Sequence assemblies primarily from WGS sequences |
ENA-Reads data classes
| Data class | Description |
|---|---|
| Trace Archive | Sequence traces with base, quality and intensity information from capillary sequencing instruments |
| Sequence read archieve (SRA) | Sequence traces with base, quality and intensity information from next-generation sequencing instruments |
Figure 1.Selection of the sequence submission type.
Figure 2.Selection of the fields to include in the submission.
Figure 3.Submission summary page.
Figure 4.ENA Browser project page.
Figure 5.ENA Browser taxonomy page.