| Literature DB >> 22080548 |
Clara Amid1, Ewan Birney, Lawrence Bower, Ana Cerdeño-Tárraga, Ying Cheng, Iain Cleland, Nadeem Faruque, Richard Gibson, Neil Goodgame, Christopher Hunter, Mikyung Jang, Rasko Leinonen, Xin Liu, Arnaud Oisel, Nima Pakseresht, Sheila Plaister, Rajesh Radhakrishnan, Kethi Reddy, Stephane Rivière, Marc Rossello, Alexander Senf, Dimitriy Smirnov, Petra Ten Hoopen, Daniel Vaughan, Robert Vaughan, Vadim Zalunin, Guy Cochrane.
Abstract
The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena), Europe's primary nucleotide sequence resource, captures and presents globally comprehensive nucleic acid sequence and associated information. Covering the spectrum from raw data to assembled and functionally annotated genomes, the ENA has witnessed a dramatic growth resulting from advances in sequencing technology and ever broadening application of the methodology. During 2011, we have continued to operate and extend the broad range of ENA services. In particular, we have released major new functionality in our interactive web submission system, Webin, through developments in template-based submissions for annotated sequences and support for raw next-generation sequence read submissions.Entities:
Mesh:
Year: 2011 PMID: 22080548 PMCID: PMC3245037 DOI: 10.1093/nar/gkr946
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Growth of assembled sequences (ENA:EMBL-Bank); see http://www.ebi.ac.uk/ena/about/statistics#embl_growth for dynamically updated growth chart. (B) Growth of raw data from next generation sequencing platforms (ENA: SRA); see http://www.ebi.ac.uk/ena/about/statistics#sra_growth for dynamically updated growth chart.
Figure 2.Usage of the different web-based interactive submission systems for annotated sequences at ENA between 2009 and 2011.
Names and definitions of templates currently available for sequence submissions to EMBL-bank
| Template name | Definition |
|---|---|
| Intergenic Spacer, IGS | For intergenic spacer (IGS) sequences between neighbouring genes (e.g. psbA-trnH IGS, 16S-23S rRNA IGS). Inclusion of the flanking genes is allowed |
| ITS region | For the 18S rRNA, ITS1, 5.8S rRNA, ITS2, 28S rRNA region, where the locations of the boundaries are not known |
| D-Loop | For mitochondrial D-loop (control region) sequences. All D-loops are considered partial |
| trnK-matK locus | For complete or partial matK gene within the chloroplast trnK gene |
| COI gene | For mitochondrial cytochrome oxidase subunit 1 genes |
| MHC gene 1 exon | For partial MHC class I or II antigens containing one exon |
| MHC gene 2 exons | For partial MHC class I or II antigens containing two exons |
| Single CDS genomic DNA | For complete or partial single non-segmented coding sequence (CDS) derived from genomic DNA |
| Single viral CDS genomic RNA | For complete or partial single coding sequence (CDS) derived from viral genomic RNA. Please do not use for viral DNA, peptides processed from polyproteins, viral cRNAs, or proviral sequences, as these are all annotated differently |
| Single CDS mRNA | For complete or partial single coding sequence (CDS) derived from mRNA (via cDNA) |
| rRNA gene | For ribosomal RNA genes from prokaryotic, nuclear or mitochondrial DNA. All rRNAs are considered partial |
| EST | For EST (expressed sequence tag) submissions |
| WGS (unannotated) | For unannotated Whole Genome Shotgun (WGS) sequences |
| MIMARKS-Survey 16S rRNA sequences | For the submission of 16S rRNA sequence compliant with the MIMARKS Minimal Information about a MARKer gene Sequence Standard |
| Soil sample MIMARKS-Survey using 16S rRNA sequences | For the submission of 16S rRNA sequence compliant with the MIMARKS Minimal Information about a MARKer gene Sequence Standard, specific to soil metagenomes |
Figure 3.Screenshot of raw data definition page in SRA Webin.