| Literature DB >> 15608199 |
Carola Kanz1, Philippe Aldebert, Nicola Althorpe, Wendy Baker, Alastair Baldwin, Kirsty Bates, Paul Browne, Alexandra van den Broek, Matias Castro, Guy Cochrane, Karyn Duggan, Ruth Eberhardt, Nadeem Faruque, John Gamble, Federico Garcia Diez, Nicola Harte, Tamara Kulikova, Quan Lin, Vincent Lombard, Rodrigo Lopez, Renato Mancuso, Michelle McHale, Francesco Nardone, Ville Silventoinen, Siamak Sobhany, Peter Stoehr, Mary Ann Tuli, Katerina Tzouvara, Robert Vaughan, Dan Wu, Weimin Zhu, Rolf Apweiler.
Abstract
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl), maintained at the European Bioinformatics Institute (EBI) near Cambridge, UK, is a comprehensive collection of nucleotide sequences and annotation from available public sources. The database is part of an international collaboration with DDBJ (Japan) and GenBank (USA). Data are exchanged daily between the collaborating institutes to achieve swift synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation (TPA) and alignments. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. New and updated data records are distributed daily and the whole EMBL Nucleotide Sequence Database is released four times a year. Access to the sequence data is provided via ftp and several WWW interfaces. With the web-based Sequence Retrieval System (SRS) it is also possible to link nucleotide data to other specialist molecular biology databases maintained at the EBI. Other tools are available for sequence similarity searching (e.g. FASTA and BLAST). Changes over the past year include the removal of the sequence length limit, the launch of the EMBLCDSs dataset, extension of the Sequence Version Archive functionality and the revision of quality rules for TPA data.Entities:
Mesh:
Year: 2005 PMID: 15608199 PMCID: PMC540052 DOI: 10.1093/nar/gki098
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
SRS data libraries
| Library | Content |
|---|---|
| EMBL | Entire EMBL Nucleotide Sequence Database apart from Contig and expanded Contig data |
| EMBL (Release) | The latest public release of the EMBL Nucleotide Sequence Database |
| EMBL (Updates) | All entries that are new or updated since the latest public release |
| EMBL (Third Party Annotation) | TPA data |
| EMBL (Contig) | CON entries |
| EMBL (Contigs Expanded) | Expanded CON entries |
| EMBL (Coding Sequences) | CDS data |
| EMBLALIGN (under ‘Nucleotide related databases’) | Alignment data |
Figure 1A sample entry from the EMBLCDSs dataset.
| AH | TPA_SPAN | PRIMARY_IDENTIFIER | PRIMARY_SPAN | COMP |
|---|---|---|---|---|
| AS | 1-251 | BE529226.1 | 1-251 | |
| AS | 68-450 | BE524624.1 | 1-383 | |
| AS | 394-1086 | AJ420881.1 | 1-693 | |
| AS | 826-1211 | AV561543.1 | 1-386 | c |