| Literature DB >> 18039715 |
Guy Cochrane1, Ruth Akhtar, Philippe Aldebert, Nicola Althorpe, Alastair Baldwin, Kirsty Bates, Sumit Bhattacharyya, James Bonfield, Lawrence Bower, Paul Browne, Matias Castro, Tony Cox, Fehmi Demiralp, Ruth Eberhardt, Nadeem Faruque, Gemma Hoad, Mikyung Jang, Tamara Kulikova, Alberto Labarga, Rasko Leinonen, Steven Leonard, Quan Lin, Rodrigo Lopez, Dariusz Lorenc, Hamish McWilliam, Gaurab Mukherjee, Francesco Nardone, Sheila Plaister, Stephen Robinson, Siamak Sobhany, Robert Vaughan, Dan Wu, Weimin Zhu, Rolf Apweiler, Tim Hubbard, Ewan Birney.
Abstract
The Ensembl Trace Archive (http://trace.ensembl.org/) and the EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/), known together as the European Nucleotide Archive, continue to see growth in data volume and diversity. Selected major developments of 2007 are presented briefly, along with data submission and retrieval information. In the face of increasing requirements for nucleotide trace, sequence and annotation data archiving, data capture priority decisions have been taken at the European Nucleotide Archive. Priorities are discussed in terms of how reliably information can be captured, the long-term benefits of its capture and the ease with which it can be captured.Entities:
Mesh:
Year: 2007 PMID: 18039715 PMCID: PMC2238915 DOI: 10.1093/nar/gkm1018
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.EMBL submission portal. The screenshot shows the submission portal top page, where submission options are presented to the user.
Points of entry to the European Nucleotide Archive—submissions, retrieval and support
| Tool | Point of entry | |
|---|---|---|
| Submission | Project registration | |
| Trace submission | ||
| Sequence/annotation new data and updates | ||
| Retrieval | SRS | |
| Homology search—EMBL-Bank | ||
| Homology search—ETA | ||
| Sequence Version Archive | ||
| EMBL-Bank FTP | ||
| ETA FTP | ftp://ftp.ensembl.org/pub/traces/ | |
| Genomes | ||
| Dbfetch | ||
| Wsdbfetch | ||
| Custom EMBL-Bank datasets | datasubs@ebi.ac.uk | |
| EMBL World | ||
| Custom ETA datasets | trace-request@ensembl.org | |
| Support | Help | datasubs@ebi.ac.uk |
| General Information | ||
| News | ||
| Forthcoming Changes | ||
| XML Documentation | ||
| EMBL-Bank User Manual | ||
| EMBL-Bank Feature Table Definition Document | ||
| ETA Format Documentation |
Figure 2.Input pipelines into ENA. Data input pipelines into ENA are shown; the left-hand side of the diagram represents the work of the data generating group and the right-hand side the work of ENA; validation refers to the semantic content of data transferred, particularly the biological content, and loading refers to syntactic checking of data and upload into the appropriate database.
Description of source metadata in EMBL-Bank
| Qualifier type | Qualifiers |
|---|---|
| Molecule-related | /mol_type*, /chromosome, /segment, /clone, /clone_lib, /map, /PCR_primers***, /sub_clone, /tissue_library |
| ‘Parent’ setting | /virion, /proviral, /germline, /rearranged, /focus, /macronuclear, /organelle**, /plasmid |
| ‘Parent’ organism | /organism*, /strain, /variety**, /cultivar, /sub_species**, /sub_strain, /transgenic, /ecotype, /identified_by, /pop_variant, /serotype, /serovar, /sex, speciment_voucher***, /culture_collection***, /bio_material*** |
| ‘Parent’ resolution | /dev_stage, /tissue_type, /cell_line |
| Sampling | /environmental_sample, /lat_lon**, /collected_by, /isolate, /collection_date, /country***, /isolation_source, /lab_host |
| Integrative | /citation**, /db_xref, /label, /specific_host |
| Miscellaneous | /note |
The table shows a classification of source qualifiers used in the description of source molecules in EMBL-Bank. Those marked with * are mandatory, those marked with ** take controlled vocabularies and those marked with *** take additional granular structure.