| Literature DB >> 17981842 |
Konstantinos Liolios1, Konstantinos Mavromatis, Nektarios Tavernarakis, Nikos C Kyrpides.
Abstract
The Genomes On Line Database (GOLD) is a comprehensive resource that provides information on genome and metagenome projects worldwide. Complete and ongoing projects and their associated metadata can be accessed in GOLD through pre-computed lists and a search page. As of September 2007, GOLD contains information on more than 2900 sequencing projects, out of which 639 have been completed and their sequence data deposited in the public databases. GOLD continues to expand with the goal of providing metadata information related to the projects and the organisms/environments towards the Minimum Information about a Genome Sequence' (MIGS) guideline. GOLD is available at http://www.genomesonline.org and has a mirror site at the Institute of Molecular Biology and Biotechnology, Crete, Greece at http://gold.imbb.forth.gr/Entities:
Mesh:
Year: 2007 PMID: 17981842 PMCID: PMC2238992 DOI: 10.1093/nar/gkm884
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Metadata types available from GOLD
| Project metadata fields | Number of projects | Organism/ environment metadata | Number of projects |
|---|---|---|---|
| 1. GOLD Project ID | 2905 | 1. Domain | 2905 |
| 2. GCAT ID | 2905 | 2. Phylum | 2905 |
| 3. NCBI Project ID | 1903 | 3. Class | 2905 |
| 4. IMG OID | 829 | 4. Order | 2905 |
| 5. Sequencing method | 797 | 5. Family | 2905 |
| 6. Sequencing coverage | 401 | 6. Genus | 2905 |
| 7. Project type | 2905 | 7. Species | 2905 |
| 8. Sequencing status | 2905 | 8. Strain | 2113 |
| 9. Project status | 1375 | 9. Serovar | 177 |
| 10. Country | 2905 | 10. Taxon ID | 2806 |
| 11. Availability | 2905 | 11. StrainInfo ID | 320 |
| 12. Sequencing center | 2896 | 12. Greengenes ID | 707 |
| 13. Project relevance | 2241 | 13. Culture Collection ID | 595 |
| 14. Funding center | 2108 | 14. Size | 1717 |
| 15. Sequence data | 1160 | 15. Gene number | 991 |
| 16. Database | 1983 | 16. Chromosome number | 793 |
| 17. Publication | 448 | 17. Plasmid number | 777 |
| 18. Release date | 664 | 18. GC% | 1184 |
| 19. Contact name | 2158 | 19. Phenotype | 2123 |
| 20. Contact email | 2150 | 20. Habitat | 1962 |
| 21. Disease | 983 | ||
| 22. Temperature | 626 | ||
| 23. pH | 69 | ||
| 24. Isolation | 1023 | ||
| 25. Symbiont | 122 |
Figure 1.Statistical information available in GOLD. (A) Distribution of the 2995 genome projects across the major sequencing centers. Abbreviations are for, JGI: Joint Genome Institute, TIGR: The Institute for Genome Research, JCVI: J. Craig Venter Institute, WashU: Washington University and WORLD: all other sequencing centers. (B) Distribution of the 1949 bacterial and archaeal genome projects across the major sequencing centers. (C) Phylogenetic distribution of the 790 bacterial genome projects in January 2005. (D) Phylogenetic distribution of the 1832 bacterial genome projects in September 2007.