| Literature DB >> 21994619 |
James Rodney Brister1, Yiming Bao1, Carla Kuiken2, Elliot J Lefkowitz3, Philippe Le Mercier4, Raphael Leplae5, Ramana Madupu6, Richard H Scheuermann7, Seth Schobel6, Donald Seto8, Susmita Shrivastava6, Peter Sterk9, Qiandong Zeng10, William Klimke1, Tatiana Tatusova1.
Abstract
Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world's biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.Entities:
Keywords: annotation; genome; virus
Year: 2010 PMID: 21994619 PMCID: PMC3185566 DOI: 10.3390/v2102258
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1Cumulative number of validated full-length virus genome sequence records deposited in GenBank from 1993 to 2010 1,2
1 Only GenBank sequence records validated by RefSeq curators as full-length genomes are included.
2 Individual viral segments are included in tabulations, not complete constellations.
Total number of full-length virus genome sequence records in GenBank.
| Total virus genomes | 27059 |
| dsDNA virus genomes | 2419 |
| ssDNA virus genomes | 3607 |
| dsRNA virus genomes | 4861 |
| ssRNA virus genomes | 10665 |
| ssRNA negative-strand virus genomes | 1858 |
| ssRNA positive-strand virus genomes | 8801 |
| unassigned/unclassified ssRNA genomes | 6 |
| Retro-transcribing virus genomess | 4553 |
| Deltavirus genomes | 134 |
| Satellite genomes | 820 |
| Virus genomes without classification | 42 |
Only GenBank sequence records validated by RefSeq curators as full-length genomes are included.
Individual viral segments are included in tabulations, not complete constellations.