| Literature DB >> 22669909 |
Shiliang Wang1, Jaideep P Sundaram, Timothy B Stockwell.
Abstract
A gene prediction program, VIGOR (Viral Genome ORF Reader), was developed at J. Craig Venter Institute in 2010 and has been successfully performing gene calling in coronavirus, influenza, rhinovirus and rotavirus for projects at the Genome Sequencing Center for Infectious Diseases. VIGOR uses sequence similarity search against custom protein databases to identify protein coding regions, start and stop codons and other gene features. Ribonucleicacid editing and other features are accurately identified based on sequence similarity and signature residues. VIGOR produces four output files: a gene prediction file, a complementary DNA file, an alignment file, and a gene feature table file. The gene feature table can be used to create GenBank submission. VIGOR takes a single input: viral genomic sequences in FASTA format. VIGOR has been extended to predict genes for 12 viruses: measles virus, mumps virus, rubella virus, respiratory syncytial virus, alphavirus and Venezuelan equine encephalitis virus, norovirus, metapneumovirus, yellow fever virus, Japanese encephalitis virus, parainfluenza virus and Sendai virus. VIGOR accurately detects the complex gene features like ribonucleicacid editing, stop codon leakage and ribosomal shunting. Precisely identifying the mat_peptide cleavage for some viruses is a built-in feature of VIGOR. The gene predictions for these viruses have been evaluated by testing from 27 to 240 genomes from GenBank.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22669909 PMCID: PMC3394299 DOI: 10.1093/nar/gks528
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison data between VIGOR predictions and GenBank annotations of the designated viruses
| Virus name | No. of genomes | No. of GenBank Annotations | No. of VIGOR predictions | No. of same predictions | Partial agreements | Missing gene | New gene | Special features | Mat_peptide prediction |
|---|---|---|---|---|---|---|---|---|---|
| MPV | 27 | 243 | 243 | 235 | 8 | 0 | 0 | Gene overlapping | N/A |
| Measles and mumps viruses | 101 | 759 | 774 | 754 | 5 | 0 | 15 | RNA editing | N/A |
| Norovirus | 98 | 294 | 294 | 294 | 0 | 0 | 0 | No | Yes |
| Parainfluenza virus and Sendai virus | 36 | 249 | 311 | 249 | 0 | 0 | 62 | RNA editing, Ribosomal shunting | N/A |
| RSV | 50 | 535 | 568 | 522 | 13 | 0 | 33 | Gene overlapping | N/A |
| Rubella virus | 46 | 92 | 92 | 91 | 1 | 0 | 0 | No | Yes |
| VEEV and alphavirus | 240 | 478 | 478 | 478 | 0 | 0 | 0 | Stop codon leakage, | Yes |
| YFV and JEV | 170 | 170 | 170 | 170 | 0 | 0 | 0 | No | Yes |
aNot applicable.