| Literature DB >> 19457259 |
Francesco Marass1, Chris Upton.
Abstract
BACKGROUND: The volume of viral genomic sequence data continues to increase rapidly. This is especially true for the smaller RNA viruses, which are relatively easy to sequence in large numbers. The data volumes cause a number of significant problems for research applications that require large multiple alignments of essentially complete genomes, which are of the order of 10 kb.Entities:
Year: 2009 PMID: 19457259 PMCID: PMC2694821 DOI: 10.1186/1756-0500-2-91
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1Use of tags to create an alignment. To create this alignment, the query sequence read against the reference sequence until after amino acid #8 (reference sequence numbering), then a gap of 3 residues is inserted in the query (tag 8-3; deletion of length 3 in query). The sequences are continued until amino acid #21 (reference sequence numbering), then a gap of 1 residue is the reference sequence (tag 21g1; insertion of 1 amino acid in query sequence). There is no need for substitution tags in this example because both amino acid sequences have been stored.
Figure 2Web interface for the demonstration of KISSa. Four menus are used to select polyproteins from different dengue virus genotypes. Results can be viewed in a browser window or exported in mFASTA format for loading into an MSA viewer/editor such as Base-By-Base.
Examples of KISSa tags.
| Tag | Consequence |
| 3-1 | Deletion of length 1, starting at position 3 in the reference sequence |
| 17sT | Replace nucleotide at position 17 in the reference sequence with T |
| 5+ME | Insert two amino acids (ME) starting at position 5 in the reference sequence |
| 6g3 | Insert a gap of length 3 at position 6 |
| 0-0 | The no-operation tag, a deletion of zero characters |
Example KISSa alignments. Number of differences refers to the total number of insertions and deletions in all sequences.
| Genotypes | # of Seqs | Time for KISSa MSA (secs) | Number of differences |
| DENV 1 | 528 | 3.970 | 8 deletions, 4 insertions |
| DENV 2 | 558 | 0.284 | 3 deletions |
| DENV 3 | 236 | 0.697 | 1 insertion |
| DENV 4 | 31 | 0.164 | 3 deletions |
| DENV 1,2 | 1086 | 10.607 | 1127 deletions, 562 insertions |
| DENV 1,2,3 | 1322 | 20.289 | 1599 deletions, 1035 insertions |
| DENV 1,2,3,4 | 1354 | 25.022 | 1755 deletions, 1129 insertions |