| Literature DB >> 28061810 |
Louis J Taylor1,2, Klaus Strebel3.
Abstract
BACKGROUND: Gene knockouts are a common tool used to study gene function in various organisms. However, designing gene knockouts is complicated in viruses, which frequently contain sequences that code for multiple overlapping genes. Designing mutants that can be traced by the creation of new or elimination of existing restriction sites further compounds the difficulty in experimental design of knockouts of overlapping genes. While software is available to rapidly identify restriction sites in a given nucleotide sequence, no existing software addresses experimental design of mutations involving multiple overlapping amino acid sequences in generating gene knockouts.Entities:
Keywords: Bioinformatics; Cloning; Knockout; Knockout virus; Mutation; Overprinting; Pyviko; Viral mutant; Virus
Mesh:
Substances:
Year: 2017 PMID: 28061810 PMCID: PMC5219722 DOI: 10.1186/s12866-016-0920-3
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Fig. 1Examples of gene overprinting in viruses. a Outline showing different types of gene overprinting and associated nucleotide and amino acid sequences. Gene B overprints into gene A in the same direction but a different frame. Genes C and D showcase overprinting in different frames and directions. Start codons in nucleotide sequences are shown in green. b Genome of human immunodeficiency virus type I [7] with annotated genes. tat and rev splicing is indicated by a solid line
Fig. 2Usage of Pyviko in stand-alone scripts and web interface. a Schematic of sample target (Y) and overlapping (X) genes used in code examples. b Minimal Python commands for generating gene knockouts. In this interactive example, “>>>” denotes input into a Python interpreter and output is displayed directly below the input on an unindented line. c Pyviko input interface [14] including sample gene sequences. d Results of analysis for input shown in (c)
Fig. 3Large-scale analysis of viral overprinted genes from the NCBI Nucleotide database. a and b Show counts of the first position at which a directed point mutation can insert a premature stop codon in the overprinted gene without changing the amino acid sequence of the overlapped gene in each gene pair surveyed. Full-size graphs show mutations over the full length of all genes surveyed with a log2 scale to ensure visibility of bins with low counts. Insets show the first 100 codons of each gene with a linear y axis. a Shows counts without requiring a restriction site change and (b) shows counts with a required restriction site change. c and d Show positions of the first stop codon mutants from (a) and (b) expressed as the ratio of the position of the first codon relative to the total length of the gene in codons. c, as in (a), shows counts without requiring a restriction site change and (d), as in (b), shows counts with a required restriction site change