| Literature DB >> 28348827 |
Lauren A Esposito1, Swati Gupta1, Fraida Streiter1, Ashley Prasad1, John J Dennehy1,2.
Abstract
In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.Entities:
Keywords: UPGMA clustering; bacteriophages; codon bias; host range; mycobacteria; viral tRNAs
Mesh:
Substances:
Year: 2016 PMID: 28348827 PMCID: PMC5359403 DOI: 10.1099/mgen.0.000079
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Histogram of tRNA-encoding genes found in the 199 mycobacteriophages analyzed in this study.
Fig. 2.The synonymous codon usage order (SCUO) was calculated for each genome. SCUO values for each mycobacteriophage were compared with the quantity of encoded tRNA genes. tRNA abundance does not predict SCUO. However, if a phage does have tRNA genes, it tends to have a more biased genome.
Fig. 3.Relative synonymous codon usage (RSCU) was calculated for each mycobacteriophage and mycobacterial species and used to reconstruct this UPGMA-dendrogram. Colors correspond to the cluster designations assigned by phagesdb.org.