| Literature DB >> 29132309 |
Oriah Mioduser1, Eli Goz1,2, Tamir Tuller3,4,5.
Abstract
BACKGROUND: Viruses undergo extensive evolutionary selection for efficient replication which effects, among others, their codon distribution. In the current study, we aimed at understanding the way evolution shapes the codon distribution in early vs. late viral genes in terms of their expression during different stages in the viral replication cycle. To this end we analyzed 14 bacteriophages and 11 human viruses with available information about the expression phases of their genes.Entities:
Keywords: Bacteriophage genome evolution; Coding regions; Codon usage bias (CUB); Synthetic virology; Viral evolution; Viral life cycle
Mesh:
Substances:
Year: 2017 PMID: 29132309 PMCID: PMC5683454 DOI: 10.1186/s12864-017-4248-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1The research outline of the study. The details can be found in the main text: Our analysis was based on coding sequences of 14 bacteriophages and 11 human viruses (A.), and on the ribo-seq measurements of bacteriophage Lambda and its E.coli host (B.). Basing on the existing literature, classification of the viral genes to early and late (with respect to the beginning of the lytic phase) was derived (C.). A., B., and C, were used to perform a comprehensive comparative genomics analysis of differential synonymous codon usage in early and late genes (D.), as well as of additional genomic features possibly related to codon bias (E.), such as: ribo-seq based codon typical decoding rates (TDR), Transfer Ribonucleic Acid (tRNA) adaptation indexes (tAI), effective number of codons (ENC), codon pairs bias (CPB), amino acids bias (AAB), dinucleotide bias (DNTB), nucleotide bias (NTB), GC content, number of genes in each temporary group and their length
Fig. 2Principal component analysis (PCA) of RSCF vectors for bacteriophages with significant separation in codon usage between early (blue circles) and late (red circles) genes. In order to visualize the clustering, PCA was applied to project the RSCF vectors to a plane spanned by their first two principal components. In order to visualize the separation between clusters a maximum margin separation line, a line for which the Euclidian distance between it and the nearest point from either of the groups is maximized, was calculated and plotted. The significance of cluster separation was assessed by comparing the Davies-Bouldin cluster score to the randomized scores obtained from 100 permutations of gene temporary (early or late) labels. The variances % of the first two principal components are mentioned in the figures axis
Fig. 3Comparative analysis of early and late genes in 14 different bacteriophages. Details can be found in the main text. a A phylogenetic tree built from complete phage proteomes using ARS distance (see Materials and Methods). Phages with significant differences in temporary codon usage are marked by blue. b Viruses with significant (p-value <0.05) separation between early and late genes w.r.t synonymous codons or AA are marked by yellow stars. c Significance of separation between early and late genes w.r.t additional genomic features estimated by Wilcoxon ranksum p-value. Features/viruses with significant (p-value <0.05) separation between the two temporal groups are marked by yellow stars; green is related to higher mean in the case of the early genes and red is related to higher mean in the case of the late genes