| Literature DB >> 29642587 |
Darius Kazlauskas1,2, Arvind Varsani3,4, Mart Krupovic5.
Abstract
Numerous metagenomic studies have uncovered a remarkable diversity of circular replication-associated protein (Rep)-encoding single-stranded (CRESS) DNA viruses, the majority of which are uncultured and unclassified. Unlike capsid proteins, the Reps show significant similarity across different groups of CRESS DNA viruses and have conserved domain organization with the N-terminal nuclease and the C-terminal helicase domain. Consequently, Rep is widely used as a marker for identification, classification and assessment of the diversity of CRESS DNA viruses. However, it has been shown that in certain viruses the Rep nuclease and helicase domains display incongruent evolutionary histories. Here, we systematically evaluated the co-evolutionary patterns of the two Rep domains across classified and unclassified CRESS DNA viruses. Our analysis indicates that the Reps encoded by members of the families Bacilladnaviridae, Circoviridae, Geminiviridae, Genomoviridae, Nanoviridae and Smacoviridae display largely congruent evolutionary patterns in the two domains. By contrast, among the unclassified CRESS DNA viruses, 71% appear to have chimeric Reps. Such massive chimerism suggests that unclassified CRESS DNA viruses represent a dynamic population in which exchange of gene fragments encoding the nuclease and helicase domains is extremely common. Furthermore, purging of the chimeric sequences uncovered six monophyletic Rep groups that may represent new families of CRESS DNA viruses.Entities:
Keywords: CRESS DNA viruses; HUH endonuclease domain; recombination; rolling-circle replication initiation proteins; ssDNA viruses; superfamily 3 helicase domain; virus evolution
Mesh:
Substances:
Year: 2018 PMID: 29642587 PMCID: PMC5923481 DOI: 10.3390/v10040187
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Tanglegram of maximum likelihood phylogenetic trees of the nuclease (left) and helicase (right) domains of Reps encoded by circular, Rep-encoding single-stranded (CRESS) DNA viruses. Putative recombinant proteins are joined with red lines. Clades forming distinct groups are marked with colored rectangles. Branches with support lower than 70% were collapsed. The nuclease and helicase phylogenies were inferred using PhyML [32] with the VT + G (VT matrix; G, gamma shape parameter) and rtREV + G + I + F (rtREV amino acid model; G, gamma shape parameter: fixed; I, proportion of invariable sites: fixed; F, equilibrium frequencies: empirical) substitution models, respectively. The alignments contained 267 and 206 aa positions, respectively. Rec, recombinant Rep group.
Figure 2Sequence motifs of Rep proteins from classified and possibly recombinant CRESS DNA viruses. Motifs are presented as sequence logos and those containing predicted recombinations are joined with grey rectangles. Residues are colored according to their chemical properties (polar, green; basic, blue; acidic, red; hydrophobic, black; neutral, purple).
Figure 3Phylogenetic tree of CRESS viruses generated after removing possible recombinant proteins. The tree is based on full-length Rep amino acid sequences. Clades belonging to the same group have the same colors as in Figure 1. Edges having support lower than 70% were contracted. The phylogeny was inferred using PhyML [32] with the rtREV+G+F (rtREV amino acid model; G, gamma shape parameter: fixed; F, equilibrium frequencies: empirical) substitution model using the alignment containing 412 positions.