| Literature DB >> 29583117 |
Abstract
In Bacteria, a working consensus of species circumscription may have been reached and one of the most prominent criteria is high average nucleotide identity (ANI). ANI in effect groups strains that may recombine more or less frequently, depending on their biology, as opposed to rare interspecies gene transfer. For bacteriophages, which show various lifestyles, the nature of the fundamental natural unit, if it exists in a biological sense, is not well understood and defined. The approaches based on dot-plots are useful to group similar bacteriophages, yet are not quantitative and use arbitrarily set cut-offs. Here, we focus on lytic Myoviridae and test the ANI metric for group delineation. We show that ANI-based groups are in agreement with the International Committee on Taxonomy of Viruses (ICTV) classification and already established dot-plot groups, which are occasionally further refined owing to higher resolution of ANI analysis. Furthermore, these groups are separated among themselves by clear ANI discontinuities. Their members readily exchange core genes with each other while they do not with bacteriophages of other ANI groups, not even with the most similar. Thus, ANI-delineated groups may represent the natural units in lytic Myoviridae evolution with features that resemble those encountered in bacterial species.Entities:
Keywords: Myoviridae; average nucleotide identity; evolution; lytic; recombination
Mesh:
Substances:
Year: 2018 PMID: 29583117 PMCID: PMC5885020 DOI: 10.1099/mgen.0.000169
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.ANI analysis of the 186 lytic Myovridae bacteriophages. (a) Histogram of ANI values showing a discontinuity centred on 85 %. (b) ANIb versus ANIb coverage (the proportion of the bacteriophage genome on which the ANI was actually calculated, the shared DNA).
Evidence for recombination in members of the more populous ANIb-delineated bacteriophage groups
The CADM test measures the similarity among distance matrices obtained for all core genes of a sample. It is given as the Kendall coefficient, where 1 indicates complete agreement among distance matrices of all genes and 0 the opposite. A low P-value indicates that at least two matrices are similar and the total incongruence is rejected.
| Group name (number of phages) | All genes (50 % AAI, 50 % coverage) | Core genome | Genes exhibiting recombination (pairwise homoplasy test) | Clonal frame, portion of core DNA (range for all tree branches) | CADM, all core genes W, |
|---|---|---|---|---|---|
| T4 (32) | 434 | 183 | 123 | 0.86–0.98 | 0.16, |
| Felix (17) | 224 | 88 | 48 | 0.80–0.97 | 0.27, |
| FV3 (11) | 314 | 175 | 74 | 0.81–0.94 | 0.32, |
| RB69 (9) | 334 | 203 | 73 | 0.90–0.97 | 0.12, |
| Phax (9) | 383 | 135 | 55 | 0.71–0.91 | 0.06, |
| KP15 (6) | 335 | 207 | 40 | 0.89–0.96 | 0.09, |
| Ea35-70 (5) | 341 | 307 | 55 | 0.93–0.97 | 0.03, |
| Marshall (5) | 268 | 148 | 17 | 0.66–0.88 | 0.18, |
Fig. 2.The phylogenetic congruence in samples containing bacteriophages of different ANI-delineated groups and putatively recently diverged bacteriophages (‘satellites’). The groups and bacteriophages were chosen to cover the transition zone from highly similar genomes to clearly separated groups displaying 75 % ANI. Starting with the group IME08, successive designated satellite phages and finally whole SP18 group phages were added. (a) The CADM test applied to shared genes of each sample. W: the Kendall coefficient of concordance among gene distance matrices. W may range from 1 (distance matrices for every gene are the same) to 0 (no similarity in matrices). The boxplots above represent Mantel correlations among rank-transformed distance matrices for all shared genes. By adding progressively more distant bacteriophages, the distance matrices are becoming more similar, i.e. the same bacteriophage clusters are recorded in more and more matrices/genes. Below is the significance of the correlations where the red dotted line indicates P=0.05. For sample IME08, most matrix correlations are expected to arise by chance, while for the sample containing the SP18 group, the correlations are significant. (b) Consensus network constructed for a sample containing all of the bacteriophages in (a), illustrating the CADM results. There is clear clustering between ANIb-delineated groups but a relative lack of it at the intragroup level.