| Literature DB >> 28458915 |
Alison T M Lima1,2, José C F Silva2, Fábio N Silva1,2, Gloria P Castillo-Urquiza1,2, Fabyano F Silva3, Yee M Seah4, Eduardo S G Mizubuti1, Siobain Duffy4, F Murilo Zerbini1,2.
Abstract
Begomoviruses (single-stranded DNA plant viruses) are responsible for serious agricultural threats. Begomovirus populations exhibit a high degree of within-host genetic variation and evolve as quickly as RNA viruses. Although the recombination-prone nature of begomoviruses has been extensively demonstrated, the relative contribution of recombination and mutation to the genetic variation of begomovirus populations has not been assessed. We estimated the genetic variability of begomovirus datasets from around the world. An uneven distribution of genetic variation across the length of the cp and rep genes due to recombination was evident from our analyses. To estimate the relative contributions of recombination and mutation to the genetic variability of begomoviruses, we mapped all substitutions over maximum likelihood trees and counted the number of substitutions on branches which were associated with recombination (ηr) and mutation (ημ). In addition, we also estimated the per generation relative rates of both evolutionary mechanisms (r/μ) to express how frequently begomovirus genomes are affected by recombination relative to mutation. We observed that the composition of genetic variation in all begomovirus datasets was dominated by mutation. Additionally, the low correlation between the estimates indicated that the relative contributions of recombination and mutation are not necessarily a function of their relative rates. Our results show that, although a considerable fraction of the genetic variation levels could be assigned to recombination, it was always lower than that due to mutation, indicating that the diversification of begomovirus populations is predominantly driven by mutational dynamics.Entities:
Keywords: evolution; geminivirus; genetic variability; phylogeny.
Year: 2017 PMID: 28458915 PMCID: PMC5399926 DOI: 10.1093/ve/vex005
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.Statistical significance of the differences amongst the average pairwise number of nucleotide differences per site (nucleotide diversity, π) calculated for full-length genomes of begomoviruses. Ninety-five percent bootstrap confidence intervals (CIs) for the difference between π values were estimated from 15,000 nonparametric simulations in R software (R Development Core Team 2007) using the Simpleboot statistical package (Peng 2008). Confidence intervals which include the value ‘zero’ denote no statistically significant difference between the means.
Mean dN/dS values estimated for the cp and rep genes of begomovirus datasets retrieved from the GenBank database.
| Species dataset | Acronym | ||
|---|---|---|---|
| ACMV | 0.216 | 0.218 | |
| AgEV | 0.068 | 0.291 | |
| BhYVIV | 0.164 | 0.274 | |
| BhYVMV | 0.133 | 0.228 | |
| CLCuBuV | 0.684 | 0.615 | |
| CLCuGV | 0.059 | 0.178 | |
| CLCuMV | 0.124 | 0.312 | |
| EACMV | 0.136 | 0.191 | |
| MYMIV | 0.141 | 0.209 | |
| SPLCV | 0.073 | 0.186 | |
| ToLCNDV | 0.093 | 0.172 | |
| ToLCTV | 0.144 | 0.229 | |
| TYLCTV | 0.102 | 0.260 | |
| ToYLCCNV | 0.103 | 0.236 | |
| TYLCV | 0.198 | 0.270 |
Figure 2.Statistical significance of the differences amongst the average pairwise number of nucleotide differences per site (nucleotide diversity, π) calculated for the cp and rep genes of begomoviruses. Ninety-five percent bootstrap confidence intervals (CIs) for the difference between π values (πcp–πrep) were estimated from 15,000 nonparametric simulations in R software (R Development Core Team 2007) using the Simpleboot statistical package (Peng 2008). Confidence intervals which include the value ‘zero’ denote no statistically significant difference between the means.
Figure 3.Statistical significance of the differences amongst the average pairwise number of nucleotide differences per site (nucleotide diversity, π) calculated for the 5′-terminal, central and 3′-terminal regions of the cp and rep genes of begomoviruses. The whole sequences of the cp and rep genes were partitioned into three segments of identical lengths and the π values were calculated using an algorithm written in Python programming language. Ninety-five percent bootstrap confidence intervals (CIs) for the difference between π values were estimated from 15,000 nonparametric simulations in R software (R Development Core Team 2007) using the Simpleboot statistical package (Peng 2008). Confidence intervals which include the value ‘zero’ denote no statistically significant difference between the means.
Figure 4.Average pairwise number of nucleotide differences per site (nucleotide diversity, π) at synonymous (a, c) and nonsynonymous (b, d) sites calculated for three distinct regions (5′-terminal, central and 3′-terminal) of the cp (a, b) and rep (c, d) genes of begomoviruses using DnaSP v.5.
Figure 5.Midpoint-rooted maximum likelihood trees based on the EACMV cp (a) and rep (b) gene nucleotide sequences. Nodes to the right of branches with bootstrap support ≥90% are indicated by filled circles, and those with support between 70% and 89% by empty circles. The unique recombination events detected within the cp (event 1) and rep sequences (events 2 and 3) are shown as diagrams close to the branches where the substitutions due to each recombination event were mapped. Branches in red, blue and green colors were assigned to events 1, 2 and 3, respectively.
Figure 6.Statistical significance of the differences between branch lengths associated with recombination (BLr) and mutation (BLμ) in ML phylogenetic trees for the cp (a) and rep (b) genes. Ninety-five percent bootstrap confidence intervals (CIs) for the difference between branch lengths were estimated from 15,000 nonparametric simulations in R software (R Development Core Team 2007) using the Simpleboot statistical package (Peng 2008). Confidence intervals which include the value ‘zero’ denote no statistically significant difference between the means.
The relative contribution (ηr/ημ) and relative rates (ρ/θ) of recombination and mutation for the cp and rep genes of begomovirus datasets retrieved from the GenBank database.
| Dataset | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ηr | ημ | ηr/ημ | ρ | θw | ρ/θw | ηr | ημ | ηr/ημ | ρ | θw | ρ/θw | |
| ACMV | 0 | 312 | 0.000 | 5.101 | 35.994 | 0.142 | 0 | 694 | 0.000 | 5.101 | 48.147 | 0.106 |
| AgEV | 55 | 93 | 0.591 | 1.020 | 26.128 | 0.039 | 70 | 103 | 0.680 | 1.020 | 32.242 | 0.032 |
| BhYVIV | 105 | 503 | 0.209 | 12.242 | 52.560 | 0.233 | 360 | 527 | 0.683 | 4.081 | 81.912 | 0.050 |
| BhYVMV | 184 | 192 | 0.958 | 7.141 | 40.907 | 0.175 | 511 | 775 | 0.659 | 7.141 | 86.986 | 0.082 |
| CLCuBuV | 99 | 152 | 0.651 | 0.000 | 42.089 | 0.000 | 145 | 190 | 0.763 | 4.081 | 47.377 | 0.086 |
| CLCuGV | 98 | 138 | 0.710 | 1.020 | 29.092 | 0.035 | 56 | 317 | 0.177 | 3.061 | 40.446 | 0.076 |
| CLCuMV | 114 | 189 | 0.603 | 2.040 | 41.577 | 0.049 | 54 | 433 | 0.125 | 2.040 | 59.920 | 0.034 |
| EACMV | 104 | 541 | 0.192 | 1.020 | 15.825 | 0.064 | 61 | 1372 | 0.044 | 5.101 | 13.692 | 0.373 |
| MYMIV | 11 | 238 | 0.046 | 2.040 | 32.768 | 0.062 | 13 | 342 | 0.038 | 3.061 | 39.973 | 0.077 |
| SPLCV | 166 | 290 | 0.572 | 4.081 | 33.656 | 0.121 | 164 | 985 | 0.166 | 13.263 | 73.623 | 0.180 |
| ToLCNDV | 128 | 644 | 0.199 | 14.283 | 42.452 | 0.336 | 160 | 1136 | 0.141 | 11.222 | 77.347 | 0.145 |
| ToLCTV | 1 | 163 | 0.006 | 3.061 | 23.889 | 0.128 | 146 | 301 | 0.485 | 1.020 | 59.131 | 0.017 |
| TYLCTV | 46 | 132 | 0.348 | 2.040 | 29.867 | 0.068 | 37 | 309 | 0.120 | 2.040 | 49.779 | 0.041 |
| ToYLCCNV | 40 | 444 | 0.090 | 2.040 | 46.384 | 0.044 | 263 | 406 | 0.648 | 1.020 | 84.907 | 0.012 |
| TYLCV | 45 | 420 | 0.107 | 5.101 | 5.315 | 0.960 | 41 | 1040 | 0.039 | 2.040 | 29.320 | 0.070 |
ACMV, African cassava mosaic virus; AgEV, Ageratum enation virus; BhYVIV, Bhendi yellow vein India virus; BhYVMV, Bhendi yellow vein mosaic virus; CLCuBuV, Cotton leaf curl Burewala virus; CLCuGV, Cotton leaf curl Gezira virus; CLCuMV, Cotton leaf curl Multan virus; EACMV, East African cassava mosaic virus; MYMIV, Mungbean yellow mosaic India virus; SPLCV, Sweet potato leaf curl virus; ToLCNDV, Tomato leaf curl new Delhi virus; ToLCTV, Tomato leaf curl Taiwan virus; TYLCTV, Tomato yellow leaf curl Thailand virus; ToYLCCNV, Tomato yellow leaf curl China virus; TYLCV, Tomato yellow leaf curl virus.
Number of substitutions due to recombination over the ML phylogenetic tree.
Number of substitutions due to mutation over the ML phylogenetic tree.
Population-scaled recombination rate estimated using a likelihood lookup table for a per site mutation rate of 0.001.
Watterson’s infinite-sites estimator of the population-scaled mutation rate (θ).
Equivalent to r/μ.