| Literature DB >> 24465629 |
Ryosuke Yasaka1, Huy D Nguyen2, Simon Y W Ho3, Sebastián Duchêne3, Savas Korkmaz4, Nikolaos Katis5, Hideki Takahashi6, Adrian J Gibbs7, Kazusato Ohshima2.
Abstract
Cauliflower mosaic virus (CaMV) is a plant pararetrovirus with a double-stranded DNA genome. It is the type member of the genus Caulimovirus in the family Caulimoviridae. CaMV is transmitted by sap inoculation and in nature by aphids in a semi-persistent manner. To investigate the patterns and timescale of CaMV migration and evolution, we sequenced and analyzed the genomes of 67 isolates of CaMV collected mostly in Greece, Iran, Turkey, and Japan together with nine published sequences. We identified the open-reading frames (ORFs) in the genomes and inferred their phylogeny. After removing recombinant sequences, we estimated the substitution rates, divergence times, and phylogeographic patterns of the virus populations. We found that recombination has been a common feature of CaMV evolution, and that ORFs I-V have a different evolutionary history from ORF VI. The ORFs have evolved at rates between 1.71 and 5.81×10(-4) substitutions/site/year, similar to those of viruses with RNA or ssDNA genomes. We found four geographically confined lineages. CaMV probably spread from a single population to other parts of the world around 400-500 years ago, and is now widely distributed among Eurasian countries. Our results revealed evidence of frequent gene flow between populations in Turkey and those of its neighboring countries, with similar patterns observed for Japan and the USA. Our study represents the first report on the spatial and temporal spread of a plant pararetrovirus.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24465629 PMCID: PMC3897471 DOI: 10.1371/journal.pone.0085641
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Multidimensional scaling of tree-to-tree patristic distances.
ORF I vs ORF III isolates (A); ORF I vs ORF VI isolates (B); ORFs I–V vs ORF VI isolates (C); ORFs I–V Group A vs ORF VI isolates (D); and ORFs I–V Group B vs ORF VI isolates (E).
Tentative and clear recombination sites in Cauliflower mosaic virus genomes.
| Isolate | Position (nt) | ORF | Parental isolate | Recombination detection program |
| |
| Major | Minor | |||||
| B29 | 3296-3946 | IV–V | TUR50 | Unknown (TUR4) | B | 3.81×10−9 |
| 5996-7341 | VI | Unknown (TUR50) | TUR4 | RGBMC | 2.14×10−31 | |
| BBC | 3259-3946 | IV–V | TUR50 | Unknown (TUR4) | B | 6.48×10−10 |
| 4214-5995 (UD) | V | Unknown (TUR263) | TUR50 | R | 3.41×10−17 | |
| Cabbage S | 3298-4078 | IV–V | TUR50 | Unknown (TUR4) | GB | 2.02×10−9 |
| 6239-74 | VI–VII | TUR285 | CM1841 | RGBMC | 7.43×10−31 | |
| CM1841 | 3259-4071 | IV–V | TUR50 | Unknown (TUR4) | B | 3.80×10−10 |
| 4214-5995 | V–VI | Unknown (TUR263) | TUR50 | R | 3.42×10−15 | |
| CMV-1 | 3259-4031 | IV–V | TUR50 | Unknown (TUR4) |
| 2.66×10−10 |
| 5887-195 | VI–VII | Unknown (TUR4) | TUR50 | RGBMC | 7.84×10−34 | |
| CRO180A | 5996-7362 | VI | TUR50 | Unknown (TUR4) | R | 3.10×10−31 |
| D/H | 5957-82 | VI–VII | Unknown (TUR50) | TUR4 | RGBMC | 4.21×10−35 |
| GRC83 | 7240-15 | VI–VII | GRC86D | BBC | RG | 1.32×10−26 |
| GRC84B | 7240-15 | VI–VII | GRC86D | BBC | RG | 1.37×10−24 |
| GRC86B | 4318-7239 | VI | GRC84B | TUR216 | RBM | 3.21×10−10 |
| GRC86D | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 7.28×10−17 |
| GRC87E | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 3.18×10−13 |
| GRC87G | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 4.21×10−14 |
| GRC91B | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 2.63×10−15 |
| GRC92A | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 7.99×10−15 |
| GRC92C | 7348-615 | VI–VII | TUR94 | Unknown (CM1841) | RG | 1.83×10−14 |
| GRC92D | 7348-504- | VI–VII | TUR94 | Unknown (CM1841) | RG | 1.15×10−17 |
| IRN1 | 5969-102 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 2.07×10−35 |
| IRN2 | 5969-102 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 7.68×10−35 |
| IRN3 | 5969-102 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 2.07×10−35 |
| IRN4 | 5996-195 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 4.08×10−34 |
| IRN5 | 5996-208 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.58×10−34 |
| IRN6 | 5944-180 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 4.70×10−34 |
| IRN7 | 5969-76 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 8.64×10−34 |
| IRN8 | 5962-208 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 4.48×10−35 |
| IRN9 | 5965-64 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.53×10−33 |
| IRN10 | 5967-7342 | VI | TUR50 | Unknown (TUR4) | RGBMC | 3.77×10−34 |
| IRN11 | 5969-42 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.44×10−34 |
| IRN12 | 5965-7342 | VI | TUR50 | Unknown (TUR4) | RGBMC | 2.59×10−34 |
| IRN13 | 5965-180 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.45×10−33 |
| IRN14 | 5952-99 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 2.46×10−33 |
| IRN18 | 5969-212 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.57×10−36 |
| IRN19 | 5965-64 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 1.18×10−35 |
| IRN21 | 5996-180 | VI–VII | TUR50 | Unknown (TUR4) | RGBMC | 2.93×10−34 |
| JPNHGB340 | 3259-3946 | IV–V | TUR50 | Unknown (TUR4) | B | 5.12×10−9 |
| 5996-7341 | VI | Unknown (TUR4) | TUR50 | RGBMC | 5.81×10−33 | |
| JPNKWB778 | 3265-3946 | IV–V | TUR50 | Unknown (TUR4) | B | 3.72×10−9 |
| 5965-7341 | VI | Unknown (TUR4) | TUR50 | RGBMC | 3.35×10−33 | |
| JPNM | 4214-5964 | V–VI | Unknown (TUR263) | TUR50 | R | 1.26×10−15 |
| JPNN | 5996-7361 | VI | Unknown (TUR4) | TUR50 | RGBMC | 1.35×10−34 |
| JPNS1 | 3259-3946 | IV–V | TUR50 | Unknown (TUR4) |
| 9.43×10−9 |
| 5996-269 | VI–VII | Unknown (TUR50) | TUR4 | RGBMC | 1.19×10−36 | |
| JPNS2 | 3259-3946 | IV–V | TUR50 | Unknown (TUR4) |
| 5.74×10−9 |
| 5996-269 | VI–VII | Unknown (TUR50) | TUR4 | RGBMC | 1.19×10−36 | |
| JPNUV1 | 4214-5964 | V–VI | Unknown (TUR263) | TUR50 | R | 1.26×10−15 |
| JPNUV26 | 4214-5964 | V–VI | Unknown (TUR263) | TUR50 | R | 1.15×10−16 |
| JPNTKD762 | 3242-3989 | IV–V | TUR50 | Unknown (TUR4) |
| 2.17×10−8 |
| 5881-210 | VI–VII | Unknown (TUR50) | TUR4 | RGBMC | 5.12×10−35 | |
| NY8153 | 3296-3946 | IV–V | TUR50 | Unknown (TUR4) |
| 3.20×10−11 |
| 2104 (UD) - 5896 (UD) | IV–VI | Unknown (TUR263) | TUR50 |
| 8.95×10−13 | |
| 5909-164 | VI–VII | Unknown (TUR4) | TUR50 | RGBMC | 5.70×10−35 | |
| TUR2 | 399-1261 | I–II | TUR249 | TUR59 | RB | 2.87×10−5 |
| TUR34 | 4438-5876 | V–VI | Unknown (TUR285) | TUR278 |
| 1.39×10−6 |
| TUR59 | 4511-5948 | V–VI | TUR278 | Unknown (TUR285) | M | 4.61×10−7 |
| 5996-164 | VI–VII | TUR4 | Unknown (TUR50) | RGBMC | 8.65×10−34 | |
| TUR214 | 1772-2108 | III–IV | TUR2 | TUR12 | B | 2.49×10−6 |
| TUR216 | 2832-4937 (UD) | IV–V | TUR249 | Unknown (TUR2) | B | 2.03×10−16 |
| 5324-7347 | VI | Unknown (TUR306) | GRC92D |
| 1.90×10−5 | |
| TUR220 | 5539-6357 | VI | TUR81 | Unknown (TUR285) |
| 5.22×10−5 |
| TUR239 | 34 (UD) -1034 | I | Unknown (TUR4) | TUR244 | RGB | 3.09×10−10 |
| 1857 (UD) -2799 | V | GRC83 | Unknown (IRN2) |
| 3.71×10−5 | |
| 4365-5326 (UD) | V–VI | TUR50 | TUR4 | BM | 6.57×10−11 | |
| TUR289 | 471 (UD) -2485 | I–IV | TUR84 | Unknown (TUR306) | RGBMC | 2.00×10−9 |
| TUR306 | 1831-2512 | III–IV | Unknown (TUR94) | TUR84 | B | 4.52×10−9 |
| W260 | 3259-3946 | IV–V | TUR50 | Unknown (TUR4) | B | 2.54×10−9 |
| Xinjing | 627-1661 | I–III | Unknown (IRN19) | IRN21 | R | 1.96×10−5 |
Recombination sites detected in the CaMV genomes by the recombination detection programs (listed in column 6), from the aligned sequences of the likely recombinant and its ‘parental isolates’. The nucleotide position shows locations of individual genes numbered as in Xinjing genome (AF140604). UD; Undetermined.
Recombinant isolates identified by the recombination detection programs: R (RDP), G (GENECONV), B (BOOTSCAN), M (MAXCHI), C (CHIMAERA) and SR (SISCAN) programs in RDP4, and SO (SISCAN total nucleotide site analysis) in original SISCAN version 2 and P (PHYLPRO) programs. The analyses were done using default settings and a Bonferroni-corrected P-value cut-off of 0.01 in RDP4.
The reported P-value is for the program in bold type and underlined in RDP4 and is the smallest P-value among the isolates calculated for the region in question. P-values smaller than 1.0×10−5 are listed.
Figure 2Cluster-based analysis of population subdivision using Structure.
The results are grouped by population of origin for each individual. Each individual is represented by a column. The number of clusters is indicated by the value of K: ORFs I–V, K = 6 (A), ORF VI, K = 5 (B). The colour proportion for each bar represents the posterior probability of assignment of each individual to one of six clusters (A) and one of five clusters (B) of genetic similarity. Clusterings correspond to those shown in Figure S1 in File S1.
Details of the data sets used for estimation of nucleotide substitution rate and time to the most recent common ancestor for Cauliflower mosaic virus.
| Parameter | Open reading frame | |
| I–V | VI | |
| Best-fit substitution model | GTR+I+Γ4 | GTR+I+Γ4 |
| Best-fit molecular clock model | Relaxed Uncorrelated Exponential | Relaxed Uncorrelated Exponential |
| Best-fit population growth model | Exponential growth | Constant size |
| Sequence length (nt) | 5106 | 1269 |
| No. of sequences | 66 | 97 |
| Sampling date range | 1960–2010 | 1960–2012 |
| Chain length (in millions) | 100 | 100 |
| TMRCA | 491 (86–1270) | 431 (113–886) |
| Substitution rate (nt/site/year) | 1.71×10−4 (1.45×10−5–3.87×10−4) | 5.81×10−4 (2.47×10−4–9.47×10−4) |
| dN/dS | 0.069 | 0.201 |
| No. of variable sites | 1074 | 448 |
Time to the most recent common ancestor.
Nonsynomymous (dN) and synonymous (dS) substitution (dN/dS) ratios were calculated for seven ORFs using the Pamilo-Bianchi-Li (PBL) method in MEGA v5 [56].
Figure 3Bayesian phylogenetic estimates from ORFs I–V and ORF VI of Cauliflower mosaic virus.
Maximum-clade–credibility trees from BEAST analyses of 66 and 97 isolates of ORFs I–V (A) and ORF VI (B), respectively. Branch colours correspond to the most probable geographic location of their descendent nodes.
Figure 4Patterns of Cauliflower mosaic virus migration jointly estimated across the two ORF regions.
ORFs I–V and ORF VI migrations are shown by solid and dashed lines. Lines connecting discrete regions indicate statistically supported ancestral state changes and their thicknesses denote statistical support. There are five categories of support. In increasing order, line thicknesses indicate 6≤BF<10 (positive support); 10≤BF<30 (strong support); 30≤BF<100 (very strong support); and BF≥100 (decisive support). Migration line was not shown when they were represented by only a single sample.