| Literature DB >> 23186303 |
Alexandre De Bruyn1, Julie Villemot, Pierre Lefeuvre, Emilie Villar, Murielle Hoareau, Mireille Harimalala, Anli L Abdoul-Karime, Chadhouliati Abdou-Chakour, Bernard Reynaud, Gordon W Harkins, Arvind Varsani, Darren P Martin, Jean-Michel Lett.
Abstract
BACKGROUND: Cassava (Manihot esculenta) is a major food source for over 200 million sub-Saharan Africans. Unfortunately, its cultivation is severely hampered by cassava mosaic disease (CMD). Caused by a complex of bipartite cassava mosaic geminiviruses (CMG) species (Family: Geminivirideae; Genus: Begomovirus) CMD has been widely described throughout Africa and it is apparent that CMG's are expanding their geographical distribution. Determining where and when CMG movements have occurred could help curtail its spread and reveal the ecological and anthropic factors associated with similar viral invasions. We applied Bayesian phylogeographic inference and recombination analyses to available and newly described CMG sequences to reconstruct a plausible history of CMG diversification and migration between Africa and South West Indian Ocean (SWIO) islands.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23186303 PMCID: PMC3560262 DOI: 10.1186/1471-2148-12-228
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Repartition map of the seven African CMG’s in East Africa, the Comoros archipelago and the Seychelles archipelago. The map on the left describes the general repartition of the seven DNA-A species in the area. The map at the bottom right zooms in on the archipelagos with the number and composition of samples from each island represented with pie charts. The colour code is available on the top right of the figure with FS indicating full genome sequences and PS indicating partial genome sequences.
List of recombination events inferred in EACMV-like sequences
| DNA-A | 1 | EACMV-UG | 545 | 1008 | ACMV | EACMV | RG | 1.2x10-71 |
| 2 | EACMCV | 1052 | 1872 | Unknown | EACMV | RGBMCS | 1.3x10-89 | |
| 3 | EACMKV, EACMZV, SACMV | 1835 | 9 | ToLCV-like | EACMV | RG | 6.7x10-53 | |
| 4 | EACMV-KE | 1878 $ | 2083 | EACMZV | EACMV | RGBMCS | 1.4x10-24 | |
| 5 | EACMMV, SACMV | 178 | 999 $ | ToLCV/TYLCV-like | EACMV | RGBMCS | 4.9x10-20 | |
| 6 | EACMMV | 1843 $ | 2311 | EACMKV | EACMV | RG | 6.0x10-15 | |
| 7 | EACMZV | 1840 $ | 2184 | EACMKV | Unknown | RGBMC | 6.7x10-22 | |
| 8 | EACMV, EACMMV | 2052 $ | 2360 | Unknown | Unknown | RGBMC | 7.2x10-16 | |
| 9 | EACMV | 1821 | 1935 | ACMV | EACMV | 1.0x10-10 | ||
| 10 | EACMKV, SACMV | 2625 | 2742 $ | EACMV-like | Unknown | RGBMC | 1.3x10-11 | |
| 11 | ACMV | 1788 | 1856 | EACMV | ACMV | 3.3x10-5 | ||
| 12 | ToLCCV | 2591 | 2646 | EACMV | ToLCCV | Rg | 9.1x10-5 | |
| 13 | EACMV-like | 190 | 451 | Unknown | Unknown | rBc | 5.7x10-12 | |
| 14 | EACMCV | 1056 $ | 1126 | EACMCV | Unknown | rgB | 6.0x10-5 | |
| 15 | EACMV | 2662 | 2684 | Unknown | EACMV | 3.7x10-2 | ||
| DNA-B | 1 | CMMGV | 1564 | 2688 | Unknown | EAC | RGBMCS | 1.6x10-78 |
| 2 | EACMV-[SC] | 1124 | 1454 | EACMCV-like | EAC | 5.8x10-21 | ||
| 3 | EAC | 2461 | 2704 | EAC | EAC | 3.3x10-12 | ||
| 4 | EAC | 2672 | 5 | Unknown | EAC | RG | 5.6x10-13 | |
| 5 | CAM | 1496 | 2503 | EACMV-[SC]-like | Unknown | RGB | 2.2x10-9 | |
| 6 | EAC | 2612 | 37 | EAC | EAC | RGBMC | 1.3x10-11 | |
| 7 | EAC | 422 | 1647 | Unknown | EAC | RMC | 1.3x10-6 | |
| 8 | EAC | 723 | 1143 | Unknown | EAC | 3.2x10-6 | ||
| 9 | EAC | 1718 | 2751 | EAC | EAC | BMC | 1.7x10-15 | |
* EAC corresponds to B components associated with EACMV, EACMKV, EAMCZV; CAM corresponds to EACMCV-B; CMMGV, Cassava mosaic Madagascar virus[MG:Tol:06]; EACMV-[SC] corresponds to East African cassava mosaic virus isolated in Seychelles; ToLCV, Tomato leaf curl virus; TYLCV, Tomato yellow leaf curl virus; ToLCCV: Tomato leaf curl Comoros virus.
$ Breakpoints not inferred by RDP.
α R: RDP; G: GENECONV; B: Bootscan; M: MaxChi; C: Chimaera; S: SiScan; T: 3Seq. Upper-case letters imply that a method detected recombination with a multiple comparison corrected P-value < 0.05, lower-case letters imply that the method detected recombination with a multiple comparison corrected P-value > 0.05
β The reported P-value is for the method in bold type and is the best P-value calculated for the region in question.
Figure 2Maximum clade credibility trees constructed from the EACMV-like capsid protein (CP) dataset. Branches are coloured according to the most probable location state of the node on their right (i.e. the likely geographical location of the ancestral sequence represented by this node). The large black circle around one of the nodes indicates that the state probability at this node is less than 0.5 (i.e. there is less than 50% confidence in the indicated location being the actual place where this ancestral sequence existed). The time-scale of evolutionary changes represented in the tree is indicated by the scale bar below it. Whereas filled circles associated with nodes indicate > 95% posterior probability support for the branches to their left, open circles indicate nodes with > 70% posterior support for these branches. Nodes to the right of branches with < 70% support are left unlabelled. The bar graph on the left corner indicates location probabilities of the node at the root of the tree (which is the node representing the last common ancestor of all the sequences represented in the tree). Grey bars represent the probabilities obtained with randomisation of the tip locations. Probable introduction events from Africa to the SWIO islands are indicated with red arrows.
Figure 3Maximum clade credibility trees constructed from the EACMV-like DNA-B Full genome (FG-B) dataset. Branches are coloured according to the most probable location state of their descendant nodes, with black circled nodes indicating nodes with state probabilities < 0.5. The time-scale of evolutionary changes represented in the tree is indicated by the scale bar below it. Whereas filled circles associated with nodes indicate > 95% posterior probability support for the branches to their left, open circles indicate nodes with > 70% posterior support for these branches. Nodes to the right of branches with < 70% support are left unlabelled. The bar graph on the left corner indicates the location probabilities of the node at the root of the tree that represents the most recent common ancestor of all the sequences represented in the tree. Grey bars represent the probabilities obtained with randomisation of the tip locations. Probable introduction events from Africa to the SWIO islands are indicated with red arrows.
Figure 4CMG migrations from East Africa and between SWIO islands. CMG migration events inferred using the capsid protein (CP, in green); full genome DNA-A (FG-A, in red) and full genome DNA-B (FG-B, in blue) datasets. Arrow colours represent the dataset used to infer each migration event. The thickness of arrows is proportional to the number of independent movements inferred between islands.
List of pseudorecombination events inferred in concatenated DNA-A and DNA-B sequences
| 1 | EACMV-like | 2858 | 6017 | EACMV-UG | EACMV | Full B | RGBMcS | 1.9x10-71 |
| 2 | EACMV-like | 2883 $ | 6017 | EACMKV | EACMV | Full B | RG | 9.9x10-24 |
| 3 | EACMV-like | 2883 $ | 6017 | EACMKV | EACMV | Full B | 6.7x10-61 | |
| 4 | EACMV-like | 2883 | 6017 $ | EACMV | EACMV | Full B | RBGMC | 1.4x10-28 |
| 5 | EACMV-like | 2883 $ | 6017 | EACMKV | EACMV | Full B | RG | 1.7x10-63 |
| 6 | EACMV-like | 3635 | 5950 | EACMV-UG | EACMZV | pCR+ORFs | RG | 2.7x10-38 |
| 7 | EACMV-like | 2883 $ | 5973 $ | EACMV | EACMV-UG | Full B | R | 2.2x10-36 |
| 8 | EACMV-like | 2883 $ | 6017 | EACMV-UG | EACMKV | Full B | RG | 8.1x10-51 |
| 9 | EACMV-like | 2820 | 5924 | EACMV | EACMV | Full B | GBMC | 7.3x10-26 |
| 10 | EACMV-like | 2883 | 6017 | EACMV | EACMV | Full B | RGbMCS | 2.9x10-12 |
| 11 | EACMV-like | 2883 | 6017 | EACMV | EACMV | Full B | G | 8.7x10-8 |
| 12 | EACMV-like | 3711 | 6017 | EACMZV | EACMZV | pCR+ORFs | GMC | 8.0x10-25 |
| 13 | EACMV-like | 3689 | 6017 $ | EACMV | EACMZV | pCR+ORFs | RGBMc | 8.6x10-13 |
| 14 | EACMV-like | 3789 | 5501 $ | EACMKV | EACMZV | pCR+pORFs | RB | 2.1x10-6 |
| 15 | EACMV-like | 1 | 2201 $ | EACMV | EACMV | Full B | rbMC | 1.3x10-4 |
* EACMV-like comprises B components associated with EACMV, EACMKV, EACMZV and SACMV.
$ Breakpoints not inferred by RDP.
α R: RDP; G: GENECONV; B: Bootscan; M: MaxChi; C: Chimaera; S: SiScan; T: 3Seq. Upper-case letters imply that a method detected recombination with a multiple comparison corrected P-value < 0.05, lower-case letters imply that the method detected recombination with a multiple comparison corrected P-value > 0.05
β The reported P-value is for the method in bold type and is the best P-value calculated for the region in question.
ϕ The portion of the DNA-B “transferred” between the DNA-A sequences during each re-assortment event is indicated (p: partial; CR: Common Region; ORFs: Open-Reading Frames).
Figure 5Analysis of EACMV-like component re-assortments. Analysis of component re-assortment through counts of DNA-B component “migrations” between the DNA-A sequences of the major CMG lineages. In the analysis DNA-A sequences were classified into six discrete groups. Branches are coloured according to which of these six groups the DNA-B sequence represented by the node to their right was associated with. Black circled nodes indicate ancestral DNA-B sequences where there is less than a < 0.5 state probability for all six of DNA-A lineages (i.e. it is particularly uncertain which DNA-A these ancestral DNA-B sequences were associated with). Whereas filled circles associated with nodes to the right of branches indicate > 95% posterior probability support for these branches, open circles indicate branches with > 70% posterior support. Nodes to the right of branches with < 70% support are left unlabelled. DNA-B “migrations” between different DNA-A lineages that could be explained by genome re-assortment events inferred in an independent RDP3 analysis are indicated by the corresponding RDP3 event number listed in Table 2. The underlined event number (event 6) corresponds to a partial DNA-B”migration” that likely involved overprinting of part of the DNA-B CR with that of the DNA-A sequence that captured it.