| Literature DB >> 34072594 |
Yves Kwibuka1,2, Espoir Bisimwa2, Arnaud G Blouin1, Claude Bragard3, Thierry Candresse4, Chantal Faure4, Denis Filloux5,6, Jean-Michel Lett7, François Maclot1, Armelle Marais4, Santatra Ravelomanantsoa8, Sara Shakir9, Hervé Vanderschuren9,10, Sébastien Massart1.
Abstract
Cassava is one of the most important staple crops in Africa and its production is seriously damaged by viral diseases. In this study, we identify for the first time and characterize the genome organization of novel ampeloviruses infecting cassava plants in diverse geographical locations using three high-throughput sequencing protocols [Virion-Associated Nucleotide Acid (VANA), dsRNA and total RNA], and we provide a first analysis of the diversity of these agents and of the evolutionary forces acting on them. Thirteen new Closteroviridae isolates were characterized in field-grown cassava plants from the Democratic Republic of Congo (DR Congo), Madagascar, Mayotte, and Reunion islands. The analysis of the sequences of the corresponding contigs (ranging between 10,417 and 13,752 nucleotides in length) revealed seven open reading frames. The replication-associated polyproteins have three expected functional domains: methyltransferase, helicase, and RNA-dependent RNA polymerase (RdRp). Additional open reading frames code for a small transmembrane protein, a heat-shock protein 70 homolog (HSP70h), a heat shock protein 90 homolog (HSP90h), and a major and a minor coat protein (CP and CPd respectively). Defective genomic variants were also identified in some cassava accessions originating from Madagascar and Reunion. The isolates were found to belong to two species tentatively named Manihot esculenta-associated virus 1 and 2 (MEaV-1 and MEaV-2). Phylogenetic analyses showed that MEaV-1 and MEaV-2 belong to the genus Ampelovirus, in particular to its subgroup II. MEaV-1 was found in all of the countries of study, while MEaV-2 was only detected in Madagascar and Mayotte. Recombination analysis provided evidence of intraspecies recombination occurring between the isolates from Madagascar and Mayotte. No clear association with visual symptoms in the cassava host could be identified.Entities:
Keywords: Ampelovirus; Central Africa; Closteroviridae; Indian Ocean islands; Manihot esculenta; high-throughput sequencing
Mesh:
Substances:
Year: 2021 PMID: 34072594 PMCID: PMC8226816 DOI: 10.3390/v13061030
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Identity and origin of cassava landraces used and indication on the high-throughput sequencing (HTS) protocol performed on these samples.
| Cassava | Country of Origin | Collection Date DD/MM/YY | Symptoms | Sequencing Strategy and Target | Isolates Detected | Other Virus Detected |
|---|---|---|---|---|---|---|
| Menatana | Madagascar | 18/10/2017 | Asymptomatic in greenhouse | VANA | MG-Men-241, MG-Men-9 | - |
| Miandrazaka | Madagascar | 18/10/2017 | Asymptomatic in greenhouse | VANA | MG-Mia-10, MG-Mia-362, MG-Mia-403, MG-Mian 2-2 | - |
| Long Java | Reunion | 25/02/2015 | Asymptomatic in field | dsRNAs | RE-Ljv | - |
| Reunion | Mayotte | 26/03/2015 | Symptoms of CMD and CBSD | dsRNAs | MY-Ren-5, MY-Ren-8 | EACMV+UCBSV |
| 6 Mois Blanc | Mayotte | 26/03/2015 | Symptoms of CMD | dsRNAs | MY−6mb-4, MY−6mb-6 | EACMV |
| Nambiyombiyo | DR Congo | 13/01/2016 | Asymptomatic in greenhouse | Total RNA | CG-Nmb | CBSV |
| Kahunde | DR Congo | 13/01/2016 | Asymptomatic in greenhouse | Total RNA | CG-Kah | - |
VANA: virion-associated nucleic acid; dsRNAs: double-stranded RNAs; EACMV: East African cassava mosaic virus; UCBSV: Ugandan cassava brown streak virus; CBSV: cassava brown streak virus; CMD: cassava mosaic disease; CBSD: cassava brown streak disease.
RT-PCR primer pairs used for the confirmation of the viral sequences in the samples.
| PRIMER | TARGET ORF | SEQUENCES (5′==> 3′) | Tm |
|---|---|---|---|
| 12.495 F cp | CP ORF5 | AATTTGGGAGGAGTGCGACC | 60.3 |
| 12.715 R cp | CP ORF5 | AGACCGACTTGTGCTACTCTTG | 60.0 |
| 11.735 F hsp90 | HSP90 ORF4 | GTCCCGGCCTACATGCAATT | 60.8 |
| 11.959 R hsp90 | HSP90 ORF4 | ACCGCCAGTCAACTCTCGTA | 60.9 |
| 9.021 F hsp70 | HSP70 ORF4 | TATGGTTGTTGCTCGCGACT | 60.0 |
| 9.292 R hsp70 | HSP70 ORF4 | CTGACAAACCAGCAGCAGTTG | 60.3 |
| 7.244 F rdrp | RdRp ORF1B | GGACAACCTCCGAAACCGTAT | 60.1 |
| 7.544 R rdrp | RdRp ORF1B | CTTTCGCTGCCATTGGTGTC | 60.1 |
RT-PCR primer pairs used for the characterization of the defective RNA molecules.
| Primers Name | Sequences (5’==>3’) | Expected Amplicon Size |
|---|---|---|
| MEaV-1-RE-Ljv-D-RNA1-F | ACATCTAAATGCTAACGAACGAAGAG | 380 bp |
| MEaV-1-RE-Ljv-D-RNA1-R | CAACGCCAGAATCTTCGTACA | |
| MEaV-1-RE- Ljv-D-RNA2-F | AGGCTTTCGACAGTGAAGAAGTG | 330 bp |
| MEaV-1-RE- Ljv-D-RNA2-R | CGTAGCCATACTGAAGGATAGCA | |
| MEaV-2-RE- Ljv-D-RNA3-F | TGGAAGCCGCTGGTAAACTACA | 400 bp |
| MEaV-2-RE-Ljv-D-RNA3-R | CAAGCACGTTCAATATTAGGAATAGTAC | |
| MEaV-2-MG-Mena-D-RNA4-F | AGACATATGAAAGAGTTGCATTGGTG | 500 bp |
| MEaV-2-MG-Mena-D-RNA4-R | ACCTACAAATAATTTCGCTCGTCTG |
Lengths, number of reads integrated, percent of total reads and average coverage for Closteroviridae contigs from the various analyzed cassava samples.
| Cassava Landraces | Isolate Name 1 | Accession Number | Virus Name | Contig Length [Nt] | Number of Reads Mapped | Percent Of Total Reads | Average Genome Coverage | Internal Undetermined Nucleotides | Internal Gaps |
|---|---|---|---|---|---|---|---|---|---|
| Menatana | MG-Men-241 | MT773584 | MEaV-1 | 13,528 | 38,475 | 12.1 | 506.8X | 0 | 0 |
| MG-Men-9 | MT773591 | MEaV-2 | 13,752 | 10,945 | 4.10 | 153.6X | 0 | 0 | |
| Miandrazaka | MG-Mia-10 | MT773592 | MEaV-2 | 13,770 | 26,987 | 4.8 | 368X | 377 | 1 |
| MG-Mia-362 | MT773594 | MEaV-2 | 12,007 | 19,421 | 8.24 | 322X | 0 | 0 | |
| MG-Mia-403 | MT773590 | MEaV-1 | 10,518 | 26,917 | 11.4 | 480.9X | 0 | 0 | |
| MG-Mian 2-2 | MT773596 | MEaV-2 | 10,417 | 7676 | 1.4 | 136.4X | 843 | 1 | |
| Long java | RE-Ljv | MT773586 | MEaV-1 | 13,640 | 21,294 | 18.5 | 312.5X | 0 | 0 |
| Reunion | MY-Ren-5 | MT773589 | MEaV-1 | 13,172 | 1976 | 3.4 | 29.6X | 72 | 1 |
| MY-Ren-8 | MT773595 | MEaV-2 | 13,762 | 377 | 0.7 | 5.4X | 2889 | 14 | |
| 6 Mois blanc | MY-6mb-4 | MT773585 | MEaV-1 | 13,615 | 4992 | 12.7 | 73.3X | 288 | 4 |
| MY-6mb-6 | MT773593 | MEaV-2 | 13,764 | 2631 | 6.7 | 39.2X | 990 | 9 | |
| Nambiyombiyo | CG-Nmb | MT773587 | MEaV-1 | 13,612 | 7502 | 0.12 | 42X | 0 | 0 |
| Kahunde | CG-Kah | MT773588 | MEaV-1 | 13,616 | 15,351 | 0.23 | 75X | 11 | 1 |
Isolates were named considering two letters of the code of the country where the landraces were collected—three letters of the landrace short acronym—and a number (when multiple contigs from the same landrace were obtained)
Figure 1Schematic representation of the genomic organization of representative isolates CG-Nmb (MEaV-1) and MG-Men-9 (MEaV-2) (top) and structure of the defective variants (dRNA) identified (bottom). The RNA genome is drawn as a black line and the predicted open reading frames (ORFs) are represented by colored rectangles. Annotations and ORF numbers are given inside and above rectangles, respectively. Abbreviations: MTR—methyltransferase, HEL—helicase, RdRp—RNA-dependent RNA polymerase, HSP70h—heat shock protein 70 homolog, HSP90h— heat shock protein 90 homolog, CP—coat protein, CPd—minor coat protein.
Genome length, position of ORFs and size of the encoded proteins for the representative isolates of the two species CG-Nmb (MEaV-1) and MG-Men-9 (MEaV-2).
| Isolates | Genome Length (Nt) | Length of Open Reading Frames (Orfs) (Nt) and Molecular Mass of Encoded Proteins (Kda) | ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ORF6 | ||
| 13,616 nt | 6789 nt | 1503 nt | 204 nt | 1707 nt | 1686 nt | 774 nt | 717 nt | |
| 254.6 kDa | 56.6 kDa | 7.6 kDa | 62.2 kDa | 63.4 kDa | 27.7 kDa | 27.1 kDa | ||
| 13,752 nt | 6975 nt | 1593 nt | 147 nt | 1608 nt | 1698 nt | 816 nt | 639 nt | |
| 258.8 kDa | 60.4 kDa | 5.3 kDa | 58.4 kDa | 64.3 kDa | 29.3 kDa | 24 kDa | ||
Figure 2Phylogenetic analysis of the aligned amino acid sequences of the HP70h (ORF3) of the thirteen isolates from cassava and of selected members of the family Closteroviridae (see the Supplementary Material Table S2 for detailed information on these viruses). Green arrows indicate isolate sequences obtained in this study. Maximum likelihood phylogenetic tree was reconstructed in Mega X using Poisson model with uniform distribution for amino acid sequence alignments. Bootstrap values are indicated at the main branch nodes. The bar represents the number of amino acid substitution per site.
Amino acid sequence identity of the RdRp, HSP70h and CP of MEaV-2 (isolate MG-Men-9) with those of other selected Closteroviridae members.
| Genus | Representative Members | Proteins | ||
|---|---|---|---|---|
|
|
|
| ||
|
| MEaV-1 (isolate CG-Nmb) | 66.5% | 69.9% | 80.1% |
|
| Grapevine leafroll-associated virus 4 (NC_016416) | 41.7% | 51.6% | 47.6% |
|
| Citrus tristeza virus (NC_001661) | 27.2% | 25% | 12.5% |
|
| Potato yellow vein virus (NC_006062 and NC_006063) | 25.7% | 26.5% | 17.2% |
|
| Grapevine leafroll-associated virus 7 (NC_016436) | 27.5% | 27.7% | 13.6% |
Figure 3Phylogenetic trees reconstructed using the amino acid sequences of the following three taxonomically relevant proteins for the family Closteroviridae: (a) RdRp; (b) HSP70h; (c) CP; and (d) the whole genome nucleotide sequences. Maximum likelihood phylogenetic trees were reconstructed in Mega X using the GTR+GI model for nucleotide sequences alignments and a Poisson model with uniform distribution for amino acid sequence alignments. Only bootstrap values of more than 70% are mentioned on nodes. The scales provide branch distance for the given number of substitutions per site.
Intergroup and intragroup average pairwise divergence and standard deviation calculated for the three taxonomically relevant proteins (RdRP, HSP70h and CP) and for the nearly complete genomes.
| Within MEaV-2 with Mian 2-2 | Within MEaV-2 without Mian2-2 | Within MEaV-1 with Divergents | Within MEaV-1 without Divergents | Within MEaV-1 Divergents | Between MEaV-1 and MEaV-2 | ||
|---|---|---|---|---|---|---|---|
|
| aa divergences | 7.3 +/− 0.7% | 1.1 +/− 0.3% | 7.1 +/− 0.7% | 0.8 +/− 0.4% | 0.8% | 32.3 +/− 1.7% |
|
| aa divergences | 2.8 +/− 0.4% | 1.0 +/− 0.3% | 6.0 +/− 0.6% | 2.3 +/− 0.4% | 0.7% | 30.8 +/− 2.0% |
|
| aa divergences | na | 3.1 +/− 0.8% | 7.4 +/− 1.0% | 3.2 +/− 0.6% | na | 19 +/− 2.2% |
| Genome | nt divergences | 12.3 +/− 0.1% | 3.8 +/− 0.1% | 17.1 +/− 0.2% | 7.9 +/− 0.2% | 2.3% | 41.4 +/− 0.3% |
MEaV-1 divergents: MY-Ren-5 and MG-Mia-403. na: information not available as the contig/scaffold does not extend to the region concerned.
Figure 4Geographic repartition of the two novel cassava ampelovirus species (MEaV-1 and MEaV-2) according to the country of origin of positive samples.
Figure 5Recombination event detected among isolates from MeaV-2 by the RDP4 program.
Figure 6Gel red-stained 2% agarose gel showing RT-PCR products from: (a) D-RNA1 expected amplicon size 380 bp; (b) D-RNA2 and D-RNA3, expected amplicon size 380 bp and 400 bp, respectively; and (c) D-RNA4, expected amplicon size 500 bp. M1: GeneRuler 1 kb plus ladder (Thermo Scientific, SM1343). M2: fast DNA ladder 10 kb (NEB N3238S).