| Literature DB >> 30376573 |
Shaheen Nourinejhad Zarghani1,2, Jean Michel Hily3, Miroslav Glasa4, Armelle Marais5, Thierry Wetzel1, Chantal Faure5, Emmanuelle Vigne3, Amandine Velt3, Olivier Lemaire3, Jean Michel Boursiquot6,7, Arnela Okic8, Ana Belén Ruiz-Garcia9, Antonio Olmos9, Thierry Lacombe6,7, Thierry Candresse5.
Abstract
RNASeq or double-stranded RNA based approaches allowed the reconstruction of a total of 9 full-length or near full-length genomes of the recently discovered grapevine virus T (GVT). In addition, datamining of publicly available grapevine RNASeq transcriptome data allowed the reconstruction of a further 14 GVT genomes from five grapevine sources. Together with four GVT sequences available in Genbank, these novel sequences were used to analyse GVT diversity. GVT shows a very limited amount of indels variation but a high level of nucleotide and aminoacid polymorphism. This level is comparable to that shown in the closely related grapevine rupestris stem pitting-associated virus (GRSPaV). Further analyses showed that GVT mostly evolves under conservative selection pressure and that recombination has contributed to its evolutionary history. Phylogenetic analyses allow to identify at least seven clearly separated groups of GVT isolates. Analysis of the only reported PCR GVT-specific detection primer pair indicates that it is likely to fail to amplify some GVT isolates. Taken together these results point at the distinctiveness of GVT but also at the many points it shares with GRSPaV. They constitute the first pan-genomic analysis of the diversity of this novel virus.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30376573 PMCID: PMC6207325 DOI: 10.1371/journal.pone.0206010
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
GVT isolate, grapevine variety, origin and tissue sampled for the grapevine plants analyzed here and viruses identified in addition to GVT.
| GVT isolate | Variety | Origin | Country | Tissue sampled | Other viruses present | HTS template |
|---|---|---|---|---|---|---|
| Limniona (2288Mtp2) | Germplasm collection | France (Greece) | Phloem scrapings | GRSPaV, GLRaV2, GLRaV3, GFkV, GVA, GRVFV | dsRNA | |
| Saperavi (1734Mtp2) | Germplasm collection | France (Romania) | Phloem scrapings | GRSPaV, GLRaV3, GVA, GVF, GRVFV | dsRNA | |
| Sauvignon gris (301Mtp4) | Germplasm Francetion | France (Chile) | Phloem scrapings | GRSPaV, GPgV, GRGV | dsRNA | |
| Sauvignon | MulFranceation block | France | Phloem scrapings | GRSPaV, HSVd, GYSVd1 | Total RNA | |
| Phloem scrapings | GRSPaV, HSVd, GYSVd-1 | Total RNA | ||||
| Leaves | GRSPaV, HSVd, GYSVd-1 | Total RNA | ||||
| Welshriesling | Commercial vineyard | Slovak Republic | Phloem scrapings | GLRaV-1, GRVFV, HSVd | rRNA depleted total RNA | |
| Riesling | Commercial vineyard | Germany | Leaves | GRGV | na |
1: the code of the accession is given in parentheses for samples originating from The INRA Vassal-Montpellier grapevine biological resource center.
2: the country of origin of the accession is given in parentheses for samples originating from a biological resource center.
3: INRA Grapevine biological resource center of Vassal-Montpellier (https://www6.montpellier.inra.fr/vassal_eng/)
4: potential presence of viroids not efficiently detected through the double-stranded RNA sequencing strategy used.
GVT contig length, average coverage and representation in the dsRNA derived high-throughput sequencing reads derived from the various grapevine samples.
| Total reads | GVT Mapped reads | % mapped reads | Average coverage | Initial contig length | Final sequence length | Accession number | |
|---|---|---|---|---|---|---|---|
| 716,447 | 9,890 | 1.38% | 188 x | 7.9 kb | 8,670 nt | MH674173 | |
| 427,125 | 9,389 | 2.2% | 205 x | 7,5 kb | 8,670 nt | MH674171 | |
| 5,318 | 1.25% | 112 x | 7,6 kb | 8,670 nt | MH674172 | ||
| 3,025,587 | 13,630 | 0.45% | 322 x | 7,3 kb | 8,669 nt | MH674174 |
GVT contig length, average coverage and representation in the total RNA derived high-throughput sequencing reads derived from the various grapevine samples.
| Isolate | Total reads | Reads after grapevine genome substraction | GVT mapped reads | % mapped reads | Average coverage | Accession number |
|---|---|---|---|---|---|---|
| 70,808,936 | 15,942,747 | 575 | 0.003% | 9.9 x | MH674170 | |
| 57,440,410 | 10,894,109 | 2,459 | 0.023% | 42.3 x | MH674175 | |
| 73,210,304 | 9,338,200 | 4,336 | 0.046% | 74.8 x | MH674176 | |
| 1,065,468 | na | 2,532 | 0.23% | 36.6 x | MH638265 |
1: does not apply.
Dataset, grapevine variety and length of the GVT contigs assembled with datamining publicly available grapevine transcriptomic data.
| Isolate | Dataset | Grapevine species/ variety | Length of contig | % of GVT genome | Accession number |
|---|---|---|---|---|---|
| Pinot noir | SRR520387 | 8,679 nt | 99.7% | MH674167 | |
| Barbera | SRR1631833 | 8,665 nt | 99.6% | MH674169 | |
| Moscato rosa | ERR923384 | 8,667 nt | 99.6% | MH674169 | |
| CF-1 | ERR922632 | 8,672 nt | 99.7% | MH674178 | |
| CF-2 | 8,674 nt | 99.7% | MH674179 | ||
| CF-3 | 8,657 nt | 99.5% | MH674180 | ||
| CF-4 | 8,637 nt | 99.3% | MH674181 | ||
| Lambrusco-1 | ERR922631 | 8,687 nt | 99.8% | MH674182 | |
| Lambrusco-2 | 8,635 nt | 99.2% | MH674183 | ||
| Lambrusco-3 | 8,319 nt | 95.6% | MH674184 | ||
| Lambrusco-4 | 8,683 nt | 99.8% | MH674185 | ||
| Lambrusco-5 | 8,651 nt | 99.4% | MH674186 | ||
| Lambrusco-6 | 8,557 nt | 98.3% | MH674187 | ||
| Vitis amurensis | SRR922004, SRR922126, SRR922136, SRR922138 | 8,668 nt | 99.6% | MH674168 |
Fig 1Schematic representation of the genomic organization of GRSPaV (A) and of GVT (B) and plotting of their respective nucleotide diversity (pi) along their genomes.
Open reading frames are indicated by boxes. REP: replication-associated protein; TGB1, 2, 3: Triple gene block protein 1, 2, 3; CP: coat protein; 6: ORF6. The overall average diversity calculated for the complete genome is indicated by a doted line. Pi plots were constructed using DNASP v6, with a window of 100 nt and a step of 25 nt.
Average pairwise nucleotide (nt) and amino acid (aa) divergence computed for all genomic regions of GVT.
The value computed individually for the different codon positions are also shown.
| Full genome | 5’ NCR | REP | TGB1 | TGB2 | TGB3 | CP | 3’NCR | |
|---|---|---|---|---|---|---|---|---|
| Average pairwise divergence (nt) | 16.6 +/- 0.2% | 7.6 +/- 2.3% | 17.0 +/- 0.2% | 17.3 +/- 0.8% | 16.8 +/- 1.2% | 15.7 +/- 1.5% | 14.8 +/- 0.7% | 11.6 +/- 1.1% |
| Avegage pairwise divergence (aa) | na | na | 9.8 +/- 0.4% | 11.9 +/- 1.2% | 13.9 +/- 2.0% | 17.9 +/- 2.6% | 6.1 +/- 0.9% | na |
| Average pairwise divergence (1st codon position) | na | na | 8.7 +/- 0.3% | 9.7 +/- 1.1% | 13.3 +/- 1.8% | 9.1 +/- 1.6% | 5.7 +/- 0.9% | na |
| Average pairwise divergence (2nd codon position) | na | na | 4.8 +/- 0.2% | 6.2 +/- 0.9% | 4.3 +/- 1.1% | 10.2 +/- 1.7% | 3.5 +/- 0.5% | na |
| Average pairwise divergence (3rd codon position) | na | na | 37.5 +/- 0.3% | 36.0 +/- 1.1% | 32.8 +/- 1.9% | 27.8 +/- 2.4% | 35.1 +/- 1.1% | na |
Fig 2Phylogenetic tree reconstructed using the complete or near-complete genome sequence of GVT and GRSPaV isolates.
The Neighbor joining tree was reconstructed using a strict nucleotide distance in Mega7. Bootstrap analysis (1,000 replicates) was performed to evaluate the solidity of branches. Only bootstrap values >70% are shown. The scale bar represents 10% nucleotide divergence. Groups of GRSPaV or GVT are indicated on the right of the figure.
Potential recombination events detected by RDP4 in the genome of GVT isolates.
| Predicted recombinant | Predicted major parent | Predicted minor parent | Detecting programs | Predicted breakpoints | Overall corrected probability |
|---|---|---|---|---|---|
| 486–1 | CF-3 | Moscato rosa | 7 | 1538–2838 | 9.0 e-26 |
| 7 | 3772–4104 | 1.0 e-7 | |||
| 7 | 5888–6824 | 2.5 e-10 | |||
| 7 | 7952–8624 | 3.3 e-13 | |||
| SK809 | Lambrusco-1 | Moscato rosa | 7 | 438–7770 | 1.3 e-37 |
| unknown | Moscato rosa | 5 | 1952–2403 | 2.5 e-5 | |
| 6 | 3139–5299 | 1.3 e-20 |
1: number of programs that detected a particular potential recombination event out of the nine tested algorithms.
Fig 3Comparison of the sequence of the various GVT isolates at the binding sites of the GVT_7632F/GVT_8534R detection primer pair [24].
The sequences are shown in a 5' to 3' orientation (left to right). Missing sequence information is indicated by n. Mutations are indicated in gray-shaded, bold type. The sequence of the primers is shown in bold italics at the bottom of the alignement (for the GVT_8534R reverse primer, the reverse complement is shown).