| Literature DB >> 34207047 |
Norsazilawati Saad1,2, James W Olmstead3, Arvind Varsani4,5, Jane E Polston1, Jeffrey B Jones1, Svetlana Y Folimonova1, Philip F Harmon1.
Abstract
Southern highbush blueberry (interspecific hybrids of Vaccinium corymbosum L.) is cultivated near wild V. corymbosum as well as closely related species in Florida, USA. The expansion of blueberry cultivation into new areas in Florida and deployment of new cultivars containing viruses can potentially increase the diversity of viruses in wild and cultivated V. corymbosum. In this study, viral diversity in wild and cultivated blueberries (V. corymbosum) is described using a metagenomic approach. RNA viromes from V. corymbosum plants collected from six locations (two cultivated and four wild) in North Central Florida were generated by high throughput sequencing (HTS) and analyzed using a bioinformatic analysis pipeline. De novo assembled contigs obtained from viromes of both commercial and wild sites produced sequences with similarities to plant virus species from a diverse range of families (Amalgaviridae, Caulimoviridae, Endornaviridae, Ophioviridae, Phenuiviridae, and Virgaviridae). In addition, this study has enabled the identification of blueberry latent virus (BlLV) and blueberry mosaic associated ophiovirus (BlMaV) for the first time in Florida, as well as a tentative novel tepovirus (blueberry virus T) (BlVT) in blueberry. To the best of our knowledge, this is the first study that compares viral diversity in wild and cultivated blueberry using a metagenomic approach.Entities:
Keywords: Florida; blueberry; metagenomics; viromes
Mesh:
Year: 2021 PMID: 34207047 PMCID: PMC8234961 DOI: 10.3390/v13061165
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Map showing the locations of the collected blueberry samples. Adapted from [5]. (A) O’leno State Park, High Springs; (B) Morning Side Nature Center, Gainesville; (C) Island Grove; (D) Interlachen.
Figure 2Virome analysis pipeline used for identification of viruses. Raw reads from metagenome data were processed by filtering the reads based on quality and trimming the adapter sequences. These reads were de novo assembled to produce contigs that were then subjected to a two-step BLASTx analysis to identify plant viruses, which include a comparison to a local plant virus database followed by comparison to a non-redundant GenBank protein database. Contigs with homology to the same virus species were then assembled to produce scaffolds. Apart from de novo assembly, a reference-based mapping approach was also used to obtain complete or partial viral genomes. Published viral genomes and scaffolds were used as reference sequences for known and novel viruses, respectively. Adapted from [5].
The number of processed reads and scaffolds, and the percentage (%) of putative plant virus scaffolds for each RNA library corresponding to different sampling sites. Adapted from [5].
| Libraries | No. of Reads | No. of Scaffolds ≥500 | No. of Putative Plant Virus Scaffolds | % of Putative Plant Virus Scaffolds | |
|---|---|---|---|---|---|
| Raw | Processed | ||||
| GV | 52,166,912 | 48,198,079 | 4256 | 52 | 1.22 |
| HS | 50,757,234 | 46,322,605 | 3485 | 60 | 1.72 |
| IL | 44,181,866 | 41,424,158 | 4890 | 91 | 1.86 |
| IG | 50,931,730 | 47,527,410 | 3899 | 24 | 0.62 |
| ILC | 44,632,114 | 41,527,161 | 3361 | 20 | 0.60 |
| IGC | 30,729,018 | 27,504,778 | 2790 | 23 | 0.82 |
GV, Gainesville; HS, High Springs; IL, Interlachen; IG, Island Grove; ILC, Interlachen cultivated site; IGC, Island Grove cultivated site.
Figure 3Viral populations in the RNA viromes of wild and cultivated V. corymbosum. (a) Diversity of virus sequences represented by the different range of viral genera of closely related viruses as identified by BLASTx analyses. The overlapping region indicates viral genera present in both sites. (b) Putative viral scaffolds with similarity to plant virus species from the different range of viral genera from each sampling location. GV, Gainesville; HS, High Springs; IL, Interlachen; IG, Island Grove; ILC, Interlachen cultivated site; IGC, Island Grove cultivated site. Unclassified viruses refer to species that have not been approved by the ICTV. Adapted from [5].
The consensus sequence of complete viral genomes assembled from each plant virome using a virome analysis pipeline. The tick mark indicated the complete viral genomes that were successfully assembled from each RNA virome. Adapted from [5].
| Virus | Genus | Vector | Transmission | GV | HS | IL | IG | ILC | IGC |
|---|---|---|---|---|---|---|---|---|---|
| BlLV |
| No | Transmitted through seed | ✓ | ✓ | ||||
| BlMaV |
| UK | UK | ✓ | ✓ | ||||
| BRRV |
| UK | Vegetative propagation | ✓ | |||||
| BlVT |
| UK | UK | ✓ | |||||
| TMV |
| No | Mechanical | ✓ | ✓ | ✓ | ✓ |
BlMaV, blueberry mosaic associated ophiovirus; BlLV, blueberry latent virus; BRRV, blueberry red ringspot virus; BlVT, blueberry virus T; TMV, tobacco mosaic virus; UK, unknown vector; GV, Gainesville; HS, High Springs; IG, Island Grove; IL, Interlachen; ILC, Interlachen cultivated site; IGC, Island Grove cultivated site.
Figure 4The pairwise comparison analysis in SDT and phylogenetic analysis in MEGA7 using nucleotide and amino acid sequences of different genome regions of the respective viruses. Pairwise sequence identities between the viruses are represented in different colors. The bootstrap consensus phylogenetic tree was constructed by neighbor-Joining using the Maximum Composite Likelihood and Poisson correction method for nucleotide and amino acid sequences, respectively, based on 1000 replicates, showing branch nodes more than 75% bootstrap values. (a) Pairwise identity and phylogenetic analysis using the full genome of BlLV isolates and selected members of the genus Amalgavirus. (b) Pairwise identity and phylogenetic analysis using the NP gene of BlMaV isolates and selected members of the genus Ophiovirus. (c) Pairwise identity and phylogenetic analysis using the full genome of BRRV isolates and selected members of the genus Soymovirus. (d) Pairwise identity and phylogenetic analysis using the RdRp and CP protein of the putative novel Tepovirus and selected members representing different genera in the family Betaflexiviridae. Accession numbers are shown in the figure. ACLSV, apple chlorotic leaf spot virus; ASGV, apple stem grooving virus; ASPV, apple stem pitting virus; AVCaV, apricot vein clearing associated virus; BanMMV, banana mild mosaic virus; BanVX, banana virus X; BBLV, blueberry latent virus; BlMaV, blueberry mosaic associated ophiovirus; BlVT, blueberry virus T; BRRV, blueberry red ringspot virus; CarChV1, carrot Ch virus 1; CLBV, citrus leaf blotch virus; CNRMV, cherry necrotic rusty mottle virus; CPsV, citrus psorosis virus; DVA—diuris virus A; GarCLV, garlic common latent virus; GVA, grapevine virus A; LRNV, lettuce ring necrosis virus; PCSV, peanut chlorotic streak virus; PrVT, prunus virus T; PVT, potato virus T; RVA, rhododendron virus A; SbCMV, soybean chlorotic mottle virus; SCSMaV, sugarcane striate mosaic-associated virus; STV, southern tomato virus. Adapted from [5].
The percentage (%) and average coverage of mapped reads to each RNA segment of BlMaV obtained from virome of cultivated V. corymbosum from Island Grove and wild V. corymbosum from High Springs and Interlachen, and the percentage of pairwise nt identity between each RNA to the corresponding reference sequences. Adapted from [5].
| RNA Segments | % of Mapped Reads (Average Coverage) | % Pairwise nt Identity to Ref. Seq. | ||||
|---|---|---|---|---|---|---|
| IGC | HS | IL | IGC | HS | IL | |
| 1 | 0.01 (47×) | 0.013 (102×) | - | 81.2 | 79.9 | - |
| 2 | 0.004 (83×) | 0.0005 (16×) | - | 79.5 | 79.7 | - |
| 3 | 0.003 (61×) | 0.007 (252×) | 0.01 (368x) | 83.4 | 82.7 | 84.6 |
Reference sequence (Ref. seq.): Accession number: KJ_04366-8; IGC, cultivated V. corymbosum from Island Grove; HS, High Springs; IL, wild V. corymbosum from Interlachen; k, kilo.
The length of each RNA segment and the encoded ORFs of the reference sequence and BlMaV obtained from virome of cultivated V. corymbosum from Island Grove and wild V. corymbosum from High Springs and Interlachen. Adapted from [5].
| RNA Segments | ORFs | Length of RNA (nt) | Length of ORFs (nt) | 5′/3′ UTRs | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RS | IGC | HS | IL | RS | IGC | HS | IL | RS | IGC | HS | IL | ||
| 1 | RdRp/23 kDa | 7963 | 7747 | 7946 | - | 7014/585 | 7014/585 | 7014/576 | - | 238/19 | -/25 | 190/43 | - |
| 2 | MP | 1934 | 1981 | 1939 | - | 1548 | 1545 | 1545 | - | 317/69 | 334/60 | 337/99 | - |
| 3 | NP | 1570 | 1543 | 1602 | 1591 | 1368 | 1368 | 1368 | 1368 | 128/77 | 111/64 | 165/127 | 163/61 |
Reference sequence (RS), Accession number: KJ704366-KJ704368; IGC, cultivated V. corymbosum from Island Grove; HS, High Springs; IL, wild V. corymbosum from Interlachen; k, kilo.
Nucleotide length of each ORFs in different BRRV isolates from Florida and other regions. Adapted from [5].
| Isolates | ORFs | Total Length | |||||||
|---|---|---|---|---|---|---|---|---|---|
| I (MP) | A | B | C | IV (CP) | V (RT) | VI (TA) | VII | ||
| CZ | 1101 | 312 | 561 | 600 | 1488 | 2004 | 1284 | 462 | 8302 |
| IGC-FL | 1197 | 369 | 561 | 597 | 1488 | 2007 | 1284 | 522 | 8392 |
| NJ | 939 | 369 | 561 | 600 | 1461 | 1974 | 1287 | 429 | 8303 |
| PL | 939 | 369 | 561 | 594 | 1455 | 1974 | 1284 | 462 | 8265 |
| SL | 1110 | 369 | 561 | 588 | 1476 | 2043 | 1284 | 462 | 8299 |
CZ, Czech Republic; IGC-FL, Island Grove, Florida; NJ, New Jersey; PL, Polish; SL, Slovenia.
The nucleotide length and pairwise nucleotide comparison of each ORF and the UTR of BlVT and PrVT (NC_024686). Adapted from [5].
| Type of Region | Region | Length (nt) | % Pairwise nt Identity | |
|---|---|---|---|---|
| PrVT | BlVT | |||
| ORFs | RdRp | 5337 | 5457 | 61 |
| MP | 1152 | 1146 | 70 | |
| CP | 666 | 663 | 65 | |
| UTRs | 5′ | 46 | 109 | 68 |
| 3′ | 79 | 264 | 59 | |
Figure 5Validation of the putative new tepovirus in extracted RNA of different V. corymbosum cultivars from Island Grove by PCR, producing expected amplicons at ~1 kB. Cultivar’s name was shown at the top of the gel. M, 1 kB DNA ladder.