| Literature DB >> 26848679 |
Karyna Rosario1, Christian Marr2, Arvind Varsani3,4,5, Simona Kraberger6, Daisy Stainton7, Enrique Moriones8, Jane E Polston9, Mya Breitbart10.
Abstract
Monopartite begomoviruses (Geminiviridae), which are whitefly-transmitted single-stranded DNA viruses known for causing devastating crop diseases, are often associated with satellite DNAs. Since begomovirus acquisition or exchange of satellite DNAs may lead to adaptation to new plant hosts and emergence of new disease complexes, it is important to investigate the diversity and distribution of these molecules. This study reports begomovirus-associated satellite DNAs identified during a vector-enabled metagenomic (VEM) survey of begomoviruses using whiteflies collected in various locations (California (USA), Guatemala, Israel, Puerto Rico, and Spain). Protein-encoding satellite DNAs, including alphasatellites and betasatellites, were identified in Israel, Puerto Rico, and Guatemala. Novel alphasatellites were detected in samples from Guatemala and Puerto Rico, resulting in the description of a phylogenetic clade (DNA-3-type alphasatellites) dominated by New World sequences. In addition, a diversity of small (~640-750 nucleotides) satellite DNAs similar to satellites associated with begomoviruses infecting Ipomoea spp. were detected in Puerto Rico and Spain. A third class of satellite molecules, named gammasatellites, is proposed to encompass the increasing number of reported small (<1 kilobase), non-coding begomovirus-associated satellite DNAs. This VEM-based survey indicates that, although recently recovered begomovirus genomes are variations of known genetic themes, satellite DNAs hold unexplored genetic diversity.Entities:
Keywords: Ipomoea satellites; alphasatellites; begomovirus; begomovirus-associated satellites; betasatellites; gammasatellites; metagenomics; ssDNA; vector; whitefly
Mesh:
Substances:
Year: 2016 PMID: 26848679 PMCID: PMC4776191 DOI: 10.3390/v8020036
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Whitefly samples used for vector-enabled metagenomic (VEM) survey of begomovirus-associated satellite DNAs [37].
| Crop/Plant (Dataset) 1 | Location | Collection Date | Whiteflies 2 |
|---|---|---|---|
| Squash | Imperial Valley, California, USA | June 2012 | |
| Tomato | Salamá, Baja Verapaz, Guatemala | January 2012 | |
| Tomato (Guat_T2) | El Progreso, Jutiapa, Guatemala | January 2012 | |
| Squash | Teculután, Zacapa, Guatemala | January 2012 | |
| Bean | Torrox, Málaga, Spain | September 2011 | |
| Squash (Spain_S) | Torrox, Málaga, Spain | September 2011 | |
| Tomato (Spain_T) | Torrox, Málaga, Spain | September 2011 | |
| European black nightshade | Algarrobo-Costa, Málaga, Spain | September 2011 | |
| European black nightshade | Algarrobo-Costa, Málaga, Spain | September 2011 | |
| Tomato | Bet Dagan, Israel | March 2011 | |
| Squash (Israel_S) | Bet Dagan, Israel | March 2011 | |
| Tomato (PR_T1) | Santa Isabel, Puerto Rico | November 2010 | |
| Tomato (PR_T2) | Santa Isabel, Puerto Rico | November 2010 | |
| Pumpkin (PR_P) | Santa Isabel, Puerto Rico | November 2010 | |
| Eggplant (PR_E) | Santa Isabel, Puerto Rico | November 2010 |
1 Indicates the crop or weed species from which whitefly specimens were collected. Dataset IDs for metagenomic libraries containing satellite DNA sequences are specified in parentheses. Highlighted samples indicate locations where begomovirus-associated satellite DNAs were identified. Colors distinguish if novel (green) or known (grey) satellite DNAs were detected; 2 Whitefly species (Bemisia tabaci or Trialeurodes vaporariorum) used for VEM were classified based on phylogenetic groups including the Middle East-Asia Minor 1 (MEAM1), Mediterranean (MED), and New World (NW) clades.
Satellite DNA molecules detected in whiteflies.
| Dataset | Genome a (Accession Number) | Best Match b (% Identity)/Accession Number | Satellite Size (nt) |
|---|---|---|---|
| Guat_T2 | VEM alphasatellite 1a (KT099170) | MeCMV-associated alphasatellite, Venezuela (76%)/KF670648 | 1370 |
| VEM alphasatellite 1b (KT099171) | MeCMV-associated alphasatellite, Venezuela (74%)/KF670648 | 1329 | |
| VEM alphasatellite 2 (KT099172) | Ageratum enation virus associated DNA 1, India (71%)/FN543100 | 1359 | |
| Israel_S | VEM Cotton leaf curl Gezira betasatellite * (KT099176) | Cotton leaf curl Gezira betasatellite, Egypt(95%)/AF397215 | 1324 |
| VEM Cotton leaf curl Gezira betasatellite * (KT099177) | Cotton leaf curl Gezira betasatellite, Egypt (99%)/AF397217 | 1348 | |
| VEM Cotton leaf curl Gezira betasatellite (KU095847) | Cotton leaf curl Gezira betasatellite, Egypt(98%)/AF397215 | 1310 | |
| VEM Cotton leaf curl Gezira virus alphasatellite * (KT099178) | Cotton leaf curl Gezira virus DNA 1, Burkina Faso (96%)/FN554581 | 1359 | |
| PR_T1 | VEM Ipomoea begomovirus satellite 1 (KT099179) | Ipomoea begomovirus satellite DNA beta isolate, Spain (70%)/FJ914403 | 731 |
| VEM Ipomoea begomovirus satellite 2 (KT099180) | Ipomoea begomovirus satellite DNA beta isolate, Spain (70%)/FJ914403 | 713 | |
| VEM Ipomoea begomovirus satellite 3 (KT099181) | SPLCLaV replication-associated protein, Spain (95%) **/EU839579 | 743 | |
| VEM alphasatellite 3a (KT099173) | Cuban alphasatellite 1, Cuba (86%)/HE806451 | 1300 | |
| PR_P | VEM alphasatellite 3b (KT099174) | Cuban alphasatellite 1, Cuba (85%)/HE806451 | 1307 |
| VEM alphasatellite 3c (KT099175) | Cuban alphasatellite 1, Cuba (86%)/HE806451 | 1303 | |
| PR_T1, T2, P, and E | VEM Ipomoea begomovirus satellite (Contigs, n = 49) * | Ipomoea begomovirus satellite DNA beta isolate, Spain (< 80%)/Various | |
| Spain T and S | VEM Ipomoea begomovirus satellite (Contigs, n = 10) * | Ipomoea begomovirus satellite DNA beta isolate, Spain (> 90%)/Various |
a Genomes that were not PCR-verified are highlighted with an asterisk (*). Genomes highlighted in grey summarize multiple Ipomoea satellites identified in datasets from Puerto Rico and Spain (Contig sequences representing complete Ipomoea satellites are provided in Supplemental File S1). b Genome-wide pairwise identities for best match in GenBank. Abbreviations refer to melon chlorotic mosaic virus (MeCMV) and sweet potato leaf curl Lanzarote virus (SPLCLaV). ** The best GenBank match to VEM Ipomoea begomovirus satellite 3 was a portion of the Rep-encoding region of the SPCLaV genome and the reported identity is only within this small region (8% coverage).
Figure 1Schematics depicting general genome organization and features observed in novel alphasatellites (A) and Ipomoea satellites (B) detected during this study. All genomes are characterized by a putative origin of replication (ori) marked by a stem-loop structure containing a conserved nonanucleotide motif. Alphasatellites encode a replication-associated protein (Rep) and exhibit an adenine-rich (A-rich) region. Ipomoea satellites do not exhibit any coding regions; however, these gammasatellites contain an A-rich region as well as a conserved region ~100 nt long shared among various gammasatellites (γSCR). The γSCR contains a 23 nt stretch where all gammasatellites reported to date share high similarities with betasatellites, named here the satellite common region (SCR) core. Some Ipomoea satellites from Puerto Rico share high identities with a 66 nt stretch (highlighted in black) found within the rep gene of sweet potato leaf curl virus (SPLCV) genomes detected in the same region.
Figure 2Midpoint-rooted maximum likelihood phylogenetic tree of predicted alphasatellite and nanovirus replication-associated protein sequences. Alphasatellites detected during this study are highlighted with a red star. Three groups of alphasatellites based on amino acid pairwise-identities are highlighted in shades of grey, including DNA-1-type alphasatellites from the Old World, DNA-2-type, and DNA-3-type alphasatellites mainly identified in the New World. Branches exhibiting less than 70% branch support were collapsed. Branches with approximate likelihood ratio test (aLRT) support > 91% are indicated with black circles, whereas branches exhibiting 85%–90% support are marked with a white circle. A list of sequences used for phylogenetic analysis and Rep pairwise identities are provided in Supplemental File S2.
Figure 3(A) Two-dimensional color-coded matrix depicting genome-wide pairwise identities observed among gammasatellites and (B) consensus sequences observed within a conserved satellite common region shared among various gammasatellites (γSCR). Mixed bases in the consensus sequences include K (G/T), R (A/G), W (A/T), and H (A/C/T). The γSCR was observed in Ipomoea satellites (Ipomoea sats) as well as tomato leaf curl virus sat (ToLCV sat) and malvastrum leaf curl satellite (MLC sat). A conserved 23 nt stretch (positions 663–686; labelled SCR core) found within the γSCR (positions 613–711) shares high pairwise identities with betasatellites and all gammasatellites reported to date, including novel New World satellites (NW sats) identified in Cuba and satellites identified in whiteflies from Florida, USA (VEM sats). Nucleotide positions are based on VEM Ipomoea begomovirus satellite 1. A list of sequences used for calculating identity scores among gammasatellites as well as betasatellites used for investigating the SCR core is provided in Supplemental File S3.