| Literature DB >> 29466388 |
Steven M Valles1, Sanford D Porter1, Luis A Calcaterra2.
Abstract
Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.Entities:
Mesh:
Year: 2018 PMID: 29466388 PMCID: PMC5821328 DOI: 10.1371/journal.pone.0192377
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map of collection locations (capital letters) in Formosa Province, Argentina.
Dashed ovals show how locations were grouped for RNA library preparation. We sampled 2–9 colonies at each site. See Table 1 for colony numbers, GPS coordinates, and habitat descriptions.
Summary of collection information for Solenopsis invicta colonies used in RNA library preparation.
Library designation represents South American Library (SAL).
| Site | Date | Latitude | Longitude | Habitat | Number of colonies | Library designation |
|---|---|---|---|---|---|---|
| R | 23-ix-14 | -25.2875 | -57.7054 | Dike along river | 7 | SAL_1 |
| S | 22-ix-14 | -25.3598 | -57.6631 | Dike | 5 | SAL_1 |
| T | 22-ix-14 | -25.2948 | -57.7445 | Road right of way | 8 | SAL_1 |
| U | 24-ix-14 | -25.3553 | -58.2796 | Low grass | 9 | SAL_1 |
| V | 24-ix-14 | -25.1432 | -58.2314 | Roadside | 9 | SAL_1 |
| W | 24-ix-14 | -25.2508 | -57.9907 | Roadside | 8 | SAL_1 |
| A | 16-ix-14 | -26.6609 | -58.6297 | River bank, Costanera park | 8 | SAL_2 |
| B | 16-ix-14 | -26.6222 | -58.6653 | Semi urban | 9 | SAL_2 |
| C | 17-ix-14 | -26.5173 | -58.2818 | Field adjacent to road | 9 | SAL_2 |
| D | 17-ix-14 | -26.4806 | -58.2737 | Marshy road area | 7 | SAL_2 |
| E | 17-ix-14 | -26.4795 | -58.3071 | City park, | 9 | SAL_2 |
| F | 17-ix-14 | -26.3944 | -58.3442 | Gas station | 7 | SAL_2 |
| X | 25-ix-14 | -26.5412 | -58.5081 | Roadside | 1 | SAL_2 |
| Y | 25-ix-14 | -26.6474 | -58.6236 | Roadside | 3 | SAL_2 |
| G | 18-ix-14 | -25.8786 | -58.0872 | Abandoned parking area and periphery | 9 | SAL_3 |
| H | 18-ix-14 | -25.6733 | -58.2561 | Grassy roadside | 6 | SAL_3 |
| N | 20-ix-14 | -26.1182 | -58.2241 | Urban, highly disturbed | 5 | SAL_3 |
| O | 22-ix-14 | -26.1975 | -58.2144 | Adjacent to gas station | 7 | SAL_3 |
| P | 22-ix-14 | -26.2281 | -58.2405 | Abandoned picnic area | 9 | SAL_3 |
| Q | 22-ix-14 | -26.0416 | -58.0589 | Rural road side | 8 | SAL_3 |
| I | 19-ix-14 | -26.2391 | -58.6303 | Rough grassy area | 9 | SAL_4 |
| J | 19-ix-14 | -26.205 | -59.0708 | City park | 8 | SAL_4 |
| K | 19-ix-14 | -25.7411 | -59.1021 | Mowed road edge, west side | 9 | SAL_4 |
| L | 20-ix-14 | -25.9357 | -58.5124 | Road edge | 5 | SAL_4 |
| M | 20-ix-14 | -25.8637 | -58.8858 | Residential area | 8 | SAL_4 |
Contiguous sequences (Contigs) with significant viral identity by BLASTX analysis of the GenBank database and considered high likelihood viral prospects from RNA libraries created from Solenopsis invicta worker ants.
Contigs were first sorted in descending order based on the number of the sequences comprising it, followed by the libraries represented.
| Designation | Sequences comprising contig | Size (nt) | e-score | Query coverage:identity (%) | Sequence identity with | Number of singletons in | Genome | Virus family | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SAL_1 | SAL_2 | SAL_3 | SAL_4 | ||||||||
| Contig_29 | 21294 | 9030 | 0 | 57: 67 | Israeli acute paralysis virus | 274 | 10312 | 4312 | 6396 | ssRNA | Dicistroviridae |
| Contig_66 | 13980 | 2626 | 2E-154 | 76: 39 | Aphid lethal paralysis virus | 3823 | 1704 | 5241 | 3212 | ssRNA | Dicistroviridae |
| Contig_30 | 6197 | 1869 | 1E-139 | 96: 40 | Aphid lethal paralysis virus | 2165 | 779 | 1833 | 1420 | ssRNA | Dicistroviridae |
| Contig_16 | 2104 | 2371 | 0 | 83:48 | Solenopsis invicta virus 1 | 2 | 133 | 1815 | 154 | ssRNA | Dicistroviridae |
| Contig_70 | 641 | 1068 | 6E-64 | 82: 43 | Alber virus | 117 | 180 | 141 | 203 | RNA | Unclassified |
| Contig_58 | 607 | 1453 | 1E-67 | 88: 34 | Nasonia vitripennis virus | 18 | 311 | 37 | 241 | RNA | Unclassified |
| Contig_13 | 510 | 1568 | 2E-102 | 87: 41 | Aphid lethal paralysis virus | 206 | 98 | 7 | 199 | ssRNA | Dicistroviridae |
| Contig_21 | 486 | 3211 | 0 | 85: 61 | Acute bee paralysis virus | 108 | 51 | 9 | 318 | ssRNA | Dicistroviridae |
| Contig_83 | 428 | 2623 | 0 | 97: 62 | Mosinovirus | 90 | 58 | 110 | 170 | RNA | Unclassified |
| Contig_55 | 202 | 2639 | 3E-147 | 91: 35 | Kashmir bee virus | 12 | 153 | 20 | 17 | ssRNA | Dicistroviridae |
| Contig_82 | 114 | 1203 | 0 | 82: 94 | Acute bee paralysis virus | 16 | 7 | 4 | 87 | ssRNA | Dicistroviridae |
| Contig_15 | 100 | 1042 | 0 | 92: 83 | Hubei orthoptera virus 1 | 28 | 14 | 3 | 55 | RNA | Unclassified |
| Contig_27 | 69 | 680 | 6E-135 | 91: 97 | Acute bee paralysis virus | 1 | 6 | 2 | 60 | ssRNA | Dicistroviridae |
| Contig_17 | 58 | 796 | 4E-162 | 99: 91 | Acute bee paralysis virus | 15 | 7 | 5 | 31 | ssRNA | Dicistroviridae |
| Contig_80 | 55 | 1173 | 0 | 98: 79 | Drosophila C virus | 44 | 4 | 3 | 4 | ssRNA | Dicistroviridae |
| Contig_75 | 36 | 589 | 6E-26 | 83: 39 | Hubei picorna-like virus 46 | 8 | 13 | 15 | 0 | RNA | Unclassified |
| Contig_47 | 734 | 1045 | 7E-82 | 89: 44 | Solenopsis invicta virus 2 | 0 | 667 | 6 | 61 | ssRNA | Unclassified |
| Contig_78 | 216 | 702 | 7E-111 | 99: 85 | Israeli acute paralysis virus | 153 | 42 | 21 | 0 | ssRNA | Dicistroviridae |
| Contig_57 | 146 | 1028 | 5E-116 | 93: 57 | Wuhan insect virus 11 | 0 | 93 | 28 | 25 | RNA | Unclassified |
| Contig_18 | 44 | 664 | 2E-84 | 97: 60 | Hubei Orthoptera virus 1 | 10 | 2 | 0 | 32 | RNA | Unclassified |
| Contig_88 | 44 | 1500 | 0 | 99: 87 | Rhopalosiphum padi virus | 12 | 0 | 7 | 25 | ssRNA | Dicistroviridae |
| Contig_53 | 39 | 522 | 1E-50 | 98: 52 | Wuhan insect virus 11 | 0 | 27 | 3 | 9 | RNA | Unclassified |
| Contig_9 | 9 | 471 | 3E-68 | 87: 95 | Acute bee paralysis virus | 3 | 0 | 3 | 3 | ssRNA | Dicistroviridae |
| Contig_85 | 8 | 542 | 2E-53 | 91: 56 | Solenopsis invicta virus 1 | 4 | 1 | 3 | 0 | ssRNA | Dicistroviridae |
| Contig_51 | 321 | 1465 | 0 | 99: 76 | Nodamura virus | 0 | 274 | 0 | 47 | ssRNA | Nodaviridae |
| Contig_49 | 182 | 1269 | 0 | 99: 95 | Aphid lethal paralysis virus | 0 | 153 | 29 | 0 | ssRNA | Dicistroviridae |
| Contig_12 | 95 | 438 | 2E-79 | 99: 77 | Big Sioux River virus | 49 | 0 | 0 | 46 | ssRNA | Unclassified |
| Contig_50 | 69 | 768 | 9E-68 | 99: 46 | Israeli acute paralysis virus | 0 | 68 | 0 | 1 | ssRNA | Dicistroviridae |
| Contig_60 | 69 | 926 | 0 | 99: 93 | Aphid lethal paralysis virus | 0 | 55 | 14 | 0 | ssRNA | Dicistroviridae |
| Contig_28 | 67 | 830 | 1E-46 | 97: 37 | Israeli acute paralysis virus | 0 | 0 | 13 | 54 | ssRNA | Dicistroviridae |
| Contig_52 | 66 | 356 | 1E-35 | 99: 94 | Shuangao insect virus 8 | 0 | 20 | 0 | 46 | RNA | Unclassified |
| Contig_25 | 58 | 869 | 8E-10 | 37: 40 | Hubei picorna-like virus 50 | 0 | 0 | 20 | 38 | RNA | Unclassified |
| Contig_141_3101 | 3101 | 2351 | 0 | 97: 55 | Solenopsis invicta virus 1 | 0 | 0 | 0 | 3101 | ssRNA | Dicistroviridae |
| Contig_19 | 1075 | 1080 | 2E-159 | 89: 71 | Solenopsis invicta virus 1 | 0 | 0 | 0 | 1075 | ssRNA | Dicistroviridae |
| Contig_19_480 | 480 | 623 | 8E-114 | 99: 88 | Solenopsis invicta virus 1 | 0 | 0 | 480 | 0 | ssRNA | Dicistroviridae |
| Contig_110_361 | 361 | 1214 | 0 | 99: 82 | Kashmir bee virus | 0 | 0 | 0 | 361 | ssRNA | Dicistroviridae |
| Contig_81_84 | 84 | 1103 | 6E-161 | 99: 67 | Wuhan arthropod virus 2 | 0 | 0 | 84 | 0 | RNA | Unclassified |
| Contig_108_79 | 79 | 715 | 5E-131 | 98: 83 | Shuangao insect virus 8 | 0 | 79 | 0 | 0 | RNA | Unclassified |
Fig 2Graphical summary of broad taxonomic assignments of RNA sequences from the four combined gene libraries (Left) which did not match the Solenopsis invicta fire ant genome.
These libraries were derived from 182 S. invicta colonies sampled from the 25 locations shown in Fig 1. Phage sequences were not of interest because they infect bacteria. The expanded pie chart (Right) shows the percentage of non-phage virus sequences matching known S. invicta viruses and those remaining sequences with significant viral identity. Note that both SINV-3 and SiDNV only accounted for a very small percentage of matched contigs. The dashed line in the unmatched section shows the proportion of sequences belonging to the new fire ant virus (SINV-5) described in this paper.
Fig 3Geographic variation of singleton expression levels of sequences for each RNA library (see Fig 1 for map of collection locations and RNA library groupings).
Sequences were binned according to broader taxonomic categories (Top) and known or newly described Solenopsis invicta viruses (Bottom). Note that expression levels are shown in log scales.
Fig 4SINV-5 genome architecture (A), detection of replicative genome strand (B), and phylogenetic analysis of the RNA-dependent RNA polyermase of SINV-5 and known dicistroviruses (C). (A) The SINV-5 genome is represented by the center black line and open reading frames by rectangles. Proteins with identity to an RNA helicase, protease, and RNA-dependent RNA polymerase (non-structural proteins) were detected within ORF 1 and those with identity to virus capsid proteins were detected in ORF 2. A short, overlapping ORF3 at the 5' end of ORF2 provides support for Aparavirus placement. (B) RT-PCR amplification of each SINV-5 genome strand. Replicative strand (lane 1), corresponding control (lane 2), positive strand (lane 3) and corresponding control (lane 4). (C) The evolutionary relationship of SINV-5 with known dicistroviruses by comparison of the RNA-dependent RNA polymerase from each virus inferred using the Neighbor-Joining method [43]. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches [44]. Only the most conserved region of the RdRp was aligned and a total of 243 positions were included in the final dataset (exact positions for the translated ORF1 are indicated in the phylogenetic tree within parentheses by each taxa). Evolutionary analyses were conducted in MEGA7 [45]. Abbreviations and GenBank Accession numbers for regions of the translated ORF1 (shown in graph) used for analysis include SINV-5 (Solenopsis invicta virus 5; MF593921), ABPV (Acute bee paralysis virus; NC002548), IAPV (Israeli acute bee paralysis virus; NC009025), KBV (Kashmir bee virus; NC004807), SINV-1 (Solenopsis invicta virus 1; NC006559), CPV (Cricket paralysis virus; NC003924), DCV (Drosophila C virus; NC001834), MCV (Mud crab virus; NC014793), TaSV (Taura syndrome virus; NC003005), ALPV (Aphid lethal paralysis virus; NC004365), RhPV (Rhopalosiphum padi virus; NC001874), HoCV (Homalodisca coagulata virus 1; NC008029), PSIV (Plautia stali intestine virus; NC003779), HiPV (Himetobi P virus; NC003782), BQCV (Black queen cell virus; NC003784), TrV (Triatoma virus; NC003783).
gs composed of sequences from all four gene libraries evaluated by RT-PCR and PCR.
Conti Reverse transcription was required for successful amplification of all contigs. RT-PCR was also conducted with RNA obtained from S. invicta colonies (n = 27) collected in the U.S. to determine if the contig was present in the introduced range.
| Contig designation | PCR amplification with cDNA synthesis | PCR amplification without cDNA synthesis | Presence in |
|---|---|---|---|
| Contig_29 | Yes | No | Not detected |
| Contig_66 | Yes | No | 56 |
| Contig_30 | Yes | No | 56 |
| Contig_16 | Yes | No | 56 |
| Contig_70 | Yes | No | 26 |
| Contig_58 | Yes | No | Not detected |
| Contig_13 | Yes | No | 15 |
| Contig_21 | Yes | No | Not detected |
| Contig_83 | Yes | No | Not detected |
| Contig_55 | Yes | No | Not detected |
| Contig_82 | Yes | No | Not detected |
| Contig_15 | Yes | No | Not detected |
| Contig_27 | Yes | No | Not detected |
| Contig_17 | Yes | No | Not detected |
| Contig_80 | Yes | No | Not detected |