| Literature DB >> 30202647 |
Jane Pascar1,2, Christopher H Chandler1.
Abstract
Wolbachia is the most widespread endosymbiont, infecting >20% of arthropod species, and capable of drastically manipulating the host's reproductive mechanisms. Conventionally, diagnosis has relied on PCR amplification; however, PCR is not always a reliable diagnostic technique due to primer specificity, strain diversity, degree of infection and/or tissue sampled. Here, we look for evidence of Wolbachia infection across a wide array of arthropod species using a bioinformatic approach to detect the Wolbachia genes ftsZ, wsp, and the groE operon in next-generation sequencing samples available through the NCBI Sequence Read Archive. For samples showing signs of infection, we attempted to assemble entire Wolbachia genomes, and in order to better understand the relationships between hosts and symbionts, phylogenies were constructed using the assembled gene sequences. Out of the 34 species with positively identified infections, eight species of arthropod had not previously been recorded to harbor Wolbachia infection. All putative infections cluster with known representative strains belonging to supergroup A or B, which are known to only infect arthropods. This study presents an efficient bioinformatic approach for post-sequencing diagnosis and analysis of Wolbachia infection in arthropods.Entities:
Keywords: Anopheles; Bioinformatics; Insects; NCBI SRA; Wolbachia
Year: 2018 PMID: 30202647 PMCID: PMC6126470 DOI: 10.7717/peerj.5486
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Reference Wolbachia genes.
Gene sequences from Wolbachia-infected hosts used to create the reference database for Magic-BLAST searches of SRA accessions to diagnose novel Wolbachia infections.
| Host Classification | ||||||||
|---|---|---|---|---|---|---|---|---|
| Phylum | Class | Order | Species | Supergroup | Strain | Accession number | Gene | Citation |
| Arthropoda | Arachnida | Trombidiformes | B | – |
| |||
| A | wDiacir3 |
| ||||||
| Insecta | Coleoptera | – | NFR114 |
| D Kageyama, S Narita, T Imamura and A Miyanoshita (2008, unpublished data) | |||
| – | – |
| ||||||
| A | wDic |
| ||||||
| Diptera | B | – |
| ND Djadid, N Daneshinia, S Gholizadeh and S Zakeri (2006, unpublished data) | ||||
| B | – |
| ||||||
| A | wMel |
| YF Wang and Y Zheng (2008, unpublished data) | |||||
| A | wMel |
| ||||||
| B | wProtPA |
| ||||||
| B | 22 |
| JC Rondan-Duenas, A Blanco, and CN Gardenal (2010, unpublished data) | |||||
| A | Canton-S |
| ||||||
| A | – |
| ||||||
| A | wHa |
| JWO Ballard (2003, unpublished data) | |||||
| A | wMa(Ma) |
| JWO Ballard (2003, unpublished data) | |||||
| A | wAlbA |
| ||||||
| B | wFusc |
| S Wiwatanaratanabutr and P Kittayapong (2002, unpublished data) | |||||
| A | – |
| ||||||
| A | – |
| ||||||
| A | – |
| ||||||
| Hymenoptera | B | – |
| F Lu and MX Jiang (2012, unpublished data) | ||||
| A | – |
| ||||||
| – | wNGirVA |
| ||||||
| – | wNvi-2 |
| F Lu and MX Jiang (2012, unpublished data) | |||||
| B | Sib |
| ||||||
| B | Type I |
| ||||||
| B | Type I |
| ||||||
| B | – |
| ||||||
| – | – |
| ||||||
| Lepidoptera | – | – |
| |||||
| B | – |
| ||||||
| A | Type II |
| ||||||
| B | – |
| ||||||
| A | – |
| ||||||
| B | – |
| ||||||
| Orthoptera | B | – |
| |||||
| Malacostraca | Isopoda | – | – |
| ||||
| Nematoda | Secementea | Spirurida | C | – |
| |||
| D | – |
| ||||||
| C | – |
| ||||||
| D | – |
| ||||||
| D | – |
| ||||||
| C | – |
| ||||||
| D | – |
| S Salahuddeen and TB Nutman (2000, unpublished data) | |||||
Wolbachia sequences of known origin for phylogenetic analysis.
Wolbachia genes used as controls and the species name from which they were isolated. The supergroup of the Wolbachia strain is listed and these genes served as a control during the creation of the phylogeny.
| Host Classification | Gene & Accession Number | |||||
|---|---|---|---|---|---|---|
| Phylum | Order | Species | Supergroup | Citation | ||
| Arthropoda | Isopoda |
|
| B | ||
| Hemiptera |
|
| B | |||
| Hemiptera |
|
| B | |||
| Diptera |
|
| A | |||
| Diptera |
|
| A | |||
| Diptera |
|
| A | |||
Species showing evidence of Wolbachia infection.
List of unique species (class; order) that tested positive for the presence of Wolbachia genes.
| Phylum | Class | Order | Species | Supergroup (this study) | Supergroup (previous study) | Citation | GenBank Accession numbers |
|---|---|---|---|---|---|---|---|
| Arthropoda | Insecta | Coleoptera | B | – | – | – | |
| A (but possible double infection) | A/B |
| |||||
| A | A |
| |||||
| B | – | – | – | ||||
| B | – | – | – | ||||
| Diptera | B | A |
| ||||
| A | A |
| |||||
| A/B | A/B |
| |||||
| A | A |
| |||||
| A | A |
| |||||
| A | A |
| |||||
| A | A | – | |||||
| Hemiptera | B | A/B |
| ||||
| B | B |
| |||||
| B | – | – | – | ||||
| B | B |
| |||||
| B | B |
| |||||
| A | A |
| |||||
| – | – | C Fromont, M Riegler and JM Cook (2015, unpublished data) |
| ||||
| – | – | C Fromont, M Riegler and JM Cook (2015, unpublished data) |
| ||||
| Hymenoptera | Acromyrmex echinatior | A | A |
| |||
| A | – |
| |||||
| – | – | – | – | ||||
| Cynipini sp. | A | A |
| ||||
| A | – | – | – | ||||
| B | A |
| |||||
| B | – | – | – | ||||
| A | A | – | |||||
| A | – |
| |||||
| B | B |
| |||||
| Lepidoptera | B | – | – | – | |||
| B | B |
| |||||
| B | – |
| |||||
| B | B |
|
Notes.
Species indicated with a ‘* are species that have not previously been identified, according to our literature search, to harbor Wolbachia strains. The supergroup classification of the Wolbachia strain according to this study and previously studies is listed if known.
Wolbachia genome assemblies
Information on Wolbachia draft genome assemblies. Expected host coverage is calculated as (total sequence data/host genome size). “Evidence of multiple infections” indicates whether or not the assembly contains signs pointing to multiple, distinct Wolbachia strains within the same biological host sample used for generating the sequence data (though some of these consisted of pooled individuals). BUSCO comp., BUSCO dup., BUSCO frag., and BUSCO missing refer to the number of BUSCO orthologs that were found to be complete and single copy, duplicated, fragmented, and missing from the Wolbachia assembly, out of 148 BUSCOs present in the Bacteria odb9 reference gene set. Grey rows at the bottom of the table were omitted from the whole-genome phylogenetic analysis because the assemblies appeared less complete (as indicated by missing BUSCO genes) or showed evidence of being chimeric or a mixture of two independent strains.
| Host species/ID | Description/ common name | BioSample accession number | SRA accession numbers | Total seq. data (Gb) | Host genome size (ref.) | Expected host coverage (x) | BUSCO comp. | BUSCO dup. | BUSCO frag. | BUSCO missing | Evidence of multiple infections? | Sample notes | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mosquito |
|
| 9.1 | 280 Mb | 32 | 9.0 | 1.212 | 23.12 | 125 | 0 | 2 | 21 | No | ||
| Wasp |
|
| 8.7 | 16 | 1.249 | 9.5 | 125 | 0 | 5 | 18 | No | ||||
| Wasp |
|
| 8.3 | 16 | 1.246 | 10.29 | 128 | 0 | 3 | 17 | No | ||||
| Butterfly |
|
| 13.2 | 26 | 1.207 | 13.35 | 124 | 0 | 4 | 20 | No | Wild caught whole insect | |||
| Western corn rootworm |
|
| 98.9 | 2.4 Gb (GCA_003013835.1) | 41 | 900 | 1.505 | 31 | 128 | 0 | 3 | 17 | No | 5 animals | |
| Western corn rootworm |
|
| 95.1 | 2.4 Gb (GCA_003013835.1) | 40 | 750 | 1.487 | 35.01 | 128 | 0 | 3 | 17 | No | 5 animals | |
| Western corn rootworm |
|
| 88.9 | 2.4 Gb (GCA_003013835.1) | 37 | 660 | 1.376 | 32.16 | 128 | 0 | 3 | 17 | No | 5 animals | |
| Wasp |
|
| 56.8 | 390 Mb (GCA_001412515.1) | 150 | 830 | 1.377 | 21.39 | 127 | 1 | 2 | 18 | No | Adults collected from | |
| Asian citrus psyllid |
|
| 25 | 490 Mb (GCA_000475195.1) | 51 | 180 | 1.379 | 25.7 | 128 | 0 | 2 | 18 | No | ||
| Asian citrus psyllid |
|
| 27.1 | 490 Mb (GCA_000475195.1) | 55 | 250 | 1.425 | 25.7 | 128 | 0 | 2 | 18 | No | ||
| Beetle |
|
| 24.8 | 63 | 1.698 | 10.59 | 124 | 0 | 5 | 19 | No | ||||
| Gall wasp |
|
| 6.8 | 57 | 1.398 | 12.93 | 127 | 0 | 3 | 18 | No | ||||
| Gall wasp |
|
| 7.1 | 53 | 1.382 | 12.13 | 121 | 0 | 3 | 24 | No | ||||
| Fruit fly |
|
| 18.1 | ∼175 Mb | 100 | 1600 | 1.208 | 19.19 | 127 | 0 | 3 | 18 | No | FM7a-23229-hemizygous | |
| Fruit fly |
|
| 3.5 | ∼175 Mb | 20 | 21 | 1.198 | 13.83 | 125 | 0 | 4 | 19 | No | Haploid embryos; natural population | |
| Fruit fly |
|
| 25.3 | ∼150 Mb | 170 | 390 | 1.265 | 14.67 | 130 | 0 | 2 | 16 | No | ||
| Fruit fly |
|
| 23.4 | ∼150 Mb | 160 | 1100 | 1.294 | 15.57 | 130 | 0 | 2 | 16 | No | ||
| Fruit fly |
|
| 37 | ∼150 Mb | 250 | 1600 | 1.313 | 15.72 | 130 | 0 | 2 | 16 | No | ||
| Fruit fly |
|
| 22.6 | ∼170 for other | 130 | 150 | 1.306 | 22.94 | 129 | 0 | 3 | 16 | No | Female | |
| Fruit fly |
|
| 8 | ∼170 Mb | 47 | 42 | 1.254 | 14.75 | 128 | 0 | 2 | 18 | No | Pooled isofemale lines (11 lines, 4 females per line) | |
| Fruit fly |
|
| 4.4 | ∼170 Mb | 26 | 35 | 1.26 | 10.24 | 127 | 0 | 2 | 19 | No | Pooled isofemale lines (11 lines, 4 females per line) | |
| Rove beetle |
|
| 22.7 | 170 | 1.437 | 42.92 | 127 | 1 | 3 | 17 | No | Whole insect | |||
| Water striders |
|
| 27.4 | 990 Mb (GCA_001010745.2) | 28 | 69 | 1.538 | 13.14 | 127 | 0 | 3 | 18 | No | Female, whole individuals, adults | |
| Water striders |
|
| 27.8 | 990 Mb (GCA_001010745.2) | 28 | 39 | 1.537 | 13.14 | 127 | 0 | 3 | 18 | No | Male, whole individuals, adults | |
| Glassy-winged sharpshooter (leafhopper) |
|
| 107.7 | 1.45 Gb (GCA_000696855.2) | 74 | 260 | 1.675 | 14.19 | 121 | 9 | 3 | 15 | Maybe: BUSCO duplications | Lab reared Florida-strain female | |
| Mealybug |
|
| 9.8 | 160 Mb (GCA_900064465.1) | 61 | 80 | 1.415 | 27.94 | 130 | 0 | 2 | 16 | No | ||
| Winter moth |
|
| 22.2 | 640 Mb (GCA_001266575.1) | 35 | 28 | 1.35 | 33.34 | 129 | 0 | 1 | 18 | No | Female; adult; head and thorax; wild caught individual | |
| Speckled wood butterfly |
|
| 9.8 | 138 | 1.282 | 83.56 | 129 | 0 | 2 | 17 | No | Whole adult, lab culture | |||
| Gall wasp |
|
| 3.8 | 12 | 1.188 | 8.62 | 119 | 0 | 5 | 24 | No | ||||
| Gall wasp |
|
| 3.8 | 12 | 1.174 | 8.05 | 114 | 0 | 9 | 25 | No | ||||
| Comma butterfly |
|
| 6 | 61 | 1.463 | 22.09 | 129 | 0 | 1 | 18 | No | Whole adult, lab cultured | |||
| Ant |
|
| 36.9 | 280 Mb (GCA_002006095.1 for congeners) | 130 | 195 | 1.245 | 15.91 | 127 | 0 | 3 | 18 | No | Adult female worker ant | |
| Apple maggot fly |
|
| 23.3 | 0.97 (C-value) | 25 | 270 | 1.314 | 13.52 | 127 | 0 | 3 | 18 | No | Single adult female fly | |
| Wasp |
|
| 68.6 | 190 Mb (GCA_000599845.3) | 360 | 50 | 1.097 | 51.37 | 127 | 0 | 3 | 18 | No | ||
| Ant |
|
| 13.2 | 300 Mb (GCA_000204515.1) | 44 | 56 | 1.611 | 4.66 | 104 | 0 | 8 | 36 | No | 1 male | |
| Beetle |
|
| 8.5 | 11 | 1.151 | 1.79 | 70 | 0 | 13 | 65 | No | Adult, whole body | |||
| Wasp |
|
| 4.1 | 4.2 | 0.645 | 0.7 | 3 | 0 | 8 | 137 | No | ||||
| Bean weevil |
|
| 32.1 | 0.75 (C-value) | 44 | 340 | 2.894 | 4.06 | 78 | 6 | 12 | 52 | Yes: bimodal coverage distribution; assembly size; BUSCO duplications | Male, head, thorax, feet | |
| Carpenter bee |
|
| 16 | 200 Mb (GCA_001652005.1) | 80 | 11 | 1.053 | 1.96 | 55 | 0 | 12 | 81 | No | 1 haploid male | |
| Cynipini 1 | Oak gall wasp |
|
| 9.4 | 20 | 1.216 | 6.57 | 96 | 0 | 13 | 39 | No | |||
| Cynipini 2 | Oak gall wasp |
|
| 7.3 | 17 | 1.182 | 3.32 | 82 | 0 | 21 | 45 | No | |||
| Domestic cochineal | SAMN02725055 |
| 6.2 | 21.1 Mb (estimate given in from NCBI BioSample entry) | 290 | 110 | 2.563 | 5.95 | 47 | 80 | 2 | 19 | Yes: assembly size; BUSCO duplications; bimodal coverage distribution | Bulk sample of 50 Oaxacan Mexican grana | |
| Fruit fly |
|
| 3.4 | ∼175 Mb | 19 | 6.5 | 1.171 | 6.14 | 102 | 0 | 15 | 31 | No | Adult male whole body, wild caught from Africa | |
| Glassy-winged sharpshooter (leafhopper) |
|
| 39.5 | 1.45 Gb (GCA_000696855.2) | 27 | 64 | 1.803 | 17.02 | 109 | 23 | 3 | 13 | Maybe; BUSCO duplications | Lab-reared Florida-strain male | |
| Gall wasp |
|
| 3.4 | 7.9 | 0.998 | 2.19 | 63 | 0 | 16 | 69 | No | ||||
| Gall wasp |
|
| 3.4 | 7.7 | 0.965 | 2.31 | 63 | 0 | 14 | 71 | No | ||||
| Stink bug |
|
| 5.7 | 31 | 2.097 | 1.74 | 75 | 4 | 13 | 56 | Yes: assembly size; BUSCO duplications; possible bimodal coverage distribution | ||||
| Fig psyllid |
|
| 0.9 | 9.9 | 1.171 | 2.32 | 76 | 0 | 6 | 66 | No | ||||
| Fig psyllid |
|
| 0.8 | 12 | 1.238 | 2.68 | 77 | 0 | 14 | 57 | No | ||||
| Psyllid |
|
| 1.1 | 5.7 | 0.364 | 0.76 | 0 | 0 | 3 | 145 | No | ||||
| Tephritid fly | SAMN04977950 |
| 132.9 | 1.1 Gb (GCA_001687245.1) | 120 | 1200 | 1.881 | 11.19 | 80 | 51 | 2 | 15 | Yes: bimodal coverage distribution; assembly size; BUSCO duplications | Single adult female fly |
Figure 1Concatenated phylogeny.
Molecular phylogenetic analysis by maximum likelihood based on the concatenated dataset containing ftsZ and the groE operon (total of 1,381 nucleotide positions). Bold letters next to host species names indicate supergroup relationships of Wolbachia isolates identified in previous studies for each host. Asterisks indicate reference sequences (see also Table 2).
Figure 2Whole-genome phylogeny.
Maximum likelihood phylogeny based on whole-genome sequence data of Wolbachia isolates assembled here and previously sequenced reference Wolbachia genomes (indicated by samples with associated accession numbers), with a total of 133,744 nucleotide positions. Numbers by nodes indicate bootstrap support based on 200 replicates. Bold letters next to host species names indicate supergroup relationships of Wolbachia isolates identified in previous studies for each host. Isolates with accession numbers listed represent reference genome sequences from other studies.