| Literature DB >> 27194801 |
Amelia R I Lindsey1, John H Werren2, Stephen Richards3, Richard Stouthamer4.
Abstract
Wolbachia is an intracellular symbiont of invertebrates responsible for inducing a wide variety of phenotypes in its host. These host-Wolbachia relationships span the continuum from reproductive parasitism to obligate mutualism, and provide a unique system to study genomic changes associated with the evolution of symbiosis. We present the genome sequence from a parthenogenesis-inducing Wolbachia strain (wTpre) infecting the minute parasitoid wasp Trichogramma pretiosum The wTpre genome is the most complete parthenogenesis-inducing Wolbachia genome available to date. We used comparative genomics across 16 Wolbachia strains, representing five supergroups, to identify a core Wolbachia genome of 496 sets of orthologous genes. Only 14 of these sets are unique to Wolbachia when compared to other bacteria from the Rickettsiales. We show that the B supergroup of Wolbachia, of which wTpre is a member, contains a significantly higher number of ankyrin repeat-containing genes than other supergroups. In the wTpre genome, there is evidence for truncation of the protein coding sequences in 20% of ORFs, mostly as a result of frameshift mutations. The wTpre strain represents a conversion from cytoplasmic incompatibility to a parthenogenesis-inducing lifestyle, and is required for reproduction in the Trichogramma host it infects. We hypothesize that the large number of coding frame truncations has accompanied the change in reproductive mode of the wTpre strain.Entities:
Keywords: Rickettsiales; Trichogramma; gene truncations; genome content; symbiosis
Mesh:
Substances:
Year: 2016 PMID: 27194801 PMCID: PMC4938664 DOI: 10.1534/g3.116.028449
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Wolbachia strains used in comparative and phylogenetic analyses
| Strain | Host | Supergroup | Size (bp) | ORFs | Reference | Accession Number | BUSCO Score |
|---|---|---|---|---|---|---|---|
| A | 1,019,687 | 1378 | AWUH00000000 | C: 77.5% [D: 6.4%], F: 5%, M: 17.5%, n: 40 | |||
| A | 1,295,804 | 1342 | CP003884 | C: 85% [D: 2.9%], F: 5%, M: 10%, n: 40 | |||
| A | 1,267,782 | 1401 | AE017196 | C: 87.5% [D: 2.9%], F: 2.5%, M: 10%, n: 40 | |||
| A | 1,445,873 | 1493 | CP001391 | C: 82.5% [D: 3%], F: 5%, M: 12.5%, n: 40 | |||
| A | 1,415,350 | 1528 | CAOU00000000 | C: 87.5% [D: 2.9%], F: 2.5%, M: 10%, n: 40 | |||
| B | 1,162,431 | 1187 | CAGB00000000 | C: 82.5% [D: 3%], F: 2.5%, M: 15%, n: 40 | |||
| B | 1,377,933 | 1369 | CAOH00000000 | C: 80% [D: 3.1%], F: 5%, M: 15%, n: 40 | |||
| B | 1,240,904 | 1250 | AMZJ00000000 | C: 80% [D: 3.1%], F: 2.5%, M: 17.5%, n: 40 | |||
| B | 1,301,823 | 1317 | CP003883 | C: 82.5% [D: 3%], F: 2.5%, M: 15%, n: 40 | |||
| B | 1,482,355 | 1461 | AM999887 | C: 80% [D: 3.1%], F: 5%, M: 15%, n: 40 | |||
| B | 1,542,137 | 1556 | ABZA00000000 | C: 75% [D: 3.3%], F: 2.5%, M: 22.5%, n: 40 | |||
| B | 1,340,443 | 1340 | HG428761 | C: 80% [D: 3.1%], F: 2.5%, M: 17.5%, n: 40 | |||
| B | 1,133,709 | 1405 | This study | LKEQ00000000 | C: 77.5% [D: 3.2%], F: 5%, M: 17.5%, n: 40 | ||
| B | 1,107,643 | 1245 | AERW00000000 | C: 77.5% [D: 3.2%], F: 2.5%, M: 20%, n: 40 | |||
| C | 957,990 | 1272 | HE660029 | C: 75% [D: 3.3%], F: 2.5%, M: 22.5%, n: 40 | |||
| D | 1,080,084 | 1339 | AE017321 | C: 82.5% [D: 3%], F: 5%, M: 12.5%, n: 40 | |||
| D | 1,052,327 | 2144 | ADHD00000000 | C: 45% [D: 0%], F: 20%, M: 35%, n: 40 | |||
| F | 1,250,060 | 1357 | AP013028 | C: 72.5% [D: 3.4%], F: 2.5%, M: 25%, n: 40 |
ORFs, open reading frames; BUSCO, benchmarking universal single-copy orthologs; C, complete; D, duplicated; F, fragmented; M, missing; n, number of genes used.
BUSCO scores in standard BUSCO notation.
Single-scaffold assembly.
Complete assembly.
Figure 1Phylogenetic relationships of Wolbachia. (A) Phylogeny inferred with RAxML from a nucleotide supermatrix of the five Wolbachia MLST (multi locus sequence typing) genes using 1000 bootstrap replicates. Supergroups are shown in colored boxes, and labeled in the top right corner of each box. Symbols next to taxa denote Wolbachia host and phenotypes. Colors at nodes indicate bootstrap values. Anaplasma marginale str. Florida “Ama” is the outgroup. (B) Cladogram of Wolbachia inferred with RAxML using the same methods as in Figure 1A, but analyzing only the strains with sequenced genomes. Numbers in parentheses next to taxon names represent, on the left, the number of genes in the genome, and on the right, the number of genes unique to that genome. Numbers corresponding to points on internodes represent, on the left, the number of core cluster of orthologous genes (COGs) for that clade, and on the right, the number of COGs unique to that clade. Numbers in square brackets represent alternative core and unique genome sizes for the respective clade, calculated without wGmm. Colored boxes denote supergroups, with labels in the top right corner.
Figure 2Gene content of Wolbachia. (A) The numbers of genes in each role category, for each Wolbachia genome are plotted with open circles and correspond to the left axis. Role categories are sorted by standard deviation, represented by the red triangles, and the right axis. (B) Number of ankyrin repeat-containing genes per genome, by supergroup. The B supergroup has a significantly higher number of ankyrin genes than the A supergroup (Mann–Whitney U-test, P = 0.003).
Figure 3Principal components (PC) analysis of Wolbachia genomes based on proportion of annotated genes devoted to each role category, with wTpre and closest neighbor circled. (A) All annotated role categories analyzed. The strongest factor loadings along PC1 (46.3% of total variance) and PC2 (34.0% of total variance), respectively, are energy metabolism and regulatory functions. (B) Mobile and extrachromosomal elements functions category excluded. The strongest factor loadings along PC1 (38.0% of total variance) and PC2 (24.0% of total variance), respectively, are cellular processes and DNA metabolism.
Classification of wTpre “unique genes”
| “Unique Genes” with Evidence of Truncation | “Unique Genes” Without Evidence of Truncation | ||
|---|---|---|---|
| Nonsense mutation | 26 | No match to other | 115 |
| Postnonsense | 76 | Low identity score of alignment | 7 |
| Frameshift mutation | 30 | Homolog is shorter than | 11 |
| Postframeshift | 139 | No up/downstream homology | 68 |
| Poststart codon mutation | 10 | ||
| Total truncations | 281 | Total excluded | 201 |
Figure 4Evidence for truncation in wTpre genes. (A) Length of wTpre “unique genes” and their homologous genes from other Wolbachia genomes. There is a significant difference in the size of the wTpre unique gene set as compared to their homologous counterparts (Mann–Whitney U-test, P < 0.0001). (B) Schematic representation of wTpre coding frame truncation and fragmentation. The wTpre “unique genes,” wTpre_380, wTpre_381, and wTpre_382, are homologous to sequential locations in the WD0152 gene from wPip_Pel. A frameshift mutation at base pair 421 in wTpre_380 resulted in a premature stop codon and the subsequent annotation of downstream ORFs (open reading frames), or “postframeshift” ORFs.