Literature DB >> 32386316

The Protector within: Comparative Genomics of APSE Phages across Aphids Reveals Rampant Recombination and Diverse Toxin Arsenals.

Jeff Rouïl1, Emmanuelle Jousselin1, Armelle Coeur d'acier1, Corinne Cruaud2, Alejandro Manzano-Marín3.   

Abstract

Phages can fundamentally alter the physiology and metabolism of their hosts. Although these phages are ubiquitous in the bacterial world, they have seldom been described among endosymbiotic bacteria. One notable exception is the APSE phage that is found associated with the gammaproteobacterial Hamiltonella defensa, hosted by several insect species. This secondary facultative endosymbiont is not necessary for the survival of its hosts but can infect certain individuals or even whole populations. Its infection in aphids is often associated with protection against parasitoid wasps. This protective phenotype has actually been linked to the infection of the symbiont strain with an APSE, which carries a toxin cassette that varies among so-called "types." In the present work, we seek to expand our understanding of the diversity of APSE phages as well as the relations of their Hamiltonella hosts. For this, we assembled and annotated the full genomes of 16 APSE phages infecting Hamiltonella symbionts across ten insect species. Molecular and phylogenetic analyses suggest that recombination has occurred repeatedly among lineages. Comparative genomics of the phage genomes revealed two variable regions that are useful for phage typing. Additionally, we find that mobile elements could play a role in the acquisition of new genes in the toxin cassette. Altogether, we provide an unprecedented view of APSE diversity and their genome evolution across aphids. This genomic investigation will provide a valuable resource for the design and interpretation of experiments aiming at understanding the protective phenotype these phages confer to their insect hosts.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  zzm321990 Hamiltonella defensazzm321990 ; APSE phage; endosymbiont; parasitoid defense; toxin

Mesh:

Year:  2020        PMID: 32386316      PMCID: PMC7313666          DOI: 10.1093/gbe/evaa089

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Bacteriophages, the viruses that infect and replicate in bacteria, exhibit genomes that are highly dynamic (reviewed by Dion et al. [2020]). They undergo rampant recombination and constant rearrangements. They are also known to play a prominent role in horizontal gene transfers between bacteria (reviewed by Touchon et al. [2017]). As such, they can be a source of innovation for the bacteria that are infected as well as for the eukaryotic organisms that carry these bacteria. Aphids (Hemiptera: Aphididae) can host a wide variety of facultative endosymbiotic bacteria (Guo et al. 2017), in addition to their obligate nutritional endosymbiont Buchnera aphidicola. These bacteria are not necessary for reproduction nor survival but can endow their host with a variety of beneficial effects, ranging from survival after heat stress to defense against pathogenic fungi and parasitoid wasps (reviewed by Oliver et al. [2010]). Although bacteriophages are ubiquitous wherever bacteria exist, their discovery is scarce among the endosymbiotic bacteria of insects. One such case is the APSE (for  secondary endosymbiont) phage: a lambdoid phage with an isometric head and a short tail (resembling species in the family Podoviridae) which was originally isolated from an Acyrthosiphon pisum aphid infected by an unidentified secondary endosymbiont (van der Wilk et al. 1999). This phage has been found so far associated with the gammaproteobacterial symbiont species Hamiltonella defensa: an endosymbiont lineage that infects several hemipteran species. In the case of aphids, it is clear that toxin genes carried by this phage confer a protective phenotype against parasitoid wasps by disabling wasp development (Oliver et al. 2009; Brandt et al. 2017). Microscopic analysis and genome sequencing of this phage, first termed APSE-1 (for bacteriophage 1 from APSE), revealed a circularly permuted and terminally redundant double-stranded DNA molecule of 36,524 base pairs (bp) (van der Wilk et al. 1999). APSE-1 encodes for a protein product showing low sequence similarity with the shiga-like toxin B subunit of several bacteriophages. Further whole-genome sequencing of a second APSE genome from A. pisum (termed APSE-2) as well as specific regions of other APSE phages revealed variation across the so-called “types” (Moran et al. 2005; Degnan and Moran 2008). These “types” are characterized by carrying a unique set of genes including (putative) toxins from three protein families: shiga-like toxin, cytolethal distending toxin (CdtB), and YD-repeat toxin. So far, seven types of APSE have been described (table 1). They are associated with Hamiltonella strains from six aphid species (from two subfamilies) and the whitefly Bemisia tabaci.
Table 1

Previously Characterized APSE Toxins

Insect Host Hamiltonella StrainAPSE TypeToxinReference
Acyrthosiphon pisum NAAPSE-1Shigga-like van der Wilk et al. (1999)
Acyrthosiphon pisum 5ATAPSE-2CdtB Moran et al. (2005); Degnan and Moran (2008)
Acyrthosiphon pisum NY26APSE-2CdtB Brandt et al. (2017)
Acyrthosiphon pisum 82BAPSE-2CdtB Moran et al. (2005); Degnan and Moran (2008)
Acyrthosiphon pisum ZA17APSE-2aCdtB Martinez et al. (2014)
Acyrthosiphon pisum WA4APSE-2CdtB Martinez et al. (2014)
Acyrthosiphon pisum A1A, A2F, A2HAPSE-3YD-repeat Degnan and Moran (2008)
Acyrthosiphon pisum AS3APSE-3YD-repeat Oliver et al. (2009)
Acyrthosiphon pisum AS5APSE-3YD-repeat Oliver et al. (2009)
Acyrthosiphon pisum R7APSE-3YD-repeat Oliver et al. (2009)
Aphis fabae H76APSE-3YD-repeat Dennis et al. (2017)
Aphis craccivora 5ATacAPSE-4bShigga-like Degnan and Moran (2008)
Uroleucon rudbeckiae NAAPSE-5bShigga-like Degnan and Moran (2008)
Chaitophorus sp.N4APSE-6CdtB Degnan and Moran (2008)
Aphis fabae H402APSE-6CdtB Dennis et al. (2017)
Bemisia tabaci NAAPSE-7CdtB Degnan and Moran (2008)

Note.—Summary of the previous works characterizing APSE phage toxins. NA, not available.

Later referred to as APSE-8 (Brandt et al. 2017).

Variants of APSE-1, see Degnan and Moran (2008).

Previously Characterized APSE Toxins Note.—Summary of the previous works characterizing APSE phage toxins. NA, not available. Later referred to as APSE-8 (Brandt et al. 2017). Variants of APSE-1, see Degnan and Moran (2008). Differences in the level of protection have been observed in different associations of aphids species/genotypes and Hamiltonella–APSE combinations. For example, experimental work has shown that Hamiltonella endosymbionts can confer varying levels of resistance against different parasitoid wasp species, and even no defense at all (Asplen et al. 2014; Lenhart and White 2017; Hopper et al. 2018; Leybourne et al. 2020). It has been shown that Hamiltonella carrying APSE-3 confer a strong defensive phenotype, whereas the ones carrying APSE-2 generally confer moderate defense against parasitoids (Oliver et al. 2009; Martinez et al. 2014; Brandt et al. 2017). This protection is indeed largely dependent on the APSE “type” (Degnan and Moran 2008; Martinez et al. 2014), and experiments have conclusively demonstrated that APSE-3, carried by Hamiltonella defensa strain AS3, can confer its protective phenotype when transplanted to the naturally occurring APSE-free and nonprotective strain A2C (Brandt et al. 2017). The beneficial fitness effect of APSE-bearing Hamiltonella has been shown to be conditional to the presence of APSE and the environmental pressure of parasitoid infection. In one study, the authors found that, although apparently no cost to infection with Hamiltonella could be detected in aphids not exposed to parasitism, a significant decline in the frequency of Hamiltonella was observed (Oliver et al. 2008). In another study, the loss of APSE led to an increase in intra-aphid Hamiltonella abundance and was associated to “severe” deleterious effects on aphid fitness (Weldon et al. 2013). In the current work, we sought to explore and expand our knowledge on the diversity of APSE phages across aphid species. For this purpose, we assembled and annotated full genomes for 16 APSE phages that infect Hamiltonella endosymbionts across ten insect species, including several aphids (from five subfamilies) and the whitefly B. tabaci. We performed phylogenetic analyses on Hamiltonella and its associated APSE phages in order to understand the diversity and evolutionary trajectory of this defensive phage. Through comparative genomics, we investigated genome rearrangement and more specifically look at the evolution of the toxin cassette. We found evidence suggesting that recombination takes place among APSE types in aphids and that most variation in gene content is observed in two main regions of their genomes. Finally, analysis of the toxin cassettes revealed that mobile elements might be involved in at least some of the variation observed in this symbiotically relevant genomic region.

Materials and Methods

Aphid Collection, DNA Extraction, and Sequencing

Complete genome sequences of APSE phages were retrieved from the NCBI database for APSE-1 (also known as Hamiltonella virus APSE1) and Hamiltonella strain 5AT. We then used the genome data available for five Hamiltonella strains from the aphid A. pisum and two from the whitefly B. tabaci to extract integrated APSE phages. Additionally, we gathered sequencing data from eight different aphid species (from four genera belonging to four subfamilies) collected between the years 2008 and 2016: some of which were previously known to host Hamiltonella (Meseguer et al. 2017). Specimens were kept in 70% ethanol at 6 °C. For whole-genome sequencing, we prepared DNA samples enriched with bacteria following a slightly modified version of the protocol by Charles and Ishikawa (1999) as described in Jousselin et al. (2016). For this filtration protocol, ∼15 aphids from one colony were pooled together. Extracted DNA was used to prepare custom paired-end libraries in Genoscope as in Manzano-Marín et al. (2018). These libraries were sequenced using either 151- or 251-bp paired-end reads chemistry on a HiSeq2500 Illumina sequencer. For full details on specimen collection, species identification, and accession numbers for the samples from which APSE phages were extracted, see supplementary table S1, Supplementary Material online.

Hamiltonella and APSE Genome Assembly and Annotation

First, we scanned the aforementioned Hamiltonella assembled genomes using the sequence of the phage attachment site (also known as attP) and the last 62 bp with a 90% identity threshold in UGENE v1.29.0 (Okonechnikov et al. 2012). For noncircularized genomes, we first scanned the contigs using BlastN (Altschul 1997) for these sequences in order to identify the contig or scaffold where the putative APSE phage resided in. Illumina sequences from the eight newly sampled aphid species were first right-tail clipped (requiring a minimum quality threshold of 20 and a minimum length of 75 bp) using FASTX-Toolkit v0.0.14 (http://hannonlab.cshl.edu/fastx_toolkit/, last accessed March 4, 2020). Additionally, PRINSEQ v0.20.4 (Schmieder and Edwards 2011) was used to remove reads containing undefined nucleotides as well as those left without a pair after the filtering and clipping process. The resulting reads were assembled using SPAdes v3.11.1 (Bankevich et al. 2012) with the --only-assembler option and k-mer sizes of 33, 55, 77, 99, and 127. From the resulting contigs, those that were shorter than 200 bp were dropped. The remaining contigs were binned using results from a BlastX (Altschul 1997) search (best hit per contig) against a database consisting of the Pea aphid’s proteome and a selection of aphid’s symbiotic bacteria proteomes (supplementary table S2, Supplementary Material online). When no genome was available for a specific bacterial lineage, closely related bacteria were used. The assigned contigs were manually screened using the BlastX web server (searching against the nr database) to ensure correct assignment. This binning process confirmed the presence of Buchnera, Hamiltonella, and its corresponding APSE phage. APSE was confidently assigned to Hamiltonella by checking for chimeric APSE–Hamiltonella chromosome contig ends in the corresponding insertion sites and/or by checking that no other facultative symbionts were sequenced in the sample. The resulting contigs were then used as reference for read mapping and individual genome assembly using SPAdes, as described above. Hamiltonella draft assemblies included APSE-assigned reads. APSE individual assemblies were visually screened for inconsistencies using Tablet (Milne et al. 2013). The resulting APSE genomes underwent a draft annotation using Prokka 1.14.4 (Seemann 2014). This was followed by noncoding RNA prediction using infernal v1.1.2 (Nawrocki and Eddy 2013) (against the Rfam v14.1 database, Kalvari, Argasinska, et al. 2018; Kalvari, Nawrocki, et al. 2018) and tRNAscan-SE v2.0.5 (Chan et al. 2019). We then performed manual curation of the annotations on UGENE (Okonechnikov et al. 2012) through online BlastX searches of the intergenic regions, open reading frame (ORF) finding feature in UGENE, and through DELTA-BLAST (Boratyn et al. 2012) searches of the predicted ORFs against NCBI’s nr database and the InterProScan v5 web server (Jones et al. 2014; Mitchell et al. 2019). SignalP v5.0 (Almagro Armenteros et al. 2019) and Phobius v1.01 (Kall et al. 2005) were used to predict signal peptides. ORFs were considered to be putative functional proteins (and thus not pseudogenes) if seemingly essential domains for the function were found or if the ORFs displayed truncations but retained identifiable domains. Short ORFs (≤300) that were conserved but consistently pseudogenized in all but one APSE genome were annotated as miscellaneous features. The origin of replication was determined using originX (Worning et al. 2006). Once the first APSE genome was curated, we used these proteins as a “genus”-specific database in Prokka to annotate the subsequent genomes and iteratively added novel proteins to the database. Codon usage was calculated separately for closed high-quality Hamiltonella–APSE pairs using the Cousin web server (Bourret et al. 2019). Finally, phage-like regions were searched in Arsenophonus spp. genomes using the web server for PHASTER (Zhou et al. 2011; Arndt et al. 2016). The retrieved phage-like regions were then manually inspected in UGENE, and identity of genes versus those of APSE was assessed using the online web server of BLAST. The EMBOSS suite program primersearch (Rice et al. 2000) was used to predict possible amplification targets in the Arsenophonus genomes using the primers reported in Duron (2014).

Phylogenetic and Recombination Analyses

In order to reconstruct the phylogeny of Hamiltonella endosymbionts, we used seven gene sequences (accD, dnaA, gyrB, hrpA, murE, ptsI, and recJ) following Degnan and Moran (2008). Genes sequences were gathered from the Supplementary data for Manzano-Marín et al. (2020) (https://doi.org/10.5281/zenodo.2566355, last accessed March 4, 2020) then used to identify orthologs in newly sequenced Hamiltonella strains using BlastX. Due to the low coverage for Hamiltonella draft genomes for two samples (3702 and 3692), we were unable to recover sequences for all seven genes used for phylogenetic inference, and thus these specimens were excluded from the Hamiltonella phylogenetic analysis. For each gene, sequence alignments were conducted using MUSCLE v.3.8.31 (Edgar 2004) and visually checked using SeaView v4 (Gouy et al. 2010). We then removed divergent and ambiguously aligned blocks using Gblocks v0.91b (Talavera and Castresana 2007). A partitioned scheme was selected using PartitionFinder v2.1.1 (Lanfear et al. 2017) with one data block defined for each codon position (first, second, and third) in each gene. The best model of evolution of each partition was selected among those available in MrBayes, using the Bayesian Information Criterion metric under a greedy algorithm. Finally, we concatenated the resulting alignments and ran a Bayesian phylogenetic inference using the GTR + I + G model in MrBayes v3.2.7 (Ronquist et al. 2012) running two independent analyses with four chains each for 1,500,000 generations and checked for convergence. For performing both phylogenetic inferences and analyzing the genetic differences across APSE, we first ran an orthologous protein clustering analysis using OrthoMCL v2.0.9 (Li 2003; Chen et al. 2007) on the full predicted proteome for the phage genomes (supplementary table S3, Supplementary Material online). We then extracted the single-copy core proteins for phylogenetic reconstruction (30 protein groups). Sequences were aligned using MAFFT v7.450 (maxiterate 1,000 localpair) (Katoh and Standley 2013). Divergent and ambiguously aligned blocks were removed using Gblocks. Substitution model was selected using ModelTest-NG v0.1.6 (Darriba et al. 2020) using the Akaike information criterion criterium. Bayesian phylogenetic reconstruction was conducted with MrBayes as described above by running chains for 300,000 generations and checking for convergence. Given prior evidence for intragenic recombination in APSE proteins (Degnan and Moran 2007), we constructed a second data set removing all genes where recombination has putatively occurred. Recombination was tested for running PhiPack (Bruen et al. 2006) on the aligned proteins and results of these analyses, for the putative nonrecombining proteins based on , can be found in supplementary table S4, Supplementary Material online. Bayesian phylogenetic reconstruction was ran as described above. To infer the relationships of the 14 toxin-cassette- and lytic-region-proteins (lysozyme and holins) in APSE genomes, we searched for similar sequences using the online BlastP web server versus the NCBI’s nr database and selected the top 50 nonredundant hits. These were filtered for a minimum of 70% query coverage and a 35% sequence identity. We aligned each of them using MAFFT and manually removed divergent and ambiguously aligned blocks. For each gene, model selection was done with ModelTest-NG. Bayesian phylogenetic inference was run for 300,000 generations in MrBayes as described above. All resulting trees were visualized and exported with FigTree v1.4.1 (http://tree.bio.ed.ac.uk/software/figtree/, last accessed March 4, 2020). All files used for phylogenetic analyses as well as PHASTER phage search results and comparative genomics files used in this study are available in https://doi.org/10.5281/zenodo.3764739 (last accessed March 5, 2020).

Results and Discussion

APSE Phage Genomes

We successfully de novo assembled genomes for eight complete APSE phages from eight different aphid species and extracted an additional eight from both closed and draft Hamiltonella endosymbionts from the pea aphid A. pisum and the whitefly B. tabaci (table 2). Additionally, we recovered draft genomes for all eight Hamiltonella strains hosting the newly sequenced APSE phages. All recovered phage genomes have a genome size between 33,476 and 41,343 bp. They have a very conserved G + C content ranging between 42.81% and 45.04% and code for an average of 38 coding sequences (CDSs) with 3 to no pseudogenes (table 2). Most APSE phages preserve an intact tRNA-Lys-(UC)UU, which has been lost in 5D and ZA17, both of which still keep an internally degraded tRNA pseudogene. An analysis of 17 bacterial hosts and 37 phage genomes found that the presence of tRNAs in these phages tended to correspond to codons that were highly used by the phage’s protein coding genes but were rare in the host’s (Bailly-Bechet et al. 2007). However, we found virtually no difference in the lysin codon usage between pairs of Hamiltonella–APSE genomes (supplementary table S5, Supplementary Material online). Nonetheless, we found that the Hamiltonella strain 5AT only codes for a tRNA-Lys-CUU, following the pseudogenization of the conserved tRNA-Lys-UUU, which recognizes the most commonly used codon for lysine. Conversely, its associated APSE preserves its own tRNA-Lys-UUU. This suggests that the presence of a tRNA-Lys-UUU in the APSE phage could drive the loss of this tRNA in the bacterial host, which could in turn provide a selective advantage to carrying this phage.
Table 2

Genome Assembly and Annotation Statistics

Insect Host Hamiltonella
APSE
StrainCoverageGenome SizeCoverageG + C ContentCDSs(ψ)ncRNAs
Bemisia tabaci MEAM1aNA38,988NA43.3639(2)4
Bemisia tabaci MED-Q1aNA38,949NA43.3439(2)4
Cinara confinis 2801b178×38,9491,638×43.9438(2)3
Cinara sp. 30463046b37,53138×43.9638(2)3
Cinara pinimaritimae 2836b27×35,191597×43.8438(0)3
Cinara cuneomaculata 2628b777×36,0946,498×43.8837(1)3
Drepanosiphum platanoidis 3702b36,931111×43.8638(2)3
Eriosoma grossulariae 3692b38,42038×42.9042(3)3
Acyrthosiphon pisum 5ATa28.7×39,867NA42.9141(2)3
Acyrthosiphon pisum NY26a643×39,887NA42.8641(2)3
Acyrthosiphon pisum 5DaNA39,146NA42.8141(2)3
Acyrthosiphon pisum ZA17a567×39,145NA42.8141(2)3
Acyrthosiphon pisum APSE1cNA36,524NA43.8937(2)3
Acyrthosiphon pisum MI47aNA36,522NA43.8837(2)3
Periphyllus testudinaceus 2671b41,34357×44.3738(0)3
Acyrthosiphon pisum AS3a535×38,992NA45.0436(0)3
Cinara watanabei 3293b194×33,47625,300×44.4634(2)3

Note.—Assembly and genome statistics for complete APSE genomes and their Hamiltonella hosts. NA, not available.

Extracted from publicly available Hamiltonella genomes.

Sequenced in this study.

Phage isolated from an unidentified secondary symbiont of Acyrthosiphon pisum (van der Wilk et al. 1999).

Genome Assembly and Annotation Statistics Note.—Assembly and genome statistics for complete APSE genomes and their Hamiltonella hosts. NA, not available. Extracted from publicly available Hamiltonella genomes. Sequenced in this study. Phage isolated from an unidentified secondary symbiont of Acyrthosiphon pisum (van der Wilk et al. 1999). All APSE have two noncoding RNAs (ncRNAs) that code for small bacterial RNAs (sRNA): a c4 antisense family sRNA and an isrK Hfq binding family sRNA. In the case of the former, a c4 sRNA has been identified in P1 and P7 phages from Escherichia coli as a regulator of the ant gene, an antirepressor, through the binding to complementary regions in the upstream region of this gene (Citron and Schuster 1990). Through the analysis of the putative so-called a′ and b′ antisense regions of the c4 sRNA, we found that they consistently matched the 3′-end region of the distant DNA polymerase gene and the upstream noncoding region of the adjacent BRO family, N-terminal domain-containing protein. Three BRO family proteins (BRO-A, BRO-C, and BRO-D) from the Bombyx mori nucleopolyhedrovirus (BmNPV) have been suggested to influence host DNA replication and/or transcription, following the finding that they are able to bind DNA from the host with a stronger affinity for single-stranded than for double-stranded DNA (Zemskov et al. 2000). Regarding IsrK, its expression pattern in Salmonella typhimurium suggested that this sRNA might be involved in the regulation of virulence (Padalon-Brauch et al. 2008). Lastly, both APSE phages harbored by Hamiltonella infecting B. tabaci MEAM1 and MED-Q1 biotypes code for an additional sRNA-Xcc1 family sRNA upstream of a protein of unknown function (ORF6N domain-containing protein) also unique to these APSE. In Xanthomonas campestris pv. vesicatoria, sRNA-Xcc1 is under the positive control of two important virulence regulators, suggesting that they might play a role in pathogenesis (Chen et al. 2011). All this evidence points toward an important regulatory function of these sRNAs in APSE and highlight them as targets to study their downstream regulatory effects in both APSE and Hamiltonella gene expression.

APSE Gene Content Variation and Toxin Cassettes

APSE genomes were highly conserved in both gene content, order, and sequence identity (supplementary fig. S1, table S3, and file S3, Supplementary Material online). The most variable genes, in terms of sequence identity, across APSE were those of a BRO family, N-terminal domain-containing protein (orthogroup APSEcp_004), the DNA polymerase (APSEcp_008), the lyzozyme (APSElp_003), the small subunit of the terminase (APSEcp_020), and the tail proteins (needle, spike, and fiber assembly; APSEcp_028 and APSEcp_033-34). Regarding the DNA polymerase gene, we observed differential loss of the intein domain (a protein intron). The lyzozyme gene was by far the one that showed the most sequence divergence. A phylogenetic analysis suggested two well supported lyzozyme phylogroups (so-called “P13” and “F,” supplementary fig. S2, Supplementary Material online), consistent with previous findings (Degnan and Moran 2008). From our analyses, “P13” is restricted to APSE-7 and APSE-1 (including the APSE-1 variants APSE-4 and APSE-5). One marked difference in CDS content between APSE harbored by Hamiltonella symbionts of aphids and those harbored by the whitefly B. tabaci was in the protein flanked by the BRO family, N-terminal domain-containing protein and the putative P-loop NTPase. Although all APSE from aphid-infecting Hamiltonella code for a putative phage regulatory protein (Rha family), those infecting the whitefly code for an AntA/AntB antirepressor family protein. To our knowledge, the function of both of these proteins is not well understood, and thus the significance of this difference remains to be studied. Altogether, we identified two main regions of genomic variation in APSE phages and propose an APSE classification based on the composition of these two variable sequence stretches based on Degnan and Moran (2008), and keeping the main type numbering (X), with a slight revision (X.Yz) based on our findings (fig. 1): The first variable region (X) is the toxin cassette, with small variations within the types (Y), and the second one is found around the vicinity of the DNA polymerase gene (z). In regards to the toxin cassettes, we found five different “types” or “gene sets” represented in the newly assembled APSE genomes. We found seven phages that could be classified within APSE-7. These were associated with Hamiltonella from two whiteflies and five species of aphids (from two subfamilies). We found that although the CdtB family protein was conserved across APSE-7, its companion hypothetical protein was not. Both proteins in the cassette showed identifiable signal peptides, suggesting their export. The CdtB family protein is also present in APSE-2; however, the structure of the cassette is unlike that of APSE-7. It codes for an additional two putative toxin genes, and in the case of APSE-2.2 subtypes, two flanking hypothetical proteins, all with signal peptides. In the case of APSE-2.1a, we found that the hypothetical proteins flanking CdtB were missing. Nonetheless, we found a different hypothetical protein (containing a signal peptide) flanked by insertion sequence pseudogenes. This suggests that, at least in this case, the new member of the toxin cassette was likely mobilized by the action of transposable elements. As in a previous study by Degnan and Moran (2008), we found that all APSE-2 and -7 types had both holin family genes (lambda and HP1) followed by a different lysozyme gene phylotype, so-called “F” and “P13,” respectively. The APSE-1 type was represented by two nearly identical APSE genomes, both associated with Hamiltonella strains from A. pisum. Both presented the same four hypothetical proteins, including two previously identified as putative subunits of a shiga-like toxin (van der Wilk et al. 1999; Degnan and Moran 2008). APSE-3 types were found to be associated with Hamiltonella hosted by two distantly related aphid species, A. pisum and Periphyllus testudinaceus. They showed a very conserved cassette, encoding for a hypothetical protein, with no identifiable signal peptide, and an RHS-repeat putative toxin (previously referred to as “YD-repeat-containing” by Degnan and Moran [2008]) with a signal peptide. Finally, the cassette found in 3293 (Hamiltonella from Cinara watanabei) represents a completely novel type, which we classified as APSE-8. This toxin cassette encodes for a single putative virulence-associated protein with no recognizable signal peptide. Previous studies have classified the APSE-2 variant found in Hamiltonella strain ZA17 as APSE-8 (Brandt et al. 2017; Doremus and Oliver 2017; Chevignon et al. 2018; Patel et al. 2019). However, we found that the only difference in its toxin cassette was the pseudogenization of the first putative toxin gene within this region when compared with APSE-2 5AT and NY26. Therefore, we have considered it to be a subtype of APSE-2, in agreement with Martinez et al. (2014).
Fig. 1

Variable regions in APSE genomes. Genome synteny plots of the two variable regions across APSE phages. On the bottom left, a color key for the features shown in the plot. On the right, proposed classification of APSE in types and subtypes based on the variable regions composition.

Variable regions in APSE genomes. Genome synteny plots of the two variable regions across APSE phages. On the bottom left, a color key for the features shown in the plot. On the right, proposed classification of APSE in types and subtypes based on the variable regions composition. Regarding the second variable region, the main differences laid in the presence/absence of the KilA-N domain-containing protein and its companion hypothetical protein, a putative transcriptional regulator upstream of the DNA polymerase gene. How this changes impact the phage fitness and phenotype remains to be investigated. Although most genes in the cassette had only very distant or no affiliations to genes in the databases, both the RHS-repeat and CdtB toxins had many close hits in the NCBI’s nr database (see Materials and Methods). In both cases, they formed a well-supported monophyletic group (fig. 2). As aforementioned, the RHS-repeat toxin is, to the best of our knowledge, restricted to APSE-3 types. On the contrary, the CdtB protein is actually present in several APSE phages, namely APSE-7 and APSE-2. The CdtB proteins encoded in the two different APSE types do not form distinct subclades, rather the proteins of APSE-2 are nested within CdtB from APSE-7 (fig. 2). Due to previous evidence for intragenic recombination (Degnan and Moran 2007), we ran a recombination test using PhiPack (Bruen et al. 2006) and found no significant evidence for recombination (P value = 7.02e-02). However, NSS- and Max -tests did found significant recombination among CdtB genes (P value = 4.00e-03 and P value = 2.00e-03, respectively), and thus, the relationships among APSE-encoded CdtB proteins should be interpreted with caution.
Fig. 2

APSE toxin gene phylogenies. Bayesian phylogeny of (A) RHS-repeat toxins and (B) CdtB proteins. Numbers at nodes represent Bayesian posterior probabilities. “*” = 1. Phylogenies were midpoint rooted.

APSE toxin gene phylogenies. Bayesian phylogeny of (A) RHS-repeat toxins and (B) CdtB proteins. Numbers at nodes represent Bayesian posterior probabilities. “*” = 1. Phylogenies were midpoint rooted. In both cases, these proteins were closely related to CdtB encoded by Arsenophonus spp. hosted by two whitefly species: Aleurodicus floccissimus and B. tabaci. Additionally, another four proteins in the toxin cassettes had high similarity and a close phylogenetic relation to proteins in Arsenophonus spp. (orthogroups APSEtp_002, APSEtp_007, APSEtp_013, and APSEtp_014). Previous analyses have identified APSE-like DNA polymerase genes in Arsenophonus symbionts of whiteflies, aphids, louse flies, bat flies, and psyllids (Hansen et al. 2007; Duron 2014). The diagnostic polymerase chain reactions used in these studies targeted the so-called P45 (DNA polymerase) and P3 (virulence-associated protein E family protein). Analysis of three available high-quality Arsenophonus genomes (Arsenophonus nasoniae strain FIN [CP038613.1CP038621.1], Arsenophonus endosymbiont of Nilaparvata lugens strain Hangzhou [JRLH01000001.1], and Arsenophonus endosymbiont of Aleurodicus floccissimus [OUND01000000.1]), revealed that these reported hits actually corresponded to prophage regions in their genomes (PHASTER annotations available in https://doi.org/10.5281/zenodo.3764739, last accessed March 5, 2020). On closer inspection, we found that only one region in the chromosome of Arsenophonus nasoniae strain FIN contained both of these proteins. This region, as did others, preserved high sequence identity to one or a few APSE proteins. However, the regions were not in fact APSE phages: that is, they consistently showed a different gene order, insertion of proteins not characteristic of fully sequenced APSE phages, and/or pseudogenized proteins that are otherwise conserved across APSE analyzed in this study (including the DNA polymerase). Although there are currently no full genome sequences for the strains analyzed by Hansen et al. (2007) or Duron (2014), this genomic scan, along with the presence of putative toxins in Arsenophonus spp. closely related to those present in APSE, strongly suggests that Arsenophonus bacteria have indeed been historically infected by close relatives of the Hamiltonella–associated APSE phages, and that these phages have left an imprint in the bacterial genomes. The retention of RHS-repeat- and CdtB-APSE-like toxin genes in at least two different strains of Arsenophonus suggests that these proteins could in fact endow the Arsenophonus symbiont with a protective phenotype. This could in turn provide a positive “APSE-free” fitness effect of carrying the endosymbiont when confronted with the environmental pressure of parasitoid infection. So far Arsenophonus, while quite widespread and very diversified in insects (Nováková et al. 2009), has not been credited with conferring protection against parasitoids of its insect hosts.

Phylogenetic History of Hamiltonella and APSE

Phylogenetic analyses supported previous observations (Degnan and Moran 2007) that closely related Hamiltonella strains were harbored by distantly related aphid species (i.e., aphids belonging to distinct subfamilies) and hosted different APSE types (fig. 3). A noteworthy case is that of Hamiltonella strains ZA17 and 5D: They belong to two different clades but host nearly identical APSE phages. Conversely, the closely related Hamiltonella strain 5D and MI47 host rather different APSE types. APSE-7 is notably present in both Hamiltonella clade C and in the distantly related lineages of this symbiont hosted by the whitefly B. tabaci. This provides strong evidence and supports previous findings, based on single-gene phylogenies (Degnan and Moran 2007, 2008), of horizontal transfer of these viral entities across Hamiltonella strains and lineages. The horizontal transfer of phages across lineages of endosymbiotic bacteria has been suggested for Wolbachia, where horizontal transfers of the WO phage seem to occur across both related and divergent Wolbachia (Masui et al. 2000; Bordenstein 2004). This horizontal transfer could be facilitated by the coexistence of two divergent Hamiltonella lineages within the same host, similarly to what has been observed for Wolbachia (Sasaki et al. 2005; Raychoudhury and Werren 2012). Indeed, multiple symbiont species and strains of the same species can coexist within the same population of an aphid species (Sandström et al. 2001; Tsuchida et al. 2002; Haynes et al. 2003; Russell et al. 2013; Meseguer et al. 2017), offering an opportunity for coinfection and transfer or recombination of their phages. Through the reconstruction of a phylogenetic network using the concatenated single-copy shared proteins of APSE, we observe evidence for recombination across APSE types (fig. 3). This is a feature that has previously been observed for these phages (Degnan and Moran 2007) and is a general feature of phage genomes (reviewed by Dion et al. [2020]). In fact, only 18 out of 30 single-copy shared orthologous groups of proteins showed no significant evidence of intragenic recombination based on a test (supplementary table S4, Supplementary Material online). From these 18, nine showed significant evidence of recombination as judged by two additional tests implemented in PhiPack: NSS and Max . A concatenated nucleotide alignment of the nine putative nonrecombinant genes revealed significant intergenic recombination (P value = 0.00e + 00). These results suggest genome-wide recombination for APSE phages, which is expected to complicate the inference of phylogenetic relationships. Regardless, Bayesian phylogenies were reconstructed for the three sets of single-copy shared proteins and can be found in supplementary figure S3, Supplementary Material online. Given the rampant recombination of their genomes, it is therefore hard to conclude on the phylogenetic history of these viruses.
Fig. 3

Phylogenetic relations of Hamiltonella and APSE phages. (A) Dendrogram of phylogenetic relationships among Hamiltonella endosymbionts based on single-copy shared proteins. Numbers at nodes represent Bayesian posterior probabilities. “*” = 1. (B) Phylogenetic network of single-copy shared proteins as calculated by SplitsTree.

Phylogenetic relations of Hamiltonella and APSE phages. (A) Dendrogram of phylogenetic relationships among Hamiltonella endosymbionts based on single-copy shared proteins. Numbers at nodes represent Bayesian posterior probabilities. “*” = 1. (B) Phylogenetic network of single-copy shared proteins as calculated by SplitsTree. One common feature of all phylogenetic analyses was that the APSE phages infecting B. tabaci-associated Hamiltonella strains were consistently highly divergent from those of aphid-associated ones. Additionally, the SplitsTree network reconstruction suggests little recombination (relative to that observed among aphid-associated APSE) between these two groups of APSE infecting different hosts (aphids and B. tabaci). This is congruent with the Hamiltonella symbiont phylogeny which positions those infecting the whitefly B. tabaci as a separate sister clade to that made up of all currently available aphid-hosted ones (fig. 3). Previous experimental studies of Hamiltonella infections in Sitobion avenae aphids have shown that strains of this symbiont establish infections more easily when transferred from the same host species (Łukasik et al. 2015). Additionally, these infections were found to be more stable. The same study also found that infection success was higher when the transferred symbiont strain was more closely related to the one that was originally present in the host. All this strongly suggests that these two lineages infecting different host groups can follow separate evolutionary histories, possibly driven by both ecological factors and other barriers limiting the horizontal transfer of the two distinct symbiont lineages.

Conclusion

Facultative symbionts in insects can confer conditional fitness benefits to their hosts (Polin et al. 2015; Cass et al. 2016; Doremus et al. 2018), and Hamiltonella and its APSE-conferred defense against parasitoid wasps is one of the best studied cases (Oliver and Higashi 2019). The results presented in this study support previous studies reporting a variable toxin cassette across APSE as well as intragenic recombination. Additionally, the whole-genome sequencing, annotation, and phylogenetic analyses of three new APSE types conducted here have revealed important features of these protective entities: 1) their genomes are highly conserved in terms of gene content with most of the variation localized to the toxin cassette; 2) alterations can occur within a cassette, with pseudogenization events affecting the putative toxin genes (i.e., APSE-2.2a) or IS-mediated insertion of novel genes; 3) APSE harbored by whitefly-associated Hamiltonella seem to have undergone a separate evolutionary history, in line with their hosts; and 4) Arsenophonus spp. might have historically had association with APSE-like phages, with some of these having left an imprint in their genomes. This study provides the first genome-wide overview of the diversity of five APSE phage types hosted by Hamiltonella endosymbionts, improving our understanding of these phages. The features highlighted in this study will be of great value in the design and interpretation of future experiments of APSE-mediated Hamiltonella parasitoid defense. Click here for additional data file.
  72 in total

1.  An HMM posterior decoder for sequence feature prediction that includes homology information.

Authors:  Lukas Käll; Anders Krogh; Erik L L Sonnhammer
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

2.  Unipro UGENE: a unified bioinformatics toolkit.

Authors:  Konstantin Okonechnikov; Olga Golosova; Mikhail Fursov
Journal:  Bioinformatics       Date:  2012-02-24       Impact factor: 6.937

3.  SignalP 5.0 improves signal peptide predictions using deep neural networks.

Authors:  José Juan Almagro Armenteros; Konstantinos D Tsirigos; Casper Kaae Sønderby; Thomas Nordahl Petersen; Ole Winther; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Biotechnol       Date:  2019-02-18       Impact factor: 54.908

4.  Conditional fitness benefits of the Rickettsia bacterial symbiont in an insect pest.

Authors:  Bodil N Cass; Anna G Himler; Elizabeth C Bondy; Jacquelyn E Bergen; Sierra K Fung; Suzanne E Kelly; Martha S Hunter
Journal:  Oecologia       Date:  2015-09-16       Impact factor: 3.225

5.  Diversity and geographic distribution of secondary endosymbiotic bacteria in natural populations of the pea aphid, Acyrthosiphon pisum.

Authors:  Tsutomu Tsuchida; Ryuichi Koga; Harunobu Shibao; Tadao Matsumoto; Takema Fukatsu
Journal:  Mol Ecol       Date:  2002-10       Impact factor: 6.185

6.  Buchnera has changed flatmate but the repeated replacement of co-obligate symbionts is not associated with the ecological expansions of their aphid hosts.

Authors:  A S Meseguer; A Manzano-Marín; A Coeur d'Acier; A-L Clamens; M Godefroid; E Jousselin
Journal:  Mol Ecol       Date:  2016-12-19       Impact factor: 6.185

7.  Evidence for nucleic acid binding ability and nucleosome association of Bombyx mori nucleopolyhedrovirus BRO proteins.

Authors:  E A Zemskov; W Kang; S Maeda
Journal:  J Virol       Date:  2000-08       Impact factor: 5.103

8.  InterPro in 2019: improving coverage, classification and access to protein sequence annotations.

Authors:  Alex L Mitchell; Teresa K Attwood; Patricia C Babbitt; Matthias Blum; Peer Bork; Alan Bridge; Shoshana D Brown; Hsin-Yu Chang; Sara El-Gebali; Matthew I Fraser; Julian Gough; David R Haft; Hongzhan Huang; Ivica Letunic; Rodrigo Lopez; Aurélien Luciani; Fabio Madeira; Aron Marchler-Bauer; Huaiyu Mi; Darren A Natale; Marco Necci; Gift Nuka; Christine Orengo; Arun P Pandurangan; Typhaine Paysan-Lafosse; Sebastien Pesseat; Simon C Potter; Matloob A Qureshi; Neil D Rawlings; Nicole Redaschi; Lorna J Richardson; Catherine Rivoire; Gustavo A Salazar; Amaia Sangrador-Vegas; Christian J A Sigrist; Ian Sillitoe; Granger G Sutton; Narmada Thanki; Paul D Thomas; Silvio C E Tosatto; Siew-Yit Yong; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  ModelTest-NG: A New and Scalable Tool for the Selection of DNA and Protein Evolutionary Models.

Authors:  Diego Darriba; David Posada; Alexey M Kozlov; Alexandros Stamatakis; Benoit Morel; Tomas Flouri
Journal:  Mol Biol Evol       Date:  2020-01-01       Impact factor: 16.240

10.  Serial horizontal transfer of vitamin-biosynthetic genes enables the establishment of new nutritional symbionts in aphids' di-symbiotic systems.

Authors:  Alejandro Manzano-Marı N; Armelle Coeur d'acier; Anne-Laure Clamens; Céline Orvain; Corinne Cruaud; Valérie Barbe; Emmanuelle Jousselin
Journal:  ISME J       Date:  2019-10-17       Impact factor: 10.302

View more
  6 in total

Review 1.  Diversity and function of arthropod endosymbiont toxins.

Authors:  Jonathan H Massey; Irene L G Newton
Journal:  Trends Microbiol       Date:  2021-07-09       Impact factor: 17.079

Review 2.  Genetic innovations in animal-microbe symbioses.

Authors:  Julie Perreau; Nancy A Moran
Journal:  Nat Rev Genet       Date:  2021-08-13       Impact factor: 59.581

Review 3.  Evolution in small steps and giant leaps.

Authors:  Noah K Whiteman
Journal:  Evolution       Date:  2022-01-29       Impact factor: 4.171

4.  Horizontal Transfer of Microbial Toxin Genes to Gall Midge Genomes.

Authors:  Kirsten I Verster; Rebecca L Tarnopol; Saron M Akalu; Noah K Whiteman
Journal:  Genome Biol Evol       Date:  2021-09-01       Impact factor: 3.416

5.  Evolutionary genomics of APSE: a tailed phage that lysogenically converts the bacterium Hamiltonella defensa into a heritable protective symbiont of aphids.

Authors:  Bret M Boyd; Germain Chevignon; Vilas Patel; Kerry M Oliver; Michael R Strand
Journal:  Virol J       Date:  2021-11-10       Impact factor: 4.099

6.  Bacteriophages Roam the Wheat Phyllosphere.

Authors:  Laura Milena Forero-Junco; Katrine Wacenius Skov Alanin; Amaru Miranda Djurhuus; Witold Kot; Alex Gobbi; Lars Hestbjerg Hansen
Journal:  Viruses       Date:  2022-01-26       Impact factor: 5.048

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.