| Literature DB >> 34758862 |
Bret M Boyd1,2, Germain Chevignon3, Vilas Patel4, Kerry M Oliver4, Michael R Strand5.
Abstract
BACKGROUND: Most phages infect free-living bacteria but a few have been identified that infect heritable symbionts of insects or other eukaryotes. Heritable symbionts are usually specialized and isolated from other bacteria with little known about the origins of associated phages. Hamiltonella defensa is a heritable bacterial symbiont of aphids that is usually infected by a tailed, double-stranded DNA phage named APSE.Entities:
Keywords: Aphid; Bacteria; Mutualism; Parasitoid; Virus
Mesh:
Year: 2021 PMID: 34758862 PMCID: PMC8579659 DOI: 10.1186/s12985-021-01685-y
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
APSE haplotypes with fully sequenced genomes examined in this study
| Haplotype | Bacterial host | Insect host | Toxin | Data type | Assembly type | References | NCBI identifier |
|---|---|---|---|---|---|---|---|
| APSE1 | Phage genome | Whole genome | [ | NC_000935.1 | |||
| APSE3 AS3 | Contig/scaffold | Whole genome | [ | NZ_CP017610.1 | |||
| APSE2 5AT | Contig/scaffold | Whole genome | [ | NZ_CP001277.1 | |||
| APSE2 NY26 | Contig/scaffold | Whole genome | [ | NZ_CP017605.1 | |||
| APSE8 ZA17 | Contig/scaffold | Whole genome | [ | NZ_CP17613.1 | |||
| APSE MEAM | Contig/scaffold | Whole genome | [ | CCVN0000000 | |||
| APSE MED | Contig/scaffold | Whole genome | [ | GCF_000258345.2 | |||
| APSE1 MI47 | Contig/scaffold | Whole genome | This study | NZ_CP022932.1 | |||
| APSE9 MI12 | Contig/scaffold | Whole genome | This study | CP023987.1 | |||
| APSE8 5D | Contig/scaffold | Whole genome | This study | NZ_CP021663.1 |
Fig. 1Genome alignment of sequenced APSE haplotypes. The upper part of the figure schematically shows the genome for each haplotype with total size in base pairs (bp) indicated to the left, the boundaries for each module indicated at the top and the boundary for the two predicted functional units indicated at the bottom. Arrows identify predicted genes and their orientation on the positive (right) or negative strand (left) and color (yellow, blue, red, lavender) indicating module assignment. Predicted functions for each gene are summarized in Table 2. The lower part of the figure illustrates the boundaries for the attL and attR sites for each haplotype. The purple bars indicate the left and right boundaries for the H. defensa tRNA-Arg gene, the black bars correspond to the position of the anticodon in the tRNA-loop, the light blue bars correspond to the H. defensa chromosome as identified by alignment to the H. defensa A2C strain, and the dark blue bars correspond to the left and right boundaries for the integrated APSE genome
Best hits identified to APSE3 AS3 coding sequences in other sequenced viruses or bacteria
| Module | Locus | Alternative locus | Protein | Translation | Translation | BLASTP target: sequenced | Viral | Amino acid | BLASTP Target: sequenced γ-proteobacteria | Amino acid | BLASTP Target: all other | Amino acid |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Tag | Tag | Description | Start | Stop | viruses | family | Identity | That are insect symbionts | Identity | sequenced γ-proteobacteria | Identity | |
| 1 | P38 | APACPISMAS3_01 | Integrase | 1243 | 71 | Proteus phage NV18 | Podoviridae | 76.68 | 96.15 | 73.78 | ||
| 1 | P40 | APACPISMAS3_02 | Excisionase | 1479 | 1099 | Salmonella phage epsilon34 | Podoviridae | 44.72 | 25 | 60 | ||
| 1 | P41 | APACPISMAS3_03 | Dead-box helicase | 2993 | 1611 | Yersinia phage YeP4 | Podoviridae | 66.09 | 93.9 | 81.4 | ||
| 1 | P42 | APACPISMAS3_04 | DNA binding protein Roi | 3744 | 3013 | Cronobacter phage ENT47670 | Siphoviridae | 40.16 | 33.33 | 54.58 | ||
| 1 | P43 | APACPISMAS3_05 | Antirepressor | 4591 | 3809 | Salmonella phage SPN3UB | Siphoviridae | 36.78 | 87.31 | 48.47 | ||
| 1 | P44 | APACPISMAS3_08 | Nuclease | 5166 | 4885 | Xylella phage Xfas53 | Podoviridae | 43.01 | 95.7 | 78.65 | ||
| 1 | P45 | APACPISMAS3_10 | DNA polymerase I | 7464 | 5401 | Yersinia phage YeP4 | Podoviridae | 62.64 | 92.87 | 76.79 | ||
| 1 | P46 | APACPISMAS3_11 | Transcriptional regulator | 7809 | 7477 | Escherichia phage D6 | Myoviridae | 30.53 | 90 | 72.46 | ||
| 1 | P47 | APACPISMAS3_12 | Phage protein | 7900 | 8394 | Xylella phage Xfas53 | Podoviridae | 37.3 | 76.22 | 46.76 | ||
| 1 | P49 | APACPISMAS3_14 | Phage protein | 8685 | 8413 | NA | NA | NA | 76.47 | NA | NA | |
| 1 | P50 | APACPISMAS3_15 | Phage protein | 9308 | 8754 | Yersinia phage YeP4 | Podoviridae | 59.02 | 90.76 | 68.31 | ||
| 1 | P51 | APACPISMAS3_16 | Phage protein | 10,622 | 9324 | Yersinia phage YeP4 | Podoviridae | 54.93 | 89.58 | 62.3 | ||
| 1 | P53 | APACPISMAS3_17 | Phage protein | 11,538 | 10,615 | Yersinia phage YeP4 | Podoviridae | 34.05 | 75.16 | 40.6 | ||
| 2 | P1 | APACPISMAS3_18 | Repressor | 12,804 | 12,106 | Yersinia phage YeP4 | Podoviridae | 47.11 | 89.66 | 50.22 | ||
| 2 | P2 | N/A | Transcriptional regulator | 12,900 | 13,184 | Yersinia phage YeP4 | Podoviridae | 57.89 | 80.85 | 42.05 | ||
| 2 | P3 | APACPISMAS3_20 | ATPase | 13,188 | 15,461 | Yersinia phage YeP4 | Podoviridae | 67.89 | 91.15 | 70.13 | ||
| 2 | P5 | APACPISMAS3_21 | Antitermination protein Q | 15,889 | 16,335 | Stx2a-converting phage Stx2 | Siphoviridae | 61.67 | 95.95 | 58.22 | ||
| 3 | Toxin | APACPISMAS3_23* | YD-repeat protein | 17,764 | 22,185 | NA | NA | NA | 65.77 | 56.18 | ||
| 3 | Toxin | APACPISM5AT_28* | 19,883 | 20,872 | Enterobacteria phage cdtl | Siphoviridae | 27.91 | 48.6 | 29.91 | |||
| 3 | Toxin | APACPISM1_23* | 5398 | 6498 | NA | NA | NA | NA | NA | 23.58 | ||
| 3 | K | APACPISMAS3_24 | Group II holin | 22,837 | 23,028 | Morganella phage IME1369_02 | Siphoviridae | 34.29 | 75.44 | 52.63 | ||
| 3 | F | APACPISMAS3_25 | Lysozyme | 23,018 | 23,491 | Salmonella virus BTP1 | Podoviridae | 52.15 | 85.99 | 56.34 | ||
| 3 | P14 | APACPISMAS3_26* | Exported protein | 23,487 | 23,882 | Enterobacteria phage SfV | Myoviridae | 12.5 | 88.2 | 32.82 | ||
| 3 | P16 | N/A | Endolysin | 23,911 | 24,030 | Klebsiella phage ST16-OXA48phi5.4 | Myoviridae | 48.72 | 97.44 | 70.27 | ||
| 4 | P17 | APACPISMAS3_27* | Terminase, small | 24,104 | 24,517 | Providencia phage PSTNGR2lys | Siphoviridae | 79.05 | 89.71 | 80.95 | ||
| 4 | P18 | APACPISMAS3_28 | Terminase, large | 24,534 | 25,934 | Shigella virus Sf6 | Podoviridae | 38.7 | 91.24 | 54.2 | ||
| 4 | P19 | APACPISMAS3_29 | Portal protein | 25,940 | 28,105 | Shigella virus Sf6 | Podoviridae | 70.9 | 88.97 | 87.99 | ||
| 4 | P23 | APACPISMAS3_30 | Scaffolding protein | 28,156 | 29,052 | Shigella virus Sf6 | Podoviridae | 58.94 | 79.53 | 72.24 | ||
| 4 | P24 | APACPISMAS3_31 | Major capsid protein | 29,063 | 30,334 | Salmonella virus HK620 | Podoviridae | 79.2 | 93.6 | 85.07 | ||
| 4 | P27 | APACPISMAS3_33 | DNA stabilization protein | 30,554 | 31,036 | Salmonella virus HK620 | Podoviridae | 58.13 | 82.5 | 77.78 | ||
| 4 | P28 | APACPISMAS3_34 | DNA stabilization protein | 31,008 | 32,423 | Salmonella phage SPN9CC | Podoviridae | 60.81 | 83.65 | 63.56 | ||
| 4 | P30 | APACPISMAS3_35 | DNA stabilization protein | 32,423 | 32,779 | Proteus phage NV18 | Podoviridae | 38.79 | 49.38 | 42.86 | ||
| 4 | P31 | APACPISMAS3_36 | Hypothetical protein | 32,779 | 33,246 | Proteus phage NV18 | Podoviridae | 36.3 | 84.52 | 35.95 | ||
| 4 | P32 | APACPISMAS3_37 | DNA transfer protein | 33,224 | 33,853 | Escherichia phage vB EcoP Kapi1 | Podoviridae | 63.24 | 82.16 | 63.55 | ||
| 4 | P33 | APACPISMAS3_38 | DNA transfer protein | 33,866 | 35,254 | Salmonella virus BTP1 | Podoviridae | 33.13 | 83.69 | 65.34 | ||
| 4 | P35 | APACPISMAS3_39 | DNA transfer protein | 35,254 | 37,131 | Salmonella virus P22 | Podoviridae | 64.03 | 87.28 | 68.43 | ||
| 4 | P36 | APACPISMAS3_40 | Tail fiber protein | 37,160 | 38,179 | Proteus phage NV18 | Podoviridae | 72.27 | 67.34 | 74.36 | ||
| 4 | P37 | APACPISMAS3_41* | Tail fiber assembly protein | 38,158 | 38,625 | Shigella phage SfIV | Myoviridae | 32.37 | 72.26 | 50.34 |
Each predicted coding sequence in the APSE3 AS3 genome, along with stxB and cdtB genes from APSE1 and APSE8 5AT, were used to search using BLASTp all submitted genomes for Caudoviruses and γ-proteobacteria that are either insect symbionts or not insect symbionts
*Denotes differences in predicted transcriptional start and stop exist between annotations of APSE
Fig. 2Comparison of the APSE3 AS3 genome to: A Salmonella virus HK620 and Shigella phage Sf6, B Xylella phage Xfas53 and Salmonella enterica phage P22, and C Burkholderia phage BcepC6B and Bordella phage BPP1. For APSE3, color-coded arrows indicate orientation of predicted coding sequences and module assignment as shown in Fig. 1, while coding sequences for the other phages are indicated by light blue arrows. Shaded bars connecting linear genomes define similar regions with the scale bar shown in the lower right of the figure defining TBLASTX identity
Fig. 3Comparison of the APSE2 5AT or APSE3 AS3 genome to phage elements present in the genomes of: A Arsenophonus spp. endosymbiont in of Bemesia tabacii, B Arsenophonus endosymbiont in the whitefly Aleurodicus floccissimus, C Arsenophonus nasoniae present in the wasp Nasonia vitripennis, D Arsenophonus spp. ENCA and Arsenophonus triatominarum, and E Sodalis glossinidius. Predicted coding sequences and shaded bars connecting APSE genomes to identified phage elements are defined as described in Fig. 2. Annotations were not available for Arsenophonus triatominarum
Fig. 4Comparison of the APSE3 AS3 genome to phage elements present in the genomes of: A Morganella morganii and Providencia alcalifaciens, B Morganella morganii and Proteus mirabilis, C Providencia rettgeri, and Providencia alcalifaciens, and D Providencia sneebia. Predicted coding sequences and shaded bars connecting APSE genomes to identified phage elements are defined as described in Fig. 2
Fig. 5Maximum-likelihood phylograms depicting evolutionary relationships for two genes in module 1 (p41 and p45) and two genes in module 4 (p19 and p24) of APSEs, other phages and other phage elements from bacteria in the order Enterobacteriales. Numbers at nodes indicate percent of 1000 bootstrap replicates that recovered the same node. For each gene, tip labels indicate the APSE haplotype, sequenced phage, or bacterium containing a prophage element in which the ortholog resided. Scale bars indicates nucleotide substitutions per site. Tick marks indicate branches lengths at the root have been reduced