| Literature DB >> 35056644 |
Colin Buttimer1, Francesca Bottacini2, Andrey N Shkoporov1, Lorraine A Draper1, Paul Ross1, Colin Hill1,3.
Abstract
Eggerthella lenta is an anaerobic, high GC, Gram-positive bacillus commonly found in the human digestive tract that belongs to the class Coriobacteriia of the phylum Actinobacteria. This species has been of increasing interest as an important player in the metabolism of xenobiotics and dietary compounds. However, little is known regarding its susceptibility to bacteriophage predation and how this may influence its fitness. Here, we report the isolation of seven novel E. lenta strains using cefotaxime and ceftriaxone as selective agents. We conducted comparative and pangenome analyses of these strains and those publicly available to investigate the diversity of prophages associated with this species. Prophage gene products represent a minimum of 5.8% of the E. lenta pangenome, comprising at least ten distantly related prophage clades that display limited homology to currently known bacteriophages. All clades possess genes implicated in virion structure, lysis, lysogeny and, to a limited extent, DNA replication. Some prophages utilise tyrosine recombinases and diversity generating retroelements to generate phase variation among targeted genes. The prophages have differing levels of sensitivity to the CRISPR/cas systems of their hosts, with spacers from 44 E. lenta isolates found to target only five out of the ten identified prophage clades. Furthermore, using a PCR-based approach targeting the prophage attP site, we were able to determine that several of these elements can excise from the host chromosome, thus supporting the notion that these are active prophages. The findings of this study provide further insights into the diversity of prophages infecting species of the phylum Actinobacteria.Entities:
Keywords: Actinobacteria; Eggerthella lenta; bacterial isolation; diversity generating retroelements; prophage diversity
Year: 2022 PMID: 35056644 PMCID: PMC8778435 DOI: 10.3390/microorganisms10010195
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Accession numbers (DDBJ/ENA/GenBank) for genomes/plasmids of E. lenta isolated in this study.
| No. | Isolate | Accession (Genome; Plasmids) |
|---|---|---|
| 1 | APC055-529-1D | CP089331; CP089332 |
| 2 | APC055-539-5C | CP089333 |
| 3 | APC055-920-1E | CP089334 |
| 4 | APC055-924-7B | CP089335 |
| 5 | APC055-928-H3-3 | CP089336 |
| 6 | APC055-943-4 | CP089337 |
| 7 | APC-F2-3 | CP089338; CP089339, CP089340 |
| 8 | DSM2243_R211 | JAJQIW000000000 |
| 9 | DSM2243_R611 | JAJQIX000000000 |
| 10 | DSM2243_R1011 | JAJQKW000000000 |
Figure 1Dot plot illustrating whole genome alignment (MUMmer) of seven E. lenta strains isolated in this study (denoted with the APC prefix) and those publicly available (C592 and DSM2243) with the genome of the type strain DSM2243.
Figure 2(A) Distribution of genes in the pangenome among each of the examined 57 genomes of cultivated E. lenta isolates. (B) Analysis of dereplicated genomes of E. lenta (n = 57) shows the genus possesses a pan and core genome of 8273 and 1425 genes, respectively. (C) OGs of the pangenome (core, accessory and unique genes) of 57 E. lenta isolates and their functional assignment using the COG database, the fraction of OGs assigned no function is excluded from illustration.
General details of identified E. lenta prophage genomes among host genomes.
| No. | Prophage | Host (Contig Accession) | Left Boundary | Right Boundary | Size (bp) | GC% | Integration Locus (Locus Tag) Relative to Strains DSM2243 or C529 | Clade |
|---|---|---|---|---|---|---|---|---|
| 1 | 14Aphi1 | 14A (NZ_PPUR01000006.1) | 29,937 | 71,301 | 41,365 | 67 | tRNA-Leu (ELEN_RS15020) | 1 |
| 2 | MGYG-HGUT-02544phi1 | MGYG-HGUT-02544 (NZ_CABMOO010000006.1) | 29,956 | 71,302 | 41,347 | 67 | tRNA-Leu (ELEN_RS15020) | 1 |
| 3 | AB8_2phi1 | AB8 #2 (NZ_PPUJ01000004.1) | 191,735 | end of contig | >38,510 | 67 | tRNA-Leu (ELEN_RS15020) | 1 |
| 4 | APC055-529-1Dphi1 | APC055-529-1D (CP089331) | 3,240,976 | 3,282,082 | 41,106 | 67 | tRNA-Leu (ELEN_RS15020) | 1 |
| 5 | ResAG49phi1 | ResAG49 (NZ_WPON01000034.1) | full contig | full contig | >30,988 | 67 | unknown | 1 |
| 6 | MR1_12phi1 | MR1_#12 (NZ_PPTX01000022.1) | 3020 | 45,178 | 42,159 | 67 | tRNA-Leu (ELEN_RS02880) | 1 |
| 7 | 1001095IJ_161003_H5phi1 | 1001095IJ_161003_H5 (JADMUV010000007.1) | 137,112 | 178,204 | 41,092 | 67 | tRNA-Leu (ELEN_RS02880) | 1 |
| 8 | BSD2780120875_150330_C12phi1 | BSD2780120875_150330_C12 (JADMOT010000003.1) | 209,825 | 251,107 | 41,282 | 66 | tRNA-Leu (ELEN_RS15020) | 1 |
| 9 | Valenciaphi2 | Valencia (NZ_PPTV01000006.1) | 55,915 | 93,457 | 37,543 | 69 | tRNA-Ser (ELEN_RS00500) | 2 |
| 10 | 32-6-I_6_NAphi2 | 32-6-I_6_NA (NZ_PPUM01000011.1) | 26,284 | 63,686 | 37,403 | 69 | tRNA-Ser (ELEN_RS00500) | 2 |
| 11 | DSM15644phi3 | DSM15644 (NZ_PPUB01000019.1) | 32,924 | 69,743 | 36,820 | 67 | tRNA-Ala (ELEN_RS00055) | 3 |
| 12 | DSM15644phi4 | DSM15644 (NZ_PPUB01000037.1) | full contig | full contig | >36,317 | 63 | unknown | 4 |
| 13 | DSM2243phi4 | DSM2243 (NC_013204) | 3,031,719 | 3,068,586 | 36,618 | 63 | tRNA-Ala (C592_00654) | 4 |
| 14 | CC82_BHI2phi5 | CC82_BHI2 (NZ_PPUF01000005.1) | 22,245 | 56,042 | 33,798 | 65 | hypothetical protein (ELEN_RS14245) | 5 |
| 15 | C592phi5 | C592 (NZ_CP021140) | 500,864 | 534,671 | 33,808 | 65 | hypothetical protein (ELEN_RS14245) | 5 |
| 16 | J1101437_171009_A1phi5 | J1101437_171009_A1 (JADNJK010000003.1) | 299,175 | 333,022 | 33,847 | 65 | hypothetical protein (ELEN_RS14245) | 5 |
| 17 | J1101653_170612_H2phi5 | J1101653_170612_H2 (JADPDY010000015.1) | 27,299 | 61,125 | 33,826 | 65 | hypothetical protein (ELEN_RS14245) | 5 |
| 18 | 1-1-60AFAAphi6 | 1-1-60AFAA (NZ_KN214093.1) | 487,345 | 528,047 | 40,703 | 64 | unknown | 6 |
| 19 | APC055-529-1Dphi7 | APC055-529-1D (CP089331) | 3,337,687 | 3,371,357 | 33,670 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 20 | APC055-924-7Bphi7 | APC055-924-7B (CP089335) | 3,608,848 | 3,642,543 | 33,695 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 21 | 14Aphi7 | 14A (NZ_PPUR01000011.1) | 37,521 | 71,198 | 33,678 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 22 | MGYG-HGUT-02544phi7 | MGYG-HGUT-02544 (NZ_CABMOO010000011.1) | 37,518 | 71,195 | 33,678 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 23 | APC055-920-1Ephi7 | APC055-920-1E (CP089334) | 3,189,344 | 3,223,015 | 33,671 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 24 | MR1-F37phi7 | MRI-F37 (NZ_WPOI01000001.1) | 69,802 | 103,502 | 33,701 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 25 | MRI-F36phi7 | MRI-F36 (NZ_WPOJ01000009.1) | 49,797 | 83,505 | 33,709 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 26 | 32-6-1-6_NAphi7 | 32-6-I_6_NA (NZ_PPUM01000009.1) | 46,614 | 46,614 | 33,684 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 27 | MRI-F40phi7 | MRI-F40 (NZ_WPOH01000001.1) | 69,804 | 103,501 | 33,448 | 59 | tRNA-Arg (ELEN_RS15245) | 7 |
| 28 | J1100102_180507_G10phi7 | J1100102_180507_G10 (JADOZP010000001.1) | 109,609 | 145,440 | 35,831 | 58 | tRNA-Arg (ELEN_RS15245) | 7 |
| 29 | SECO-MT75m2phi8 | SECO-MT75m2 (NZ_VEVP01000036.1) | 3653 | end of contig | >28,901 | 63 | unknown | 8 |
| 30 | TF05-26B-4phi9 | TF05-26B-4 | 16,687 | end of contig | >31,589 | 64 | unknown | 9 |
| 31 | J1100102_180507_G10phi9 | J1100102_180507_G10 | 10,288 | 46,514 | 36,226 | 65 | unknown | 9 |
| 32 | 1001302B_160321_A1phi9 | 1001302B_160321_A1 | 165,588 | 202,064 | 36,476 | 65 | unknown | 9 |
| 33 | 1001302B_160321_A1phi10 | 1001302B_160321_A1 | 220,405 | 252,147 | 31,742 | 58 | unknown | 10 |
Figure 3BLASTn heat map generated using Gegenees with accurate parameters—fragment length: 200 bp; step size: 100 bp; threshold: 0%. The map includes 33 identified prophage genomes of E. lenta, members of the ten different prophage clades are highlighted.
Putative insertion sites among clade 1 E. lenta prophages.
| Prophage | Predicted attP-Site | attB Relative to DSM2243 (Locus) |
|---|---|---|
| 14Aphi1 | CAACCCCATGGAGGTTCAAGTCCTCTCGCCCGCACCATCTGAA | tRNA-Leu (ELEN_RS15020) |
| MGYG-HGUT-02544phi1 | AACCCCATGGAGGTTCAAGTCCTCTCGCCCGCACCATCTGAA | tRNA-Leu (ELEN_RS15020) |
| APC055-529-1Dphi1 | TTCAGATGGTGCGGGCGAGAGGACTTGAACCTCCATGGGGTT | tRNA-Leu (ELEN_RS15020) |
| 1001095IJ_161003_H5phi1 | ACTTAAAATCTTCCGGCTTCGGCCTTGCGGGTTCGAGTCCCGCCGCCCCTACCA | tRNA-Leu (ELEN_RS02880) |
| BSD2780120875_150330_C12phi1 | TTCAGATGGTGCGGGCGAGAGGACTTGAACCTCCATGGGGTT | tRNA-Leu (ELEN_RS15020) |
Figure 4(A) Heatmap showing the distribution (presence–absence of OGs among prophages of E. lenta. (B) Heatmap illustrating genes detected among E. lenta prophages found with a particular function.
Figure 5Characterisation of Rin shufflon among E. lenta prophages. (A) Locus map showing the alignment of the Rin shufflon among prophages 16 phi1 and 1001095IJ_161003_H5phi1, genes (arrows) relative to the direction of transcription and labelled accordingly to function. Brown triangles indicate repeat regions and shaded regions represent the degree of sequence homology in BLASTn alignments. (B) Unrooted maximum likelihood tree of invertases and intergrases among E. lenta clade 1 prophages.
Figure 6(A) Locus map of the architecture of the DGR element found among different prophage types of E. lenta, showing genes (arrows in orange) and sequence repeat regions (arrows/arrow segments in yellow) relative to the direction of transcription and labelled accordingly to predicted function. (B) Alignment of a VR segment of the mtd gene forming part of the DGR element of a clade 4 prophage associated with strain DSM2243, to isolates of the bacterium (R211, R611 and R1011) grown after fermentation in a chemostat (24 h as batch, then 72 h as continuous) illustrating locations of nucleotide substitutions.
Figure 7Network-based analysis of shared protein clusters with 33 E. lenta prophages with 12,892 phages visualised with Cytoscape. Each node represents a phage genome, lines (edges) represent the strength of connectivity (edge weight) between genomes. (A) Overview of network with nodes coloured to highlight family to which a phage belongs and a close-in view of the network of where E. lenta prophages are situated with nodes coloured to highlight (B) phage genus or (C) or host bacterium genus.
Figure 8Heatmap illustrating spacer sequence hits among the CRISPR array of different E. lenta isolates (y-axis) and E. lenta prophage clades (x-axis) with the number of representatives for each clade identified in brackets. The analysis utilised 283 spacers from CRISPR arrays of 44 E. lenta strains identified to target prophages of the species.
Figure 9Phylogram constructed from concatenated core genes as predicted by ROARY with protein alignment using PRANK and tree constructed by FASTTree; bootstrap branch support indicated with values from 0 to 1. The table indicates the presence or absence of representatives of a particular prophage clade integrated into the genome of the bacterial isolate described.