| Literature DB >> 30692672 |
Audra E Devoto1, Joanne M Santini2, Matthew R Olm3, Karthik Anantharaman1,4, Patrick Munk5, Jenny Tung6, Elizabeth A Archie7, Peter J Turnbaugh8,9, Kimberley D Seed3,9, Ran Blekhman10, Frank M Aarestrup5, Brian C Thomas1, Jillian F Banfield11,12,13.
Abstract
Bacteriophages (phages) dramatically shape microbial community composition, redistribute nutrients via host lysis and drive evolution through horizontal gene transfer. Despite their importance, much remains to be learned about phages in the human microbiome. We investigated the gut microbiomes of humans from Bangladesh and Tanzania, two African baboon social groups and Danish pigs; many of these microbiomes contain phages belonging to a clade with genomes >540 kilobases in length, the largest yet reported in the human microbiome and close to the maximum size ever reported for phages. We refer to these as Lak phages. CRISPR spacer targeting indicates that Lak phages infect bacteria of the genus Prevotella. We manually curated to completion 15 distinct Lak phage genomes recovered from metagenomes. The genomes display several interesting features, including use of an alternative genetic code, large intergenic regions that are highly expressed and up to 35 putative transfer RNAs, some of which contain enigmatic introns. Different individuals have distinct phage genotypes, and shifts in variant frequencies over consecutive sampling days reflect changes in the relative abundance of phage subpopulations. Recent homologous recombination has resulted in extensive genome admixture of nine baboon Lak phage populations. We infer that Lak phages are widespread in gut communities that contain the Prevotella species, and conclude that megaphages, with fascinating and underexplored biology, may be common but largely overlooked components of human and animal gut microbiomes.Entities:
Mesh:
Year: 2019 PMID: 30692672 PMCID: PMC6784885 DOI: 10.1038/s41564-018-0338-9
Source DB: PubMed Journal: Nat Microbiol ISSN: 2058-5276 Impact factor: 30.964
Complete megaphage genomes (see also Supplementary Table 1)
| Phage | Sample of origin | Guanine-cytosine, % | Length (bp) | No. tRNAs | No. tRNA introns | No. predicted open reading frames (code 15) |
|---|---|---|---|---|---|---|
| A1-i | As cohort 22, no. 2 | 25.9 | 541,643 | 33 | 3 | 581 |
| A1-ii | As cohort 22, no. 3 | 25.9 | 541,664 | 33 | 3 | 581 |
| A1-iii | As cohort 22, no. 4 | 25.9 | 541,664 | 33 | 3 | 584 |
| A1-iv | As cohort 22, no. 5 | 25.9 | 541,664 | 33 | 3 | 581 |
| A2 | As cohort 20, no. 3 | 26.0 | 541,299 | 34 | 4 | 581 |
| C1 | Cholera CH_A02_001D1 | 25.8 | 540,217 | 32 | 2a | 591 |
| B1 | Baboon F22 (V) | 26.0 | 547,991 | 30 | 1 | 591 |
| B2 | Baboon F3 (V) | 26.0 | 549,839 | 31 | 1 | 594 |
| B3 | Baboon M09 (V) | 26.0 | 546,746 | 30 | 1 | 590 |
| B4 | Baboon F30 (V) | 26.0 | 550,552 | 31 | 1 | 594 |
| B5 | Baboon F18 (V) | 26.7 | 543,529 | 31 | 1 | 583 |
| B6 | Baboon F16 (V) | 25.8 | 546,689 | 30 | 1 | 588 |
| B7 | Baboon F11 (M) | 26.0 | 550,702 | 31 | 1 | 599 |
| B8 | Baboon F4 (V) | 26.0 | 551,627 | 31 | 1 | 600 |
| B9 | Baboon F01 (V) | 26.0 | 550,053 | 30 | 1 | 593 |
aVariants within incomplete genomes have tRNA introns not found in the C1 genome. Baboons are from two social groups, V (Viola’s) and M (Mica’s) (see Tungi et al.[16]).
Fig. 1Alignment of the CRISPR arrays on four Prevotella scaffolds containing repeat GGTTTAATCGTACCTTTATGGAATTGAAAT.
The green rods indicate repeats, the coloured rods indicate spacers. The same colour indicates the same spacer sequence, except for black rods, which indicate spacers different between individuals 26 and 28 (probably added to the diversifying locus ends). The red arrows indicate spacers targeting megaphages (also see Supplementary Fig. 3).
Fig. 2Genomic features and variation in Lak phage genomes.
a, Frequency of use of the TAG repurposed stop codon overlying the alignment of the A1 and A2 genomes. The red bars indicate regions >5 kb without TAG codon use. Structural proteins (dark blue) are mostly encoded in regions with high TAG use. CDS, coding sequence. The boxed region is shown in detail in b. b, Alignments of each distinct Lak phage genome against the A1 genome. A subset of this region was used in the pig Lak phage genome fragment alignments (Supplementary Fig. 6).
Fig. 3Comparison of B-Lak phage genomes reveals identical sequence blocks in a subset of the B-Lak phage.
a, Sequence variation in a ~20 kb region of the aligned B-Lak phage genomes with B1 as the reference. The coloured bars underline blocks with a shared sequence. Note evidence of admixture of sequence blocks, indicative of extensive homologous recombination among phages sampled from individual baboons. For the full alignment of the nine complete B-Lak phage genomes, see Supplementary Fig. 13. The box indicates the B9 region examined in detail (Supplementary Fig. 14). b, Relatively conserved and divergent regions in A1, A2 and C1. The open box indicates the region shown in a.
Fig. 4Alignment of sequences with a possible tRNA Thr (TGT).
Some tRNAs are predicted to contain an intron that itself may encode a tRNA (dashed box with superimposed secondary structure). With the exception of the Hadza sequence, all introns are identical.