| Literature DB >> 34386733 |
Marco A Crisci1, Lin-Xing Chen2,3, Audra E Devoto3, Adair L Borges3, Nicola Bordin1, Rohan Sachdeva2,3, Adrian Tett4, Allison M Sharrar3, Nicola Segata4, Francesco Debenedetti5, Mick Bailey5, Rachel Burt5, Rhiannon M Wood6, Lewis J Rowden7, Paula M Corsini8, Steven van Winden9, Mark A Holmes6, Shufei Lei3, Jillian F Banfield2,3,10, Joanne M Santini1.
Abstract
Lak phages with alternatively coded ∼540 kbp genomes were recently reported to replicate in Prevotella in microbiomes of humans that consume a non-Western diet, baboons, and pigs. Here, we explore Lak phage diversity and broader distribution using diagnostic polymerase chain reaction and genome-resolved metagenomics. Lak phages were detected in 13 animal types, including reptiles, and are particularly prevalent in pigs. Tracking Lak through the pig gastrointestinal tract revealed significant enrichment in the hindgut compared to the foregut. We reconstructed 34 new Lak genomes, including six curated complete genomes, all of which are alternatively coded. An anomalously large (∼660 kbp) complete genome reconstructed for the most deeply branched Lak from a horse microbiome is also alternatively coded. From the Lak genomes, we identified proteins associated with specific animal species; notably, most have no functional predictions. The presence of closely related Lak phages in diverse animals indicates facile distribution coupled to host-specific adaptation.Entities:
Keywords: Microbiome; Omics; Virology
Year: 2021 PMID: 34386733 PMCID: PMC8346664 DOI: 10.1016/j.isci.2021.102875
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Lak phages detected in various animal microbiome samples by PCR
| Animal | Sample type | Details | PCR positive |
|---|---|---|---|
| Cow ( | Rumen Fluid | 1 Individual, female, ~10 years | 1/3 |
| Warthog ( | Feces | Group of 2, pooled | 1/3 |
| White-naped mangabey ( | Feces | Group of 7, pooled | 1/3 |
| Galapagos giant tortoise ( | Feces | Adult group of 3 and juvenile group of 3, pooled separately | 2/2 |
| Fallow deer ( | Feces | 1 Individual, wild | 1/1 |
| (Epsom) horse ( | Feces | 6 individuals over 3 days, various sexes and ages | 12/18 |
| (Hersham) horse ( | Feces | 1 Individual, female, ~20 years | 1/1 |
| (Royal Veterinary College) horse ( | Feces | 3 Individuals, various sexes and ages | 3/3 |
| (Cambridge) Pig ( | Feces | 19 Individuals, various sexes and ages | 12/19 |
| (Bristol) Pig ( | Lumen digesta and mucosal scrapings | 6 Individuals, finisher pigs, various sexes | 70/70 |
| (Bristol) Pig ( | Vaginal and rectal samples | 5 Individuals, finisher pigs, female | 8/10 |
| (Royal Veterinary College) Pig ( | Feces | Various groups, pooled | 5/5 |
A subset of PCR products from each cohort were sequenced to confirm the presence of Lak. See details in Tables S1 and S2.
Post-mortem pig samples from: foregut (jejunum and ileum) and hindgut (cecum, proximal spiral, distal spiral and rectum).
Lak was detected in all rectums (n = 5) and 3/5 vaginal mucosa, but not in lungs of the same animals.
Figure 1Lak phage and Prevotella abundance differs across the pig gastrointestinal tract
(A) Schematic of pig GIT with labels indicating the sites sampled: Blue labels = foregut, Red labels = hindgut (main sites of microbial fiber fermentation). For both (B) and (C), Lak phage major capsid and Prevotella 16S rRNA gene copy numbers determined by absolute quantification qPCR, with 10 ng pooled DNA from each GIT site from 6 finisher pigs; for all sites except ileal lumens, where digesta was only present in 4/6 pigs. Top and bottom whiskers = minimum and maximum values. Box width = Interquartile range (IQR). Significant differences in Lak, Prevotella and Lak: Prevotella ratio means were determined by Tukey's HSD test.
(B) Difference in Lak phage abundance across pig lumen and mucosal sites coincides with Prevotella 16S rRNA gene abundance. Solid green and pink lines represent differences in abundance deemed statistically significant (P < 0.001). Standard errors ranged from 0.17–0.21 (Lak), and 0.28–0.34 (Prevotella).
(C) Difference in ratios of Lak phage to Prevotella 16S rRNA gene copies (P < 0.05). Standard errors ranged from 0.28–0.35.
Newly reconstructed Lak phage genomes reveal expanded size range and genomic diversity
| Animal source | Microbiome type | No. of genomes/complete genomes | No. of scaffolds/genome length | GC (%) |
|---|---|---|---|---|
| Human | International Human Microbiomes – fecal samples from China | 3/0 | 2–18/445–540 kbp | ~26 |
| International Human Microbiomes – fecal samples from Denmark | 2/0 | 2–17/478–537 kbp | ~26 | |
| International Human Microbiomes – fecal samples from Spain | 9/0 | 4–20/408–528 kbp | ~26 | |
| Human Gut Microbiome – fecal samples from China, Israel, Italy, Liberia | 4/0 | 1–8/499–544 kbp | ~26 | |
| Baboon | Yellow Baboon fecal samples | 3/0 | 1–6/544–546 kbp | ~26 |
| Olive Baboon fecal samples | 4/0 | 3–30/537–545 kbp | ~26 | |
| Pig | Fecal | 4/3 | 1–26/517–541 kbp | ~26 |
| Fecal or | 4/2 | 1–9/463–479 kbp | ~31 | |
| Horse | Individual fecal sample | 1/1 | 1/660 kbp | ~29 |
All new Lak genomes reconstructed in this study are listed, which were included for protein family analyses, along with the 15 published Lak genomes (Devoto et al., 2019) and all the 181 circular huge phage genomes reported recently (Al-Shayeb et al., 2020). See details in Table S1.
Figure 2Lak phages from diverse animals are phylogenetically related
Phylogeny was based on sequences from PCR, genomes, and metagenomes. The nucleotide sequences encoding the major capsid protein (MCP) were aligned and trimmed so that all lengths corresponded with that of the PCR-derived sequences. The capsid of the ~660-kbp phage is very divergent from others, thus was excluded from the tree to enable resolution of the other sequences. The tree was rooted between the GC31 group and GC26 group, according to the full phylogeny of all Lak (including the ~660-kbp one) and some other published huge phages (Figure S6). Three partial Lak phage genomes do not contain the MCP sequences thus were excluded. The names of the complete Lak genomes reported in this study are in bold. Bristol pig sequences obtained from the vaginal mucosa were identical to those found in the digestive tract. Corresponding trees for portal vertex and tail sheath monomer genes are shown in Figures S2 and S3.
Figure 3Alternative coding is a persistent feature of the expanded Lak phage clade
The maximum likelihood phylogenetic tree (iqtree (v1.6.12) using the "LG+G4" model (-bb = 1000)) was constructed using sequences for the large terminase protein sequence (see STAR Methods). The genome sizes shown are based on those of complete Lak phages in each clade. Nodes with ≥90% bootstrap support values are indicated by filled black circles and nodes with 70–90% support by open circles. Recoding of the TAG stop codon was detected through the Lak lineages but not in phages represented by the deepest branch.
Figure 4Compressed version of the large terminase protein sequence alignment in which all positions except those with in-frame TAG codons (represented by ∗) have been deleted
Background shading indicates different Lak phage lineages, as shown in Figure 3). Colors superimposed on ∗ indicate positions in which there is within-clade consensus as to the identity of the aligned amino acid. In the Lak clades with ~26% GC (bottom three groups), Q is the aligned amino acid in 77%, 75% and 85% of cases. There is insufficient information in other groups to predict how TAG is translated.
Figure 5Lak phage genomes exhibit conserved, lineage-specific and animal-specific protein families
Phylogenomic analyses of the 51 Lak phage genomes were performed. The phylogenetic tree (left) was built based on concatenated sequences of 49 single copy protein families detected in all Lak genomes and re-rooted using the sequence of the ~660-kbp horse-associated Lak phage. The protein family content heatmap (right), aligned with the phylogenetic tree, shows the presence/absence of protein families that could be detected in at least 4 genomes. The names of the 6 complete Lak genomes reported in this study are in bold. A total of 6 blocks of protein families with group-specific or animal-specific distribution patterns are highlighted in boxes and numbered.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| See | This study | NA |
| DSMZ | DSM 18205 | |
| QIAquick PCR purification kit | Qiagen | 28,106 |
| QIAquick gel extraction kit | Qiagen | 28,706 |
| QIAamp PowerFecal DNA kit | Qiagen | 51,804 |
| QuantiNova SYBR green PCR kit | Qiagen | 208054 |
| BIOTAQ™ DNA Polymerase | Bioline | BIO-21060 |
| Lak genomes | This paper | |
| Lak genomes | This paper | |
| Lak genomes | This paper | |
| See | This paper | |
| See | ||
| JMP® Pro 14.1 | SAS Institute Inc., NC, USA, 2019 | |
| MEGA X | ||
| BLASTN 2.10.0+ | ||
| Primer-BLAST | ||
| Geneious Prime | Geneious | |
| Bowtie2 V2.3.5.1 | ||
| Prodigal v2.6.3 | ||
| MEGAHIT v1.2.9 | ||
| IDBA_UD | ||
| tRNAscan-SE | ||
| MMseqs2 | ||
| IQ-TREE 2 | Minh et al., 2020 | |
| HHpred | ||
| OligoAnalyzer | Integrated DNA Technologies Inc., Iowa, USA | |
| Sanger sequencing | LightRUN | Eurofins, Germany |
| Published Lak genomes | ||