| Literature DB >> 35880895 |
Gamaliel López-Leal1, Laura Carolina Camelo-Valera1, Juan Manuel Hurtado-Ramírez2, Jérôme Verleyen2, Santiago Castillo-Ramírez3, Alejandro Reyes-Muñoz1.
Abstract
Phages and prophages are one of the principal modulators of microbial populations. However, much of their diversity is still poorly understood. Here, we extracted 33,624 prophages from 13,713 complete prokaryotic genomes to explore the prophage diversity and their relationships with their host. Our results reveal that prophages were present in 75% of the genomes studied. In addition, Enterobacterales were significantly enriched in prophages. We also found that pathogens are a significant reservoir of prophages. Finally, we determined that the prophage relatedness and the range of genomic hosts were delimited by the evolutionary relationships of their hosts. On a broader level, we got insights into the prophage population, identified in thousands of publicly available prokaryotic genomes, by comparing the prophage distribution and relatedness between them and their hosts. IMPORTANCE Phages and prophages play an essential role in controlling their host populations either by modulating the host abundance or providing them with genes that benefit the host. The constant growth in next-generation sequencing technology has caused the development of powerful computational tools to identify phages and prophages with high precision. Making it possible to explore the prophage populations integrated into host genomes on a large scale. However, it is still a new and under-explored area, and efforts are still required to identify prophage populations to understand their dynamics with their hosts.Entities:
Keywords: archaea; bacteria; bacteriophages; eubacteria; prophages; viral diversity
Year: 2022 PMID: 35880895 PMCID: PMC9426530 DOI: 10.1128/msystems.00326-22
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 7.324
FIG 1Distribution of the number of prophages (A) and the average of prophages (B) per host genome based on the genome size (Mb). The total number of prophages (green line) and prophages ≥30 Kb (brown line) are shown in each plot.
FIG 2Bacteria and Archaea prophage distribution. The phylogenetic tree was generated using Lifemap at order level. The phyla are shown in different colors. The Archaea and Bacteria branches are shown in dashed and solid lines, respectively.
FIG 3Boxplots of P-values for different genera. P-values were calculated by collecting and bootstrapping (100 times randomly) the number of prophages in 10 genomes for each genus, using the Wilcoxon test (see methods). The blue dotted line indicates the significant Wilcoxon tests (p <0.05). The boxplots are colored with respect to the Order taxonomic level.
Genomic host range of the viral clusters
| Viral cluster (VC) Id | Present in | Host range taxonomic level | ANI between hosts |
|---|---|---|---|
| vOTU_1233 | 7 | Genus | 79.49% |
| vOTU_94 | 3 | Phylum (Proteobacteria, Actoinobacteria) | 71.18% |
| vOTU_1254 | 2 | Family ( | 73.20% |
| vOTU_1158, vOTU_1237, vOTU_51, vOTU_831 | 1 | Genus | 83.11% |
| vOTU_1201 | 1 | Family ( | 77.11% |
| vOTU_1442 | 1 | Family ( | 74.60% |
The viral clusters were assigned by 90% nucleotide similarity and 80% coverage using cd-hit (see Materials and Methods). The number of prophage identified in the host genome for specific viral cluster are shown. Level of the genomic host range and the Average Nucleotide Identity (ANI) between host are displayed.
FIG 4(A) Prophage network generated with prophages with ≥80 AAI (2,485 prophages), visualization produced with Cytoscape (see methods). Nodes represent prophages and edges represent their weighted pairwise similarities of AAI. Nodes (prophages) are depicted with different colors according to their phylum host. (B) Prophage fraction which shared an AAI value ≥80 between prophages of different lysogens at all taxonomic levels.