| Literature DB >> 34305853 |
Hyunjin Shim1, Haridha Shivram2, Shufei Lei1, Jennifer A Doudna2, Jillian F Banfield1,2,3,4,5,6.
Abstract
Prokaryote mobilome genomes rely on host machineries for survival and replication. Given that mobile genetic elements (MGEs) derive their energy from host cells, we investigated the diversity of ATP-utilizing proteins in MGE genomes to determine whether they might be associated with proteins that could suppress related host proteins that consume energy. A comprehensive search of 353 huge phage genomes revealed that up to 9% of the proteins have ATPase domains. For example, ATPase proteins constitute ∼3% of the genomes of Lak phages with ∼550 kbp genomes that occur in the microbiomes of humans and other animals. Statistical analysis shows the number of ATPase proteins increases linearly with genome length, consistent with a large sink for host ATP during replication of megaphages. Using metagenomic data from diverse environments, we found 505 mobilome proteins with ATPase domains fused to diverse functional domains. Among these composite ATPase proteins, 61.6% have known functional domains that could contribute to host energy diversion during the mobilome infection cycle. As many have domains that are known to interact with nucleic acids and proteins, we infer that numerous ATPase proteins are used during replication and for protection from host immune systems. We found a set of uncharacterized ATPase proteins with nuclease and protease activities, displaying unique domain architectures that are energy intensive based on the presence of multiple ATPase domains. In many cases, these composite ATPase proteins genomically co-localize with small proteins in genomic contexts that are reminiscent of toxin-antitoxin systems and phage helicase-antibacterial helicase systems. Small proteins that function as inhibitors may be a common strategy for control of cellular processes, thus could inspire future biochemical experiments for the development of new nucleic acid and protein manipulation tools, with diverse biotechnological applications.Entities:
Keywords: ATPase protein; antibiotic; genome editing; host energy; metagenome; mobilome
Year: 2021 PMID: 34305853 PMCID: PMC8297831 DOI: 10.3389/fmicb.2021.691847
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Phage infection cycle with a list of proteins known to have an ATPase domain (highlighted in red), potentially involved in the energy consuming activities of each stage.
Summary statistics of 353 huge phage genomes and associated ATPase proteins.
| Summary statistics | Genome length (kbp) | Total proteins | GC content (%) | ATPase domains | ATPase proteins | Multiple ATPase proteins |
| Mean | 291 | 397.2 | 39.20 | 14.29 | 12.59 | 1.70 |
| Standard deviation | 89 | 160.68 | 8.99 | 6.14 | 5.28 | 1.30 |
| Min | 104 | 106 | 21.98 | 3 | 3 | 0 |
| Max | 735 | 1144 | 63.98 | 60 | 47 | 13 |
FIGURE 2Survey of ATPase proteins on the complete genome of a huge phage (PHAGE-A1–js4906-22-3_S10_HugePhage_Circular_26_80_closed).
FIGURE 3Analysis of the functional domains of composite ATPase proteins from the mobilome metagenomes. The bar chart shows the predicted functions of the 221 composite ATPase proteins are nucleic acid and amino acid interacting enzymes. For each category, a representative example of a composite ATPase protein is shown, with all the known domains drawn to scale. The red rectangles indicate the domains predicted to have energy-utilizing functions such as AAA+ and ABC.
FIGURE 4A phylogenetic tree of the ATPase domains from diverse environments is built using the multiple sequence alignment and the structural reconstruction is shown for each class of ATPase domains.
FIGURE 5Examples of genomic neighborhood of composite ATPase proteins, with the proteins localized ±3 of each composite ATPase protein indicated in the middle. Most proteins are unannotated and hypothetical proteins, except those with labels. Identical to the genomic neighborhood heatmap, the color represents the number of blast hits in the ggKbase database. (A) Composite ATPase proteins that are found to have highly conserved operons (indicated with green stars in Supplementary Figure 5B). (B) Composite ATPase proteins that are found to have highly conserved pairs with a sequence overlap (indicated with blue stars in Supplementary Figure 5B).
Percentage of ATPase per protein by category in the mobilome metagenomes.
| Virus | Phage | Plasmid | Other | |
| Number of proteins | 44,243 | 731,372 | 112,101 | 35,695 |
| Number of ATPases | 361 | 2,654 | 2,216 | 266 |
| ATPase/protein | 0.82% | 0.36% | 1.98% | 0.75% |