| Literature DB >> 35487995 |
Sonali Tayal1, Venugopal Bhatia2, Tanya Mehrotra1, Sonika Bhatnagar3,4.
Abstract
Molecular mimicry of host proteins by pathogens constitutes a strategy to hijack the host pathways. At present, there is no dedicated resource for mimicked domains and motifs in the host-pathogen interactome. In this work, the experimental host-pathogen (HP) and host-host (HH) protein-protein interactions (PPIs) were collated. The domains and motifs of these proteins were annotated using CD Search and ScanProsite, respectively. Host and pathogen proteins with a shared host interactor and similar domain/motif constitute a mimicry pair exhibiting global structural similarity (domain mimicry pair; DMP) or local sequence motif similarity (motif mimicry pair; MMP). Mimicry pairs are likely to be co-expressed and co-localized. 1,97,607 DMPs and 32,67,568 MMPs were identified in 49,265 experimental HP-PPIs and organized in a web-based resource, ImitateDB ( http://imitatedb.sblab-nsit.net ) that can be easily queried. The results are externally integrated using hyperlinked domain PSSM ID, motif ID, protein ID and PubMed ID. Kinase, UL36, Smc and DEXDc were frequent DMP domains whereas protein kinase C phosphorylation, casein kinase 2 phosphorylation, glycosylation and myristoylation sites were frequent MMP motifs. Novel DMP domains SANT, Tudor, PhoX and MMP motif microbody C-terminal targeting signal, cornichon signature and lipocalin signature were proposed. ImitateDB is a novel resource for identifying mimicry in interacting host and pathogen proteins.Entities:
Keywords: Domain; Host–pathogen protein–protein interaction; Infectious diseases; Molecular mimicry; Motif
Mesh:
Substances:
Year: 2022 PMID: 35487995 PMCID: PMC9054641 DOI: 10.1007/s00726-022-03163-3
Source DB: PubMed Journal: Amino Acids ISSN: 0939-4451 Impact factor: 3.789
Table depicting the different types of molecular mimicry
| S. no | Types of molecular mimicry | Examples | PDB structure ID |
|---|---|---|---|
| a | Similarity in both sequence and structure of a full-length protein or a functional domain | K3L protein of Vaccinia Virus mimics the S1 domain of human eIF-2-α to bind to human PRK (Sharp et al. | K3L S1 domain: PDB ID: 1luz |
| eIF-2-α S1 domain: PDB ID: 1kl9 | |||
| b | Structural similarity without an apparent sequence similarity | Viral proteins (US28 protein of human cytomegalovirus and vMIP-II protein of Kaposi Sarcoma virus) mimics both the structures and interactions of their host counterparts (CXCR4 and CX3CL1, respectively) although they have a very low sequence similarity (Standfuss | US28-CX3CL1 (PDB ID: 4xt1) |
| vMIP-II-CXCR4) PDB ID: 4rws) | |||
| c | Similarity in the sequence of a short linear motif | The LMP1 protein of Epstein–Barr Virus mimics the motif PxQxT of CD40 to interact with TRAF3(Wu et al. | LMP1-TRAF3(PDB ID: 1zms) |
| TRAF3-CD40(PDB ID: 1fll) | |||
| d | Similarity of only the binding site architectures (interface mimicry) without sequence homology | Map of | Cdc42-Map (PDB ID: 3gcg), |
| Cdc42-SopE (PDB ID: 1gzs), | |||
| Cdc42-Intersectin (PDB ID: 1ki1) |
Fig. 1Panel (a) Domain Mimicry Pair (DMPs) and Motif Mimicry Pairs in Host–pathogen protein–protein interactions and Host–host interaction network: a pathogen protein(red) interacts with many host proteins(blue) that in turn interacts with many other host proteins (yellow). The pathogen protein(red) and host first interactor protein(yellow) with a common interacting host protein (blue) are compared for similar domains (green) and motifs (purple) to determine DMPs and MMPs respectively. Panel (b) A Schematic of individual DMP: a host first interactor protein (yellow) and pathogen protein (red) that interacts with the same host protein (blue) share the same domain ‘A’ (green). B Schematic of individual MMP: a host first interactor protein (yellow) and pathogen protein (red) that interacts with the same host protein (blue) share the same motif ‘A’ (purple)
Fig. 2a Schematic of DMP through allosteric binding: a host first interactor protein (yellow) and pathogen protein (red) that interact with the same host protein (blue) share the same domain ‘A’ (green). Here, the pathogen protein and host first interactor protein bind to the same host through the mimicked domain but at different sites i.e., allosterically, thus leading to a change in shape of the host protein binding site. b Schematic of MMP through allosteric binding: a host first interactor protein (yellow) and pathogen protein (red) that interacts with the same host protein (blue) share the same motif ‘A’ (purple). Here, the pathogen protein and host first interactor protein bind to the same host through the mimicked motif but at different sites, i.e., allosterically thus leading to a change in shape of the host protein binding site
Fig. 3Database schematic: the basic pipeline for search options is represented on the left and the basic workflow as well as the count of entities in the database are shown on the right
Number of entitiesa for pathogen proteins and host first interactor proteins
| Entities | Pathogen proteins | Host first interactor proteins |
|---|---|---|
| Total count of entities | ||
| Total proteins in HP-PPIs and HHIs | 5568 from 629 pathogens | 11,657 |
| Total domains | 68,838 | 4,78,710 |
| Unique domains | 17,465 | 25,245 |
| Total motifs | 31,594 | 79,944 |
| Unique motifs | 1046 | 1661 |
| Count of entities involved in molecular mimicry | ||
| Total proteins involved in Molecular mimicry | 5254 from 611 pathogens | 11,633 |
| Proteins involved in domain mimicry | 607 from 146 pathogens | 1,558 |
| Proteins involved in motif mimicry | 5254 from 611 pathogens | 11,633 |
| Unique domains mimicked/shared between pathogen and host interactor proteins | 4300 | |
| Unique motifs mimicked/ Shared between pathogen and host interactor proteins | 96 | |
aSince every protein has a unique UniProt accession ID that makes that protein unique and different from other, the count for different proteins is done based on the UniProt accession of each protein. Every pathogen strain (having a unique NCBI taxon identifier) as a different organism and every protein of that organism (having a unique UniProt accession) is considered as a distinct entity. The count of different domains is based on the PSSM-ID of each domain. The count of different motifs is based on the ScanProsite Motif-ID of each motif
Number of DMPs, MMPs, total HP-PPIs and HP-PPIs characterized by mimicked domains and motifs for each pathogen category
| Pathogen category | Total number of HP-PPIs | Number of HP-PPIs in mimicked domains | Number of HP-PPIs in mimicked motifs | Number of DMPs | Number of MMPs |
|---|---|---|---|---|---|
| Virus | 49,249 | 822 | 39,694 | 1,07,325 | 21,99,131 |
| Bacteria | 10,066 | 221 | 7708 | 5254 | 5,76,195 |
| Fungi | 1869 | 496 | 1833 | 75,729 | 4,88,705 |
| Others | 30 | 12 | 30 | 9299 | 3537 |
| Total | 61,214 | 1551 | 49,265 | 1,97,607 | 32,67,568 |
Fig. 4a The ImitateDB web interface: expanded view of the search panel of the web interface showing the steps to query the ImitateDB database. b Receive large result files by email: expanded view of the mailer popped up on the ImitateDB interface