| Literature DB >> 34718738 |
Manjeet Kumar1, Sushama Michael1, Jesús Alvarado-Valverde1,2, Bálint Mészáros1, Hugo Sámano-Sánchez1,3,4, András Zeke5, Laszlo Dobson1,5, Tamas Lazar6,7, Mihkel Örd8, Anurag Nagpal9, Nazanin Farahi6,7, Melanie Käser1,10, Ramya Kraleti1,11, Norman E Davey8, Rita Pancsa1,5, Lucía B Chemes12, Toby J Gibson1.
Abstract
Almost twenty years after its initial release, the Eukaryotic Linear Motif (ELM) resource remains an invaluable source of information for the study of motif-mediated protein-protein interactions. ELM provides a comprehensive, regularly updated and well-organised repository of manually curated, experimentally validated short linear motifs (SLiMs). An increasing number of SLiM-mediated interactions are discovered each year and keeping the resource up-to-date continues to be a great challenge. In the current update, 30 novel motif classes have been added and five existing classes have undergone major revisions. The update includes 411 new motif instances mostly focused on cell-cycle regulation, control of the actin cytoskeleton, membrane remodelling and vesicle trafficking pathways, liquid-liquid phase separation and integrin signalling. Many of the newly annotated motif-mediated interactions are targets of pathogenic motif mimicry by viral, bacterial or eukaryotic pathogens, providing invaluable insights into the molecular mechanisms underlying infectious diseases. The current ELM release includes 317 motif classes incorporating 3934 individual motif instances manually curated from 3867 scientific publications. ELM is available at: http://elm.eu.org.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34718738 PMCID: PMC8728146 DOI: 10.1093/nar/gkab975
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Data growth progression in the ELM resource. (A) Cumulative number of annotated motif classes and instances added in ELM. (B) Cumulative number of structures and papers added to the database. (C) Distribution of newly added instances between novel and already existing motif classes (lighter and darker shades in the outer circle, respectively) belonging to five different functional categories. The pie chart only includes the 35 classes where new instances have been added: for visual clarity, classes with less than five instance additions are not labelled in the pie-chart. The plotted instances in the pie-chart also include remapped instances from previously existing classes in the database. The plots were produced using ggplot2 in R-studio (http://www.rstudio.com/).
Concise summary of data captured in the ELM resource
| Functional sites | ELM classes | ELM instances | GO terms | PDB structures | PubMed links | ||||
|---|---|---|---|---|---|---|---|---|---|
| Total | 190 | 317 | 3934 | 884 | 616 | 3867 | |||
| By Category | LIG | 178 | Human | 2222 | Biological Process | 489 | |||
| MOD | 38 | Mouse | 373 | ||||||
| DOC | 38 | Rat | 171 | Cellular Component | 184 | ||||
| DEG | 26 | Yeast | 330 | ||||||
| TRG | 26 | Viruses | 277 | Molecular Function | 211 | ||||
| CLV | 11 | Others | 561 | ||||||
New and revised ELM entries in the current ELM release
| New ELM classes | ||
|---|---|---|
| ELM class identifier | #Instances | ELM class (short) description |
| DEG_SCF_FBXO31_1 | 3 | The C-terminal degron of cyclin D proteins is bound by the FBXO31 F-box E3 ligase of the SCF (Skp1-Cullin-Fbox) complex. |
| DOC_CDC14_PxL_1 | 10 | The PxL substrate docking motif enhances the Cdc14 phosphatase–substrate interaction and promotes subsequent dephosphorylation. |
| DOC_CYCLIN_D_Helix_1 | 3 | The Cyclin D Helical docking motif mediates binding of substrates to a site on Cyclin D different from the hydrophobic pocket and enhances substrate phosphorylation by CyclinD/Cdk4-6 complexes. |
| DOC_CYCLIN_yCln2_LP_2 | 18 | The budding yeast G1/S cyclins Cln1 and 2 bind a specific leucine- and proline-rich (LP) docking motif on G1-specific target proteins. |
| DOC_CYCLIN_yClb3_PxF_3 | 4 | The hydrophobic patch (hp) of the G2 phase cyclin from budding yeast, Clb3, binds a specific PxF docking motif on regulators and target proteins. |
| DOC_CYCLIN_yClb1_LxF_4 | 13 | The LxF motif found in budding yeasts serves as a docking site for mitotic cyclin-CDK complexes (M-CDK). It is found in both regulators and mitotic phosphorylation target proteins. |
| DOC_CYCLIN_yClb5_NLxxxL_5 | 5 | Cyclin hydrophobic patch docking motif NLxxxL specific for S-phase cyclins Clb5 and Clb6 in budding yeasts. |
| DOC_MIT_MIM_1 | 5 | C-terminal LxxR[FL]xxL based type 1 MIT interacting motif (MIM1) that docks at the MIT domain present in some ESCRT-III proteins. |
| LIG_ActinCP_CPI_1 | 15 | The conserved capping protein interaction (CPI) motif is employed by a diverse set of proteins to allosterically down-regulate actin filament capping by CP and thereby fine-tune actin assembly dynamics. |
| LIG_ActinCP_TwfCPI_2 | 4 | The highly conserved twinfilin-type actin capping protein interaction (CPI) motif is employed by twinfilins to maintain the dynamic actin capping/decapping cycles of CP and to counterbalance the effects of negative regulators. |
| LIG_DLG_GKlike_1 | 14 | The guanylate kinase-like domain of DLG family membrane-associated scaffolding proteins binds phosphorylated motifs in SAPAPs and other protein partners. |
| LIG_Integrin_RGD_TGFB_3 | 5 | A C-terminally extended subtype of the canonical RGD motif strongly binding to integrins αvβ6 and αvβ8. |
| LIG_Integrin_RGDW_4 | 18 | A C-terminally extended subtype of the canonical RGD motif strongly binding to integrins αIIbβ3 and αvβ3. |
| LIG_Integrin_KxxGD_FGGC_5 | 5 | An αIIbβ3 integrin-specific, C-terminal variant of the RGD motif where a displaced lysine substitutes for the canonical arginine. |
| LIG_KLC1_Yacidic_2 | 3 | A kinesin cargo motif binding to the TPR domain of KLC1 found in JIP1 and TorsinA. |
| LIG_LSD1_SNAG_1 | 11 | A repressor motif found in some zinc finger transcription factors binds to the amine oxidase domain of LSD1. |
| LIG_LYPXL_yS_3 | 2 | The yeast short version of the LYPxL motif binds the V-domain of Bro1 and Rim20, proteins involved in endosomal sorting and pH signalling. |
| LIG_LYPXL_SIV_4 | 3 | The SIV helical version of the LYPxL motif binds the V-domain of Alix, a protein involved in endosomal sorting. |
| LIG_NRP_CendR_1 | 12 | The CendR motif has a carboxy-terminal arginine, which binds to the Neuropilin b1 domain binding site. CendR motifs are either located at the protein C-terminus or are generated by internal cleavage by a polybasic protease, such as Furin |
| LIG_PCNA_TLS_4 | 3 | The PCNA binding motifs include the PIP Box, PIP degron, the APIM and the TLS motif. These motifs are found in proteins involved in DNA replication, repair, methylation and cell cycle control. |
| LIG_RuBisCO_WRxxL_1 | 20 | The WRxxL RuBisCO-binding motif present in Pyrenoid proteins promotes the assembly of this algal organelle and its different compartments. |
| LIG_SH3_CIN85_PxpxPR_1 | 60 | The non-canonical SH3-binding motif is recognized primarily by adaptor proteins CIN85 and CD2AP, which are involved in RTK regulation, endocytosis, lysosomal degradation, actin cytoskeleton dynamics regulation, and signal transduction |
| LIG_WRC_WIRS_1 | 22 | WRC interacting receptor sequence (WIRS) is a highly conserved and widespread interaction motif that is employed by diverse membrane proteins to recruit the WRC to initiate the dynamic rearrangements of the actin cytoskeleton. |
| MOD_CDC14_SPxK_1 | 48 | A subset of Cdk phosphorylation sites conform to the (S)PxK/r pattern that serves as an optimal Cdc14 dephosphorylation site, allowing high catalytic efficiency. |
| TRG_DiLeu_BaEn_1 | 23 | Classical adaptin sigma subunit-binding acidic dileucine motifs sorting in Endosomal-Basolateral trafficking. |
| TRG_DiLeu_BaEn_2 | 4 | Phe-containing variant adaptin sigma subunit-binding acidic dileucine motifs sorting in Endosomal-Basolateral trafficking |
| TRG_DiLeu_BaEn_3 | 4 | Diglutamate-containing variant Adaptin sigma subunit-binding acidic dileucine motifs sorting in Endosomal-Basolateral trafficking. |
| TRG_DiLeu_BaEn_4 | 4 | Acidic dileucine motifs with a monoleucine preference and extra glutamate sorting in Endosomal-Basolateral trafficking. |
| TRG_DiLeu_LyEn_5 | 18 | Dileucine motifs lacking Glu + 1 with Pro-Arg preference at + 4 sorting in Endosomal-Basolateral-Lysosomal trafficking. |
| TRG_DiLeu_BaLyEn_6 | 18 | Acidic dileucine motifs with Arg or Pro preference at position 4 interacting with AP-3 and sorting in Endosomal-Basolateral-Lysosomal trafficking. |
| Extensively revised ELM classes | ||
| DOC_CYCLIN_RxL_1 | 31 | Both fungal and mammalian S-phase Cyclin/CDK complexes recognize specific RxL docking motifs in their target proteins. |
| LIG_LIR_Gen_1 | 54 | Canonical LIR motif that binds to Atg8/LC3 protein family members to mediate processes involved in autophagy. |
| LIG_LIR_Nem_3 | 11 | Nematode-specific variant of the canonical LIR motif that binds to Atg8 protein family members to mediate processes involved in autophagy. |
| LIG_LYPXL_S_1 | 18 | The short version of the LYPxL motif binds the V-domain of Alix, a protein involved in endosomal sorting. |
| LIG_LYPXL_L_2 | 4 | The long version of the LYPxL motif binds the V-domain of Alix, a protein involved in endosomal sorting. |
Figure 2.Structural information on motif-mediated regulation by and of the Cyclin-dependent kinases. (A) Representative structure of the cyclin:Cdk:Cks holoenzyme showing the three structurally characterised motif-recognising regions of the complex: two docking pockets on the cyclin and CKS subunits and the active site of the holoenzyme on the Cdk subunit. The image is constructed from two structures: a crystal structure of human Cdk2:Cyclin A2 in complex with a peptide containing both the SP phosphosite substrate and RxL docking motif of human Cdc6 [PDB ID:2CCI], and a crystal structure of a yeast Cdc6 phosphopeptide in complex with yeast Cks1 [PDB ID:4LPA]. Cks1 was placed on the holoenzyme based on the Cks2:Cdk1 interface in a Cdk:cyclin B1:Cks2 structure [PDB ID:4YC3]. (B) Table of the diverse motifs that allow docking and modification by the cyclin:Cdk:Cks holoenzyme split by their pocket on the the complex. Bold and underlined residues are the defined residues from the ELM class for the motif. (C) Close-up structure of the interface of yeast dual specificity-protein phosphatase CDC14 (grey) in complex with the PxL Cdc14 docking motif of yeast SIC1 (orange) [PDB ID:6G86]. (D) Close-up of the structure of the interface of human F-box only protein 31 (FBXO31) (grey) in complex with the FBXO31 degron of human cyclin D1 (orange) [PDB ID:5VZU].
Figure 3.SLiM-mediated interactions in the endocytosis network (KEGG:hsa04144). Proteins with ELM instance annotations are coloured according to motif class: docking—light brown; ligand binding—light blue; modification—salmon; targeting—yellow; multiple classes—pink. Pathogens and their proteins hijacking the endocytosis pathway components are overlaid on the network (light green background rectangles with rounded corners). A red arrow from the rectangle points to the hijacked protein in the network. See ELM for more details such as the binding domains being hijacked by the pathogens. Reproduced from https://www.kegg.jp/pathway/hsa04144 with permission.
Figure 4.Setting up the ELM server correctly to query viral proteins for SLiM candidates. The example uses IDP-rich UL135 transmembrane protein of Cytomegalovirus for which the natural host is human. UL135 has an N-terminal transmembrane helix and the following sequence faces into the host cell and therefore the correct cell compartment to use is cytosol. The correct species is the host Homo sapiens. For the illustration, the motif probability score has been set stringently (0.001) to exclude many matches to motifs with low sequence complexity. In the output, the two recently added PxpxPR motif instances (50) are shown as red ovals. All other motif matches are hypothetical. Three motif matches have conflicting compartment information and are therefore highly improbable. One of the PxpxPR motifs is conserved in a related Cytomegalovirus which infects Chimpanzee (Lower right box).