| Literature DB >> 29136216 |
Marc Gouw1, Sushama Michael1, Hugo Sámano-Sánchez1, Manjeet Kumar1, András Zeke2, Benjamin Lang1, Benoit Bely3, Lucía B Chemes4,5,6, Norman E Davey7, Ziqi Deng1, Francesca Diella1, Clara-Marie Gürth8, Ann-Kathrin Huber8, Stefan Kleinsorg8, Lara S Schlegel8, Nicolás Palopoli9, Kim V Roey1, Brigitte Altenberg1, Attila Reményi2, Holger Dinkel1,10, Toby J Gibson1.
Abstract
Short linear motifs (SLiMs) are protein binding modules that play major roles in almost all cellular processes. SLiMs are short, often highly degenerate, difficult to characterize and hard to detect. The eukaryotic linear motif (ELM) resource (elm.eu.org) is dedicated to SLiMs, consisting of a manually curated database of over 275 motif classes and over 3000 motif instances, and a pipeline to discover candidate SLiMs in protein sequences. For 15 years, ELM has been one of the major resources for motif research. In this database update, we present the latest additions to the database including 32 new motif classes, and new features including Uniprot and Reactome integration. Finally, to help provide cellular context, we present some biological insights about SLiMs in the cell cycle, as targets for bacterial pathogenicity and their functionality in the human kinome.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29136216 PMCID: PMC5753338 DOI: 10.1093/nar/gkx1077
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The number of SLiMs (motif classes and motif instances) created and modified in ELM. For the past 15 years, ELM has been steadily growing, and the total number of motif classes (dark orange) and motif instances (dark purple) continues to grow each year. As of September 2017, there are over 275 motif classes defined, and over 3000 motif instances annotated. Besides contributing new content, curators also updated existing annotations to include new findings (last modification dates for motif classes in dashed light purple and for motif instances in dashed light orange).
New ELM motif classes
| ELM motif class identifier | # Instances | ELM motif class description |
|---|---|---|
| DEG_COP1_1 | 12 | A destruction motif interacts with the COP1 WD 40 domain for target ubiquitination and degradation. |
| DOC_MAPK_DCC_7 | 11 | A kinase docking motif mediating interaction toward the ERK1/2 and p38 subfamilies of MAP kinases. |
| DOC_MAPK_HePTP_8 | 10 | A kinase docking motif that interacts with the ERK1/2 and p38 subfamilies of MAP kinases. |
| DOC_MAPK_JIP1_4 | 29 | A shorter D site specifically recognized by the JNK kinases |
| DOC_MAPK_MEF2A_6 | 24 | A kinase docking motif that mediates interaction toward the ERK1/2 and p38 subfamilies of MAP kinases. |
| DOC_MAPK_NFAT4_5 | 17 | An extended D site specifically recognized by the JNK kinases. |
| DOC_MAPK_RevD_3 | 6 | Reverse (C to N direction) of the classical MAPK docking motif ELM:DOC_MAPK_gen_1 with an often extended linker region of the bipartite motif. |
| DOC_PP2A_B56_1 | 18 | Docking site required for the regulatory subunit B56 of PP2A for protein dephosphorylation. |
| LIG_14-3-3_CanoR_1 | 62 | Canonical Arg-containing phospho-motif mediating a strong interaction with 14-3-3 proteins. |
| LIG_14-3-3_ChREBP_3 | 1 | 14-3-3 protein binding to a nonphosphorylated helical peptide in ChREBP is promoted by adenosine monophosphate. |
| LIG_14-3-3_CterR_2 | 5 | C-terminal Arg-containing phospho-motif mediating a strong interaction with 14-3-3 proteins. |
| LIG_ANK_PxLPxL_1 | 10 | The consensus PxLPxI/L motif, which can be found in diverse proteins, binds to the ankyrin repeat domains of ANKRA2 and its close paralog RFXANK. |
| LIG_APCC_ABBA_1 | 11 | Amphipathic motif that is involved in APC/C inhibition by binding of CDH1/CDC20. In metazoan cyclin A, the motif also acts as a degron, enabling the cyclin’s degradation in prometaphase. |
| LIG_APCC_ABBAyCdc20_2 | 2 | Amphipathic motif that binds to yeast Cdc20 and acts as an APC/C degron enabling cyclin Clb5 degradation during mitosis. |
| LIG_BH_BH3_1 | 19 | The BH3 motif is found in pro-apoptotic proteins and interacts with BH domains of the anti-apoptotic Bcl-2 family members to regulate apoptosis. |
| LIG_CSK_EPIYA_1 | 13 | Csk Src Homology 2 domain-binding EPIYA motif. |
| LIG_CSL_BTD_1 | 18 | The motif mediates the interaction between a notch-like protein and the transcription factor CSL by placing two amino acids (W and P) into a hydrophobic pocket of the Beta-trefoil DNA-binding (BTD) domain of CSL. |
| LIG_G3BP_FGDF_1 | 9 | The FGDF motif binds to a hydrophobic binding cleft within the N-terminal NTF2-like domain of the stress granule protein G3BP. |
| LIG_GBD_Chelix_1 | 12 | Amphipatic α-helix that binds the GTPase-binding domain (GBD) in WASP and N-WASP. |
| LIG_GSK3_LRP6_1 | 8 | PPPSP motif present on the cytosolic tails of the transmembrane receptors LRP5 and LRP6, responsible for GSK3 binding and inhibition when phosphorylated. |
| LIG_IRF3_LxIS_1 | 5 | A binding site for IRF-3 protein present in various innate adaptor proteins and the viral protein NSP1to trigger the innate immune responsive pathways. |
| LIG_KLC1_WD_1 | 22 | This short WD or WE motif is found in cargo proteins and mediates kinesin-1-dependent microtubule transport by binding to the KLC TPR region. |
| LIG_LRP6_Inhibitor_1 | 3 | Short motif present in extracellular of some Wnt antagonists recognized by the N-terminal β-propeller domain of LRP5/6 and thus inhibits the Wnt pathway. |
| LIG_PALB2_WD40_1 | 1 | A motif present in the BRCA2 protein which binds to the WD 40 repeat (blade 4,5) domain of PALB2 which is required for the recognition of DNA double strand breaks and repair. |
| LIG_Rrp6Rrp47_Mtr4_1 | 6 | The motif enables the interaction of Mtr4 like helicaes with the Rrp6-Rrp47 heterodimer and thus the formation of the exosome-binding complex. |
| LIG_UFM1_UFIM_1 | 1 | UFIM is a motif present in the E1 enzyme UBA5 required to bind ubiquitin-like protein UFM1. UFIM overlaps with a LIR motif binding LC3/GABARAP family proteins. |
| LIG_Vh1_VBS_1 | 12 | An amphipathic α-helix recognized by the head domain of vinculin that is required for vinculin activation and actin filament attachment. |
| MOD_CDK_SPK_2 | 18 | Short version of the cyclin-dependent kinases (CDK) phosphorylation site which shows specificity toward a lysine/arginine residue at the [ST] +2 position. |
| MOD_CDK_SPxxK_3 | 25 | Longer version of the CDK phosphorylation site which shows specificity toward a lysine/arginine residue at position +4 after the phospho-Ser/Thr. |
| MOD_Plk_1 | 23 | Ser/Thr residue phosphorylated by the Plk1 kinase. |
| MOD_Plk_2-3 | 3 | Ser/Thr residue phosphorylated by Plk2 and Plk3. |
| MOD_Plk_4 | 7 | Ser/Thr residue phosphorylated by Plk4. |
Since the last NAR database issue publication 32 motif classes have been annotated to the database. (13)
New ELM motif classes and instances
| Motif type | Motif classes added | Motif classes modified | Taxon | Motif instances added | Motif instances modified |
|---|---|---|---|---|---|
| DEG | 1 | 1 | Human | 315 | 10 |
| CLV | 0 | 1 | other Animal | 87 | 2 |
| TRG | 0 | 0 | Fungi | 17 | 0 |
| LIG | 19 | 9 | Plant | 10 | 3 |
| MOD | 5 | 2 | Bacteria | 23 | 0 |
| DOC | 7 | 5 | Virus | 39 | 0 |
Since the last NAR database issue publication in 2016 (13) a total of 32 motif classes and 491 motif instances have been added to the database. Most of the new motifs added are either Ligand (LIG) or Docking (DOC) motifs. Most of the new motif instances are Human, although motif instances for many other branches of life have also been added.
Figure 2.SLiMs play major roles in many biological pathways, including those involved in the progression of the cell cycle. Using the Reactome pathway annotations downloaded from ELM and using Cytoscape (42) to visualize the data, we can see that many SLiMs (especially Ligand and Degradation motifs) are involved in the cell cycle (A) and specifically in the mitotic spindle checkpoint (B).
Figure 3.Surface representation showing two MAPK docking motifs bound to the MAPK docking groove. Negative charges and positive charges are shown in red and blue, respectively, on MAPK and the docking motif is rendered in yellow. (A) The N- to C-terminal orientation of the MAPK docking motif shown for the DOC_MAPK_NFAT4_5 motif (with regular expression: [RK][∧P][∧P][LIM].L.[LIVMF]). Here, charged amino acids [RK] are followed by hydrophobic residues (PDB:2XS0; (43)). (B) The reverse MAPK docking motif shown for the DOC_MAPK_RevD_3 motif (with regular expression: [LIVMPFA].[LIV].1,2[LIVMP].4,6[LIV]..[RK][RK]), where the N-terminus has hydrophobic amino acids followed by charged residues (PDB:2Y9Q; (43)). Figures produced using Chimera (46).