| Literature DB >> 29140456 |
Kenneth F Lyon1, Xingyu Cai2, Richard J Young1, Abdullah-Al Mamun2, Sanguthevar Rajasekaran2, Martin R Schiller1.
Abstract
Minimotif Miner (MnM) is a database and web system for analyzing short functional peptide motifs, termed minimotifs. We present an update to MnM growing the database from ∼300 000 to >1 000 000 minimotif consensus sequences and instances. This growth comes largely from updating data from existing databases and annotation of articles with high-throughput approaches analyzing different types of post-translational modifications. Another update is mapping human proteins and their minimotifs to know human variants from the dbSNP, build 150. Now MnM 4 can be used to generate mechanistic hypotheses about how human genetic variation affect minimotifs and outcomes. One example of the utility of the combined minimotif/SNP tool identifies a loss of function missense SNP in a ubiquitylation minimotif encoded in the excision repair cross-complementing 2 (ERCC2) nucleotide excision repair gene. This SNP reaches genome wide significance for many types of cancer and the variant identified with MnM 4 reveals a more detailed mechanistic hypothesis concerning the role of ERCC2 in cancer. Other updates to the web system include a new architecture with migration of the web system and database to Docker containers for better performance and management. Weblinks:minimotifminer.org and mnm.engr.uconn.edu.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29140456 PMCID: PMC5753208 DOI: 10.1093/nar/gkx1085
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
A decade of growth of the MnM database
| Category | MnM | MnM 2 | MnM 3 | MnM 4 |
|---|---|---|---|---|
| Total minimotifs | 462 | 5089 | 294 933 | 1 060 436 |
| Consensus | 312 | 858 | 880 | 894 |
| Instance | 44 | 4229 | 294 053 | 1 059 542 |
| Activity classes | ||||
| Post-translational modifications | 116 | 663 | 210 949 | 912 735 |
| Binding | 162 | 4689 | 4922 | 147 654 |
| Trafficking | 34 | 195 | 228 | 229 |
| Required for cell process | 47 | 47 | ||
| Unique minimotif | ||||
| Sequences | 312 | 2224 | 185 833 | 590 589 |
| Proteins | <312 | 1211 | 49 671 | 182 868 |
| Targets | <312 | 687 | 2620 | 4586 |
| C-Terminal minimotifs | ND | ND | ND | 12 808 |
ND = not determined.
Enrichment of Gene Ontology terms for human proteins with minimotifs1
| Description |
|
| Type |
|---|---|---|---|
| Olfactory receptor activity | 2.89E-265 | 1.14E-261 | function |
| Detection of chemical stimulus involved in perception of smell | 2.89E-265 | 3.95E-261 | process |
| Detection of stimulus involved in sensory perception | 8.09E-232 | 2.76E-228 | process |
| G-protein coupled receptor activity | 2.84E-213 | 5.6E-210 | function |
| Detection of stimulus | 1.82E-207 | 4.98E-204 | process |
| Transmembrane signaling receptor activity | 1.17E-190 | 1.53E-187 | function |
| Transmembrane receptor activity | 2.07E-189 | 2.05E-186 | function |
| G-protein coupled receptor signaling pathway | 8.6E-187 | 1.96E-183 | process |
| Signaling receptor activity | 6.33E-182 | 5E-179 | function |
| Molecular transducer activity | 4.6E-161 | 2.6E-158 | function |
1 Supplementary Table S1 is a list of the top 50 enriched GO functions.
Figure 1.Binplots representing counts of minimotif activity classes. Instances of major minimotif modification activities (A) 14 most common modification activity subclasses in MnM 4 (B), and the 12 new manually annotated modification activity subclasses for MnM 4 (C).
Figure 2.Outputs of SNP functional analysis for excision repair cross-complementing 2 (ERCC2) with MnM. (A) MnM 3 output of ERCC2 with minimotifs highlighted magenta and SNPs highlighted dark blue or green. The table shows a list of the eight SNPs that are indicated in the sequence window. (B) Example rows of MnM 3 output showing minimotifs introduced (red font) or eliminated (green font) by an SNP that is selected in the sequence window. (C) MnM 4 output of ERCC2 with minimotifs highlighted magenta and SNPs highlighted dark blue or green. The table shows a list of the 29 SNPs that are indicated in the sequence window.