| Literature DB >> 19477961 |
Darby Tien-Hao Chang1, Ting-Ying Chien, Chien-Yu Chen.
Abstract
Sequence motifs are important in the study of molecular biology. Motif discovery tools efficiently deliver many function related signatures of proteins and largely facilitate sequence annotation. As increasing numbers of motifs are detected experimentally or predicted computationally, characterizing the functional roles of motifs and identifying the potential synergetic relationships between them are important next steps. A good way to investigate novel motifs is to utilize the abundant 3D structures that have also been accumulated at an astounding rate in recent years. This article reports the development of the web service seeMotif, which provides users with an interactive interface for visualizing sequence motifs on protein structures from the Protein Data Bank (PDB). Researchers can quickly see the locations and conformation of multiple motifs among a number of related structures simultaneously. Considering the fact that PDB sequences are usually shorter than those in sequence databases and/or may have missing residues, seeMotif has two complementary approaches for selecting structures and mapping motifs to protein chains in structures. As more and more structures belonging to previously uncharacterized protein families become available, combining sequence and structure information gives good opportunities to facilitate understanding of protein functions in large-scale genome projects. Available at: http://seemotif.csie.ntu.edu.tw,http://seemotif.ee.ncku.edu.tw or http://seemotif.csbb.ntu.edu.tw.Entities:
Mesh:
Year: 2009 PMID: 19477961 PMCID: PMC2703912 DOI: 10.1093/nar/gkp439
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Examples of motifs found for an interested protein (UniProt entry P35835)
| Motif sources | Motifs | Database ID |
|---|---|---|
| MAGIIC-PRO | H-G-T-x(3)-G-x(77,101)-A-x-G-N-x(57,78)-G-T-S-x(3)-P | |
| GLAM2 | 165-214 or QDGSSHGTHVAGTIAALNNSIGVLGVAPSASLYAVKVLDSTGSGQYSWII | |
| PROSITE | [STAIV]-{ERDL}-[LIVMF]-[LIVM]-D-[DSTA]-G-[LIVMFC]-x(2,3)-[DNH] | PS00136 |
| PROSITE | H-G-[STM]-x-[VIC]-[STAGC]-[GS]-x-[LIVMA]-[STAGCLV]-[SAGM] | PS00137 |
| PROSITE | G-T-S-x-[SA]-x-P-x-{L}-[STAVC]-[AG] | PS00138 |
| ELM | [RK].[AILMFV][LTKF] | CLV_PCSK_SKI1_1 |
| ELM | Y[QDEVAIL][DENPYHI][IPVGAHS] | LIG_SH2_SRC |
Figure 1.An example of using seeMotif by combining two motifs from different computational tools. This figure comprises five parts: (1) motifs; (2) position mappings between the motifs and the reference sequence; (3) reference sequence with the matched positions highlighted; (4) a snapshot of the structure panel; and (5) local alignment of the reference sequence and the selected protein chain. In this example, motifs are plotted on the PDB structure 3PRK:E. Protein chains are shown in strands style. The residues matched by any of the motifs are illustrated as sticks with distinct colors corresponding to their sequence expression form in the sequence panel. Ligands (the inhibitor in this structure) are displayed in spacefill and colored in CPK mode.
Figure 2.An example of using seeMotif by combining five motifs from both computational tools and existing motif databases. This figure comprises five parts: (1) motifs; (2) position mappings between the motifs and the reference sequence; (3) reference sequence with matched positions highlighted; (4) a snapshot of the structure panel; and (5) the motif selection panel in the intermediate page. In this example, motifs are plotted on the PDB structure 1SCJ. Protein chains are shown in strands style, where each chain has its own color. The residues matched by any of the motifs are illustrated as sticks with distinct colors corresponding to their sequence expression form in the sequence panel. CA ions are displayed in spacefill.