| Literature DB >> 28077065 |
Zhi-Ping Liu1, Shutang Liu1, Ruitang Chen2, Xiaopeng Huang3,4,5, Ling-Yun Wu6,7,8.
Abstract
BACKGROUND: Many critical biological processes are strongly related to protein-RNA interactions. Revealing the protein structure motifs for RNA-binding will provide valuable information for deciphering protein-RNA recognition mechanisms and benefit complementary structural design in bioengineering. RNA-binding events often take place at pockets on protein surfaces. The structural classification of local binding pockets determines the major patterns of RNA recognition.Entities:
Keywords: Local structure classification; Network clustering; RNA-binding pocket; Structural alignment; Structure motif
Mesh:
Substances:
Year: 2017 PMID: 28077065 PMCID: PMC5225598 DOI: 10.1186/s12859-016-1410-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The framework for identifying structural RNA-binding motifs via pocket similarity network. 1 The RNA-binding pockets in proteins are extracted and collected from RBPs. 2 The pocket similarity network is constructed based on the similarities among these pockets calculated by our structure alignment algorithm SAMO. 3 The RNA-binding pockets are clustered into small groups with similar structures by decomposing the pocket similarity network into subnetwork modules via a community detection method. 4 The RNA-binding structure motifs and sequence patterns underlying the similar pocket groups are identified and analyzed
Fig. 2The global view of pocket similarity and the identified pocket groups in the RNA-binding pocket similarity network. aThe heatmap of the structure similarities of RNA-binding pockets on the protein surface. b The pocket similarity network with different colored and numbered groups. A 3D pocket structure is displayed for a representative pocket of each of six major groups
Fig. 3The sequence motif logos of six pocket groups and the ratios of active RNA-binding residues in each corresponding position. The sequence motif logos and bar graphs are drawn based on the results of the multiple structure alignment shown in Additional file 1: Figure S1
Fig. 4The corresponding classifications between protein groups and pocket groups. The RBP groups in the frames were clustered based on a protein similarity network, similarly to the pocket groups. When a pocket is contained in a protein, it is placed on the same row to the right of the corresponding protein. The pockets in the same pocket group are shown in the same color.
Functional analysis of the GO molecular functions of the six pocket groups. ‘Group’ is the pocket group ID. ‘Term’ refers to the GO molecular functions with their descriptions. Some pockets in the group are listed as ‘Representative pockets’. ‘P-value’ is the enrichment significance. Note that the functions of the parent proteins are used to implement the calculations because there are no GO annotations of these pockets thus far. For each term, ‘Count’ refers to the number of proteins containing the GO term in the ‘Population total’ background RBPs. ‘List total’ refers to the number of proteins in one group. ‘Population hits’ is the number of proteins annotated with the term. (In DAVID, there are slightly different numbers in ‘List total’ and ‘Population total’ for one group when calculating a specific GO term). For conciseness, we list only the top 5 terms in each group; the complete tables are available from our website
| Group | Term | Count | List total | Population hits | Population total |
| Representative pockets |
|---|---|---|---|---|---|---|---|
| Group 1 | GO:0046914 ~ transition metal ion binding | 10 | 32 | 17 | 123 | 0.032977 | 1U0B:40:B, 1UVL:183:A, 2BH2:32:A, 2IPY:137:A, 3EPH:130:A |
| GO:0043167 ~ ion binding | 11 | 30 | 27 | 119 | 0.071152 | 1U0B:40:B, 1UVL:183:A, 2BH2:32:A, 3EPH:130:A | |
| GO:0046872 ~ metal ion binding | 11 | 30 | 27 | 119 | 0.071152 | 1U0B:40:B, 1UVL:183:A, 2BH2:32:A, 3EPH:130:A | |
| GO:0043169 ~ cation binding | 11 | 30 | 27 | 119 | 0.071152 | 1U0B:40:B, 1UVL:183:A, 2BH2:32:A, 3EPH:130:A | |
| GO:0016779 ~ nucleotidyltransferase activity | 5 | 30 | 24 | 123 | 0.097784 | 1H38:166:A, 1UVL:183:A | |
| Group 2 | GO:0000166 ~ nucleotide binding | 27 | 52 | 44 | 119 | 0.005401 | 1A9N:58:B, 1ASY:38:A, 1B23:24:P, 1C0A:13:A, 1CVJ:57:A, 1QTQ:16:A, 1UVL:129:A, 2AKE:13:A, 2F8K:8:A, 3EPH:97:A, 3MOJ:8:B, 3RW6:56:A |
| GO:0032559 ~ adenyl ribonucleotide binding | 20 | 52 | 30 | 119 | 0.008157 | 1ASY:38:A, 1C0A:13:A, 1QTQ:16:A, 2AKE:13:A, 3EPH:97:A, 3MOJ:8:B | |
| GO:0005524 ~ ATP binding | 20 | 52 | 30 | 119 | 0.008157 | 1ASY:38:A, 1C0A:13:A, 1QTQ:16:A, 2AKE:13:A, 3EPH:97:A, 3MOJ:8:B | |
| GO:0004812 ~ aminoacyl-tRNA ligase activity | 11 | 52 | 15 | 119 | 0.043756 | 1ASY:38:A, 1C0A:13:A, 1QTQ:16:A, 2AKE:13:A | |
| GO:0046914 ~ transition metal ion binding | 14 | 52 | 21 | 119 | 0.045022 | 1UVL:129:A, 1ZBL:22:A, 2B3J:46:A, 3EPH:97:A | |
| Group 3 | GO:0030554 ~ adenyl nucleotide binding | 18 | 51 | 30 | 119 | 0.04838 | 1F7U:34:A, 1QTQ:25:A, 3EPH:112:A |
| GO:0001882 ~ nucleoside binding | 18 | 51 | 30 | 119 | 0.04838 | 1F7U:34:A, 1QTQ:25:A,2A8V:16:B3EPH:112:A | |
| GO:0001883 ~ purine nucleoside binding | 18 | 51 | 30 | 119 | 0.04838 | 1F7U:34:A, 1QTQ:25:A,2A8V:16:B3EPH:112:A | |
| GO:0032553 ~ ribonucleotide binding | 20 | 51 | 35 | 119 | 0.061405 | 1F7U:34:A, 2A8V:16:B, 3EPH:112:A | |
| GO:0032555 ~ purine ribonucleotide binding | 20 | 51 | 35 | 119 | 0.061405 | 1F7U:34:A, 2A8V:16:B, 3EPH:112:A | |
| Group 4 | GO:0046914 ~ transition metal ion binding | 6 | 14 | 21 | 119 | 0.052374 | 1U0B:78:B, 1UVL:73:A, 2BH2:89:A, 3EPH:50:A |
| GO:0043167 ~ ion binding | 6 | 14 | 27 | 119 | 0.13911 | 1UVL:73:A, 2BH2:89:A | |
| GO:0043169 ~ cation binding | 6 | 14 | 27 | 119 | 0.13911 | 1UVL:73:A, 2BH2:89:A | |
| GO:0046872 ~ metal ion binding | 6 | 14 | 27 | 119 | 0.13911 | 1UVL:73:A, 2BH2:89:A | |
| GO:0000166 ~ nucleotide binding | 8 | 14 | 44 | 119 | 0.151441 | 1CVJ:135:A, 1 N78:58:A, 2A8V:9:B | |
| Group 5 | GO:0016876 ~ ligase activity, forming aminoacyl-tRNA and related compounds | 9 | 42 | 15 | 119 | 0.08969 | 1C0A:17:A, 1EIY:142:B, 1J1U:24:A,2AKE:32:A |
| GO:0004812 ~ aminoacyl-tRNA ligase activity | 9 | 42 | 15 | 119 | 0.08969 | 1C0A:17:A, 1EIY:142:B, 1J1U:24:A, 2AKE:32:A | |
| GO:0046872 ~ metal ion binding | 12 | 42 | 27 | 119 | 0.287759 | 1EIY:142:B, 1YTU:82:A | |
| GO:0032553 ~ ribonucleotide binding | 14 | 42 | 35 | 119 | 0.422562 | 1C0A:17:A, 1EIY:142:B, 1J1U:24:A | |
| GO:0030554 ~ adenyl nucleotide binding | 12 | 42 | 30 | 119 | 0.46644 | 1C0A:17:A, 1EIY:142:B, 1J1U:24:A | |
| Group 6 | GO:0046914 ~ transition metal ion binding | 10 | 41 | 21 | 119 | 0.229275 | 1U0B:49:B, 1UVL:243:A, 2BH2:56:A, 2IPY:165:A, 3EPH:116:A |
| GO:0000166 ~ nucleotide binding | 18 | 41 | 44 | 119 | 0.245078 | 1A9N:73:B, 1CVJ:120:A | |
| GO:0008173 ~ RNA methyltransferase activity | 4 | 41 | 6 | 119 | 0.32238 | 2BH2:56:A, 3BT7:93:A | |
| GO:0046872 ~ metal ion binding | 11 | 41 | 27 | 119 | 0.416938 | 1YTU:76:A | |
| GO:0043167 ~ ion binding | 11 | 41 | 27 | 119 | 0.416938 | 1YTU:76:A |
Fig. 5Details of an example local structure motif in RBPs. a The multiple sequence alignment of five RBPs with the locations of RNA-binding pockets and residues. b The similarities among global protein structures, sequence-based domains and local pockets. Significant similarity is represented by an edge in different colors corresponding to different levels, i.e., green for pocket, yellow for domain and red for global structure. Some proteins contain similar domains, while others do not. All five proteins contain similar RNA-binding pockets serving as local structure motifs, which determine the similar RNA-binding functionality