| Literature DB >> 25374612 |
Hengwu Li1, Daming Zhu2, Caiming Zhang2, Huijian Han3, Keith A Crandall4.
Abstract
BACKGROUND: With the continuous discovery of novel RNA molecules with key cellular functions and of novel pathways and interaction networks, the need for structural information of RNA is still increasing. In order to predict structure of long RNA and understand its natural folding mechanism, exploring the characteristic of RNA structure is an important issue.Entities:
Year: 2014 PMID: 25374612 PMCID: PMC4202180 DOI: 10.1186/1753-6561-8-S6-S3
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Structures of different domains and sub-domains. (a) Structure of three domains. (b) Structure of four domains. (c) Structure of five domains. (d) Structure of six domains. (e) Structure of two parallel domains formed by helixes. (f) Structure of two parallel domains formed by pseudokonts. (g) Structure of one domain with no sub-domains. (h) Structure of one domain with one sub-domain. (i) Structure of one domain with multiple sub-domains.
Figure 2Distribution of ratios for synthetic RNA with two domains. (a) For all validated by NMR or X-Ray, non-fragment and non-redundant 48 synthetic RNA sequences with two domains from RNA STRAND, the length ratio of 3'-end of domains to its sequence is computed and the summarization is shown. The first ratio centres on 0.5, and the second ratio is 1.0. (b) The x-axis represents the sequence, and the y-axis represents length ratio of the 3'-end of domain to the sequence.
Distribution of domains for synthetic RNA with more than two domains
| Sequences | L | D 1 | D 2 | D 3 | D 4 | D 5 | D 6 | R 1 | R 2 | R 3 | R4 | R 5 | R 6 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB_00195 | 84 | 1-28 | 29-56 | 57-84 | 0.33 | 0.66 | 1.0 | ||||||
| PDB_00262 | 48 | 1-16 | 17-32 | 33-48 | 0.33 | 0.66 | 1.0 | ||||||
| PDB_00754 | 55 | 1-17 | 18-37 | 38-54 | 0.32 | 0.69 | 1.0 | ||||||
| PDB_01250 | 156 | 15-52 | 53-104 | 105-156 | 0.32 | 0.69 | 1.0 | ||||||
| PDB_01060 | 64 | 17-32 | 33-48 | 49-64 | 0.5 | 0.75 | 1.0 | ||||||
| PDB_00175 | 96 | 1-24 | 25-48 | 49-64 | 65-96 | 0.25 | 0.5 | 0.75 | 1.0 | ||||
| PDB_00873 | 96 | 1-24 | 25-48 | 49-64 | 65-96 | 0.25 | 0.5 | 0.75 | 1.0 | ||||
| PDB_00447 | 120 | 1-30 | 41-60 | 71-90 | 101-120 | 0.25 | 0.5 | 0.75 | 1.0 | ||||
| PDB_00340 | 140 | 1-35 | 36-70 | 71-105 | 106-140 | 0.25 | 0.5 | 0.75 | 1.0 | ||||
| PDB_01061 | 80 | 1-16 | 33-48 | 49-64 | 65-80 | 0.2 | 0.4 | 0.6 | 0.8 | 1.0 | |||
| PDB_00370 | 175 | 1-35 | 36-70 | 71-105 | 106-140 | 141-175 | 0.2 | 0.4 | 0.6 | 0.8 | 1.0 | ||
| PDB_01249 | 150 | 1-25 | 26-50 | 51-75 | 76-100 | 101-125 | 126-150 | 0.17 | 0.33 | 0.5 | 0.67 | 0.83 | 1.0 |
L is the length of the sequence, D1-D6 are the domains of the sequence, R1-R6 are the ratios of 3'-end of domains to the length of sequence. The domain is expressed as the 5'-end and 3'-end.
Distribution of domains for synthetic RNA with two domains
| sequences | L | H1 | H2 | D1 | D2 | R 1 | R2 | sequences | L | H1 | H2 | D1 | D2 | R 1 | R2 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB_00483 | 58 | 1-29 | 30-58 | 1-29 | 30-58 | 0.5 | 1.0 | PDB_00123 | 92 | 1-46 | 49-92 | 1-46 | 47-92 | 0.5 | 1.0 |
| PDB_00816 | 46 | 1-23 | 24-46 | 1-23 | 24-46 | 0.5 | 1.0 | PDB_00196 | 57 | 1-18 | 20-57 | 1-19 | 20-57 | 0.33 | 1.0 |
| PDB_00924 | 86 | 1-43 | 44-86 | 1-43 | 44-86 | 0.5 | 1.0 | PDB_00236 | 48 | 1-24 | 25-48 | 1-24 | 25-48 | 0.5 | 1.0 |
| PDB_00942 | 31 | 1-15 | 17-31 | 1-15 | 16-31 | 0.48 | 1.0 | PDB_00254 | 24 | 1-12 | 13-24 | 1-12 | 13-24 | 0.5 | 1.0 |
| PDB_00963 | 28 | 1-15 | 16-28 | 1-15 | 16-28 | 0.54 | 1.0 | PDB_00264 | 48 | 1-24 | 26-48 | 1-24 | 25-48 | 0.5 | 1.0 |
| PDB_00964 | 36 | 1-19 | 22-34 | 1-19 | 20-36 | 0.53 | 1.0 | PDB_00709 | 40 | 1-16 | 17-32 | 1-16 | 17-40 | 0.4 | 1.0 |
| PDB_00965 | 21 | 2-10 | 13-21 | 1-11 | 12-21 | 0.52 | 1.0 | PDB_00710 | 40 | 1-16 | 17-32 | 1-16 | 17-40 | 0.4 | 1.0 |
| PDB_00966 | 23 | 2-10 | 13-23 | 1-12 | 13-23 | 0.52 | 1.0 | PDB_00868 | 52 | 1-19 | 21-39 | 1-20 | 21-52 | 0.39 | 1.0 |
| PDB_00970 | 34 | 1-17 | 20-32 | 1-17 | 18-34 | 0.5 | 1.0 | PDB_00663 | 44 | 2-19 | 26-41 | 1-22 | 23-44 | 0.5 | 1.0 |
| PDB_00971 | 30 | 1-17 | 18-30 | 1-17 | 18-30 | 0.57 | 1.0 | PDB_00682 | 32 | 1-16 | 17-32 | 1-16 | 17-32 | 0.5 | 1.0 |
| PDB_00973 | 27 | 2-14 | 15-27 | 1-14 | 15-27 | 0.52 | 1.0 | PDB_00688 | 40 | 1-20 | 21-40 | 1-20 | 21-40 | 0.5 | 1.0 |
| PDB_00974 | 28 | 1-15 | 16-28 | 1-15 | 16-28 | 0.54 | 1.0 | PDB_00724 | 66 | 1-32 | 34-65 | 1-33 | 34-66 | 0.5 | 1.0 |
| PDB_00979 | 154 | 3-77 | 80-154 | 1-77 | 78-154 | 0.5 | 1.0 | PDB_00866 | 19 | 2-9 | 11-19 | 1-10 | 11-19 | 0.53 | 1.0 |
| PDB_01035 | 56 | 1-26 | 27-56 | 1-26 | 27-56 | 0.46 | 1.0 | PDB_00871 | 48 | 1-24 | 25-48 | 1-24 | 25-48 | 0.5 | 1.0 |
| PDB_01130 | 60 | 1-30 | 31-60 | 1-30 | 31-60 | 0.5 | 1.0 | PDB_00874 | 44 | 1-20 | 23-42 | 1-22 | 23-44 | 0.5 | 1.0 |
| PDB_01135 | 29 | 1-15 | 17-29 | 1-15 | 17-29 | 0.52 | 1.0 | PDB_00886 | 92 | 1-46 | 47-92 | 1-46 | 47-92 | 0.5 | 1.0 |
| PDB_01136 | 30 | 2-14 | 17-29 | 2-14 | 15-29 | 0.5 | 1.0 | PDB_00892 | 72 | 1-36 | 48-61 | 1-36 | 37-72 | 0.5 | 1.0 |
| PDB_01138 | 26 | 1-15 | 18-25 | 1-15 | 16-26 | 0.58 | 1.0 | PDB_00929 | 40 | 1-20 | 21-40 | 1-20 | 21-40 | 0.5 | 1.0 |
| PDB_01145 | 40 | 1-20 | 21-40 | 1-20 | 21-40 | 0.5 | 1.0 | PDB_00960 | 29 | 2-13 | 18-29 | 2-15 | 16-29 | 0.52 | 1.0 |
| PDB_01164 | 56 | 1-26 | 27-56 | 1-26 | 27-56 | 0.46 | 1.0 | PDB_01017 | 30 | 2-15 | 17-30 | 2-15 | 16-30 | 0.5 | 1.0 |
| NDB_00010 | 8 | 1-4 | 5-8 | 1-4 | 5-8 | 0.5 | 1.0 | PDB_01019 | 30 | 2-15 | 17-30 | 2-15 | 16-30 | 0.5 | 1.0 |
| NDB_00037 | 8 | 1-4 | 5-8 | 1-4 | 5-8 | 0.5 | 1.0 | PDB_01156 | 88 | 1-43 | 45-87 | 1-44 | 45-88 | 0.5 | 1.0 |
| NDB_00048 | 56 | 1-28 | 29-56 | 1-28 | 29-56 | 0.5 | 1.0 | PDB_01219 | 44 | 1-21 | 23-44 | 1-22 | 23-44 | 0.5 | 1.0 |
| PDB_00104 | 36 | 1-18 | 19-36 | 1-18 | 19-36 | 0.5 | 1.0 | PDB_00945 | 31 | 2-14 | 15-30 | 1-14 | 15-31 | 0.48 | 1.0 |
L is the length of the sequence, H1-H2 are the helixes of the sequence, D1-D2 are the domains of the sequence, R1-R2 are the ratios of 3'-end of domains to the length of sequence. The helix data is the closed base pair in helix or the start and end bases in pseudokont. The domain is expressed as 5'-end and 3'-end.
Distribution of multiple sub-domains for synthetic RNA with one domain
| Sequences | L | H1 | H2 | H3 | SD1 | SD2 | SD3 | D | R 1 | R 2 | R 3 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB_01111 | 75 | 7-38 | 40-68 | 7-38 | 39-68 | 7-68 | 0.52 | 0.48 | |||
| PDB_01112 | 75 | 7-39 | 41-68 | 7-39 | 40-68 | 7-68 | 0.53 | 0.47 | |||
| PDB_01114 | 76 | 7-39 | 41-69 | 7-39 | 41-69 | 7-69 | 0.52 | 0.48 | |||
| PDB_01115 | 74 | 7-36 | 40-66 | 7-36 | 37-67 | 7-67 | 0.49 | 0.51 | |||
| PDB_01116 | 69 | 7-37 | 39-62 | 7-37 | 38-62 | 7-62 | 0.55 | 0.45 | |||
| PDB_00522 | 48 | 8-18 | 19-32 | 34-43 | 6-18 | 19-32 | 33-44 | 6-44 | 0.33 | 0.36 | 0.31 |
| PDB_00750 | 155 | 5-19 | 21-32 | 35-151 | 5-20 | 21-34 | 35-151 | 5-151 | 0.11 | 0.1 | 0.79 |
L is the length of the sequence, H1-H3 are the helixes of the sequence, D is the domain of the sequence, SD1-SD3 are the sub-domains of the domain, R1-R3 are the length ratios of SD1-SD3 to D. The helix data is the closed base pair or the start and end bases in pseudokont. The domain is expressed as 5'-end and 3'-end of the internal section except of closed helix or pseudokont.
Distribution of sub-domains for tRNA with one domain and multiple sub-domains
| Sequences | L | H1 | H2 | H3 | SD1 | SD2 | SD3 | D | R 1 | R 2 | R 3 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NDB_00051 | 75 | 10-25 | 26-44 | 48-64 | 8-25 | 26-44 | 45-64 | 8-64 | 0.32 | 0.33 | 0.35 |
| PDB_00045 | 76 | 10-25 | 26-44 | 48-65 | 8-25 | 26-45 | 46-65 | 8-65 | 0.31 | 0.34 | 0.34 |
| PDB_00070 | 76 | 10-25 | 26-44 | 48-65 | 8-25 | 26-45 | 46-65 | 8-65 | 0.31 | 0.34 | 0.34 |
| PDB_00095 | 75 | 10-25 | 26-43 | 48-64 | 8-25 | 26-44 | 45-64 | 8-64 | 0.32 | 0.33 | 0.35 |
| PDB_00229 | 75 | 10-24 | 25-43 | 48-64 | 8-24 | 25-44 | 45-64 | 8-64 | 0.30 | 0.35 | 0.35 |
| PDB_00244 | 73 | 10-25 | 26-44 | 48-64 | 8-25 | 26-44 | 45-64 | 8-64 | 0.32 | 0.33 | 0.35 |
| PDB_00259 | 72 | 10-25 | 26-44 | 48-64 | 8-25 | 26-44 | 45-64 | 8-64 | 0.32 | 0.33 | 0.35 |
| PDB_00313 | 74 | 10-24 | 26-42 | 47-63 | 8-25 | 26-44 | 45-63 | 8-63 | 0.32 | 0.34 | 0.34 |
| PDB_00376 | 73 | 9-23 | 24-42 | 46-62 | 7-23 | 24-42 | 43-62 | 7-62 | 0.30 | 0.34 | 0.36 |
| PDB_00426 | 74 | 9-23 | 24-41 | 47-63 | 7-23 | 24-43 | 44-63 | 7-63 | 0.30 | 0.35 | 0.35 |
| PDB_00903 | 76 | 10-25 | 26-44 | 49-65 | 8-25 | 26-45 | 46-65 | 8-65 | 0.31 | 0.34 | 0.34 |
| PDB_00999 | 70 | 10-26 | 27-45 | 50-64 | 5-26 | 27-47 | 48-67 | 5-67 | 0.35 | 0.33 | 0.32 |
L is the length of the sequence, H1-H3 are the helixes of the sequence, D is the domain of the sequence, SD1-SD3 are the sub-domains of the domain, R1-R3 are the length ratios of SD1-SD3 to D. The helix data is the closed base pair or the start and end bases in pseudokont. The domain is expressed as 5'-end and 3'-end of the internal section except of closed helix or pseudokont.
Distribution of domains for tRNA with multiple domains
| Sequence ID | L | N | D1 | D2 | D3 | D4 | D5 | R1 | R2 | R3 | R4 | R5 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB_00307 | 150 | 2 | 1-75 | 76-150 | 0.5 | 1.0 | ||||||
| PDB_00421 | 152 | 2 | 1-75 | 76-152 | 0.49 | 1.0 | ||||||
| PDB_00472 | 147 | 2 | 1-73 | 74-147 | 0.5 | 1.0 | ||||||
| PDB_00475 | 148 | 2 | 1-74 | 75-148 | 0.5 | 1.0 | ||||||
| PDB_00593 | 152 | 2 | 1-75 | 76-152 | 0.5 | 1.0 | ||||||
| PDB_00648 | 42 | 2 | 1-20 | 21-42 | 0.48 | 1.0 | ||||||
| PDB_00649 | 39 | 2 | 1-21 | 22-39 | 0.54 | 1.0 | ||||||
| PDB_00681 | 51 | 2 | 1-17 | 18-51 | 0.33 | 1.0 | ||||||
| PDB_00722 | 152 | 2 | 1-76 | 77-152 | 0.5 | 1.0 | ||||||
| PDB_00891 | 44 | 2 | 1-23 | 24-44 | 0.52 | 1.0 | ||||||
| PDB_00904 | 152 | 2 | 1-76 | 77-152 | 0.5 | 1.0 | ||||||
| PDB_00980 | 150 | 2 | 1-75 | 76-150 | 0.5 | 1.0 | ||||||
| PDB_00981 | 150 | 2 | 1-74 | 75-150 | 0.49 | 1.0 | ||||||
| PDB_00994 | 146 | 2 | 1-72 | 73-146 | 0.49 | 1.0 | ||||||
| PDB_01054 | 154 | 2 | 2-77 | 78-154 | 0.5 | 1.0 | ||||||
| PDB_01074 | 75 | 2 | 1-37 | 38-75 | 0.49 | 1.0 | ||||||
| PDB_01162 | 56 | 2 | 1-34 | 35-56 | 0.61 | 1.0 | ||||||
| PDB_00637 | 69 | 3 | 1-26 | 27-46 | 47-69 | 0.38 | 0.67 | 1.0 | ||||
| PDB_00732 | 78 | 3 | 1-25 | 26-51 | 52-78 | 0.32 | 0.65 | 1.0 | ||||
| PDB_00733 | 77 | 3 | 1-26 | 27-51 | 52-77 | 0.34 | 0.66 | 1.0 | ||||
| PDB_00998 | 145 | 4 | 1-26 | 27-49 | 50-73 | 74-145 | 0.18 | 0.34 | 0.5 | 1.0 | ||
| PDB_00398 | 380 | 5 | 1-76 | 77-152 | 153-228 | 229-304 | 305-380 | 0.2 | 0.4 | 0.6 | 0.8 | 1.0 |
| PDB_01000 | 148 | 6 | 1-26 | 27-50 | 51-74 | 75-100 | 101-124 | 0.18 | 0.34 | 0.5 | 0.68 | 0.83 |
| 125-148 | 1.0 |
L is the length of the sequence, N is the number of domains in the sequence, D1-D5 are the domains of the sequence, R1-R5 are the ratios of 3'-end of domains to the length of sequence. The domain is expressed as 5'-end and 3'-end.
Distribution of domains for Other RNA, Viral Phage and Ham Ribozyme with multiple domains
| Sequence ID | Type | L | N | D1 | D2 | D3 | D4 | R1 | R2 | R3 | R4 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB_00308 | Other RNA | 150 | 2 | 1-75 | 76-150 | 0.5 | 1.0 | ||||
| PDB_00358 | Other RNA | 146 | 2 | 1-73 | 74-146 | 0.5 | 1.0 | ||||
| PDB_00419 | Other RNA | 150 | 2 | 1-75 | 76-150 | 0.5 | 1.0 | ||||
| PDB_00804 | Other RNA | 176 | 2 | 1-88 | 89-176 | 0.5 | 1.0 | ||||
| PDB_00967 | Other RNA | 156 | 2 | 1-78 | 79-156 | 0.5 | 1.0 | ||||
| PDB_00983 | Other RNA | 152 | 2 | 1-75 | 76-152 | 0.5 | 1.0 | ||||
| PDB_01274 | Other RNA | 152 | 2 | 1-76 | 77-152 | 0.5 | 1.0 | ||||
| PDB_00626 | Other RNA | 225 | 3 | 1-75 | 76-150 | 151-225 | 0.33 | 0.67 | 1.0 | ||
| PDB_00739 | Other RNA | 228 | 3 | 1-76 | 77-152 | 153-228 | 0.33 | 0.67 | 1.0 | ||
| PDB_01261 | Other RNA | 968 | 3 | 1-484 | 485-726 | 727-968 | 0.5 | 0.75 | 1.0 | ||
| PDB_00985 | Other RNA | 248 | 4 | 1-62 | 63-124 | 125-186 | 187-248 | 0.25 | 0.5 | 0.75 | 1.0 |
| PDB_01161 | Other RNA | 272 | 4 | 1-72 | 73-144 | 145-210 | 211-272 | 0.26 | 0.52 | 0.77 | 1.0 |
| PDB_00743 | Viral Phage | 33 | 2 | 1-17 | 18-33 | 0.52 | 1.0 | ||||
| PDB_00157 | Ham. Ribozyme | 82 | 2 | 3-39 | 44-80 | 1-41 | 42-82 | 0.5 | 1.0 |
L is the length of the sequence, N is the number of domain in the sequence, D1-D4 are the domains of the sequence, R1-R4 are the ratios of 3'-end of domains to the length of sequence. The domain is expressed as 5'-end and 3'-end.
Distribution of domains for Other Ribozyme
| Sequence ID | L | N | D1 | D 2 | D3 | D4 | R1 | R2 | R3 | R4 |
|---|---|---|---|---|---|---|---|---|---|---|
| PDB_00078 | 316 | 1 | 1-115 | 116-151 | 152-273 | 1-311 | 0.36 | 0.48 | 1.0 | |
| PDB_00851 | 98 | 2 | 1-49 | 50-98 | 0.5 | 1.0 | ||||
| PDB_00856 | 98 | 2 | 1-49 | 50-98 | 0.5 | 1.0 | ||||
| PDB_00893 | 61 | 2 | 1-25 | 26-61 | 0.41 | 1.0 | ||||
| PDB_00956 | 61 | 2 | 1-25 | 26-61 | 0.41 | 1.0 | ||||
| PDB_01068 | 142 | 2 | 1-71 | 72-142 | 0.5 | 1.0 | ||||
| PDB_01069 | 141 | 2 | 1-70 | 71-141 | 0.5 | 1.0 | ||||
| PDB_01092 | 143 | 2 | 1-72 | 73-143 | 0.5 | 1.0 | ||||
| PDB_01255 | 159 | 2 | 1-112 | 113-159 | 0.7 | 1.0 | ||||
| PDB_01300 | 142 | 2 | 4-72 | 73-142 | 0.5 | 1.0 | ||||
| PDB_01301 | 141 | 2 | 2-70 | 71-141 | 0.5 | 1.0 | ||||
| PDB_01302 | 139 | 2 | 2-70 | 71-138 | 0.5 | 1.0 | ||||
| PDB_00176 | 96 | 4 | 2-24 | 26-48 | 50-72 | 74-96 | 0.25 | 0.5 | 0.75 | 1.0 |
| PDB_01187 | 142 | 4 | 1-51 | 52-71 | 72-121 | 124-142 | 0.36 | 0.5 | 0.85 | 1.0 |
| PDB_00805 | 968 | 8 | 1-242 | 243-484 | 485-726 | 727-968 | 0.25 | 0.5 | 0.75 | 1.0 |
L is the length of the sequence, N is the number of domain in the sequence, D1-D4 are the domains of the sequence, R1-R4 are the ratios of 3'-end of domains to the length of sequence. The domain is expressed as 5'-end and 3'-end.