| Literature DB >> 24223923 |
Jun-Jie Hong1, Tzong-Yuan Wu, Tsair-Yuan Chang, Chung-Yung Chen.
Abstract
The internal ribosomal entry site (IRES) functions as cap-independent translation initiation sites in eukaryotic cells. IRES elements have been applied as useful tools for bi-cistronic expression vectors. Current RNA structure prediction programs are unable to predict precisely the potential IRES element. We have designed a viral IRES prediction system (VIPS) to perform the IRES secondary structure prediction. In order to obtain better results for the IRES prediction, the VIPS can evaluate and predict for all four different groups of IRESs with a higher accuracy. RNA secondary structure prediction, comparison, and pseudoknot prediction programs were implemented to form the three-stage procedure for the VIPS. The backbone of VIPS includes: the RNAL fold program, aimed to predict local RNA secondary structures by minimum free energy method; the RNA Align program, intended to compare predicted structures; and pknotsRG program, used to calculate the pseudoknot structure. VIPS was evaluated by using UTR database, IRES database and Virus database, and the accuracy rate of VIPS was assessed as 98.53%, 90.80%, 82.36% and 80.41% for IRES groups 1, 2, 3, and 4, respectively. This advance useful search approach for IRES structures will facilitate IRES related studies. The VIPS on-line website service is available at http://140.135.61.250/vips/.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24223923 PMCID: PMC3818432 DOI: 10.1371/journal.pone.0079288
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flowchart of VIPS.
The FASTA/plain text RNA primary sequence ran by RNAL fold and compared with known IRES structure by RNA Align and pknotsRG programs. The IRES structures are displayed by RnaViz software and alignment results can be edited by any text editor. The eight private programs are pointed beside arrow symbols. The sort codes are shown in Supplementary file (Methods S1).
The accuracy of the predicted IRES elements for IRES group 1, 2, 3 and 4 by VIPS.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
| Cut-off value | 1.61 | 1.98 | 1.87 | 1.58 |
| Average R score of positive group | 1.90±0.29 | 2.42±0.62 | 2.05±0.34 | 1.68±0.25 | |
| Average R score of negative group | 1.29±0.18 | 1.53±0.07 | 1.53±0.07 | 1.49±0.05 | |
|
| <0.001 | <0.001 | <0.001 | <0.001 | |
| Sensitivity | 87.50% | 80.52% | 64.71% | 56.34% | |
| Specificity | 97.06% | 100% | 100% | 98.85% | |
| Accuracy rate | 92.28% | 90.26% | 82.36% | 77.60% | |
|
| Positive group contains pseudoknot structure | 81.25% | 15.70% | 11.76% | 40.85% |
| Negative group contains pseudoknot structure | 16.18% | 14.70% | 9.52% | 35.94% | |
|
| Sensitivity | 100% | 81.59% | 64.71% | 62.44% |
| Specificity | 97.06% | 100% | 100% | 98.37% | |
| Accuracy rate | 98.53% | 90.8% | 82.36% | 80.41% |
All values are expressed as means ±SEM. P values were calculated by t-test.
Figure 2The distribution of R and pseudoknot parameters from positive and negative group of four IRES groups by VIPS analysis.
The distribution of four IRES groups calculated by R and pseudoknot parameters in positive group and negative group. The R scores and pseudoknot scores were demonstrated according to IRES group 1 (a and e), IRES group 2 (b and f), IRES group 3 (c and g) and IRES group 4 (d and h). Linear discriminant analysis was applied to determine the cut-off line of the R score in each group shown as (a), (b), (c) and (d). Distribution of positive and negative IRES elements that were calculated and summarized from R and pseudoknot values are shown as (e), (f), (g) and (h).
Statistical results of the predicted IRES elements from Rfam, UTR and virus databases by VIPS.
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
|
| 1.96±0.28 (n=16/16) | 2.64±0.53 (n=2526/3096) | 2.28±0.15 (n=11/17) | 1.82±0.20 (n=133/213) |
|
| 100% | 81.59% | 64.47% | 62.44% | |
|
|
| 1.95±0.28 (n=14/16) | 2.61±0.54 (n=2439/3096) | 2.28±0.15 (n=11/17) | 1.84±0.20 (n=120/213) |
|
| 87.50% | 78.78% | 64.47% | 56.34% | |
|
|
| 1.64±0.03 (n=2622) | 2.00±0.02 (n=44) | 0.00±0.00 (n=0) | 1.61±0.02 (n=220) |
|
|
| 1.63±0.02 (n=542) | 0.00±0.00 (n=0) | 0.00±0.00 (n=0) | 1.60±0.02 (n=145) |
|
|
| 1.64±0.06 (n=743) | 2.51±0.57 (n=37) | 2.10±0.12 (n=19) | 1.62±0.05 (n=172) |
|
|
| 1.66±0.11 (n=201) | 2.69±0.60 (n=26) | 2.10±0.12 (n=19) | 1.62±0.06 (n=145) |
All values are expressed as means±SEM.
Top 15 records of VIPS predicted potential IRES elements from human 5’UTR of UTR database.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 165 | NM_152377 | 1-165 | 165 | 1.76|N | 5'UTR in Homo sapiens chromosome 1 open reading frame 87 (C1orf87), mRNA. |
| 337 | NM_001080551 | 169-337 | 169 | 1.73|N | 5'UTR in Homo sapiens chromosome 9 open reading frame 84 (C9orf84), transcript variant 2, mRNA. |
| 265 | NM_002803 | 75-265 | 191 | 1.73|Y | 5'UTR in Homo sapiens proteasome (prosome, macropain) 26S subunit, ATPase, 2 (PSMC2), mRNA. |
| 288 | NM_015239 | 127-285 | 159 | 1.72|Y | 5'UTR in Homo sapiens ATP/GTP binding protein 1 (AGTPBP1), mRNA. |
| 470 | NM_030571 | 251-465 | 215 | 1.72|Y | 5'UTR in Homo sapiens Nedd4 family interacting protein 1 (NDFIP1), mRNA. |
| 205 | NM_001135811 | 28-199 | 172 | 1.72|Y | 5'UTR in Homo sapiens family with sequence similarity 60, member A (FAM60A), transcript variant 1, mRNA. |
| 359 | NM_015454 | 165-359 | 195 | 1.71|Y | 5'UTR in Homo sapiens La ribonucleoprotein domain family, member 7 (LARP7), transcript variant 2, mRNA. |
| 282 | NM_001007022 | 107-278 | 172 | 1.71|N | 5'UTR in Homo sapiens outer dense fiber of sperm tails 2-like (ODF2L), transcript variant 2, mRNA. |
| 177 | NM_001003790 | 1-169 | 169 | 1.71|Y | 5'UTR in Homo sapiens ER lipid raft associated 2 (ERLIN2), transcript variant 2, mRNA. |
| 338 | NM_001076785 | 171-337 | 167 | 1.70|N | 5'UTR in Homo sapiens solute carrier family 7 (cationic amino acid transporter, y+ system), member 6 (SLC7A6), transcript variant 2, mRNA. |
| 215 | NM_001003927 | 10-211 | 202 | 1.70|N | 5'UTR in Homo sapiens ecotropic viral integration site 2A (EVI2A), transcript variant 1, mRNA. |
| 977 | NM_032779 | 808-976 | 169 | 1.70|N | 5'UTR in Homo sapiens coiled-coil domain containing 142 (CCDC142), mRNA. |
| 442 | NM_024650 | 217-436 | 220 | 1.70|Y | 5'UTR in Homo sapiens chromosome 11 open reading frame 80 (C11orf80), mRNA. |
| 537 | NM_024650 | 312-531 | 220 | 1.70|Y | 5'UTR in Homo sapiens chromosome 11 open reading frame 80 (C11orf80), mRNA. |
| 686 | NM_002270 | 500-682 | 183 | 1.70|N | 5'UTR in Homo sapiens transportin 1 (TNPO1), transcript variant 1, mRNA. |
* length of sequence fragments inputted into VIPS.
predicted IRES sequence length.
Top 15 records of VIPS predicted potential IRES elements from viral IRES database.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 416 | AJ277947 | 241-427 | 187 | 2.40|N | Hepatitis GB virus B genomic RNA. |
| 383 | M67463 | 140-363 | 224 | 2.35|Y | Hepatitis C virus subtype 1a, strain H, complete genome. |
| 1040 | M16020 | 678-906 | 229 | 2.34|N | Theiler's murine encephalomyelitis virus (TMEV) RNA polyprotein, complete genome. |
| 192 | NC_003924 | 6047-6214 | 168 | 2.20|N | Cricket paralysis virus (Dicistroviridae) nonstructural polyprotein and structural polyprotein genes, complete genome. |
| 363 | NC_001461 | 173-404 | 232 | 2.17|Y | Bovine viral diarrhea virus 1, complete genome. |
| 461 | AJ133357 | 661-875 | 215 | 2.16|N | Foot-and-mouth disease virus (FMDV) strain C, isolate c-s8c1, genomic RNA. |
| 712 | L43052 | 430-651 | 222 | 2.00|N | Equine rhinitis A virus, genome incomplete at the 5'-end. |
| 748 | DQ060149 | 456-655 | 200 | 1.80|N | Human enterovirus 71 strain pinf7-54A from Taiwan, complete genome. |
| 604 | X02316 | 425-613 | 189 | 1.74|N | Human rhinovirus (HRV) 5'-UTR. |
| 145 | NC_003779 | 6004-6145 | 142 | 1.74|N | Plautia stali intestine virus (Dicistroviridae), complete genome. |
| 742 | K01392 | 404-640 | 237 | 1.73|N | Poliovirus P3/Leon/37 (type 3), complete genome. |
| 390 | NC_005092 | 160-380 | 221 | 1.70|Y | Ectropis obliqua picorna-like virus, complete genome. |
| 750 | AY752946 | 438-650 | 213 | 1.67|N | Human coxsackievirus B3 strain 20, complete genome. |
| 261 | AF033821 | 635-781 | 147 | 1.63|N | Drosophila melanogaster gypsy LTR-transposable element, full-lenght RNA. |
| 330 | AF033821 | 322-516 | 195 | 1.62|N | Drosophila melanogaster gypsy LTR-transposable element, full-lenght RNA. |
* length of sequence fragments inputted into VIPS.
predicted IRES sequence length.