| Literature DB >> 28476023 |
Wang-Ren Qiu1,2,3, Shi-Yu Jiang2, Zhao-Chun Xu2, Xuan Xiao2,3, Kuo-Chen Chou3,4,5.
Abstract
Occurring at <span class="Chemical">cytosine (C) of RNA, <span class="Chemical">5-methylcytosine (m5C) is an important post-transcriptional modification (PTCM). The modification plays significant roles in biological processes by regulating RNA metabolism in both eukaryotes and prokaryotes. It may also, however, cause cancers and other major diseases. Given an uncharacterized RNA sequence that contains many C residues, can we identify which one of them can be of m5C modification, and which one cannot? It is no doubt a crucial problem, particularly with the explosive growth of RNA sequences in the postgenomic age. Unfortunately, so far no user-friendly web-server whatsoever has been developed to address such a problem. To meet the increasingly high demand from most experimental scientists working in the area of drug development, we have developed a new predictor called iRNAm5C-PseDNC by incorporating ten types of physical-chemical properties into pseudo dinucleotide composition via the auto/cross-covariance approach. Rigorous jackknife tests show that its anticipated accuracy is quite high. For most experimental scientists' convenience, a user-friendly web-server for the predictor has been provided at http://www.jci-bioinfo.cn/iRNAm5C-PseDNC along with a step-by-step user guide, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the approach presented here can also be used to deal with many other problems in genome analysis.Entities:
Keywords: RNA 5-methylcytosine sites; auto/cross-covariance; physical-chemical property matrix; pseudo dinucleotide composition; web-server
Mesh:
Substances:
Year: 2017 PMID: 28476023 PMCID: PMC5522291 DOI: 10.18632/oncotarget.17104
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Schematic drawing to show the m5C modification in RNA: an important post-transcriptional modification (PTCM) in RNA [4, 5]
During the modification process, a methyl group is attached to the 5th atom of the 6-atom ring. SAM and SAH are the abbreviations of S-adenosylmethionine and S-adenosylhomocysteine, respectively. The former is the source of the methyl group; while the latter, the byproduct.
A cohort comparison with some existing web-server predictors for different purposes
| Predictor's name | Purpose | Acca | MCCa | Sna | Spa |
|---|---|---|---|---|---|
| iRSpot-PseDNCb | DNA recombination spot | 0.8204 | 0.6380 | 0.7306 | 0.8949 |
| iSNO-PseAACc | Cysteine S-nitrosylation site | 0.6762 | 0.3515 | 0.6701 | 0.6815 |
| iPro54-PseKNCd | Sigma-54 promoter | 0.8043 | 0.6101 | 0.7702 | 0.8385 |
| iRSpot-TNCPseAACe | DNA recombination spot | 0.8372 | 0.6710 | 0.8714 | 0.7959 |
| iNitro-Tyrf | Nitrotyrosine site | 0.8452 | 0.4905 | 0.8176 | 0.8598 |
| iRNAm5C-PseDNCg | RNA 5-methylcytosine site | 0.9237 | 0.7935 | 0.6989 | 0.9986 |
a See Eq.13 for the definition.
b See ref. [12].
c See ref. [13].
d See ref. [14].
e See ref. [15].
f See ref. [16].
g The web-server predictor developed in this paper.
Figure 2A graphical illustration to show the proposed predictor's performances via the ROC (receiver operating characteristic) curves [29, 30]
The area under the ROC curve is called AUC (area under the curve). The greater the AUC value is, the better the performance will be. See the text for further explanation.