| Literature DB >> 25161221 |
Jocelyn Brayet1, Farida Zehraoui2, Laurence Jeanson-Leh2, David Israeli2, Fariza Tahi2.
Abstract
MOTIVATION: Piwi-interacting RNA (piRNA) is the most recently discovered and the least investigated class of Argonaute/Piwi protein-interacting small non-coding RNAs. The piRNAs are mostly known to be involved in protecting the genome from invasive transposable elements. But recent discoveries suggest their involvement in the pathophysiology of diseases, such as cancer. Their identification is therefore an important task, and computational methods are needed. However, the lack of conserved piRNA sequences and structural elements makes this identification challenging and difficult.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25161221 PMCID: PMC4147894 DOI: 10.1093/bioinformatics/btu441
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.A chromosome with telomeric and centromeric regions
Cross-validation results obtained by our method (using different combinations of kernels) and by Zhang et al. method in Human and Drosophila training datasets
| Method | ||||||||
|---|---|---|---|---|---|---|---|---|
| ACC | SP | SE | PPV | ACC | SP | SE | PPV | |
| 0.76 ± 0.03 | 0.75 ± 0.01 | 0.81 ± 0.01 | 0.75 ± 0.02 | 0.67 ± 0.01 | 0.70 ± 0.02 | 0.65 ± 0.01 | 0.66 ± 0.02 | |
| 0.61 ± 0.02 | 0.55 ± 0.02 | 0.72 ± 0.03 | 0.59 ± 0.01 | 0.86 ± 0.02 | 0.88 ± 0.03 | 0.83 ± 0.01 | 0.86 ± 0.02 | |
| 0.74 ± 0.01 | 0.82 ± 0.02 | 0.67 ± 0.03 | 0.80 ± 0.02 | 0.83 ± 0.03 | 0.82 ± 0.01 | 0.83 ± 0.04 | 0.82 ± 0.01 | |
| 0.81 ± 0.03 | 0.82 ± 0.02 | 0.78 ± 0.03 | 0.81 ± 0.02 | 0.87 ± 0.02 | 0.93 ± 0.01 | 0.81 ± 0.03 | 0.91 ± 0.02 | |
| Zhang | 0.58 ± 0.05 | 0.82 ± 0.01 | 0.30 ± 0.04 | 0.63 ± 0.03 | 0.69 ± 0.02 | 0.92 ± 0.01 | 0.45 ± 0.02 | 0.85 ± 0.01 |
Note: ACC, accuracy; SP, specificity; SE, sensitivity; PPV, positive predictive value. In bold: The highest value in each column.
Predictive performance of our method (piRPred) on Human and Drosophila sequences in comparison with the Zhang et al. method
| Method | ||||
|---|---|---|---|---|
| TP | SE | TP | SE | |
| Zhang | 1953 | 0.78 | 1636 | 0.65 |
| Zhang | 849 | 0.34 | 1568 | 0.63 |
Note: TP, true-positive predictions; SE, sensitivity. In bold: The highest value in each column.
Fig. 2.Results obtained by the k-nearest neighbors kernel with different values of k on Human training datasets
Fig. 3.Results obtained by the k-nearest neighbors kernel with different values of k on Drosophila training datasets