| Literature DB >> 20041294 |
Kao Lin1, Ziliang Qian, Lin Lu, Lingyi Lu, Lihui Lai, Jieyi Gu, Zhenbing Zeng, Haipeng Li, Yudong Cai.
Abstract
We used a machine learning method, the nearest neighbor algorithm (NNA), to learn the relationship between miRNAs and their target proteins, generating a predictor which can then judge whether a new miRNA-target pair is true or not. We acquired 198 positive (true) miRNA-target pairs from Tarbase and the literature, and generated 4,888 negative (false) pairs through random combination. A 0/1 system and the frequencies of single nucleotides and di-nucleotides were used to encode miRNAs into vectors while various physicochemical parameters were used to encode the targets. The NNA was then applied, learning from these data to produce a predictor. We implemented minimum redundancy maximum relevance (mRMR) and properties forward selection (PFS) to reduce the redundancy of our encoding system, obtaining 91 most efficient properties. Finally, via the Jackknife cross-validation test, we got a positive accuracy of 69.2% and an overall accuracy of 96.0% with all the 253 properties. Besides, we got a positive accuracy of 83.8% and an overall accuracy of 97.2% with the 91 most efficient properties. A web-server for predictions is also made available at http://app3.biosino.org:8080/miRTP/index.jsp.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20041294 DOI: 10.1007/s11030-009-9216-y
Source DB: PubMed Journal: Mol Divers ISSN: 1381-1991 Impact factor: 2.943