Literature DB >> 31681416

CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.

Huiqing Wang1, Yue Ma1, Chunlin Dong2, Chun Li1, Jingjing Wang1, Dan Liu1.   

Abstract

MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.
Copyright © 2019 Wang, Ma, Dong, Li, Wang and Liu.

Entities:  

Keywords:  convolutional neural network; deep learning; imbalanced learning; long short-term memory network; pre-miRNA identification

Year:  2019        PMID: 31681416      PMCID: PMC6798641          DOI: 10.3389/fgene.2019.00967

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  30 in total

1.  Identification of virus-encoded microRNAs.

Authors:  Sébastien Pfeffer; Mihaela Zavolan; Friedrich A Grässer; Minchen Chien; James J Russo; Jingyue Ju; Bino John; Anton J Enright; Debora Marks; Chris Sander; Thomas Tuschl
Journal:  Science       Date:  2004-04-30       Impact factor: 47.728

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 3.  MicroRNAS and their regulatory roles in plants.

Authors:  Matthew W Jones-Rhoades; David P Bartel; Bonnie Bartel
Journal:  Annu Rev Plant Biol       Date:  2006       Impact factor: 26.379

Review 4.  Drug target miRNAs: chances and challenges.

Authors:  Marco F Schmidt
Journal:  Trends Biotechnol       Date:  2014-10-08       Impact factor: 19.536

5.  Human microRNA genes are frequently located at fragile sites and genomic regions involved in cancers.

Authors:  George Adrian Calin; Cinzia Sevignani; Calin Dan Dumitru; Terry Hyslop; Evan Noch; Sai Yendamuri; Masayoshi Shimizu; Sashi Rattan; Florencia Bullrich; Massimo Negrini; Carlo M Croce
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-18       Impact factor: 11.205

Review 6.  MicroRNA and cancer.

Authors:  Martin D Jansson; Anders H Lund
Journal:  Mol Oncol       Date:  2012-10-09       Impact factor: 6.603

7.  Prediction of novel precursor miRNAs using a context-sensitive hidden Markov model (CSHMM).

Authors:  Sumeet Agarwal; Candida Vaz; Alok Bhattacharya; Ashwin Srinivasan
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

8.  fRNAdb: a platform for mining/annotating functional RNA candidates from non-coding RNA sequences.

Authors:  Taishin Kin; Kouichirou Yamada; Goro Terai; Hiroaki Okida; Yasuhiko Yoshinari; Yukiteru Ono; Aya Kojima; Yuki Kimura; Takashi Komori; Kiyoshi Asai
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

9.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

10.  MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features.

Authors:  Peng Jiang; Haonan Wu; Wenkai Wang; Wei Ma; Xiao Sun; Zuhong Lu
Journal:  Nucleic Acids Res       Date:  2007-06-06       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.