Literature DB >> 29668310

Human Splice-Site Prediction with Deep Neural Networks.

Tatsuhiko Naito1.   

Abstract

Accurate splice-site prediction is essential to delineate gene structures from sequence data. Several computational techniques have been applied to create a system to predict canonical splice sites. For classification tasks, deep neural networks (DNNs) have achieved record-breaking results and often outperformed other supervised learning techniques. In this study, a new method of splice-site prediction using DNNs was proposed. The proposed system receives an input sequence data and returns an answer as to whether it is splice site. The length of input is 140 nucleotides, with the consensus sequence (i.e., "GT" and "AG" for the donor and acceptor sites, respectively) in the middle. Each input sequence model is applied to the pretrained DNN model that determines the probability that an input is a splice site. The model consists of convolutional layers and bidirectional long short-term memory network layers. The pretraining and validation were conducted using the data set tested in previously reported methods. The performance evaluation results showed that the proposed method can outperform the previous methods. In addition, the pattern learned by the DNNs was visualized as position frequency matrices (PFMs). Some of PFMs were very similar to the consensus sequence. The trained DNN model and the brief source code for the prediction system are uploaded. Further improvement will be achieved following the further development of DNNs.

Entities:  

Keywords:  deep learning; deep neural networks; splice-site prediction; splicing

Mesh:

Substances:

Year:  2018        PMID: 29668310     DOI: 10.1089/cmb.2018.0041

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Predicting the impact of single nucleotide variants on splicing via sequence-based deep neural networks and genomic features.

Authors:  Tatsuhiko Naito
Journal:  Hum Mutat       Date:  2019-06-23       Impact factor: 4.878

2.  Assessing predictions of the impact of variants on splicing in CAGI5.

Authors:  Stephen M Mount; Žiga Avsec; Liran Carmel; Rita Casadio; Muhammed Hasan Çelik; Ken Chen; Jun Cheng; Noa E Cohen; William G Fairbrother; Tzila Fenesh; Julien Gagneur; Valer Gotea; Tamar Holzer; Chiao-Feng Lin; Pier Luigi Martelli; Tatsuhiko Naito; Thi Yen Duong Nguyen; Castrense Savojardo; Ron Unger; Robert Wang; Yuedong Yang; Huiying Zhao
Journal:  Hum Mutat       Date:  2019-08-19       Impact factor: 4.878

3.  Oligonucleotide correction of an intronic TIMMDC1 variant in cells of patients with severe neurodegenerative disorder.

Authors:  Raman Kumar; Mark A Corbett; Nicholas J C Smith; Daniella H Hock; Zoya Kikhtyak; Liana N Semcesen; Atsushi Morimoto; Sangmoon Lee; David A Stroud; Joseph G Gleeson; Eric A Haan; Jozef Gecz
Journal:  NPJ Genom Med       Date:  2022-01-28       Impact factor: 6.083

Review 4.  Principles and correction of 5'-splice site selection.

Authors:  Florian Malard; Cameron D Mackereth; Sébastien Campagne
Journal:  RNA Biol       Date:  2022-01       Impact factor: 4.766

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.