Literature DB >> 29931149

SpliceRover: interpretable convolutional neural networks for improved splice site prediction.

Jasper Zuallaert1,2, Fréderic Godin2, Mijung Kim1,2, Arne Soete3,4, Yvan Saeys4,5, Wesley De Neve1,2.   

Abstract

Motivation: During the last decade, improvements in high-throughput sequencing have generated a wealth of genomic data. Functionally interpreting these sequences and finding the biological signals that are hallmarks of gene function and regulation is currently mostly done using automated genome annotation platforms, which mainly rely on integrated machine learning frameworks to identify different functional sites of interest, including splice sites. Splicing is an essential step in the gene regulation process, and the correct identification of splice sites is a major cornerstone in a genome annotation system.
Results: In this paper, we present SpliceRover, a predictive deep learning approach that outperforms the state-of-the-art in splice site prediction. SpliceRover uses convolutional neural networks (CNNs), which have been shown to obtain cutting edge performance on a wide variety of prediction tasks. We adapted this approach to deal with genomic sequence inputs, and show it consistently outperforms already existing approaches, with relative improvements in prediction effectiveness of up to 80.9% when measured in terms of false discovery rate. However, a major criticism of CNNs concerns their 'black box' nature, as mechanisms to obtain insight into their reasoning processes are limited. To facilitate interpretability of the SpliceRover models, we introduce an approach to visualize the biologically relevant information learnt. We show that our visualization approach is able to recover features known to be important for splice site prediction (binding motifs around the splice site, presence of polypyrimidine tracts and branch points), as well as reveal new features (e.g. several types of exclusion patterns near splice sites). Availability and implementation: SpliceRover is available as a web service. The prediction tool and instructions can be found at http://bioit2.irc.ugent.be/splicerover/. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29931149     DOI: 10.1093/bioinformatics/bty497

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  23 in total

1.  NmSEER V2.0: a prediction tool for 2'-O-methylation sites based on random forest and multi-encoding combination.

Authors:  Yiran Zhou; Qinghua Cui; Yuan Zhou
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

2.  Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review.

Authors:  Seyedeh Neelufar Payrovnaziri; Zhaoyi Chen; Pablo Rengifo-Moreno; Tim Miller; Jiang Bian; Jonathan H Chen; Xiuwen Liu; Zhe He
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

3.  Noncanonical Splice Site and Deep Intronic FRMD7 Variants Activate Cryptic Exons in X-linked Infantile Nystagmus.

Authors:  Junwon Lee; Han Jeong; Dongju Won; Saeam Shin; Seung-Tae Lee; Jong Rak Choi; Suk Ho Byeon; Helen J Kuht; Mervyn G Thomas; Jinu Han
Journal:  Transl Vis Sci Technol       Date:  2022-06-01       Impact factor: 3.048

Review 4.  Decoding disease: from genomes to networks to phenotypes.

Authors:  Aaron K Wong; Rachel S G Sealfon; Chandra L Theesfeld; Olga G Troyanskaya
Journal:  Nat Rev Genet       Date:  2021-08-02       Impact factor: 53.242

5.  Explainable deep neural networks for novel viral genome prediction.

Authors:  Chandra Mohan Dasari; Raju Bhukya
Journal:  Appl Intell (Dordr)       Date:  2021-06-25       Impact factor: 5.019

6.  Splice2Deep: An ensemble of deep convolutional neural networks for improved splice site prediction in genomic DNA.

Authors:  Somayah Albaradei; Arturo Magana-Mora; Maha Thafar; Mahmut Uludag; Vladimir B Bajic; Takashi Gojobori; Magbubah Essack; Boris R Jankovic
Journal:  Gene X       Date:  2020-05-13

7.  Nonsense-associated altered splicing of MAP3K1 in two siblings with 46,XY disorders of sex development.

Authors:  Maki Igarashi; Yohei Masunaga; Yuichi Hasegawa; Kenichi Kinjo; Mami Miyado; Hirotomo Saitsu; Yuko Kato-Fukui; Reiko Horikawa; Yomiko Okubo; Tsutomu Ogata; Maki Fukami
Journal:  Sci Rep       Date:  2020-10-15       Impact factor: 4.379

8.  Mining influential genes based on deep learning.

Authors:  Lingpeng Kong; Yuanyuan Chen; Fengjiao Xu; Mingmin Xu; Zutan Li; Jingya Fang; Liangyun Zhang; Cong Pian
Journal:  BMC Bioinformatics       Date:  2021-01-22       Impact factor: 3.169

Review 9.  Learning the Regulatory Code of Gene Expression.

Authors:  Jan Zrimec; Filip Buric; Mariia Kokina; Victor Garcia; Aleksej Zelezniak
Journal:  Front Mol Biosci       Date:  2021-06-10

10.  SpliceFinder: ab initio prediction of splice sites using convolutional neural network.

Authors:  Ruohan Wang; Zishuai Wang; Jianping Wang; Shuaicheng Li
Journal:  BMC Bioinformatics       Date:  2019-12-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.