Literature DB >> 28961695

An introduction to deep learning on biological sequence data: examples and solutions.

Vanessa Isabell Jurtz1, Alexander Rosenberg Johansen2, Morten Nielsen1,3, Jose Juan Almagro Armenteros1, Henrik Nielsen1, Casper Kaae Sønderby4, Ole Winther2,4, Søren Kaae Sønderby4.   

Abstract

MOTIVATION: Deep neural network architectures such as convolutional and long short-term memory networks have become increasingly popular as machine learning tools during the recent years. The availability of greater computational resources, more data, new algorithms for training deep models and easy to use libraries for implementation and training of neural networks are the drivers of this development. The use of deep learning has been especially successful in image recognition; and the development of tools, applications and code examples are in most cases centered within this field rather than within biology.
RESULTS: Here, we aim to further the development of deep learning methods within biology by providing application examples and ready to apply and adapt code templates. Given such examples, we illustrate how architectures consisting of convolutional and long short-term memory neural networks can relatively easily be designed and trained to state-of-the-art performance on three biological sequence problems: prediction of subcellular localization, protein secondary structure and the binding of peptides to MHC Class II molecules.
AVAILABILITY AND IMPLEMENTATION: All implementations and datasets are available online to the scientific community at https://github.com/vanessajurtz/lasagne4bio. CONTACT: skaaesonderby@gmail.com. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28961695      PMCID: PMC5870575          DOI: 10.1093/bioinformatics/btx531

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

Review 2.  Where did the BLOSUM62 alignment score matrix come from?

Authors:  Sean R Eddy
Journal:  Nat Biotechnol       Date:  2004-08       Impact factor: 54.908

Review 3.  MHC class II epitope predictive algorithms.

Authors:  Morten Nielsen; Ole Lund; Søren Buus; Claus Lundegaard
Journal:  Immunology       Date:  2010-04-12       Impact factor: 7.397

4.  MultiLoc: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition.

Authors:  Annette Höglund; Pierre Dönnes; Torsten Blum; Hans-Werner Adolph; Oliver Kohlbacher
Journal:  Bioinformatics       Date:  2006-01-20       Impact factor: 6.937

5.  Locating proteins in the cell using TargetP, SignalP and related tools.

Authors:  Olof Emanuelsson; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

Review 6.  Antigen presentation by MHC class II molecules: invariant chain function, protein trafficking, and the molecular basis of diverse determinant capture.

Authors:  F Castellino; G Zhong; R N Germain
Journal:  Hum Immunol       Date:  1997-05       Impact factor: 2.850

7.  Automatic Segmentation of MR Brain Images With a Convolutional Neural Network.

Authors:  Pim Moeskops; Max A Viergever; Adrienne M Mendrik; Linda S de Vries; Manon J N L Benders; Ivana Isgum
Journal:  IEEE Trans Med Imaging       Date:  2016-03-30       Impact factor: 10.048

8.  Urinary bladder segmentation in CT urography using deep-learning convolutional neural network and level sets.

Authors:  Kenny H Cha; Lubomir Hadjiiski; Ravi K Samala; Heang-Ping Chan; Elaine M Caoili; Richard H Cohan
Journal:  Med Phys       Date:  2016-04       Impact factor: 4.071

9.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

10.  Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Authors:  Sheng Wang; Jian Peng; Jianzhu Ma; Jinbo Xu
Journal:  Sci Rep       Date:  2016-01-11       Impact factor: 4.379

View more
  24 in total

1.  Deep learning in biomedicine.

Authors:  Michael Wainberg; Daniele Merico; Andrew Delong; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2018-09-06       Impact factor: 54.908

2.  Development of Machine-Learning Model to Predict COVID-19 Mortality: Application of Ensemble Model and Regarding Feature Impacts.

Authors:  Seung-Min Baik; Miae Lee; Kyung-Sook Hong; Dong-Jin Park
Journal:  Diagnostics (Basel)       Date:  2022-06-14

3.  Deep learning tools are top performers in long non-coding RNA prediction.

Authors:  Tea Ammunét; Ning Wang; Sofia Khan; Laura L Elo
Journal:  Brief Funct Genomics       Date:  2022-05-21       Impact factor: 4.840

Review 4.  Computational Strategies for Dissecting the High-Dimensional Complexity of Adaptive Immune Repertoires.

Authors:  Enkelejda Miho; Alexander Yermanos; Cédric R Weber; Christoph T Berger; Sai T Reddy; Victor Greiff
Journal:  Front Immunol       Date:  2018-02-21       Impact factor: 7.561

Review 5.  A Review of Deep Learning Methods for Antibodies.

Authors:  Jordan Graves; Jacob Byerly; Eduardo Priego; Naren Makkapati; S Vince Parish; Brenda Medellin; Monica Berrondo
Journal:  Antibodies (Basel)       Date:  2020-04-28

6.  Off-target predictions in CRISPR-Cas9 gene editing using deep learning.

Authors:  Jiecong Lin; Ka-Chun Wong
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

7.  Uncovering the prognostic gene signatures for the improvement of risk stratification in cancers by using deep learning algorithm coupled with wavelet transform.

Authors:  Yiru Zhao; Yifan Zhou; Yuan Liu; Yinyi Hao; Menglong Li; Xuemei Pu; Chuan Li; Zhining Wen
Journal:  BMC Bioinformatics       Date:  2020-05-19       Impact factor: 3.169

Review 8.  Learning the Regulatory Code of Gene Expression.

Authors:  Jan Zrimec; Filip Buric; Mariia Kokina; Victor Garcia; Aleksej Zelezniak
Journal:  Front Mol Biosci       Date:  2021-06-10

9.  Development and evaluation of a deep learning model for protein-ligand binding affinity prediction.

Authors:  Marta M Stepniewska-Dziubinska; Piotr Zielenkiewicz; Pawel Siedlecki
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

Review 10.  Probing lncRNA-Protein Interactions: Data Repositories, Models, and Algorithms.

Authors:  Lihong Peng; Fuxing Liu; Jialiang Yang; Xiaojun Liu; Yajie Meng; Xiaojun Deng; Cheng Peng; Geng Tian; Liqian Zhou
Journal:  Front Genet       Date:  2020-01-31       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.