Literature DB >> 33660783

iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Zhen Chen1, Pei Zhao2, Chen Li3, Fuyi Li3,4,5, Dongxu Xiang3,4, Yong-Zi Chen6, Tatsuya Akutsu7, Roger J Daly3, Geoffrey I Webb4, Quanzhi Zhao1,8, Lukasz Kurgan9, Jiangning Song3,4.   

Abstract

Sequence-based analysis and prediction are fundamental bioinformatic tasks that facilitate understanding of the sequence(-structure)-function paradigm for DNAs, RNAs and proteins. Rapid accumulation of sequences requires equally pervasive development of new predictive models, which depends on the availability of effective tools that support these efforts. We introduce iLearnPlus, the first machine-learning platform with graphical- and web-based interfaces for the construction of machine-learning pipelines for analysis and predictions using nucleic acid and protein sequences. iLearnPlus provides a comprehensive set of algorithms and automates sequence-based feature extraction and analysis, construction and deployment of models, assessment of predictive performance, statistical analysis, and data visualization; all without programming. iLearnPlus includes a wide range of feature sets which encode information from the input sequences and over twenty machine-learning algorithms that cover several deep-learning approaches, outnumbering the current solutions by a wide margin. Our solution caters to experienced bioinformaticians, given the broad range of options, and biologists with no programming background, given the point-and-click interface and easy-to-follow design process. We showcase iLearnPlus with two case studies concerning prediction of long noncoding RNAs (lncRNAs) from RNA transcripts and prediction of crotonylation sites in protein chains. iLearnPlus is an open-source platform available at https://github.com/Superzchen/iLearnPlus/ with the webserver at http://ilearnplus.erc.monash.edu/.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2021        PMID: 33660783      PMCID: PMC8191785          DOI: 10.1093/nar/gkab122

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  111 in total

1.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

2.  Clustering by passing messages between data points.

Authors:  Brendan J Frey; Delbert Dueck
Journal:  Science       Date:  2007-01-11       Impact factor: 47.728

3.  Combinatorial Targeting by MicroRNAs Co-ordinates Post-transcriptional Control of EMT.

Authors:  Joseph Cursons; Katherine A Pillman; Kaitlin G Scheer; Philip A Gregory; Momeneh Foroutan; Soroor Hediyeh-Zadeh; John Toubia; Edmund J Crampin; Gregory J Goodall; Cameron P Bracken; Melissa J Davis
Journal:  Cell Syst       Date:  2018-07-11       Impact factor: 10.304

4.  PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition.

Authors:  Yongchun Zuo; Yuan Li; Yingli Chen; Guangpeng Li; Zhenhe Yan; Lei Yang
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

Review 5.  Machine learning techniques for protein function prediction.

Authors:  Rosalin Bonetta; Gianluca Valentino
Journal:  Proteins       Date:  2019-11-14

6.  Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.

Authors:  Zhen Chen; Pei Zhao; Fuyi Li; Yanan Wang; A Ian Smith; Geoffrey I Webb; Tatsuya Akutsu; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-11       Impact factor: 11.622

7.  SUMOhydro: a novel method for the prediction of sumoylation sites based on hydrophobic properties.

Authors:  Yong-Zi Chen; Zhen Chen; Yu-Ai Gong; Guoguang Ying
Journal:  PLoS One       Date:  2012-06-14       Impact factor: 3.240

8.  iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Authors:  Bin Liu; Jinghao Xu; Xun Lan; Ruifeng Xu; Jiyun Zhou; Xiaolong Wang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2014-09-03       Impact factor: 3.240

9.  Selene: a PyTorch-based deep learning library for sequence data.

Authors:  Kathleen M Chen; Evan M Cofer; Jian Zhou; Olga G Troyanskaya
Journal:  Nat Methods       Date:  2019-03-28       Impact factor: 28.547

10.  A deep learning method for lincRNA detection using auto-encoder algorithm.

Authors:  Ning Yu; Zeng Yu; Yi Pan
Journal:  BMC Bioinformatics       Date:  2017-12-06       Impact factor: 3.169

View more
  25 in total

1.  Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors:  Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.

Authors:  Robson P Bonidia; Douglas S Domingues; Danilo S Sanches; André C P L F de Carvalho
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria.

Authors:  Robson P Bonidia; Anderson P Avila Santos; Breno L S de Almeida; Peter F Stadler; Ulisses N da Rocha; Danilo S Sanches; André C P L F de Carvalho
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

4.  PASSer2.0: Accurate Prediction of Protein Allosteric Sites Through Automated Machine Learning.

Authors:  Sian Xiao; Hao Tian; Peng Tao
Journal:  Front Mol Biosci       Date:  2022-07-11

5.  iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species.

Authors:  Pengyu Zhang; Hongming Zhang; Hao Wu
Journal:  Nucleic Acids Res       Date:  2022-10-14       Impact factor: 19.160

6.  Geographic encoding of transcripts enabled high-accuracy and isoform-aware deep learning of RNA methylation.

Authors:  Daiyun Huang; Kunqi Chen; Bowen Song; Zhen Wei; Jionglong Su; Frans Coenen; João Pedro de Magalhães; Daniel J Rigden; Jia Meng
Journal:  Nucleic Acids Res       Date:  2022-10-14       Impact factor: 19.160

7.  EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction.

Authors:  Honglei Wang; Hui Liu; Tao Huang; Gangshen Li; Lin Zhang; Yanjing Sun
Journal:  BMC Bioinformatics       Date:  2022-06-08       Impact factor: 3.307

8.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

9.  BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors:  Hong-Liang Li; Yi-He Pang; Bin Liu
Journal:  Nucleic Acids Res       Date:  2021-12-16       Impact factor: 16.971

10.  Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.

Authors:  Fuyi Li; Xudong Guo; Peipei Jin; Jinxiang Chen; Dongxu Xiang; Jiangning Song; Lachlan J M Coin
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.