Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Literature DB >> 33660783

iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Zhen Chen¹, Pei Zhao², Chen Li³, Fuyi Li^3,4,5, Dongxu Xiang^3,4, Yong-Zi Chen⁶, Tatsuya Akutsu⁷, Roger J Daly³, Geoffrey I Webb⁴, Quanzhi Zhao^1,8, Lukasz Kurgan⁹, Jiangning Song^3,4.

Abstract

Sequence-based analysis and prediction are fundamental bioinformatic tasks that facilitate understanding of the sequence(-structure)-function paradigm for DNAs, RNAs and proteins. Rapid accumulation of sequences requires equally pervasive development of new predictive models, which depends on the availability of effective tools that support these efforts. We introduce iLearnPlus, the first machine-learning platform with graphical- and web-based interfaces for the construction of machine-learning pipelines for analysis and predictions using nucleic acid and protein sequences. iLearnPlus provides a comprehensive set of algorithms and automates sequence-based feature extraction and analysis, construction and deployment of models, assessment of predictive performance, statistical analysis, and data visualization; all without programming. iLearnPlus includes a wide range of feature sets which encode information from the input sequences and over twenty machine-learning algorithms that cover several deep-learning approaches, outnumbering the current solutions by a wide margin. Our solution caters to experienced bioinformaticians, given the broad range of options, and biologists with no programming background, given the point-and-click interface and easy-to-follow design process. We showcase iLearnPlus with two case studies concerning prediction of long noncoding RNAs (lncRNAs) from RNA transcripts and prediction of crotonylation sites in protein chains. iLearnPlus is an open-source platform available at https://github.com/Superzchen/iLearnPlus/ with the webserver at http://ilearnplus.erc.monash.edu/.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Year: 2021 PMID： 33660783 PMCID： PMC8191785 DOI： 10.1093/nar/gkab122

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

111 in total

1. An efficient algorithm for large-scale detection of protein families.

Authors: A J Enright; S Van Dongen; C A Ouzounis
Journal: Nucleic Acids Res Date: 2002-04-01 Impact factor: 16.971

2. Clustering by passing messages between data points.

Authors: Brendan J Frey; Delbert Dueck
Journal: Science Date: 2007-01-11 Impact factor: 47.728

3. Combinatorial Targeting by MicroRNAs Co-ordinates Post-transcriptional Control of EMT.

Authors: Joseph Cursons; Katherine A Pillman; Kaitlin G Scheer; Philip A Gregory; Momeneh Foroutan; Soroor Hediyeh-Zadeh; John Toubia; Edmund J Crampin; Gregory J Goodall; Cameron P Bracken; Melissa J Davis
Journal: Cell Syst Date: 2018-07-11 Impact factor: 10.304

4. PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition.

Authors: Yongchun Zuo; Yuan Li; Yingli Chen; Guangpeng Li; Zhenhe Yan; Lei Yang
Journal: Bioinformatics Date: 2016-08-26 Impact factor: 6.937

Review 5. Machine learning techniques for protein function prediction.

Authors: Rosalin Bonetta; Gianluca Valentino
Journal: Proteins Date: 2019-11-14

6. Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.

Authors: Zhen Chen; Pei Zhao; Fuyi Li; Yanan Wang; A Ian Smith; Geoffrey I Webb; Tatsuya Akutsu; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal: Brief Bioinform Date: 2019-11-11 Impact factor: 11.622

7. SUMOhydro: a novel method for the prediction of sumoylation sites based on hydrophobic properties.

Authors: Yong-Zi Chen; Zhen Chen; Yu-Ai Gong; Guoguang Ying
Journal: PLoS One Date: 2012-06-14 Impact factor: 3.240

8. iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Authors: Bin Liu; Jinghao Xu; Xun Lan; Ruifeng Xu; Jiyun Zhou; Xiaolong Wang; Kuo-Chen Chou
Journal: PLoS One Date: 2014-09-03 Impact factor: 3.240

9. Selene: a PyTorch-based deep learning library for sequence data.

Authors: Kathleen M Chen; Evan M Cofer; Jian Zhou; Olga G Troyanskaya
Journal: Nat Methods Date: 2019-03-28 Impact factor: 28.547

10. A deep learning method for lincRNA detection using auto-encoder algorithm.

Authors: Ning Yu; Zeng Yu; Yi Pan
Journal: BMC Bioinformatics Date: 2017-12-06 Impact factor: 3.169

25 in total

1. Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.

Authors: Meng Zhang; Cangzhi Jia; Fuyi Li; Chen Li; Yan Zhu; Tatsuya Akutsu; Geoffrey I Webb; Quan Zou; Lachlan J M Coin; Jiangning Song
Journal: Brief Bioinform Date: 2022-03-10 Impact factor: 11.622

2. MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.

Authors: Robson P Bonidia; Douglas S Domingues; Danilo S Sanches; André C P L F de Carvalho
Journal: Brief Bioinform Date: 2022-01-17 Impact factor: 11.622

3. BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria.

Authors: Robson P Bonidia; Anderson P Avila Santos; Breno L S de Almeida; Peter F Stadler; Ulisses N da Rocha; Danilo S Sanches; André C P L F de Carvalho
Journal: Brief Bioinform Date: 2022-07-18 Impact factor: 13.994

4. PASSer2.0: Accurate Prediction of Protein Allosteric Sites Through Automated Machine Learning.

Authors: Sian Xiao; Hao Tian; Peng Tao
Journal: Front Mol Biosci Date: 2022-07-11

5. iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species.

Authors: Pengyu Zhang; Hongming Zhang; Hao Wu
Journal: Nucleic Acids Res Date: 2022-10-14 Impact factor: 19.160

6. Geographic encoding of transcripts enabled high-accuracy and isoform-aware deep learning of RNA methylation.

Authors: Daiyun Huang; Kunqi Chen; Bowen Song; Zhen Wei; Jionglong Su; Frans Coenen; João Pedro de Magalhães; Daniel J Rigden; Jia Meng
Journal: Nucleic Acids Res Date: 2022-10-14 Impact factor: 19.160

7. EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction.

Authors: Honglei Wang; Hui Liu; Tao Huang; Gangshen Li; Lin Zhang; Yanjing Sun
Journal: BMC Bioinformatics Date: 2022-06-08 Impact factor: 3.307

8. nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors: Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal: Brief Bioinform Date: 2021-11-05 Impact factor: 11.622

9. BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors: Hong-Liang Li; Yi-He Pang; Bin Liu
Journal: Nucleic Acids Res Date: 2021-12-16 Impact factor: 16.971

10. Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.

Authors: Fuyi Li; Xudong Guo; Peipei Jin; Jinxiang Chen; Dongxu Xiang; Jiangning Song; Lachlan J M Coin
Journal: Brief Bioinform Date: 2021-11-05 Impact factor: 13.994