Literature DB >> 24603751

Predicting essential genes in prokaryotic genomes using a linear method: ZUPLS.

Kai Song1, Tuopong Tong, Fang Wu.   

Abstract

An effective linear method, ZUPLS, was developed to improve the accuracy and speed of prokaryotic essential gene identification. ZUPLS only uses the Z-curve and other sequence-based features. Such features can be calculated readily from the DNA/amino acid sequences. Therefore, no well-studied biological network knowledge is required for using ZUPLS. This significantly simplifies essential gene identification, especially for newly sequenced species. ZUPLS can also select necessary features automatically by embedding the uninformative variable elimination tool into the partial least squares classifier. No optimized modelling parameters are needed. ZUPLS has been used, herein, to predict essential genes of 12 remotely related prokaryotes to test its performance. The cross-organism predictions yielded AUC (Area Under the Curve) scores between 0.8042 and 0.9319 by using E. coli genes as the training samples. Similarly, ZUPLS achieved AUC scores between 0.8111 and 0.9371 by using B. subtilis genes as the training samples. We also compared it with the best available results of the existing approaches for further testing. The improvement of the AUC score in predicting B. subtilis essential genes using E. coli genes was 0.13. Additionally, in predicting E. coli essential genes using P. aeruginosa genes, the significant improvement was 0.10. Similarly, the exceptional improvement of the average accuracy of M. pulmonis using M. genitalium and M. pulmonis genes was 14.7%. The combined superior feature extraction and selection power of ZUPLS enable it to give reliable prediction of essential genes for both Gram-positive/negative organisms and rich/poor culture media.

Entities:  

Mesh:

Year:  2014        PMID: 24603751     DOI: 10.1039/c3ib40241j

Source DB:  PubMed          Journal:  Integr Biol (Camb)        ISSN: 1757-9694            Impact factor:   2.192


  7 in total

1.  Prediction of essential genes in prokaryote based on artificial neural network.

Authors:  Luo Xu; Zhirui Guo; Xiao Liu
Journal:  Genes Genomics       Date:  2019-11-17       Impact factor: 1.839

2.  Selection of key sequence-based features for prediction of essential genes in 31 diverse bacterial species.

Authors:  Xiao Liu; Bao-Jin Wang; Luo Xu; Hong-Ling Tang; Guo-Qing Xu
Journal:  PLoS One       Date:  2017-03-30       Impact factor: 3.240

Review 3.  A Comprehensive Overview of Online Resources to Identify and Predict Bacterial Essential Genes.

Authors:  Chong Peng; Yan Lin; Hao Luo; Feng Gao
Journal:  Front Microbiol       Date:  2017-11-27       Impact factor: 5.640

4.  Sequence-based information-theoretic features for gene essentiality prediction.

Authors:  Dawit Nigatu; Patrick Sobetzko; Malik Yousef; Werner Henkel
Journal:  BMC Bioinformatics       Date:  2017-11-09       Impact factor: 3.169

5.  Network-based features enable prediction of essential genes across diverse organisms.

Authors:  Karthik Azhagesan; Balaraman Ravindran; Karthik Raman
Journal:  PLoS One       Date:  2018-12-13       Impact factor: 3.240

6.  A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification.

Authors:  Nguyen Quoc Khanh Le; Duyen Thi Do; Truong Nguyen Khanh Hung; Luu Ho Thanh Lam; Tuan-Tu Huynh; Ngan Thi Kim Nguyen
Journal:  Int J Mol Sci       Date:  2020-11-28       Impact factor: 5.923

7.  An Approach for Predicting Essential Genes Using Multiple Homology Mapping and Machine Learning Algorithms.

Authors:  Hong-Li Hua; Fa-Zhan Zhang; Abraham Alemayehu Labena; Chuan Dong; Yan-Ting Jin; Feng-Biao Guo
Journal:  Biomed Res Int       Date:  2016-08-30       Impact factor: 3.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.