Literature DB >> 31987913

Using extreme gradient boosting to identify origin of replication in Saccharomyces cerevisiae via hybrid features.

Duyen Thi Do1, Nguyen Quoc Khanh Le2.   

Abstract

DNA replication is a fundamental task that plays a crucial role in the propagation of all living things on earth. Hence, the accurate identification of its origin could be the key to giving an insightful understanding of the regulatory mechanism of gene expression. Indeed, with the robust development of computational techniques and the abundant biological sequencing data, it has become possible for scientists to identify the origin of replication accurately and promptly. This growing concern has drawn a lot of attention among experts in this field. However, to gain better outcomes, more work is required. Therefore, this study is designed to explore the combination of state-of-the-art features and extreme gradient boosting learning system in classifying DNA sequences. Our hybrid approach is able to identify the origin of DNA replication with achieved sensitivity of 85.19%, specificity of 93.83%, accuracy of 89.51%, and MCC of 0.7931. Evidence is presented to show that our proposed method is superior to the state-of-the-art methods on the same benchmark dataset. Moreover, the research results represent a further step towards developing the prediction models for DNA replication in particular and DNA sequences in general.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Continuous bag of words; DNA replication; DNA sequencing; FastText; Prediction model; PseKNC; XGBoost

Mesh:

Year:  2020        PMID: 31987913     DOI: 10.1016/j.ygeno.2020.01.017

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  14 in total

1.  Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Gwang Lee
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  Development and validation of a novel survival model for acute myeloid leukemia based on autophagy-related genes.

Authors:  Li Huang; Lier Lin; Xiangjun Fu; Can Meng
Journal:  PeerJ       Date:  2021-08-12       Impact factor: 2.984

3.  A Pyroptosis-Related Gene Signature Associated with Prognosis and Tumor Immune Microenvironment in Gliomas.

Authors:  Zunjie Zhou; Jing Xu; Ning Huang; Jun Tang; Ping Ma; Yuan Cheng
Journal:  Int J Gen Med       Date:  2022-05-06

4.  Construction of Molecular Subtypes and Related Prognostic and Immune Response Models Based on M2 Macrophages in Glioblastoma.

Authors:  Kai Xiao; Shushan Zhao; Jian Yuan; Yimin Pan; Ya Song; Lanhua Tang
Journal:  Int J Gen Med       Date:  2022-01-26

5.  A deep learning framework combined with word embedding to identify DNA replication origins.

Authors:  Feng Wu; Runtao Yang; Chengjin Zhang; Lina Zhang
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

6.  A Computational Framework Based on Ensemble Deep Neural Networks for Essential Genes Identification.

Authors:  Nguyen Quoc Khanh Le; Duyen Thi Do; Truong Nguyen Khanh Hung; Luu Ho Thanh Lam; Tuan-Tu Huynh; Ngan Thi Kim Nguyen
Journal:  Int J Mol Sci       Date:  2020-11-28       Impact factor: 5.923

7.  In-Silico Multi-Omics Analysis of the Functional Significance of Calmodulin 1 in Multiple Cancers.

Authors:  Maolin Yao; Lanyi Fu; Xuedong Liu; Dong Zheng
Journal:  Front Genet       Date:  2022-01-12       Impact factor: 4.599

8.  Identification of patients at risk of new onset heart failure: Utilizing a large statewide health information exchange to train and validate a risk prediction model.

Authors:  Son Q Duong; Le Zheng; Minjie Xia; Bo Jin; Modi Liu; Zhen Li; Shiying Hao; Shaun T Alfreds; Karl G Sylvester; Eric Widen; Jeffery J Teuteberg; Doff B McElhinney; Xuefeng B Ling
Journal:  PLoS One       Date:  2021-12-10       Impact factor: 3.240

9.  Machine Learning Revealed Ferroptosis Features and a Novel Ferroptosis-Based Classification for Diagnosis in Acute Myocardial Infarction.

Authors:  Dan Huang; Shiya Zheng; Zhuyuan Liu; Kongbo Zhu; Hong Zhi; Genshan Ma
Journal:  Front Genet       Date:  2022-01-25       Impact factor: 4.599

10.  Analysis of DNA Sequence Classification Using CNN and Hybrid Models.

Authors:  Hemalatha Gunasekaran; K Ramalakshmi; A Rex Macedo Arokiaraj; S Deepa Kanmani; Chandran Venkatesan; C Suresh Gnana Dhas
Journal:  Comput Math Methods Med       Date:  2021-07-15       Impact factor: 2.238

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.