Literature DB >> 32065211

A computational platform to identify origins of replication sites in eukaryotes.

Fu-Ying Dao1, Hao Lv1, Hasan Zulfiqar1, Hui Yang1, Wei Su1, Hui Gao1, Hui Ding1, Hao Lin1.   

Abstract

The locations of the initiation of genomic DNA replication are defined as origins of replication sites (ORIs), which regulate the onset of DNA replication and play significant roles in the DNA replication process. The study of ORIs is essential for understanding the cell-division cycle and gene expression regulation. Accurate identification of ORIs will provide important clues for DNA replication research and drug development by developing computational methods. In this paper, the first integrated predictor named iORI-Euk was built to identify ORIs in multiple eukaryotes and multiple cell types. In the predictor, seven eukaryotic (Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana, Pichia pastoris, Schizosaccharomyces pombe and Kluyveromyces lactis) ORI data was collected from public database to construct benchmark datasets. Subsequently, three feature extraction strategies which are k-mer, binary encoding and combination of k-mer and binary were used to formulate DNA sequence samples. We also compared the different classification algorithms' performance. As a result, the best results were obtained by using support vector machine in 5-fold cross-validation test and independent dataset test. Based on the optimal model, an online web server called iORI-Euk (http://lin-group.cn/server/iORI-Euk/) was established for the novel ORI identification.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  classification algorithm; eukaryote; feature extraction; origins of replication site; webserver

Year:  2021        PMID: 32065211     DOI: 10.1093/bib/bbaa017

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  19 in total

1.  Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Gwang Lee
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  Identification and Classification of Enhancers Using Dimension Reduction Technique and Recurrent Neural Network.

Authors:  Qingwen Li; Lei Xu; Qingyuan Li; Lichao Zhang
Journal:  Comput Math Methods Med       Date:  2020-10-18       Impact factor: 2.238

3.  iPTT(2 L)-CNN: A Two-Layer Predictor for Identifying Promoters and Their Types in Plant Genomes by Convolutional Neural Network.

Authors:  Ang Sun; Xuan Xiao; Zhaochun Xu
Journal:  Comput Math Methods Med       Date:  2021-01-05       Impact factor: 2.238

4.  Predicting Cell Wall Lytic Enzymes Using Combined Features.

Authors:  Xiao-Yang Jing; Feng-Min Li
Journal:  Front Bioeng Biotechnol       Date:  2021-01-06

5.  ApoPred: Identification of Apolipoproteins and Their Subfamilies With Multifarious Features.

Authors:  Ting Liu; Jia-Mao Chen; Dan Zhang; Qian Zhang; Bowen Peng; Lei Xu; Hua Tang
Journal:  Front Cell Dev Biol       Date:  2021-01-08

6.  Prevention and Control of Pathogens Based on Big-Data Mining and Visualization Analysis.

Authors:  Cui-Xia Chen; Li-Na Sun; Xue-Xin Hou; Peng-Cheng Du; Xiao-Long Wang; Xiao-Chen Du; Yu-Fei Yu; Rui-Kun Cai; Lei Yu; Tian-Jun Li; Min-Na Luo; Yue Shen; Chao Lu; Qian Li; Chuan Zhang; Hua-Fang Gao; Xu Ma; Hao Lin; Zong-Fu Cao
Journal:  Front Mol Biosci       Date:  2021-02-25

7.  A Mendelian Randomization Analysis to Expose the Causal Effect of IL-18 on Osteoporosis Based on Genome-Wide Association Study Data.

Authors:  Ni Kou; Wenyang Zhou; Yuzhu He; Xiaoxia Ying; Songling Chai; Tao Fei; Wenqi Fu; Jiaqian Huang; Huiying Liu
Journal:  Front Bioeng Biotechnol       Date:  2020-03-20

8.  SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.

Authors:  Yideng Cai; Jiacheng Wang; Lei Deng
Journal:  Front Bioeng Biotechnol       Date:  2020-04-29

9.  Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features.

Authors:  Xiao-Yang Jing; Feng-Min Li
Journal:  Comput Math Methods Med       Date:  2020-09-23       Impact factor: 2.238

10.  Predicting Gram-Positive Bacterial Protein Subcellular Location by Using Combined Features.

Authors:  Feng-Min Li; Xiao-Wei Gao
Journal:  Biomed Res Int       Date:  2020-08-02       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.