Literature DB >> 25050475

Genome-wide characterization and prediction of Arabidopsis thaliana replication origins.

Yong-Qiang Xing1, Guo-Qing Liu2, Xiu-Juan Zhao2, Hong-Yu Zhao3, Lu Cai4.   

Abstract

Identification of replication origins is crucial for the faithful duplication of genomic DNA. The frequencies of single nucleotides and dinucleotides, GC/AT bias and GC/AT profile in the vicinity of Arabidopsis thaliana replication origins were analyzed in the present work. The guanine content or cytosine content is higher in origin of replication (Ori) than in non-Ori. The SS (S=G or C) dinucleotides are favoured in Ori whereas WW (W=A or T) dinucleotides are favoured in non-Ori. GC/AT bias and GC/AT profile in Ori are significantly different from that in non-Ori. Furthermore, by inputting DNA sequence features into support vector machine, we distinguished between the Ori and non-Ori regions in A. thaliana. The total prediction accuracy is about 69.5% as evaluated by the 10-fold cross-validation. This result suggested that apart from DNA sequence, deciphering the selection of replication origin must integrate many other factors including nucleosome positioning, DNA methylation, histone modification, etc. In addition, by comparing predictive performance we found that the predictive accuracy of SVM using sequence features on the context of WS language is significantly better than that of RY language. Furthermore, the same conclusion was also obtained in S. cerevisiae and D. melanogaster.
Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Entities:  

Keywords:  A. thaliana; Compositional bias; Predictive performance; Replication origin; Sequence information; Support vector machine

Mesh:

Substances:

Year:  2014        PMID: 25050475     DOI: 10.1016/j.biosystems.2014.07.001

Source DB:  PubMed          Journal:  Biosystems        ISSN: 0303-2647            Impact factor:   1.973


  3 in total

1.  Arabidopsis DNA Replication Initiates in Intergenic, AT-Rich Open Chromatin.

Authors:  Emily Wheeler; Ashley M Brooks; Lorenzo Concia; Daniel L Vera; Emily E Wear; Chantal LeBlanc; Umamaheswari Ramu; Matthew W Vaughn; Hank W Bass; Robert A Martienssen; William F Thompson; Linda Hanley-Bowdoin
Journal:  Plant Physiol       Date:  2020-03-23       Impact factor: 8.340

2.  Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes.

Authors:  Wen-Chao Li; Zhe-Jin Zhong; Pan-Pan Zhu; En-Ze Deng; Hui Ding; Wei Chen; Hao Lin
Journal:  Front Microbiol       Date:  2014-11-18       Impact factor: 5.640

3.  Identification of Proteins of Tobacco Mosaic Virus by Using a Method of Feature Extraction.

Authors:  Yu-Miao Chen; Xin-Ping Zu; Dan Li
Journal:  Front Genet       Date:  2020-10-09       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.