| Literature DB >> 29233729 |
Vinod Kumar Singh1, Vipin Kumar2, Annangarachari Krishnamachari3.
Abstract
Autonomous replication sequences (ARS) are essential for the replication of Saccharomyces cerevisiae genome. The content and context of ARS sites are distinct from other segments of the genome and these factors influence the conformation and thermodynamic profile of DNA that favor binding of the origin recognition complex proteins. Identification of ARS sites in the genome is a challenging task because of their organizational complexity and degeneracy present across the intergenic regions. We considered a few properties of DNA segments and divided them into multiple subsets (views) for computational prediction of ARS sequences. Our approach utilized these views for learning classification models in an ensemble manner and accordingly predictions were made. This approach maximized the prediction accuracy over the traditional way where all features are selected at once. Our study also revealed that major groove width and major groove depth are the most prominent properties that distinguished ARS from other segments of the genome. Our investigation also provides clue about the most suitable classifier for a given feature set, and this strategy may be useful for finding ARS in other closely related species.Entities:
Mesh:
Year: 2017 PMID: 29233729 DOI: 10.1016/j.biosystems.2017.12.005
Source DB: PubMed Journal: Biosystems ISSN: 0303-2647 Impact factor: 1.973