| Literature DB >> 34624074 |
Zheng Jiang1, Si-Rui Xiao1, Rong Liu1.
Abstract
The biological functions of DNA and RNA generally depend on their interactions with other molecules, such as small ligands, proteins and nucleic acids. However, our knowledge of the nucleic acid binding sites for different interaction partners is very limited, and identification of these critical binding regions is not a trivial work. Herein, we performed a comprehensive comparison between binding and nonbinding sites and among different categories of binding sites in these two nucleic acid classes. From the structural perspective, RNA may interact with ligands through forming binding pockets and contact proteins and nucleic acids using protruding surfaces, while DNA may adopt regions closer to the middle of the chain to make contacts with other molecules. Based on structural information, we established a feature-based ensemble learning classifier to identify the binding sites by fully using the interplay among different machine learning algorithms, feature spaces and sample spaces. Meanwhile, we designed a template-based classifier by exploiting structural conservation. The complementarity between the two classifiers motivated us to build an integrative framework for improving prediction performance. Moreover, we utilized a post-processing procedure based on the random walk algorithm to further correct the integrative predictions. Our unified prediction framework yielded promising results for different binding sites and outperformed existing methods.Entities:
Keywords: DNA binding site; RNA binding site; ensemble learning; integrative algorithm; structural homology
Mesh:
Substances:
Year: 2022 PMID: 34624074 PMCID: PMC8769709 DOI: 10.1093/bib/bbab411
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622