| Literature DB >> 36139026 |
Abstract
Single-stranded DNA (ssDNA) binding proteins (SSBs) are critical in maintaining genome stability by protecting the transient existence of ssDNA from damage during essential biological processes, such as DNA replication and gene transcription. The single-stranded region of telomeres also requires protection by ssDNA binding proteins from being attacked in case it is wrongly recognized as an anomaly. In addition to their critical roles in genome stability and integrity, it has been demonstrated that ssDNA and SSB-ssDNA interactions play critical roles in transcriptional regulation in all three domains of life and viruses. In this review, we present our current knowledge of the structure and function of SSBs and the structural features for SSB binding specificity. We then discuss the machine learning-based approaches that have been developed for the prediction of SSBs from double-stranded DNA (dsDNA) binding proteins (DSBs).Entities:
Keywords: SSB; binding specificity; single-stranded DNA; single-stranded DNA binding protein; ssDNA
Mesh:
Substances:
Year: 2022 PMID: 36139026 PMCID: PMC9496475 DOI: 10.3390/biom12091187
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1OB fold of SSBs and binding specificity. (A) Specific OB fold domain of human POT1 bound to telomeric single-stranded DNA (TTAGGGTTAG), PDB ID: 1xjv, chain: A, domain: 151–299. (B) Non-specific OB fold of Bacillus subtilis SsbB, PDB ID: 3vdy, chain: A. Black dashed lines, sidechain-base hydrogen bonds; red dashed lines, non-sidechain-base hydrogen bonds.
Figure 2Comparison of the distribution of the percentage of the sidechain-base hydrogen bonds in each protein chain–ssDNA complex between the specific (SP) and the non-specific (NS) protein–ssDNA complexes. Statistical analysis of the comparison between the NS and SP groups was done with Wilcoxon rank sum test. *** = p-value ≤ 0.001.
Figure 3The flowchart for machine learning-based prediction of SSBs.
Summary of machine learning-based SSB prediction methods.
| References | Predictor (If Any) | Features | Classifiers |
|---|---|---|---|
| Wang et al. [ | NA | -OAAC | SVM |
| Ali et al. [ | SDBP-Pred | -PSSM | SVM |
| Tan et al. [ | PredPSD | -OAAC | GTB |
| Sharma et al. [ | NA | -HMM | SVM |
| -normalized profile-monogram | RF | ||
| -normalized profile-bigram | KNN |