| Literature DB >> 35054954 |
Martin Bartas1, Kristyna Slychko1, Václav Brázda2, Jiří Červeň1, Christopher A Beaudoin3, Tom L Blundell3, Petr Pečinka1.
Abstract
Z-DNA and Z-RNA are functionally important left-handed structures of nucleic acids, which play a significant role in several molecular and biological processes including DNA replication, gene expression regulation and viral nucleic acid sensing. Most proteins that have been proven to interact with Z-DNA/Z-RNA contain the so-called Zα domain, which is structurally well conserved. To date, only eight proteins with Zα domain have been described within a few organisms (including human, mouse, Danio rerio, Trypanosoma brucei and some viruses). Therefore, this paper aimed to search for new Z-DNA/Z-RNA binding proteins in the complete PDB structures database and from the AlphaFold2 protein models. A structure-based similarity search found 14 proteins with highly similar Zα domain structure in experimentally-defined proteins and 185 proteins with a putative Zα domain using the AlphaFold2 models. Structure-based alignment and molecular docking confirmed high functional conservation of amino acids involved in Z-DNA/Z-RNA, suggesting that Z-DNA/Z-RNA recognition may play an important role in a variety of cellular processes.Entities:
Keywords: Z-DNA; Z-RNA; Zα domain; bioinformatics; protein binding
Mesh:
Substances:
Year: 2022 PMID: 35054954 PMCID: PMC8775963 DOI: 10.3390/ijms23020768
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Schematic diagram of classical right-handed B-DNA, left-handed Z-DNA/Z-RNA, and Zα domain consisting of three α-helices and two β-strands. This domain is known to specifically interact with left-handed nucleic acids, mainly through its α-helix 3 and some amino acid residues of beta-strands.
Known Z-DNA/RNA binding proteins containing experimentally solved Zα or Zβ domain(s) (PDB IDs are provided). UniProtKB IDs of all proteins are provided as well.
| Protein Symbol/ID | Protein Name | Organism | Protein Length | Function | PDB ID | Method/ | Domain | Ref. |
|---|---|---|---|---|---|---|---|---|
| ADAR (P55265) | Double-stranded RNA-specific adenosine deaminase |
| 1226 | Hydrolytic deamination of adenosine to inosine in dsRNA (A-to-I RNA editing) | 1XMK | XRC/0.97 Å | Zβ | [ |
| 1QGP | NMR | Zα | [ | |||||
| 3F21 | XRC/2.20 Å | Zα | [ | |||||
| 3F22 | XRC/2.50 Å | Zα | ||||||
| 3F23 | XRC/2.70 Å | Zα | ||||||
| 2GXB | XRC/2.25 Å | Zα | [ | |||||
| ZBP1 (Q9H171) | Z-DNA-binding protein 1 |
| 429 | Innate sensor recognising viral | 2L4M | NMR | Zβ | [ |
| Zbp1/DAI | Z-DNA-binding protein 1 |
| 411 | 1J75 | XRC/1.85 Å | Zα | [ | |
| PKZ (Q5NE12) | Protein kinase-containing Z-DNA-binding domains |
| 511 | Defence response to virus | 4LB5 | XRC/2.00 Å | Zα | [ |
| 4LB6 | XRC/1.80 Å | |||||||
| ORF112 (A4FTK7) | Protein ORF112 | 278 | Double-stranded RNA adenosine deaminase activity; RNA binding | 4WCG | XRC/1.50 Å | Zα | [ | |
| E3L (P21605) | Protein E3 |
| 190 | Double-stranded RNA adenosine deaminase activity; inhibition of multiple cellular antiviral responses activated by dsRNA | 7C0I | XRC/2.40 Å | Zα | [ |
| 34L (Q9DHS8) | 34L protein |
| 185 | Same as E3L | 1SFU | XRC/2.00 Å | Zα | [ |
Predicted Z-DNA/RNA binding proteins based on structural similarity to the experimentally validated Zα domain (3f21). Proteins are sorted according to their decreasing similarity score (Q-score); HOP2 is the best hit. UniProtKB IDs of all proteins are provided as well.
| Protein Symbol/ID | Protein Name | Organism | Domain | Protein Length | Cellular Localisation/Known Function |
|---|---|---|---|---|---|
| HOP2 (O35047) | Homologous-pairing protein 2 homolog |
| Eukarya | 217 | Nucleus/DNA binding, meiotic recombination, double-strand break repair, positive regulation of transcription by RNA pol II [ |
| DsvD (Q46582) | DsvD |
| Bacteria | 78 | Role in dissimilatory sulfite reduction, Possible Interaction with B- and Z-DNA by Its Winged-Helix Motif [ |
| D2PEW5 | Uncharacterised DNA binding protein |
| Archaea | 59 | DNA binding |
| feoC | Probable [Fe-S]-dependent transcriptional repressor |
| Bacteria | 79 | DNA binding may function as a transcriptional regulator that controls feoABC expression [ |
| pefI | FaeA-like protein |
| Bacteria | 70 | Regulation of transcription [ |
| RPA2 (P15927) | Replication protein A 32 kDa subunit |
| Eukarya | 270 | Nucleus/DNA binding, multifunctional protein (DNA repairs, DNA replication, telomere maintenance, preventing G-quadruplex formation) [ |
| CDC53 (Q12018) | Cell division control protein 53 |
| Eukarya | 815 | Nucleus & Cytoplasm/DNA replication origin binding, cell division, protein ubiquitination [ |
| CUL1 (Q13616) | Cullin-1 |
| Eukarya | 776 | Nucleus & Cytoplasm/Protein ubiquitination, cell division, transcription regulation [ |
| ANC2 (Q9UJX6) | Anaphase-promoting complex subunit 2 |
| Eukarya | 822 | Nucleus & Cytoplasm/Component of the anaphase promoting complex/cyclosome (APC/C) [ |
| SCC1 (Q12158) | Sister chromatid cohesion protein 1 |
| Eukarya | 566 | Nucleus/Mitotic sister chromatid cohesion, double-strand break repair [ |
| APC2 (Q12440) | Anaphase-promoting complex subunit 2 |
| Eukarya | 853 | Nucleus & cytoplasm/Component of the anaphase promoting complex/cyclosome (APC/C) [ |
| Rpc34 (Q921X6) | DNA-directed RNA polymerase III subunit RPC6 |
| Eukarya | 316 | Nucleus/Nuclear and cytosolic DNA sensor involved in innate immune response, defence response to the virus [ |
| PBP2 (A0A0E3GTJ4) | Archaeal DNA polymerase holoenzyme (PBP2 subunit) |
| Archaea | 76 | Enhances DNA synthesis [ |
| Reut_B4095 (Q46TT3) | Putative DNA-binding protein |
| Bacteria | 95 | DNA binding |
Figure 2Comparison of the reference Zα domain (PDB: 3f21) (upper left corner) with the experimentally solved proteins (or their corresponding domains) having significant structural similarity (structures are ordered according to their similarity score to the reference structure (HOP2 best, DsvD second best, etc.).
Figure 3Sequence alignment is constructed from the structural superposition of the Zα domain of human ADAR1 protein (PDB: 3f21) and the 14 possible Z-DNA/Z-RNA binding proteins. The default colour of fully populated columns is light red, in addition, helices are coloured in yellow and strands in green. Letter colours correspond to the ClustalX colouring scheme.
Figure 4Position of Zα domain in ADAR1 and within 14 newly described possible Z-DNA/Z-RNA binding proteins. The exact position of Zα and its structural homologs is always highlighted in yellow.
Figure 5Representative molecular docking of human RPA2 Zα structural homolog to Z-DNA (A) and Z-RNA (B). Protein alpha helices are in red, beta-strands in green, coiled-coil regions in azure. Highlighting of Z-DNA/Z-RNA follows classic NDB colouring (guanines in green, cytosines in yellow).
Figure 6STRING interaction network of newly identified human possible Z-DNA/Z-RNA binding proteins (in bold and higher letter size), together with 2 previously known human Z-DNA/Z-RNA binding proteins (ZBP1 and ADAR), and also with 50 first shell interactors. Clustering was made using MCL inflation parameter (3), the resulting five clusters are highlighted in distinct colours. Line thickness indicates the strength of data support and edges between different clusters are dotted.