| Literature DB >> 35682854 |
Richard P Bowater1, Natália Bohálová2,3, Václav Brázda2.
Abstract
Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of these structures confirm their characterization as four-way junctions and have demonstrated that several factors influence their stability, including overall chromatin structure and DNA supercoiling. Here, we review our understanding of processes that influence the formation and stability of cruciforms in genomes, covering the range of sequences shown to have biological significance. It is challenging to accurately sequence repetitive DNA sequences, but recent advances in sequencing methods have deepened understanding about the amounts of inverted repeats in genomes from all forms of life. We highlight that, in the majority of genomes, inverted repeats are present in higher numbers than is expected from a random occurrence. It is, therefore, becoming clear that inverted repeats play important roles in regulating many aspects of DNA metabolism, including replication, gene expression, and recombination. Cruciforms are targets for many architectural and regulatory proteins, including topoisomerases, p53, Rif1, and others. Notably, some of these proteins can induce the formation of cruciform structures when they bind to DNA. Inverted repeat sequences also influence the evolution of genomes, and growing evidence highlights their significance in several human diseases, suggesting that the inverted repeat sequences and/or DNA cruciforms could be useful therapeutic targets in some cases.Entities:
Keywords: DNA base sequence; DNA structure; DNA supercoiling; cruciform; epigenetics; genome stability; inverted repeat; replication; transcription
Mesh:
Substances:
Year: 2022 PMID: 35682854 PMCID: PMC9180970 DOI: 10.3390/ijms23116171
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1Inverted repeat sequences can form different types of double-stranded conformations. (A) Transition of inverted repeat in a linear conformation to a double hairpin, cruciform state. For the sequence indicated, the cruciform structure consists of four branchpoints and two 7 bp-long stems, each with 4 nt loops. (B) Decisive factors for the resulting thermodynamic stability and genomic occurrence of cruciform structures are: (1) stem size indicated in blue; (2) loop length indicated in purple; (3) possible mismatches in base pairing indicated in red. The arrows at the top and bottom of part (B) highlight the relative stability and occurrence of the represented cruciforms, with the larger and darker part of the arrows indicating those that are most stable and are most likely to occur in genomes. For all schematic molecules, the arrow indicates the 3′-end of the DNA strand.
Figure 2High-resolution structure of a cruciform (four-way junction) formed by an inverted repeat DNA sequence. Images show the X-ray crystallographic structure determined at 2.10 Å for DNA with the sequence 5′-CCGGTACCGG-3′ (1DCW) [37]. The DNA alone forms a four-way junction in a stacked-X conformation, in which duplexes coaxially stack, with each pair of stacked duplexes related by +30° to +60° (right-handed) rotation. The continuous (least distorted relative to B-DNA) strands are coloured as green and red, while those of the crossing strands (making a tight U-turn) are coloured blue and cyan. In each panel, the images show the structure visualised via different axes viewpoints as indicated by the coloured squares. (A) The upper image provides a schematic view of the molecule, the distinct strands (in different colours), and their sequences, with arrows indicating the 3′-ends of the DNA strands. The lower image presents the high-resolution structure of 1DCW, illustrating its arrangement of base pairs. (B) The upper image views the structure down the helix axis of one pair of stacked duplexes, while the lower image views it from a rotational shift of approximately 90°. (C) The images zoom in on the central part of the structure (dashed bracketed region in (B)) to highlight the electrostatic interactions, particularly close to the Na+ ion at its centre. The lower image views the same face of the dyad axis shown in (B), and the upper image shows the opposite face of the axis, viewed from a rotational shift of approximately 180°.
Figure 3The occurrence of inverted repeat sequences in gene features as determined by bioinformatic analyses. An idealised gene and its regulatory sequences are shown, with UTR referring to “untranslated regions”. A relative abundance (+) or depletion (−) of inverted repeats in the indicated genomes is highlighted above and below the idealised gene, respectively. For E. coli and S. cerevisiae, inverted repeats with a stem length from 5 bp and a spacer length up to 8 bp were considered [50,51], while for H. sapiens and viruses from the Nidovirales order, inverted repeats with the stem length from 6–30 bp and spacer length up to 10 bp were taken into account [18,42].
Proteins involved in interactions with cruciform structures. TF = transcription factor, chromatin AP = chromatin-associated protein. Adapted from [15]. * If no reference is listed for an entry, see [15] for further details.
| Protein | Source | Function | Reference * |
|---|---|---|---|
|
| Eukaryotes | Replication, DNA repair, TF | [ |
|
| Coccinia virus | Junction-resolving enzyme | |
|
|
| TF | |
|
|
| Replication, DNA repair, TF | |
|
| Mammals | Chromatin AP, DNA repair, TF | [ |
|
| Archaea, Bacteria | Endonuclease, defence response to virus | [ |
|
| Yeast | Junction-resolving enzyme | [ |
|
|
| DNA repair | [ |
|
| Mammals | Chromatin AP, replication, DNA repair | [ |
|
| Eukaryotes | DNA repair | |
|
|
| DNA repair, stress response | [ |
|
| Phage T7 | Junction-resolving enzyme | [ |
|
| Phage T4 | Junction-resolving enzyme | |
|
| Mammals | TF | |
|
| Vertebrates | Junction-resolving enzyme | [ |
|
| Plants | Replication, stress response | |
|
| all | Replication | [ |
|
| Archaea | Junction-resolving enzymes | |
|
| all | Chromatin AP, DNA repair, TF | |
|
|
| DNA Repair | |
|
|
| Replication | [ |
|
|
| Viral DNA recognition | [ |
|
| all | Junction-resolving enzyme | |
|
| Vertebrates | Junction-resolving enzyme | [ |
|
|
| Replication | |
|
| Mammals | Junction-resolving enzyme | [ |
|
| Eukaryotes | Junction-resolving enzyme | |
|
|
| Junction-resolving enzyme | [ |
|
| Eukaryotes | Junction-resolving enzyme | |
|
| DNA repair, TF | [ | |
|
| DNA repair, TF | [ | |
|
| DNA repair, TF | [ | |
|
| Eukaryotes | Chromatin AP | [ |
|
| Eukaryotes | Chromatin AP | [ |
|
| Eukaryotes | Chromatin AP | [ |
|
| G+ bacteria | Junction-resolving enzyme | |
|
| Bacteria | Replication | [ |
|
| Mammals | DNA repair, TF | [ |
|
| Yeast | DNA repair, TF | |
|
|
| Junction-resolving enzyme | |
|
|
| Junction-resolving enzyme | |
|
|
| Replication | |
|
|
| Junction-resolving enzyme | [ |
|
|
| DNA repair, TF | |
|
| Eukaryotes | Chromatin AP | |
|
| Eukaryotes | Chromatin AP | [ |
|
|
| Junction-resolving enzyme | |
|
| Baculoviruses | Replication | |
|
|
| Replication | |
|
| Eukaryotes | Junction-resolving enzyme | [ |
|
|
| Junction-resolving enzyme | |
|
|
| Junction-resolving enzyme | [ |
Figure 4Cellular processes influenced by cruciform structures.