| Literature DB >> 28180276 |
Elisé P Wright1, Julian L Huppert2, Zoë A E Waller1,3.
Abstract
i-Motifs are alternative DNA secondary structures formed in cytosine-rich sequences. Particular examples of these structures, traditionally assumed to be stable only at acidic pH, have been found to form under near-physiological conditions. To determine the potential impact of these structures on physiological processes, investigation of sequences with the capacity to fold under physiological conditions is required. Here we describe a systematic study of cytosine-rich DNA sequences, with varying numbers of consecutive cytosines, to gain insights into i-motif DNA sequence and structure stability. i-Motif formation was assessed using ultraviolet spectroscopy, circular dichroism and native gel electrophoresis. We found that increasing cytosine tract lengths resulted in increased thermal stability; sequences with at least five cytosines per tract folded into i-motif at room temperature and neutral pH. Using these results, we postulated a folding rule for i-motif formation, analogous to (but different from) that for G-quadruplexes. This indicated that thousands of cytosine-rich sequences in the human genome may fold into i-motif structures under physiological conditions. Many of these were found in locations where structure formation is likely to influence gene expression. Characterization of a selection of these identified i-motif forming sequences uncovered 17 genomic i-motif forming sequence examples which were stable at neutral pH.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28180276 PMCID: PMC5605235 DOI: 10.1093/nar/gkx090
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Library of oligonucleotides sequences and data for their melting temperature (), annealing temperature () and transitional pH (pH)
| Sequence 5΄ - 3΄ | Bases | Notation | pH 5.5 | pH 7.4 |
| ||
|---|---|---|---|---|---|---|---|
|
|
|
|
| ||||
| TAA(C3)4 | 24 | hTeloC | 43 ± 0.0 | 41 ± 0.6 | nd | nd | 6.5 |
| C(T3C)3 | 13 | C1T3 | nd | nd | nd | nd | nd |
| C2(T3C2)3 | 17 | C2T3 | 27.1 ± 1.0 | 26.2 ± 0.6 | nd | nd | 6.1 |
| C3(T3C3)3 | 21 | C3T3 | 46.3 ± 1.8 | 45.5 ± 0.6 | 7 ± 0.6 | 6 ± 0.0 | 6.7 |
| C4(T3C4)3 | 25 | C4T3 | 56.4 ± 1.2 | 55.6 ± 0.6 | 15.8 ± 0.7 | 8.4 ± 0.2 | 7.1 |
| C5(T3C7)5 | 29 | C5T3 | 61.4 ± 0.6 | 60.6 ± 0.0 | 26.2 ± 1.8 | 6.7 ± 0.6 | 7.2 |
| C6(T3C6)3 | 33 | C6T3 | 65.5 ± 0.6 | 63.6 ± 0.0 | 9.1 ± 3.1/31.3 ± 0.6 | 9.7 ± 0.5 | 6.8 |
| C7(T3C7)3 | 37 | C7T3 | 68.5 ± 0.6 | 64.7 ± 0.0 | 12.8 ± 1.2/32 ± 1.2 | 8.7 ± 1.0 | 7.4 |
| C8(T3C8)3 | 41 | C8T3 | 72.6 ± 0.0 | 64.7 ± 0.0 | 18.8 ± 0.8/35 ± 2.0 | 13.8 ± 0.7 | 7.1 |
| C9(T3C9)3 | 45 | C9T3 | 75.6 ± 1.8 | 65.7 ± 0.6 | 20.8 ± 1.8/35 ± 1.7 | 12.1 ± 0.6 | 7.3 |
| C10(T3C10)3 | 49 | C10T3 | 66 ± 0.7/77 ± 2.7 | 65 ± 0.4 | 21.9 ± 1.9/41.1 ± 0.2 | 18.2 ± 0.0 | 7.3 |
| C5(T1C5)3 | 23 | C5T1 | 60.6 ± 1.5 | 59.6 ± 0.0 | 15.8 ± 0.2 | 14.1 ± 0.6 | 6.9 |
| C5(T2C5)3 | 26 | C5T2 | 60.6 ± 0.6 | 60.6 ± 0.0 | 25.2 ± 0.6 | 16.1 ± 0.0 | 7.1 |
| C5(T4C5)3 | 32 | C5T4 | 63.3 ± 0.2 | 60.6 ± 0.6 | 29.3 ± 0.0 | 5.3 ± 0.6 | 6.7 |
Thermal and pH stability of the genomic i-motif candidate oligonucleotides used in this study
| pH 7.0 | |||||
|---|---|---|---|---|---|
| Notation | Bases | Sequence 5′ - 3′ |
|
|
|
| AC017019.1 | 28 | CCC-CCC-TCC-CCC-CCT-CCC-CCC-TCC-CCC-C | 27.9 ± 0.6 | 18.0 ± 0.2 | 7.1 |
| AC018878.3 | 26 | CCC-CCA-CCC-CCA-GCC-CCC-TTT-CCC-CC | 18.1 ± 0.2 | 7.5 ± 0.5 | 7.1 |
| ATXN2L | 24 | CCC-CCC-CCC-CCC-CCC-CCC-CCC-CCC | 23.7 ± 1.0 | 22.5 ± 0.6 | 7.0 |
| CAMK2G | 50 | CCC-CCA-GGC-CCC-GCC-AGT-CCC-CCC-CCC-CGC-CCG-GCC-CCC-GGC-CCG-CCC-CC | 13.0 ± 1.2 | 11.3 ± 0.7 | 6.9 |
| DAP | 29 | CCC-CCG-CCC-CCG-CCC-CCG-CCC-CCG-CCC-CC | 24.7 ± 0.5 | 22.0 ± 0.4 | 7.0 |
| DRP2 | 70 | CCC-CCT-CTT-CCC-CTC-TCC-CCC-TCT-CCC-CCT-CTC-TCC-CTC-TTC-CCC-CTC-TCC-TTG-TCT-CCTTCT-CTC-CCC-C | 4.5 ± 0.7 | 6.5 ± 0.7 | 6.0 |
| DUX4L22 | 41 | CCC-CCG-AAA-CGC-GCC-CCC-CTC-CCC-CCT-CCC-CCC-TCT-CCC-CC | 29.2 ± 0.2 | 14.2 ± 0.2 | 7.1 |
| GH2 | 42 | CCC-CCA-CCC-CCA-CCC-CCA-TCC-CCA-CGC-CCC-GCC-CCC-GCC-CCC | 22.7 ± 1.0 | 15.2 ± 0.9 | 7.1 |
| HIC2 | 74 | CCC-CCG-GGA-CAG-GGA-CCC-TGG-CCC-CCC-CCG-ACA-GGC-TGA-CGC-CCA-CCC-CCT-CAA-ACT-CTG-GTG-GAC-TTA-CCC-CC | 7.5 ± 2.1 | 7.8 ± 2.0 | 6.4 |
| HOXC10 | 24 | CCC-CCA-CCC-CCA-CCC-CCA-CCC-CCC | 17.7 ± 0.8 | 14.0 ± 0.2 | 7.1 |
| HOXD10 | 26 | CCC-CCC-CCC-CCT-CCC-CCG-CGG-CCC-CC | 10.2 ± 0.9 | 5.2 ± 0.2 | 7.1 |
| JAZF1 | 31 | CCC-CCC-CCG-CCC-CCG-CCC-CCG-CCC-TCC-CCC-C | 20.4 ± 0.4 | 18.5 ± 0.6 | 7.1 |
| MSMO1 | 23 | CCC-CCG-CCC-CCG-CCC-CCG-CCC-CC | 16.6 ± 0.5 | 15.9 ± 0.4 | 6.7 |
| NFATC1 | 45 | CCC-CCG-TTT-CCC-CCG-CCA-GCC-CCA-GCG-CCC-CCC-TGC-CCG-GCC-CCC | 23.2 ± 0.0/28.8 ± 0.5 | 18.8 ± 0.6 | 7.1 |
| PIM1 | 45 | CCC-CCG-ACG-CGC-CCC-CCA-ACA-CAC-AAA-CCC-CCA-GAA-TCC-GCC-CCC | 29.4 ± 1.9 | 5.1 ± 0.2 | 7.0 |
| PLCB2 | 36 | CCC-CCG-CCT-CTT-CTG-GAG-GCC-CCC-GCC-CCC-ACC-CCC | 15.0 ± 0.2 | 13.2 ± 0.2 | 7.0 |
| QSOX1 | 25 | CCC-CCG-CCC-CCG-AGC-CCC-CGC-CCC-C | 20.1 ± 0.2 | 11.9 ± 0.4 | 7.1 |
| RAE1 | 116 | CCC-CCC-GCC-CCC-CCC-GCC-CCC-CCG-CGC-CGC-CCC-CCC-CCG-CCC-CCC-GCC-CCC-GTC-CCC-CCG-CCC-CCC-CCG-CCC-CCC-CCG-CCC-CCC-GTC-CCC-CCG-CCC-CCC-CGC-CCC-CCC-GTC-CCC-CC | 27.1 ± 7.4 | 13.6 ± 6.3 | 6.8 |
| RUNX1–1 | 31 | CCC-CCC-CCG-CAC-CCC-TTC-CCC-CGG-CCC-CCC-C | 14.3 ± 0.9/25.0 ± 0.4 | 9.8 ± 0.4 | 6.7 |
| RUNX1–2 | 36 | CCC-CCC-TCC-CCC-TGC-CTC-TCC-CTC-CCC-CCT-TTC-CCC | 13.2 ± 0.2/24.4 ± 0.2 | 10.1 ± 0.0 | 6.5 |
| RUNX1–3 | 32 | CCC-CCC-TTT-CCC-CTG-CCC-CCC-CTG-CCT-CCC-CC | 10.7 ± 0.6/26.2 ± 0.0 | 9.7 ± 0.6 | 6.7 |
| SHANK1b | 28 | CCC-CCC-TCC-CCC-CAC-CCC-CCA-CCC-CCC-C | 22.5 ± 0.6 | 12.0 ± 0.0 | 7.1 |
| SHANK3 | 80 | CCC-CCG-CCT-CCG-GCG-CAG-CCC-CCT-CGC-CAC-CCC-CGC-TTC-CCT-CCC-GTC-TCA-GGC-CCC-CTC-CCC-CCG-CCG-CCC-CCG-CCC-CC | 18.3 ± 1.9 | 5.6 ± 1.0 | 6.6 |
| SHANK3b | 79 | CCC-CCC-GCA-CCG-AGG-CCT-AGG-ACT-CCC-CCC-CCC-AAC-CCC-GTC-ACA-GCC-CCC-CAG-ACC-CCC-GCC-CCG-TGG-CTC-GGC-CCC-C | 12.6 ± 3.6 | 4.8 ± 0.7 | 6.5 |
| SNORD112 | 36 | CCC-CCC-CCC-GCC-CCC-CAC-CCC-CCC-ACC-CCC-CCC-CCC | 25.8 ± 0.5 | 15.9 ± 0.4 | 7.2 |
| SOX1 | 57 | CCC-CCT-GCA-GGC-CCC-CCT-GCG-CCT-CCC-CCC-CCC-CGC-CAC-TGG-CGC-CTG-GCT-TCC-CCC | 9.0 ± 0.2 | 6.5 ± 0.5 | 6.9 |
| STX17 | 33 | CCC-CCG-CCC-CCG-CCC-CCG-CCC-CGC-AGG-GCC-CCC | 19.5 ± 0.6 | 15.0 ± 0.2 | 7.0 |
| Tandem Repeat (LA16c-OS12.2) | 57 | CCC-CCC-GTG-TCG-CTG-TTC-CCC-CCG-TGT-CGC-TGT-TCC-CCC-CGT-GTC-GCT-GTT-CCC-CCC | 9.4 ± 1.5/31.6 ± 0.6 | 6.7 ± 0.6 | 6.6 |
| TRABD | 23 | CCC-CCG-CCC-CCC-CCC-CCC-CCC-CC | 21.3 ± 0.2 | 19.3 ± 0.2 | 6.9 |
| WNT7A | 48 | CCC-CCG-CCC-CTC-CCT-CCT-TTC-CCC-CGT-CCC-TCC-CCC-GCC-CCC-TCC-CCC | 22.7 ± 2.5 | 16.1 ± 0.0 | 7.1 |
| ZBTB7B | 58 | CCC-CCC-ATC-CCT-CCC-CTC-CCT-CCC-CCC-GCC-CCT-GCC-ACC-CCC-CAA-ACT-CCC-CCC-CCC-C | 25.5 ± 1.3 | 10.0 ± 0.2 | 7.1 |
| ZFP41 | 52 | CCC-CCA-GCC-CCC-GCC-GAC-CCC-CAG-CTC-CCG-CCT-CCG-CCG-ACC-CCC-AGC-CCC-C | 21.7 ± 0.7/35.6 ± 0.4 | 17.4 ± 0.4 | 7.0 |
| ZNF480 | 23 | CCC-CCG-CCC-CCG-CCC-CCG-CCC-CC | 17.1 ± 0.0 | 15.2 ± 0.2 | 6.7 |
Figure 1.ODN stability with increasing cytosine tract length. ODNs (2.5 μM) were annealed in 10 mM sodium cacodylate with 100 mM sodium chloride at the indicated pH. + pH 5.5 Tm1; Δ pH 5.5 Tm2 (10 cytosine tract only); o pH 5.5 Ta; pH 7.4 Tm1; × pH 7.4 Tm2; and ■ pH 7.4 Ta.
Figure 2.The effects of cytosine tract length (A) and loop length (B) on hysteresis. ODNs (2.5 μM) were annealed in 10 mM sodium cacodylate with 100 mM sodium chloride at the indicated pH. (A) × Hysteresis at pH 5.5; o Secondary hysteresis at pH 5.5 (10 cytosine tract only); ■ Hysteresis at pH 7.4; Secondary hysteresis at pH 7.4. (B) Hysteresis at pH 5.5 (▪) and at pH 7.4 (•) in ODNs containing tracts of 5 cytosines with increasing lengths of thymine loops.
Figure 3.The thermal difference spectra calculated between 95 and 4°C of each of the oligonucleotides (ODNs) at pH 5.5 (A) and 7.4 (B). ODNs (2.5 μM) were annealed in 10 mM sodium cacodylate with 100 mM sodium chloride at the indicated pH.
Figure 4.Room temperature native PAGE of the model ODNs (10 μM in 10 mM sodium cacodylate at pH 7.4) with increasing tract lengths at pH 7.4. Acrylamide gels (20%) were buffered with 50 mM tris, 50 mM HEPES at pH 7.4. ODNs were annealed at pH 7.4 as described above. The samples were loaded onto the gel for electrophoresis at 50 V for 5 h. Lane contents left to right are as follows: (1) C1T3; (2) C2T3; (3) C3T3; (4) C4T3; (5) C5T3; (6) Base pair ladder standard; (7) C6T3; (8) C7T3; (9) C8T3; (10) C9T3; (11) C10T3; (12) Base pair ladder standard.
Figure 5.Circular dichroism of (A) human telomeric i-motif; (B) C2T3; (C) C5T3 and (D) C10T3. pH 4.0; pH 4.5; pH 5.0; pH 5.5; pH 6.0; pH 6.5; pH 7.0; pH 7.5; pH 8.0. All ODN concentrations were 10 μM in 10 mM sodium cacodylate buffer with 100 mM sodium chloride buffer at the required pH.
Figure 6.The relationship between total loop length (the sum of all loop bases) and transitional pH. Transitional pH was calculated from fitting the CD data for all pH at 288 nm and identifying the inflection point.
Figure 7.The relationship between total loop length (the sum of all loop bases) and melting temperature (Tm). Tm was calculated from the maxima of the first derivative of UV-melt data.