| Literature DB >> 29123983 |
Dhanusha Yesudhas1, Muhammad Ayaz Anwar1, Suresh Panneerselvam1, Han-Kyul Kim1, Sangdun Choi1.
Abstract
Transcription factors (TFs) are gene expression regulators that bind to DNA in a sequence-specific manner and determine the functional characteristics of the gene. It is worthwhile to study the unique characteristics of such specific TF-binding pattern in DNA. Sox2 recognizes a 6- to 7-base pair consensus DNA sequence; the central four bases of the binding site are highly conserved, whereas the two to three flanking bases are variable. Here, we attempted to analyze the binding affinity and specificity of the Sox2 protein for distinct DNA sequence patterns via steered molecular dynamics, in which a pulling force is employed to dissociate Sox2 from Sox2-DNA during simulation to study the behavior of a complex under nonequilibrium conditions. The simulation results revealed that the first two stacking bases of the binding pattern have an exclusive impact on the binding affinity, with the corresponding mutant complexes showing greater binding and longer dissociation time than the experimental complexes do. In contrast, mutation of the conserved bases tends to reduce the affinity, and mutation of the complete conserved region disrupts the binding. It might pave the way to identify the most likely binding pattern recognized by Sox2 based on the affinity of each configuration. The α2-helix of Sox2 was found to be the key player in the Sox2-DNA association. The characterization of Sox2's binding patterns for the target genes in the genome helps in understanding of its regulatory functions.Entities:
Keywords: Sox2; molecular dynamics; protein–DNA interaction; umbrella sampling
Year: 2017 PMID: 29123983 PMCID: PMC5666385 DOI: 10.1002/2211-5463.12316
Source DB: PubMed Journal: FEBS Open Bio ISSN: 2211-5463 Impact factor: 2.693
Figure 1Representation of bp positions for Sox2 binding patterns. The distinct DNA binding patterns for Sox2 protein obtained from Discovery Studio with bp positions numbered. The orange represents the DNA strands with its associated Sox2 indicated in cyan. The bp positions of positive control and mutant complexes are mentioned in black and green, respectively, with mutated bps positions marked in red.
Comparison of dissociation time, force, PMF, and the interactions restrained during dissociation of the complexes
| Complex | Dissociation time (ps) | Force, kcal·(mol·nm) −1 | PMF, kcal·mol−1 | Major interactions restrained during dissociation | C‐terminal interactions restrained during dissociation |
|---|---|---|---|---|---|
|
| 180 | 2000 | 126.815 | Arg15–DA19, Arg19–DT20, Ser34–DT4, Lys35–DT4 | Arg2–DA17, Lys4–DA18, Arg5–DT8, Arg5–DA9, Lys71–DT10, Arg75–DG11, Lys79–DA15 |
|
| 162 | 2150 | 113.65 | Arg2–DA17, Arg5–DT9, Asn8–DT6, Ser31–DT4, Ser34–DT4, Lys35–DT5, Lys71–DT10, Tyr72–DC16, Arg75–DC16 | Arg73‐DG11, Arg75‐DC16, Arg76‐DA15, Lys77‐DC16 |
|
| 160 | 1800 | 103.151 | Lys4–DC17, Arg5–DC8, Phe10–DA7, Arg19–DT19, Asn30–DG20, Ser31–D21 | Lys71–DT10, Tyr72–DA15, Arg76–DC14, Arg76–DT10, Lys77–DC16, Thr80–DT1 |
| F‐Mut‐GA | 190 | 1750 | 124.78 | Arg2–DA17, Arg5–DC16, Asn8–DT6, Phe10–DT5Ser34–DT5, Trp41–DT6 | Arg2–DC16, Arg2–DA17, Lys71–DT10, Arg75–DG11, Arg76–DA15, Lys77–DT2, Thr80–DT1 |
| C‐Mut‐AC | 150 | 1650 | 98.518 | Asn8–DT6, Arg18–DT5, Ser31–DC3, Ser31–DA4, Ser34–DT5, Trp41–DA7 | Arg5–DC8, Trp41–DA7, Tyr72–DT16, Arg73–DG11, Lys77–DA17 |
| C‐Mut‐GG | 150 | 1600 | 87.46 | Arg5–DT8, Arg5–DT8, Arg5–DG9, Trp41–DG7, Lys42–DG6, Lys71‐DT10 | Tyr70–DG9, Tyr72–DC16, Arg75–DC16, Lys77–DC16, Thr80–DT2 |
| C‐Mut‐GGAC | 180 | 2100 | 46.008 | Arg5–DC8, Asn5–DC8, Asn5–DA7, Trp41–DG6 | Lys71–DT10, Arg75–DT10, Arg75–DG11, Thr80–DT1, Thr80–DT2 |
| F‐Mut‐TT | 200 | 2000 | 123.7 | Arg2–DA17, Arg5–DT8, Arg5–DA9, Arg5–DC16, Arg15–DA18, Arg15–DA19, Arg15–DA20, Trp41–DT6 | Lys4–DA18, Arg60–DA9, Tyr72–DA18, Arg76–DT8, Arg76–DA9 |
Figure 2The dissociation pathway along with structural deformation during SMD analysis. (A) The dissociation pathway of Sox2 in the positive control complexes, showing the time and force required for dissociation. The increase in the distances between the protein and DNA during dissociation is also shown. (B) The dissociation pathway of Sox2 in complex with a mutant sequence showing the time and force required for dissociation. The increase in the distances between the protein and DNA during dissociation is also indicated. (C) General representation of the dissociation process showing a Sox2 dissociated from DNA and the distance at different stages of separation.
Figure 3Hydrogen‐bonded and nonbonded interactions between the protein and DNA during the dissociation. The changes in the number of bonded (hydrogen bond) and nonbonded interactions between the Sox2 and DNA for all the complexes during the process of dissociation. Black and red represent the numbers of hydrogen‐bonded and nonbonded interactions, respectively.
Figure 4Binding free energy profiles obtained from the PMF calculation. (A) A comparison of binding free energies of experimentally validated complexes. (B) A comparison of binding free energies of mutant complexes.
Figure 5Radial distribution functions of water molecules around DNA. (A) RDF values of the experimental complexes showing hydration shells. (B) RDF values of mutant complexes showing hydration shells.
Figure 6Stacking energies for DNA and its stability. (A) Stacking energies for experimentally validated DNA patterns showing higher values at fourth and fifth stacking bps. (B) Stacking energies for mutated DNA patterns showing increased stacking energies at fourth and fifth stacking bps in comparison with the positive control patterns. The stacking energies with its standard deviations are plotted in this figure.
Sox2‐binding patterns in the human and mouse genomes identified in searches based on our DNA patterns
| Complex | Pattern | Human | Mouse | ||
|---|---|---|---|---|---|
| Total number of genes matched | Genes with hit count greater than 1 | Total number of genes matched | Genes with hit count greater than 1 | ||
|
| ATTTGTA | 36 |
| 15 |
|
|
| CTTTGTG | 36 |
| 16 | – |
|
| CATTGTT | 36 | – | 6 | – |
| F‐Mut‐GA | GATTGTC | 12 | – | 7 | – |
| C‐Mut‐AC | CATTACG | 5 | – | – | – |
| C‐Mut‐GG | CAGGGTG | 36 | – | 9 | – |
| C‐Mut‐GGAC | CAGGAC | 2 | – | 2 | – |
| F‐Mut‐TT | TTTTGTA | 108 |
| 13 | – |