| Literature DB >> 23418514 |
Tomoko Sunami1, Hidetoshi Kono.
Abstract
When a protein binds to DNA, a conformational change is often induced so that the protein will fit into the DNA structure. Therefore, quantitative analyses were conducted to understand the conformational changes in proteins. The results showed that conformational changes in DNA interfaces are more frequent than in non-interfaces, and DNA interfaces have more conformational variations in the DNA-free form. As expected, the former indicates that interaction with DNA has some influence on protein structure. The latter suggests that the intrinsic conformational flexibility of DNA interfaces is important for adjusting their conformation for DNA. The amino acid propensities of the conformationally changed regions in DNA interfaces indicate that hydrophilic residues are preferred over the amino acids that appear in the conformationally unchanged regions. This trend is true for disordered regions, suggesting again that intrinsic flexibility is of importance not only for DNA binding but also for interactions with other molecules. These results demonstrate that fragments destined to be DNA interfaces have an intrinsic flexibility and are composed of amino acids with the capability of binding to DNA. This information suggests that the prediction of DNA binding sites may be improved by the integration of amino acid preference for DNA and one for disordered regions.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23418514 PMCID: PMC3571985 DOI: 10.1371/journal.pone.0056080
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Data set preparation.
(a) Workflow for obtaining DBbound, DBfree, and DBfree>2. (b) Schematic diagram for the preparation of a set of non-redundant clusters.
Figure 2Structure of the eleven fragments.
(a) Ten fragments (assigned to A to J) of the 4-residue-long structural alphabet library and a fragment with missing coordinates (assigned to Y). (b) Similarity of the ten alphabet structures calculated using cRMS as the distance with the Fitch in Phylip package. Fragment structures were drawn with Pymol [42].
Database composition.
| Cluster ID | Representative PDB ID(chain) | Representative PDB ID(chain) | Molecular name | Used for DBfree(≥2)
|
|
| 1aqi(A) | 2ih2(A) | Modification methylase TaqI | ✓ |
|
| 1aro(P) | 1cez(A) | Bacteriophage T7 RNA polymerase | |
|
| 1az3(A) | 1sx5(A) | Type II restriction enzyme EcoRV | ✓ |
|
| 1b24(A) | 2vs7(A) | Homing endonuclease I-DmoI | |
|
| 1bam(A) | 3bam(A) | Restriction endonuclease BamHI | |
|
| 1baz(A) | 1bdt(A) | Gene-regulating protein Arc | ✓ |
|
| 1bjt(A) | 3l4j(A) | DNA topoisomerase 2 | ✓ |
|
| 1bm9(A) | 1f4k(A) | Replication termination protein | ✓ |
|
| 1bpe(A) | 2fmp(A) | DNA polymerase β | |
|
| 1ci4(A) | 2bzf(A) | Barrier-to-autointegration factor | ✓ |
|
| 1ci6(B) | 1h89(A) | CAAT/enhancer binding protein β | |
|
| 1cmb(A) | 1mjo(A) | Methionine repressor | ✓ |
|
| 1eaq(A) | 1h9d(A) | Core-binding factor α subunit1 | ✓ |
|
| 1es8(A) | 1dfm(A) | Endonuclease BglII | |
|
| 1eto(A) | 3jrh(A) | DNA-binding protein fis | ✓ |
|
| 1ev7(A) | 1iaw(A) | Type II Restriction enzyme NAEI | ✓ |
|
| 1evx(A) | 1a73(A) | INTRON 3 (I-Ppo) ENCODED ENDONUCLEASE | ✓ |
|
| 1f9f(A) | 1jj4(A) | Regulatory protein E2 | ✓ |
|
| 1fc3(A) | 1lq1(A) | Stage 0 sporulation protein A | ✓ |
|
| 1fr2(B) | 1v15(A) | Colicin E9 | ✓ |
|
| 1fvi(A) | 2q2t(A) | Chlorella virus DNA ligase | ✓ |
|
| 1fx7(A) | 1u8r(A) | Iron-dependent repressor IdeR | ✓ |
|
| 1gxq(A) | 1gxp(A) | Phosphate regulon transcriptional regulatory protein | |
|
| 1hmy(A) | 2c7p(A) | Modification methylase HhaI | ✓ |
|
| 1hw5(A) | 1zrf(A) | Catabolite gene activator | ✓ |
|
| 1ih7(A) | 3nae(A) | DNA polymerase | ✓ |
|
| 1ii7(A) | 3dsd(B) | DNA double-strand break repair protein Mre11 | ✓ |
|
| 1ikn(A) | 2ram(A) | Transcription factor NF-κB p65 | ✓ |
|
| 1jbg(A) | 1r8d(A) | Transcription activator MtaN | |
|
| 1jeq(A) | 1jey(A) | Ku70 | |
|
| 1jeq(B) | 1jey(B) | Ku80 | |
|
| 1jg7(A) | 1m5r(A) | DNA β-glucosyltransferase | ✓ |
|
| 1jhg(A) | 1tro(A) | Trp repressor | ✓ |
|
| 1jih(A) | 3mfi(A) | DNA polymerase η | ✓ |
|
| 1jjh(A) | 2bop(A) | E2 | ✓ |
|
| 1jye(A) | 1efa(A) | Lac repressor | ✓ |
|
| 1k0z(A) | 3pvi(A) | PvuII endonuclease | ✓ |
|
| 1ko9(A) | 1m3q(A) | 8-Oxoguanine DNA glycosylase | |
|
| 1ku3(A) | 1rio(H) | Sigma factor SigA | |
|
| 1mij(A) | 1xpx(A) | Protein prospero | |
|
| 1mml(A) | 3fsi(A) | MMLV Reverse transcriptase domain | ✓ |
|
| 1mpg(A) | 3cw7(A) | DNA-3-methyladenine glycosylase 2 | ✓ |
|
| 1mug(A) | 1mwi(A) | G/U mismatch-specific DNA glycosylase | |
|
| 1okr(A) | 1sax(A) | Methicillin resistance regulatory protein mecI | ✓ |
|
| 1ouo(A) | 1oup(A) | Vibrio vulnificus nuclease | |
|
| 1owl(A) | 1tez(A) | Deoxyribodipyrimidine photolyase | ✓ |
|
| 1p7i(A) | 2hdd(A) | Engreiled homeodomain | ✓ |
|
| 1q0s(A) | 1yf3(A) | DNA adenine methylase | |
|
| 1q3b(A) | 1k3x(A) | Endonuclease VIII | ✓ |
|
| 1q8i(A) | 3k59(A) | DNA polymerase II | ✓ |
|
| 1qht(A) | 2vwj(A) | Thermococcus gorgonarius DNA polymerase | ✓ |
|
| 1qtw(A) | 2nq9(A) | Endonuclease 4 | ✓ |
|
| 1r69(A) | 1per(L) | 434 repressor | |
|
| 1sdo(A) | 2p0j(A) | BstYI | |
|
| 1tzy(A) | 1kx5(C) | Histone H2A.1 | ✓ |
|
| 1tzy(B) | 1kx5(D) | Histone H2B.2 | ✓ |
|
| 1tzy(C) | 1kx5(A) | Histone H3 | ✓ |
|
| 1tzy(D) | 1kx5(B) | Histone H4 | ✓ |
|
| 1vhi(A) | 1b3t(A) | Nuclear protein EBNA1 | ✓ |
|
| 1vok(A) | 1qna(A) | Transcription initiation factor TFIID-1 | ✓ |
|
| 1vsr(A) | 1odg(A) | DNA mismatch endonuclease | |
|
| 1w9h(A) | 2w42(A) | Archaeal Piwi protein | |
|
| 1wtd(A) | 1wte(A) | EcoO109IR | ✓ |
|
| 1xhx(A) | 2pyj(A) | phi29 DNA polymerase | ✓ |
|
| 1xv5(A) | 1y8z(A) | DNA α-glucosyltransferase | |
|
| 1xwl(A) | 2hhv(A) | DNA Polymerase I | |
|
| 1ynm(A) | 2fkc(A) | R.HinP1I restriction endonuclease | |
|
| 1z91(A) | 1z9c(A) | Organic hydroperoxide resistance transcriptional regulator | |
|
| 1zbf(A) | 3ey1(A) | Ribonuclease H | |
|
| 2a40(B) | 2dnj(A) | Deoxyribonuclease I | ✓ |
|
| 2a6m(A) | 2vih(A) | Transposase ORFA | ✓ |
|
| 2aud(A) | 2gig(A) | Type II restriction enzyme HincII | |
|
| 2bnk(A) | 2c5r(A) | Early protein p16.7 | ✓ |
|
| 2ckx(A) | 2qhb(A) | Telomere binding protein TBP1 | |
|
| 2cpg(A) | 1b01(A) | Transcriptional repressor CopG | ✓ |
|
| 2d3y(A) | 2dp6(A) | Uracil-DNA glycosylase | |
|
| 2dt5(A) | 3ikt(A) | Redox-sensing transcriptional repressor rex | ✓ |
|
| 2end(A) | 2fcc(A) | Endonuclease V | ✓ |
|
| 2f4q(A) | 3m4a(A) | Deinococcus radiodurans Type IB DNA topoisomerases | |
|
| 2fip(A) | 2fio(A) | Late genes activator | ✓ |
|
| 2fok(A) | 1fok(A) | FokI restriction endonuclease | ✓ |
|
| 2frh(A) | 1fzp(B) | Staphylococcal accessory regulator A | ✓ |
|
| 2fuf(A) | 2itl(A) | Large T antigen | ✓ |
|
| 2gpe(A) | 2rbf(A) | Bifunctional protein putA | ✓ |
|
| 2gxg(A) | 3gfi(A) | ST1710 | ✓ |
|
| 2hts(A) | 3hts(B) | Kluyveromyces lactis heat shock transcription factor | |
|
| 2iru(A) | 2r9l(A) | Polymerase Domain from Mycobacterium tuberculosis Ligase D | ✓ |
|
| 2nov(A) | 3k9f(A) | DNA topoisomerase 4 subunit A | ✓ |
|
| 2oa9(A) | 2oaa(A) | R.MvaI | ✓ |
|
| 2odh(A) | 2odi(B) | R.BcnI | |
|
| 2ofk(A) | 2ofi(A) | 3-Methyladenine DNA glycosylase I, constitutive | ✓ |
|
| 2ore(D) | 2g1p(A) | DNA adenine methylase | ✓ |
|
| 2p5k(A) | 2p5l(C) | Arginine repressor | |
|
| 2po4(A) | 3c2p(A) | Virion RNA polymerase | |
|
| 2qsf(A) | 2qsh(A) | DNA repair protein RAD4 | |
|
| 2rdi(A) | 1jx4(A) | DNA polymerase IV (family Y) | ✓ |
|
| 2v1x(A) | 2wwy(A) | ATP-dependent DNA helicase Q1 | ✓ |
|
| 2ve8(A) | 2ve9(A) | DNA translocase FtsK | ✓ |
|
| 2vke(A) | 1qpi(A) | Tetracycline repressor | ✓ |
|
| 2wcw(A) | 2wiw(B) | Hjc | ✓ |
|
| 2wiu(B) | 3dnv(B) | HTH-type transcriptional regulator HipB | ✓ |
|
| 2x6u(A) | 2x6v(A) | T-box transcription factor TBX5 | |
|
| 2yve(A) | 2yvh(A) | Transcriptional regulator | ✓ |
|
| 2zd1(A) | 3kk1(A) | Reverse transcriptase p66 subunit | ✓ |
|
| 2zkg(A) | 3fde(A) | E3 ubiquitin-protein ligase UHRF1 | ✓ |
|
| 2znz(A) | 2e1c(A) | Putative HTH-type transcriptional regulator PH1519 | ✓ |
|
| 3a45(A) | 3a46(A) | Formamidopyrimidine-DNA glycosylase | ✓ |
|
| 3bqz(A) | 1jt0(A) | Hypothetical transcriptional regulator in QACA 5′region | ✓ |
|
| 3bvq(A) | 3c25(A) | NotI restriction endonuclease | ✓ |
|
| 3bvs(A) | 3jxy(A) | Alkylpurine DNA glycosylase AlkD | |
|
| 3d06(A) | 3igk(A) | Cellular tumor antigen p53 | ✓ |
|
| 3d1g(A) | 3bep(A) | DNA polymerase III subunit β | ✓ |
|
| 3e5u(A) | 3e6c(C) | Cyclic nucleotide-binding protein | ✓ |
|
| 3ei3(B) | 3ei2(B) | DNA damage-binding protein 2 | |
|
| 3f0z(A) | 3i0w(A) | 8-Oxoguanine-DNA-glycosylase | ✓ |
|
| 3fci(A) | 1emh(A) | Uracil-DNA glycosylase | ✓ |
|
| 3fhf(A) | 3knt(A) | N-glycosylase/DNA lyase | |
|
| 3g91(A) | 3g00(A) | Exodeoxyribonuclease | ✓ |
|
| 3gn5(A) | 3o9x(A) | Uncharacterized HTH-type transcriptional regulator ygiT | ✓ |
|
| 3gva(A) | 3gx4(X) | Alkyltransferase-like protein 1 | ✓ |
|
| 3gz5(A) | 3gz6(A) | MutT/nudix family protein | ✓ |
|
| 3hd0(A) | 2w36(A) | Endonuclease V | ✓ |
|
| 3i3q(A) | 3o1t(A) | Alpha-ketoglutarate-dependent dioxygenase AlkB | ✓ |
|
| 3iao(A) | 1r8e(A) | Multidrug-efflux transporter regulator | |
|
| 3lsj(A) | 3lsr(A) | DesT | ✓ |
|
| 3mx1(A) | 3mx4(A) | Eco29kIR |
A member which is reported with the highest resolution is shown as a representative. Entire lists of PDB ID (chain ID) of the members were shown in Table S2.
The molecular names were extracted from the PDB headers of DNA-bound forms. If the molecular names in the headers do not describe the molecule (e.g. Putative protein), the molecular names were extracted from the literatures.
✓ indicates that the clusters were used for DBfree(≥2).
Figure 3Conformational changes and variations of fragments.
(a) Frequencies of conformational change upon DNA binding for DNA interfaces and non-interfaces. (b) Frequencies of conformational variation in the DNA-free forms for DNA interfaces and non-interfaces. Error bar indicates the 85% bootstrap confidence interval.
Figure 4Propensities of the structural alphabets in the DNA interfaces vs. non-interfaces.
(a) Conformational change upon DNA binding. (b) Conformational variation in the DNA-free form. Error bar indicates the 85% bootstrap confidence interval.
Figure 5Propensities of the amino acid residues.
(a) Conformationally changed fragments vs. conformationally unchanged fragments in DNA interfaces (filled bar) and non-interfaces (open). A positive value indicates that frequency that amino acid is observed in conformationally changed fragments is higher than that in conformationally unchanged fragments. (b) DNA interfaces vs. non-interfaces for conformationally changed fragments (filled) and conformationally unchanged fragments (open). A positive value indicates that frequency that amino acid is observed in DNA interfaces is higher than that in non-interfaces. Error bar indicates the 85% bootstrap confidence interval. The background colors denote the physico-chemical property of amino acids: hydrophobic is shown in pale green; polar in pink; basic in blue; acidic in red.
Figure 6Propensities of the structural alphabets with various conditions.
(a) Propensities of the structural alphabets that undergo a conformational change upon DNA binding. (b) Propensities of the structural alphabets that undergo an order-to-order conformational change upon DNA binding. (c) Frequencies of the structural alphabets that undergo a disorder-to-order conformational change (Y to one of A-J) upon DNA binding. (d) Propensities of the structural alphabets that undergo a disorder-to-order conformational change (Y to one of A-J) upon DNA binding. In (a) through (d), fragments located within DNA interfaces and those within non-interfaces are shown by filled and open bar, respectively. A positive value of the propensities indicates that frequency that alphabet is observed in DNA-bound forms is higher than that in free forms. Error bar indicates the 85% bootstrap confidence interval.