| Literature DB >> 11104519 |
N M Luscombe1, S E Austin, H M Berman, J M Thornton.
Abstract
On the basis of a structural analysis of 240 protein-DNA complexes contained in the Protein Data Bank (PDB), we have classified the DNA-binding proteins involved into eight different structural/functional groups, which are further classified into 54 structural families. Here we present this classification and review the functions, structures and binding interactions of these protein-DNA complexes.Entities:
Mesh:
Substances:
Year: 2000 PMID: 11104519 PMCID: PMC138832 DOI: 10.1186/gb-2000-1-1-reviews001
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
The groups of protein structures found in the dataset, the number of families within each group and the number of PDB files each family contains
| No. of proteins (PDB files) in group | |||||
| Protein group | No families in group | Prokaryote | Eukaryote | Viral | Total |
| 1. Helix-turn-helix | 16 * | 32 | 28 | - | 60 |
| 2. Zinc-coordinating | 4 | 0 | 23 | - | 23 |
| 3. Zipper-type | 2 | 0 | 10 | - | 10 |
| 4. Other α-helix | 7 | 1 | 5 | 2 | 8 |
| 5. β-sheet | 1 | 0 | 8 | - | 8 |
| 6. β-hairpin/ribbon | 6 | 10 | 1 | - | 11 |
| 7. Other | 2 | 0 | 8 | - | 8 |
| 8. Enzyme | 16 | 43 | 68 | 2 | 113 |
| Total | 54 | 86 | 151 | 4 | 241 † |
*Includes the two 'winged' helix-turn-helix families. †PDB entry 1a02 contains proteins belonging to the families 'zipper-type' and 'other'.
List of the 240 structures of protein-DNA complexes in the dataset.
| PDB code | DNA-binding subunits | Protein name | Species | Resolution /Å | DNA sequence |
| 1lmb* | 3,4 | Repressor | Phage λ | 1.8 | -AATACCACTGGCGGTGATATTATAT-CACCGCCAGTGGTAT- |
| 1lli | A,B | Repressor mutant | Phage λ | 2.1 | -AATACCACTGGCGGTGATATTATAT-CACCGCCAGTGGTAT- |
| 1per | L,R | Repressor | Phage 434 | 2.5 | AAGTACAGTTTTTCTTG-TATTATA--CAAGAAAAACTGTACT |
| 1rpe | L,R | Repressor | Phage 434 | 2.5 | -TATACAATGTATCTTG-TTTGACAAACAAGATACATTGTAT- |
| 2or1 | L,R | Repressor | Phage 434 | 2.5 | AAGTACAAACTTTCTTG-TATTATA--CAAGAAAGTTTGTACT |
| 3cro | L,R | Cro | Phage 434 | 2.5 | AAGTACAAACTTTCTTG-TATTATA--CAAGAAAGTTTGTACT |
| 6cro | A | Cro | Phage λ | 3.0 | AAGTACAAACTTTCTTG-TATTATA-CAAGAAAGTTTGTACT |
| 3orc | A | Cro | Phage λ | 3.0 | AAGTACAAACTTTCTTG-TATTATA--CAAGAAAGTTTGTACT |
| 1fjl*† | A,B,C,G | Paired protein | 2.0 | AATAATCTGATTACTGTAATCAGATTAT | |
| 1hdd | C,D | Engrailed homeodomain | 2.8 | --TTTTGCCATGTAATTACCTAAATTAGGTAATTACATGGCAAA | |
| 1apl | C,D | Mat α-2 | 2.7 | --ACATGTAATTCATTTACACGCTGCGTGTAAATGAATTACATG | |
| 1yrn | A,B | Mat a-1/α-2 | 2.5 | -TACATGTAATTTATT-ACATCATATGATGTAATAAATTACATG | |
| 1au7 | A1(5-76), | Pit-1 POU domain | 2.3 | TTGCGTAGCGTTACGTATATTCCGCTAATCGAT | |
| A2(103-160), | |||||
| B1(5-75), | |||||
| B2(102-161) | |||||
| 1oct | C1(5-75) | Oct-1 POU domain | 3.0 | TGTATTGCAAATAAGGACCTTATTTGCATAC | |
| C1(102-161) | |||||
| 2hdd | A,B | Engrailed homeodomain | 1.9 | TGTTTTTGATAAGATCTTATCAAAAAC | |
| 3hdd | A,B | Engrailed homeodomain | 2.2 | --TTTTGCCATGTAATTACCTAAATTAGGTAATTACATGGCAAA | |
| 9ant | A,B | Antennapedia homeodomain | 2.4 | --TTTTGCCATGTAATTACCTAAATTAGGTAATTACATGGCAAA | |
| 6pax | A | Paired domain | 2.5 | GATGACCTAATAGGGAAAATAACACGAA | |
| 1akh | A,B | Mat a-1/α-2 | 2.5 | TACATGTAAAAATTACATCATATGA | |
| 1b72 | A,B | Homeodomain | 2.35 | ACTCTATGATTGATCGTAATGCGCAAACG | |
| 1b8I | A,B | Homeodomain | 2.4 | TTTACGTTTAAAAGCTTAATAAACG | |
| 1mnm | C,D | Mat α-2 | 2.25 | GATTACCTAATAGGGAAATTTACACG | |
| *** | |||||
| 1wet*† | A,C | Purine repressor | 2.6 | AACGAAAACGTTTTCGT | |
| 1bdh† | A,C | Purine repressor | 2.7 | TACGCAAACGTTTGCGT | |
| 1bdi† | A,C | Purine repressor | 3.0 | TACGCAAACGTTTGCGT | |
| 1pnr† | A,C | Purine repressor | 2.7 | AACGAAAACGTTTTCG- | |
| 2pua† | A,C | Purine repressor | 2.9 | TACGCAAACGTTTGCGT | |
| 2pub† | A,C | Purine repressor | 2.7 | TACGCAAACGTTTGCGT | |
| 2puc† | A,C | Purine repressor | 2.6 | TACGCAAACGTTTGCGT | |
| 2pud† | A,C | Purine repressor | 2.6 | TACGCAAACGTTTGCGT | |
| 2pue† | A,C | Purine repressor | 2.7 | TACGCAAACGTTTGCGT | |
| 2puf† | A,C | Purine repressor | 3.0 | TACGCAAACGTTTGCGT | |
| 2pug† | A,C | Purine repressor | 2.7 | AACGAAAATTTTTCGT- | |
| 1vpw† | A,C | Purine repressor | 2.7 | TACGCAAACGTTTGCGT | |
| 1qpz† | A,C | Purine repressor | 2.5 | TACGCAAATTTTTCGT- | |
| 1zay† | A,C | Purine repressor | 2.7 | TACGCAAACGTTTGCGT | |
| ********* | |||||
| 1fok* | A | Endonuclease FokI | 2.8 | TCGGATGATAACGCTAGTCA | |
| 1gdt* | A,B | γδ-resolvase | 3.0 | GCAGTGTCCGATAATTTATAAA | |
| 1hcr* | A | Artificial Hin recombinase | Artificial | 1.8 | TGTTTTTGATAAGA |
| 1ign* | A,B | RAP1 | 2.25 | CCGCACACCCACACACCAG | |
| 1pdn* | C | Prd paired domain | 2.5 | AACGTCACGGTTGAC | |
| 1tc3* | C | Transposase TC3 | 2.45 | AGGGGGGGTCCTATAGAACTT | |
| 1trr* | A,B,D,E, | Trp repressor | 2.4 | AGCGTACTAGTACGCT | |
| G,H,J,K | |||||
| 1ddn* | A,B,C,D | Diptheria Tox repressor | 3.0 | ATATAATTAGGATAGCTTTACCTAATTATTTTA | |
| 1d3u* | B | TF IIB | 2.4 | AGAGTAAAGTTTAAATACTTATAT | |
| 1vol | A | TF IIB | 2.7 | ------ GGCTATAAAAGGGCTG-- | |
| 1c9b | A,E,I,M,Q | TF IIB | 2.65 | --GGGCGCCTATAAAAGGGG---- | |
| ****** | |||||
| 2irf* | G,H,I,J, K, L | Interferon regulatory factor-2 | 2.2 | AAGTGAAAGCGA | |
| 1if1 | A,B | Interferon regulatory factor | 3.0 | AACTGTAAGCTTT | |
| 2cgp* | C | Catabolite gene activator | 2.2 | -------------------------GTCACATTAAT-------------------- | |
| 1ber | A,B | Catabolite gene activator | 2.5 | --------GCGAAAAGTGTGACATATGTCACACTTTTCGGCGAAAAGTGTGACATATGTCACACTT | |
| 1cgp | A,B | Catabolite gene activator | 3.0 | ACTTTTCGGCGAAAAGTGTGACATATGTCACACTTTTCGGCGAAAAGTGTGACATAT--------- | |
| 1run | A,B | Catabolite gene activator | 2.7 | --------GCGAAAAATGTGATCTAGATCACATTTTTCGGCGAAAAATGTGATCTAGATCACATTT | |
| 1ruo | A,B | Catabolite gene activator | 2.7 | --------GCGAAAAATGTGATCTAGATCACATTTTTCGGCGAAAAATGTGATCTAGATCACATTT | |
| ************************************ | |||||
| 3hts*† | B | Heat shock transcription factor | 1.75 | GGTTCTAGAACC---- | |
| 1cf7 | A,B | E2F-DP | 2.6 | -ATTTTCGCGCGGTTT | |
| ***** | |||||
| 1bc8* | C | Sap-1 | 1.93 | TACCGGAAGTAACTTCCGGT | |
| 1bc7 | C | Sap-1 | 2.01 | TACCGGAAGTAACTTCCGGT | |
| 1pue | E,F | TF PU.1 | 2.1 | -AAAAAGGGGAAGTGGG--- | |
| 1awc | A,B | GABP-α | 2.15 | -AACGACCGGAAGTACACCGGA | |
| ***** | |||||
| 1aay* | A1(103-133), A2(134-161), A3(162-187) | Zif268 zinc finger | 1.6 | --AGCGTGGGCGTTACGCC-CACGC | |
| 1zaa | C1(3-33), C2(34-61), C3(62-87) | Zif268 zinc finger | 2.1 | --AGCGTGGGCGTTACGCC-CACGC------------- | |
| 2drp | A1(103-135), A2(137-164), D1(102-135), D2(137-165) | Tramtrack protein | 2.8 | AATAAGGATAACGTCCGTCGGACGTTATCCTTATTA-- | |
| 1ubd | C1(295-323), C2(324-351), C3(352-381), C4(382-408) | Zinc finger | 2.5 | -AGGGTCTCCAATTTTGAAGCGCGCTTCAAAATGG--- | |
| 1mey | C1(1-31), C2(32-59), C3(60-84), F1(1-31), F2(32-59), F3(60-84), G | Consensus zinc finger | Artificial | 2.2 | ---ATGAGGCAGAACTTAGTTCTGCCTCA-------- |
| 1a1g | A1(103-132), A2(133-159), A3(160186) | DSNR (Zif268 variant) | 1.9 | ---AGCGTGGGCGTTACGCCCACGC | |
| 1a1h | " | QSGR (Zif268 variant) | 1.6 | ---AGCGTGGGCGTTACGCCCACGC | |
| 1a1I | " | RADR (Zif268 variant) | 1.9 | ---AGCGTGGGCGTTACGCCCACGC | |
| 1a1j | " | " | 1.6 | ---AGCGTGGGCGTTACGCCCACGC | |
| 1a1k | " | " | 2.0 | ---AGCGTGGGCGTTACGCCCACGC | |
| 1a1l | " | Zif268 zinc finger | 2.3 | ---AGCGTGGGCGTTACGCCCACGC | |
| 2gli | A1(103-135) | Five-zinc finger | 2.6 | ---AGCGTGGGCGTTACGCCCACGC | |
| A2(136-167) | " | " | " | ---TTTCGTCTTGGGTGGTCCACG | |
| A3(168-196) | " | " | " | ||
| A4(197-228) | " | " | " | ||
| A5(229-257) | " | " | " | ||
| 2nll* | A,B | Retinoic acid receptor | 1.9 | ---CAGGTCAT-TTCAGGTCAGCTGACCTGAAATGACCTG-- | |
| 1hcq | A,B,E,F | Estrogen receptor | 2.4 | --CCAGGTCAC-AGTGACCTGCCAGGTCACTG-TGACCTG-- | |
| 1glu | A,B | Glucococorticoid receptor | 2.9 | --CCAGAACATCGATGTTCTGCCAGAACATCGATGTTCTG-- | |
| 1lat | A,B | Glucococorticoid receptor | 1.9 | TTCCAGAACATGTTCTGGATTCCAGAACAT----GTTCTGGA | |
| 1by4 | A,B,C,D | Retinoic acid receptor | 2.1 | ---CAGGACATCTAGTAAATTCCAGATCTTACGTTGTCTG-- | |
| 1cit | A | Orphan nuclear receptor | 2.7 | --CCAGAACATCGAGCCTCTGCCAGAACATCGTTGTTCTG-- | |
| 1a6y | A,B | Orphan nuclear receptor | 2.3 | -TCCAGGACATCGCTAAGCTTGCTGGTCATTGCGGTTCTG | |
| *********** | |||||
| 1tsr* | A,B,C | p53 tumor suppressor | 2.2 | TTTCCTAGACTTGCCCAATTAATAATTGGGCAAGTCTAGGAA | |
| 1tup | A,B,C | p53 tumor suppressor | 2.2 | TTTCCTAGACTTGCCCAATTAATAATTGGGCAAGTCTAGGAA | |
| 1zme* | C,D | Proline utilization | 2.5 | -ACGGGAAGCCAACTCCG- | |
| 1d66 | A,B | GAL4 | 2.7 | CCGGAGGACAGTCCTCCGG | |
| ******** | |||||
| 2dgc*† | A,C | GCN4 | 2.2 | ---TGGAGATGACGTCATCTCC-- | |
| 1dgc† | A,C | GCN4 | 3.0 | ---TGGAGATGACGTCATCTCC-- | |
| 1ysa | C,D | GCN4 | 2.9 | ---TTCCTATGAGCTCATCCAGTT | |
| 1a02 | F | C-Fos | 2.7 | TTGGGAAATTTCTTTCATAG---- | |
| J | C-Jun | ||||
| ***** | |||||
| 1am9* | A,B,C,D | Srebp-1A | 2.3 | -TTGCAAGTGGGGTGATCCATGA--------------- | |
| 1hlo | A,B | Max | 2.8 | -CACCACGTGGTGTGGTGCACCA--------------- | |
| 1an4 | A,B | USF | 2.9 | AGGCCACGTGACCGG-GGTACATCCGGTGCACT----- | |
| 1an2 | A,C | Max | 2.9 | AGGTCACGTGACCTACACCACATCCAGTGCACTGGATG | |
| 1mdy | A,B,C,D | Myod | 3.0 | TCAACAGCTGTTGA---TCAACAGCTGTTGAC------ | |
| 1a0a | A,B | Pho4 | S | 2.8 | |
| ********* | |||||
| 2bop*† | A,C | Papillomavirus-1 E2 | Bovine | 1.7 | CCGACCGACGTCGGTCG |
| papillomavirus | |||||
| 1aoi* | A,B,C,D,E,F,G,H | Histone | 2.8 | ATCAATATCCACCTGCAGATTCTACCAAAAGTGTATTTGGAAACTGCTC | |
| CATCAAAAGGCATGTTCAGCTGAATTCAGCTGAACATGCCTTTTGATGG | |||||
| AGCAGTTTCCAAATACACTTTTGGTAGAATCTGCAGGTGGATATTGAT | |||||
| 1b3t* | A,B | Ebna-1 | 2.2 | GGGAAGCATAGCTTCCC | |
| 1skn* | P | Skn-1 | 2.5 | TGACAATGTCATCCC | |
| 1crx* | A,B | Cre recombinase | Bacteriophage P1 | 2.4 | TATAACTTCGTATAG |
| 1qrv* | A,B | High mobility group-1 | 2.2 | GCGATATCGC------ | |
| 1ckt | A | High mobility group-1 | 2.5 | CCCCTCTGGACCTTCC | |
| ** | |||||
| 1mnm* | A,B | Pheromone transcription | 2.25 | GATTACCTAATAGGGAAATTTACACG | |
| factor MCM-1 | |||||
| 1ytb* | B | TATA box-binding protein | 1.8 | --GTATATAAAACGGGTGGCGTTTTATATAC----- | |
| 1ytf | A | TATA box-binding protein | 2.5 | ----TGTATGTATATAAAACGTTTTATATACATACA | |
| 1ais | A | TATA box-binding protein | 2.1 | AACTTACTTTIIAAAGCTTGAATGAAAAATTTCA-- | |
| 1cdw | A | TATA box-binding protein | 1.9 | CTGCTATAAAAGGCTGCAGCCTTTTATAGCAG---- | |
| 1tgh | A | TATA box-binding protein | 2.0 | --------CGTATATATACGCGTATATATACG---- | |
| 1vol | B | TATA-box-BP | 2.7 | ----TGATCCCTTAAACTCGCTTGTATATGA----- | |
| 1d3u | A | TATA-box-BP | 2.4 | ---CTTAATCGCTATATCCGTTTCTATAGCTTTCA- | |
| 1c9b | B,F,J,N,R | TATA-box-BP | 2.65 | --GCTATAACGGTTAACGTTATTGTATAGCCAA--- | |
| ***** | |||||
| 1cma* | A,B | Met repressor | 2.8 | TTAGACGTCTAGACGTCTA | |
| 1ecr* | A | Tus replication terminator | 2.7 | TTAGTTACAACATACT | |
| 1ihf* | A,B | Integration host factor | 2.5 | CGGTGCAACAAATTGATAAGCAATGCTTTTTTGGC | |
| 1xbr* | A,B | T-domain | 2.5 | AATTTCACACCTAGGTGTGAAAATT | |
| 1azp* | A | Sac7D | 1.6 | GCGATCGC | |
| 1azq | A | Sac7D | 1.94 | GCGATCGC | |
| 1bnz | A | 7A | 2.0 | GTAATTAC | |
| 1bf4 | A | Ss07D | 1.6 | GTAATTAC | |
| **** | |||||
| 1bdt* | A,B,C,D | Arc repressor | Bacteriophage P22 | 2.5 | TATAGTAGAGTGCTTCTATCAT |
| 1bdv | A,B,C,D | Arc repressor | Bacteriophage P22 | 2.8 | TATAGTAGAGTGCTTCTATCAT |
| 1par | A,B,C,D | Arc repressor | Bacteriophage P22 | 2.6 | TATAGTAGAGTGCTTCTATCAT |
| 1a3q* | A, B | NF κB p52 | 2.1 | GATTCCCCGGGGAATTCCCC------ | |
| 1nfk | A,B | NF-κB p50 | 2.3 | GAATTCCCTGGGAATTCCC------- | |
| 1svc | P,T | NF-κB p50 | 2.6 | ----AGATGGGGAATCCCCTAGA--- | |
| 1vkx | A,B | NF κB p50/p65 | 2.9 | ------CTGGGGAATTTCCCAG---- | |
| 1ram | A,B | NF κB p65 | 2.7 | GGGACTTTCCGAAATTCCCC------ | |
| 1bvo | A | Gambif1 TF | 2.7 | ---CGGGCTGGAAATTTCCAGCCG-- | |
| 1a02 | N | Nfat | 2.7 | ------TTGGGAAATTTCTTTCATAG | |
| ***** | |||||
| 1bf5* | A | Stat-1 | 2.9 | ACAGTTTCCCGTAAATGC | |
| 6mht* | A | Hhal methyltransferase | 2.05 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 4mht | A | Hhal methyltransferase | 2.7 | ---GATAG-GCTATC-GATAGCGCTATC---------- | |
| 1mht | A | Hhal methyltransferase | 2.8 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 3mht | A | Hhal methyltransferase | 2.7 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 5mht | A | HhaI methyltransferase | 2.7 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 7mht | A | HhaI methyltransferase | 2.87 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 8mht | A | HhaI methyltransferase | 2.76 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 9mht | A | HhaI methyltransferase | 2.87 | ---GATAG-GCTATC-GATAGCGCTATC---------- | |
| 10mh | A | HhaI methyltransferase | 2.55 | ---GATAGCGCTATCTGATAGCGCTATC---------- | |
| 1dct | A,B | HaeIII methyltransferase | 2.8 | ---ACCAGCAG-GCCACC-AGTGTCACTGGTGGCCTGCTGG | |
| ********** | |||||
| 3pvi* | A,B | Endonuclease PvuII | 1.59 | TGACCAGCTGGTC | |
| 2pvi | A,B | Endonuclease PvuII | 1.76 | TGACCAGCTGGTC | |
| 1pvi | A,B | Endonuclease PvuII | 2.8 | TGACCAGCTGGTC | |
| 1rva* | A,B | Endonuclease EcoRV | 2.0 | ------AAAGATATCTTAAA---GATATCTT- | |
| 1rvb | A,B | Endonuclease EcoRV | 2.1 | ------AAAGATATCTTAAA---GATATCTT- | |
| 1rvc | A,B | Endonuclease EcoRV | 2.1 | ------AAAGATATCTTAAA---GATATCTT- | |
| 2rve | A,B | Endonuclease EcoRV | 3.0 | CGAGCTCGCGAGCTCGCGAGCTCGCGAGCTCG | |
| 4rve† | A,B,C,G | Endonuclease EcoRV | 3.0 | -------GGGATATCCCGG----GATATCCC- | |
| 1rve | A,B | Endonuclease EcoRV | 2.5 | ------AAAGATATCTTAAA----GATATCTT- | |
| 1rv5 | A,B | Endonuclease EcoRV | 2.1 | ------CGGGATATCCC CGG---GATATCCC- | |
| 1bgb | A,B | Endonuclease EcoRV | 2.0 | ------AAAGATATCTTAAA---GATATCTT- | |
| 1bss | A,B | Endonuclease EcoRV | 2.0 | ------AAAGATATCTTAAA---GATATCTT- | |
| 1bua | A,B | Endonuclease EcoRV | 2.15 | ------AAAGACATCTT--------------- | |
| 1bsu | A,B | Endonuclease EcoRV | 2.0 | ------AAAGACATCTT--------------- | |
| ******* | |||||
| 1qps*† | A,M | Endonuclease EcoRI | 2.5 | TCGCGAATTCGCG | |
| 1eri† | A,C | Endonuclease EcoRI | 2.7 | TCGCGAATTCGCG | |
| 1qrh† | A | Endonuclease EcoRI | 2.5 | TCGCGAATTCGCG | |
| 1qri† | A | Endonuclease EcoRI | 2.6 | TCGCGAATTCGCG | |
| 3bam* | A,B | Endonuclease BamHI | 1.8 | TATGGATCCATATATG | |
| 1bhm | A,B | Endonuclease BamHI | 2.2 | TATGGATCCATA---- | |
| ************ | |||||
| 1vas* | A | Endonuclease V | Phage T4 | 2.75 | ATCGCGTTGCGCTTAGCGCAACGCCGA |
| 2dnj* | A | Deoxyribonuclease I | 2.0 | GCGATCGCGCGATC-- | |
| 1dnk | A | Deoxyribonuclease I | 2.3 | GGTATACGGCTATACC | |
| ****** | |||||
| 1cw0* | A | DNA mismatch endonuclease | 2.3 | ACGTACCTGGCTAGCTAGGTACGT | |
| 1bpy* | A | DNA polymerase-β | 2.2 | CCGACCACGCATCAGC | |
| 1bpx | A | " | " | 2.4 | CCGACCACGCATCAGC |
| 1bpz | A | " | " | 2.6 | CCGACCACGCATCAGC |
| 1zqa | A | " | " | 2.7 | CATTAGAATCTAATG- |
| 1zqf | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 1zqi | A | " | " | 2.7 | CATTAGAATCTAATG- |
| 1zqn | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 1zqp | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7ice | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7icg | A | " | " | 3.0 | CATCTG--TCAGATG- |
| 7ich | A | " | " | 2.9 | CATCTG--TCAGATG- |
| 7ici | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7ick | A | " | " | 2.9 | CATCTG--TCAGATG- |
| 7icm | A | " | " | 3.0 | CATCTG--TCAGATG- |
| 7icn | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7icp | A | " | " | 3.0 | CATCTG--TCAGATG- |
| 7icq | A | " | " | 2.9 | CATCTG--TCAGATG- |
| 7icr | A | " | " | 3.0 | CATCTG--TCAGATG- |
| 7ics | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7ict | A | " | " | 2.8 | CATCTG--TCAGATG- |
| 7icv | A | " | " | 2.8 | CATCTG---CAGATG- |
| 8ica | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 8icc | A | " | " | 2.8 | CATTAGAATCTAATGA |
| 8icf | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 8ici | A | " | " | 2.8 | CATTAGAATCTAATGA |
| 8ick | A | " | " | 2.7 | CATTAGAATCTAATG- |
| 8icm | A | " | " | 2.9 | CATTAGAATCTAATGA |
| 8icn | A | " | " | 2.8 | CATTAGAATCTAATG- |
| 8ico | A | " | " | 2.7 | CATTAGAATCTAATGA |
| 8icp | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 8icq | A | " | " | 3.0 | CATTAGAATCTAATGA |
| 8icr | A | " | " | 2.9 | CATTAGAATCTAATGA |
| 8ics | A | " | " | 2.9 | CATTAGAATCTAATGA |
| 8icu | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 8icx | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 9ica | A | " | " | 3.0 | CATTAGAA-CTAATG- |
| 9icf | A | " | " | 3.0 | CATTAGAT-CTAATG- |
| 9icg | A | " | " | 3.0 | CATTAGAT-CTAATG- |
| 9ich | A | " | " | 2.9 | CATTAGAT-CTAATG- |
| 9ick | A | " | " | 2.7 | CATTAGAT-CTAATG- |
| 9icl | A | " | " | 2.8 | CATTAGAT-CTAATG- |
| 9icm | A | " | " | 2.9 | CATTAGAT-CTAATG- |
| 9icn | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 9ico | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 9icq | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 9icr | A | " | " | 3.0 | CATTAGAA-CTAATG- |
| 9ics | A | " | " | 2.9 | CATTAGAT-CTAATG- |
| 9ict | A | " | " | 3.0 | CATTAGAT-CTAATG- |
| 9icu | A | " | " | 2.9 | CATTAGAATCTAATG- |
| 9icv | A | " | " | 2.7 | CATTAGAATCTAATG- |
| 9icw | A | " | " | 2.6 | CATTAGAATCTAATG- |
| 9icx | A | " | " | 2.6 | CATTAGAATCTAATG- |
| 9icy | A | " | " | 3.0 | CATTAGAATCTAATG- |
| 2bpf | A | " | 2.9 | GGGCGCCGCGGCGCC- | |
| *** | |||||
| 2bdp* | A | DNA polymerase I | 1.8 | --GCATGATGCAGCATCATGC--- | |
| 4bdp | A | DNA polymerase I | 1.8 | --GCATGATGCAGCATCATGC--- | |
| 1qsy | A | DNA polymerase I | 2.3 | GACCACGGCGCAGGGCGCCGTGGT | |
| 1qss | A | DNA polymerase I | 2.3 | GACCACGGCGCAGGGCGCCGTGGT | |
| 2ktq | A | DNA polymerase I | 2.3 | GACCACGGCGCGGGCGCCGTAATC | |
| 3ktq | A | DNA polymerase I | 2.3 | GACCACGGCGCGGGCGCCGTAATC | |
| 4ktq | A | DNA polymerase I | 2.5 | GACCACGGCGCGGGCGCCGTAATC | |
| 1tau | A | DNA polymerase I | 3.0 | -GCCATGCGGCAGTCGC------- | |
| 1d8y | A | DNA polymerase I | 2.08 | TTTTTTTTTTTTTTTTT | |
| ****** | |||||
| 1t7p* | A | DNA polymerase | Bacteriophage | 2.2 | GCCAGTGCCAACCTTGGCACTGGC |
| 1clq | A | DNA polymerase | Bacteriophage | 2.7 | GCGGAACTACTAGTAGTTCCGAG |
| ****** | |||||
| 1hmi* | A,B | HIV reverse transcriptase | HIV type 1 | 2.8 | ATGGCGCCCGAACAGGAC |
| 2hmi | A,B | HIV reverse transcriptase | HIV type 1 | 2.8 | ATGGCGCCCGAACAGGAC |
| 1ssp* | E | Uracil-DNA glycosylase | 1.9 | CTGTATCTTAAAGATAACAG | |
| 2ssp | E | Uracil-DNA glycosylase | 2.25 | CTGTATCTTAAAGATAACAG | |
| 4skn | A | Uracil-DNA glycosylase | 2.9 | TGGGGGCTTAAAGCCGCCC- | |
| ********* | |||||
| 1bnk* | A | 3-Methyladenine DNA glycosylase H. | 2.7 | GACATGTTGCCTGGCAATCATGTCA | |
| 1a73* | A,B, | Intron-encoded | 1.8 | -TTGACTCTCTTAAGCGAGTCA | |
| Endonuclease I-Ppoi | |||||
| 1a74 | A,B, | Intron-encoded | 2.2 | -TTGACTCTCTTAAGCGAGTCA | |
| Endonuclease I-Ppoi | |||||
| 1cyq | A,B, | Intron-encoded | 1.93 | -TTGACTCTCTTAAGCGAGTCA | |
| Endonuclease I-Ppoi | |||||
| 1ipp | A,B, | Intron-encoded | 2.2 | -TTGACTCTCTTAAGCGAGTCA | |
| Endonuclease I-Ppoi | |||||
| 1bp7 | A,B,C,D | Homing endlonuclease | 3.0 | GCAAAACGTCGTGAGACAGTTCG | |
| I-CreI | |||||
| ********* | |||||
| 1a31* | A | Topoisomerase I | 2.1 | AAAAAGACCCTGAAAAACCCCT | |
| 1a35 | A | Topoisomerase I | 2.5 | AAAAAGACCCTGAAAAACCCCT | |
| 1a36 | A | Topoisomerase I | 2.8 | AAAAAGACTTAGAAAAATTTTT |
Each entry is provided with the four-digit PDB code, the name, the source, resolution and the aligned sequences of DNA to which the protein is bound in the crystal structure. Protein DNA-binding domains are identified by a single letter identifying the protein chain in the PDB file. Where more than one DNA-binding domain is contained in a single subunit, the chain is split into the individual domains. For these, the domains are labeled with the chain ID and a number and the PDB file residue numbers of the domain are given in brackets. The PDB entries are classified into 'groups' and 'families' according to the structure of the protein contained in the complex. The representative for each family, defined as the structure with the highest resolution is marked with an asterisk (*). The CAP family is an exception; the highest-resolution entry, 2cgp, contains only part of the dimer and so PDB code 1ber is used as the representative. The full coordinate files for homodimeric PDB entries that contain only half the structure were obtained from the NDB and are marked by a dagger (†). Enzyme families whose structures also qualified for another protein group are listed in all that are applicable and are marked by a double dagger (‡).
Box 1Flow diagram showing the selection of the protein-DNA complexes from the PDB (04/01/00). The protein-DNA complexes were grouped into structurally related families using the secondary structure alignment program SSAP (see text).
Figure 1Group I, HTH proteins. The DNA-binding motif is red. The protein binds as a dimer; one monomer is colored blue and the other yellow. The DNA is shown as a space filling model. Family names and numbers are as listed in Table 2; PDB codes are bracketed.
Figure 2Group II, zinc-coordinating proteins. Colors, numbers and names are as in Figure 1.
Figure 3Group III, zipper-type proteins. Colors, numbers and names are as in Figure 1.
Figure 4Group IV, 'other α helix proteins'. Colors, numbers and names are as in Figure 1.
Figure 5Group V, β-sheet proteins. Colors, numbers and names are as in Figure 1.
Figure 6Group VI, the β-hairpin/ribbon proteins. Colors, numbers and names are as in Figure 1.
Figure 7Group VII, 'other DNA-binding proteins'. Colors, numbers and names are as in Figure 1.
Figure 8Group VIII, the enzymes. Colors, numbers and names are as in Figure 1.