| Literature DB >> 21737431 |
Abstract
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcatedEntities:
Mesh:
Substances:
Year: 2011 PMID: 21737431 PMCID: PMC3201857 DOI: 10.1093/nar/gkr452
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The number of crystal structures of nucleotide–protein complexes in PDB
| Bases | Nucleotides (Code in PDB) | No. of structures in PDB | |
|---|---|---|---|
| ~ 3.5 Å | ~ 2.0 Å (%) | ||
| Adenine | Total | 739 | 281 (38.0) |
| ATP | 238 | 84 (35.3) | |
| ADP | 343 | 122 (35.6) | |
| AMP | 158 | 75 (47.5) | |
| Guanine | Total | 174 | 57 (32.8) |
| GTP | 80 | 16 (20.0) | |
| GDP | 87 | 38 (43.7) | |
| GMP | 7 | 3 (42.9) | |
| Uracil | Total | 96 | 46 (47.9) |
| UTP | 16 | 7 (43.8) | |
| UDP | 65 | 31 (47.7) | |
| UMP | 15 | 8 (53.3) | |
| Thymine | Total | 82 | 37 (45.1) |
| dTTP (TTP) | 45 | 11 (24.4) | |
| dTDP (TYD) | 16 | 11 (68.8) | |
| dTMP (TMP) | 21 | 15 (71.4) | |
| Cytosine | Total | 63 | 25 (39.7) |
| CTP | 27 | 8 (29.6) | |
| CDP | 14 | 7 (50.0) | |
| CMP (C5P) | 22 | 10 (45.5) | |
| Total | 1154 | 446 (38.6) | |
Our data set used in this study contains a total of 446 crystal structures with resolution better than 2.0 Å.
Frequency of each pseudo pair and single hydrogen bond contact observed in our data set of nucleotide–protein complexes
| Bases (No. of interactions) | Interaction edges (No. of interactions) | Amino acids | Pseudo pair | Hydrogen bond | ||
|---|---|---|---|---|---|---|
| Number | Frequency (%) | Number | Frequency (%) | |||
| Adenine (266) | Watson–Crick (181) | Asn | 10 (6 WM) | 3.8 (2.3) | 4 | 1.5 |
| Gln | 2 | 0.8 | 4 | 1.5 | ||
| Asp | 7 (6 WM, 1 Asp/PB) | 2.6 (2.3, 0.4) | 10 | 3.8 | ||
| Glu | 1 | 0.4 | 3 | 1.1 | ||
| Ser | 1 (1 Ser/PB) | 0.4 (0.4) | 6 | 2.3 | ||
| Thr | – | – | 4 | 1.5 | ||
| Thy | – | – | 4 | 1.5 | ||
| PB | 108 (6 WM) | 40.6 (2.3) | 17 | 6.4 | ||
| Hoogsteen (84) | Asn | 8 | 3.0 | 3 | 1.1 | |
| Gln | 8 | 3.0 | 5 | 1.9 | ||
| Asp | 1 (1 WM) | 0.4 (0.4) | – | – | ||
| Glu | 1 (1 WM) | 0.4 (0.4) | 19 | 7.1 | ||
| His | – | – | 2 | 0.8 | ||
| Ser | – | – | 1 | 0.4 | ||
| Thr | – | – | 5 | 1.9 | ||
| Tyr | – | – | 2 | 0.8 | ||
| PB | – | – | 29 | 10.9 | ||
| Sugar-edge (1) | Ser | – | – | 1 | 0.4 | |
| Guanine (82) | Watson–Crick (64) | Asn | – | – | 1 | 1.2 |
| Gln | – | – | 2 | 2.4 | ||
| Asp | 28 | 34.1 | 2 | 2.4 | ||
| Glu | 2 | 2.4 | 7 | 8.5 | ||
| Ser | – | – | 14 | 17.1 | ||
| PB | 6 | 7.3 | 2 | 2.4 | ||
| Hoogsteen (17) | Asn | – | – | 14 | 17.1 | |
| Arg | 2 | 2.4 | – | – | ||
| Thr | – | – | 1 | 1.2 | ||
| Sugar-edge (1) | PB | – | – | 1 | 1.2 | |
| Uracil (47) | Watson–Crick (38) | Asn | 2 | 4.3 | – | – |
| Gln | 1 | 2.1 | – | – | ||
| Asp | – | – | 7 | 14.9 | ||
| Ser | 1 (1 Ser/PB) | 2.1 (2.1) | 1 | 2.1 | ||
| Thr | 2 (2 Thr/PB) | 4.3 (4.3) | – | – | ||
| Tyr | – | – | 1 | 2.1 | ||
| PB | 22 | 46.8 | 1 | 2.1 | ||
| Hoogsteen (4) | Asn | – | – | 1 | 2.1 | |
| Arg | – | – | 2 | 4.3 | ||
| His | – | – | 1 | 2.1 | ||
| Sugar-edge (5) | Asn | 3 | 6.9 | – | – | |
| Arg | – | – | 1 | 2.1 | ||
| His | – | – | 1 | 2.1 | ||
| Thymine (24) | Watson–Crick (17) | Asn | 2 | 8.3 | 2 | 8.3 |
| Gln | 4 | 16.7 | 1 | 4.2 | ||
| Asp | – | – | 1 | 4.2 | ||
| Glu | – | – | 1 | 4.2 | ||
| Lys | – | – | 1 | 4.2 | ||
| PB | 4 | 16.7 | 1 | 4.2 | ||
| Hoogsteen (7) | Arg | – | – | 5 | 20.8 | |
| Trp | – | – | 1 | 4.2 | ||
| PB | – | – | 1 | 4.2 | ||
| Sugar-edge (0) | – | – | – | – | – | |
| Cytosine (24) | Watson–Crick (18) | Asn | 1 (1WM) | 4.2 (4.2) | – | – |
| Arg | 3 | 12.5 | – | – | ||
| Ser | – | – | 3 | 12.5 | ||
| Thr | 1 (1 Thr/PB) | 4.2 (4.2) | – | – | ||
| PB | 6 (1 WM) | 25.0 (4.2) | 4 | 16.7 | ||
| Hoogsteen (6) | Asp | – | – | 1 | 4.2 | |
| PB | – | – | 5 | 20.8 | ||
| Sugar-edge (0) | – | – | – | – | – | |
The water-mediated pseudo pair is designated as WM. The pseudo pair using both side- and main-chains is designated as Xxx/PB (Xxx = three-letter code of amino acid).
Figure 5.The adenine binding [left; PDB-ID = 1DAD (38)] and guanine binding [right; PDB-ID = 1G7S (40)] motifs composed of multiple interactions. Hydrogen bonds are shown in black dashed lines.
Figure 1.The Watson–Crick pseudo pairs between nucleotide bases and amino acid side-chains. Nucleotides and amino acids are colored in yellow and green, respectively. Water molecules are shown by red spheres. Pseudo pairs classified in this study are boxed by red lines, and those observed by Cheng et al. (26) but not in this study are boxed by blue lines. Hydrogen bonds are shown in black dashed lines. In the C-Glu pseudo pairs observed in DNA–protein complexes (PDB-ID = 1dct, 1mht, 4mht), Cytosine modified at position C5 is indicated by an asterisk.
Figure 3.The Hoogsteen pseudo pairs between nucleotide bases and amino acid side-chains. Pseudo pairs classified in this study are boxed with red lines.
Figure 4.The Sugar-edge pseudo pairs between nucleotide bases and amino acid side-chains. Pseudo pairs classified in this study are boxed with red lines and those observed by Cheng et al. (26) but not in this study are boxed with blue lines.
Figure 2.The Watson–Crick pseudo pairs between nucleotide bases and the peptide backbone.