| Literature DB >> 21293729 |
Kresimir Sikic1, Sanja Tomic, Oliviero Carugo.
Abstract
Nearly all the macromolecular three-dimensional structures deposited in Protein Data Bank were determined by either crystallographic (X-ray) or Nuclear Magnetic Resonance (NMR) spectroscopic methods. This paper reports a systematic comparison of the crystallographic and NMR results deposited in the files of the Protein Data Bank, in order to find out to which extent these information can be aggregated in bioinformatics. A non-redundant data set containing 109 NMR - X-ray structure pairs of nearly identical proteins was derived from the Protein Data Bank. A series of comparisons were performed by focusing the attention towards both global features and local details. It was observed that: (1) the RMDS values between NMR and crystal structures range from about 1.5 Å to about 2.5 Å; (2) the correlation between conformational deviations and residue type reveals that hydrophobic amino acids are more similar in crystal and NMR structures than hydrophilic amino acids; (3) the correlation between solvent accessibility of the residues and their conformational variability in solid state and in solution is relatively modest (correlation coefficient = 0.462); (4) beta strands on average match better between NMR and crystal structures than helices and loops; (5) conformational differences between loops are independent of crystal packing interactions in the solid state; (6) very seldom, side chains buried in the protein interior are observed to adopt different orientations in the solid state and in solution.Entities:
Keywords: Large scale structure comparison; NMR spectroscopy; Protein Data Bank; X-ray crystallography.; structure similarity
Year: 2010 PMID: 21293729 PMCID: PMC3032220 DOI: 10.2174/1874091X01004010083
Source DB: PubMed Journal: Open Biochem J ISSN: 1874-091X
A Non-Redundant Data Set of 109 NMR – X-Ray Structure Pairs of Nearly Identical Proteins Derived from the Protein Data Bank
| No. | NMR | X-ray | ||||
|---|---|---|---|---|---|---|
| PDB entry | Chain | Number of models | PDB entry | Chain | Resolution (Å) | |
| 1 | 2JQX | A | 1 | 1D8C | A | 2.00 |
| 2 | 1EZO | A | 10 | 1ANF | A | 1.67 |
| 3 | 1AH2 | A | 18 | 1IAV | A | 1.80 |
| 4 | 1TR4 | A | 20 | 1UOH | A | 2.00 |
| 5 | 1BXL | A | 1 | 1MAZ | A | 2.20 |
| 6 | 2GPQ | A | 10 | 1IPB | A | 2.00 |
| 7 | 1A23 | A | 1 | 1A2J | A | 2.00 |
| 8 | 1MM4 | A | 20 | 1THQ | A | 1.90 |
| 9 | 1GGR | A | 3 | 2F3G | B | 2.13 |
| 10 | 1CRP | A | 20 | 1GNP | A | 2.70 |
| 11 | 1OCA | A | 20 | 1W8L | A | 1.80 |
| 12 | 1NMV | A | 10 | 1PIN | A | 1.35 |
| 13 | 1AX3 | A | 16 | 1GPR | A | 1.90 |
| 14 | 1DF3 | A | 10 | 1JV4 | A | 1.75 |
| 15 | 1B6F | A | 23 | 1BV1 | A | 2.00 |
| 16 | 1EQ0 | A | 20 | 1DY3 | A | 2.00 |
| 17 | 1BVH | A | 15 | 1DG9 | A | 1.90 |
| 18 | 1RCH | A | 8 | 1RDD | A | 2.80 |
| 19 | 6I1B | A | 1 | 1I1B | A | 2.00 |
| 20 | 1BA9 | A | 36 | 1MFM | A | 1.02 |
| 21 | 1XPW | A | 20 | 1TVG | A | 1.60 |
| 22 | 1FXT | A | 1 | 1FZY | B | 1.90 |
| 23 | 1JOK | A | 1 | 1SNO | A | 1.70 |
| 24 | 1CFC | A | 25 | 1A29 | A | 2.74 |
| 25 | 1ORM | A | 20 | 1QJ8 | A | 1.90 |
| 26 | 1PFL | A | 20 | 1FIK | A | 2.30 |
| 27 | 1BLR | A | 22 | 1CBQ | A | 2.20 |
| 28 | 2P3M | A | 20 | 2VBS | A | 3.00 |
| 29 | 1JJJ | A | 20 | 1B56 | A | 2.05 |
| 30 | 1JBH | A | 20 | 1CRB | A | 2.10 |
| 31 | 1TBD | A | 1 | 2FUF | A | 1.45 |
| 32 | 1AEL | A | 20 | 1ICM | A | 1.50 |
| 33 | 1A63 | A | 10 | 1A62 | A | 1.55 |
| 34 | 1IY3 | A | 1 | 1IWT | A | 1.40 |
| 35 | 1CYL | A | 20 | 1HIK | A | 2.60 |
| 36 | 1CEY | A | 46 | 1CHN | A | 1.76 |
| 37 | 2JU3 | A | 10 | 1LFO | A | 2.30 |
| 38 | 1FR0 | A | 30 | 1A0B | A | 2.06 |
| 39 | 1GIO | A | 10 | 1AGI | A | 1.50 |
| 40 | 2PRF | A | 19 | 1PRQ | A | 2.50 |
| 41 | 1MVG | A | 10 | 1TVQ | A | 2.00 |
| 42 | 2AAS | A | 32 | 1AQP | A | 2.00 |
| 43 | 1FSP | A | 20 | 1NAT | A | 2.45 |
| 44 | 1PIR | A | 1 | 1P2P | A | 2.60 |
| 45 | 1IT4 | A | 1 | 1FAZ | A | 1.40 |
| 46 | 1KLV | A | 20 | 1GNU | A | 1.75 |
| 47 | 2JN8 | A | 20 | 2ES9 | A | 2.00 |
| 48 | 1SVQ | A | 20 | 1SVY | A | 1.75 |
| 49 | 1K19 | A | 20 | 1KX8 | A | 2.80 |
| 50 | 1BC4 | A | 15 | 1KM8 | A | 1.90 |
| 51 | 1BNR | A | 20 | 1YVS | A | 2.20 |
| 52 | 1A67 | A | 16 | 1CEW | I | 2.00 |
| 53 | 2NLN | A | 20 | 1OMD | A | 1.85 |
| 54 | 1F40 | A | 10 | 1FKB | A | 1.70 |
| 55 | 1MPH | A | 50 | 1BTN | A | 2.00 |
| 56 | 1N5H | A | 15 | 1PFP | A | 2.30 |
| 57 | 1IYY | A | 24 | 1BU4 | A | 1.90 |
| 58 | 2CZN | A | 38 | 2CWR | A | 1.70 |
| 59 | 1N7T | A | 20 | 2H3L | B | 1.00 |
| 60 | 2EVN | A | 20 | 2IL4 | A | 2.05 |
| 61 | 1GO0 | A | 10 | 1H7M | A | 1.96 |
| 62 | 1M42 | A | 1 | 2C9Q | A | 1.60 |
| 63 | 1IIY | A | 1 | 3EZM | A | 1.50 |
| 64 | 1JNJ | A | 20 | 1LDS | A | 1.80 |
| 65 | 1BEG | A | 18 | 1BEO | A | 2.20 |
| 66 | 1C15 | A | 16 | 1CY5 | A | 1.30 |
| 67 | 1BMW | A | 38 | 1WHO | A | 1.90 |
| 68 | 2JMV | A | 20 | 2QSK | A | 1.00 |
| 69 | 1AFH | A | 15 | 1FK0 | A | 1.80 |
| 70 | 3HSF | A | 30 | 2HTS | A | 1.83 |
| 71 | 1PQX | A | 10 | 2FFM | A | 2.51 |
| 72 | 1L8Y | A | 30 | 2HDZ | A | 2.00 |
| 73 | 2CKU | A | 15 | 2CG6 | A | 1.55 |
| 74 | 1A2S | A | 1 | 1CTJ | A | 1.10 |
| 75 | 1ACA | A | 20 | 1HB6 | A | 2.00 |
| 76 | 1GGR | B | 3 | 1POH | A | 2.00 |
| 77 | 1ACP | A | 2 | 1T8K | A | 1.10 |
| 78 | 1D3Z | A | 10 | 1AAR | B | 2.30 |
| 79 | 1RQS | A | 20 | 1CTF | A | 1.70 |
| 80 | 2AIT | A | 9 | 1HOE | A | 2.00 |
| 81 | 1FD8 | A | 1 | 1CC7 | A | 1.20 |
| 82 | 1K3G | A | 30 | 1C75 | A | 0.97 |
| 83 | 2ORC | A | 32 | 1ORC | A | 1.54 |
| 84 | 1HFG | A | 1 | 2FHT | A | 1.70 |
| 85 | 1ZUG | A | 20 | 2CRO | A | 2.35 |
| 86 | 1BQT | A | 6 | 1GZR | B | 2.0 |
| 87 | 3MEF | A | 16 | 1MJC | A | 2.00 |
| 88 | 2BTT | A | 20 | 1RUW | A | 1.80 |
| 89 | 1QQV | A | 1 | 1YU5 | X | 1.40 |
| 90 | 1LQH | A | 1 | 2ASC | A | 1.10 |
| 91 | 1IJC | A | 20 | 1F94 | A | 0.97 |
| 92 | 1AEY | A | 15 | 1SHG | A | 1.80 |
| 93 | 1ERA | A | 1 | 3EBX | A | 1.40 |
| 94 | 1ROF | A | 10 | 1VJW | A | 1.75 |
| 95 | 1JV8 | A | 23 | 1BPI | A | 1.09 |
| 96 | 1H0T | B | 40 | 1LP1 | A | 2.30 |
| 97 | 1KUN | A | 20 | 1KNT | A | 1.60 |
| 98 | 1GB1 | A | 60 | 1PGA | A | 2.07 |
| 99 | 1BFY | A | 20 | 1FHH | A | 1.50 |
| 100 | 1MBE | A | 1 | 1GUU | A | 1.60 |
| 101 | 1FDM | 20 | 2C0W | A | 3.20 | |
| 102 | 1CCM | A | 8 | 1CBN | A | 0.83 |
| 103 | 1PJF | A | 27 | 1QL1 | A | 3.10 |
| 104 | 1HEV | A | 6 | 1Q9B | A | 1.50 |
| 105 | 1ERC | A | 20 | 2ERL | A | 1.00 |
| 106 | 1KX6 | A | 20 | 1GCN | A | 3.00 |
| 107 | 1CIR | B | 20 | 1CIQ | B | 2.20 |
| 108 | 1V6R | A | 20 | 1EDN | A | 2.18 |
| 109 | 1XGA | A | 35 | 1NOT | A | 1.20 |
Mean RMDS and RMDS When the PDB entry corresponding to a NMR structure contains several models, values were averaged. The values reported in the table are thus averages of averages.
| CE mean(std)(Å) | DALI mean(std)(Å) | PROFIT mean(std)(Å) | |
|---|---|---|---|
| RMSD | 1.61(0.02) | 1.96(0.02) | 2.60(0.09) |
| RMSD100 | 2.40(0.11) | 2.01(0.02) | 3.75(0.19) |
Average Distance Values (Standard Deviations in Parentheses) Between the Equivalent Cα Atoms for Different Amino Acid Types, After Optimal Superposition of 109 Non-Redundant Pairs of Proteins. Superpositions were made with CE, DALI, and PROFIT. When the PDB entry corresponding to a NMR structure contains several models, values were averaged. The values reported in the table are thus averages of averages.
| Residue Type | CE mean(std) (Å) | DALI mean(std) (Å) | PROFIT mean(std) (Å) |
|---|---|---|---|
| Ala | 1.41(0.05) | 1.80(0.05) | 2.55(0.17) |
| Arg | 1.61(0.06) | 1.71(0.06) | 2.18(0.17) |
| Asn | 1.57(0.06) | 1.87(0.06) | 2.40(0.14) |
| Asp | 1.65(0.06) | 1.90(0.06) | 2.64(0.18) |
| Cys | 1.23(0.06) | 1.42(0.07) | 1.27(0.07) |
| Glu | 1.54(0.05) | 1.87(0.06) | 2.20(0.16) |
| Gln | 1.49(0.07) | 1.80(0.07) | 2.09(0.17) |
| Gly | 1.85(0.06) | 2.03(0.05) | 2.38(0.12) |
| His | 1.58(0.11) | 1.78(0.10) | 2.04(0.27) |
| Ile | 1.24(0.05) | 1.55(0.05) | 1.80(0.12) |
| Leu | 1.28(0.04) | 1.72(0.05) | 1.83(0.10) |
| Lys | 1.56(0.05) | 1.76(0.05) | 2.32(0.15) |
| Met | 1.55(0.10) | 1.87(0.11) | 2.14(0.25) |
| Phe | 1.23(0.06) | 1.56(0.06) | 1.79(0.16) |
| Pro | 1.70(0.09) | 2.00(0.08) | 2.50(0.20) |
| Ser | 1.66(0.06) | 1.80(0.05) | 2.37(0.14) |
| Thr | 1.48(0.06) | 1.74(0.06) | 1.96(0.11) |
| Trp | 1.26(0.11) | 1.76(0.11) | 2.78(0.42) |
| Tyr | 1.39(0.07) | 1.76(0.08) | 2.22(0.15) |
| Val | 1.18(0.04) | 1.54(0.05) | 1.77(0.11) |
| Overall | 1.48(0.07) | 1.76(0.07) | 2.16(0.17) |
Mean Distances (Standard Deviations in Parentheses) Between Equivalent Atoms in Different Secondary Structures in NMR and X-Ray Models After Cα Based Superposition. After superposition Cα-Cα distances for different types of secondary structure elements were calculated. Superpositions were made with CE, DALI, and PROFIT. When the PDB entry corresponding to a NMR structure contains several models, values were averaged. The values reported in the table are thus averages of averages.
| Secondary Structure | CE (Å) | DALI (Å) | PROFIT (Å) |
|---|---|---|---|
| Helix mean(std) distance | 1.50(0.02) | 1.80(0.02) | 2.55(0.08) |
| Sheet mean(std) distance | 1.08(0.02) | 1.40(0.02) | 1.39(0.03) |
| Loop mean(std) distance | 1.79(0.03) | 2.04(0.02) | 2.58(0.07) |
Mean RMSD of Loops of Different Length and Differently Involved in Crystal Packing Contacts in the Solid State. Standard deviations of the mean are shown in parentheses. The length of the loop is measured by the number of residues (nres). Superpositions were made with PROFIT. The measure for the crystal packing contacts is number of residues that are involved in crystal packing interactions (ncpc). Therefore ncpc ≤ nres. When the PDB entry corresponding to a NMR structure contains several models, all of them are considered separately.
| nres | ncpc | Number of Examples | RMSD_100 (std) (Å) | nres | ncpc | Number of Examples | RMSD_100 (std) (Å) |
|---|---|---|---|---|---|---|---|
| 3 | 0 | 46 | 1.18(0.07) | 6 | 3 | 10 | 0.62(0.04) |
| 3 | 1 | 106 | 0.91(0.02) | 6 | 4 | 54 | 1.11(0.08) |
| 3 | 2 | 155 | 0.94(0.02) | 6 | 5 | 4 | 1.33(0.09) |
| 3 | 3 | 149 | 0.99(0.04) | 6 | 6 | 50 | 1.09(0.03) |
| 4 | 0 | 26 | 0.60(0.05) | 7 | 0 | 10 | 1.13(0.00) |
| 4 | 1 | 24 | 0.73(0.05) | 7 | 4 | 3 | 0.50(0.03) |
| 4 | 2 | 73 | 1.39(0.02) | 7 | 6 | 23 | 1.14(0.08) |
| 4 | 3 | 117 | 1.06(0.02) | 7 | 7 | 10 | 1.09(0.01) |
| 4 | 4 | 174 | 1.12(0.01) | 8 | 0 | 10 | 1.31(0.01) |
| 5 | 2 | 84 | 1.21(0.04) | 8 | 2 | 20 | 0.94(0.01) |
| 5 | 3 | 108 | 0.92(0.02) | 8 | 5 | 1 | 1.47(0) |
| 5 | 4 | 85 | 0.80(0.03) | 8 | 6 | 1 | 1.19(0) |
| 5 | 5 | 91 | 1.10(0.03) | 9 | 7 | 23 | 1.02(0.02) |
| 6 | 1 | 10 | 1.50(0.00) | 9 | 8 | 40 | 1.71(0.09) |