| Literature DB >> 29471780 |
Oliviero Carugo1,2.
Abstract
BACKGROUND: Protein crystal structures are potentially over-interpreted since they are routinely refined without any restraint on the upper limit of atomic B-factors. Consequently, some of their atoms, undetected in the electron density maps, are allowed to reach extremely large B-factors, even above 100 square Angstroms, and their final positions are purely speculative and not based on any experimental evidence.Entities:
Keywords: Atomic displacement parameter; B-factor; Crystal structure; Matthews coefficient; Protein structure validation
Mesh:
Substances:
Year: 2018 PMID: 29471780 PMCID: PMC5824579 DOI: 10.1186/s12859-018-2083-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Number of protein crystal structures in the different datasets examined in the present paper
| Resolution range (Å) | Number of structures | With B > 100 Å2 | Without B > 100 Å2 | With B > 100 Å2 | Without B > 100 Å2 |
|---|---|---|---|---|---|
| 0.0–1.5 | 863 | 4% | 24% | 2% | 70% |
| 1.5–1.8 | 1853 | 5% | 18% | 1% | 76% |
| 1.8–2.1 | 2505 | 8% | 14% | 1% | 77% |
| 2.1–2.4 | 1950 | 18% | 11% | 2% | 69% |
| 2.4–2.7 | 1422 | 31% | 9% | 4% | 56% |
| 2.7–3.0 | 916 | 44% | 6% | 6% | 44% |
| 3.0–3.3 | 338 | 53% | 6% | 7% | 34% |
| 3.3–4.0 | 76 | 55% | 5% | 12% | 28% |
Fig. 1PDB entries with missing residues and large B-factors. Frequency with which different types of files have been deposited into the PDB during the last decades
Fig. 2B-factors and pcVol. Relationship between B-factors and pcVol (independently of crystallographic resolution)
Fig. 3B-factors, resolution and pcVol. Dependence of B-factors on resolution and dependence of resolution on pcVol indicate that the relationship between B-factors and pcVol (see Fig. 2) must be examined in datasets of structures that have similar resolution
Fig. 4B-factors-pcVol versus resolution. Relationship between B-factors and pcVol at various resolution ranges, indicated close to each curve
Values of the B-factors (Å2) expected when the percentage of solvent tends to 100% (B_max; confidence intervals in parentheses)
| Atom type | Resolution range (Å) | |||||||
|---|---|---|---|---|---|---|---|---|
| 0.0–1.5 | 1.5–1.8 | 1.8–2.1 | 2.1–2.4 | 2.4–2.7 | 2.7–3.0 | 3.0–3.3 | 3.3–4.0 | |
| Any | 25(2) | 31(2) | 43(2) | 55(3) | 61(3) | 70(5) | 83(9) | 80(19) |
| 1 | 23(2) | 30(2) | 41(2) | 53(3) | 59(3) | 68(5) | 80(9) | 77(19) |
| 2 | 23(2) | 30(2) | 42(2) | 54(3) | 60(3) | 68(5) | 80(9) | 77(19) |
| 3 | 23(2) | 30(2) | 41(2) | 52(3) | 57(3) | 64(5) | 74(9) | 71(19) |
| 4 | 26(2) | 33(2) | 44(2) | 56(3) | 62(3) | 70(5) | 83(9) | 84(20) |
| 5 | 24(2) | 31(2) | 42(2) | 54(3) | 61(3) | 70(5) | 84(9) | 84(20) |
| 6 | 33(3) | 39(2) | 51(2) | 64(3) | 70(4) | 80(5) | 98(11) | 102(22) |
| 7 | 24(2) | 31(2) | 41(2) | 51(3) | 56(3) | 63(5) | 76(9) | 72(19) |
| 8 | 23(2) | 30(2) | 40(2) | 51(3) | 56(3) | 64(5) | 79(9) | 72(19) |
| 9 | 25(3) | 32(2) | 43(2) | 55(3) | 62(4) | 70(5) | 84(9) | 81(21) |
| 10 | 26(4) | 32(3) | 44(3) | 56(4) | 61(4) | 72(6) | 92(11) | 90(23) |
| 11 | 29(3) | 35(2) | 48(2) | 59(3) | 67(4) | 79(5) | 93(10) | 94(21) |
| 12 | 32(3) | 39(3) | 52(3) | 65(3) | 73(4) | 86(6) | 102(11) | 106(22) |
| 13 | 22(4) | 25(3) | 36(3) | 45(4) | 51(5) | 64(7) | 74(12) | 57(25) |
| 14 | 27(5) | 30(4) | 37(4) | 44(5) | 57(6) | 65(8) | 83(13) | 76(22) |
| 15 | 23(2) | 30(2) | 41(2) | 53(3) | 59(3) | 68(5) | 80(9) | 76(19) |
| 16 | 20(3) | 29(3) | 37(3) | 43(4) | 47(5) | 56(7) | 68(12) | 63(22) |
| 17 | 27(4) | 32(3) | 44(3) | 56(4) | 62(4) | 72(6) | 91(11) | 90(24) |
| 18 | 31(3) | 36(2) | 48(2) | 59(3) | 66(4) | 79(5) | 95(10) | 94(21) |
| 19 | 35(4) | 38(3) | 51(3) | 62(3) | 70(4) | 80(6) | 97(10) | 100(24) |
| 20 | 40(4) | 43(3) | 57(3) | 69(3) | 74(4) | 80(5) | 93(11) | 98(22) |
| 21 | 24(2) | 30(2) | 41(2) | 54(3) | 60(3) | 69(5) | 81(9) | 77(19) |
| 22 | 31(3) | 37(2) | 50(2) | 61(3) | 68(4) | 80(5) | 95(10) | 95(21) |
| 23 | 35(3) | 41(3) | 54(3) | 67(3) | 74(4) | 87(6) | 103(11) | 107(22) |
| 24 | 26(2) | 32(2) | 44(2) | 56(3) | 62(4) | 71(5) | 84(9) | 85(19) |
Fig. 5B-max examples. B_max values (Å2), at various resolution ranges, for the side chain atoms of lysine, serine, and methionine
Percentage of protein crystal structures deposited after 2014 in the Protein Data Bank that have and average B-factor larger than B_max
| Resolution (Å) | Percentage (%) |
|---|---|
| 0.0–1.5 | 5 |
| 1.5–1.8 | 17 |
| 1.8–2.1 | 19 |
| 2.1–2.4 | 30 |
| 2.4–2.7 | 34 |
| 2.7–3.0 | 37 |
| 3.0–3.3 | 43 |
| 3.3–4.0 | 76 |