| Literature DB >> 31970272 |
Abstract
Water is a key component of cellular biochemistry and numerous water molecules are visible in crystallographic structures. Here we report a series of data sets of crystallographic water: a high resolution data set, a cytochrome c oxidase (subunit I) data set and a carbonic anhydrase data set. These data support the evidence that short distance water molecule pairs are present both at the surface and inside the cavities of proteins. These data are related to article entitled "Oxygen-oxygen distances in protein-bound crystallographic water suggest the presence of protonated clusters" (Palese, 2020) [1].Entities:
Keywords: Confined water; Protein surface; Proton; Water; Water cluster; X-ray crystallography
Year: 2020 PMID: 31970272 PMCID: PMC6965711 DOI: 10.1016/j.dib.2019.105076
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1The RDF of the protein-bound water oxygen atoms. Left panel reports the HR data set (see Table 1); right panel reports the HR data set not refined by SHELX (see Table 2). See Ref. [1] for details. Red crosses refer to the experimental RDF; colored curves indicated with dashed lines are the Gaussian curves used for the fitting, and their sum is reported as a thin red line.
The HR data set.
| 1A6M | 1B0Y | 1BXO | 1C75 | 1DY5 | 1EA7 | 1EB6 | 1EXR | 1F94 | 1FN8 | 1FY4 | 1FY5 | 1G4I |
| 1G6X | 1GA6 | 1GCI | 1GDN | 1GDQ | 1GQV | 1GVK | 1HJ8 | 1I1W | 1IQZ | 1IR0 | 1IUA | 1J0P |
| 1JFB | 1K2A | 1K6U | 1KWF | 1L9L | 1LNI | 1LUG | 1M1Q | 1M40 | 1 MC2 | 1MJ5 | 1MN8 | 1MUW |
| 1MXT | 1N1P | 1N4U | 1N4V | 1N4W | 1N9B | 1NQJ | 1OAI | 1OD3 | 1OEW | 1OK0 | 1PQ5 | 1PQ7 |
| 1PQ8 | 1PWM | 1R2M | 1R6J | 1RTQ | 1SSX | 1SY2 | 1SY3 | 1T2H | 1TG0 | 1TQG | 1TT8 | 1UG6 |
| 1UNQ | 1US0 | 1V0L | 1VB0 | 1VBW | 1VL9 | 1VYR | 1W0N | 1X8P | 1XMK | 1XVO | 1YK4 | 1YLJ |
| 1YWA | 1YWB | 1YWC | 1Z8A | 1ZLB | 1ZUU | 1ZZK | 2A6Z | 2ABB | 2AGT | 2AT3 | 2AT8 | 2AYW |
| 2B97 | 2BF6 | 2CE2 | 2CNQ | 2CWS | 2DDX | 2E4T | 2EWI | 2EWK | 2FDN | 2FLA | 2FMA | 2FOU |
| 2FVY | 2GBA | 2GEW | 2GG2 | 2GGC | 2GKG | 2H3L | 2H5C | 2H5D | 2I4A | 2IDQ | 2IDS | 2IDT |
| 2IDU | 2IXT | 2JFR | 2JJJ | 2NLS | 2NRL | 2NX0 | 2O7A | 2O9S | 2OFR | 2OV0 | 2P5K | 2P74 |
| 2PEV | 2PF8 | 2PFH | 2PND | 2PNE | 2PVB | 2PVE | 2PYA | 2PZN | 2QCP | 2QDV | 2QDW | 2QSK |
| 2RBK | 2V1M | 2V8B | 2VB1 | 2VHK | 2VHR | 2VI3 | 2VK5 | 2VU6 | 2WFI | 2WFJ | 2WUR | 2X46 |
| 2XFR | 2XJP | 2XOD | 2XU3 | 2Z6W | 2ZPM | 2ZQ7 | 2ZQA | 3A02 | 3A4R | 3AJ4 | 3AKQ | 3AKS |
| 3AKT | 3B3R | 3BOI | 3C78 | 3CNJ | 3D1P | 3D43 | 3DHA | 3E4G | 3E6Z | 3EA6 | 3FSA | 3FYM |
| 3G9X | 3GHR | 3GHS | 3GOE | 3GYI | 3GYJ | 3H31 | 3HGP | 3I2Y | 3I30 | 3I34 | 3I37 | 3IP0 |
| 3JUD | 3K34 | 3KFF | 3KLR | 3KS3 | 3LEP | 3LL1 | 3LL2 | 3LZ5 | 3M4H | 3MFJ | 3 MI4 | 3NE0 |
| 3NED | 3NIR | 3O5P | 3O5Q | 3ODV | 3P8J | 3PUC | 3QL9 | 3QM5 | 3QM6 | 3QM8 | 3QM9 | 3QMA |
| 3QPA | 3QPC | 3S6E | 3SOJ | 3TEU | 3TRV | 3U2C | 3U7C | 3UI4 | 3UI6 | 3V1A | 3VHG | 3VIF |
| 3VIG | 3VII | 3VJQ | 3VN3 | 3VOR | 3WCQ | 3WGE | 3WGX | 3WL2 | 3WOU | 3WVM | 3X2G | 3X2M |
| 3X32 | 3X33 | 3X34 | 3X35 | 3ZR8 | 3ZSJ | 3ZSK | 3ZUC | 3ZZP | 4A02 | 4ACJ | 4AQO | 4AWS |
| 4AWT | 4BCT | 4BJ0 | 4BM8 | 4BVM | 4DP9 | 4DPB | 4DRQ | 4E3Y | 4EIC | 4EKF | 4F18 | 4FPT |
| 4FRC | 4FU5 | 4G78 | 4G9S | 4GA2 | 4GCA | 4GNR | 4HGU | 4HVU | 4HVW | 4I8G | 4I8H | 4I8J |
| 4I8K | 4I8L | 4IAU | 4IGS | 4J5E | 4JP6 | 4K8Y | 4KQP | 4LAU | 4LAZ | 4LB3 | 4LB4 | 4LBR |
| 4LBS | 4LFS | 4M7G | 4MTY | 4MZC | 4N1I | 4NPD | 4O6Q | 4O6U | 4O8H | 4OO4 | 4PRT | 4PSS |
| 4PSY | 4PTH | 4Q4G | 4Q78 | 4Q9W | 4QB3 | 4QBX | 4QXI | 4R5R | 4REK | 4RTZ | 4TJZ | 4TKB |
| 4TKH | 4TKJ | 4UA6 | 4UA7 | 4UA9 | 4UAA | 4UYR | 4WJX | 4WKA | 4WPK | 4X5P | 4X6H | 4XDX |
| 4XOJ | 4XZH | 4Y5L | 4Y9W | 4YXI | 4Z8J | 4ZC9 | 4ZGF | 4ZM7 | 5A8C | 5AVD | 5AVG | 5B28 |
| 5B5H | 5CMT | 5CTM | 5D66 | 5DGJ | 5DJ7 | 5DK1 | 5DKM | 5DP2 | 5DZE | 5E1K | 5E1N | 5E9N |
| 5EMB | 5F82 | 5FLK | 5HB7 | 5I5B | 5IG6 | 5II8 | 5IQN | 5IVN | 5JDK | 5JDT | 5JIG | 5JUG |
| 5KWM | 5KXV | 5LJT | 5LP9 | 5LXW | 5MAE | 5MAJ | 5MB5 | 5MEH | 5MK9 | 5MN1 | 5MNC | 5MNG |
| 5MNK | 5MNN | 5MTU | 5NFK | 5NFM | 5NW3 | 5O2X | 5O99 | 5OAV | 5OBK | 5OGO | 5OME | 5OTN |
| 5OU0 | 5OUJ | 5OUK | 5SV5 | 5TDA | 5TIF | 5U3A | 5VLE | 5WS7 | 5X9L | 5X9M | 5XBU | 5XP6 |
| 5XQU | 5XQV | 5XR0 | 5Y2R | 5Y2S | 5YCE | 5ZGE | 5ZGI | 5ZGW | 5ZGX | 5ZGY | 5ZGZ | 5ZIO |
| 5ZJ1 | 5ZJ7 | 5ZJ8 | 5ZJC | 6B00 | 6CNW | 6EQE | 6ETK | 6ETM | 6EUW | 6F1O | 6F7R | 6F81 |
| 6FMC | 6FO5 | 6FVI | 6G1I | 6HMD | 6HMQ | 6I74 | 6J93 | 6JJ1 | 6JJQ | 6MU9 | 6NFR | 6Q2Y |
| 6Q49 | 6Q4G | 6Q4H | 6Q6T | 6RGP | 6RHH | 6RHU | 6RHX | 6RI0 | 6RI6 | 6RI8 | 6RII | 7A3H |
| 8A3H |
The no-SHELX HR data set.
| 1EB6 | 1OAI | 1OD3 | 1R2M | 1SY2 | 1SY3 | 1T2H | 1UG6 | 1V0L | 1W0N | 1ZUU | 2ABB | 2AT3 |
| 2AT8 | 2AYW | 2CE2 | 2CNQ | 2DDX | 2GG2 | 2GGC | 2H3L | 2I4A | 2IXT | 2NLS | 2NRL | 2NX0 |
| 2OFR | 2P5K | 2PND | 2PNE | 2QCP | 2V1M | 2VHK | 2VHR | 2VI3 | 2VU6 | 2XFR | 2XU3 | 2Z6W |
| 2ZQ7 | 2ZQA | 3A02 | 3A4R | 3AJ4 | 3AKQ | 3AKS | 3AKT | 3BOI | 3C78 | 3E4G | 3E6Z | 3FSA |
| 3I2Y | 3I30 | 3I34 | 3I37 | 3IP0 | 3JUD | 3LL1 | 3LL2 | 3NED | 3NIR | 3O5Q | 3P8J | 3PUC |
| 3QL9 | 3QM5 | 3QM6 | 3QM8 | 3QM9 | 3QMA | 3QPA | 3QPC | 3TEU | 3UI4 | 3UI6 | 3V1A | 3VIF |
| 3VIG | 3VII | 3VN3 | 3WCQ | 3WGE | 3WGX | 3WL2 | 3WVM | 3X2G | 3X2M | 3ZR8 | 3ZUC | 4ACJ |
| 4AQO | 4BCT | 4BJ0 | 4BVM | 4DP9 | 4DPB | 4DRQ | 4E3Y | 4F18 | 4G9S | 4GA2 | 4GCA | 4GNR |
| 4HVU | 4HVW | 4IGS | 4J5E | 4K8Y | 4KQP | 4LAU | 4LAZ | 4LB3 | 4LB4 | 4LBR | 4LBS | 4M7G |
| 4MTY | 4MZC | 4O6Q | 4O6U | 4O8H | 4OO4 | 4PRT | 4Q4G | 4Q78 | 4Q9W | 4QB3 | 4QBX | 4QXI |
| 4R5R | 4REK | 4RTZ | 4TJZ | 4TKB | 4TKH | 4TKJ | 4WJX | 4WKA | 4WPK | 4X5P | 4X6H | 4XDX |
| 4XOJ | 4XZH | 4Y5L | 4Y9W | 4YXI | 4Z8J | 4ZC9 | 4ZM7 | 5A8C | 5AVD | 5AVG | 5B28 | 5B5H |
| 5CMT | 5CTM | 5DGJ | 5DJ7 | 5DK1 | 5DKM | 5DP2 | 5DZE | 5E1N | 5E9N | 5EMB | 5F82 | 5FLK |
| 5HB7 | 5I5B | 5IG6 | 5II8 | 5IQN | 5IVN | 5JDK | 5JDT | 5JIG | 5JUG | 5KWM | 5KXV | 5LJT |
| 5LP9 | 5LXW | 5MAE | 5MAJ | 5MB5 | 5MEH | 5MN1 | 5MNC | 5MNG | 5MNK | 5MNN | 5NFK | 5NFM |
| 5NW3 | 5O2X | 5O99 | 5OAV | 5OBK | 5OGO | 5OME | 5OTN | 5OU0 | 5OUJ | 5OUK | 5SV5 | 5TDA |
| 5TIF | 5U3A | 5WS7 | 5XBU | 5XP6 | 5XQU | 5XQV | 5XR0 | 5Y2R | 5Y2S | 5YCE | 5ZGE | 5ZGI |
| 5ZGW | 5ZGX | 5ZGY | 5ZGZ | 5ZIO | 5ZJ1 | 5ZJ7 | 5ZJ8 | 5ZJC | 6B00 | 6CNW | 6EQE | 6EUW |
| 6F1O | 6FMC | 6FO5 | 6FVI | 6G1I | 6HMD | 6HMQ | 6I74 | 6J93 | 6JJ1 | 6JJQ | 6MU9 | 6NFR |
| 6Q2Y | 6Q49 | 6Q4G | 6Q4H | 6Q6T | 6RGP | 6RHH | 6RHU | 6RHX | 6RI0 | 6RI6 | 6RI8 | 6RII |
| 7A3H | 8A3H |
Fig. 2The RDF of the water oxygen atoms in the sodium free HR data set (see Table 3) described in Ref. [1].
The sodium free HR data set.
| 1B0Y | 1DY5 | 1EA7 | 1F94 | 1G4I | 1I1W | 1IQZ | 1IR0 | 1J0P | 1JFB | 1K6U | 1LNI | 1LUG |
| 1M1Q | 1 MC2 | 1MJ5 | 1MUW | 1N1P | 1N4W | 1N9B | 1PWM | 1R2M | 1R6J | 1SSX | 1SY2 | 1SY3 |
| 1TG0 | 1TT8 | 1UG6 | 1V0L | 1VB0 | 1VL9 | 1W0N | 1X8P | 1XMK | 1YWA | 1YWB | 1YWC | 1Z8A |
| 1ZZK | 2A6Z | 2AGT | 2AT3 | 2AT8 | 2AYW | 2BF6 | 2CNQ | 2EWI | 2EWK | 2FDN | 2FMA | 2FOU |
| 2FVY | 2GEW | 2GG2 | 2GGC | 2H3L | 2H5C | 2H5D | 2JFR | 2O9S | 2OFR | 2P74 | 2PEV | 2PF8 |
| 2PFH | 2PVE | 2PZN | 2WFI | 2WFJ | 2WUR | 2XFR | 2XU3 | 2Z6W | 2ZPM | 3A02 | 3A4R | 3AKQ |
| 3C78 | 3D1P | 3D43 | 3DHA | 3E4G | 3EA6 | 3FSA | 3FYM | 3GHR | 3GHS | 3GOE | 3I2Y | 3I30 |
| 3I34 | 3I37 | 3JUD | 3K34 | 3KFF | 3LEP | 3LZ5 | 3M4H | 3NED | 3NIR | 3O5P | 3O5Q | 3PUC |
| 3QL9 | 3QPA | 3QPC | 3SOJ | 3TRV | 3U2C | 3U7C | 3UI4 | 3UI6 | 3V1A | 3VHG | 3VIF | 3VIG |
| 3VII | 3VJQ | 3VOR | 3WCQ | 3WGE | 3WGX | 3WOU | 3WVM | 3X2G | 3X2M | 3ZR8 | 3ZUC | 3ZZP |
| 4ACJ | 4AQO | 4AWS | 4AWT | 4BCT | 4BJ0 | 4DRQ | 4E3Y | 4EIC | 4F18 | 4FU5 | 4G78 | 4G9S |
| 4GA2 | 4GCA | 4GNR | 4IAU | 4IGS | 4J5E | 4JP6 | 4LAU | 4LAZ | 4LB3 | 4LB4 | 4LBR | 4LBS |
| 4M7G | 4O6U | 4PRT | 4PSS | 4PSY | 4PTH | 4Q9W | 4QB3 | 4QBX | 4QXI | 4RTZ | 4TJZ | 4TKB |
| 4TKH | 4TKJ | 4UA6 | 4UA7 | 4UA9 | 4UAA | 4XZH | 4Y9W | 4YXI | 4Z8J | 4ZC9 | 4ZGF | 4ZM7 |
| 5A8C | 5B28 | 5B5H | 5DGJ | 5DJ7 | 5DK1 | 5DP2 | 5E9N | 5EMB | 5F82 | 5FLK | 5HB7 | 5IVN |
| 5JDK | 5JIG | 5KWM | 5MAE | 5MAJ | 5MEH | 5MN1 | 5MNC | 5MNG | 5MNK | 5MNN | 5NFK | 5NFM |
| 5O2X | 5O99 | 5OTN | 5OU0 | 5OUJ | 5OUK | 5TDA | 5TIF | 5VLE | 5XBU | 5XP6 | 5YCE | 5ZGE |
| 5ZGY | 5ZGZ | 5ZIO | 5ZJ1 | 5ZJ7 | 5ZJ8 | 5ZJC | 6B00 | 6F7R | 6F81 | 6FMC | 6G1I | 6MU9 |
| 6NFR | 6Q2Y |
Fig. 3The RDF of the water oxygen atoms in the sodium free, not refined by SHELX subset (see Table 4) described in Ref. [1].
The sodium free, no-SHELX HR data set.
| 1R2M | 1SY2 | 1SY3 | 1UG6 | 1V0L | 1W0N | 2AT3 | 2AT8 | 2AYW | 2CNQ | 2GG2 | 2GGC | 2H3L |
| 2OFR | 2XFR | 2XU3 | 2Z6W | 3A02 | 3A4R | 3AKQ | 3C78 | 3E4G | 3FSA | 3I2Y | 3I30 | 3I34 |
| 3I37 | 3JUD | 3NED | 3NIR | 3O5Q | 3PUC | 3QL9 | 3QPA | 3QPC | 3UI4 | 3UI6 | 3V1A | 3VIF |
| 3VIG | 3VII | 3WCQ | 3WGE | 3WGX | 3WVM | 3X2G | 3X2M | 3ZR8 | 3ZUC | 4ACJ | 4AQO | 4BCT |
| 4BJ0 | 4DRQ | 4E3Y | 4F18 | 4G9S | 4GA2 | 4GCA | 4GNR | 4IGS | 4J5E | 4LAU | 4LAZ | 4LB3 |
| 4LB4 | 4LBR | 4LBS | 4M7G | 4O6U | 4PRT | 4Q9W | 4QB3 | 4QBX | 4QXI | 4RTZ | 4TJZ | 4TKB |
| 4TKH | 4TKJ | 4XZH | 4Y9W | 4YXI | 4Z8J | 4ZC9 | 4ZM7 | 5A8C | 5B28 | 5B5H | 5DGJ | 5DJ7 |
| 5DK1 | 5DP2 | 5E9N | 5EMB | 5F82 | 5FLK | 5HB7 | 5IVN | 5JDK | 5JIG | 5KWM | 5MAE | 5MAJ |
| 5MEH | 5MN1 | 5MNC | 5MNG | 5MNK | 5MNN | 5NFK | 5NFM | 5O2X | 5O99 | 5OTN | 5OU0 | 5OUJ |
| 5OUK | 5TDA | 5TIF | 5XBU | 5XP6 | 5YCE | 5ZGE | 5ZGY | 5ZGZ | 5ZIO | 5ZJ1 | 5ZJ7 | 5ZJ8 |
| 5ZJC | 6B00 | 6FMC | 6G1I | 6MU9 | 6NFR | 6Q2Y |
Fig. 4Scatter plot of the RDFs relative to the no SHELX subset (horizontal axis) and the sodium free, no SHELX subset (vertical axis) described in Ref. [1]. Solid black line is the best fit.
Fig. 5Number of crystallographic water molecules in CcO subunit I structures as a function of the resolution of the relative PDB entry. Structures diffracted at 100 K and 50 K have been considered for this analysis.
Fig. 6Distribution of the Euclidean oxygen-oxygen (O–O) distances of water molecules in the CcO subunit I data set. Data are obtained from the 50 K structures. Only water pairs with O–O distances <5 Å are reported.
Fig. 7Distribution of the Euclidean oxygen-oxygen (O–O) distances of water molecules in the CcO subunit I data set. Data are obtained from the 100 K structures. Only water pairs with O–O distances <5 Å are reported.
The number of water molecules in each considered hCA II is reported (#H2O). The table reports also the number of calculated oxygen-oxygen Euclidean distances (# O–O) and the number of oxygen-oxygen pairs considered as putative, charged (protonated or deprotonated), water clusters as defined in Ref. [1] (# OHO).
| 4MTY | 4Q78 | 4YXI | 5LJT | 5OGO | 5Y2R | 5Y2S | 6B00 | |
|---|---|---|---|---|---|---|---|---|
| # H2O | 398 | 214 | 264 | 439 | 333 | 441 | 429 | 409 |
| # O–O | 79003 | 22791 | 34716 | 96141 | 55278 | 97020 | 91806 | 83436 |
| # OHO | 3 | 3 | 1 | 1 | 9 | 12 | 17 | 10 |
The number of water molecules in each considered CcO subunit I is reported (#H2O). The table reports also the number of calculated oxygen-oxygen Euclidean distances (# O–O) and the number of oxygen-oxygen pairs considered as putative, charged (protonated or deprotonated), water clusters as defined in Ref. [1] (# OHO).
| 5B1A_A | 5B1A_N | 5B1B_A | 5B1B_N | 5B3S_A | 5B3S_N | 5XDQ_A | 5XDQ_N | 5ZCP_A | 5ZCP_N | 5ZCQ_A | 5ZCQ_N | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| # H2O | 297 | 290 | 289 | 286 | 273 | 278 | 257 | 267 | 235 | 203 | 228 | 215 |
| # O–O | 43956 | 41905 | 41616 | 40755 | 37128 | 38503 | 32896 | 35511 | 27495 | 20503 | 25878 | 23005 |
| # OHO | 3 | 1 | 3 | 2 | 9 | 5 | 1 | 1 | 3 | 3 | 4 | 2 |
The RDF pseudo-code.
| set num_mol [molinfo num] |
Specifications Table
| Subject | Biochemistry |
| Specific subject area | Biochemistry, biophysics, structural biology, bioenergetics. |
| Type of data | Table |
| How data were acquired | Survey of the protein crystal structures obtained by X-ray diffraction deposited in the Protein Data Bank (PDB). Input data for analysis were obtained as pdb, cif or ccp4 files from public databases. |
| Data format | Raw: pdb files, ccp4 files, cif files and text files. |
| Parameters for data collection | Raw pdb files were checked for quality (resolution). Atoms in cif files were compared with the electron density maps in ccp4 files. |
| Description of data collection | Raw data were analyzed by different computational protocols. |
| Data source location | Department of Basic Medical Sciences, Neurosciences and Sense Organs (SMBNOS), Bari 70124, Italy. |
| Data accessibility | All the data produced in this work are available within the article. |
| Related research article | Luigi Leonardo Palese, Oxygen-oxygen distances in protein-bound crystallographic water suggest the presence of protonated clusters, Biochemical et Biophysical Acta - General Subjects, |
X-ray crystallography has shown that proteins contain numerous water molecules The role of these protein-bound water molecules is still not fully understood Water molecules in large data sets of high resolution structures are reported Positions of candidate protonated water clusters in some model proteins are reported |