| Literature DB >> 25114050 |
Amita Barik1, Ranjit Prasad Bahadur2.
Abstract
We investigate the role of water molecules in 89 protein-RNA complexes taken from the Protein Data Bank. Those with tRNA and single-stranded RNA are less hydrated than with duplex or ribosomal proteins. Protein-RNA interfaces are hydrated less than protein-DNA interfaces, but more than protein-protein interfaces. Majority of the waters at protein-RNA interfaces makes multiple H-bonds; however, a fraction do not make any. Those making H-bonds have preferences for the polar groups of RNA than its partner protein. The spatial distribution of waters makes interfaces with ribosomal proteins and single-stranded RNA relatively 'dry' than interfaces with tRNA and duplex RNA. In contrast to protein-DNA interfaces, mainly due to the presence of the 2'OH, the ribose in protein-RNA interfaces is hydrated more than the phosphate or the bases. The minor groove in protein-RNA interfaces is hydrated more than the major groove, while in protein-DNA interfaces it is reverse. The strands make the highest number of water-mediated H-bonds per unit interface area followed by the helices and the non-regular structures. The preserved waters at protein-RNA interfaces make higher number of H-bonds than the other waters. Preserved waters contribute toward the affinity in protein-RNA recognition and should be carefully treated while engineering protein-RNA interfaces.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25114050 PMCID: PMC4150782 DOI: 10.1093/nar/gku679
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics of interface water molecules
| Interface statistics | Protein–RNA | Protein–DNAa | Protein–proteinb | ||||
|---|---|---|---|---|---|---|---|
| tRNA | ribosomal | duplex | single-stranded | All classes | |||
| Number of complexes | 12 | 5 | 27 | 45 | 89 | 115 | 115 |
| Interface area B (Å2) | 4185 ± 1226 | 1725 ± 434 | 2589 ± 1052 | 2004 ± 1219 | 2460 ± 1347 | 3137 ± 1350 | 1886 ± 704 |
| Non-polar area fraction | |||||||
| Protein | 56 ± 3 | 52 ± 11 | 54 ± 8 | 57 ± 8 | 56 ± 8 | 54 ± 7 | 60 ± 6 |
| RNA | 35 ± 2 | 29 ± 7 | 32 ± 5 | 34 ± 6 | 33 ± 6 | 37 ± 5 | - |
| Mean number of bound waterd | 301 ± 195 | 153 ± 59 | 208 ± 108 | 208 ± 122 | 217 ± 131 | 232 ± 174 | 218 ± 159 |
| Interface watere | |||||||
| Range | 8–116 | 12–49 | 9–70 | 2–51 | 2–116 | 2–174 | 1–58 |
| Mean number per interface | 39 ± 29 | 25 ± 15 | 32 ± 18 | 22 ± 12 | 27 ± 18 | 44 ± 32 | 20 ± 14 |
| Per 1000 Å2 of B | 9.3 | 14.5 | 12.4 | 11.0 | 11.0 | 14.0 | 10.6 |
| Water-mediated H-bondsf | |||||||
| Range | 16–198 | 23–78 | 14–134 | 5–130 | 5–198 | 2–244 | 0–101 |
| Mean number per interface | 76 ± 50 | 45 ± 22 | 60 ± 34 | 41 ± 26 | 52 ± 34 | 70 ± 47 | 31 ± 22 |
| Per 1000 Å2 of B | 18.2 | 26.1 | 23.2 | 20.5 | 21.1 | 22.3 | 16.4 |
| Bridging water moleculesg | |||||||
| Range | 3–34 | 4–12 | 2–23 | 0–26 | 0–34 | 0–60 | 0–15 |
| Mean number per interface | 15 ± 9 | 8 ± 3 | 12 ± 7 | 8 ± 5 | 10 ± 7 | 15 ± 11 | 5 ± 3 |
| Per 1000 Å2 of B | 3.6 | 4.6 | 4.6 | 4.0 | 4.1 | 4.8 | 2.7 |
| Preserved water moleculesh | |||||||
| Range | 1–12 | 4–6 | 3–14 | 0–17 | 0–17 | - | - |
| Mean number per interface | 4 | 5 | 7 | 6 | 6 | - | - |
| Per 1000 Å2 of B | 2.2 | 5.8 | 4.7 | 6.8 | 5.2 | - | - |
| Hydration pattern | |||||||
| Mean | 0.86 ± 0.09 | 1.17 ± 0.05 | 0.98 ± 0.12 | 1.07 ± 0.17 | 1.02 ± 0.16 | 1.01 ± 0.14 | 1.06 ± 0.16 |
aParameters are calculated on a protein–DNA dataset taken from (45) with resolution 2.6 Å or better, and having at least one interface water molecule.
bParameters are calculated on a dataset of protein–protein complexes taken from the docking benchmark 4.0 (46) with resolution 2.6 Å or better, and having at least one interface water molecule.
cPercentage of area contributed by the non-polar groups (all carbon-containing groups).
dAll the crystallographic water molecules bound to the complex.
eWater molecules <4.5 Å from atoms of both protein and RNA chains.
fH-bonds are calculated using the program HBPLUS (42).
gWater molecules making H-bonds with the polar groups of both protein and RNA chains.
hAt least one water-mediated H-bond with the same atom of either protein or RNA chain is found in both bound and unbound form. The mean number and the density are calculated over all the preserved water molecules in the dataset.
Figure 1.The number of interface water molecules is plotted against the interface area B in protein–RNA complexes.
Figure 2.Number of interface water molecules making zero to four H-bonds to the protein or the RNA component. The bars under ‘0’ H-bond represent the number of water molecules making no H-bond with one partner, but they make H-bonds with the other partner.
Frequency of interface water making multiple H-bonds
| Complexes | tRNA | ribosomal | duplex | single-stranded | All classes | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Protein | RNA | Protein | RNA | Protein | RNA | Protein | RNA | Protein | RNA | |
| H-bond | ||||||||||
| Frequencya | ||||||||||
| Zerob | 11.4 | 6.3 | 9.0 | 3.0 | 9.2 | 4.6 | 5.4 | 4.0 | 7.6 | 4.4 |
| One | 14.9 | 13.2 | 7.4 | 8.2 | 11.4 | 12.0 | 7.5 | 8.1 | 9.7 | 10.0 |
| Two | 5.3 | 8.6 | 2.8 | 5.0 | 4.0 | 5.1 | 3.4 | 4.2 | 3.8 | 5.1 |
| Three | 1.3 | 3.7 | 0.6 | 2.0 | 1.1 | 2.9 | 1.3 | 1.0 | 1.2 | 2.0 |
| Four | 0.1 | 1.2 | 0.0 | 1.6 | 0.3 | 1.3 | 0.2 | 0.6 | 0.2 | 0.9 |
aAverage number of interface water molecules making up to four H-bonds per interface. Total 1998 interface water molecules making at least one H-bond.
bFrequency of interface waters that do not make any H-bond with the protein, but they make H-bonds with the RNA and vice versa.
Chemical composition of H-bonds
| H-bonds | Protein–RNA | Protein–DNAa | ||
|---|---|---|---|---|
| Interface water-mediated | Direct | Interface water-mediated | Direct | |
| Total number | 4625 | 1740 | 8008 | 2585 |
| With protein chain | 1942 | - | 4087 | - |
| With nucleotide chain | 2683 | - | 3921 | - |
| Protein chemical group (%)b | ||||
| Main chain O | 24 | 12 | 21 | 3 |
| Main chain N | 10 | 14 | 10 | 23 |
| Side chain groups | ||||
| Charged | 31 | 43 | 32 | 37 |
| Neutral | 35 | 31 | 37 | 37 |
| Nucleic acid chemical group (%) | ||||
| Phosphatec | 30 | 36 | 56 | 70 |
| Sugard | 45 | 31 | 14 | 6 |
| Base | 25 | 33 | 30 | 25 |
| Guanine | 7 | 10 | 9 | 12 |
| Adenine | 6 | 7 | 9 | 3 |
| Cytosine | 4 | 6 | 6 | 6 |
| Uracil/Thymine | 8 | 10 | 6 | 4 |
aCalculated on a protein–DNA dataset taken from (45) with resolution 2.6 Å or better, and having at least one interface water molecule.
bCharged side chain: N of Arg, Lys and O of Asp, Glu. Neutral side chain: N of Asn, Gln, His, Trp; O of Asn, Gln; OH of Ser, Thr, Tyr and S of Cys, Met.
cPhosphate includes O1P, O2P and P atoms.
dOxygen atoms O2′, O3′, O4′ and O5′ are attributed to the sugar moiety.
Figure 3.Percentage contribution of (A) amino acid residues, and (B) nucleotide bases to the interface water-mediated H-bonds. One-letter codes of the amino acid residues along with their contribution as the donor (D) or as the acceptor (A) are given along the abscissa in (A). One-letter codes of the nucleotide bases and their contribution as the donor (D) or as the acceptor (A) are given along the abscissa in (B).
Figure 4.Number of interface water–RNA H-bonds in different structural classes. The phosphate includes O1P and O2P; the sugar moiety includes O2′, O3′, O4′ and O5′.
H-bonds with the 2′OH
| tRNA | ribosomal | duplex | single-stranded | Total | |
|---|---|---|---|---|---|
| Interface water-mediated H-bonds | |||||
| Overall | 560 | 153 | 983 | 987 | 2683 |
| No. with ribose sugar | 309 | 58 | 483 | 369 | 1219 |
| No. with 2′OH | 210 | 40 | 322 | 247 | 819 |
| % with respect to all | 38 | 26 | 33 | 25 | 31 |
| % within sugar moietya | 68 | 69 | 67 | 67 | 67 |
| Direct H-bonds | |||||
| Overall | 352 | 80 | 593 | 715 | 1740 |
| No. with ribose sugar | 114 | 27 | 203 | 190 | 534 |
| No. with 2′OH | 86 | 23 | 151 | 134 | 394 |
| % with respect to all | 24 | 29 | 25 | 19 | 23 |
| % within sugar moietya | 75 | 85 | 74 | 71 | 74 |
aContribution of the 2′OH with respect to all the oxygen atoms present in the ribose sugar. Here, O3′, O4′ and O5′ oxygen atoms are attributed to the sugar moiety.
Figure 5.Hydration pattern in protein–RNA interfaces. In each diagram, the protein chain is shown as molecular surface and the RNA chain is shown as ribbon. The interface region is colored blue, and the interface water molecules are represented by red sphere. (A) A ‘wet’ interface between arginyl-tRNA synthetase and its cognate tRNA (1F7U; dr = 0.84). (B) A ‘dry’ interface between ribosomal protein L1 and mRNA (2HW8; dr = 1.18). (C) A ‘wet’ interface between FAB and duplex RNA (2R8S; dr = 0.95). (D) A ‘dry’ interface between CspB and single-stranded RNA (3PF4; dr = 1.46).
Figure 6.Preserved interface water molecules in a protein–RNA recognition site. In the top left panel, the bound complex between ribosomal protein L1 and mRNA (2HW8) with interface waters (in red) and preserved waters (in magenta) are shown. In the top right panel, the unbound protein and the four preserved waters are shown. Protein and RNA chains are shown in gray and green color, respectively. The panels below show the conservation of four interface waters. These preserved waters making same H-bond in the bound and unbound forms of the protein.
Hydration of the protein secondary structural elements
| Interfaces | Water-mediated H-bonds | Direct H-bonds | ||||
|---|---|---|---|---|---|---|
| Helices | Strands | Other | Helices | Strands | Other | |
| tRNA | 16.8 | 9.1 | 13.7 | 12.4 | 20.2 | 15.6 |
| ribosomal | 25.6 | 21.1 | 9.7 | 31.4 | 15.3 | 16.4 |
| duplex | 22.4 | 21.4 | 16.0 | 19.3 | 20.7 | 15.2 |
| single-stranded | 19.2 | 25.2 | 20.6 | 15.2 | 19.1 | 18.5 |
| All classes | 19.6 | 21.2 | 16.9 | 16.0 | 19.4 | 16.6 |
Number of H-bonds per 1000 Å2 of B contributed by the helices, the strands or the other (non-regular secondary structures).
Hydration of the major and the minor groove
| H-bond percentage | Protein–RNAa | Protein–DNAb |
|---|---|---|
| Major groove atoms | 40 | 66 |
| Minor groove atoms | 60 | 33 |
Percentage of interface water-mediated H-bonds with the major and the minor groove atoms.
aCalculated on 27 duplex RNA (Class C).
bCalculated on a dataset of 115 protein–DNA complexes taken from (45).