| Literature DB >> 18276647 |
Abstract
An RNA-binding protein places a surface helix, beta-ribbon, or loop in an RNA helix groove and/or uses a cavity to accommodate unstacked bases. Hence, our strategy for predicting RNA-binding residues is based on detecting a surface patch and a disparate cleft. These were generated and scored according to the gas-phase electrostatic energy change upon mutating each residue to Asp(-)/Glu(-) and each residue's relative conservation. The method requires as input the protein structure and sufficient homologous sequences to define each residue's relative conservation. It yields as output a priority list of surface patch residues followed by a backup list of surface cleft residues distant from the patch residues for experimental testing of RNA binding. Among the 69 structurally non-homologous proteins tested, 81% possess a RNA-binding site with at least 70% of the maximum number of true positives in randomly generated patches of the same size as the predicted site; only two proteins did not contain any true RNA-binding residues in both predicted regions. Regardless of the protein conformational changes upon RNA-binding, the prediction accuracies based on the RNA-free/bound protein structures were found to be comparable and their binding sites overlapped as long as there are no disordered RNA-binding regions in the free structure that are ordered in the corresponding RNA-bound protein structure.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18276647 PMCID: PMC2275128 DOI: 10.1093/nar/gkn008
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
RNA-binding residue predictions based on the 3D structures of 69 structurally non-homologous protein–RNA complexes
aPDB entry of the RNA-bound protein structure followed by the protein chain.
bThe number of aa residues in the protein chain.
cThe number of true RNA-binding residues based on the protein/RNA complex structure.
dThe patch or cleft is generated and scored as described in the Materials and Methods section.
eThe number of predicted RNA-binding residues, which is equal to the number of solvent accessible residues in the patch/cleft.
fThe number of true positives or correctly predicted RNA-binding residues in the patch/cleft.
gThe maximum number of true-positive RNA-binding residues among all the randomly generated patches. Proteins with nTP/nmax < 0.7 in both predicted regions are highlighted by the gray background.
hThe random pick value (RPV), is the fraction of random patches with true-positive RNA-binding residues ≥nTP.
iThe top-ranked patch is merged with nearby top-ranking clefts.
jFor these small proteins, two disparate RNA-binding sites could not be found, and only a single RNA-binding site was predicted.
RNA-binding residue predictions based on the 3D structures of 18 structurally non-homologous RNA-free and RNA-bound proteinsa
aSee footnotes to Table 1, except that the gray background highlights predicted regions with nTP/nmax < 0.7.
bPDB entry of the RNA-free protein structure.
cPDB entry of the RNA-bound protein structure.
dThe root mean square deviation of the Cα atoms in the RNA-free protein structure relative to the respective RNA-bound protein structure.
eThe ratio of nTP to nmax in the patch or cleft predicted using the RNA-free protein structure.
fThe ratio of nTP to nmax in the patch or cleft predicted using the RNA-bound protein structure.
gThe overlapping fraction, foverlap, is computed according to Equation 5.
hThe top-ranked patch is merged with nearby top-ranking clefts.
iThe dash sign means that another disparate RNA-binding site could not be found.
Figure 1.Predicted RNA-binding residues in the top merged patch+cleft (in blue) derived using the RNA-free elongation factor tu structure (1tui-A). The ‘true’ RNA-binding residues derived from the respective RNA-bound structure (1b23-P) structure are in yellow, while those that are correctly predicted are in cyan.
Predicted RNA-binding residues in aminoacyl–tRNA–synthetases
| PDB-chain | Binding site | Predicted RNA-binding residues |
|---|---|---|
| 1h3f-A | Patch | I168−Y175, A178, Q179 |
| Cleft | L42, | |
| 1hc7-A | Patch | S15, L19, Y30–T36, S88, E90, L91, |
| Cleft | I37–V39, Y44, L70, F71, F87, P89, A92, V93, V108, N139, V141, W143, E155, L157 | |
| 1j09-A | Patch | L235, R237, N238, P239, D240, K241, T242, K243, I244, S245, K246, R247, K248, S249, H250 |
| Cleft | A7, |
aResidues in bold underlined are involved in binding ATP or tyrosinol based on the corresponding structure of tyrosyl-tRNA synthetase complexed with its cognate tRNATyr, ATP and tyrosinol (1h3e-A).
bResidues in bold underlined are involved in binding ATP or prolinol based on the corresponding structure of prolyl-tRNA synthetase complexed with its cognate tRNAPro, ATP and prolinol (1h4q-A).
cResidues in bold underlined are involved in binding the tRNA 3′-terminal CytCytAde based on the structure of glutamyl-tRNA synthetase complexed with its cognate tRNAGlu and glutamol-AMP (1n78-A).