| Literature DB >> 20944227 |
Abhinav Kumar1, Hsiu Ju Chiu, Herbert L Axelrod, Andrew Morse, Marc André Elsliger, Ian A Wilson, Ashley Deacon.
Abstract
Approximately 65% of PSI structures report some type of ligand(s) that is bound in the crystal structure. Here, a description is given of how such ligands are handled and analyzed at the JCSG and a survey of the types, variety and frequency of ligands that are observed in the PSI structures is also compiled and analyzed, including illustrations of how these bound ligands have provided functional clues for annotation of proteins with little or no previous experimental characterization. Furthermore, a web server was developed as a tool to mine and analyze the PSI structures for bound ligands and other identifying features.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20944227 PMCID: PMC2954221 DOI: 10.1107/S1744309110008092
Source DB: PubMed Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun ISSN: 1744-3091
Figure 1The Ligand Search Server and an example of its use. (a) The server’s main page showing the search form and search example looking for PSI structures that contain either FMN or PLP bound to proteins from Thermotoga maritima. Tips on how to use the interface are displayed on the right and a partial list of structures is listed at the bottom. (b) A summary of all of the ligands bound to the PSI structures is displayed when the ‘Summary’ button is clicked.
Summary of ligands found in PSI and JCSG structures
| Type | % observed in PSI structures | % observed in JCSG structures | No. of unique compounds/entities |
|---|---|---|---|
| Ligands | 12.3 | 15.6 | 285 |
| Peptides | 1.2 | 0.6 | |
| Cofactors | 9.4 | 10.9 | 22 |
| Metals | 24.7 | 26.4 | 24 |
| Non-metals | 27.2 | 40.7 | 21 |
| Organics | 3.0 | 3.9 | 23 |
| Buffers | 10.7 | 19.3 | 14 |
| Precipitants | 5.2 | 14.2 | 14 |
| Cryoprotectants | 21.3 | 51.6 | 3 |
| Overall | 65.0 | 85.2 |
Figure 2Percentage of PSI structures that have any small-molecule ligand bound to them. The small molecules are categorized by their types.
Figure 3Distribution of various ligands by category and relative frequency. Only the most common of these small molecules are shown. The names of the ligands follow the IDs used in the PDB and their full names can be obtained from the Ligand Expo Server (http://ligand-depot.rcsb.org/ld-search.html). (a) The ‘Ligands’ category includes biological ligands, such as substrates/products or their analogs. (b) The ‘Cofactors’ category includes various cofactors of enzymes but excludes ions, which are shown in the ‘Metal ions’ (c) and ‘Non-metal ions’ (d) categories.
Unique ligands found in PSI structures
| PDB code | Ligand name | Ligand ID | PSI center |
|---|---|---|---|
| Didecyldimethylammonium | 10A | TBSGC | |
| Allantoate ion | 1AL | NYSGXRC | |
| 3-Hydroxypropanoic acid | 3OH | MCSG | |
| (2 | 3SL | JCSG | |
| 2-[(2 | 8PP | NYSGXRC | |
| β- | ABF | MCSG | |
| Azelaic acid | AZ1 | NYSGXRC | |
| Co-5-methoxybenzimidazolylcobamide | B1M | SECSG | |
| B33 | NYSGXRC | ||
| Coformycin 5′-phosphate | CF5 | CESG | |
| 3-[(4 | DI6 | NYSGXRC | |
| 6-Diazenyl-5-oxo- | DON | MCSG | |
| 6-Fluoro- | FT6 | NESG | |
| 5-{[4-(9 | GEQ | TBSGC | |
| 1 | I3A | JCSG | |
| M7P | MCSG | ||
| 1-Methyl-1,3-dihydro-2 | MMZ | NYSGXRC | |
| (4-Amino-2-methylpyrimidin-5-yl)methyl dihydrogen phosphate | MP5 | NESG | |
| NIG | NYSGXRC | ||
| 10-Oxohexadecanoic acid | OHA | JCSG | |
| Pantoyl adenylate | PAJ | TBSGC | |
| 5-Phosphoribosyl-1-(β-methylene) pyrophosphate | PPC | TBSGC | |
| 2-Amino-4 | STH | NESG | |
| THT | TBSGC | ||
| 3-Hydroxy-2-[(3-hydroxy-2-methyl-5-phosphonooxymethylpyridin-4-ylmethyl)-amino]butyric acid | TLP | NYSGXRC |
Figure 4Unknown ligands (UNL) in a few PSI structures. The UNL atoms are represented as red spheres enveloped by electron-density mesh (2F o − F c density contoured at 1σ level above the mean) and surrounded by the protein rendered in cartoon representation. In many cases, the ligand could have been assigned as one or a few potential compounds, but is still annotated as a UNL since we have no definitive proof of its identity. (a) A protein of unknown function, NP_823353.1 from Streptomyces avermitilis, at 1.45 Å resolution. (b) A protein of unknown function possessing a ferritin-like fold (YP_832262.1; PDB code 3ez0) from Arthrobacter sp. Fb24 at 2.33 Å resolution. (c) A protein of unknown function from Geobacter sulfurreducens possessing a GGDEF domain (NP_951600.1; PDB code 3ezu) at 1.95 Å resolution. (d) Phzb2 (NP_250591.1; PDB code 3ff0) with a cystatin-like fold and an unknown function in phenazine biosynthesis from Pseudomonas aeruginosa at 1.90 Å resolution.
Ligands bound to proteins of unknown function, excluding common crystallization reagents and cryoprotectants
| Ligand | Count | PDB codes |
|---|---|---|
| UNL | 31 | |
| ZN | 29 | |
| NA | 29 | |
| MG | 28 | |
| CA | 19 | |
| UNX | 11 | |
| NI | 10 | |
| FE | 6 | |
| K | 5 | |
| NO3 | 4 | |
| MN | 4 | |
| COA | 4 | |
| PT | 2 | |
| PLM | 2 | |
| HG | 2 | |
| FMN | 2 | |
| SNN | 1 | |
| SIN | 1 | |
| SE | 1 | |
| SAM | 1 | |
| SAH | 1 | |
| RIP | 1 | |
| NDP | 1 | |
| NBZ | 1 | |
| NAP | 1 | |
| NAD | 1 | |
| HXA | 1 | |
| GLC | 1 | |
| GDP | 1 | |
| CO3 | 1 | |
| CO | 1 | |
| BR | 1 | |
| BEZ | 1 | |
| AU | 1 | |
| ATP | 1 |
The names of the ligands follow the IDs used in the PDB; their full names can be obtained from the Ligand Expo server (http://ligand-depot.rcsb.org/ld-search.html).
Figure 5Common reagents bound in the active sites of proteins. The protein structures are shown in cartoon representation and colored green or gray. The bound ligands are drawn as sticks and are colored yellow (carbon), red (oxygen) and blue (nitrogen). The interacting residues are also drawn as sticks with their C atoms colored cyan. (a) An SO4 2− ion bound in the active site of protein YP_001181608.1 (PDB code 3gxg). (b) A citrate molecule bound to YP_001089791.1 (PDB code 3g68) helped to identify the potential active site and was supported by substrates (gray) bound to the same location in structurally similar proteins (gray; PDB codes 1mos, 2bpl, 2poc and 2v4m). (c) A Tris molecule bound in the active site of YP_001304206.1 (PDB code 3h3l).
Frequency of cryoprotectant reagents found as bound ligands in JCSG structures
Values in parentheses correspond to occurances in all of the other structures in the PDB. These numbers are obtained from structures that report the use of these compounds in the crystal-growth conditions in their headers.
| Cryo reagent | No. of times used in crystallization/cryoprotective conditions | No. of times observed in structures | % observed |
|---|---|---|---|
| EDO | 348 (888) | 284 (184) | 81.6 (20.7) |
| GOL | 302 (3079) | 167 (723) | 55.3 (23.5) |
| MPD | 106 (2558) | 47 (373) | 44.3 (14.6) |
| PEG 200 | 78 | 44 | 56.4 |
| PEG 400 | 70 | 21 | 30.0 |
Three-letter codes: EDO, ethylene glycol; GOL, glycerol; MPD, 2-methyl-2,4-pentanediol.