| Literature DB >> 19523227 |
Jianwen Fang1, Yakov M Koen, Robert P Hanzlik.
Abstract
BACKGROUND: Protein covalent binding by reactive metabolites of drugs, chemicals and natural products can lead to acute cytotoxicity. Recent rapid progress in reactive metabolite target protein identification has shown that adduction is surprisingly selective and inspired the hope that analysis of target proteins might reveal protein factors that differentiate target- vs. non-target proteins and illuminate mechanisms connecting covalent binding to cytotoxicity.Entities:
Year: 2009 PMID: 19523227 PMCID: PMC2711050 DOI: 10.1186/1472-6769-9-5
Source DB: PubMed Journal: BMC Chem Biol ISSN: 1472-6769
Sequence parameters
| Amino acid residue composition | 40 | In-house script | Numbers and percentages of 20 amino acid residues |
| Numbers and percentages of positive residues, negative residues, all charged residues, net charges | 8 | In-house script | |
| Sequence length | 1 | ||
| Predicted isoelectric point | 1 | ProtParam | [ |
| Number of C/H/N/O/S atoms | 5 | ProtParam | [ |
| Instability index | 1 | ProtParam | Predicted based on dipeptide composition [ |
| Instability class | 1 | ProtParam | Proteins with predicted in vivo half life ≥ 40 hrs considered as stable [ |
| Aliphatic index | 1 | ProtParam | [ |
| Gravy hydropathy index | 1 | ProtParam | [ |
| Predicted percentage of sheet, helix and coil | 3 | psipred | [ |
| Predicted percentage of buried/exposed residues | 2 | Accpro | [ |
| Predicted disordered regions including length of the longest coil, length of all coils, percentage of the longest coil, percentage of all coils; and corresponding features for rem465 and hotloop | 12 | disembl | [ |
| Tripeptide features | 125 | In-house script |
Representative Gene Ontology terms significantly enriched in target proteins.a
| Category Description | |||||
| 3824 | catalytic activity | 1.45E-19 | 115 | 5085 | 0.023 |
| 16491 | oxidoreductase activity | 2.76E-12 | 40 | 916 | 0.044 |
| 16209 | antioxidant activity | 2.93E-09 | 11 | 57 | 0.193 |
| 51920 | peroxiredoxin activity | 9.34E-08 | 5 | 6 | 0.833 |
| 16684 | oxidoreductase activity, acting on peroxide as acceptor | 5.71E-07 | 8 | 40 | 0.20 |
| 4364 | glutathione transferase activity | 3.30E-06 | 7 | 34 | 0.206 |
| 9031 | thioredoxin peroxidase activity | 3.30E-06 | 4 | 5 | 0.8 |
| 51082 | unfolded protein binding | 9.02E-05 | 9 | 115 | 0.078 |
| 5504 | fatty acid binding | 9.42E-04 | 5 | 37 | 0.135 |
| 19752 | carboxylic acid metabolic process | 1.10E-14 | 35 | 524 | 0.067 |
| 6519 | amino acid and derivative metabolic process | 1.02E-08 | 23 | 362 | 0.064 |
| 9308 | amine metabolic process | 1.53E-08 | 24 | 414 | 0.058 |
| 6950 | response to stress | 4.82E-08 | 36 | 978 | 0.0369 |
| 6805 | xenobiotic metabolic process | 2.31E-07 | 8 | 32 | 0.25 |
| 9410 | response to xenobiotic stimulus | 2.74E-07 | 8 | 33 | 0.242 |
| 6979 | response to oxidative stress | 4.38E-07 | 12 | 110 | 0.109 |
| 6457 | protein folding | 1.28E-06 | 15 | 207 | 0.072 |
| 5737 | cytoplasm | 4.59E-26 | 108 | 3936 | 0.027 |
| 44444 | cytoplasmic part | 3.71E-23 | 91 | 2969 | 0.031 |
| 5739 | mitochondrion | 2.31E-16 | 41 | 750 | 0.055 |
| 5788 | endoplasmic reticulum lumen | 4.95E-10 | 8 | 20 | 0.4 |
| 5829 | cytosol | 2.95E-09 | 25 | 494 | 0.051 |
| 5783 | endoplasmic reticulum | 9.68E-09 | 25 | 525 | 0.048 |
| 5793 | ER-Golgi intermediate compartment | 2.07E-06 | 6 | 23 | 0.261 |
| 5625 | soluble fraction | 2.56E-06 | 15 | 260 | 0.058 |
a GO analysis was performed using BinGO in Cytoscape. The rat proteome was used as the background for comparison. A complete listing of terms having a false discovery rate (FDR) < 1.0E-03 is presented in Table S3 (see Additional file 5). Also available in the Additional files are graphic representations of enriched GO categories and their hierarchical structures (Figures S1 and S2 in Additional files 1 and 2, respectively).
KEGG pathways significantly enriched in target proteins.a
| Pathway Name | ||||
| Glycolysis/Gluconeogenesis | 2.96E-08 | 12 | 45 | 0.267 |
| Glutathione metabolism | 1.25E-06 | 9 | 30 | 0.300 |
| Carbon fixation | 9.92E-06 | 7 | 19 | 0.368 |
| Pyruvate metabolism | 1.25E-05 | 8 | 29 | 0.276 |
| Urea cycle and metabolism of amino groups | 1.38E-05 | 7 | 20 | 0.350 |
| Arginine and proline metabolism | 5.63E-05 | 7 | 25 | 0.280 |
| Metabolism of xenobiotics by cytochrome P450 | 7.13E-05 | 9 | 50 | 0.180 |
| Limonene and pinene degradation | 2.70E-04 | 5 | 12 | 0.417 |
| Propanoate metabolism | 4.02E-04 | 6 | 23 | 0.261 |
| Glutamate metabolism | 4.97E-04 | 6 | 24 | 0.250 |
| Phenylalanine metabolism | 6.97E-04 | 5 | 15 | 0.333 |
| 3-Chloroacrylic acid degradation | 7.50E-04 | 4 | 7 | 0.571 |
| Fatty acid metabolism | 8.59E-04 | 7 | 40 | 0.175 |
| Nitrogen metabolism | 9.09E-04 | 5 | 16 | 0.313 |
| Cysteine metabolism | 9.09E-04 | 5 | 16 | 0.313 |
aPathway mapping was performed using DAVID, and 101 of 171 target proteins could be mapped to one or more KEGG pathways. The subset of rat proteins that are allocable to KEGG pathways includes 3396 proteins.
Summary of interacting partner analysis for human orthologs of 28 rat or mouse proteins commonly targeted by multiple different reactive metabolites a
| 213 | ALB | Albumin | 12 |
| 217 | ALDH2 | Aldehyde dehydrogenase 2 | 7 |
| 383 | ARG1 | Arginase | 2 |
| 761 | CA3 | Carbonic anhydrase III | 3 |
| 2023 | ENO1 | Enolase 1 | 6 |
| 2168 | FABP1 | Fatty acid binding protein 1 | 2 |
| 2923 | PDIA3 | Protein disulfide isomerase A3 | 11 |
| 2944 | GSTM1 | Glutathione S-transferase Mu-1 | 4 |
| 2946 | GSTM2 | Glutathione S-transferase Mu-2 | 4 |
| 3309 | HSPA5 | BIP | 31 |
| 3312 | HSPA8 | Heat shock 70 kDa protein 8 | 43 |
| 3417 | IDH1 | Isocitrate dehydrogenase 1 | 1 |
| 5034 | P4HB | Protein disulfide isomerase | 15 |
| 5037 | PEBP1 | Raf kinase inhibitor protein | 18 |
| 5230 | PGK1 | Phosphoglycerate kinase 1 | 4 |
| 7170 | TPM3 | Tropomyosin 3 | 13 |
| 7276 | TTR | Transthyretin | 14 |
| 7295 | TXN | Thioredoxin | 10 |
| 8991 | SELENBP1 | Selenium binding protein 1 | 4 |
| 10130 | PDIA6 | Protein disulfide isomerase P5 | 2 |
| 10961 | ERP29 | Endoplasmic reticulum protein 29 | 4 |
| 1109 | AKR1C4 | Aldo keto reductase family 1, member C4 | 0 |
| 1652 | DDT | D-dopachrome tautomerase | 0 |
| 2052 | EPHX1 | Epoxide hydrolase | 0 |
| 2184 | FAH | Fumarylacetoacetase | 0 |
| 2593 | GAMT | Guanidinoacetate N-methyltransferase | 0 |
| 10247 | HRSP12 | Translational inhibitor protein p14.5 | 0 |
| 51733 | UPB1 | Beta ureidopropionase | 0 |
a No interacting partners were found for 7 of the 28 proteins. See Additional file 5 for full listing of all the interacting partners found.
GO categories with an over-representation of target proteins or their interacting partners
| Category Description | |||||
| 51082 | unfolded protein binding | 3.26E-09 | 14 | 113 | 0.124 |
| 6457 | protein folding | 1.65E-17 | 28 | 256 | 0.109 |
| 6915 | apoptosis | 4.63E-09 | 30 | 676 | 0.044 |
| 6986 | response to unfolded protein | 4.63E-09 | 11 | 61 | 0.18 |
| 5783 | endoplasmic reticulum | 2.83E-06 | 25 | 669 | 0.037 |
KEGG pathways containing target proteins or their interacting partners
| MAP kinase signaling pathway | 4.26E-05 | 19 | 259 | 0.073 |
| Antigen processing and presentation | 6.66E-05 | 10 | 80 | 0.125 |
| Alzheimer's disease | 1.11E-04 | 6 | 28 | 0.214 |
| Long-term potentiation | 5.78E-04 | 8 | 65 | 0.123 |
| Neurodegenerative disorders | 0.0011 | 6 | 39 | 0.154 |
| Long-term depression | 0.0018 | 8 | 75 | 0.107 |
| Arginine and proline metabolism | 0.0018 | 7 | 34 | 0.201 |
| Adipocytokine signaling pathway | 0.0054 | 7 | 73 | 0.096 |
Figure 1ROC curve using the false-positive rate and true positive rate to evaluate the performance of the model to predict target proteins. The area under the curve is 0.857.
Figure 2Relative importance of the top 15 features as ranked by the random forest algorithm.
Figure 3Cumulative distribution of lysine (panel A) and cysteine (panel B) for 62 thiobenzamide target proteins (open circles), 45 bromobenzene target proteins (filled triangles) and 11482 non-target proteins (solid line).