Literature DB >> 35184559

Discovery and Visualization of Uncharacterized Drug-Protein Adducts Using Mass Spectrometry.

Michael Riffle¹, Michael R Hoopmann², Daniel Jaschob¹, Guo Zhong³, Robert L Moritz², Michael J MacCoss⁴, Trisha N Davis¹, Nina Isoherranen³, Alex Zelter¹.

Abstract

Drugs are often metabolized to reactive intermediates that form protein adducts. Adducts can inhibit protein activity, elicit immune responses, and cause life-threatening adverse drug reactions. The masses of reactive metabolites are frequently unknown, rendering traditional mass spectrometry-based proteomics approaches incapable of adduct identification. Here, we present Magnum, an open-mass search algorithm optimized for adduct identification, and Limelight, a web-based data processing package for analysis and visualization of data from all existing algorithms. Limelight incorporates tools for sample comparisons and xenobiotic-adduct discovery. We validate our tools with three drug/protein combinations and apply our label-free workflow to identify novel xenobiotic-protein adducts in CYP3A4. Our new methods and software enable accurate identification of xenobiotic-protein adducts with no prior knowledge of adduct masses or protein targets. Magnum outperforms existing label-free tools in xenobiotic-protein adduct discovery, while Limelight fulfills a major need in the rapidly developing field of open-mass searching, which until now lacked comprehensive data visualization tools.

Entities: Chemical

Mesh：

Substances：
DNA Adducts
Proteins

Year: 2022 PMID： 35184559 PMCID： PMC8892443 DOI： 10.1021/acs.analchem.1c04101

Source DB: PubMed Journal: Anal Chem ISSN： 0003-2700 Impact factor: 6.986

Humans are constantly exposed to chemicals from their environment. Protein adducts result from covalent modification by xenobiotics, or their metabolites, and can also cause unintended toxicities and adverse drug reactions (ADRs).[1−3] Adduct identification improves understanding of the mechanisms of ADRs and enables design of structural modifications to prevent them. Current identification methods, using radiolabeled compounds, trapping agents, immunological detection and mass spectrometry-based methods for detecting xenobiotic-protein adducts are labor-intensive and cannot sensitively measure the presence, abundance, and localization of protein adducts across a range of proteins without prior knowledge.[1,4,5] Recent proteomics methods allow identification of unknown protein adducts but require isotopic labeling of test compounds[6,7] or a complex combination of data-processing methods to reduce false discoveries and identify adducted peptides.[8] Traditional database search algorithms[9] search MS/MS spectra against protein sequences to identify the peptides and proteins that produced them. Known modifications can be identified if their masses are predefined. This is not possible for xenobiotic-protein adducts if their chemical composition is unknown. “Open-mass” search strategies[10−18] solve this issue by allowing observed peptide masses to differ from identified peptide masses, returning mass differences as modifications of the identified peptides. This has allowed peptide spectrum matches (PSMs) to be made from a large proportion of previously unassigned spectra in shotgun proteomics data.[10−18] However, few algorithms have been designed to identify xenobiotic-protein adducts that are typically low abundance and specific to the drug and treatment, and there are no graphical software tools to analyze and visualize data from open-mass searches. Here we present Magnum, a purpose-built xenobiotic-protein adduct discovery algorithm, and Limelight, a web-based open modification analysis platform that rapidly highlights protein adducts that result from a specific treatment in a background of unrelated modifications. Limelight can combine and compare data from different pipelines empowering users to find the best tool or combination of tools for their specific application. We validate our new tools using three drug/protein combinations and compare results from Magnum to other open-mass search tools. We apply our workflow to identify novel xenobiotic-protein adducts in the P450 enzyme CYP3A4 resulting from exposure to raloxifene. Our software and workflow enable rapid and accurate identification of novel xenobiotic–protein adducts with no prior knowledge of adduct masses or protein targets. These tools provide a highly accelerated and statistically rigorous label-free workflow for the discovery and characterization of xenobiotic-protein adducts and can be incorporated into drug discovery pipelines and environmental toxicology screening.

Materials and Methods

Reagents and Drug Incubations

The chemicals and reagents used in this study plus details of drug incubations are described in the Supplementary Methods section of the Supporting Information (SI).

Sample Preparation

β-lactam antibiotics and HSA: Aliquots (30 μL) of control or drug treated HSA (0.5 mg/mL) were reduced by adding 10 mM DTT, final concentration, and incubating at 37 °C for 30 min. Samples were alkylated with 16 mM iodoacetamide at room temp for 20 min in the dark. Tryptic digestion was done at 1:15 (enzyme:substrate) for 6 h at 37 °C in an Eppendorf Thermomixer with shaking (1000 rpm) prior to acidification with 250 mM HCl (final concentration). Samples were centrifuged at max speed in a benchtop microfuge for 10 min and supernatant transferred to autosampler vials and stored at −80 °C. Human plasma was prepared, treated, and digested as described in Supplementary Methods. Raloxifene and CYP3A4: Aliquots (100 μL) of control or drug treated CYP3A4 incubation reaction mixture (63 μg total protein) were prepared as described but digested for 4 h. A second set of digests were performed as described in Supplementary Methods and labeled “extra digest” in results. After digestion, solid phase extraction was done on all CYP3A4 samples using Oasis MCX cartridges (see Supplementary Methods for details).

Mass Spectrometry

Sample digests (2 μL ∼ 1 μg) were loaded onto a 150 μm Kasil fritted trap packed with 2 cm of ReprosilPur C18AQ (3 μm bead diameter, Dr. Maisch) at a flow rate of 2 μL per min. Separation used a self-packed 75 μm i.d. 30 cm column. Peptides were eluted at 0.25 μL/min using a standard or higher concentration (labeled “highB” in results) acetonitrile gradient. A QExactive HF or Exploris 480 (Thermo Fisher Scientific) was used to perform MS in data dependent mode.

Data Processing

Acquired spectra were converted into mzML (for input to all algorithms except MODa) or mzXML (for input to MODa) using ProteoWizard’s msConvert.[19] Proteins present in the samples were identified using Comet[20] by standard closed searching against the entire human or E. coli proteomes, and smaller databases were made for subsequent open searching consisting only of proteins identified in initial comet searches by at least three peptides with a Percolator[21] assigned q-value of ≤0.01. Decoy databases consisted of the corresponding set of reversed protein sequences and were provided to algorithms requiring pregenerated decoy sequences. All data are filtered at a false discovery rate (FDR) of 1% unless otherwise stated. Detailed procedures and parameters for each algorithm can be found in Supplementary Methods. Native search results from all pipelines were converted to Limelight XML prior to uploading to the Limelight web application. We have written converters for many pipelines including Comet,[20] Percolator,[21] the Trans-Proteomic Pipeline (TPP),[22] Crux,[23] MSFragger,[11] open-pFind,[12] Comet-PTM,[14] MetaMorpheus,[17] MODa,[13] TagGraph,[18] and Magnum (this paper). A current list of converters is available on our documentation Web site: https://limelight-ms.readthedocs.io/. Further details can be found in Supplementary Note 3.

Quantification of CYP3A4 and P450-Reductase Adducts

Peptides were quantified using Skyline as previously described.[24,25] Full details can be found in Supplementary Methods, processed data plus Limelight links can be found in Supplementary File 2, Sheet 3, and a complete interactive Skyline session is available on Panorama (https://panoramaweb.org/CYP3A4-raloxifene.url).

Software Availability

Magnum is written in C++ and is open source and freely available at http://magnum-ms.org. Full source code plus precompiled Magnum binaries are available for Windows and Linux. Docker images for the Magnum/Percolator/Limelight pipeline and associated documentation can be found at https://limelight-ms.readthedocs.io/en/latest/tutorials/magnum-pipeline.html. Magnum outputs results either as simple tab-delimited text or PepXML[26] format for potential use with existing software supporting this format. For further details of Magnum see Supplementary Note 1. Limelight is written in Java and TypeScript and is open source and freely available at https://limelight-ms.org/. Limelight source code plus preconfigured Docker containers are provided for running Limelight and Limelight XML converters. Extensive documentation is at https://limelight-ms.readthedocs.io/. Users not able to run their own Limelight installation may use a generally available installation at https://use.limelight-ms.org/. For full details of Limelight see Supplementary Note 3.

Data Availability

All raw and processed data discussed in this paper are available via Limelight at https://limelight.yeastrc.org/limelight/p/adduct-discovery. In addition, complete search algorithm configuration files, fasta search databases, raw search output, and raw MS data files were deposited to the ProteomeXchange Consortium via the PRIDE[27] partner repository with the data set identifier PXD025019. Full Skyline quantification of CYP3A4/raloxifene was deposited to the ProteomeXchange Consortium via Panorama Public[28] and is available at https://panoramaweb.org/CYP3A4-raloxifene.url with the data set identifier PXD024932.

Results

Development of Magnum

Magnum was developed to analyze MS/MS spectra to identify peptide sequences modified by adducts of unknown mass. The approach taken in Magnum is similar to cross-linked peptide identification in Kojak and makes use of the same spectral processing, analytical work-flow, and Xcorr scoring function.[29] However, the algorithms are mutually exclusive. In Kojak search results contain two peptide sequences that sum together to produce the observed mass, whereas Magnum returns only one peptide sequence plus a modification mass equal to the mass unexplained by the predicted peptide. A detailed description the Magnum algorithm is presented in Supplementary Note 1. Magnum incorporates several features that improve its ability to identify protein adducts. First, Magnum calculates and scores an adduct-modified MS/MS ion series. Some open search algorithms disregard adduct-modified fragment ions leading to a loss of sensitivity and preventing adduct localization.[10,15−17] Magnum handles both MS-labile and stable modifications by including all possible scenarios in its search space for every spectrum. Adduct identification with or without adduct localization is therefore possible (see Supplemental Note 1 for details). Second, Magnum allows the restriction of adduct localization to specific amino acids during searching. This is useful if the reactivity of a xenobiotic is known or hypothesized based on its chemistry or through detection of thiol conjugates with reactive intermediates.[1] Increased search sensitivity and statistical power can be gained by restricting the open modification search space to specific amino acids.[30] Third, Magnum can flag MS/MS spectra containing diagnostic reporter ions. Some xenobiotics and endogenous post-translational modifications (PTMs) fragment predictably in the mass spectrometer producing reporter ions,[31−33] the masses of which are constant regardless of the peptide to which the adduct was attached. If present, Magnum records this information in the PSM. Fourth, Magnum considers user defined peptide modifications and open modifications. This allows separation of masses due to defined versus open modifications and is important if both types of modification exist on a single peptide. The product of a Magnum analysis is a set of PSMs for as many input spectra as possible, plus a set of metrics that can be used as input in a variety of PSM-validation[21,34] algorithms.

Evaluation of Open Modification Search Tools for Xenobiotic-Protein Adduct Discovery

Open-mass searches have greater potential for false positive identifications than closed searches because discrepancies between theoretical and observed peptide masses are interpreted as open modifications. While enabling identification of undefined modifications, this can lead to incorrect identifications not otherwise possible. Additionally, large differences in unmodified and modified peptide search spaces and the small proportion of PSMs representing specific xenobiotic adducts of interest make accurate error estimation difficult[35] (Supplemental Note 1). We therefore created a gold standard (true positive) data set to evaluate the accuracy and sensitivity of Magnum in identifying xenobiotic-protein adducts. We compared Magnum, which was designed specifically for xenobiotic-protein adduct detection, to several previously published open search algorithms. For our gold standard data set, we acquired high resolution LC-MS/MS data of human serum albumin (HSA) exposed to the β-lactam antibiotics dicloxacillin and flucloxacillin. A significant portion of previous research identifying protein adducts from environmental exposures has focused on HSA[36] as it is the most abundant protein in plasma and forms adducts with numerous xenobiotics.[4] The β-lactam antibiotics dicloxacillin and flucloxacillin were chosen as there are published MS characterizations of their clinically significant adducts.[31,37] Our gold standard data set consisted of 2979 MS/MS spectra extracted from 307 652 scans acquired from two dicloxacillin- and two flucloxacillin-treated HSA samples. Each of the 2979 spectra was definitively determined to result from a peptide containing one 469 Da (dicloxacillin) or 453 Da (flucloxacillin) adduct using the methods described in Supplementary Note 2. The four full data sets (307 652 MS/MS scans) were searched with 7 algorithms. Open modification masses associated with each PSM were extracted for the 2979 gold standard spectra. A precision/recall analysis was performed where precision was defined as the fraction of all answers that are correct (i.e., number of correct answers/total number of answers) and recall as the fraction of total possible correct answers identified (i.e., number of correct answers/number scans tested [2979]). An answer within ±3 Da of the known modification mass was considered correct, allowing for incorrect monoisotopic mass assignments,[38] which in open searching are compensated for by changes in open modification masses. Both Magnum and MSFragger[11] had excellent precision: >0.9 at 1% FDR (false discovery rate) (Figure ). Recall of Magnum was better than all other algorithms. MSFragger, comet with a 500 Da wide precursor window, and MetaMorpheus[17] performed next best. Recall of open-pFind[12] was close to zero as it does not perform a truly unrestricted search, but rather a multinotch search,[17] allowing only delta masses present in UniMod[39] to be used as modification masses. As dicloxacillin and flucloxacillin adducts are not present in Unimod, open-pFind cannot identify them and returned results for only 9 of the 2979 scans (SI Table S6).

Figure 1

Evaluation of open modification search tools for xenobiotic-protein adduct discovery. Precision recall plot of adduct masses at 1% FDR by 7 open search algorithms. Closed comet searches using defined modifications were included as a positive control. Magnum was run allowing for open-masses on any amino acid (Magnum) or restricted to lysine only (Magnum K), the previously published residue modified by dicloxacillin and flucloxacillin adducts.[31,37] Except for open-pFind, algorithms having a low recall did so mainly due to returning incorrect modification masses rather than by not returning a result for that scan. All algorithms except open-pFind returned over 2200 total answers at 1% FDR. These were distributed between correct and incorrect masses (SI Figure S6). An analysis of all open-mass modifications returned by each algorithm within ±3 Da of the known correct modification masses shows Magnum had the most accurate (62–63%) monoisotopic mass assignment of the algorithms tested (SI Figure S7). MetaMorpheus and MSFragger were next best, and respectively assigned 53% and 49% of PSMs the correct monoisotopic mass. Like many pharmaceutical drugs, dicloxacillin and flucloxacillin contain chlorine, confounding monoisotopic mass assignment due to its unusual isotopic composition. To overcome these complexities, Magnum incorporates code that reduces peptide isotope distributions to a single monoisotopic mass (Supplementary Note 1). Further validation of Magnum was done by comparing open modification masses returned by Magnum and MSFragger with those returned by closed comet searching of previously published phosphopeptide enriched data.[40] Both Magnum and MSFragger results were almost exclusively (∼98%) a subset of closed search results indicating excellent accuracy by both algorithms (SI Figure S12). Magnum returned more PSMs (3101) than MSFragger (1760) at the same calculated FDR. Overall, our analyses showed Magnum accurately and sensitively identifies xenobiotic-protein adducts.

Development of Limelight

Mass spectrometry-based proteomics data is complex and open modification data even more so. Each search algorithm uses different scoring metrics and outputs results in different formats. PSM-validation tools[21,34] are often used to assign statistical confidence to results, and add another layer of metrics to the data. If adduct data are to be assessable for answering specific questions, a simple graphical interface is needed. Limelight is a web application built to analyze, visualize, and share bottom-up MS proteomics data (Figure ). It provides a generalized platform designed to support any MS database search pipeline. Data are displayed on a global level in protein, peptide, or modification centric views, which can be filtered on multiple criteria including any or all scores from the search algorithm and PSM-validation tools, and that provide access to all underlying raw data and associated metrics from every part of the analysis. Annotated spectra, including ions modified by open-masses where applicable, are viewable using a built in spectrum viewer.[41] Data are unified and can be queried, viewed, and compared across multiple experiments and disparate software pipelines allowing the strengths of different algorithms to be leveraged.

Figure 2

Limelight. A web-based application to interrogate, analyze, and visualize mass spectrometry proteomics results.

Limelight. A web-based application to interrogate, analyze, and visualize mass spectrometry proteomics results. Limelight incorporates novel features designed for adduct and PTM analysis. Mass modifications identified in peptides may be viewed, analyzed, and visualized independently from the peptides or proteins in which they were identified and regardless of whether those modifications were localized or not. If desired, this allows interesting adduct masses to be identified and examined before segregation into protein, peptide, or residue level groups. To highlight exposure related adducts, we developed a suite of tools within Limelight for visualizing and statistically comparing modification masses between different experiments. These functions are used below and described in depth in Supplementary Note 3.

Development and Validation of Adduct Discovery Pipeline

Open search results contain many modification masses not related to exposures or treatment conditions. Numerous open-mass modifications are identified even in untreated purified HSA (SI Figure S14, Supplementary Note 4). We therefore designed an experimental workflow to allow discovery of protein adducts resulting specifically from exposure to xenobiotics (Figure ). Our strategy was to produce, analyze, and compare both untreated and xenobiotic treated samples. This strategy is similar to that behind heavy/light isotope labeling: namely, that the difference between the two sets of samples allows adduct masses specific to the treatment of interest to be distinguished within a background of unrelated modification masses. We tested our label-free workflow using unexposed, dicloxacillin, and flucloxacillin exposed HSA.

Figure 3

Adduct identification and visualization workflow using dicloxacillin or flucloxacillin exposed versus untreated HSA as an example. (a) Raw MS data was searched using Magnum and (b) resulting PSMs were analyzed by Limelight. Data visualization includes: (c) a heatmap of Global z-score for all modification masses across all samples highlighting modification masses enriched in the treated samples (red arrows); (d) results of statistical test of proportions on all modification masses automatically pinpoints dicloxacillin (469 Da) and (e) flucloxacillin (453 Da) adducts (red arrows). See Supplementary Note 3 for details. All results shown have a Percolator calculated PSM level q ≤ 0.01. Exact modification masses were binned into 1 Da bins by Limelight prior to analysis. Live view for (c) can be found on Limelight here: https://limelight.yeastrc.org/limelight/go/x1h2QRXqrE Open-mass searching was performed using Magnum and PSMs were imported into Limelight for downstream analysis. Using Limelight, a two-tailed test of proportions comparing untreated with dicloxacillin or flucloxacillin treated HSA, easily picked out exposure specific modifications enriched in the treated samples (Figure d,e). Previously published studies determined that dicloxacillin and flucloxacillin produce 469 and 453 Da adduct modifications, respectively, on HSA lysine residues.[31,37] These studies relied on MS analysis of unadducted antibiotics, combined with trapping experiments using N-acetyl-cysteine and N-acetyl-lysine plus extensive targeted MS experiments. Using our untargeted method, comparisons of dicloxacillin or flucloxacillin treated versus untreated samples resulted in 469 or 453 Da, being the most significantly enriched mass for their respective treatment with no prior knowledge or experimentation. These data show a two-tailed test of proportions comparing treated versus untreated samples, is effective in highlighting exposure specific adducts from PSMs generated by Magnum. Limelight was designed to support output from any proteomics pipeline, and we performed the same analysis using PSMs from six additional open-mass search algorithms. In all cases Magnum identified the most treatment related PSMs as well as resulting in the largest z-score for the correct mass compared to other algorithms (SI Figure S17, Supplementary Note 4). Overall, Magnum identified 41 unique dicloxacillin- and 55 unique flucloxacillin-adducted HSA peptides of which 90% contained 1 or more reporter ions resulting from adduct fragmentation (SI Table S7). Manual validation using Limelight’s built-in spectrum viewer further confirmed Magnum’s automated results and example annotated spectra are shown in SI Figure S18 (dicloxacillin) and SI Figure S19 (flucloxacillin). These data demonstrate our software and workflow can correctly and automatically identify and highlight drug protein adducts and that Magnum is the most sensitive algorithm of those tested.

Identification of Penicillin Adducts in Human Plasma

To challenge our pipeline with a more complex sample we prepared unexposed and dicloxacillin exposed human plasma. Closed-mass comet searches identified 334 unique proteins in these samples. We performed open-mass searching using Magnum, and PSMs were imported into Limelight and analyzed as described above. As with purified HSA, Limelight’s two-tailed test of proportions, found 469 Da as the top scoring mass enriched in the treated sample (SI Figure S20a,b). For comparison of modification masses observed across samples Limelight rounds all modifications to the nearest Da. We analyzed the distribution of observed open modification masses in PSMs within Limelight’s 469 Da bin. Observed masses in the treated sample peak at 469.01 Da, the exact mass of previously characterized dicloxacillin adducts. Observed masses in the unexposed sample do not result in any open-mass identifications in this region (SI Figure S20c). Overall, Magnum identified 15 unique peptides modified by 469 Da of which 80% were in HSA. 91% of identified PSMs contained all three reporter ions resulting from dicloxacillin fragmentation (SI Table S8). In addition to identifying dicloxacillin adducts in HSA our analysis found exposure specific adducts in transthyretin (TTHY), and Apolipoprotein A-II (APOA2). These data show our method can be successfully applied to complex protein samples. It should be noted that reduced depth of coverage is expected with increasing sample complexity. This is a function of individual peptides being a smaller fraction of total peptides as the number of different peptides in the sample increases. For highly complex samples, fractionation and enrichment techniques will improve sensitivity and depth of coverage.[42]

Magnum and Limelight Identify Novel Raloxifene Adducts in CYP3A4 and P450-Reductase

Most xenobiotics that ultimately form adducts undergo metabolic activation by cytochrome P450 (CYP) enzymes to form short-lived reactive intermediates that are difficult to predict and identify. Raloxifene, a drug commonly used to treat osteoporosis in postmenopausal women, is a mechanism-based inhibitor of CYP3A4.[43] Raloxifene metabolism by CYP3A4 produces several electrophilic species (SI Figure S21), and 471.15 Da adducts on cysteine 239 and tyrosine 79 were previously identified.[44,45] Past studies identifying adducts in CYP3A4[44−47] required extensive experimentation involving multiple techniques including (1) adduct trapping with GSH, N-acetyl cysteine, N-acetyl lysine, and potassium cyanide to determine the masses of likely adducts; (2) radiolabeled drugs combined with LC-radiochromatography in-line with MS; (3) whole-protein MS combined with heavy and light labeled drugs; (4) targeted MS experiments using predicted masses of adducts and/or (5) closed MS/MS database searching using predefined adduct masses previously determined by methods 1–4. We incubated raloxifene in vitro with CYP3A4 plus P450-reductase and NADPH. LC-MS/MS data of exposed and untreated samples were acquired, searched using Magnum and analyzed using Limelight (Supplementary Note 6). A two-tailed test of proportions identified 471 Da as the most significantly (p = 0) enriched modification mass in raloxifene treated samples (SI Table S10). We searched these data using 6 additional open-mass search algorithms, however none proved as sensitive as Magnum (SI Figure S22). A fully unrestricted Magnum search (modifications allowed on any amino acid) found 146 PSMs containing a 471 Da modification mass in treated samples versus 14 in untreated samples (10:1 ratio) indicating that >90% of 471 Da identifications made by Magnum in raloxifene treated samples are exposure-specific adducts. Manual evaluation, using Limelight’s built-in spectrum viewer, of all 146 PSMs revealed that close to 90% of 471 Da adducts were on cysteine, tryptophan or tyrosine (Supplementary File 2, Sheet 2). The only two previously identified adducts were on a cysteine and a tyrosine, and raloxifene adducts have previously been thought to occur primarily on cysteine. Cysteine, tryptophan and tyrosine contain nucleophilic moieties (the thiol group of cysteine, the aromatic nitrogen of tryptophan and the phenolic group of tyrosine) which can react with electrophilic sites of the reactive raloxifene metabolite to form adducts. We further improved search sensitivity by using Magnum’s ability to restrict open-mass modifications to specific amino acids: searches were performed restricting open-mass modifications to cysteine only, or, to cysteine (C), tryptophan (W), and tyrosine (Y) collectively. Cysteine-restricted searches yielded fewer (67) PSMs containing 471 Da adduct masses confirming the presence of adducts on residues other than cysteine. CWY-restricted searches resulted in more 471 Da PSMs in treated samples (157), and less in untreated samples (3), than any other search (a 52:1 ratio, SI Figure S22). Extracted ion chromatograms (XICs) were produced and quantified in Skyline[24,25] and showed treatment specific signal for all 471 Da modified peptides identified by Magnum as exclusively in treated samples (SI Figure S23). We searched an additional 6 untreated and 6 raloxifene treated CYP3A4 samples using CWY-restricted Magnum resulting in a total of 443 PSMs containing a 471 Da modification in CYP3A4 and 91 PSMs in P450-reductase, with just 7 total PSMs showing this mass in the untreated samples (a 76:1 ratio, Supplementary File 2, Sheet 5). These PSMs mapped to 12 distinct amino acids in CYP3A4 and 11 in P450-reductase (Figure and SI Figure S30) and 22/23 represent novel raloxifene-protein adducts.

Figure 4

Identification of novel raloxifene adducts in CYP3A4. (a) Magnum identifies multiple 471 Da protein adducts in CYP3A4 after exposure to raloxifene. Adducted residues are mapped to defined regions of CYP3A4.[50] (b) Observed 471 Da modifications are shown on the structure of CYP3A4 as magenta spheres. Results were identified by ≥3 PSMs, 1% FDR. Limelight view: https://limelight.yeastrc.org/limelight/go/0UjwIJNz45 (c) Extracted ion chromatograms (XICs) of 2+, 3+, and 4+ precursor ions corresponding to CYP3A4 residue W126 elute as four distinct chromatographic peaks likely representing regioisomers resulting from the different positions[43,48] in the raloxifene metabolite, diquinone methide, that are subject to nucleophilic attack (inset structure, red arrows). Unexposed control XICs show no signal (inset box, left). We analyzed the distribution of observed open modification masses in PSMs within Limelight’s 471 Da bin. Observed masses (536 PSMs) in the treated sample peak at 471.15 Da, the exact mass of previously characterized raloxifene diquinone methide adduct. Only seven PSMs were observed in the 471 Da bin in unexposed samples (SI Figure S31). Interestingly, W126 was associated with four distinct chromatographically separated peaks (Figure c) that might represent regioisomers resulting from the different positions in the raloxifene metabolite subject to nucleophilic attack (Figure c, red arrows).[43,48] Several other 471 Da modified peptides showed multiple distinguishable chromatographic peaks (e.g., SI Figures S27 and S28) and represent the first evidence for regioisomers in native P450 enzymes. The identification of multiple novel raloxifene-protein adducts within both CYP3A4 and P450-reductase demonstrates our methodology and software provide greatly expanded power to better understand the structure/function and ligand interactions of catalytic membrane proteins such as cytochrome P450s. Past biochemical studies have typically identified single residues in CYP3A4 that can be modified by xenobiotics despite the expected exposure of multiple nucleophilic residues to reactive metabolites formed by CYP3A4.[43,46,47,49] Additionally, previous studies required extensive experimentation to define adducts while the methods and tools presented here required no prior knowledge or experimentation.

Conclusions

Open-mass search algorithms have become increasingly important in the past decade and can now identify over 50% of spectra invisible to traditional search methods.[10] Previous open-mass search analyses of proteomics data sets have found hundreds of yet-to-be-identified modifications in untreated protein samples,[10−13,17,18] but until now there were few pipelines dedicated to discovering exposure related xenobiotic-protein adducts in the background of modifications identified by open searching. The pipeline presented here allows the use of standard shotgun proteomics to identify xenobiotic-protein adducts in an automated manner with no prior knowledge of the adduct mass or the adducted peptides. Magnum is optimized to detect adduct modified peptides and outperforms existing label-free open-search based methods in xenobiotic-protein adduct discovery. Limelight provides specific tools to automatically highlight exposure specific modifications while also fulfilling a major need in the rapidly developing field of open-mass searching, which until now has had no comprehensive data visualization tools. Users are thus empowered to utilize the best algorithm, or combination of algorithms to fulfill their needs.

50 in total

1. SpecOMS: A Full Open Modification Search Method Performing All-to-All Spectra Comparisons within Minutes.

Authors: Matthieu David; Guillaume Fertin; Hélène Rogniaux; Dominique Tessier
Journal: J Proteome Res Date: 2017-07-11 Impact factor: 4.466

2. Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine.

Authors: Hao Chi; Chao Liu; Hao Yang; Wen-Feng Zeng; Long Wu; Wen-Jing Zhou; Rui-Min Wang; Xiu-Nan Niu; Yue-He Ding; Yao Zhang; Zhao-Wei Wang; Zhen-Lin Chen; Rui-Xiang Sun; Tao Liu; Guang-Ming Tan; Meng-Qiu Dong; Ping Xu; Pei-Heng Zhang; Si-Min He
Journal: Nat Biotechnol Date: 2018-10-08 Impact factor: 54.908

3. Comprehensive Quantification of the Modified Proteome Reveals Oxidative Heart Damage in Mitochondrial Heteroplasmy.

Authors: Navratan Bagwan; Elena Bonzon-Kulichenko; Enrique Calvo; Ana Victoria Lechuga-Vieco; Spiros Michalakopoulos; Marco Trevisan-Herraz; Iakes Ezkurdia; José Manuel Rodríguez; Ricardo Magni; Ana Latorre-Pellicer; José Antonio Enríquez; Jesús Vázquez
Journal: Cell Rep Date: 2018-06-19 Impact factor: 9.423

4. ProteomeTools: Systematic Characterization of 21 Post-translational Protein Modifications by Liquid Chromatography Tandem Mass Spectrometry (LC-MS/MS) Using Synthetic Peptides.

Authors: Daniel Paul Zolg; Mathias Wilhelm; Tobias Schmidt; Guillaume Médard; Johannes Zerweck; Tobias Knaute; Holger Wenschuh; Ulf Reimer; Karsten Schnatbaum; Bernhard Kuster
Journal: Mol Cell Proteomics Date: 2018-05-29 Impact factor: 5.911

5. Cysteine 98 in CYP3A4 contributes to conformational integrity required for P450 interaction with CYP reductase.

Authors: Bo Wen; Jed N Lampe; Arthur G Roberts; William M Atkins; A David Rodrigues; Sidney D Nelson
Journal: Arch Biochem Biophys Date: 2006-08-22 Impact factor: 4.013

Review 6. Drug-protein adducts: an industry perspective on minimizing the potential for drug bioactivation in drug discovery and development.

Authors: David C Evans; Alan P Watt; Deborah A Nicoll-Griffith; Thomas A Baillie
Journal: Chem Res Toxicol Date: 2004-01 Impact factor: 3.739