| Literature DB >> 34094414 |
Jonathan Fine1, Judy Kuan-Yu Liu1, Armen Beck1, Kawthar Z Alzarieni1, Xin Ma1, Victoria M Boulos1, Hilkka I Kenttämaa1, Gaurav Chopra1,2.
Abstract
Diagnostic ion-molecule reactions employed in tandem mass spectrometry experiments can frequently be used to differentiate between isomeric compounds unlike the popular collision-activated dissociation methodology. Selected neutral reagents, such as 2-methoxypropene (MOP), are introduced into an ion trap mass spectrometer where they react with protonated analytes to yield product ions that are diagnostic for the functional groups present in the analytes. However, the understanding and interpretation of the mass spectra obtained can be challenging and time-consuming. Here, we introduce the first bootstrapped decision tree model trained on 36 known ion-molecule reactions with MOP. It uses the graph-based connectivity of analytes' functional groups as input to predict whether the protonated analyte will undergo a diagnostic reaction with MOP. A Cohen kappa statistic of 0.70 was achieved with a blind test set, suggesting substantial inter-model reliability on limited training data. Prospective diagnostic product predictions were experimentally tested for 13 previously unpublished analytes. We introduce chemical reactivity flowcharts to facilitate chemical interpretation of the decisions made by the machine learning method that will be useful to understand and interpret the mass spectra for chemical reactivity. This journal is © The Royal Society of Chemistry.Entities:
Year: 2020 PMID: 34094414 PMCID: PMC8162943 DOI: 10.1039/d0sc02530e
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Fig. 1Schematic diagram of a linear quadrupole ion trap mass spectrometer equipped with an APCI source and an external reagent mixing manifold (bottom).[12,13] This instrument can be used to detect diagnostic ions formed between analytes protonated upon APCI and a neutral reagent (introduced using the reagent mixing manifold) in MS/MS experiments occurring in the ion trap.
Fig. 2The diagnostic utility of employing neutral reagents, such as MOP, to identify functional groups in protonated metabolites of a drug. After the metabolites were (a) protonated and isolated, (b) they were allowed to react with MOP and (c) the formation of a diagnostic addition product (DP) as opposed to proton transfer (PT) or no reaction was monitored. Only the protonated sulfoxide metabolites generated the diagnostic addition product ion (DP) with MOP.
Fig. 3(a) The distribution of diagnostic product branching ratios for the initial training set of 36 reactions. (b) Structures for representative analytes with diagnostic product branching ratios between 40 and 70%.
The probability for assignment of a correct reaction for all decision tree models
| # | Test compound | Formation of diagnostic product | 20% | 30% | 40% | 50% | 60% | 70% | Proton affinity (kcal mol−1) |
|---|---|---|---|---|---|---|---|---|---|
| 1 |
| Yes | 51% | 54% | 50% | 47% | 100% | 100% | 214.43 |
| 2 |
| No | 0% | 8% | 0% | 0% | 0% | 0% | 225.23 |
| 3 |
| No | 0% | 8% | 0% | 0% | 33% | 0% | 229.51 |
| 4 |
| No | 0% | 0% | 0% | 0% | 0% | 0% | 188.57 |
| 5 |
| No | 59% | 58% | 50% | 44% | 4% | 0% | 222.71 |
| 6 |
| No | 0% | 0% | 0% | 0% | 33% | 0% | 195.01 |
| 7 |
| Yes | 100% | 100% | 100% | 94% | 100% | 100% | 224.15 |
| 8 |
| No | 0% | 0% | 0% | 0% | 33% | 0% | 214.36 |
| 9 |
| No | 100% | 100% | 100% | 100% | 100% | 100% | 213.07 |
| 10 |
| Yes | 100% | 100% | 100% | 100% | 61% | 100% | 228.46 |
| 11 |
| No | 100% | 100% | 100% | 94% | 100% | 100% | 205.64 |
| 12 |
| Yes | 100% | 100% | 100% | 88% | 100% | 100% | 226.38 |
| 13 |
| Yes | 100% | 100% | 100% | 100% | 100% | 100% | 232.58 |
| Kappa value | — | 0.56 | 0.56 | 0.56 | 0.53 | 0.70 | 0.70 | 0.40 |
See Fig. S2–S14 for assignment of diagnostic production formation.
Value greater than the proton affinity of MOP (214.42 kcal mol−1) as calculated using density functional theory, see section Calculation of proton affinities in Methods for details.
Fig. 4Chemical reactivity flowchart. (a) Analytes that form the diagnostic product (DP) or undergo proton transfer or no reaction (PT). (b) Compounds identified as having a specific functional group feature (left), such as a sulfoxide with at least one aliphatic carbon atom bound to it (right). No structure is shown when the feature (sulfoxide) is absent in the molecule that does not form a DP. (c) Flowchart for decision making based on the presence or absence of the feature (sulfoxide). (d) The decision tree model trained on a diagnostic product branching ratio cutoff of 70%. The model classifies analytes as reactive or unreactive towards MOP based on their functional groups determined by the Morgan algorithm with a radius of 1 atom.
Fig. 5The decision tree model obtained by retraining the first model by using the 70% cutoff and all 49 reactions (original 36 and new 13 test reactions). This model is similar to the one obtained via a training set of 36 reactions but has an additional check for a nitro group which was not included in the original model. The lack of any major changes from the model shown in Fig. 4 indicates that the final model is robust and is able to incorporate new functional groups.