| Literature DB >> 34265168 |
Daniel Carby-Robinson1, Petur Weihe Dalsgaard1, Christian Brinch Mollerup1, Kristian Linnet1, Brian Schou Rasmussen1.
Abstract
Illicit drug profiling performed by forensic laboratories assists law enforcement agencies through providing information about chemical and/or physical characteristics of seized specimens. In this article, a model was developed for the comparison of seized cocaine based on retrospective analysis of data generated from ultrahigh performance liquid chromatography with time-of-flight mass spectrometry (UHPLC-TOF-MS) comprehensive drug screening. A nontargeted approach to discover target compounds was employed, which generated 53 potential markers using data from cocaine positive samples. Twelve marker compounds were selected for the development of the final profiling model. The selection included a mixture of commonly used cocaine profiling targets and other cocaine-related compounds. Combinations of pretreatments and comparison metrics were assessed using receiver operating characteristic curves to determine the combination with the best discrimination between linked and unlinked populations. Using data from 382 linked and 34,519 unlinked distances, a classification model was developed using a combination of the standardization and normalization transformations with Canberra distance, resulting in a linked cut-off with a 0.5% false positive rate. The present study demonstrates the applicability of retrospectively developing a cocaine profiling model using data generated from UHPLC-TOF-MS nontargeted drug screening without pre-existing information about cocaine impurities. The developed workflow was not specific to cocaine and thus could potentially be applied to any seized drug in which there are both sufficient data and impurities present.Entities:
Keywords: chemometrics; cocaine profiling; high-resolution mass spectrometry; retrospective analysis
Mesh:
Substances:
Year: 2021 PMID: 34265168 PMCID: PMC9291609 DOI: 10.1002/dta.3130
Source DB: PubMed Journal: Drug Test Anal ISSN: 1942-7603 Impact factor: 3.234
FIGURE 1Decision tree assessing the suitability of target compounds for inclusion during steps 3 and 4 in the development of the final profiling model
FIGURE 2Mass spectra highlighting the isotope pattern of a double‐charged truxilline (top) and a cinnmoylcocaine (bottom)
Overview of the final 12 compounds selected for development of the profiling model
| ID | Compound | RT (min) | Mean shifted | Chemical formula | Ion |
| References |
|---|---|---|---|---|---|---|---|
| M1 | Ecgonine methyl ester+
| 0.80 | 200.1280 | C10H17NO3 | [M + H]+ | 0.0001 | Previous studies |
| M2 | Benzoylecgonine+
| 2.94 | 290.1385 | C16H19NO4 | [M + H]+ | 0.0002 | Previous studies |
| M4 | Tropacocaine+
| 4.01 | 246.1487 | C15H19NO4 | [M + H]+ | 0.0002 | Previous studies |
| M6 |
| 4.71 | 394.1856 | C20H27NO7 | [M + H]+ | 0.0004 | Previous studies |
| M8 |
| 4.78 | 323.1622 | C37H44N2O8 | [M + 2H]+ | 0.0000 | |
| M21 |
| 5.31 | 645.3163 | C19H23NO4 | [M + H]+ | 0.0007 | |
| M23 | Cis‐cinnamoylcocaine+ | 5.72 | 330.1701 | C19H23NO4 | [M + H]+ | 0.0001 | Previous studies |
| M31 |
| 5.82 | 645.3163 | C37H44N2O8 | [M + H]+ | 0.0007 | |
| M36 |
| 6.18 | 323.1622 | C37H44N2O8 | [M + 2H]+ | 0.0000 | |
| M38 | Trans‐cinnamoylcocaine+
| 6.28 | 330.1700 | C19H23NO4 | [M + H]+ | 0.0000 | Previous studies |
| M46 |
| 6.47 | 323.1622 | C37H44N2O8 | [M + 2H]+ | 0.0000 | |
| M49 |
| 6.61 | 330.1701 | C38H46N2O8 | [M + 2H]+ | 0.0001 |
Identified via retention time and mass spectral comparison to in‐house libraries created from reference standards.
FIGURE 3Dendrogram from hierarchical clustering of the 53 potential markers from the nontargeted marker discovery. Markers are labelled with identification (if known), followed by retention time, mass, the mean peak area across all cocaine positive samples and count in cocaine positive samples (out of 483)
AUC values for every comparison metric and pretreatment combination
| Pretreatment | Pearson (PCC) | Cosine (SCF) | Euclidean (EUC) | Canberra (CAN) | Manhattan (MAN) |
|---|---|---|---|---|---|
|
| 0.930 | 0.932 | 0.841 | 0.953 | 0.856 |
|
| 0.930 | 0.932 | 0.928 |
| 0.949 |
|
| 0.973 | 0.970 | 0.910 | 0.953 | 0.901 |
|
| 0.967 | 0.970 | 0.970 | 0.959 | 0.960 |
|
| 0.944 | 0.954 | 0.957 | 0.959 | 0.955 |
|
| 0.977 | 0.977 | 0.969 |
| 0.976 |
|
| 0.956 | 0.961 | 0.969 |
| 0.958 |
|
| 0.944 | 0.954 | 0.957 |
|
|
|
| 0.966 | 0.972 | 0.976 |
|
|
|
| 0.951 | 0.962 | 0.976 |
|
|
|
| 0.973 | 0.970 | 0.970 |
|
|
|
| 0.965 | 0.899 | 0.970 | 0.835 | 0.960 |
|
| 0.971 | 0.978 | 0.978 |
|
|
|
| 0.973 | 0.974 | 0.970 |
|
|
Note: Combinations presenting an AUC exceeded 0.98 are highlighted in bold.
Abbreviations: AUC, area under the curve; EUC, Euclidean distance; PCC, Pearson's correlation coefficient; SCF, square cosine function.
FIGURE 4Receiver operating curve (ROC) plot with associated area under the curve (AUC) values, highlighting the difference in performance of two pretreatment and comparison metric combinations. The standardization and normalization treatment with Canberra distance (S + N and CAN) is an example of a better performing comparison metric, whereas the untreated Euclidean distance (EUC) is an example of a worse performing combination
FIGURE 5Boxplot (top) and histogram (bottom) of the standardized and normalized pretreated Canberra distances for the linked and unlinked pairwise groups
FIGURE 6Hierarchical clustered heatmap of 50 random samples from the unlinked population presenting a Canberra distance under the linked cut‐off. Rows are the 12 target compounds from the final profiling model grouped by cluster analysis using complete linkage clustering and Canberra distance. Columns are the individual injections clustered using complete linkage clustering and cosine similarity. The colour scale of the heat map is standardized, and the normalized peak area is scaled from 0 to 1, with a dark colour representing a higher peak area and a light colour representing a lower peak area. A full heatmap with all 483 cocaine‐positive samples can be found in Figure S1