| Literature DB >> 28286574 |
Martin Loos1,2, Heinz Singer1.
Abstract
BACKGROUND: A large proportion of polar anthropogenic compounds routinely released into the environment comprises homologue series, i.e., sets of chemicals differing in a repeating chemical unit. Using analytical techniques such as liquid chromatography coupled to high-resolution mass spectrometry (LC-HRMS), these compounds are readily measurable as signal sets with characteristic differences in mass and typically retention time. However, and despite such distinct characteristics, no computational approach for the direct, simultaneous and untargeted detection of all such signal sets comprising both LC and HRMS information has to date been presented. <Entities:
Keywords: Homologue series; Liquid chromatography; Mass spectrometry; Nontarget screening; Sewage effluent; Surfactants
Year: 2017 PMID: 28286574 PMCID: PMC5323340 DOI: 10.1186/s13321-017-0197-z
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Exemplary subspace query (black polygons) for the detection of 3-tuples, centered at one peak (blue point) with one detected 3-tuple exemplified in green. Intersections of blue lines with axes indicate the values of the four elements of a for this peak. Moreover, dashed lines mark the m/z interval defined in Eq. (6). In turn, dotted lines in the top panel mark the RT interval of Eq. (9), whereas dotted lines in the bottom panel indicate the upper and lower bound of intervals related to mass defect differences from Eqs. (7) and (8), respectively
Fig. 2SOM for series pairs from STP sample with ID = 1, positive ionization mode. Coloring of the top and bottom panels show and values at the SOM nodes, respectively. Sizes of the black dots in the top panel indicate frequencies of monoisotopic series pairs mapped onto the nodes. In turn, black squares in the bottom panel either indicate the three nodes with highest frequencies at low intersection angles (θ < 0.08π, nodes 1–3) or nodes with highest mapping frequencies containing 50% of all monoisotopic series pairs at larger intersection angles (θ ≥ 0.08π, nodes 4–11). The series mapped onto the latter nodes 4–11 are shown in Fig. 3. Moreover, crosses highlight the mapping nodes of the unknown superjacent series shown in Figure S6 (Additional file 15)
Fig. 3Meshed series pairs from nodes 4–11 of Fig. 2, colored by their series mass differences . Light gray points show all picked peaks whereas only those depicted in dark gray are part of any series. Bold rectangles indicate the zoom area for Figure S3 (Additional file 12) to depict the PEG-related series
Fig. 4Relative cumulative frequency of values for all blank-subtracted series detected in positive (a) and negative (c) ionization modes, stacked top-down for individual STP samples in order of their IDs 1–10 (cp. Table S6 in Additional file 9). Solid red lines indicate masses of three common homologue units at z = 1, red dashed ones at z = 2. Solid gray lines denote values of possible multiples, additions or subtractions thereof. Gray dashed lines indicate isotopologue shifts of some of these masses, equal to 12C vs. 13C mass transitions. Moreover, gray bars in b and d show STP counts from a moving window (±5 μ) over all stacked distributions for the positive and negative mode, respectively. In contrast, blue bars were derived after omission of non-monoisotopic series