| Literature DB >> 35800176 |
Anne Marie E Smith1, Kiril Lanevskij1, Andrius Sazonovas1, Jesse Harris1.
Abstract
Scientists' ability to detect drug-related metabolites at trace concentrations has improved over recent decades. High-resolution instruments enable collection of large amounts of raw experimental data. In fact, the quantity of data produced has become a challenge due to effort required to convert raw data into useful insights. Various cheminformatics tools have been developed to address these metabolite identification challenges. This article describes the current state of these tools. They can be split into two categories: Pre-experimental metabolite generation and post-experimental data analysis. The former can be subdivided into rule-based, machine learning-based, and docking-based approaches. Post-experimental tools help scientists automatically perform chromatographic deconvolution of LC/MS data and identify metabolites. They can use pre-experimental predictions to improve metabolite identification, but they are not limited to these predictions: unexpected metabolites can also be discovered through fractional mass filtering. In addition to a review of available software tools, we present a description of pre-experimental and post-experimental metabolite structure generation using MetaSense. These software tools improve upon manual techniques, increasing scientist productivity and enabling efficient handling of large datasets. However, the trend of increasingly large datasets and highly data-driven workflows requires a more sophisticated informatics transition in metabolite identification labs. Experimental work has traditionally been separated from the information technology tools that handle our data. We argue that these IT tools can help scientists draw connections via data visualizations and preserve and share results via searchable centralized databases. In addition, data marshalling and homogenization techniques enable future data mining and machine learning.Entities:
Keywords: analytical data management; computational chemistry; machine learning; metabolite identification; metabolite prediction
Year: 2022 PMID: 35800176 PMCID: PMC9253584 DOI: 10.3389/ftox.2022.932445
Source DB: PubMed Journal: Front Toxicol ISSN: 2673-3080
FIGURE 1The MetaSense metabolite generation and identification process: (A) Metabolites are generated using structural information and biotransformation rules. (B) Data acquired by analytical instruments is combined with the molecular structure and biotransformation predictions. (C) Analytical data is processed to identify metabolites. (D) Processed data is stored in SpectrusDB database. Data can be manually reviewed and processed, accessed via software tools, or be used to generate reports.
FIGURE 2A screen capture from MetaSense, showing the analysis of Terfenadine. The reactions present, absolute area, retention time and mass are notated in the Metabolite Summary Table (i.e., Parent + O) and resultant structures are visualized in the BTM. The Kinetic/Stability Plot allows users to assess formation/generation of metabolites across the entire study.