| Literature DB >> 33977389 |
Abstract
BACKGROUND: Precision medicine, space exploration, drug discovery to characterization of dark chemical space of habitats and organisms, metabolomics takes a centre stage in providing answers to diverse biological, biomedical, and environmental questions. With technological advances in mass-spectrometry and spectroscopy platforms that aid in generation of information rich datasets that are complex big-data, data analytics tend to co-evolve to match the pace of analytical instrumentation. Software tools, resources, databases, and solutions help in harnessing the concealed information in the generated data for eventual translational success. AIM OF THE REVIEW: In this review, ~ 85 metabolomics software resources, packages, tools, databases, and other utilities that appeared in 2020 are introduced to the research community. KEY SCIENTIFIC CONCEPTS OF REVIEW: In Table 1 the computational dependencies and downloadable links of the tools are provided, and the resources are categorized based on their utility. The review aims to keep the community of metabolomics researchers updated with all the resources developed in 2020 at a collated avenue, in line with efforts form 2015 onwards to help them find these at one place for further referencing and use.Entities:
Keywords: Annotation; Database; In silico; Metabolite; Metabolomics; Program; Recourse; Software; Tool
Mesh:
Year: 2021 PMID: 33977389 PMCID: PMC8112213 DOI: 10.1007/s11306-021-01796-1
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
The entire list of reviewed tools is organized by important analytical steps in metabolomics data analysis and includes details regarding their platform dependency, and implementation, e.g., programming language (R, Python, Java, C/C ++, etc.) or web browser based and their availability
| Name of the Software Tool | Category | Platform dependency | Implementation/ use dependency | Software availability | References |
|---|---|---|---|---|---|
| AlpsNMR | Platform | NMR | R | (Madrid-Gambin et al. | |
| SigMa | Platform | NMR | MATLAB, Standalone | (Khakimov et al. | |
| NMRfilter | Platform | NMR | NA | (Kuhn et al. | |
| MSHub + EI-GNPS | Platform | GC–MS | GNPS, Web | (Aksenov et al. | |
| RGCxGC toolbox | Platform | GCXGC-MS | R, TeX | (Quiroz-Moreno et al. | |
| CROP | Preprocessing | LC–MS/MS | R | (Kouřil et al. | |
| ncGTW | Preprocessing | LC–MS/MS | R, C ++ | (Wu et al. | |
| TidyMS | Preprocessing | LC–MS/MS | Python | (Riquelme et al. | |
| AutoTuner | Preprocessing | LC–MS/MS | R | (McLean & Kujawinski, | |
| hRUV | Preprocessing | LC–MS/MS | R | (Kim et al. | |
| MetumpX | Preprocessing | Any | R | (Wajid et al. | |
| MetaQuac | QC | Targeted LC–MS | R | (Kuhring et al. | |
| dbnorm | QC | Any | R | (Bararpour et al. | |
| MetaClean | QC | LC–MS/MS | R | (Chetnik et al. | |
| NeatMS | QC | LC–MS/MS | Python | (Gloaguen et al. 2020) | |
| MESSAR | Annotation | LC–MS/MS | Web | (Liu, Mrzic, et al., | |
| SMART 2.0 | Annotation | 2D NMR | Web | (Reher et al. 2020) | |
| MetFID | Annotation | MS/MS data | NA | NA | (Fan et al. |
| CPVA | Annotation | Any | Web | (Luan et al. | |
| NRPro | Annotation | LC–MS/MS | Java, Web | (Ricart et al. | |
| MetENP/MetENPWeb | Annotation | LC–MS/MS | R, Web | (Choudhary et al. | |
| CANOPUS | Annotation | LC–MS/MS | Standalone | (Dührkop et al. | |
| MolDiscovery | Annotation | LC–MS/MS | Python | (Cao et al. n.d.) | |
| MetIDfyR | Annotation | LC–MS/MS | R | (Delcourt et al. | |
| Qemistree | Annotation | LC–MS/MS | Python | (Tripathi et al. | |
| IIMN | Annotation | LC–MS/MS | GNPS, Web | (Robin Schmid, Daniel Petras, Louis-Félix Nothias, Mingxun Wang, Allegra T. Aron, Annika Jagels, Hiroshi Tsugawa, Johannes Rainer, Mar Garcia-Aloy, Kai Dührkop, Ansgar Korf, Tomáš Pluskal, Zdeněk Kameník, Alan K. Jarmusch, Andrés Mauricio Caraballo-Rodrígu 2020) | |
| FOBI | Annotation | Any | R, Web | (Castellano-Escuder et al. | |
| Biodendro | Annotation | LC–MS/MS | Python | (Rawlinson et al. | |
| AllCCS atlas | Annotation | IM-MS | Web | (Zhou et al. | |
| Binner | Annotation | LC–MS/MS | Java | (Kachman et al. | |
| MS-CleanR | Annotation | LC–MS/MS | R | (Fraisier-Vannier et al. | |
| Retip | Annotation | LC–MS/MS | R | (Bonini et al. | |
| QSRR Automator | Annotation | LC–MS/MS | Python | (Naylor et al. | |
| MFAssignR | Annotation | LC–MS/MS | R, HTML | (Schum et al. | |
| McSearch | Annotation | LC–MS/MS | R | (Xing et al. | |
| REDU | Annotation | LC–MS/MS | GNPS, Web | (Jarmusch et al. | |
| MASST | Annotation | LC–MS/MS | GNPS, Web | (Wang, Jarmusch, et al., | |
| NPClassifier | Annotation | Any | Web | (kim et al. | |
| patRoon | Annotation | HR MS/MS | R | (Helmus et al. | |
| LipidLynxX | Annotation | LC–MS/MS | Python, Standalone | (Ni & Fedorova, | |
| Skyline | Multifunctional | Any | Standalone | (Adams et al. | |
| NoTaMe | Multifunctional | LC–MS/MS | R, Web | (Klåvus et al. | |
| BALSAM | Multifunctional | IMS, GC–MS, LC–MS | Web, Python, HTML, Java | (Weber et al. | |
| MRMkit | Multifunctional | Targeted LC–MS | Python, R | (Teo et al. | |
| MetaboShiny | Multifunctional | Any | R | (Wolthuis et al. | |
| SmartPeak | Multifunctional | Many | C#, Python | (Kutuzova et al. | |
| MS-DIAL 4.0 | Multifunctional | LC–MS/MS, GC–MS, IMS | Standalone | (Tsugawa et al. | |
| IP4M | Multifunctional | LC–MS/MS | Java, Perl, R, Standalone | (Liang et al. | |
| DropMS | Multifunctional | HR MS | Web | (Rosa et al. | |
| Epimetal | Statistics, visualization | Any | JavaScript, Web | (Ekholm et al. | |
| Metabolite AutoPlotter | Statistics, visualization | Quantitative metabolomics data, any | R, Web | (Pietzke & Vazquez, | |
| Metabolite-Investigator | Statistics, visualization | LC–MS | R, Web | (Beuchel et al. | |
| VIIME | Statistics, visualization | Any | Web | (Choudhury et al. | |
| struct | Statistics, visualization | Any | R | (Lloyd et al. | |
| lipidr | Statistics, visualization | LC–MS/MS | R | (Mohamed et al. | |
| NOREVA | Statistics | Any | Web, R, Standalone | (Yang et al. | |
| %polynova_2way | Statistics | Processed data | SAS | (Manjarin et al. | |
| rawR | Visualization | LC–MS | R, C ++ | (Kockmann & Panse, | |
| Metaboverse | Visualization | Any | Java, HTML, Standalone | (Jordan A. Berg, Youjia Zhou, T. Cameron Waller, Yeyun Ouyang, Sara M. Nowinski, Tyler Van Ry, Ian George, James E. Cox, Bei Wang 2020) | |
| JS-MS 2.0 | Visualization | LC–MS/MS | Java, JavaScript, HTML | (Henning & Smith, | |
| COCONUT | Database | Any | Web | (Sorokina et al. n.d.) | |
| METLIN MS2 molecular standards database | Database | LC–MS/MS | Web | (Xue et al. | |
| CSMDB | Database | NMR | MATLAB | (Charris-Molina et al. | |
| EMBL-MCF | Database | LC–MS | NA | ||
| MIAMI | Isotopic | GC–MS | C ++ | (Dudek et al. | |
| isoSCAN | Isotopic | GC–MS | R | (Capellades et al. | |
| LiPydomics | Lipidomics | Ion Mobility, Lipidomics | Python, HTML | (Ross et al. | |
| LipidCreator | Lipidomics | LC–MS | C#, HTML, Skyline plugin | (Peng et al. | |
| Lipid Annotator | Lipidomics | LC–MS/MS | NA | NA | (Koelmel et al. |
| Raman2imzML | MSI | Raman | C ++, R | (Iakab et al. | |
| SUMMER | Multiomics | Any | R, Web | Huang et al. ( | |
| metpropagate | Analysis, visualization | Untargeted LC–MS/MS | R, Python | Graham Linck et al. ( |
The tools generally follow their order of appearance in the manuscript text
List of useful R/ Bioconductor packages that surfaced/ were improved in 2020
| CRAN package name | Title | Description | Link |
|---|---|---|---|
| lilikoi | Metabolomics personalized pathway analysis tool | Helps map metabolites data into pathways and calculates pathway deregulation scores, and enables perform exploratory analysis, classification and prognosis analysis on both metabolites and pathways | |
| omu | A metabolomics analysis tool for intuitive figures and convenient metadata collection | Helps generate intuitive figures for metabolomics data by using Kyoto Encyclopaedia of Genes and Genomes (KEGG) hierarchy data, and gathers functional orthology and gene data using the package 'KEGGREST' to access the 'KEGG' API | |
| eRah | Automated spectral deconvolution, alignment, and metabolite identification in GC/MS-based untargeted metabolomics | Updated to 2016 published tool eRah, that aids in automated compound deconvolution, alignment across samples, and identification of metabolites by spectral library matching in untargeted GC–MS metabolomics workflows | |
| MetaDBparse | Annotate mass over charge values with databases and formula prediction | Useful for parsing functionality for over 30 metabolomics databases, and calculates given adducts and isotope patterns and inserts into one big database which can be used to annotate unknown | |
| MetaClean | Detection of low-quality peaks in untargeted metabolomics data | Uses 11 peak quality metrics and eight diverse machine learning algorithms to build a classifier for the automatic assessment of peak integration quality of peaks from untargeted metabolomics analyses | |
| tmod | Feature set enrichment analysis for metabolomics and transcriptomics | Feature or gene set enrichment analysis in transcriptomics and metabolomics data and the allows enrichment based on ranked list of features, visualization and multivariate data analysis | |
| ccmn | CCMN and other normalization methods for metabolomics data | Allows implementation of Cross-contribution Compensating Multiple standard Normalization (CCMN) method | |
| LipidMS | Lipid annotation for LC–MS/MS DIA data | Aids in annotation of lipids in untargeted LC-DIA-MS lipidomics data based on fragmentation rules | |
| enviGCMS | GC/LC–ms data analysis for environmental science | For environmental mass spectrometry (GC/LC-MS) data analysis for molecular isotope ratio, matrix effects and short-chain chlorinated paraffins analysis etc | |
| nontarget | Detecting isotope, adduct and homologue relations in LC–MS data | Allows screening of HRMS data set for peaks related by (1) isotope patterns, (2) different adducts of the same molecule and/or (3) homologue series; thus yielding isotopic pattern and adduct groups called 'components' with homologue series information. Further plotting, filtering of MS data for mass defects etc. are facilitated | |
| Finding rhythmic and non-rhythmic trends in multi-omics data (MOSAIC) | MOSAIC (Multi-Omics Selection with Amplitude Independent Criteria) provides a function (mosaic_find()) designed to find rhythmic and non-rhythmic trends in multi-omics time course data using model selection and joint modelling | ||
| Integrative pathway enrichment analysis of multivariate omics data | A framework for analysing multiple omics datasets in the context of molecular pathways, biological processes and other types of gene sets. The tool uses p-value merging to combine gene- or protein-level signals, followed by ranked hypergeometric tests to determine enriched pathways and processes | ||
| Web-based interactive omics visualization | Provides modules for creating web-based applications that use plot-based strategies to visualize and analyse multi-omics data | ||
| Omics data integration using kernel methods | The package aims at providing methods to combine kernel for unsupervised exploratory analysis, that can help integration of heterogenous types of data |
Fig. 1Snapshots of a subset of tools and resources discussed in this review. a SMART 2.0 outputs for a demo metabolite swinholide A, b Outputs from MESSAR for corosollic acid, c Demo analysis on FOBI
Fig. 2Snapshots of a subset of tools and resources discussed in this review. a Outputs on demo data on NRPro, b REDU analysis results for a demo data, c MetENPWeb analysis results
Fig. 3Snapshots of a subset of tools and resources discussed in this review. Web interfaces and snapshot of outputs for demo data on a VIIME, b MetaboliteAutoPlotter, and c SUMMER