| Literature DB >> 32176690 |
Lisa M Bramer1, Amanda M White1, Kelly G Stratton1, Allison M Thompson2, Daniel Claborne1, Kirsten Hofmockel2, Lee Ann McCue2.
Abstract
The high-resolution and mass accuracy of Fourier transform mass spectrometry (FT-MS) has made it an increasingly popular technique for discerning the composition of soil, plant and aquatic samples containing complex mixtures of proteins, carbohydrates, lipids, lignins, hydrocarbons, phytochemicals and other compounds. Thus, there is a growing demand for informatics tools to analyze FT-MS data that will aid investigators seeking to understand the availability of carbon compounds to biotic and abiotic oxidation and to compare fundamental chemical properties of complex samples across groups. We present ftmsRanalysis, an R package which provides an extensive collection of data formatting and processing, filtering, visualization, and sample and group comparison functionalities. The package provides a suite of plotting methods and enables expedient, flexible and interactive visualization of complex datasets through functions which link to a powerful and interactive visualization user interface, Trelliscope. Example analysis using FT-MS data from a soil microbiology study demonstrates the core functionality of the package and highlights the capabilities for producing interactive visualizations.Entities:
Mesh:
Year: 2020 PMID: 32176690 PMCID: PMC7098629 DOI: 10.1371/journal.pcbi.1007654
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Example format of required input data.frames for ftmsRanalysis package showing the first few rows and columns of each.
| mass | sample 1 | sample 2 | … | mass | molecular formula | sample | Location | Vegetation |
|---|---|---|---|---|---|---|---|---|
| 1152257.5 | 1812663.0 | … | C8H14O7 | Michigan | Corn | |||
| 2158339.3 | 0 | … | C9H5O6N1 | Wisconsin | Switchgrass | |||
Chemical property calculations available within ftmsRanalysis.
| Name(s) | Function Name | Optional Arguments |
|---|---|---|
| Elemental ratios (O:C, H:C, N:C, P:C, N:P) | calc_element_ratios | – |
| Kendrick Mass and Defect | calc_kendrick | Compound Base: |
| Nominal oxidation state of carbon (NOSC) | calc_nosc | – |
| Cox Gibbs Free Energy | calc_gibbs | – |
| Aromaticity Index (AI) and modified AI | calc_aroma | – |
| Double bond equivalent (DBE) and DBE minus oxygen (DBE—O) | calc_dbe | Covalency of each element |
Fig 1(A) Log2-transformed peak intensity profiles for each sample colored by group. (B) Number of peaks observed in each sample colored by group. (C) Histogram of all observed masses, with masses colored in blue representing those that would be retained after application of a mass_filter(). (D) Barchart corresponding to the minimum number of samples for which a peak was observed, with the height of the bar giving the number of peaks for which this is true for 1 to 20 samples. The highlighted bar gives the number of peaks remaining if the molecule_filter() requiring a peak to be observed in two or more samples is applied.
Fig 2(A) Kendrick plot for EM0011_sample. (B) NOSC histogram and density curve for observed peaks for the EM0011_sample.
Fig 3(A) Van Krevelen plot for Michigan switchgrass samples with points colored by the number of samples in which the compound was observed colored by number of observations. (B) Individual sample and group-level modified AI density curves for Michigan switchgrass.
Fig 4(A) Van Krevelen plot of peaks uniquely observed in Michigan corn and Michigan switchgrass samples. (B) NOSC density curves for Michigan corn and switchgrass groups.
Fig 5Screenshot of a Trelliscope display showing a van Krevelen plot for each sample.
The left image shows the sort/filter capability, with which characteristics of each sample (e.g. number of observed formulae) can be used to order plots. The right image shows the first page of 2x2 plots of individual samples ordered by number of observed compounds and colored by number of S atoms.