Literature DB >> 35341032

Datasets from harmonised metabolic phenotyping of root, tuber and banana crop.

Margit Drapal1, Laura Perez-Fons1, Elliott J Price1, Delphine Amah2, Ranjana Bhattacharjee2, Bettina Heider3, Mathieu Rouard4, Rony Swennen5,6,7, Luis Augusto Becerra Lopez-Lavalle8, Paul D Fraser1.   

Abstract

Biochemical characterisation of germplasm collections and crop wild relatives (CWRs) facilitates the assessment of biological potential and the selection of breeding lines for crop improvement. Data from the biochemical characterisation of staple root, tuber and banana (RTB) crops, i.e. banana (Musa spp.), cassava (Manihot esculenta), potato (Solanum tuberosum), sweet potato (Ipomoea batatas) and yam (Dioscorea spp.), using a metabolomics approach is presented. The data support the previously published research article "Metabolite database for root, tuber, and banana crops to facilitate modern breeding in understudied crops" (Price et al., 2020) [1]. Diversity panels for each crop, which included a variety of species, accessions, landraces and CWRs, were characterised. The biochemical profile for potato was based on five elite lines under abiotic stress. Metabolites were extracted from the tissue of foliage and storage organs (tuber, root and banana pulp) via solvent partition. Extracts were analysed via a combination of liquid chromatography - mass spectrometry (LC-MS), gas chromatography (GC)-MS, high pressure liquid chromatography with photodiode array detector (HPLC-PDA) and ultra performance liquid chromatography (UPLC)-PDA. Metabolites were identified by mass spectral matching to in-house libraries comprised from authentic standards and comparison to databases or previously published literature.
© 2022 Published by Elsevier Inc.

Entities:  

Keywords:  Metabolomics; banana; cassava; potato; sweet potato; underutilised crops; yam

Year:  2022        PMID: 35341032      PMCID: PMC8943254          DOI: 10.1016/j.dib.2022.108041

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

The database provides a valuable resource describing the biochemical composition of cassava, sweet potato, potato, yam and banana. The database can be used to compare chemotypes of varieties/species of root, tuber and banana crops. The database can facilitate the identification of agronomic and consumer traits with quantifiable biochemical markers. Specific biochemical signatures can be identified for breeding selection.

Data Description

Resources for genetic and phenotypic diversity in underutilised crops are an important aspect for successful breeding efforts. Analysis of the metabolite composition of respective tissues/crops enables the assessment of chemical diversity available, the identification of certain phenotypes (e.g. nutrients content) or the elucidation of underlying mechanisms for specific traits (e.g. whitefly resistance in cassava [9]). As part of the Roots, Tubers and Bananas (RTB) project, metabolomics was used to analyse diversity panels of 38 banana accessions [10], 23 cassava varieties [11], 25 sweet potato accessions [12] and five yam species (D. rotundata, D. cayenensis, D. alata, D. bulbifera and D. dumetorum) [13], [14], [15]. In addition, five potato varieties were analysed to identify metabolites associated with resistance to drought [16] and two cassava varieties were compared to characterise the natural variation in resistance to whitefly [9]. Analysis of these crops was performed on different plant tissues (e.g. leaf, tuber and root) and for banana and cassava, on plants under two different cultivation conditions: in vitro propagation and open field. The respective species/accessions were subjected to a standard methanol-water-chloroform extraction, followed by different metabolomics techniques. LC-MS and GC-MS analysis was performed for untargeted profiling of polar and non-polar extracts. Analysis of non-polar extracts by LC-PDA was performed for a more targeted screening of isoprenoids (e.g. carotenoids and chlorophylls). Compounds in the samples were compared to retention time and UV/Vis spectrum of authentic standards (Fig. 1).
Fig. 1

UV/Vis spectra of carotenoids and xanthophylls. The respective names of the compounds is displayed at the bottom right side of the spectrum. Retention times (RT) are listed as minutes underneath the compound name. Numbers in the spectra indicate the wavelength of the peaks characteristic for the respective compounds.

UV/Vis spectra of carotenoids and xanthophylls. The respective names of the compounds is displayed at the bottom right side of the spectrum. Retention times (RT) are listed as minutes underneath the compound name. Numbers in the spectra indicate the wavelength of the peaks characteristic for the respective compounds. Molecular features detected in the different analysis techniques, were compared to authentic standards and spectral features in databases (e.g. NIST) for metabolite identification. For GC-MS, Automated mass spectral deconvolution and identification system (AMDIS) was used and settings were modified specific to certain crops (Table 1). For LC-MS analysis, the R package metaMS was used and a script for molecular feature extraction and library comparison was modified for samples analysed with maXis Ultra-High Resolution QTOF (Bruker, Germany). The outputs from both analysis techniques are available, as unprocessed Excel tables listing the areas of individual molecular features/metabolites in the respective samples, in Mendeley Data repository [2], [3], [4], [5], [6], [7], [8]. The identified metabolites were quantified relative to internal standards. A database was compiled for the present RTB crops and includes the quantitative range of each metabolite present in the individual tissues of each genus [1].
Table 1

Settings for Automated mass spectral deconvolution and identification system (AMDIS) for data analysis of GC-MS files.

AMIDS SettingsYam (polar)Yam (non-polar)CassavaPotato, sweet potato and banana
Identification
Minimum match factor—————————————- 80 —————————————–
Multiple identification per compound—————————————- yes —————————————-
Show standards—————————————- no —————————————–
Only reverse search—————————————- no —————————————–
Type of analysis————————– Use retention index data ————————–
RI window9+0*0.019+0*0.011+0*0.0120+0*0.01
Match factor penalties levelaverageaveragestrongaverage
Max. penalty20201020
No RI in library10101010

Instrument
Low m/z———————————— auto (50) ————————————
High m/z———————————— auto (520) ———————————–
Use scan set—————————————- no —————————————–
Threshold————————————— high —————————————
Scan direction———————————— high to low ———————————-
Data file format———————————– Agilent files ———————————-
Instrument type———————————– Quadrupole ———————————-

Deconvolution
Component width12123232
Omitted—————————————- 28 —————————————–
Adjacent peak subtractiontwotwoonetwo
Resolutionlowlowlowmedium
Sensitivityvery lowlowvery lowlow
Shapehighmediummediumlow

QA/QC
Solvent tailing (m/z)—————————————- 84 —————————————–
Column bleed (m/z)—————————————- 207 —————————————–
Settings for Automated mass spectral deconvolution and identification system (AMDIS) for data analysis of GC-MS files.

Experimental Design, Materials and Methods

Metabolite extraction

Lyophilised tissue was ground and homogenised to a fine powder. Aliquots (10 mg) were weighed for each sample and extracted with a methanol/water/chloroform extraction method. Due to the size of the individual sample sets, sample batch of 22 sample were created. Each sample batch included an extraction blank and a quality control, which represented a pool of a samples in the respective sample set. Extraction methods were optimised for specific chemical classes and for each crop [10], [11], [12], [13], [14], [15], [16], [17], [18], e.g. carotenoid extraction for yam with 200 mg/sample. The yam dataset was created with GC-MS analysis of aqueous and organic phase and HPLC-DAD analysis of the organic phase. Datasets for all other crops (sweet potato, potato, banana and cassava) were created with GC-MS and LC-MS analysis of the aqueous phase and UPLC-DAD analysis of the organic phase. The dataset for sweet potato and cassava also included GC-MS analysis of the organic phase.

Liquid chromatography-mass spectrometry (LC-MS) analysis

Aqueous extracts were dried down and resuspended in methanol/water (1:1, 100 μL). Internal standard (homogentisic acid, 5 μg, or genistein, 10 μg) was added to each sample, the extraction blank and the quality control. Samples were filtered (nylon, 0.45 μm) and analysed with Dionex Ultimate 3000 UHPLC (Thermo Scientific) coupled to maXis Ultra-High Resolution QTOF (Bruker, Germany) in negative electrospray ionisation mode (Vi, 5 μl). Aliquots of samples (10 μL) were separated with Acquity BEH C18 column and a solvent gradient including 0.1% formic acid in water and acetonitrile [11]. Extraction of chemical features from raw data files and search chemical database was compiled with R package metaMS [19,20] (Fig. 2) including an in-house library with authentic standards. Identification was set to m/z difference 0.005 and retention time difference 0.3 min. The resulting data matrix containing integrated peak areas of both unidentified chemical features and annotated metabolites was exported as Microsoft Excel Open XML Spreadsheet (.xlsx) format.
Fig. 2

R script for R package metaMS to convert raw LC-MS files into an unprocessed Excel file.

R script for R package metaMS to convert raw LC-MS files into an unprocessed Excel file.

Gas chromatography-mass spectrometry (GC-MS) analysis

Dried extracts were derivatised with methoxyamine hydrochloride (MeOX) in pyridine, followed by N-methyl-N-(trimethylsilyl)trifluoroacetamine (MSTFA) at 40°C. The 7890A GC on-line with 5975C MSD (Agilent Technologies, US) was set with 1 μL injection volume in splitless mode and a temperature gradient from 70-325°C [13] using a DB-5MS column. Data was compiled with AMDIS (v2.71, NIST) and an in-house library specific to each crop. Deconvolution and identification settings were optimised for each crop (Table 1).

High pressure and ultra performance liquid chromatography (HPLC/UPLC) analysis

Both HPLC and UPLC analysis included a photodiode array detector (PDA). Extracts were dried down and resuspended in ethyl acetate, to concentrate the amount of carotenoids and chlorophylls present. For analysis of yam samples by Alliance HPLC-PDA (Waters, UK), an aliquot (20μL) was injected and separated on a reverse-phase (RP) column (4.6 × 250 mm, C30, 5μm particle size; YMC Inc., Kyoto, Japan) at 25°C with a 60min solvent gradient including three buffers [14]. For all other crops, Acquity UPLC-PDA was employed with lower injection volume (3-7μL), an Ethylene Bridged Hybrid (BEH C18) column (2.1 × 100mm, 1.7μm) with a BEH C18 VanGuard precolumn (2.1 × 50mm, 1.7μm) and a mobile phase consisting of two buffers [21]. The PDAs were scanning in a continuous manner from 250-600nm. Peaks were integrated using Empower 2 (Waters, UK) and identified through chromatographic and spectral characteristics to standards (Fig. 1) and literature references [22].

Data processing

Data output from the respective data analysis software was tabulated using IdAlign (Centre for Computational Systems Biology, University of Western Australia, http://www.softsea.com/review/IdAlign.html) and Microsoft Excel 2016. Metabolite features present in extraction blanks were subtracted or excluded from the data sets. The identified metabolites and molecular features were quantified relative to the respective internal standard and the datasets were normalised to the individual sample weights (μg/g dry weight). Some compounds were detected as multiple derivatives (e.g. glutamic acid and pyroglutamic acid) and their areas were merged before normalisation. In some cases, the data needed to be normalised with the quality controls to correct for batch effects (e.g. cassava [11]).

Ethics statements

This work included plant material and did not include work involved with human subjects, animal experiments or data collected from social media platforms.

CRediT Author Statement

Margit Drapal, Laura Perez and Elliott Price: Data generation and curation, Writing – original draft preparation; Delphine Amah, Ranjana Bhattacharjee, Bettina Heider, Mathieu Rouard, Rony Swennen and Luis Augusto Becerra Lopez-Lavalle: Provided germplasm; Paul D. Fraser: Conceptualization, Supervision and Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectOmics: Metabolomics
Specific subject areaMetabolite profiling of roots, tuber and bananas
Type of dataXLSX format
How the data were acquiredMass spectrometry data were obtained with two analytical platforms:- Dionex Ultimate 3000 UHPLC (Thermo Scientific) coupled to maxis Ultra-High Resolution QTOF (Bruker, Germany) with ESI in negative ionisation mode- 7890A GC on-line with 5975C MSD (Agilent Technologies, US)- Analysis for carotenoids and chlorophylls was performed with Acquity UPLC-PDA (Waters, UK) or Alliance HPLC-PDA (Waters, UK)
Data formatRaw (LC-MS data)Analysed (tabulated format of metabolites identified by GC-MS)Processed/Filtered (database of identified metabolites)
Description of data collectionLyophilised and ground plant tissue was extracted with a methanol/water or methanol/100mM Tris-HCl with 1M NaCl and chloroform method (1:1:2, vol.). Carotenoid/chlorophylls were analysed by HPLC-PDA or UPLHC-PDA analysis. Metabolite profiling of the aqueous and organic phase was performed with GC-MS and LC-MS. GC-MS raw data was processed with AMDIS and LC-MS data files were analysed with metaMS package in R. Data was normalised relative to the internal standard and to the sample weight (μg/g dry weight) and batch correction with QC was applied with large sample sets.
Data source locationRoyal Holloway University of London, Egham, United Kingdom
Data accessibilityDatabase is published in Price et al. (2020).Pre-processed data is available at Mendeley Data for sweet potato [2], potato [3], banana [4] and cassava [5], [6], [7]. GC-MS data files are available for yam [8].
Related research articleE.J. Price, M. Drapal, L. Perez-Fons, D. Amah, R. Bhattacharjee, B. Heider, M. Rouard, R. Swennen, L.A. Becerra Lopez-Lavalle, P.D. Fraser, Metabolite database for root, tuber, and banana crops to facilitate modern breeding in understudied crops, Plant J. 101 (2020) 1258–1268. https://doi.org/10.1111/tpj.14649.
  15 in total

1.  Fast parametric time warping of peak lists.

Authors:  Ron Wehrens; Tom G Bloemberg; Paul H C Eilers
Journal:  Bioinformatics       Date:  2015-05-13       Impact factor: 6.937

2.  Technical advance: application of high-performance liquid chromatography with photodiode array detection to the metabolic profiling of plant isoprenoids.

Authors:  P D Fraser; M E Pinto; D E Holloway; P M Bramley
Journal:  Plant J       Date:  2000-11       Impact factor: 6.417

3.  Cassava Metabolomics and Starch Quality.

Authors:  Laise Rosado-Souza; Laure C David; Margit Drapal; Paul D Fraser; Jörg Hofmann; Patrick A W Klemens; Frank Ludewig; H Ekkehard Neuhaus; Toshihiro Obata; Laura Perez-Fons; Armin Schlereth; Uwe Sonnewald; Mark Stitt; Samuel C Zeeman; Wolfgang Zierer; Alisdair R Fernie
Journal:  Curr Protoc Plant Biol       Date:  2019-12

4.  Subchromoplast sequestration of carotenoids affects regulatory mechanisms in tomato lines expressing different carotenoid gene combinations.

Authors:  Marilise Nogueira; Leticia Mora; Eugenia M A Enfissi; Peter M Bramley; Paul D Fraser
Journal:  Plant Cell       Date:  2013-11-18       Impact factor: 11.277

5.  A genome-wide metabolomic resource for tomato fruit from Solanum pennellii.

Authors:  Laura Perez-Fons; Tom Wells; Delia I Corol; Jane L Ward; Christopher Gerrish; Michael H Beale; Graham B Seymour; Peter M Bramley; Paul D Fraser
Journal:  Sci Rep       Date:  2014-01-24       Impact factor: 4.379

6.  Metabolite database for root, tuber, and banana crops to facilitate modern breeding in understudied crops.

Authors:  Elliott J Price; Margit Drapal; Laura Perez-Fons; Delphine Amah; Ranjana Bhattacharjee; Bettina Heider; Mathieu Rouard; Rony Swennen; Luis Augusto Becerra Lopez-Lavalle; Paul D Fraser
Journal:  Plant J       Date:  2020-01-22       Impact factor: 6.417

7.  Capturing Biochemical Diversity in Cassava ( Manihot esculenta Crantz) through the Application of Metabolite Profiling.

Authors:  Margit Drapal; Elisabete Barros de Carvalho; Tatiana M Ovalle Rivera; Luis Augusto Becerra Lopez-Lavalle; Paul D Fraser
Journal:  J Agric Food Chem       Date:  2019-01-09       Impact factor: 5.279

8.  A metabolomics characterisation of natural variation in the resistance of cassava to whitefly.

Authors:  Laura Perez-Fons; Adriana Bohorquez-Chaux; Maria L Irigoyen; Danielle C Garceau; Kris Morreel; Wout Boerjan; Linda L Walling; Luis Augusto Becerra Lopez-Lavalle; Paul D Fraser
Journal:  BMC Plant Biol       Date:  2019-11-27       Impact factor: 4.215

9.  Metabolic diversity in sweet potato (Ipomoea batatas, Lam.) leaves and storage roots.

Authors:  Margit Drapal; Genoveva Rossel; Bettina Heider; Paul D Fraser
Journal:  Hortic Res       Date:  2019-01-01       Impact factor: 6.793

10.  Metabolite profiling of yam (Dioscorea spp.) accessions for use in crop improvement programmes.

Authors:  Elliott J Price; Ranjana Bhattacharjee; Antonio Lopez-Montes; Paul D Fraser
Journal:  Metabolomics       Date:  2017-10-14       Impact factor: 4.290

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.