Literature DB >> 35729681

Switchgrass Metabolomics Reveals Striking Genotypic and Developmental Differences in Specialized Metabolic Phenotypes.

Xingxing Li^1,2, Saurav J Sarma^3,4, Lloyd W Sumner^5,3,4,6, A Daniel Jones^1,2, Robert L Last^1,2,7.

Abstract

Switchgrass (Panicum virgatum L.) is a bioenergy crop that grows productively on lands not suitable for food production and is an excellent target for low-pesticide input biomass production. We hypothesize that resistance to insect pests and microbial pathogens is influenced by low-molecular-weight compounds known as specialized metabolites. We employed untargeted liquid chromatography-mass spectrometry, quantitative gas chromatography-mass spectrometry (GC-MS), and nuclear magnetic resonance spectroscopy to identify differences in switchgrass ecotype metabolomes. This analysis revealed striking differences between upland and lowland switchgrass metabolomes as well as distinct developmental profiles. Terpenoid- and polyphenol-derived specialized metabolites were identified, including steroidal saponins, di- and sesqui-terpenoids, and flavonoids. The saponins are particularly abundant in switchgrass extracts and have diverse aglycone cores and sugar moieties. We report seven structurally distinct steroidal saponin classes with unique steroidal cores and glycosylated at one or two positions. Quantitative GC-MS revealed differences in total saponin concentrations in the leaf blade, leaf sheath, stem, rhizome, and root (2.3 ± 0.10, 0.5 ± 0.01, 2.5 ± 0.5, 3.0 ± 0.7, and 0.3 ± 0.01 μg/mg of dw, respectively). The quantitative data also demonstrated that saponin concentrations are higher in roots of lowland (ranging from 3.0 to 6.6 μg/mg of dw) than in upland (from 0.9 to 1.9 μg/mg of dw) ecotype plants, suggesting ecotypic-specific biosynthesis and/or biological functions. These results enable future testing of these specialized metabolites on biotic and abiotic stress tolerance and can provide information on the development of low-input bioenergy crops.

Entities: Chemical

Keywords: NMR; biofuels; biomass; diosgenin; ecotype; metabolomics; saponin; specialized metabolite; sustainability; switchgrass

Mesh：

Substances：
Saponins

Year: 2022 PMID： 35729681 PMCID： PMC9264348 DOI： 10.1021/acs.jafc.2c01306

Source DB: PubMed Journal: J Agric Food Chem ISSN： 0021-8561 Impact factor: 5.895

Introduction

Development of environmentally sustainable and economical production of transportation fuels and industrial feedstocks using plant biomass is an important goal for the bioeconomy. Dedicated energy crops that are productive with low or no chemical fertilizers and pesticides on land that is unsuitable for food and fiber crops have received much attention.[1] This requires development of plants with a suite of “ideal” traits,[2] including perennial life cycle, rapid growth under conditions of low soil fertility and water content, as well as being resilient to pests and pathogens. Plants are master biochemists, producing a wide variety of general and specialized metabolites adapted to their ecological niches.[3] The structurally diverse tissue- and clade-specific specialized metabolites play varied roles in how plants cope with biotic and abiotic stresses, both by reducing deleterious impacts and promoting beneficial interactions. For instance, glucosinolates produced by crucifers such as mustard, cabbage, and horseradish mediate interactions with insect herbivores,[4] and flavonoids that induce the rhizobial lipochitooligosaccharides (“nod factors”) initiate the rhizobium–legume nitrogen fixation symbiosis.[5] The root-accumulating avenacin triterpene saponins are well documented to protect oat plants (Avena spp.) from the fungal pathogen-induced “take-all” disease,[6−8] contributing to oat productivity. Modifying plant-specialized metabolism is an attractive target for bioengineering or trait breeding to create low-input bioenergy crops that can thrive on “marginal” lands unsuitable for food and fiber crops. Although hundreds of thousands of specialized metabolites are estimated to be produced by plants,[3] there are reasons why this number is almost certainly an underestimate. First, these metabolites are taxonomically restricted, often showing interspecies or even intraspecies variation;[3] thus, any sampled species, ecotype, or cultivar will have a small subset of the overall plant kingdom’s phenotypic diversity. Second, specialized metabolites tend to be produced in a subset of cell or tissue types in any plant species analyzed; thus, cataloging the metabolic potential of even a single species requires extraction of multiple tissues over the plant’s development. Third, accumulation of these metabolites can be impacted by growth conditions and induced by abiotic or biotic stress.[9] Finally, identification and structural characterization of newly discovered metabolites require specialized capabilities, typically a combination of mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy analysis.[10] The North American native perennial switchgrass (Panicum virgatum L.) has the potential to be cultivated as a low-input bioenergy crop for growing on nonagricultural land.[11] The two principal ecotypes of switchgrass are phenotypically distinct, with variation in flowering time, plant size, physiology, and disease resistance. Upland ecotypes exhibit robust freezing tolerance but produce relatively low biomass yield in part due to early flowering.[12−15] Plants of lowland ecotype typically found in riparian areas produce large amounts of biomass and are more flooding- and heat-tolerant, pathogen-resistant, and nutrient-use-efficient than those of the upland ecotype.[12,16,17] However, these lowland ecotypes do not perform well in northern areas, largely due to cold intolerance. Although morphological and physiological properties associated with the adaptive divergence of upland and lowland switchgrass have been intensively studied, the specialized metabolite diversity and their ecotypic differences remain underexplored, partially due to the technical challenges mentioned above. Lee et al.[18] detected large amounts of diosgenin-derived steroidal saponins from aerial tissues of four different switchgrass cultivars. Diosgenin is synthesized from cholesterol and cyclized and oxidized through several spontaneous steps and enzymatic reactions catalyzed by cytochrome P450 enzymes (CYP450s).[19] It is the backbone of spirostanol-type steroidal saponins that are important defensive compounds with documented antimicrobial and antiherbivory activities.[20−22] These natural compounds also have pharmaceutical value. Diosgenin has been used as a major precursor for synthesizing steroidal drugs including hormonal contraceptives and corticosteroid anti-inflammatory agents.[20] Beyond steroidal saponins, quercetin-derived flavonoids[23] and biotic/abiotic stress-elicited C10–C20 terpenes[24,25] were also identified in switchgrass. A comprehensive metabolomics survey will be beneficial to understand the natural product diversity in this important bioenergy crop. In this study, we developed and deployed approaches to compare metabolomes of three upland and three lowland switchgrass cultivars by untargeted liquid chromatography–MS (LC–MS), targeted gas chromatography (GC)–MS, and NMR. The two ecotypes were documented to have distinct metabolomes, especially in the rhizome and root. We identified seven structurally distinct classes of diosgenin-derived steroidal saponins and cataloged a variety of flavonoid glycosides and di- and sesqui-terpenoids. Steroidal saponins were notably abundant, accounting for more than 30% total ion counts (averaging 5 μg/mg dry tissue weight) in lowland roots from the reproductive developmental stage. Furthermore, ecotype- and/or tissue-type-specific accumulations were observed for individual saponin classes as well as total saponins. Our study provides a comprehensive analysis of the specialized metabolites produced by different switchgrass cultivars and sets the stage for developing dedicated bioenergy crops with varied plant and microbiome traits.

Results

Metabolome Comparisons between Tissue Types, Developmental Stages, and Genotypes

LC–MS was used to develop an overview comparison of the metabolomes of upland and lowland ecotypes in different tissue types and developmental stages. Tissue extracts in 80% methanol were prepared from a sample panel (Figure S1B) containing three upland (Dacotah, Summer, and Cave-in-Rock) and three lowland (Alamo, Kanlow, and BoMaster) cultivars grown from seeds in a controlled environment (Materials and Methods). Shoot, rhizome, and root tissues were analyzed from plants at three developmental stages—vegetative, the transition between vegetative and reproductive, and early reproductive (Figure S1A).[26] In total, 4,668 distinct metabolite features were identified from the positive ionization mode data set; these were annotated as retention time/mass-to-charge ratio pairs and include multiple in-source fragments from single analytes. Downstream statistical analyses focused on the 2586 of 4,668 features whose maximum abundance among the biological samples was ≥500 (Table S1). We employed two complementary approaches to annotate the features by the metabolite class. First, relative mass defect (RMD) filtering[27] was used to guide assignments of all metabolite signals to putative chemical classes (Materials and Methods). As a result, 42 and 15% of the 2586 features in the data set were tentatively annotated as terpenoid glycosides and polyphenol-derived metabolites, respectively (Figure A and Table S1 columns E and F), with RMD scores distinguishing classes based on the fractional hydrogen content in each measured ion. We then searched the molecular ion mass-to-charge ratios (m/z) and associated fragment ion information in available online mass spectral databases (Materials and Methods) and found strong matches to 169 previously characterized metabolites (level-2 non-novel metabolite identifications,[28]Table S1 column KL–KT).

Figure 1

Untargeted metabolome profiling for switchgrass. (A) Histogram of RMD values for the total 2586 features detected in this study by LC–MS in the positive ion mode. The green and orange rectangles highlight regions corresponding to the ranges of the RMD values anticipated for terpenoid glycosides and polyphenols, respectively. (B) Metabolome of the six switchgrass cultivars, three tissue types, and three developmental stages shown by a heatmap with HCA. The row and column clusters symbolize the 2586 features and 48 sample groups (containing 139 individual samples), respectively. The values representing the metabolite abundances that were used to make the heatmap were scaled to a range from 0 (the lowest abundance) to 1 (the highest abundance). (C) PCA score plots for the switchgrass shoot, rhizome, and root metabolite profiles (n = 8 for “upland vegetative” and “lowland reproductive”; n = 9 for all the other groups). The percentage of explained variation is shown on the x- and y-axes. V, vegetative phase; T, transition phase; and R, reproductive phase. The untargeted metabolome data provide a comprehensive view of specialized metabolite variation among the samples, and broad patterns of variation were revealed using hierarchical clustering analysis (HCA, Figure B). The aerial (shoot) and subterranean tissue (root and rhizome) metabolites differed noticeably, consistent with the hypothesis that there are fundamental dissimilarities between the above- and below-ground tissue metabolomes. In contrast, the rhizome and root tissues were more similar to each other. Differences in metabolomes of each of the three tissue types across switchgrass cultivars (genotypes) were investigated using principal component analysis (PCA). The shoot metabolite profiles clustered into distinct groups in the PCA score plot, corresponding with the upland (blue) and lowland (red) switchgrass ecotypes (Figure C, top panel). Separation of the metabolite profiles was especially clear for different developmental stages (developmental stages are differentiated by the symbol shape and “V, T, and R”, standing for the “vegetative, transition, and reproductive phase”, respectively, in Figure C, top panel). The PCAs also showed clear-cut differences in metabolite profiles between the upland and lowland genotypes in both the rhizomes and roots (Figure C, middle and bottom panels, respectively). The developmental stage-associated variance in metabolite profiles of these two subterranean tissues was less apparent compared to that in the aerial tissue. Taken together, the metabolite PCAs revealed distinct patterns between upland and lowland switchgrass cultivars of the three tissues across the three developmental stages.

Upland and Lowland Ecotypes Have Strikingly Distinct Metabolomes

We next focused on each tissue type x developmental stage combination and identified the metabolite features that differentially accumulated in either upland or lowland ecotypes. Surprisingly, 25% (256 of 1035, Table ) of the features detected in extracts of the vegetative-stage tillers predominantly accumulated in one or the other switchgrass ecotype. Such features were termed ecotype “differentially accumulated features” (DAFs). Specifically, there are 126 upland-enriched and 130 lowland-enriched DAFs in extracts of the vegetative-stage tillers (Figure A and Table ). Analysis of vegetative-stage roots further revealed a total of 879 features with 35% (310, Table ) meeting the ecotype DAF statistical threshold. Of these 310 DAFs, similar numbers of features were found to be either upland- (149) or lowland- (161) enriched (Figure B and Table ). DAFs were also identified for the other developmental stage x tissue type combinations (Figure S2 and Table ).

Table 1

Numbers of Total Detected Features and DAFs in Each of the Eight Developmental Stage x Tissue Type Sample Classes

		no. of DAFsb
developmentalstage xtissue type	no. of total detected featuresa	upland-enriched	lowland-enriched	total
vegetative-stagex shoot	1035	126	130	256
vegetative-stagex root	879	149	161	310
transition-stagex shoot	1037	155	133	288
transition-stagex rhizome	1114	268	139	407
transition-stagex root	965	226	235	461
reproductive-stagex shoot	1612	100	200	300
reproductive-stagex rhizome	1378	324	229	553
reproductive-stagex root	1327	408	309	717

Sum of the numbers in this column is larger than the numbers of the total detected features (2586) as some features were counted multiple times in several sample classes.

Sum of the numbers in the column of total is larger than the total detected unique DAFs (1416) as some DAFs were counted multiple times. The detailed information regarding these DAFs can be found in Tables S2–S9. Criteria for the DAFs: false discovery rate (FDR) adjusted p ≤ 0.05; fold changes ≥2.

Figure 2

DAFs were identified between the upland and lowland ecotypes. Significance analysis (cutoff threshold: FDR adjusted p ≤ 0.05; fold changes ≥2) was performed to screen for the DAFs between the upland and lowland switchgrass ecotypes (n = 8 or 9) in various developmental stage x tissue type samples. Results of the analyses for (A) vegetative-stage shoots and (B) vegetative-stage roots are shown here using volcano plots. Putative terpenoid glycosides, polyphenols, and metabolites from the other categories are classified using RMD filtering and the results color-coded. (C) Upset plot of the 1416 unique (non-overlapping) ecotype DAFs across the 8 developmental stage x tissue type sample groups. The rows and columns in the Upset plot represent the sets (the eight sample groups in this case) and their intersections, respectively. For each set that is part of a given intersection, a black filled circle is placed in the corresponding matrix cell; otherwise, a light-gray circle is shown. The numbers of DAFs in the intersections are shown as a bar chart above the matrix. The horizontal bar chart to the left of the matrix shows the size of each set. The inserted bar plot shows that upland and lowland ecotypes accumulated similar numbers of the predominant DAFs likely terpenoid glycosides (green) and polyphenols (orange). The inserted pie chart summarizes percentages of the DAFs contributed by aerial (shoot) vs subterranean (root/rhizome) tissues. V, vegetative phase; T, transition phase; and R, reproductive phase. Sum of the numbers in this column is larger than the numbers of the total detected features (2586) as some features were counted multiple times in several sample classes. Sum of the numbers in the column of total is larger than the total detected unique DAFs (1416) as some DAFs were counted multiple times. The detailed information regarding these DAFs can be found in Tables S2–S9. Criteria for the DAFs: false discovery rate (FDR) adjusted p ≤ 0.05; fold changes ≥2. Altogether, 1416 unique ecotype DAFs were identified for the eight tissue type x developmental stage combinations included in this study (Figure C and Tables S2–S9), accounting for approximately half of the features in the full data set. Based on RMD filtering, 46 and 13% of the DAFs were predicted to be terpenoid glycosides and polyphenol-derived metabolites, respectively (green and orange dots in Figures A-,B and S2, respectively). The numbers of the DAFs preferentially accumulated in upland and lowland ecotypes are equivalent to each other (Figure C inset: barplot). Furthermore, 63% of the DAFs were found in subterranean tissues, while only 13% were unique to the aerial tissues. The remaining 24% were detected in both above- and below-ground tissues (Figure C inset: pie chart). These results reveal that switchgrass subterranean tissues are the major sources of the observed specialized metabolic ecotypic diversity.

Detailed Analysis of Metabolite Diversity Using MS/MS

The observation that terpenoid and polyphenol metabolite classes are highly represented in DAFs led us to use high-resolution LC–MS/MS to characterize these features in more detail. In total, we tentatively identified 72 saponins, 10 diterpenoids, 4 sesquiterpenoids, and 7 flavonoid glycosides (confidence level 2,[28]Figure A and Tables S11–S14) from the 6 switchgrass cultivars. The identified saponins (Table S14) vary in their precursor masses and retention times (RTs) due to differences in both aglycones and sugar moieties. Assignments of multiple fragment ions generated by collision-induced dissociation resulting from the losses of sugar units allowed for annotation of the conjugated monosaccharides as well as the aglycones. For 44 (out of the 72) saponins, the presence of a signal at m/z 415 was diagnostic of the diosgenin aglycone.[29] Fragmentations of 20 saponins yielded an aglycone fragment ion at m/z 431, while further loss of 18 Da (H2O) resulted in m/z 413; this suggests that this core has an additional oxygen atom compared to diosgenin. For three saponins, the presence of a signal at m/z 417 was indicative of a tigogenin aglycone.[30] Finally, larger aglycone fragments at m/z 457 and 473 were observed for six and nine saponins, respectively. Elemental composition suggested that these 2 aglycones both have 29 carbons and differed by 1 oxygen, as predicted for nortriterpenoids. Moreover, the LC–MS spectra for about two-thirds of the 72 saponins displayed an abundant [M + H–H2O]+ ion, characteristic of furostanol saponins, which contain a C-22 labile hydroxyl group due to the hemiketal structure. The furostanol saponins are often glycosylated at their sidechains (also shown by our NMR results below, Figure B–H). Therefore, based on the type of aglycone and sidechain glycosylation, we grouped the 72 identified saponins into 7 classes: D-415-SCG, D-415, D-431-SCG, D-431, D-417-SCG, D-457, and D-473 (“D” indicates a diosgenin-derived aglycone; the numerical number reflects the m/z value of the aglycone fragment ion detected by positive-mode MS; “SCG” indicates “sidechain glycosylation”).

Figure 3

Figure 4

Chemical structures of the saponins identified in switchgrass. (A) Seven purified saponins, representing the seven switchgrass saponin classes, have distinct RTs between 5 and 11 min. (B–H) Structures and numbering of the aglycones for the saponins SS1031, SS1032, SS1244, SS1064, SS1050, SS1254, and SS1089. Classifications of the saponins are shown in the parentheses. R and R′ indicate the position of the sugar moiety at C-3 and C-26 (on the side chain), respectively.

Upland and lowland ecotypes are distinct in specialized metabolite profiles. (A) TICs for upland Dacotah and lowland BoMaster shoot (green) and root (brown) extracts. Areas where the specialized metabolites were identified are labeled by the short names. FG, flavonoid glycoside; ESS, early-eluting steroidal saponin; LSS, late-eluting steroidal saponin; TS, triterpenoid saponin; Di, diterpenoid; and Se, sesquiterpenoid. (B) Heatmap showing relative abundances of the specialized metabolites (vertical axis) across the biological samples (horizontal axis). Classifications of the saponins are shown in the parentheses. Relative metabolite abundances were log10-scaled to a range between −3 (lowest) and 3 (highest). Clustering method/distance: Ward/Euclidean. (C) PCA score plot showing distinct separations of the specialized metabolite profiles among the upland shoot, upland root, lowland shoot, and lowland root. Chemical structures of the saponins identified in switchgrass. (A) Seven purified saponins, representing the seven switchgrass saponin classes, have distinct RTs between 5 and 11 min. (B–H) Structures and numbering of the aglycones for the saponins SS1031, SS1032, SS1244, SS1064, SS1050, SS1254, and SS1089. Classifications of the saponins are shown in the parentheses. R and R′ indicate the position of the sugar moiety at C-3 and C-26 (on the side chain), respectively. PCA (Figure C) using all the identified specialized metabolite features as the loadings (Figures B and S5) revealed that the ecotype plays an important role in the separation of the metabolite profiles even when multiple tissue types are included. Specifically, flavonoid glycosides predominately accumulated in shoots, while diterpenoids and sesquiterpenoids preferentially accumulated in roots (Figures B and S5A); together, these features separated the tissue types on the PCA (Figure C). Notably, all four sesquiterpenoids (Table S12) were exclusively detected in upland ecotype roots, >1000-fold higher than they were found in lowland roots (Table S10). In contrast, a diterpenoid glycoside, Di466 (Table S13), was abundant in lowland roots but nearly undetectable in upland root samples (also a >1000-fold accumulation difference, Table S10). Hence, such features also contribute to separation of the ecotypes on the PCA (Figure C). Accumulation patterns of the saponins (Figures B and S5B) were relatively complex compared with the C15 and C20 terpenes. Five general patterns were observed: (1) D-415 saponins preferentially accumulated in lowland ecotype roots; (2) D-431-SCG and D-431 saponins preferentially accumulated in root tissues, but accumulation was not ecotype-specific; (3) D-457 and D-473 saponins predominately accumulated in shoots with no apparent ecotype specificity; (4) D-417-SCG saponins preferentially were found in lowland shoots; and (5) D-415-SCG saponins showed neither strong tissue- nor ecotype-specific accumulation.

NMR Characterization of Saponins

To unequivocally determine the structures for the switchgrass saponins, we selected seven saponins that represent unique classes (Figure A) for high-performance (HP) LC purification: SS1031, SS1032, SS1244, SS1064, SS1050, SS1254, and SS1089 (“SS” abbreviation for steroidal saponins). Their molecular formulas were proposed based upon positive ionization mode UPLC-high-resolution MS/MS analyses (Figure S6–S12; see Materials and Methods for details). NMR spectra (1H, DEPTQ, HSQC, COSY, HMBC, and TOCSY) were generated for them (Tables S15–S21 and Figures S13–S61). The proton resonances determined from 1H, correlated spectroscopy (COSY), and total correlation spectroscopy (TOCSY) spectra in all samples fell into three distinct chemical shift regions: 0.8–2.6 ppm from aglycone backbone hydrogens; 3.2–4.2 ppm from sugar hydrogens; and sugar ring anomeric and aglycone olefinic hydrogens from 4.2–5.6 ppm. The heteronuclear single quantum coherence (HSQC), heteronuclear multiple bond correlation (HMBC), and distortionless enhancement by polarization transfer with retention of quaternary nuclei (DEPTQ) spectra were used to assign chemical shifts of the saponin core carbons, beginning with those downfield-shifted carbons due to direct connections with oxygens or double bonds. When combined with 1H–1H couplings established from COSY and TOCSY spectra, we could assign each aglycone position. Positions of sugar moiety substitutions were determined from HMBC spectra, based on 1H–13C correlations separated by two or three bonds. All seven saponins were glycosylated at the C-3 hydroxyl group, while three were also glycosylated at the C-26 position (side-chain glycosylation). We also observed an olefinic hydrogen from HSQC and 1H, indicating a double bond between the C-5 and C-6, for all seven saponins; thus, it rules out the possibility that SS1050 contained a tigogenin core. Collectively, these data identified SS1031 (Figure B) as the previously characterized switchgrass saponin, protodioscin, with sidechain glycosylation[18] and SS1032 (Figure C) as a steroidal saponin derived from a diosgenin with the characteristic core spiroketal moiety but no sidechain glycosylation. These two saponins are likely biosynthetically related as the final six-member heterocyclic ring closure forming the spiroketal structure in diosgenin is spontaneous in planta.[19] Similarly, SS1244 (Figure D) and SS1064 (Figure E) are related saponins that are derived from diosgenin: the former with side chain glycosylation and the latter without. Both share an additional tertiary hydroxyl group on C-17. The relatively low-abundance SS1050 (Figure F) has a cholesterol-like C27 aglycone with only four rings, side chain glycosylation, and a C-22 ketone group (characterized by a chemical shift of 215.6 ppm for the carbonyl carbon). A unique feature of the diosgenin-derived saponins SS1254 (Figure G) and SS1089 (Figure H) is that they appear to be acetylated—rather than glycosylated—at C-26. This is consistent with the observed C29 aglycone mass of 456 Da (42 Da larger than that of diosgenin, which is 414 Da) and 472 Da (42 Da larger than the 430 Da of oxydiosgenin), respectively. This structural feature was revealed by HMBC data where the C-28 carbonyl carbon at 171.0 ppm is only correlated with the C-26 methylene and C-29 methyl protons. SS1089 has an additional tertiary hydroxyl group on the C-17 position compared to SS1254.

Differential Saponin Accumulation between Upland and Lowland Revealed by LC– and GC–MS Analysis

Documenting the differential accumulations of the different saponin classes is expected to help with identification of their biosynthetic enzymes and regulatory genes. As an approach to quantifying differential accumulation of the saponins that is orthogonal to PCA loading plots (Figure S5B) and HCA (Figure B), we compared cumulative ion counts for each saponin class between the ecotypes across different tissue x developmental stage sample groups. The D-415-SCG and D-415 were the dominant saponin forms in terms of their accumulative ion counts and were approximately 1 order of magnitude higher than the other classes. The lowland ecotypes accumulated much higher levels of these saponins than did the upland in root tissues, and this was especially notable during the reproductive stage (Figure A,B). The two C-17 hydroxylated saponin classes, D-431-SCG and D-431, had higher accumulations in below-ground tissues, without apparent accumulation differences between the two ecotypes (Figure C,D). In contrast, the two acetylated sidechain saponin forms (D-457 and D-473) were found to have much higher accumulations in above-ground tissue than in below-ground tissues (Figure F,G). The only saponin form not oxidized at the C-16 position (D-417-SCG, Figure E) showed a tendency toward higher accumulations in lowland shoots at later developmental stages. For the total (summed) saponins, lowland roots accumulated more than upland roots at all three developmental stages. In contrast, shoots of the two ecotypes accumulated comparable amounts of total saponins across development stages (Figure H upper panel). The saponins represented about 20 to 35% versus 5 to 10% of the total ion counts in lowland and upland roots, respectively (Figure H lower panel), indicating the contribution of these specialized metabolites in defining distinct phenotypes between ecotypes.

Figure 5

Relative quantification of individual saponin classes and total saponins. (A–G) Accumulative ion counts for each individual saponin classes and (H, lower panel) the total saponins (all classes combined) from the eight tissue type x developmental stage sample classes measured by positive-mode LC–MS. (H, lower panel) Percentages of the saponins in the total ion pools. Normalized abundances (y-axis) are calculated as (ion intensity of the feature/ion intensity of the internal standard). For all panels, the horizontal bars represent median values (n = 8 or 9). Red, lowland ecotype; blue, upland ecotype. Although this LC–MS approach is excellent for documenting the diversity of saponin types, it is not ideal for quantification because of the uneven ionization efficiencies of the early- versus late-eluting analytes (caused by the changing ratio of the water and organic mobile phase during chromatography). To more accurately determine total saponin concentrations in different tissues and cultivars, a GC–MS-based quantification method was developed to quantify the sapogenins after hydrolytic removal of sugars through the comparison with an authentic diosgenin standard (Materials and Methods). As a result, we identified six diosgenin-derived steroidal sapogenin peaks and two peaks annotated as triterpenes (Figure A and Table ). The triterpene peaks were only present in leaf blade samples and might arise from acetylated steroids if the hydrolytic removal of acetates was incomplete.

Figure 6

Table 2

TMS-Derivatized Sapogenin Aglycones Identified in Switchgrass Extracts by CI GC–MS Analysisa

peak	retention index (RI)b	key ion annotations (m/z)c	abundant fragment ion m/zd	formula of the neutral molecule	mass of the neutral molecule (Da)	annotation of the neutral moleculee
D1	3114	397 [M + H]⁺	139, 282	C₂₇H₄₀O₂	396	anhydro-diosgenin
D2	3194	397 [M + H–TMS–OH]⁺, 471 [M–CH₃]⁺, 485 [M – H]⁺, 487 [M + H]⁺	139, 282	C₃₀H₅₀O₃Si	486	diosgenin (TMS-derivatized)
D3	3339	397 [M + H–TMS–OH]⁺, 471 [M–CH₃]⁺, 485 [M – H]⁺, 487 [M + H]⁺	139, 187, 282	C₃₀H₅₀O₃Si	486	diosgenin (TMS-derivatized)
D4	3361	397 [M + H–TMS–OH]⁺, 471 [M–CH₃]⁺, 485 [M – H]⁺, 487 [M + H]⁺	139, 187, 282	C₃₀H₅₀O₃Si	486	diosgenin (TMS-derivatized)
T1	3399	409 [M + H–TMS–OH]⁺, 483 [M–CH₃]⁺, 497 [M – H]⁺	218, 203, 190	C₃₃H₅₈OSi	498	β-amyrin (TMS-derivatized)
T2	3439	409 [M + H–TMS–OH]⁺, 483 [M–CH₃]⁺, 497 [M – H]⁺	218, 203, 190	C₃₃H₅₈OSi	498	β-amyrin (TMS-derivatized)
O1	3462	395 [M + H–TMS–OH–H₂O]⁺, 413[M + H–TMS–H₂O]⁺, 487 [M–CH₃]⁺, 501 [M – H]⁺, 503 [M + H]⁺	139, 187, 298	C₃₀H₅₀O₄Si	502	oxydiosgenin (TMS-derivatized)
O2	3620	395 [M + H–TMS–2H₂O]⁺, 413[M + H–TMS–OH]⁺, 487 [M–CH₃]⁺, 501 [M – H]⁺, 503 [M + H]⁺	139, 187, 298	C₃₀H₅₀O₄Si	502	oxydiosgenin (TMS-derivatized)

The TMS-derivatized molecules are considered as the molecular ions in each case here. TMS, trimethylsilyl group [Si(CH3)3].

The RIs were calculated using a homologous series of n-alkane standards with the same GC–MS method as employed for the samples (Figure S4A).

The m/z information for the key ion annotations was obtained by (CI) GC–MS (Figure S62).

The m/z information for the fragment ions was obtained by electron ionization (EI) GC–MS (Figure S4B).

The annotations were made based on the pseudomolecular ions and fragment ion information from (EI) MS (Figure S4B) by comparing with the diosgenin standard and/or searching against the National Institute of Standards and Technology (NIST) NIST17 GC–MS library (www.chemdata.nist.gov) for the best matching compounds.

Sapogenin aglycone peaks identified in switchgrass extracts are quantified by GC–MS. (A) +(CI) GC–MS TICs of the diosgenin standard (black), Dacotah leaf and root (blue), and Alamo leaf root (red). The identified sapogenin aglycone peaks in the switchgrass and standard samples are indicated and aligned by the dashed lines. Zoomed-in views for the peak O2 in the Dacotah root and Alamo root are indicated by the arrows. (B) Comparison of the total sapogenin concentrations among the five tissue types for each switchgrass cultivar (Kruskal–Wallis test: p = 0.012, 0.026, 0.009, 0.024, 0.017, and 0.011 for Dacotach, Summer, Cave-in-Rock, Alamo, Kanlow, and BoMaster, respectively). LB, leaf blade; LS, leaf sheath, St, stem; Rhi, rhizome; and Rt, root. (C) Comparison of the total sapogenin among the six switchgrass cultivars in the leaf blade (Kruskal–Wallis test: p = 0.766) and root (Kruskal–Wallis test, p = 0.016). Different lower-case letters on top of the boxes designate statistically different means (post-hoc test: Dunn’s test). (D) Ratio of the individual sapogenins in the leaf blade (upper) and root (lower) of upland and lowland ecotypes. Heights of the bars reflect the means of the nine replicates (three cultivars × three replicates) for each ecotype; error bars show the standard error of the mean; ** standards for 0.001 ≤ p ≤ 0.01 and *** standards for p < 0.001 (one-tailed t-test). The TMS-derivatized molecules are considered as the molecular ions in each case here. TMS, trimethylsilyl group [Si(CH3)3]. The RIs were calculated using a homologous series of n-alkane standards with the same GC–MS method as employed for the samples (Figure S4A). The m/z information for the key ion annotations was obtained by (CI) GC–MS (Figure S62). The m/z information for the fragment ions was obtained by electron ionization (EI) GC–MS (Figure S4B). The annotations were made based on the pseudomolecular ions and fragment ion information from (EI) MS (Figure S4B) by comparing with the diosgenin standard and/or searching against the National Institute of Standards and Technology (NIST) NIST17 GC–MS library (www.chemdata.nist.gov) for the best matching compounds. The total saponin (all eight peaks summed) concentrations were first compared across tissue types for each switchgrass cultivar (Figure B and Table S22). For every cultivar, saponins were detectable in all five analyzed tissue types and were higher in the leaf blade, rhizome, and root than they were in the leaf sheath and shaved stem. The two upland cultivars Dacotah and Summer have the highest saponin concentration in leaf blades, whereas all three lowland cultivars accumulate the highest saponin level in roots. The third upland cultivar, Cave-in-Rock, however, accumulates the highest total saponin in rhizomes. We then compared the total saponins in leaf blades and roots across the six cultivars. The total saponin concentrations in lowland roots are uniformly higher than those in roots of the three upland cultivars, with Kanlow and BoMaster showing statistical significance (p < 0.05, Kruskal–Wallis test). In comparison, leaf blade total saponins revealed no ecotype-related statistical difference (Figure C). Moreover, different sapogenin compositions were also found between upland and lowland roots but not leaf blades (Figure D). This was due to the differentiated accumulations of two diosgenin isomers, D2 and D3, and one oxydiosgenin isomer, O1, identified only in roots. Taken together, quantitative analysis of sugar-free sapogenins supports the results of LC–MS analysis showing strong genetic difference in root saponin accumulation. Considering the high abundances of the saponins, water solubilities of these molecules, and the documented bioactivities to the microbes,[31] the differential accumulation in roots might play a role in shaping the ecotype-specific rhizosphere microbiomes in switchgrass.

Discussion

Plants produce a plethora of structurally diverse specialized metabolites that serve roles ranging from attracting beneficial organisms to combating deleterious biotic and abiotic agents. However, domestication and improvement of crops typically lead to reduced amounts and types of these advantageous metabolites. While restoring these beneficial traits to existing crops is generally not feasible, development of new food, fuel, and fiber crops can be done in such a way to maintain or enhance existing metabolic variation, potentially limiting the need for toxic pesticides. Switchgrass is a compelling example of a low-fertilizer and low-pesticide cellulosic bioenergy crop in the USA.[11] Switchgrass populations (ecotypes) are found from the northern Midwest to Texas and along the East Coast; unlike the relatively narrow genetic and phenotypic variation of traditional crops, there is abundant genetic variation across these populations. Striking differences in numerous plant traits were documented between populations of the upland and lowland ecotypes,[32] including biomass production,[33] rust pathogen (Uromyces graminicola) resistance,[34] and tolerance to low-nitrogen, drought, and freezing conditions.[14,35,36] These documented phenotypic and physiological differences encouraged us to compare the upland and lowland switchgrass metabolomes and catalog the ecotype-specific specialized metabolites. In addition to being of interest for pathway discovery and as tools to understand the genetic architecture of switchgrass, such information should be valuable for breeding switchgrass varieties that are highly productive with no or low pesticide inputs. Analysis of six switchgrass accessions representing both upland and lowland ecotypes of eight specific tissue type x developmental stage sample classes identified a remarkable amount of metabolite variation. In fact, 1416 ecotype DAFs (Tables and S2–S9) account for half of the metabolite features detected in this study. Many of these differences were quite large: there were 157 DAFs showing >1000-fold accumulation difference between the two ecotypes in at least 1 of the 8 sample classes (Table S10). In contrast, published metabolomics analyses for maize showed that metabolite profiles of the six genetically defined genome-wide association study populations failed to separate in PCA, even when the metabolite data were independently analyzed within the same tissue type.[37] Based on RMD filtering,[27] 46 and 13% of the switchgrass ecotype-specific DAFs are proposed to be terpenoid glycosides and polyphenol-derived metabolites, respectively. Relatively few switchgrass terpenoid- and polyphenol-specialized metabolites were identified in the past, including leaf steroidal saponins,[18,38] diterpenoid-derived antimicrobial phytoalexins,[39] and quercetin-based flavonoids.[23] Our identification of more than 1000 unannotated DAFs suggests the possibility of many more switchgrass metabolites to be characterized and underlines the complexity of analysis of complex mixtures of structurally diverse small molecules. By generating MS/MS spectra beginning with the most abundant metabolites, we tentatively identified close to 100 specialized metabolites. In contrast to the flavonoid glycosides (Table S11), the diterpenoids (Table S12), sesquiterpenoids (Table S13), and most of the steroidal saponins (Table S14) did not match known compounds in MS/MS databases (Materials and Methods): this led us to subject representative saponins to NMR analysis. Saponins stand out in this study as highly abundant and differentially accumulating metabolites, exhibiting diversity in their cores and glycosylation positions and types. This structural diversity indicates tissue- and/or genotype-specific activities of as yet uncharacterized CYP450s, UDP-glycosyltransferases, and other tailoring enzymes (e.g., acyltransferases). This conclusion is supported by the seven distinct sapogenin aglycone structures elucidated based upon LC–MS and NMR data and their accumulation patterns in switchgrass. These include diosgenin cores with the characteristic 5,6-spiroketal moiety (Figure C) and a diosgenin core that presumably is derived from a metabolic intermediate,[19] in which the sidechain is stabilized by glycosylation and thus prevented from the spontaneous cyclization to form the final pyranosidic ring (Figure B); this metabolite was previously identified in switchgrass aerial tissues.[18] Besides glycosylation, the sidechain also can be stabilized by acetylation resulting in the C29 aglycones (Figure G,H). Predominant accumulation of the sidechain-acetylated saponins in shoots implies involvement of one or more tissue-specific acyltransferase activities in switchgrass saponin biosynthesis. Likewise, diosgenin C-17 hydroxylation (Figure D,E,H) was found for the saponins preferentially accumulated in root tissues, suggesting a root-specific CYP450 activity. We also characterized a saponin core that is not oxidized at the C-16 position (Figure F), which might derive from a precursor or side product of the diosgenin biosynthetic pathway.[19] Saponins with this core were seen across the cultivars analyzed but were more abundant in lowland shoots especially at a later developmental stage. This indicates that the switchgrass CYP450 responsible for the cholesterol C-16 oxidation is possibly regulated in an ecotype x tissue x development manner in switchgrass. The switchgrass saponins we characterized were either singly glycosylated at C-3 (Figure C,E,G,H) or glycosylated at both C-3 and on the C-26 sidechain (Figure B,D,F). Glycosylation diversity also comes from variation in the conjugating saccharide types, with one to six monosaccharides observed (Table S14). MS neutral mass losses corresponding to anhydrous glucose/galactose, rhamnose, and xylose were observed for these saponins, and some monosaccharides were also acetylated (Table S14). Published studies indicated that the conjugated sugar moieties impact saponin bioactivities; for example, the oat avenacoside steroidal saponins are activated by deglycosylation upon leaf damage or pathogen attack,[40,41] while antimicrobial activities of other saponins seem to rely on glycosylation.[31] The purified switchgrass saponins with distinct glycosylation patterns provide good opportunities to perform structure–function analyses using in vitro microbial bioassays. The total saponin levels in roots of three lowland switchgrass cultivars, Alamo, Kanlow, and BoMaster, are 3.5, 5.1, and 4.7 μg/mg dw, respectively, which are higher than those of the three upland cultivars, Dacotah, Summer, and Cave-in-Rock, at 1.2, 1.5, and 1.6 μg/mg dw, respectively (Figure C lower panel and Table S22). The root saponin contents in the lowland cultivars are close to those observed in legume roots: for example, the Medicago truncatula(42) and two Medicago sativa cultivars, Radius[43] and Kleszczewska,[44] contain 5.9, 5.0, and 9.3 μg/mg dw saponins in their roots, respectively. Information about accession and ecotypic differences in the saponin content could provide tools for improvement of switchgrass biomass. For example, avenacins have well documented protective roles protecting oat roots from the fungal “take-all” disease.[6−8] Saponin root differential accumulation among switchgrass ecotypes suggests that they might be attractive targets for breeding cultivars with an increased ability to improve yield by modulating the microbiome structure and function.[45−47] In contrast, breeding for low leaf saponins might produce varieties with biomass that is efficiently converted into fuel by avoiding accumulation of toxins that interfere with growth of processing microbes. Taken together, our results provide opportunities to identify targets for producing switchgrass varieties with improved plant/microbiome traits and increased biomass yield or biofuel conversion at lower economic and environmental costs.

Materials and Methods

Plant Material

The six switchgrass cultivars used in this study were the upland ecotypes Dacotah, Summer, and Cave-in-Rock and lowland ecotypes Alamo, Kanlow, and BoMaster. The seeds were ordered from Native Connections (http://nativeconnections.net, Three Rivers, MI). The plants were grown under controlled growth conditions: temperature was set at 27 °C with 16 h light (500 μE m–2s–1) per day and relative humidity set to 53%. Seeds were sown directly in a 1:1 mixture of sand and vermiculite, watered twice a week with deionized water, and fertilized once every 2 weeks using half-strength Hoagland’s solution.[48] For untargeted LC–MS analysis, plant tissues were harvested at 1, 2, and 3 months after imbibition, corresponding to the vegetative, transition, and early reproductive developmental stages, respectively.[26] Roots, rhizomes, and shoots (Figure S1A) were collected separately for all the switchgrass cultivars. For example, one sample represented a specific cultivar (genotype) x developmental stage x tissue type combination (Figure S1B). There were three biological replicates from three independent plants with two exceptions: two samples each from two independent plants for vegetative-phase Cave-in-Rock samples and early reproductive-phase Alamo samples (due to sample loss). For the GC–MS quantification of sapogenins, samples were only collected from the 3 month old (early reproductive phase) plants. All samples were immediately frozen in liquid nitrogen and stored at −80 °C until extraction.

Metabolite Extraction

All chemicals were obtained from Sigma-Aldrich (St. Louis, MO) unless otherwise specified. The samples were frozen in liquid nitrogen and powdered using 15mL polycarbonate grind vial sets (OPS Diagnostics, Lebanon, NJ) on a MiniG high-throughput homogenizer (SPEX SamplePrep, Metuchen, NJ). 500 mg of each sample was extracted at 4 °C overnight (14–16 h) in 5 mL of 80% methanol containing the 1 μM telmisartan internal standard. Extracts were centrifuged at 4000 g for 20 min at room temperature to remove solids. The supernatant from each sample was transferred to an HPLC vial and stored at −80 °C prior to LC–MS analysis. For GC–MS, 50 mg of the lyophilized sample was extracted in 1 mL of 80% methanol following the workflow described for LC–MS sample preparation above, as described by Tzin et al.[49]

UPLC–ESI-QToF-MS Analysis

Reversed-phase UPLC–positive mode electrospray ionization-quadrupole time-of-flight MS (UPLC–(+)ESI-QToF-MS) analyses were performed with a Waters ACQUITY UPLC system coupled to a Waters Xevo G2-XS QToF mass spectrometer (Waters, Milford, MA). The chromatographic separations were performed using a reversed-phase, UPLC BEH C18, 2.1 mm × 150 mm, 1.7 μm column (Waters) with a flow rate of 0.4 mL/min. The mobile phase consisted of solvent A (10 mM ammonium formate/water) and solvent B (100% acetonitrile). The column oven was maintained at 40 °C. Separations were achieved utilizing a 20 min method, injecting 10 μL of the extract and using the following method (%A/%B): 0–1.0 min hold (99/1), linear gradient to 15 min (1/99), hold (1/99) until 18 min, returning at 18.01 min (99/1), and holding until 20 min. The Xevo G2-XS QToF mass spectrometer was operated using the following static instrument parameters: desolvation temperature of 350 °C; desolvation gas flow rate at 600 L/h; capillary voltage of 3.0 kV; and cone voltage of 30 V. Mass spectra were acquired in the continuum mode over m/z 50 to 1500 using data-independent acquisition (DIA, MSE) or data-dependent MS/MS acquisition (DDA), with collision potential scanned between 20 and 80 V for the higher-energy function for DIA (and 20–60 V for DDA). The DDA mode automatically selected the three most abundant molecular ions to pass through the mass filter for fragmentation analysis at each scan. The MS system was calibrated using sodium formate, and leucine enkephalin was used as the lock mass compound, but automated mass correction was not applied during DIA data acquisition. QC and reference samples were analyzed every 20 injections to evaluate the stability of the LC–MS system.

Data Processing and Metabolite Mining for the Untargeted Metabolomic Analysis

Acquired raw MS data were processed using the Progenesis QI software package (v.3.0, Waters, Milford, MA) using RT alignment, lock mass correction, peak detection, adduct grouping, and deconvolution. The identified compounds were defined by the RT and m/z information, and we also refer to these as features. The parameters used with Progenesis processing were as follows: sensitivity for peak picking, default; minimum chromatographic peak width, 0.15 min; and RT range, 0.3–15.5 min. Intensities (ion abundances) of all the detected features were normalized to the internal standard, telmisartan, before downstream statistical analyses. Online databases—including KEGG,[50] MassBank,[51] PubChem,[52] and MetaboLights[53]—were used to provide annotations to the features based on a 10 ppm precursor tolerance, 95% isotope similarity, and 10 ppm theoretical fragmentation pattern matching with fragment tolerance. The complementary method RMD filtering[27] was used to assign chemical classes for the features. Briefly, an RMD value of each feature was calculated in ppm as (mass defect/measured monoisotopic mass) × 106. This value reflects the fractional hydrogen content of a feature and provides an estimate of the relative reduced states of carbons in the metabolite precursor of that feature. For example, in this study, we defined terpenoid glycosides (RMD of 350–550 ppm) or phenolics (RMD of 200–350 ppm) using this method. Features with RMD >1200 ppm are likely contaminants (e.g., inorganic salts) in the MS system. The DDA was carried out for a set of pooled samples to generate positive-mode MS/MS spectra for the abundant ions. Specialized metabolite discovery was performed by mining the DDA data and beginning with the most abundant metabolites. Characteristic precursor/fragment ions and RMD were used to assign metabolites to a particular chemical class (e.g., flavonoid glycoside, diterpenoid, sesquiterpenoid, and saponin).

Saponin Purification

The switchgrass (Kanlow) plants were grown in a growth chamber using the conditions described in the Plant Material section. About 150–200 g of fresh root or shoot tissues from fully matured plants (3 months post-germination) was harvested. The tissues were ground into powders with liquid nitrogen and placed in a 2 L beaker. 1.5–2 L of 80% methanol (in water) was added, and the mixture was incubated for 48 h at 4 °C. The mixture was centrifuged at 4000g for 15 min to remove insoluble debris. The supernatant volume was reduced under vacuum using a rotary evaporator, followed by evaporation to dryness using a SpeedVac vacuum concentrator. The residue was redissolved in 100 mL of water. The liquid–liquid phase partitioning was carried out against first hexane and then ethyl acetate, with a 1:1 ratio, to remove the non-polar interfering metabolites. The resultant water phase was then loaded onto a 35 cc C18 SPE cartridge (Waters). The cartridge was washed three times each using 0, 10, 20, and 50% methanol (in water) to remove the polar interfering metabolites. The cartridge was eluted three times each using 70, 80, and 90% methanol (in water) to obtain the saponin-enriched fractions (eluates). The solvent was evaporated to dryness under vacuum using SpeedVac, and the residue was redissolved in 8 mL of 80% methanol (in water). The insoluble residue was removed by centrifugation at 4000g for 5 min at 25 °C. Supernatants were transferred to autosampler vials. Purification was carried out using a Waters 2795 pump/autosampler connected with an LKB Superrac 2211 fraction collector and a Waters Symmetry C18 HPLC column (100 Å, 5 μm, 4.6 mm × 150 mm). The mobile phase consisted of 0.15% formic acid in water, pH 2.8 (solvent A) and acetonitrile (solvent B). The linear gradient elution used to purify saponins SS1244, SS1031, SS1064, and SS1032 from root tissues was 1% B at 0 min, 30% B at 1.01 min and linearly increased to 40% B at 7 min, 50% B at 7.01 min and linearly increased to 70% B at 15 min, and 99% B at 15.01 min and held at 99% B between 15.01 and 18 min. A slightly modified linear gradient elution was used to purify saponins SS1050, SS1098, and SS1254 from shoot tissues, 1% B at 0 min, 30% B at 1.01 min and linear-increased to 40% B at 8 min, 50% B at 8.01 min and linear-increased to 60% B at 15 min, and 99% B at 15.01 min and held at 99% B between 15.01 and 18 min. The solvent flow rate was 1.5 mL/min, and the column temperature was 40 °C. The eluate was collected every 10 s as one fraction for each injection, using an injection volume of 100 μL. The HPLC fractions containing the seven targeted saponins were estimated to be >75% pure based on LC–MS analysis results. Fractions of adequate purity for the same saponins were pooled.

NMR Spectroscopy

NMR spectra of purified saponin samples were acquired using a Bruker Ascend 600 MHz spectrometer (Bruker Biospin, Germany) operating at 600.13 MHz for proton and equipped with an inverse 1.7 mm TCI micro-cryoprobe and a SampleJet auto-sampler unit. For 1H NMR spectra, solvent suppression with a shaped pulse program (Wetdc) was used with a scan number of 16 at temperature 298 K with pulse width 10.75 μs and power 0.2 W. Acquisition time for each scan was 0.681 s with a delay time of 3 s for a spectral width of 20 ppm. For locking the magnetic field, CD3OD was used as the solvent, and 1H spectra were calibrated using the residual solvent peaks. Baseline and phase correction was performed manually using Bruker TopSpin 3.5.6 software. Subsequently, 2D NMR spectra were collected using the default Bruker pulse program cosygpmfppqf for COSY (ns = 16, sw = 13 for F1 and F2), hsqcedetgpsp.3 for HSQC (ns = 64, sw = 13 for F2 and 220 for F1), hmbcgpndqf for HMBC (ns = 128, sw = 13 for F1 and F2), and mlevphpr.2 for TOCSY (ns = 64, sw = 13 for F1 and F2) with pulse width 10.75 μs and power 0.2 W for 1H and pulse width 12 μs and power 68 W for 13 C. DEPT-Q spectra were collected using a Bruker Avance III 800 MHz spectrometer equipped with a 5 mm TCI cryoprobe. Data were obtained using the deptsp135 pulse program with 3072 scans for an sw 222 ppm and 0.734 s acquisition time for each scan with pulse width 14 μs and power 107.15 W for 13 C. Finally, the data were visualized using both Topspin 3.5.6 and MestReNova software for peak assignments.

Analysis of Switchgrass Sapogenins by Acid Hydrolysis, Derivatization, and GC–MS

To analyze sapogenins, acid hydrolysis was carried out according to a previously published protocol.[54] In brief, 300 μL of the switchgrass extract, 200 μL of distilled water, and 100 μL of 12 M hydrochloric acid (MilliporeSigma, Burlington, MA) were mixed in a polypropylene microcentrifuge tube and incubated at 85 °C for 2 h. The samples were cooled and evaporated to dryness under vacuum with the temperature ≤40 °C. The resultant pellet was dissolved in 500 μL of distilled water and extracted with 500 μL of ethyl acetate for phase partition. After this, 300 μL of the ethyl acetate layer was transferred to a new microcentrifuge tube and evaporated to dryness under vacuum at room temperature. The dry residue was dissolved in 100 μL of N-methyl-N-(trimethylsilyl)trifluoroacetamide, derivatized overnight at 60 °C and analyzed using a 30 m VF5 column (Agilent Technologies, Santa Clara, CA; 0.25 mm ID, 0.25 μm film thickness) coupled to an Agilent 5975 single quadrupole mass spectrometer (Agilent Technologies, Santa Clara, CA) operated using 70 eV electron ionization (EI). Then, the same set of samples was analyzed on an Agilent 7010B triple quadrupole mass spectrometer using the same column for chemical ionization (CI). The MS scanning range was m/z 80–800. Splitless sample injection was used, with helium as carrier gas at a constant flow of 1 mL/min and the inlet and transfer line held at 280 °C. The GC temperature program was as follows: held at 50 °C for 1 min; ramped at 30 °C/min to 200 °C; and ramped at 10 °C/min to 320 °C and held for 10 min. Absolute and relative quantification of the sapogenins was performed using commercially available diosgenin (∼95%, Sigma-Aldrich) as an external standard. Serially diluted standards (6–192 μg/mL dissolved in 80% ethanol) were pre-treated in the same way as the other samples; after hydrochloric acid hydrolysis, four target peaks were identified that are derived from the commercial diosgenin. They were termed diosgenin standard (DS) 1–4. DS4, eluting at 21.3 min, could be detected by EI GC–MS at high standard concentrations (Figure S3A). All target peaks were combined when plotted against the standard’s concentrations to generate a six-point response curve. Duplicate technical replicate analyses were done for each standard sample used to generate the standard response curve, which was linear (r2 > 0.97, Figure S3B) and was used to calculate relative concentrations for sapogenins detected in switchgrass extracts. Quantifications were based on the peak areas calculated from total ion chromatograms (TICs) for standards, diosgenin-type sapogenins, and the unknown sapogenins with chemical structures similar to that of diosgenin. Six individual plants were harvested for each switchgrass genome type and pooled into three groups of two individual plants. Pooling permitted collection of enough tissue to perform separate analysis of the leaf blade, leaf sheath, stem, rhizome, and root to overcome the issue of the limited amount of plant tissue.

Statistical Analysis

To visualize the metabolomic variation in tissue types, developmental stages, and ecotypes of switchgrass, HCA and PCA were performed using the R (v. 3.5.1) and MetaboAnalyst 5.0 online tool platform.[55] Signals were normalized to internal standard area and tissue mass, log-transformed, and scaled using Pareto scaling prior to these analyses. To assess the relationship among samples and among features, hierarchal clustering with Euclidean distance as the similarity measure and Ward.D2 as the clustering algorithm was used. The relationship results were visualized in the form of dendrograms on the heatmap. Significance analyses were carried out using the Progenesis QI software (Waters) to identify the DAFs between the upland and lowland ecotypes. The cutoff threshold of the significance analyses was FDR-adjusted Student’s t-test p ≤ 0.05 and fold change ≥ 2. Results of the analyses were visualized using volcano plots. To examine statistical differences in the sapogenin concentrations among samples, the Kruskal–Wallis test and post-hoc Dunn’s tests were performed in R. p ≤ 0.05 was considered statistically significant.

40 in total

1. Saponins in alfalfa (Medicago sativa L.) root and their structural elucidation.

Authors: Z Bialy; M Jurzysta; W Oleszek; S Piacente; C Pizza
Journal: J Agric Food Chem Date: 1999-08 Impact factor: 5.279

2. Metabolome-Scale Genome-Wide Association Studies Reveal Chemical Diversity and Genetic Control of Maize Specialized Metabolites.

Authors: Shaoqun Zhou; Karl A Kremling; Nonoy Bandillo; Annett Richter; Ying K Zhang; Kevin R Ahern; Alexander B Artyukhin; Joshua X Hui; Gordon C Younkin; Frank C Schroeder; Edward S Buckler; Georg Jander
Journal: Plant Cell Date: 2019-03-28 Impact factor: 11.277

3. Relative mass defect filtering of mass spectra: a path to discovery of plant specialized metabolites.

Authors: E A Prabodha Ekanayaka; Mary Dawn Celiz; A Daniel Jones
Journal: Plant Physiol Date: 2015-02-06 Impact factor: 8.340

4. Functional Diversity of Diterpene Synthases in the Biofuel Crop Switchgrass.

Authors: Kyle A Pelot; Ruibing Chen; David M Hagelthorn; Cari A Young; J Bennett Addison; Andrew Muchlinski; Dorothea Tholl; Philipp Zerbe
Journal: Plant Physiol Date: 2018-07-15 Impact factor: 8.340

5. [Interaction between structure of plant steroids and their effect on phytonematodes].

Authors: Zh V Udalova; S V Zinov'eva; I S Vasil'eva; V A Paseshnichenko
Journal: Prikl Biokhim Mikrobiol Date: 2004 Jan-Feb

6. Isolation, characterization, and quantification of steroidal saponins in switchgrass (Panicum virgatum L.).

Authors: Stephen T Lee; Robert B Mitchell; Zhirui Wang; Christian Heiss; Dale R Gardner; Parastoo Azadi
Journal: J Agric Food Chem Date: 2009-03-25 Impact factor: 5.279

7. The stromacentre inAvena plastids: an aggregation ofβ-glucosidase responsible for the activation of oat-leaf saponins.

Authors: A Nisius
Journal: Planta Date: 1988-12 Impact factor: 4.116

8. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI).

Authors: Lloyd W Sumner; Alexander Amberg; Dave Barrett; Michael H Beale; Richard Beger; Clare A Daykin; Teresa W-M Fan; Oliver Fiehn; Royston Goodacre; Julian L Griffin; Thomas Hankemeier; Nigel Hardy; James Harnly; Richard Higashi; Joachim Kopka; Andrew N Lane; John C Lindon; Philip Marriott; Andrew W Nicholls; Michael D Reily; John J Thaden; Mark R Viant
Journal: Metabolomics Date: 2007-09 Impact factor: 4.290