Literature DB >> 35082155

A comparative genomics examination of desiccation tolerance and sensitivity in two sister grass species.

Ricardo A Chávez Montes1,2, Anna Haber3,4, Jeremy Pardo3,4,5, Robyn F Powell6, Upendra K Divisetty7, Anderson T Silva8, Tania Hernández-Hernández1,9, Vanildo Silveira10, Haibao Tang11, Eric Lyons11, Luis Rafael Herrera Estrella12,2, Robert VanBuren13,4, Melvin J Oliver14,15.   

Abstract

Desiccation tolerance is an ancient and complex trait that spans all major lineages of life on earth. Although important in the evolution of land plants, the mechanisms that underlay this complex trait are poorly understood, especially for vegetative desiccation tolerance (VDT). The lack of suitable closely related plant models that offer a direct contrast between desiccation tolerance and sensitivity has hampered progress. We have assembled high-quality genomes for two closely related grasses, the desiccation-tolerant Sporobolus stapfianus and the desiccation-sensitive Sporobolus pyramidalis Both species are complex polyploids; S. stapfianus is primarily tetraploid, and S. pyramidalis is primarily hexaploid. S. pyramidalis undergoes a major transcriptome remodeling event during initial exposure to dehydration, while S. stapfianus has a muted early response, with peak remodeling during the transition between 1.5 and 1.0 grams of water (gH2O) g-1 dry weight (dw). Functionally, the dehydration transcriptome of S. stapfianus is unrelated to that for S. pyramidalis A comparative analysis of the transcriptomes of the hydrated controls for each species indicated that S. stapfianus is transcriptionally primed for desiccation. Cross-species comparative analyses indicated that VDT likely evolved from reprogramming of desiccation tolerance mechanisms that evolved in seeds and that the tolerance mechanism of S. stapfianus represents a recent evolution for VDT within the Chloridoideae. Orthogroup analyses of the significantly differentially abundant transcripts reconfirmed our present understanding of the response to dehydration, including the lack of an induction of senescence in resurrection angiosperms. The data also suggest that failure to maintain protein structure during dehydration is likely critical in rendering a plant desiccation sensitive.
Copyright © 2022 the Author(s). Published by PNAS.

Entities:  

Keywords:  C4 grasses; comparative genomics; desiccation tolerance; sister group contrast; transcriptome remodeling

Mesh:

Substances:

Year:  2022        PMID: 35082155      PMCID: PMC8812550          DOI: 10.1073/pnas.2118886119

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Desiccation tolerance (DT) is a fundamental trait that is widespread and developed early in the evolution of the land plants (1, 2), and it is believed to have been critical in the colonization of the land by green algae (3). In tracheophytes, DT is generally limited to reproductive propagules, such as seeds and spores, while vegetative desiccation tolerance (VDT) occurs in only 0.086% of known vascular plant species (4). Our understanding of VDT (and its relationship to seed DT) has broadened with the recent expansion of whole-genome sequencing of resurrection plants, tracheophytes that can survive the desiccation of their vegetative tissues. Since the release of the Boea hygrometrica genome sequence (5), the genomes of four other resurrection angiosperms [Xerophyta schlecteri (6), Oropetium thomaeum (7, 8), Lindernia brevidens (9), and Eragrostis nindensis (10)], two lycophytes [Selaginella tamariscina (11) and Selaginella lepidophylla (12)], and the bryophyte Syntrichia caninervis (13) have been published. Apart from the obvious benefits of obtaining genomic resources for individual resurrection species, the establishment of a collection of resurrection plant genomes offered the possibility of the reconstruction of an ancestral genome of a desiccation-tolerant progenitor that would reveal a genomic signature (blueprint) that defines a common mechanism for DT. However, a genomic blueprint for DT has not emerged (4), which may be related to the small number of genomes available and limited phylogenetic sampling, that all tracheophytes possess desiccation-tolerant propagules (seeds or spores), which would obfuscate the comparative analyses, or that the origin of DT lies deep in the land plant phylogeny and is thus cryptic in the recent plant lineages. It may also be a combination of these possibilities or that there is no genomic blueprint for this fundamental trait. Although a genomic blueprint for DT has not been revealed, comparative studies have demonstrated that certain gene families, such as those for early light-inducible proteins (ELIPs) and late embryogenesis-abundant proteins, have expanded in species that exhibit VDT (6, 14, 15). A corollary to the ancestral reconstruction approach to understanding the evolution of VDT and the genomic aspect of its phenotypic expression is the comparison of the genomes of closely related species that contrast the two extremes: sensitivity and tolerance. Such closely related contrasting species pairings are rare in resurrection plants, but this approach has been applied, albeit with species pairs that are not as close as would be ideal. The genomes and dehydration–rehydration transcriptomes of two resurrection eudicots within the Linderniaceae family (16), the desiccation-tolerant L. brevidens and the desiccation-sensitive (DS) Lindernia subracemosa, were sequenced and compared (9). The comparison revealed that at least in the Lindernia lineage, VDT evolved via a combination of gene duplications in gene families that are functionally associated with the desiccation response and a network-level rewiring of gene expression in vegetative tissue commonly associated with seed desiccation. More recently, a comparative analysis of two contrasting grass genomes along with their respective desiccation-related transcriptomes, the desiccation-tolerant E. nindensis and the related DS cereal Eragrostis tef, reinforced the potential role of gene duplications in the evolution of DT (10). Although there is still a significant phylogenetic distance between these two Eragrostis species (17), the comparative analysis and its extension to include other C4 grasses, including the desiccation-tolerant O. thomaeum, revealed chromatin restructuring and methylation patterns associated with down-regulated genes and specific seed-related orthologs whose expression is associated with VDT. The comparative transcriptome analyses indicated that genes having important roles in seed development and DT are broadly expressed under dehydration in both sensitive and tolerant species, with just a few genes uniquely expressed in the tolerant plants. In this study, we have chosen two phylogenetically closely related C4 grasses, the homoiochlorophyllous desiccation-tolerant Sporobolus stapfianus and the DS Sporobolus pyramidalis, to develop detailed comparative genomic and transcriptomic analyses to further explore genomic inferences into the evolution of VDT. S. stapfianus and S. pyramidalis are members of the same clade, clade A, in the Sporobolus family of the Sporobolinae subtribe of the Chloridoid grasses (18). S. stapfianus has been the subject of many mechanistic studies of its DT phenotype (19, 20) and along with S. pyramidalis, the subject of a detailed comparative leaf metabolomics study that highlighted differences in the metabolic responses of the two species to dehydration (21). We constructed Hi-C–derived assemblies of the sequenced genomes for both species and conducted transcript profiling analyses for parallel reductions in water contents for both species as well as a full desiccation drying series for S. stapfianus. We performed a detailed comparative genomic analysis for the two species and extended the analysis to include other grass species, both desiccation tolerant and DS. Our results offer insights into the mechanism and evolution of VDT in the Chloridoid grasses.

Results

Generating Genomic Resources for Sporobolus Grasses.

One-step flow cytometric assays generated size estimates for each of the Sporobolus genomes. The haploid genome of S. stapfianus had an average of 1,385 pg of DNA per nucleus, which is approximately equal to a complete genome sequence of 1.354 Gbp, and the haploid genome of S. pyramidalis had an average of 1,867 pg of DNA per nucleus, which is ∼1.826 Gbp (Table 1). Draft genome assemblies were generated for each grass using Illumina whole-genome shotgun sequencing combined with Chicago and Hi-C proximity ligation (). The final assemblies consisted of 11,574 scaffolds with an N50 of 19.4 Mb for S. stapfianus and 2,518 scaffolds with an N50 of 21.6 Mb for S. pyramidalis, with the longest scaffolds for both species greater than 60 Mbp. Despite their high contiguity, the assembled genomes are smaller than the estimated genome size, at 1.080 and 1.055 Gbp for S. stapfianus and S. pyramidalis, respectively. These differences between the estimated and assembled genome sizes are likely caused by collapsed homologous regions in these complex polyploid species as described in detail below. Both genomes have similar levels of repetitive elements, 39.7 and 41.3% for S. stapfianus and S. pyramidalis, respectively (Table 2), with almost identical distributions of known repeat families (). Gypsy and Copia retrotransposons are the most predominant families of the known repeats at 36 and 10 to 12%, respectively, for the two genomes.
Table 1.

Estimation of the genome size (1C value) using flow cytometry

SpeciesStandardAverage: sampleAverage: standardGS (2C), pgGS (1C), pgSD sample
S. stapfianus P. crispum 305.2367.85 3.734 1.867 0.011533
S. pyramidalis P. crispum 206.57335.67 2.769 1.385 0.02623

Estimated genome size (GS), in bold, for the diploid (2C) and haploid (1C) states.

Table 2.

S. pyramidalis and S. stapfianus genome assemblies

S. stapfianus S. pyramidalis
Assembly size, bp1,079,709,4291,055,099,634
No. of contigs11,5742,518
Longest contig, bp74,381,80260,313,943
N5019,421,55121,593,735
N906,548,8627,502,351
Repetitive elements, bp (%)428,320,271 (39.7)435,795,325 (41.3)
Genome GC content, %44.043.1
No. of genes52,80851,207
No. of transcripts61,02162,960
No. of CDS61,02162,960
Median mRNA length, bases1,2391,694
Median CDS length, bases1,0711,221
CDS GC content, %52.651.9
Median exons per transcript45
Median exon length, bases148150
Median intron length, bases157154
No. of transcription factors2,7222,897
No. of genes with InterProScan entry (%)52,601 (99.6)51,023 (99.6)
No. of genes with GO biological process entry (%)34,874 (66.0)34,039 (66.5)

Genome assembly statistics for both S. stapfianus and S. pyramidalis. N50 and N90 represent the length of the shortest contig for which longer and equal length contigs cover at least 50 % and 90% of the assembly respectively. GC represents the guanine-cytosine contents and CDS represents the number of protein coding sequences for each genome.

Estimation of the genome size (1C value) using flow cytometry Estimated genome size (GS), in bold, for the diploid (2C) and haploid (1C) states. S. pyramidalis and S. stapfianus genome assemblies Genome assembly statistics for both S. stapfianus and S. pyramidalis. N50 and N90 represent the length of the shortest contig for which longer and equal length contigs cover at least 50 % and 90% of the assembly respectively. GC represents the guanine-cytosine contents and CDS represents the number of protein coding sequences for each genome. The Sporobolus genomes were annotated using MAKER with a combination of RNASeq and PacBio Iso-Seq full-length transcripts as expressed sequence tag (EST) evidence and protein homology from other high-quality plant genomes. After filtering, the final annotations contained 52,208 and 51,207 gene models for S. stapfianus and S. pyramidalis, respectively (Table 2). Annotation completeness was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO) with the poales_odb10.2019–11-20 database of 4,896 conserved genes. The genome annotations recovered 93.5 and 92.4% of complete BUSCOs for S. stapfianus and S. pyramidalis, respectively, indicating that both genomes were well annotated and contained the vast majority of the coding portion of these two genomes (Table 3). Gene models were functionally annotated using a simplified maize–GAMER pipeline; 96% of genes were annotated with InterProScan domain/family information, and 66% were annotated with Gene Ontology (GO) descriptions for both genomes.
Table 3.

Genome assemblies BUSCO v4 statistics vs. the grass (poales_odb10) dataset

S. pyramidalis S. stapfianus
Complete BUSCOs4,575 (93.5)4,521 (92.4)
Complete and single-copy BUSCOs2,162 (44.2)3,421 (69.9)
Complete and duplicated BUSCOs2,413 (49.3)1,100 (22.5)
Fragmented BUSCOs65 (1.3)76 (1.6)
Missing BUSCOs256 (5.2)299 (6.0)

Data are presented as the no. of BUSCOs (%).

Genome assemblies BUSCO v4 statistics vs. the grass (poales_odb10) dataset Data are presented as the no. of BUSCOs (%).

Comparative Genomics of Chloridoid Grasses.

Sporobolus belongs to the Chloridoideae subfamily of grasses, a large and diverse group of predominantly C4 species with remarkable drought, heat, and salinity tolerance. The orphan grain crops finger millet and teff are found within Chloridoideae, as are several model desiccation-tolerant plants in the genera Oropetium, Eragrostis, Tripogon, and Sporobolus among others. Most of the surveyed Chloridoideae species (∼90%) are polyploid, including species from many of the aforementioned taxa. The availability of several high-quality chloridoid genomes facilitates detailed comparative genomic comparisons within these grasses. Macrosynteny between S. stapfianus and S. pyramidalis shows a clear 2:3 pattern, consistent with the tetraploid and hexaploid nature of these grasses, respectively (Fig. 1 and ). Comparisons with the closely related diploid chloridoid grass O. thomaeum also revealed 1:2 and 1:3 patterns of synteny for S. stapfianus and S. pyramidalis, respectively, supporting their polyploidy (Fig. 1 and ). Although neither Sporobolus genome is scaffolded into complete chromosomes, large 20-Mb+-sized scaffolds are highly collinear with the Oropetium genome with few structural large-scale rearrangements (), which is consistent with the unusually high conservation of karyotype and collinearity observed among other chloridoid grass genomes (22).
Fig. 1.

Microsynteny within Chloridoideae grasses. A collinear region between O. thomaeum, S. stapfianus, and S. pyramidalis is highlighted, reflecting the ploidy of each species (diploid, tetraploid, and hexaploidy, respectively). Genes are shown in blue and green, and syntenic gene pairs are connected by gray lines.

Microsynteny within Chloridoideae grasses. A collinear region between O. thomaeum, S. stapfianus, and S. pyramidalis is highlighted, reflecting the ploidy of each species (diploid, tetraploid, and hexaploidy, respectively). Genes are shown in blue and green, and syntenic gene pairs are connected by gray lines. Macrosyntenic analysis between the Sporobolus species and O. thomaeum exposed an overall more complex polyploid structure than the more straightforward tetraploid and hexaploid compositions (). Roughly half the hexaploid S. pyramidalis genome has the expected 3:1 pattern of syntenic blocks compared with O. thomaeum, while 37% is only 2:1. The pattern is similar for tetraploid S. stapfianus, where ∼44% of syntenic blocks are 2:1 to O. thomaeum as expected and ∼42% of blocks are 1:1 (). Similar assembly issues were observed in the tetraploid chloridoid grass E. nindensis, where one to four regions were assembled for each syntenic region in O. thomaeum (10). These discrepancies, combined with differences between the estimated and assembled genome sizes, suggest the Sporobolus genomes were partially collapsed during assembly in homologous regions. S. pyramidalis and S. stapfianus may be segmental allopolyploids with varying degrees of homology between chromosomes from separate subgenomes. Partial collapse during assembly would result in divergent homologous regions assembling separately and highly similar regions collapsing, which is supported by the observation that the ratio of assembled syntenic blocks is maintained across large syntenic blocks and whole chromosomes in O. thomaeum. For instance, two homologous regions are assembled in S. pyramidalis for chromosomes 3 and 4 from O. thomaeum, while three regions in S. pyramidalis were identified for most of chromosome 2 in O. thomaeum. Similar patterns were observed between S. stapfianus and O. thomaeum. To account for these issues related to polyploidy, syntenic gene pairs and orthogroups were used for downstream comparative genomics and transcriptomics analyses between the Sporobolus genomes and other chloridoid grasses.

Transcriptional Landscape of Desiccation in S. pyramidalis and S. stapfianus.

We generated RNASeq data from RNA isolated from leaf tissues at different stages of dehydration for both species (). Differentially expressed genes were identified using edgeR (23), and the resulting gene lists were assigned to GO biological process categories enrichment using the Cytoscape (23) plugin Bingo (24). These analyses indicate that S. pyramidalis and S. stapfianus transcriptomes respond differently to dehydration and share few biological process adaptations during the drying process. When water content decreases from 3 to 2 grams of water (gH2O) g−1 dry weight (dw), S. pyramidalis exhibits a strong response with 11,978 statistically differentially abundant transcripts (SDATs), in contrast to the more moderate response of 1,776 SDATs in S. stapfianus (Fig. 2 ). A GO enrichment analysis of SDAT lists further demonstrates that during the 3 to 2 gH2O g−1 dw water content transition, few biological processes are shared between the two species (Fig. 2 and ). Some biological process categories, including response to heat and response to reactive oxygen species, are common to both species (). Moreover, while S. pyramidalis responds to the change in water content from 3 to 2 gH2O g−1 dw by modulating processes involving the ribosome and the cell wall, S. stapfianus initiates alterations in the abundance of transcripts that relate to the response to oxidative stress, response to water deficit, and protein refolding ().
Fig. 2.

S. pyramidalis and S. stapfianus transcriptional landscape during desiccation/rehydration. (A and B) Bar plots of the numbers of differentially expressed genes (FDR ≤ 0.01) for S. pyramidalis (A) and S. stapfianus (B) from edgeR contrasts of sequential conditions; 2g corresponds to the contrast 2 vs. 3 gH2O g−1 dw, 1.5g corresponds to 1.5 vs. 2 gH2O g−1 dw, 1g corresponds to 1 vs. 1.5 gH2O g−1 dw, and so on. The last S. stapfianus contrast is 24 h after recovery irrigation vs. 3 gH2O g−1 dw. The numbers of up- and down-regulated genes are indicated at the top and bottom of each bar, respectively. The skull and bones icon indicates that S. pyramidalis is severely affected when at 1 gH2O g−1 dw and enters into senescence. (C and D) Graphs of enriched GO biological process categories in the contrast 2 vs. 3 gH2O g−1 dw for S. pyramidalis (C) and S. stapfianus (D). Nodes represent categories and edges represent the parent–child relationships in the ontology. Node identities and positions are identical in both graphs. Color is proportional to the ratio of increased abundance vs. decreased abundance transcripts in the category, with a green color indicating a ratio of more than one (a majority of increased abundance transcripts) and a magenta color indicating a ratio of less than one (a majority of decreased abundance transcripts). Category identifications and names are listed in .

S. pyramidalis and S. stapfianus transcriptional landscape during desiccation/rehydration. (A and B) Bar plots of the numbers of differentially expressed genes (FDR ≤ 0.01) for S. pyramidalis (A) and S. stapfianus (B) from edgeR contrasts of sequential conditions; 2g corresponds to the contrast 2 vs. 3 gH2O g−1 dw, 1.5g corresponds to 1.5 vs. 2 gH2O g−1 dw, 1g corresponds to 1 vs. 1.5 gH2O g−1 dw, and so on. The last S. stapfianus contrast is 24 h after recovery irrigation vs. 3 gH2O g−1 dw. The numbers of up- and down-regulated genes are indicated at the top and bottom of each bar, respectively. The skull and bones icon indicates that S. pyramidalis is severely affected when at 1 gH2O g−1 dw and enters into senescence. (C and D) Graphs of enriched GO biological process categories in the contrast 2 vs. 3 gH2O g−1 dw for S. pyramidalis (C) and S. stapfianus (D). Nodes represent categories and edges represent the parent–child relationships in the ontology. Node identities and positions are identical in both graphs. Color is proportional to the ratio of increased abundance vs. decreased abundance transcripts in the category, with a green color indicating a ratio of more than one (a majority of increased abundance transcripts) and a magenta color indicating a ratio of less than one (a majority of decreased abundance transcripts). Category identifications and names are listed in . As dehydration advances from 2 to 1.5 gH2O g−1 dw in S. pyramidalis, the functional categories of SDATs remain relatively unchanged from that activated at the initial loss of water, and as it is undergoing senescence during the 1.5 to 1 gH2O g−1 dw transition, further acclimation appears unlikely. By contrast, S. stapfianus exhibits an increase to 3,730 SDATs during the 2 to 1.5 gH2O g−1 dw transition, but starting at the 1.5 to 1 gH2O g−1 dw transition, it initiates a major remodeling of its transcriptome (), as indicated by a significant increase to 14,557 and 16,047 SDATs during these two transitions in water content, respectively (Fig. 2). Global transcriptional remodeling continues during the 0.75 to 0.5 gH2O g−1 dw transition, albeit at a lower degree, with 8,146 SDATs (Fig. 2). When desiccated S. stapfianus plants are rehydrated, another strong transcriptome reprogramming, with 27,280 SDATs 12 h after rehydration, is evident and shifts to a transcriptome functional expression profile more similar to that of the fully hydrated control (). Although S. stapfianus appeared morphologically fully recovered after 24 h of rehydration, the transcriptional profile is not equivalent to that observed in leaves of plants with a water content of 3 gH2O g−1 dw (), with 24,659 SDATs between the two conditions (Fig. 2). Leaves from plants 24 h after rehydration have up-regulated SDATs classified in ribosome biogenesis GO categories and down-regulated SDATs in photosynthesis categories, as well as remnants of stress-responsive adaptations, including the response to water categories, and altered metabolism, suggested by the presence of glucose 6-phosphate, fructose 1,6-bisphosphate, and several other metabolism-related categories ().

Syntenic Ortholog Transcripts Comparison of the Hydrated State in S. pyramidalis and S. stapfianus.

To directly compare the transcriptomes for S. stapfianus and S. pyramidalis and identify differentially regulated transcripts that relate to the differences between the two species in the hydrated state prior to dehydration, we created a custom list of syntenic ortholog “genes” (). Differential expression was accomplished using a contrast S. stapfianus vs. S. pyramidalis in edgeR (23), and the resultant syntenic ortholog gene lists were probed with GO enrichment as described previously for the intraspecies dehydration transcriptome analyses. The analyses demonstrate that S. stapfianus and S. pyramidalis have very different transcriptional landscapes under hydrated conditions that reflect functionally different priorities for each species. The S. stapfianus transcriptome significantly favors nitrogen, starch, and photosynthetic metabolic processes, whereas the S. pyramidalis transcriptome significantly favors processes involved in growth, primarily the biogenesis of cell wall components (). These differences are also reflected at the cellular component and molecular levels (), with the majority of cellular functions related to the chloroplast and photosystems in S. stapfianus and the symplast, cytoskeleton, cell wall, and cell wall modification activities in S. pyramidalis.

Syntenic Ortholog Protein Comparisons of Desiccation Response in S. pyramidalis and S. stapfianus.

To further compare the response of S. pyramidalis and S. stapfianus to dehydration, we performed a proteomic analysis using young leaves at 3 and 1.5 gH2O g−1 dw and focused on proteins encoded by syntenic genes in a comparison of enriched GO biological process categories of accumulating and decreasing proteins in both water content conditions (). At 1.5 gH2O g−1 dw, S. pyramidalis had increased accumulation of proteins that are almost exclusively involved in stress responses; S. stapfianus had increased accumulation of stress response proteins but also, accumulated proteins involved in the response to misfolded proteins and protein catabolism (), and it decreased the abundance of proteins involved in energy production (). The protein data demonstrate that, as observed for the transcriptomic profiles, S. pyramidalis and S. stapfianus follow predominantly different approaches of protein accumulation in their response to dehydration.

Expression Dynamics of DS and Desiccation-Tolerant Grasses.

To explore the evolution of VDT in the Chloridoideae subfamily of grasses, we made use of several high-quality genomes with similar dehydration expression datasets that were available for this group of grasses: the desiccation tolerant (S. stapfianus, O. thomaeum, and E. nindensis) and the DS (E. tef and S. pyramidalis). To facilitate comparisons between species with different ploidy, we clustered genes into syntenic orthologs using MCScan (25) and orthologous groups (orthogroups) using OrthoFinder (26) and compared expression patterns between genes in the same orthogroups. We identified 49,418 orthogroups from OrthoFinder containing 806,075 genes across 23 diverse land plant genomes and focused the subsequent analyses on orthogroups, orthologs, or syntenic gene pairs present in the genome of all chloridoid grasses. We first surveyed the global expression profiles of the five Chloridoid grasses under well-watered, drought/desiccation, and rehydration conditions using transformed expression data of 19,267 shared syntenic orthologs across all species. We applied a dimensionality reduction on the resulting expression matrix through principal component analysis. The first two principal components collectively explain 62% of the variance and separate the expression datasets by species and stress (Fig. 3). Well-watered RNASeq samples are found in a single tight cluster of all five species, while desiccation and rehydration samples are found in dispersed but distinct clusters. Samples from dehydration and rehydration time courses in the DT species fall into two clusters, with E. nindensis and O. thomaeum samples intertwined in one cluster and S. stapfianus in the second. The dehydration samples from the two DS species (E. tef and S. pyramidalis) clustered together in a third distinct cluster. Samples of E. nindensis and O. thomaeum are separated by relative water content in principal component (PC)1 and by dehydration vs. rehydration in PC2, but interestingly, they are not delineated by species. Together, these results indicate that expression patterns are broadly conserved in leaf samples of all species but that dehydration and rehydration samples are distinct between the three lineages of DT species and their DS relatives.
Fig. 3.

Dimensional reduction of drought expression profiles across DS and DT Cloridoid grasses. Raw expression values for syntenic orthogroups were transformed by z score prior to principal component analysis. The first two principal components are plotted for the two DS Chloridoid grasses (E. tef and S. pyramidalis) and three tolerant grasses (E. nindensis, O. thomaeum, and S. stapfianus) with comparative expression datasets. Points are colored by species or hydration state as indicated in the key.

Dimensional reduction of drought expression profiles across DS and DT Cloridoid grasses. Raw expression values for syntenic orthogroups were transformed by z score prior to principal component analysis. The first two principal components are plotted for the two DS Chloridoid grasses (E. tef and S. pyramidalis) and three tolerant grasses (E. nindensis, O. thomaeum, and S. stapfianus) with comparative expression datasets. Points are colored by species or hydration state as indicated in the key. The same leaf RNASeq data were analyzed in a pairwise fashion to identify genes with significantly increased transcript abundance under dehydrating conditions in all five species. These SDATs were clustered based on orthogroup using OrthoFinder (as described above) and compared between species. Orthogroups were used in this set of analyses as they contained more genes than the synteny-based analyses, and orthogroups have better resolution of recently duplicated genes. Across the five sequenced chloridoid grasses, the largest number of up-regulated orthogroups under dehydrating conditions was observed between the two Sporobolus species (Fig. 4), as expected since they are sister taxa. The second largest number of up-regulated orthogroups was shared between the two Sporobolus species and O. thomaeum (Fig. 4), which is consistent with their phylogenetic placement within the Chloridoideae. Many other orthogroups are up-regulated similarly in all five species (Fig. 4). The orthogroups uniquely up-regulated in all VDT species are enriched in 214 biological process GO terms (). Highly enriched GO terms include ultraviolet UV light response, chlorophyll catabolism, reactive oxygen species (ROS) metabolism, seed dormancy maintenance by abscisic acid (ABA), and gene expression in response to heat stress, among others (), These GO terms are consistent with well-characterized processes related to DT. Other GO terms with a lower magnitude of enrichment include those related to lipids, osmoprotectant biosynthesis, high light response, energy metabolism, protein degradation, and ABA signaling (). Seventy-one biological process GO terms were uniquely up-regulated in only the DS species (). These included several terms related to salicylic acid as well as ethylene and ABA signaling, arabinose biosynthesis, cell wall biogenesis, and notably, leaf senescence, among others (). We then asked whether any of the GO terms uniquely up-regulated in DT species would overlap with those uniquely down-regulated in DS species and vice versa (). The GO term “protein folding” was uniquely up-regulated in DT and down-regulated in DS species. Across these five species, most seed-related orthogroups are up-regulated similarly (). There are no seed orthogroups that are up-regulated in all three DT species without also being up-regulated in one or more DS species.
Fig. 4.

Venn diagram of up-regulated orthogroups across the five surveyed chloridoid grasses. The number of overlapping orthogroups with up-regulated expression under drought is shown for each comparison.

Venn diagram of up-regulated orthogroups across the five surveyed chloridoid grasses. The number of overlapping orthogroups with up-regulated expression under drought is shown for each comparison.

Expansion of ELIPs.

ELIPs have a conserved role in photoprotection during desiccation, and they have undergone massive tandem gene duplication in all sequenced resurrection plant genomes surveyed to date (14). We observed a similar duplication of ELIPs in the Sporobolus genomes (Fig. 5). The S. stapfianus genome has 65 ELIPs in three tandem arrays, and the S. pyramidalis genome has 30 ELIPs in two tandem arrays (Fig. 5). The largest array in S. stapfianus has 49 ELIPs compared with 17 in its corresponding homologous region, suggesting the duplications occurred after the divergence of the two S. stapfianus subgenomes. Both O. thomaeum and S. stapfianus have large tandem arrays of ELIPs, but the duplication events originated from different syntenic orthologs. The total number of ELIPs in S. pyramidalis is higher than some other desiccation-tolerant species, but when gene counts are normalized for ploidy, the ELIPs are within the range of other sensitive grasses.
Fig. 5.

ELIPs tandem duplication in S. stapfianus and ELIP gene abundance in leaf tissues. (A) Microsynteny of two ELIP tandem arrays is shown in S. stapfianus. ELIPs are shown in red, other genes are shown in gray, and syntenic homeologs between the scaffolds are denoted by gray connections. (B) The number of ELIPs in sequenced Chloridoideae grasses (E. tef, S. stapfianus, S. pyramidalis, E. coracana, O. thomaeum, and Z. mays) is plotted. The two desiccation-tolerant grasses are denoted in red. (C) Log2-transformed gene abundance (TPM) of the 30 ELIPs in S. pyramidalis and 65 ELIPs in S. stapfianus across each replicate of the leaf desiccation time courses.

ELIPs tandem duplication in S. stapfianus and ELIP gene abundance in leaf tissues. (A) Microsynteny of two ELIP tandem arrays is shown in S. stapfianus. ELIPs are shown in red, other genes are shown in gray, and syntenic homeologs between the scaffolds are denoted by gray connections. (B) The number of ELIPs in sequenced Chloridoideae grasses (E. tef, S. stapfianus, S. pyramidalis, E. coracana, O. thomaeum, and Z. mays) is plotted. The two desiccation-tolerant grasses are denoted in red. (C) Log2-transformed gene abundance (TPM) of the 30 ELIPs in S. pyramidalis and 65 ELIPs in S. stapfianus across each replicate of the leaf desiccation time courses. ELIPs have little to no detectable expression in well-watered tissue, but they are highly induced in desiccating S. stapfianus leaf tissue after they reach 1.0 gH2O g−1 dw, and their expression continues 12 and 24 h postrehydration (Fig. 5). ELIPs are also up-regulated under drought in S. pyramidalis, and this occurs quickly in the dehydration process at 2.0 and 1.5 gH2O g−1 dw. However, their combined expression is less than S. stapfianus (Fig. 5), similar to what has been observed in other DS grasses (14).

Discussion

The genomic resources we developed for the sister species S. stapfianus and S. pyramidalis offer a robust contrast that facilitates a strong comparison between a VDT and a DS grass species. The addition of the genomic resources from other resurrection grasses, O. thomaeum (8) and E. nindensis (10), broadens the comparison further into the Chloridoideae subfamily of grasses. The two genome assemblies revealed the complex mixed ploidy of these two grasses, with S. stapfianus primarily tetraploid and S. pyramidalis primarily hexaploid. The structural complexity of the two genomes likely contributed to the inability to assemble the genomes into chromosome-level contigs or to record sequenced genome sizes equivalent to those determined cytologically. The increase in ploidy between the two species probably occurred immediately after the divergence of the S. pyramidalis clade from the common ancestor of the two species (18). The assemblies did not reveal any genomic structural characteristics, with the exception perhaps of tandem arrays of ELIP genes (14), that could be attributed to the difference in VDT between the two species, which is consistent with the general observation that there is not a genomic “blueprint” for VDT in resurrection species (4). However, the assemblies did allow for a thorough comparative analysis, both structural and functional, of the gene space for each genome, and coupled with the in-depth transcriptome data, we were able to explore a detailed genomic assessment of the dehydration/desiccation responses within the Sporobolus sister species contrast. The generation of transcriptomic and proteomic data for dehydrating young leaf tissue at specific water contents during a dry-down experiment such that the dehydration levels are survivable for both grasses provides a broad assessment of the stress response for each species. DS S. pyramidalis mounted a messenger RNA (mRNA)-level response to an initial drop in hydration as has been observed for the majority of dehydration-sensitive plants (27, 28). However, as dehydration to 1.5 gH2O g−1 dw was reached, the transcript abundance response declined dramatically (Fig. 2), perhaps as the leaf water content reached a critical level for S. pyramidalis. The leaves of S. pyramidalis are wilted at 1.5 gH2O g−1 dw (21) but otherwise, appear undamaged, so it is tempting to speculate that the decline in the transcript abundance response may be related to wilting and perhaps, loss of turgor during wilting in S. pyramidalis. Although S. pyramidalis responds quickly to a loss of water, the early increased transcript abundance response appears to be focused on protein translational processes and transcripts common to heat and cold stress (), and only later, as dehydration deepens, do transcripts associated with proline metabolism (osmoregulation) and redox proteins, common to water-deficit responses (27), accumulate. The early decline in transcripts involved in photosynthesis and cell wall homeostasis is also common to the dehydration response in most angiosperms (4). The later decline in transcripts that are associated with general biosynthetic processes is consistent with the general lack of a metabolic response to dehydration seen in metabolite profiling studies of S. pyramidalis at similar levels of water loss (21). Desiccation-tolerant S. stapfianus, in contrast, exhibited a significantly different qualitative transcriptional response to dehydration with a low-magnitude response in the early phase of dehydration. With the comparatively muted response and although there are some common transcript abundance responses between the two species, S. stapfianus clearly targets remodeling a completely different functional aspect of the transcriptome than does S. pyramidalis at similar water contents. Indeed, it appears that S. stapfianus targets the accumulation of transcripts that function more in stress-related activities unlike S. pyramidalis, which does not. The differences between the two transcriptional responses for the two species were unexpected as other studies have indicated that there was extensive overlap in functionality of the transcriptomes of both sensitive and tolerant grasses exposed to dehydration (10). Although there are a few common transcript abundance functional categories in the early response to dehydration in both species, it is clear that the overall transcriptome remodeling during dehydration is very different between them, as exemplified by the different dehydration thresholds for the accumulation ELIP transcripts. For S. stapfianus, the primary remodeling of the transcriptome during dehydration appears to occur as the plants reach the 1.0- to 0.75-gH2O g−1 dw part of the drying curve, which appears to be a critical period in the desiccation response of all resurrection angiosperms studied so far (29) and concurs with early microarray data (30). In S. stapfianus, the transition from 1.0 to 0.75 gH2O g−1 dw occurs during leaf curling (19) and is likely at water contents just prior to and during a change in membrane fluidity that occurs as leaf water potentials approach −12 MPa (4). The functional aspects of the transcriptome remodeling during desiccation of S. stapfianus leaves have been documented previously and are in accord with the observation that transcript abundance is concordant with changes in metabolism associated with cellular protection aspects of DT (30). There was a dramatic alteration of the transcriptome upon rehydration of S. stapfianus leaves, which likely reflects the complex nature of the dehydration event. The magnitude of the change in the transcriptome, reflecting a change in abundance of at least half of the known transcripts, and the functional processes they represent indicate not only the stress incurred from the inrush of water and mechanical aspects of cellular expansion but also, the need to repair damage (from both desiccation and rehydration), reactivate energy metabolism, and reinstate the physiological integrity of the cells and tissues (4). The observation that transcripts encoding proteins involved in ribosome biogenesis are accumulated and those encoding proteins involved in photosynthesis have not recovered control levels at 24 h following rehydration highlights the extent of the impact that desiccation and rehydration have on plant cells and tissues even in DT plants. S. stapfianus requires between 48 and 72 h to regain the structural and physiological integrity seen in well-watered plants (19, 31). The remodeling of the transcriptome in response to dehydration starts from two very different resting-state (fully hydrated) transcriptomes. Our functional analysis of the gene-level expression of the syntenic orthologs of the sister grasses, although somewhat confounded by the structural complexity of the two genomes, revealed that for S. stapfianus, the biosynthesis of starch and nitrogen compounds was perhaps a priority for young leaves under normal conditions, while for young leaves of S. pyramidalis, the priority appeared more focused on the construction of cell walls. Although somewhat speculative, the increase in nitrogen compounds, primarily amino acids from a combination of new synthesis and redistribution, was the focus of a recent study that demonstrated that these compounds are apparently used to fuel central metabolism or for other metabolic adjustments related to the acquisition of DT, such as osmoregulation (32). The differences in priorities are consistent with the changes in protein abundance from 3 to 1.5 gH2O g−1 dw. Although S. pyramidalis protein abundance changes did not reflect cell wall processes, perhaps due to the difficulty in extracting the majority of wall-related proteins (33), they show that S. pyramidalis was almost exclusively focused on the accumulation of stress response proteins. At the same desiccation stage, S. stapfianus had similarly accumulated stress response proteins but also, proteins involved in protein catabolism, and it had down-accumulated energy-related proteins, suggesting a scaling down, at the protein level, of the energy metabolism transcriptomic activity of the hydrated state and the continuation of N metabolism prioritization through protein salvage, possibly from misfolded proteins. Syntenic orthologs transcriptomic data are also consistent with information from the metabolomes of young leaves of these two grasses in that fully hydrated leaves of S. stapfianus were focused on the accumulation of a variety of amino acids and photosynthate derivatives, while for S. pyramidalis, the metabolome was focused on energy metabolism and growth (21). The conclusion from the metabolomics analyses was that leaves of S. stapfianus were prepared (primed) for a dehydration/desiccation event by accumulating osmolytes in times of water abundance and that S. pyramidalis needed to generate energy and components to support a faster growth rate, perhaps to deal with competition in its more mesic habitats. The hydrated transcriptome functional analysis fully supports this conclusion, and our transcriptomic and proteomic data, although somewhat speculative in nature, extend the hypothesis to include a focus on the maintenance of chloroplast function in S. stapfianus in the priming mechanism and cell wall biogenesis in S. pyramidalis as a target for the focus on energy metabolism and growth. Although transcriptomic analyses were useful in comparing the functional aspects of the response to dehydration of the contrasting sister Sporobolus species and the desiccation and rehydration response of S. stapfianus, the availability of a high-quality genome for each of these two species allowed for a direct comparison of the genetic components (and their functions) of the response and allowed us to extend the comparison with other desiccation-tolerant and DS grass species. The broad comparison of the expression patterns of orthogroups and syntenic gene sets common in all five of the chloridoid grasses included in the analysis confirmed the disparate nature of the dehydration response between S. stapfianus and S. pyramidalis. It also revealed that the overall dehydration expression pattern for S. stapfianus was distinctly different from those observed for the other two desiccation-tolerant grasses, E. nindensis and O. thomaeum. The most recent phylogenetic analyses of the Chloridoideae indicate that the common ancestor of the Eragrostideae, which contains E. nindensis and E. tef, gave rise to the Zoysieae and the Cynodonteae, within which O. thomaeum resides; the Zoysieae then diversified into the Zoysiinae and the Sporobolinae, within which the Sporobolus clade containing both S. stapfianus and S. pyramidalis is located (18, 34). The phylogeny indicates that O. thomaeum and S. stapfianus are closer to one another than either are to E. nindensis, which is consistent with results of our analysis of orthogroups representing SDATs that increase in abundance. However, the overall expression response to dehydration for O. thomaeum appears to be more similar to the distantly (ancestrally) evolved response of E. nindensis. This might also explain why there is less overlap between the dehydration transcriptome of S. stapfianus and the transcriptomes of both sensitive and tolerant grasses exposed to dehydration (10). Thus, although we have used only a three-way comparison, it does allow for the hypothesis that the desiccation response of S. stapfianus represents a more recent evolution of a mechanism for VDT within the Chloridoideae. The orthogroup analysis of the SDATs that increase in abundance in all of the VDT species underscored the importance of most of the well-characterized processes that deliver cellular DT (4). The orthogroup analysis of the SDATs that increase in abundance in all of the DS species also reconfirmed what we understand of the response of most plants to a water deficit stress and highlighted the induction of senescence, which is thought to be blocked in resurrection angiosperms during desiccation (reviewed in ref. 4). However, the observation that transcripts classified as involved in protein folding accumulate in the VDT species and decline in abundance in the DS species indicates not only that maintaining protein structure is important in VDT, as has been well documented, but that the lack of the necessary components to do so might be critical in rendering a plant DS. The observation that all seed-related orthogroups are up-regulated in all VDT species and in one or more of the DS species reinforces the hypothesis that VDT likely evolved from a reprogramming of DT mechanisms that evolved in seeds (10).

Materials and Methods

Plant Material.

S. stapfianus Gandoger (original provenance: Verena, Transvaal, South Africa) and S. pyramidalis Beauv. (also known as Sporobolus indicus var. pyramidalis) were grown and maintained as described in ref. 21. For genome sequencing, a single, healthy 3-mo-old fully hydrated plant from each species was selected, and young leaf tissue was collected, flash frozen in liquid N2, and stored at −80 °C. For RNASeq experiments, seeds were collected from selfed clonal plants derived from the individuals used for the genome sequencing and germinated and plants grown to the 3-mo-old stage under greenhouse conditions (16-h light and day/night temperatures of 28 °C/19 °C).

Sampling and Drying Treatments.

Plants were grown and maintained and seed stocks were increased (as described in ref. 35) in 1-gallon pots under greenhouse conditions. Three-month-old plants were subjected to a drying event by withholding water. S. stapfianus plants were dried until desiccated (after 3 wk), whereas S. pyramidalis plants were dried to a water content of ∼1.5 gH2O g−1 dw before rewatering. Drying rates were as described by Oliver et al. (21) to simulate field drying rates that occur over a period of ∼7 d to reach the 1.5-gH2O g−1 dw stage for both grasses and ∼14 d for full desiccation of S. stapfianus (plants were left dry for a further 7 d). Young leaf tissue was collected at daily intervals, between 9 and 10 AM, from individual plants, flash frozen in liquid N2, and stored at −80 °C. Dried plants were maintained dry for a week before rehydration. Duplicate samples were harvested for water content measurements at the time of sampling. The water content was calculated as fresh weight minus the dry weight (dried to equilibrium at 70 °C for 4 h). Triplicate samples were chosen for RNA extraction. Rehydration was achieved by placing the desiccated S. stapfianus plants under a continuous misting system in the greenhouse, and young leaves were sampled in triplicate at 12 and 24 h following the addition of water.

Estimation of the Genome Size (1C Value) Using Flow Cytometry.

The genome size was estimated using the one-step flow cytometry procedure described in ref. 36. Approximately 1 cm2 of leaf material from the Sporobolus species and leaf material of the calibration standard Petroselinum crispum (Mill.) Fuss (37) (haploid genome [1C] value = 2,201 Mbp) were diced in 1 mL of “general purpose buffer” (GPB) (38) supplemented with 3% polyvinylpyrrolidone of average molecular weight of 40,000. A further 1 mL of GPB was added, and the homogenate was filtered through a 30-µm nylon mesh (Celltrics 30-µM mesh; Sysmex); 100 μL propidium iodide (1 mg/mL) was added and incubated on ice for 10 min. The relative fluorescence of 5,000 particles was recorded using a Partec Cyflow SL3 flow cytometer (Partec GmbH) fitted with a 100-mW green solid-state laser (532 nm; Cobolt Samba). Three replicates of species were processed, and output histograms were analyzed using FlowMax software v.2.4 (Partec GmbH).

DNA Isolation, Library Preparations, and Sequencing.

High–molecular weight DNA was isolated from 5 g of flash-frozen young leaf tissue using the PacBio SampleNet–Shared Protocol (https://www.pacb.com/support/documentation/) as described. Random shotgun genomic libraries with various insert sizes, both paired end and mated pair libraries, were constructed for the Illumina HiSeq 2000 sequencing system (Illumina) according to the manufacturer’s protocols. Sequencing of was conducted using an Illumina HiSeq 2500 ultrahigh-throughput DNA sequencing platform (Illumina) at the DNACore facility at the University of Missouri, Columbia, MO (https://dnacore.missouri.edu/ngs.html). For Chicago sequencing, genomic DNA isolation, library preparation, sequencing, and assembly were conducted by Dovetail Genomics and are detailed in . Chicago genomic DNA libraries were prepared as described in ref. 39. Dovetail Hi-C libraries were prepared as described in ref. 40 after fixation of chromatin in place in the nucleus by incubation of leaf tissue for each species in 1% formaldehyde for 15 min under vacuum.

Assembly of the S. stapfianus and S. pyramidalis Genomes.

A de novo assembly was constructed using a combination of paired end (mean insert size ∼350 bp) libraries and mated pair libraries with inserts ranging from 7 to 12 kbp. De novo assembly was performed using Meraculous v2.2.2.5 (diploid mode 1) (41) with a k-mer size of 109. Reads were trimmed for quality, sequencing adapters, and mate pair adapters using Trimmomatic (42). The de novo assembly, shotgun reads, Chicago library reads, and Dovetail Hi-C library reads were used as input data for HiRise, a software pipeline designed specifically for using proximity ligation data to scaffold genome assemblies (39) and detailed in .

RNA Extraction and RNA Sequencing.

RNA was extracted from young leaf samples using the RNeasy (Qiagen) kit with RLC buffer following the manufacturer’s protocol. The RNA isolates were treated with deoxyribonuclease 1 and cleaned using the DNA-free RNA Kit (Zymo Technologies). RNA quality was assessed by use of a fragment analyzer (Advanced Analytical Technologies), and concentration was determined with a Nanodrop Spectrophotometer (ThermoFisher). RNA libraries were individually bar-coded from 2.7 µg of template total RNA utilizing the TruSeq RNA Sample Prep Kit (Illumina) as described in the manufacturer’s protocol. Libraries were pooled in groups of 12 and sequenced (12 samples per lane) on an Illumina HiSeq 2500 ultrahigh-throughput DNA sequencing platform (Illumina) at the DNACore facility at the University of Missouri.

Full-Length Sequencing (Iso-Seq) and De Novo Transcriptome Assembly.

High-quality RNA was extracted from whole-root tissues obtained from seedlings at the four-leaf stage when the first pair of leaves had matured, whole seedlings at the two-leaf stage, mature leaves, young leaves, floral inflorescences, and tissue samples identical to those used for the dehydration/desiccation/rehydration transcriptomes. The RNAs were pooled for each individual species for subsequent amplification. Bar-coded SMRT libraries were prepared and sequenced on the PacBio platform with X SMRT cells by Novogene Corporation Inc. Sequence reads were processed using Iso-Seq3 (https://github.com/PacificBiosciences/IsoSeq).

Genome Annotation.

Genome assemblies were annotated using three rounds of MAKER-P. Briefly, round 1 used full-length nonchimeric sequences from PacBio transcriptome sequencing as EST evidence; a collection of Arabidopsis thaliana [Araport11 (43)], Zea mays [downloaded from Gramene’s ftp server at https://www.gramene.org/ftp-download; AGPv4 release 59 (44, 45)], Sorghum bicolor [downloaded from Phytozome; https://phytozome-next.jgi.doe.gov/pz/portal.html, version 3.1.1 (46)], and O. thomaeum [downloaded from Phytozome, version 1.0 (7)] sequences as protein evidence; and a de novo repeats library obtained using LTR_Finder (47), LTRharvest (48), LTR retriever (49), and RepeatModeler (50) as inputs. Round 2 used the round 1 maker gff file and an SNAP (http://korflab.ucdavis.edu/software.html) hmm file obtained from the round 1 gff3 file. Round 3 used the round 2 maker gff3 file, the GeneMark-ES (51) HMM output file from a BRAKER (52) run from hisat (53) aligned RNASeq reads, and the corresponding Augustus (54) gene prediction models. As a further filter, we decided to only keep genes that had expression evidence in our RNASeq Illumina or Pacbio data and/or whose corresponding protein is homologous to a known plant protein. Evidence of expression was at least one of the following two criteria: 1) an expression value of at least one transcripts per million (TPM) in all replicates of at least one sample in the RNASeq data after bowtie2 (55) alignment and Salmon (56) quantification or 2) at least one TPM in the gtf file obtained after a minimap2 (57) alignment and StringTie (58) quantification of IsoSeq3 polished long reads. Sporobolus proteins were considered as homologous if they satisfied at least one of three criteria: 1) a blastp match with an e value of 1e-6 or lower vs. either Arabidopsis proteins [Araport11 annotation (43)]; 2) vs. a collection of Glycine max, Oryza sativa subsp. japonica, Populus trichocarpa, Solanum lycopersicum, S. bicolor, Vitis vinifera, Brachypodium distachyon, Physcomitrella patens subsp. patens, and Chlamydomonas reinhardtii UniProt Trembl proteins; or 3) proteins with a domain identified by InterProScan (59) with an e value of 1e-10 or lower. Final gene identifiers are in the format Sp2s00000_00000 for S. pyramidalis and Ss2s00000_00000 for S. stapfianus. Sp stands for S. pyramidalis, Ss stands for S. stapfianus, 2 indicates the genome version, s00000 indicates the scaffold number, and the last five digits are an arbitrary gene number.

Genome Functional Annotation.

GO annotation was done using a simplified version of the maize–GAMER pipeline (60). Transcript sequences were analyzed using BLAST vs. Arabidopsis Araport11 proteins and a collection of UniProt (61) TREMBL proteins from nine plant species (G. max, O. sativa subsp. japonica, P. trichocarpa, S. lycopersicum, S. bicolor, V. vinifera, B. distachyon, P. patens subsp. patens, C. reinhardtii), InterProScan with the -goterms option, and Pannzer2 (62). GO annotations of BLAST reciprocal best hits were retrieved from either the A. thaliana gaf file available at http://geneontology.org or the GOA file available at European Bioinformatics Institute. GO annotations from Blast, InterProScan, and Pannzer2 analyses were collated into a nonredundant gaf file and used for GO enrichment analyses.

Orthogroup Analysis.

Comparative genomics analyses were completed using MCScan (25). The O. thomaeum genome was used as a common anchor as it is diploid and has a chromosome scale assembly. A minimum cutoff of five genes was used to identify syntenic gene blocks. A set of syntenic orthogroups was created containing genes present in all grass species analyzed. We clustered proteins from 23 species into orthogroups using OrthoFinder (v2.3.8) (26). OrthoFinder using default parameters and the reciprocal DIAMOND search was used to identify similar proteins, which were clustered using the Markov Cluster Algorithm. The following species were included in OrthoFinder: Ananas comosus, A. thaliana, B. distachyon, E. nindensis, E. tef, L. brevidens, L. subracemosa, Marchantia polymorpha, Medicago truncatula, O. sativa, O. thomaeum, P. patens, S. bicolor, Setaria italica, Selginella. lepidophylla, Selaginella. moellendorffii, S. lycopersicum, S. pyramidalis, S. stapfianus, V. vinifera, Xerophyta viscosa, Zostera marina, and Z. mays. A set of orthogroups containing seed-related genes was previously identified based on seed and leaf expression datasets from Z. mays, S. bicolor, O. sativa, and E. tef (22). Syntenic orthologs of these seed-related genes were then identified in O. thomaeum, and these syntenic orthologs were used with OrthoFinder output to identify seed-related orthogroups.

Cross-Species Expression Analyses.

Differential expression (DE) analyses were conducted using DESeq2 (63) (E. nindensis, E. tef, and O. thomaeum) or edgeR (23) (S. stapfianus and S. pyramidalis), and resulting outputs were processed using Pandas 0.25.0 in Python 3.6.8. Up-regulated and down-regulated genes were extracted for each species (). OrthoFinder output was used to identify the orthogroup corresponding to each gene in the differential expression output. For seed orthogroups, the previously generated lists of seed-related orthogroups were used to extract differentially expressed seed orthogroups. The intersections and differences among the resulting sets of orthogroups were then extracted, and Venn diagrams were constructed using matplotlib_venn (version 3.1.1) (64) or Python package venn. Enrichment of GO terms was conducted using topGO (65) 2.38.1 in R 3.6.0 for various intersections and differences of DE orthogroups (). Differentially expressed genes in these orthogroups were extracted, and GO enrichment was conducted using Fisher’s exact test via the weight01 algorithm. Following enrichment, unique biological process GO terms were extracted using the Python library Pandas. Unique GO terms for DS as compared with DT were also extracted for further study.

GO Categories Enrichment Analysis for the S. stapfianus vs. S. pyramidalis Comparison.

A comparison of gene expression of S. stapfianus vs. S. pyramidalis leaves at 3 gH2O g−1 dw was achieved using tximport (66) and edgeR (23). We created a custom syntenic orthologs tx2gene file (https://bioconductor.org/packages/release/bioc/vignettes/tximport/inst/doc/tximport.html). GO annotation files for both species were merged, replacing each gene identifier with the custom gene identifier from our tx2gene file. In this way, each gene inherits the GO annotation of all its corresponding S. stapfianus and S. pyramidalis genes (). GO categories enrichment analysis was carried out for the list of up-regulated both_n genes and the list of down-regulated both_n genes using Bingo (24) in Cytoscape (67), with a false discovery rate (FDR)-adjusted P value cutoff of 0.05 and the list of genes in our tx2gene file as the universe.

Protein Extraction and Proteomics Analysis.

Proteins were extracted from triplicate samples of 1 g of frozen leaf tissue, separated on 16-cm sodium dodecyl sulfate polyacrylamide gel electrophoresis gels, and cut into 10 equal slices; each slice was digested with trypsin, and liquid chromatograph mass spectrometer (LCMS) data were acquired on the LTQ Orbitrap at the Charles W. Gehrke Proteomics Center, University of Missouri using standard protocols (http://proteomics.missouri.edu/protocols.php). Raw data were analyzed with MaxQuant software v. 2.0.1.0 (68). Tandem mass spectrometer spectra were searched against the S. pyramidalis and S. stapfianus proteins, and potential contaminants by the built-in Andromeda search engine (69). Label-free quantification (LFQ) of the identified proteins was performed using normalized LFQ (LFQ intensity) using the MaxLFQ algorithms (70). The resulting identified proteins were filtered, keeping only proteins with an LFQ intensity greater than zero in all biological replicates or absent in all biological replicates. Proteins with significant Student’s t test (two tailed; P < 0.05) results were considered up accumulated (log2 fold change > 0.5) or down accumulated (log2 fold change < 0.5). The lists of up-and down-accumulated protein identifiers were translated to their corresponding syntenic ortholog identifiers, and GO biological process categories enrichment was done using Bingo previously.
  61 in total

1.  Isolation of plant cell wall proteins.

Authors:  Elisabeth Jamet; Georges Boudart; Giséle Borderies; Stephane Charmont; Claude Lafitte; Michel Rossignol; Herve Canut; Rafael Pont-Lezica
Journal:  Methods Mol Biol       Date:  2008

2.  The B73 maize genome: complexity, diversity, and dynamics.

Authors:  Patrick S Schnable; Doreen Ware; Robert S Fulton; Joshua C Stein; Fusheng Wei; Shiran Pasternak; Chengzhi Liang; Jianwei Zhang; Lucinda Fulton; Tina A Graves; Patrick Minx; Amy Denise Reily; Laura Courtney; Scott S Kruchowski; Chad Tomlinson; Cindy Strong; Kim Delehaunty; Catrina Fronick; Bill Courtney; Susan M Rock; Eddie Belter; Feiyu Du; Kyung Kim; Rachel M Abbott; Marc Cotton; Andy Levy; Pamela Marchetto; Kerri Ochoa; Stephanie M Jackson; Barbara Gillam; Weizu Chen; Le Yan; Jamey Higginbotham; Marco Cardenas; Jason Waligorski; Elizabeth Applebaum; Lindsey Phelps; Jason Falcone; Krishna Kanchi; Thynn Thane; Adam Scimone; Nay Thane; Jessica Henke; Tom Wang; Jessica Ruppert; Neha Shah; Kelsi Rotter; Jennifer Hodges; Elizabeth Ingenthron; Matt Cordes; Sara Kohlberg; Jennifer Sgro; Brandon Delgado; Kelly Mead; Asif Chinwalla; Shawn Leonard; Kevin Crouse; Kristi Collura; Dave Kudrna; Jennifer Currie; Ruifeng He; Angelina Angelova; Shanmugam Rajasekar; Teri Mueller; Rene Lomeli; Gabriel Scara; Ara Ko; Krista Delaney; Marina Wissotski; Georgina Lopez; David Campos; Michele Braidotti; Elizabeth Ashley; Wolfgang Golser; HyeRan Kim; Seunghee Lee; Jinke Lin; Zeljko Dujmic; Woojin Kim; Jayson Talag; Andrea Zuccolo; Chuanzhu Fan; Aswathy Sebastian; Melissa Kramer; Lori Spiegel; Lidia Nascimento; Theresa Zutavern; Beth Miller; Claude Ambroise; Stephanie Muller; Will Spooner; Apurva Narechania; Liya Ren; Sharon Wei; Sunita Kumari; Ben Faga; Michael J Levy; Linda McMahan; Peter Van Buren; Matthew W Vaughn; Kai Ying; Cheng-Ting Yeh; Scott J Emrich; Yi Jia; Ananth Kalyanaraman; An-Ping Hsia; W Brad Barbazuk; Regina S Baucom; Thomas P Brutnell; Nicholas C Carpita; Cristian Chaparro; Jer-Ming Chia; Jean-Marc Deragon; James C Estill; Yan Fu; Jeffrey A Jeddeloh; Yujun Han; Hyeran Lee; Pinghua Li; Damon R Lisch; Sanzhen Liu; Zhijie Liu; Dawn Holligan Nagel; Maureen C McCann; Phillip SanMiguel; Alan M Myers; Dan Nettleton; John Nguyen; Bryan W Penning; Lalit Ponnala; Kevin L Schneider; David C Schwartz; Anupma Sharma; Carol Soderlund; Nathan M Springer; Qi Sun; Hao Wang; Michael Waterman; Richard Westerman; Thomas K Wolfgruber; Lixing Yang; Yeisoo Yu; Lifang Zhang; Shiguo Zhou; Qihui Zhu; Jeffrey L Bennetzen; R Kelly Dawe; Jiming Jiang; Ning Jiang; Gernot G Presting; Susan R Wessler; Srinivas Aluru; Robert A Martienssen; Sandra W Clifton; W Richard McCombie; Rod A Wing; Richard K Wilson
Journal:  Science       Date:  2009-11-20       Impact factor: 47.728

3.  The resurrection genome of Boea hygrometrica: A blueprint for survival of dehydration.

Authors:  Lihong Xiao; Ge Yang; Liechi Zhang; Xinhua Yang; Shuang Zhao; Zhongzhong Ji; Qing Zhou; Min Hu; Yu Wang; Ming Chen; Yu Xu; Haijing Jin; Xuan Xiao; Guipeng Hu; Fang Bao; Yong Hu; Ping Wan; Legong Li; Xin Deng; Tingyun Kuang; Chengbin Xiang; Jian-Kang Zhu; Melvin J Oliver; Yikun He
Journal:  Proc Natl Acad Sci U S A       Date:  2015-04-20       Impact factor: 11.205

4.  LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons.

Authors:  Shujun Ou; Ning Jiang
Journal:  Plant Physiol       Date:  2017-12-12       Impact factor: 8.340

5.  Sporobolus stapfianus: Insights into desiccation tolerance in the resurrection grasses from linking transcriptomics to metabolomics.

Authors:  Abou Yobi; Karen A Schlauch; Richard L Tillett; Won C Yim; Catherine Espinoza; Bernard W M Wone; John C Cushman; Melvin J Oliver
Journal:  BMC Plant Biol       Date:  2017-03-28       Impact factor: 4.215

6.  Massive Tandem Proliferation of ELIPs Supports Convergent Evolution of Desiccation Tolerance across Land Plants.

Authors:  Robert VanBuren; Jeremy Pardo; Ching Man Wai; Sterling Evans; Dorothea Bartels
Journal:  Plant Physiol       Date:  2019-01-02       Impact factor: 8.340

7.  Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

Authors:  Erez Lieberman-Aiden; Nynke L van Berkum; Louise Williams; Maxim Imakaev; Tobias Ragoczy; Agnes Telling; Ido Amit; Bryan R Lajoie; Peter J Sabo; Michael O Dorschner; Richard Sandstrom; Bradley Bernstein; M A Bender; Mark Groudine; Andreas Gnirke; John Stamatoyannopoulos; Leonid A Mirny; Eric S Lander; Job Dekker
Journal:  Science       Date:  2009-10-09       Impact factor: 47.728

8.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

9.  Meraculous: de novo genome assembly with short paired-end reads.

Authors:  Jarrod A Chapman; Isaac Ho; Sirisha Sunkara; Shujun Luo; Gary P Schroth; Daniel S Rokhsar
Journal:  PLoS One       Date:  2011-08-18       Impact factor: 3.240

10.  Maize GO Annotation-Methods, Evaluation, and Review (maize-GAMER).

Authors:  Kokulapalan Wimalanathan; Iddo Friedberg; Carson M Andorf; Carolyn J Lawrence-Dill
Journal:  Plant Direct       Date:  2018-04-11
View more
  2 in total

1.  Coevolution of tandemly repeated hlips and RpaB-like transcriptional factor confers desiccation tolerance to subaerial Nostoc species.

Authors:  Hai-Feng Xu; Guo-Zheng Dai; Yang Bai; Jin-Long Shang; Bin Zheng; De-Min Ye; Huazhong Shi; Aaron Kaplan; Bao-Sheng Qiu
Journal:  Proc Natl Acad Sci U S A       Date:  2022-10-10       Impact factor: 12.779

Review 2.  Exploring the High Variability of Vegetative Desiccation Tolerance in Pteridophytes.

Authors:  Gerardo Alejo-Jacuinde; Luis Herrera-Estrella
Journal:  Plants (Basel)       Date:  2022-04-30
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.