Literature DB >> 26956888

Transcriptomes of Plant Gametophytes Have a Higher Proportion of Rapidly Evolving and Young Genes than Sporophytes.

Toni I Gossmann1, Dounia Saleh2, Marc W Schmid3, Michael A Spence4, Karl J Schmid5.   

Abstract

Reproductive traits in plants tend to evolve rapidly due to various causes that include plant-pollinator coevolution and pollen competition, but the genomic basis of reproductive trait evolution is still largely unknown. To characterize evolutionary patterns of genome wide gene expression in reproductive tissues in the gametophyte and to compare them to developmental stages of the sporophyte, we analyzed evolutionary conservation and genetic diversity of protein-coding genes using microarray-based transcriptome data from three plant species, Arabidopsis thaliana, rice (Oryza sativa), and soybean (Glycine max). In all three species a significant shift in gene expression occurs during gametogenesis in which genes of younger evolutionary age and higher genetic diversity contribute significantly more to the transcriptome than in other stages. We refer to this phenomenon as "evolutionary bulge" during plant reproductive development because it differentiates the gametophyte from the sporophyte. We show that multiple, not mutually exclusive, causes may explain the bulge pattern, most prominently reduced tissue complexity of the gametophyte, a varying extent of selection on reproductive traits during gametogenesis as well as differences between male and female tissues. This highlights the importance of plant reproduction for understanding evolutionary forces determining the relationship of genomic and phenotypic variation in plants.
© The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  gametophyte.; plant evolution; reproduction; selection; transcriptome

Mesh:

Substances:

Year:  2016        PMID: 26956888      PMCID: PMC4915351          DOI: 10.1093/molbev/msw044

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Reproductive traits in plants and animals tend to be highly diverse and rapidly evolving within and between closely related species (Barrett 2002; Swanson and Vacquier 2002; Parsch and Ellegren 2013). Their diversity may be influenced by the coevolution with pollinators or pathogens that infect reproductive tissues, the mating system (i.e., selection for the maintenance of self-incompatibility), the rapid evolutionary dynamics of sex chromosomes, genomic conflicts between parents and offspring, or from sexual selection (Baack et al. 2015). Some genes and proteins expressed in reproductive tissues exhibit high rates of evolution (Swanson and Vacquier 2002; Parsch and Ellegren 2013). In plants, they include genes encoding the self-incompatibility system (Nasrallah et al. 2002; Tang et al. 2007), pollen-coat proteins (Schein et al. 2004), and imprinted genes controlling resource allocation to offspring (Spillane et al. 2007). The rapid evolution of reproductive traits and their underlying genes is in contrast to other tissues and developmental stages that appear to be more conserved. In particular, the phylotypic stage in animals, in which a similar morphology at a certain stage of embryo development is observed within phyla, represents the archetype of morphological evolutionary conservation within a phylum (Duboule 1994). Although reproductive traits appear to evolve rapidly in animals, plants and other organisms with anisogamic sexual reproduction (Lipinska et al. 2015), there is a fundamental difference between these groups. In animals, a group of cells are set aside during early development, which forms the germ line. Plants do not have a germ line, but are characterized by alternating sporophytic and haploid gametophytic stages (Grossniklaus 2011; Schmidt et al. 2011). Since the two stages differ in their development and role in reproduction, the function and evolution of genes expressed in the sporophyte and gametophyte should also differ. Furthermore, the haploid stage immediately exposes recessive mutations to selection which causes different evolutionary dynamics of genes expressed in the gametophyte compared with genes only expressed in a diploid stage (Gossmann, Schmid, et al. 2014). Currently it is little understood which processes drive the rapid evolution of plant reproductive genes on a genome-wide scale. During plant gametogenesis, the transcription profile changes dramatically, and genes involved in reproduction are enriched in this phase (Schmid et al. 2005; Fujita et al. 2010; Xiao et al. 2011; O’Donoghue et al. 2013). However, a focus on genes whose expression is enriched in a specific tissue introduces a bias for genes with specific expression patterns that ignores the contribution of other genes to the total diversity of expression patterns (Arunkumar et al. 2013; Gossmann, Schmid, et al. 2014). To characterize the evolutionary dynamics of transcriptomic profiles, it is therefore necessary to combine the genome-wide expression intensity of all genes expressed in a given tissue and stage with evolutionary parameters quantifying the level of polymorphism, rate of molecular evolution or long-term evolutionary conservation (Slotte et al. 2011). For this purpose, evolutionary indices such as the transcriptome age index (TAI), which measures the long-term conservation of expressed genes weighted by the relative expression of the gene, or the transcriptomic divergence index (TDI), which compares the rate of nonsynonymous to synonymous substitutions in a protein-coding gene between closely related species (Domazet-Lošo and Tautz 2010; Kalinka et al. 2010; Quint et al. 2012) were developed to test whether the phylotypic stage as defined by Haeckel has a molecular equivalent. Studies in vertebrates (zebrafish) and insects (Drosophila melanogaster) confirmed this hypothesis because genes expressed during the phylotypic stage were more conserved and less rapidly evolving than genes expressed in other stages of development (Domazet-Lošo and Tautz 2010; Kalinka et al. 2010). Although plants do not have a clear morphologically defined phylotypic stage, a transcriptomic hourglass was also postulated for the model plant Arabidopsis thaliana because old and slowly evolving genes contribute disproportionally to the overall transcriptome during early stages of embryo development (Quint et al. 2012; Drost et al. 2015), but see Piasecka et al. (2013). Based on the above considerations, we reasoned that the morphologically and developmentally diverse reproductive stages of plants, in particular the gametophyte, should be characterized by a high proportion of expressed genes with a lower degree of long-term evolutionary conservation (Cui et al. 2015) and a higher rate of divergence between closely related species. We tested this hypothesis by comparing the transcriptome-based indices of evolution observed in reproductive stages like the gametogenenesis to other developmental stages such as the putative phylotypic stage. We based our analysis on three different evolutionary parameters and used gene expression and genome sequence data from three flowering plant species, A. thaliana, rice (Oryza sativa), soybean (Glycine max), and the moss Physcomitrella patens. The expression data include developmental stages preceding (e.g., flower development), during and following gametogenesis (e.g., embryogenesis). The A. thaliana data additionally included stages from both sexes, whereas for the other species we used data from the male sex only. Our results show that the rate of evolution of genes expressed in reproductive stages is much higher relative to the extent of conservation of the putative phylotypic or other sporophytic stages. For this reason, we name this observation “evolutionary bulge” to express the stronger contribution of rapidly evolving and young genes to the transcriptome in reproductive developmental stages compared with other stages and discuss several, not mutually exclusive, hypotheses that may explain this pattern.

Results and Discussion

To test whether developmental stages and tissues involved in reproduction show a higher proportion of expressed genes of a younger evolutionary age and a higher rate of divergence between closely related species, we analyzed global expression during gamete development and the developmental stages before and after gametogenesis (table 1) with three evolutionary parameters. For this we combined microarray expression levels with measures of evolutionary conservation and polymorphism into evolutionary transcriptome indices of developmental stages. The evolutionary transcriptome index is calculated as:
Table 1.

Summary of Microarray-Based Expression Data from Different Developmental Stages Used in This Study.

SpeciesDevelopmental StageReferences
A. thalianaPrereproductive stage: Shoot apex 7 days (SA7D), Shoot apex 14 days (SA14D), Shoot after bolting (SAB), Flower stage 9 (FS9), Flower stage 12 (FS12), Flower stage 15 (FS15)Schmid et al. (2005)
Reproductive stage: Megaspore mother cell (MMC), Egg cell (EC), Unicellular pollen (UCP), Bicellular pollen (BCP), Tricellular pollen (TCP), Pollen mature (MP), Sperm (S), Pollentube (PT)Honys and Twell (2004); Borges et al. (2008); Wang et al. (2008); Wuest et al. (2010); Schmidt et al. (2011); Schmid et al. (2012)
Postreproductive stage: Quadrant embryo (Q), Globular embryo (G), Heart embryo (H), Torpedo embryo (T), Mature embryo (M)Le et al. (2010); Zuber et al. (2010)
RicePrereproductive stage
Shoot 4 weeks (S4W)Fujita et al. (2010)
Reproductive stage: Unicellular pollen (UCP), Bicellular pollen (BCP), Tricellular pollen (TCP), Mature pollen (MP), Germinated pollen (GP)Wei et al. (2010)
Postreproductive stage: Fertilization (F), Zygote formation (Z), 0 Days After Pollination embryo (0DAP), 1 Days After Pollination embryo (1DAP), 2DAP embryo, 3DAP embryo, 4DAP embryo, 9DAP embryo, 12DAP embryoFujita et al. (2010); Gao and Xue (2012)
SoybeanPrereproductive stage: Sporophyte (S)Haerizadeh et al. (2009)
Reproductive stage: Mature pollen (MP)Haerizadeh et al. (2009)
Postreproductive stage: Globular embryo (G), Heart embryo (H), Cotyledon (C), Seed parenchyma (SP), Seed meristem (SSM)Le et al. (2007)

Note.—Further details about the individual data sets are provided in the supplementary file S1, Supplementary Material online.

Summary of Microarray-Based Expression Data from Different Developmental Stages Used in This Study. Note.—Further details about the individual data sets are provided in the supplementary file S1, Supplementary Material online. , where E is the evolutionary parameter, s the developmental stage, E the value of the evolutionary parameter for gene i, n the total number of genes, and e the expression level of gene i in developmental stage s. In this study, we used gene age to calculate the TAI (Domazet-Lošo and Tautz 2010; Kalinka et al. 2010), sequence divergence () for the TDI and sequence diversity () for new transcriptome polymorphism index (TPI), which is a measure of current evolutionary constraint. The evolutionary transcriptome index is related to Pearson’s correlation coefficient but also incorporates variation in expression mean and variation (supplementary text S1, Supplementary Material online). This statistic is different from previous approaches addressing similar questions of evolutionary patterns during reproduction. Instead of focusing on significantly enriched genes which are biased towards specifically and/or strongly expressed genes, we considered the composition of the whole transcriptome. This enabled us to differentiate whether any evolutionary signals during development are caused by a few genes with strong effects or many genes with weak effects. It also allows to directly compare signal intensities with the previously described evolutionary hourglass during embryo development in A. thaliana. In all three species we observed the highest values of the three indices during reproductive stages (fig. 1), and they differ significantly from the values of the sporophytic developmental stages. To exclude that high point estimates of evolutionary parameters, which may be caused by low quality alignments, inflate diversity and polymorphism indices, we calculated TDI and TPI values from the weighted median (see Materials and Methods). Both indices are robust to the impact of low quality alignments of few genes (supplementary fig. S1, Supplementary Material online). Large absolute differences in the expression level of genes with a high and low expression level may allow a few genes to dominate the overall transcriptome index. We conducted our analyses with log2 transformed data, but additionally verified the bulge pattern with raw and log10-transformed expression data and found that the transcriptome indices are little influenced by genes with very high expression levels (supplementary fig. S2, Supplementary Material online). In A. thaliana, pollen tubes have the highest TAI value and therefore the highest proportion of young genes (t-test; for all pairwise comparisons with sporophytic stages). The highest TDI and TPI values occur in sperm cells (). In rice, the highest TAI, TDI, and TPI indices are observed in the mature and germinated pollen stages ( for all pairwise comparisons), and in soybean in the germinated pollen stage (). The A. thaliana and rice expression data cover consecutive reproductive stages in which the evolutionary indices increase during the maturation of the male gametes and peak at a final reproductive stage. Female gametophytic tissues show a similar trend in A. thaliana. Overall, there is a strong difference between gametophytic and sporophytic phases, suggesting a distinct evolutionary dynamic of reproductive compared with sporophytic stages. The comparison of evolutionary indices between pre- and postgametic developmental stages reveal that the lowest values of these indices are not consistently the lowest during embryogenesis, as suggested by the hourglass hypothesis. Except for A. thaliana, there is no particular stage during embryogenesis that has the lowest TAI, TDI, and TPI values (fig. 1).
F

Evolutionary transcriptome indices for Arabidopsis thaliana, rice, and soybean. Plot of TAI, TDI, and TPI for available data from A. thaliana, rice, and soybean for different developmental stages and tissues. Black lines indicate the transcriptome index and the colored dots are the indices calculated from random samples (with replacement) of genes to obtain a confidence interval of the index. Blue dots indicate nonreproductive tissues, green and red dots indicate male and female reproductive tissues, respectively.

Evolutionary transcriptome indices for Arabidopsis thaliana, rice, and soybean. Plot of TAI, TDI, and TPI for available data from A. thaliana, rice, and soybean for different developmental stages and tissues. Black lines indicate the transcriptome index and the colored dots are the indices calculated from random samples (with replacement) of genes to obtain a confidence interval of the index. Blue dots indicate nonreproductive tissues, green and red dots indicate male and female reproductive tissues, respectively. All transcriptome data for a given species were generated with the same Affymetrix array, but hybridizations were conducted in independent experiments. To test for confounding effects from the experimental conditions, we also calculated the transcriptome indices by preprocessing data sets independently (supplementary fig. S3, Supplementary Material online). This led to a relative shift of transcriptome indices between pre- and postgametophytic developmental stages, but the evolutionary bulge remained as a robust pattern. Using P values associated with gene expression from a larger data set for A. thaliana (supplementary table S1, Supplementary Material online) we calculated modified transcriptome indices (see Materials and Methods) by including only genes that are significantly expressed in a given stage with an False Discovery Rate (FDR) < 0.1 (supplementary fig. S4, Supplementary Material online). With few exceptions, reproductive tissues have higher evolutionary indices, and the number of significantly expressed genes differs between the reproductive and vegetative phase (Pina et al. 2005) (, U-test of the median number of genes significantly expressed in reproductive vs. sporophytic tissues). Since the three evolutionary indices may not be independent of each other, we analyzed their correlation with expression and accounted for potentially covarying factors (Gossmann, Santure, et al. 2014). By assuming that expression variation between samples is similar and the same genes are analyzed across stages, the evolutionary index is proportional to the correlation coefficient, r (For a derivation, see supplementary text S1, Supplementary Material online). The analysis of correlation supports the evolutionary bulge pattern because the highest value of r is observed for the gametophytic stages (table 2; subset of sporophytic and gametophytic stages). The only exception was the polymorphism index (TPI) of the two domesticated species (rice and soybean) which was influenced in the reproductive stage by differences in expression variance between reproductive and sporophytic stages (supplementary fig. S5, Supplementary Material online). Results of partial correlations, taking the other two evolutionary parameters, as well as gene length and d (a proxy for mutation rate) as covariates, are qualitatively very similar to the pairwise correlations (table 2). Patterns of polymorphism in domesticated species are affected by past domestication bottlenecks (Gossmann et al. 2010) and the global expression pattern of domesticated species may be substantially altered (e.g., Rapp et al. 2010; Yoo and Wendel 2014). Because the evolutionary bulge pattern is influenced by different processes in the three species (fig. 2 and supplementary fig. S6, Supplementary Material online), domestication may explain some differences of TPI values between the wild and the two crop plant species.
Table 2.

Correlation of Gene Expression with Three Evolutionary Indices.

Correlation of Gene Expression Intensity withGene Age
dN/dS
pN/pS
rr (partial)rr (partial)rr (partial)

A. thaliana
Flower stage 9−0.24***−0.11***−0.34***−0.22***−0.26***−0.13***
Egg cell−0.18***−0.11***−0.20***−0.11***−0.15***−0.07***
Sperm−0.14***−0.08***−0.13***−0.07***−0.09***−0.04***
Pollen tube−0.07***0.01n.s.−0.19***−0.16***−0.12***−0.04***
Heart−0.21***−0.09***−0.26***−0.16***−0.21***−0.11***
Rice
Shoot 4 weeks−0.15***0.01n.s.−0.25***−0.04***−0.06***0.01n.s.
Mature pollen−0.05*** −0.01n.s.−0.08***−0.01n.s.−0.06***−0.03***
Zygote formation−0.17***−0.02*−0.25***−0.04***−0.04***0.03**
Soybean
Sporophyte−0.10***−0.06***−0.22***−0.18***−0.10***−0.04***
Mature pollen−0.01n.s.0.00n.s.−0.11***−0.09***−0.06***−0.03**
Heart−0.07***−0.03***−0.16***−0.14***−0.07***−0.03**

NOTE.—The analysis was based on Pearson’s correlation and partial correlation for selected development stages. For the partial correlations, the other two evolutionary parameters as well as gene length and d were used as covariates.

*** P < 0.001.

F

Difference in expression level between young and old genes and between developmental stages. (A–D) Arabidopsis thaliana, (E–H) rice, and (I–L) soybean. (A, E, and I) Heatmaps of differences in linear regression slopes between pairs of developmental stages included in the analysis. (B, F, and J) Heatmaps of differences in expression level inferred from linear regressions between pairs of developmental stages for the first phylostratum (). (c, g, and k) Heatmaps of differences in expression level inferred from linear regressions between pair of developmental stages for the youngest phylotratum ( in A. thaliana; in rice; and in soybean). (d, h, and l) Mean, confidence interval and linear regression of expression level for several phylostrata at two stages: Flower stage 9 and mature pollen in A. thaliana, 2DAP and mature pollen in rice, sporophyte and mature pollen in soybean. Red crosses represent the expression level inferred from the linear regressions for PS = 1 and PS = 13/9/15, respectively. For abbreviations of developmental stages, see supplementary table S1, Supplementary Material online.

Difference in expression level between young and old genes and between developmental stages. (A–D) Arabidopsis thaliana, (E–H) rice, and (I–L) soybean. (A, E, and I) Heatmaps of differences in linear regression slopes between pairs of developmental stages included in the analysis. (B, F, and J) Heatmaps of differences in expression level inferred from linear regressions between pairs of developmental stages for the first phylostratum (). (c, g, and k) Heatmaps of differences in expression level inferred from linear regressions between pair of developmental stages for the youngest phylotratum ( in A. thaliana; in rice; and in soybean). (d, h, and l) Mean, confidence interval and linear regression of expression level for several phylostrata at two stages: Flower stage 9 and mature pollen in A. thaliana, 2DAP and mature pollen in rice, sporophyte and mature pollen in soybean. Red crosses represent the expression level inferred from the linear regressions for PS = 1 and PS = 13/9/15, respectively. For abbreviations of developmental stages, see supplementary table S1, Supplementary Material online. Correlation of Gene Expression with Three Evolutionary Indices. NOTE.—The analysis was based on Pearson’s correlation and partial correlation for selected development stages. For the partial correlations, the other two evolutionary parameters as well as gene length and d were used as covariates. *** P < 0.001. Different expression patterns during gamete development may result from upregulation of young or downregulation of old genes and may cause the bulge pattern. We performed linear regression of mean log2 normalized expression intensities over the gene age of each stage (fig. 2) to infer how strongly the correlation varied between stages. To illustrate changes in expression for different gene ages, we selected a pairwise comparison between mature pollen and a sporophytic stage for each species as an example (fig. 2). In all three species, the relative expression of both old and young genes differed between developmental stages, but the extent of change varied between stages and species. In A. thaliana, the differences were mainly caused by a change in the expression level of young genes (fig. 2B and ) and in rice by a higher expression of young and a lower expression of older genes (fig. 2F and ). In soybean, the change in expression was mainly caused by the lower expression level of old genes (fig. 2J and ). We also compared the expression levels between stages by grouping genes by their average values of and (supplementary fig. S6, Supplementary Material online) to test whether expression levels differ between slow and rapidly evolving genes. In A. thaliana, conserved genes (low and ) showed a lower expression level and divergent genes (high and ) a higher expression level in reproductive stages, especially in pollen and pollen tubes. In rice, genes with low and values showed strongly decreased mean expression levels in reproductive stages, whereas in soybean, mean expression levels decreased independently from and during reproduction. During reproductive development the tissue complexity of the gametophyte in higher plants is reduced to single cells or a few cells suggesting a reduced interaction between cells and cell types compared with other stages. Highly connected genes tend to evolve slower as a consequence of their functional importance (Alvarez-Ponce and Fares 2012). Such genes, however, may be less expressed in the gametophytic stage and therefore contribute less to the bulge pattern. This hypothesis is supported by a reduced expression level of old genes in all three species (fig. 2B, F, and J). Using data from the Arabidopsis interactome database (see Materials and Methods), we found that in the late stages of male gametophytes the level of interactions is reduced and shows the lowest value in the pollen tube (fig. 3, P  <  0.03). In the female gametophyte, which is a tissue of higher complexity, such a reduction in protein interactions is not observed. This difference suggests that factors contributing to the evolutionary bulge pattern may vary between male and female tissues.
F

Transcriptome indices for d, neutrality index and gene interactions for Arabidopsis thaliana. Upper panel: Median per gene d (synonymous per site substitution rate, a proxy for the neutral mutation rate) weighted by gene expression. Middle panel: Median per gene neutrality index (NI, a measurement of the departure from neutrality, with indicating neutrality) weighted by gene expression. Lower panel: Average number of gene interaction partners weighted by gene expression.

Transcriptome indices for d, neutrality index and gene interactions for Arabidopsis thaliana. Upper panel: Median per gene d (synonymous per site substitution rate, a proxy for the neutral mutation rate) weighted by gene expression. Middle panel: Median per gene neutrality index (NI, a measurement of the departure from neutrality, with indicating neutrality) weighted by gene expression. Lower panel: Average number of gene interaction partners weighted by gene expression. An evolutionary bulge pattern might be relatively less pronounced in self-fertilizing species, like the three species analyzed here, as they lack genetic diversity (Wright et al. 2013) and deleterious recessive mutations are rapidly removed in diploid tissues (Szövényi et al. 2014). On the other hand, an evolutionary bulge pattern should be independent from the mating system if low but sufficient levels of outcrossing occur in selfers (Bomblies et al. 2010), if most mutations are dominant and therefore exposed to selection in outcrossers, or if the reproductive success of the gametophyte is dominated by de novo mutations during gametogenesis. The silent sequence divergence between species, d, is a proxy for mutation rate and is increased for genes predominantly expressed in sperm and pollen tube stages in A. thaliana (fig. 3; ) which supports the latter explanation. Mosses have an extended generation of multicellular haploid gametophytes that differentiate into early vegetative and later reproductive stages and allow to investigate the effects of haploidy on transcriptome indices. In the expression data available for gametophytic and sporophytic stages of the moss P. patens (O’Donoghue et al. 2013), young genes contribute to the gene age of the gametophytic transcriptome as indicated by an increase of the TAI during the haploid stage (fig. 4; ). This is consistent with the evolutionary bulge and suggests that it may be a general pattern of plant reproductive evolution, although a broader taxonomic sampling will be necessary to verify this hypothesis.
F

Estimates of the TAI for three different developmental stages in the moss Physcomitrella patens.

Estimates of the TAI for three different developmental stages in the moss Physcomitrella patens. The pollen tube of A. thaliana showed lower TDI and TPI, but higher TAI values than the sperm cell (fig. 1; see also Cui et al. 2015), which indicates that tissue- or cell-specific effects within the gametophyte additionally influence the evolutionary bulge pattern. The expression weighted neutrality index (NI; NI < 1 indicates an increased role of positive selection while NI > 1 indicates purifying selection) differs between sperm and late pollen stages in A. thaliana (fig. 3, ) which shows a shift in the relative contribution of positive and negative selection and supports tissue-specific effects. A possible explanation is an enrichment of slightly deleterious mutations that are more effectively removed in pollen due to purifying selection, but it is difficult to disentangle the extent of the different selective forces on a gene-by-gene basis. As noted before, a focus on tissue-specific enriched genes represents a bias because these genes tend to show a narrow expression pattern and a high expression level. In plants, both factors correlate with the rate of molecular evolution, but in opposite directions (Slotte et al. 2011).

Conclusion

When compared with the transcriptomic hourglass of embryogenesis, the evolutionary bulge seems to be a more robust pattern of plant development. We reproduced the hourglass in A. thaliana, but found little support for it in rice or soybean which may result from an incomplete sampling of embryonic stages in the latter two species. This suggests that the hourglass pattern is restricted to a very short time span of plant embryo development. Therefore, further research is required to verify the transcriptomic hourglass as a general pattern of plant development because the transcriptome indices are not consistently lower during embryogenesis than in other developmental stages. In contrast, the evolutionary bulge of reproduction is seen in four plant species illustrating that the evolutionary forces acting during plant reproductive development leave a strong imprint on the genomic composition of protein-coding genes. This is consistent with the phenotypic diversity of reproductive traits but additionally highlights the importance of plant reproduction for understanding evolutionary forces determining the relationship of genomic and phenotypic variation in plants. We have shown that multiple, not mutually exclusive, causes may explain the bulge pattern, most prominently reduced tissue complexity of the gametophyte and a varying extent of selection on reproductive traits during gametogenesis as well as between male and female tissue. To further test whether the evolutionary bulge is a general pattern of plant evolution and to disentangle the different factors that are influencing it, the investigation of plant species with strong differences in their mode of reproduction in comparison to our study species will be useful. Examples are diecious plants, wind-pollinated outcrossing trees, insect-pollinated flowering plants and species with increased complexity of the gametophyte generation.

Materials and Methods

Sequence Data and Software

We obtained the genome sequences of A. thaliana (), rice (O. sativa, International Rice Genome Sequencing Project 2005), and soybean (G. max, Schmutz et al. 2010) from the plant genome database (Duvick et al. 2008) and the plant duplication database (Lee et al. 2013) along with their outgroups A. lyrata (Hu et al. 2011), Sorghum bicolor (Paterson et al. 2009), and Phaseolus vulgaris (Schmutz et al. 2014), respectively. Polymorphism data were obtained from 80 A. thaliana accessions (Cao et al. 2011). To identify coding single nucleotide polymorphism (SNP) information for rice we used the Rice Haplotype Map Project Database (second Generation, http://www.ncgr.ac.cn/RiceHap2/index.html) and soybean we used SNP information deposited in SNPdb (Sherry et al. 2001) and extracted coding SNPs from the soybean genome annotation. We used R and Python scripts to conduct statistical analyses.

Gene Expression Data

Gene expression data were obtained for the three plants species from the PlexDB (Dash et al. 2012) and GEO databases (Barrett et al. 2013). In particular, we focused on development stages preceding gametogenesis, during gametogenesis and embryogenic developments (table 1 and supplementary file S1, Supplementary Material online). For each species, Robust multiarray analysis (RMA; Irizarry et al. 2003) and invariant set (IS) methods were performed with the affy Bioconductor package to normalize all data sets simultaneously. Scatterplots of expression between replicates showed better results for RMA normalization (data not shown). Therefore, unless stated otherwise, expression data shown in this study are based on a normalization across experiments using RMA with log2 transformation. Since different laboratory conditions can affect expression patterns (Massonnet et al. 2010), we controlled for these effects in the A. thaliana data (Schmid et al. 2005) by removing data sets that were obtained from plants with different growth conditions before RNA extraction (supplementary file S1, Supplementary Material online). To check whether the differences in expression between experimental conditions were negligible compared to the differences between stages, we generated scatterplots for the mature pollen stage (supplementary fig. S7, Supplementary Material online) that was common to different experiments (Honys and Twell 2004; Schmid et al. 2005; Borges et al. 2008; Wang et al. 2008). Scatterplots showed an expression profile that was similar between experiments with RMA normalization over all experiments and when normalized independently (supplementary fig. S7b and c, Supplementary Material online) and also showed more variation between expression levels when compared with nonnormalized and IS normalized expression (supplementary fig. S7a, d, and e, Supplementary Material online). Scatterplots between nonnormalized experiments and between IS normalized experiments showed less variation in expression levels, but in general, the correlations between expression levels from different experiments were highly independent from the normalization method. For rice and soybean, all experiments were kept for normalization. Gene expression data for P. patens for mature gametophyte, early- and midsporophyte (O’Donoghue et al. 2013) were downloaded from GEO (GSE32928) and the array and genome annotation (V1.6) was obtained from www.cosmoss.org/physcome_project/wiki/Downloads. In this data set, two samples per chip are hybridized, each with a different fluorescent dye (green Cy3 and red Cy5). Expression values were averaged across samples.

Evolutionary Parameters

We obtained estimates for TAI, TDI, and TPI for each developmental stage. A transcriptome index is the average of an evolutionary parameter like gene age (TAI), divergence (TDI), and diversity (TPI) that is weighted by the expression level of each gene. Confidence intervals were obtained by bootstrapping, using 100 sets of genes for each experimental stage. For estimates of gene age, we followed the procedure of Quint et al. (2012) which is based on the construction of a phylostratigraphic map. We used one-way BLAST (default parameters) hits against a set of genomes that are assigned to certain phylostrata and the BLAST hit to the most distant phylostratum defines the gene age (Albà and Castresana 2007). The oldest genes have a gene age value of 1 and the highest gene age value was assigned to genes that are specific to a given species (youngest genes). For A. thaliana we classified 13 phylostrata, 9 for rice, 15 for soybean, and 5 for P. patens. Altogether we used 40 plant genomes, details about the hierarchical order, the genomes assigned to each phylostratum and number of genes with assigned gene age can be found in supplementary figure S8, Supplementary Material online. For each species the largest age category was gene age of value 1. To calculate a per gene estimate of divergence, we calculated using pairwise alignments of homologous genes identified by INPARANOID from the whole-genome comparison with its respective outgroup (Remm et al. 2001; Ostlund et al. 2010). We obtained per gene estimates of () estimates for genes specific to species pairs with the KaKs_calculator (Zhang et al. 2006). We also introduce a new test statistic, the TPI. where s is the developmental stage, n the number of genes, e the expression intensity of gene i in developmental stage s, P and P the numbers of nonsynonymous and synonymous polymorphisms, respectively, and N and S are the numbers of nonsynonymous and synonymous sites, respectively. We used the ratio of nonsynonymous per site polymorphisms to synonymous per site polymorphism to estimate the distribution of fitness effects. Higher values of reflect an excess of slightly deleterious mutations (Keightley and Eyre-Walker 2007). For technical reasons we used rather than P as suggested by Stoletzki and Eyre-Walker (2011) because some genes have no synonymous polymorphisms and therefore would need to be excluded from the analysis which is biased (Stoletzki and Eyre-Walker 2011). For compactness we refer to the term as throughout the manuscript. We tested whether transcriptome indices are different between stages by bootstrapping 100 samples of each index per stage and then performing a two-sample t-test to test for the differences in the means of bootstrapped values. If not noted otherwise, only the highest P value in the comparison of stages is reported.

Modified Variants of the Transcriptome Index

We calculated the weighted median transcriptome index of an evolutionary parameter x and assumed that . The weighted median of the evolutionary index is then x with f such that The standardized transcriptome index that does not consider genes with a nonsignificant expression (supplementary fig. S4, Supplementary Material online) was calculated as follows: where is the arithmetic mean of and n the number of significantly expressed genes. We further obtained per gene neutrality index (NI) for A. thaliana as follows: where . The number of protein interactions for A. thaliana were obtained from the Arabidopsis interactome database (ftp://ftp.arabidopsis.org/home/tair/Proteins/Protein_interaction_data/Interactome2.0/).

Supplementary Material

Supplementary material is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
  65 in total

1.  Positive darwinian selection at the imprinted MEDEA locus in plants.

Authors:  Charles Spillane; Karl J Schmid; Sylvia Laoueillé-Duprat; Stéphane Pien; Juan-Miguel Escobar-Restrepo; Célia Baroux; Valeria Gagliardini; Damian R Page; Kenneth H Wolfe; Ueli Grossniklaus
Journal:  Nature       Date:  2007-07-19       Impact factor: 49.962

2.  Arabidopsis female gametophyte gene expression map reveals similarities between plant and animal gametes.

Authors:  Samuel E Wuest; Kitty Vijverberg; Anja Schmidt; Manuel Weiss; Jacqueline Gheyselinck; Miriam Lohr; Frank Wellmer; Jörg Rahnenführer; Christian von Mering; Ueli Grossniklaus
Journal:  Curr Biol       Date:  2010-03-11       Impact factor: 10.834

Review 3.  The origins of reproductive isolation in plants.

Authors:  Eric Baack; Maria Clara Melo; Loren H Rieseberg; Daniel Ortiz-Barrientos
Journal:  New Phytol       Date:  2015-05-05       Impact factor: 10.151

4.  A transcriptomic hourglass in plant embryogenesis.

Authors:  Marcel Quint; Hajk-Georg Drost; Alexander Gabel; Kristian Karsten Ullrich; Markus Bönn; Ivo Grosse
Journal:  Nature       Date:  2012-09-05       Impact factor: 49.962

Review 5.  Evolutionary consequences of self-fertilization in plants.

Authors:  Stephen I Wright; Susan Kalisz; Tanja Slotte
Journal:  Proc Biol Sci       Date:  2013-06-07       Impact factor: 5.349

6.  Global analysis of expression profiles of rice receptor-like kinase genes.

Authors:  Lin-Lin Gao; Hong-Wei Xue
Journal:  Mol Plant       Date:  2011-07-16       Impact factor: 13.164

7.  Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies.

Authors:  Peter D Keightley; Adam Eyre-Walker
Journal:  Genetics       Date:  2007-12       Impact factor: 4.562

8.  Transcriptome analyses show changes in gene expression to accompany pollen germination and tube growth in Arabidopsis.

Authors:  Yi Wang; Wen-Zheng Zhang; Lian-Fen Song; Jun-Jie Zou; Zhen Su; Wei-Hua Wu
Journal:  Plant Physiol       Date:  2008-09-05       Impact factor: 8.340

9.  Genome-wide transcriptome analysis of gametophyte development in Physcomitrella patens.

Authors:  Lihong Xiao; Hui Wang; Ping Wan; Tingyun Kuang; Yikun He
Journal:  BMC Plant Biol       Date:  2011-12-15       Impact factor: 4.215

10.  InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.

Authors:  Gabriel Ostlund; Thomas Schmitt; Kristoffer Forslund; Tina Köstler; David N Messina; Sanjit Roopra; Oliver Frings; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2009-11-05       Impact factor: 16.971

View more
  14 in total

1.  Comparative transcriptomic analysis reveals conserved programmes underpinning organogenesis and reproduction in land plants.

Authors:  Irene Julca; Camilla Ferrari; María Flores-Tornero; Sebastian Proost; Ann-Cathrin Lindner; Dieter Hackenberg; Lenka Steinbachová; Christos Michaelidis; Sónia Gomes Pereira; Chandra Shekhar Misra; Tomokazu Kawashima; Michael Borg; Frédéric Berger; Jacob Goldberg; Mark Johnson; David Honys; David Twell; Stefanie Sprunck; Thomas Dresselhaus; Jörg D Becker; Marek Mutwil
Journal:  Nat Plants       Date:  2021-07-12       Impact factor: 15.793

2.  Old and young duplicate genes reveal different responses to environmental changes in Arachis duranensis.

Authors:  Hui Song; Juan Sun; Guofeng Yang
Journal:  Mol Genet Genomics       Date:  2019-05-10       Impact factor: 3.291

3.  Plant organ evolution revealed by phylotranscriptomics in Arabidopsis thaliana.

Authors:  Li Lei; Joshua G Steffen; Edward J Osborne; Christopher Toomajian
Journal:  Sci Rep       Date:  2017-08-08       Impact factor: 4.379

4.  Differential evolutionary patterns and expression levels between sex-specific and somatic tissue-specific genes in peanut.

Authors:  Hui Song; Qingping Zhang; Pei Tian; Zhibiao Nan
Journal:  Sci Rep       Date:  2017-08-21       Impact factor: 4.379

5.  Slow evolution of sex-biased genes in the reproductive tissue of the dioecious plant Salix viminalis.

Authors:  Iulia Darolti; Alison E Wright; Pascal Pucholt; Sofia Berlin; Judith E Mank
Journal:  Mol Ecol       Date:  2018-01-22       Impact factor: 6.185

6.  Mechanisms of Transmission Ratio Distortion at Hybrid Sterility Loci Within and Between Mimulus Species.

Authors:  Rachel E Kerwin; Andrea L Sweigart
Journal:  G3 (Bethesda)       Date:  2017-11-06       Impact factor: 3.154

7.  Lineage-Specific Expression Divergence in Grasses Is Associated with Male Reproduction, Host-Pathogen Defense, and Domestication.

Authors:  Raquel Assis
Journal:  Genome Biol Evol       Date:  2019-01-01       Impact factor: 3.416

8.  Deleterious Mutation Accumulation in Arabidopsis thaliana Pollen Genes: A Role for a Recent Relaxation of Selection.

Authors:  Mark C Harrison; Eamonn B Mallon; Dave Twell; Robert L Hammond
Journal:  Genome Biol Evol       Date:  2019-07-01       Impact factor: 3.416

9.  Do plants have a segregated germline?

Authors:  Robert Lanfear
Journal:  PLoS Biol       Date:  2018-05-16       Impact factor: 8.029

Review 10.  Evolutionary Genomics of Plant Gametophytic Selection.

Authors:  Felix E G Beaudry; Joanna L Rifkin; Spencer C H Barrett; Stephen I Wright
Journal:  Plant Commun       Date:  2020-10-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.