Literature DB >> 34152039

Reciprocal allopolyploid grasses (Festuca × Lolium) display stable patterns of genome dominance.

Marek Glombik1,2, Dario Copetti3,4, Jan Bartos1, Stepan Stoces1, Zbigniew Zwierzykowski5, Tom Ruttink6, Jonathan F Wendel7, Martin Duchoslav8, Jaroslav Dolezel1, Bruno Studer3, David Kopecky1.   

Abstract

Allopolyploidization entailing the merger of two distinct genomes in a single hybrid organism, is an important process in plant evolution and a valuable tool in breeding programs. Newly established hybrids often experience massive genomic perturbations, including karyotype reshuffling and gene expression modifications. These phenomena may be asymmetric with respect to the two progenitors, with one of the parental genomes being "dominant." Such "genome dominance" can manifest in several ways, including biased homoeolog gene expression and expression level dominance. Here we employed a k-mer-based approach to study gene expression in reciprocal Festuca pratensis Huds. × Lolium multiflorum Lam. allopolyploid grasses. Our study revealed significantly more genes where expression mimicked that of the Lolium parent compared with the Festuca parent. This genome dominance was heritable to successive generation and its direction was only slightly modified by environmental conditions and plant age. Our results suggest that Lolium genome dominance was at least partially caused by its more efficient trans-acting gene expression regulatory factors. Unraveling the mechanisms responsible for propagation of parent-specific traits in hybrid crops contributes to our understanding of allopolyploid genome evolution and opens a way to targeted breeding strategies.
© 2021 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.

Entities:  

Keywords:  Gene expression; allopolyploidy; cis/trans regulation; genome dominance; homoeolog; interspecific hybrids

Mesh:

Year:  2021        PMID: 34152039      PMCID: PMC8518873          DOI: 10.1111/tpj.15375

Source DB:  PubMed          Journal:  Plant J        ISSN: 0960-7412            Impact factor:   6.417


INTRODUCTION

Interspecific hybridization and polyploidization are fundamental processes underlying the evolution of plant species. All angiosperms have experienced at least one round of whole genome doubling (WGD) during their evolutionary history, with multiple WGD episodes being commonplace during angiosperm diversification (Escudero and Wendel, 2020; Jiao et al., 2011; Landis et al., 2018; Soltis and Soltis, 2016; Soltis et al., 2016; Van de Peer et al., 2017; Wendel, 2015). The evolutionary success of polyploids is presumed to reflect the evolutionary genetic novelty acquired during polyploidization (Soltis and Soltis, 1993), although insights into the genomic basis of adaptive traits is only beginning to emerge (Nieto‐Feliner et al., 2020). Two types of polyploids are widely recognized, auto‐ and allopolyploids, although these terms have several definitions and can represent endpoints of several continua (Doyle and Sherman‐Broyles, 2017; Mason and Wendel, 2020; Wendel and Doyle, 2005). Under the taxonomic definition, autopolyploids arise from WGD within a species, whereas allopolyploids arise from WGD accompanying the merger of two distinct species. Allopolyploidy often leads to heterosis, the manifestation of competitive advantage of the F1 over diploid progenitors (Comai, 2005), masks deleterious recessive alleles leading to increased mutational robustness (Madlung, 2013), and conveys increased stress tolerance through several mechanisms such as delayed reproduction, longer life span, and increased defense against herbivores and pathogens (Lohaus and Van de Peer, 2016). Besides the key roles that interspecific hybridization and polyploidization play in plant evolution and diversification, these processes lay at the origin of many crops, including wheat, oilseed rape, banana, and cotton (Renny‐Byfield and Wendel, 2014; Salman‐Minkov et al., 2016). Moreover, interspecific hybridization is frequently used in breeding programs to incorporate novel alleles into crops, often for beneficial characteristics such as tolerance or resistance to abiotic or biotic stresses. This approach has been used to develop hybrid grasses within the Festuca–Lolium complex. Festulolium (Festuca × Lolium) hybrids combine good palatability and digestibility, rapid establishment from seed, and competitiveness from ryegrasses (Lolium spp.) with abiotic and biotic stress tolerance from fescues (Festuca spp.) (Ghesquiere et al., 2010). Despite the potential benefits of hybridization and polyploidization, many newly formed allopolyploids face survival challenges imposed by a diversity of genic incompatibilities and genomic instabilities resulting from the merger of differentiated genomes. These include numerous forms of chromosomal rearrangements as well as mitotic and meiotic abnormalities (Glombik et al., 2020). In addition to structural genomic changes, allopolyploidy is often accompanied by genome‐wide gene expression alterations (Yoo et al., 2014), likely mediated by epigenetic responses to the novel genomic environment (Ding and Chen, 2018; Song and Chen, 2015). For instance, Yoo et al. (2013) monitored gene expression of over 50 000 genes in natural allopolyploid cotton, as well as in its putative progenitor species, their F1 interspecific hybrids, synthetic allopolyploids, and natural and domesticated cultivars. It was reported that (i) many duplicated genes collectively display expression level dominance (ELD), where the overall level of expression (for both homoeologs) is equivalent to the expression level of one of the two parents, with a bias favoring one parent, even though on a gene‐by‐gene basis this dominance could be reversed, and (ii) genome‐wide homoeolog expression bias (HEB), where the homoeolog from a dominant parental genome contributes to the expression of the particular gene significantly more than the homoeolog from a submissive parental genome. Indeed, polyploidy appears to be universally characterized by varying degrees of genic up‐ and downregulation, homoeolog bias, and alterations in the aggregate expression level of homoeolog pairs (Grover et al., 2012; Yoo et al., 2014). In general, there seem to be only a few, if any, homoeologous gene pairs in hybrids that contribute equally to the transcript pool in all tissues (Alger and Edger, 2020). Gene expression is transcriptionally regulated by cis‐acting regulatory elements and trans‐acting regulatory factors (Williams et al., 2007). Cis‐acting elements have an allele‐specific effect on the expression of the downstream gene, whereas trans‐acting factors affect expression of other genes in the genome. In the case of allopolyploids, trans‐acting factors may influence all homoeologous alleles of downstream‐regulated genes in a hybrid nucleus, although not necessarily equivalently (Bottani et al., 2018). Cis‐acting regulatory elements are believed to account predominantly for changes in gene expression in hybrids of genetically more divergent progenitors (e.g., interspecific hybrids), while trans‐acting factors predominantly account for gene expression changes in hybrids of genetically less divergent progenitors (e.g. intraspecific hybrids) (Bell et al., 2013; Lemos et al., 2008; Wittkopp et al., 2008). One of the intriguing questions regarding cis‐ and trans‐acting regulation in polyploids concerns its mechanistic relationship to the expression‐level phenomena of ELD and HEB, and how this relates to the phenotype. Bao et al. (2019) described a high level of trans‐acting regulation accompanying the domestication of cotton, and proposed that enhanced trans‐acting regulatory evolution may be both a general property of allopolyploidy and help to explain the evolutionary novelty accompanying allopolyploidization (Bottani et al., 2018; Hu and Wendel, 2019). Crosses between the two forage grasses Festuca pratensis Huds. and Lolium multiflorum Lam. produce hybrids of considerable agricultural interest. These Festulolium hybrids display Lolium genome dominance in the form of gradual replacement of Festuca chromosomes by those of Lolium in subsequent generations (Kopecky et al., 2006; Zwierzykowski et al., 2006). Here, we investigate genome‐wide transcriptome changes in the first two generations of reciprocal L. multiflorum × F. pratensis hybrids using RNA‐sequencing (RNA‐seq). k‐mer–based RNA‐seq analysis was used to avoid any potential bias of the conventional implementation of mapping reads from hybrids to the reference sequence from one parental genome. In addition to exploring the consequences of genomic merger on patterns of gene expression in two successive generations (F1 and F2), we tested if the growth conditions and plant age may alter genome dominance at both HEB and ELD levels. We further aim to provide insights into the mechanisms of ELD and HEB potentially applicable to other newly developed hybrid organisms.

RESULTS

Development of reference‐guided transcriptome assemblies and k‐mer databases for Festuca and Lolium

Mapping parental RNA‐seq data to the reference genome sequence assembly of the respective species and creating reference‐guided transcriptome assemblies yielded 102 672 and 101 268 transcripts for F. pratensis and L. multiflorum, respectively. Filtering the transcriptomes for redundancy and longest isoform reduced this number to 39 764 transcripts for F. pratensis and 44 603 transcripts for L. multiflorum. Translated sequences of the same transcripts were then compared by reciprocal BLAST between the two species‐specific reference‐guided transcriptome assemblies, leading to the identification of 10 381 putative orthologous gene pairs between the two parental species. To examine possible reciprocal reference‐read mapping bias, RNA‐seq reads from both parental species were mapped to their own reference transcriptome (“self reference”) or to the reference transcriptome of the other parent (“non‐self reference”). We observed a statistically significant difference in the number of genes displaying differential expression from two parental homoeologs (hypergeometric test, P << 0.001; Figure S1a,b). Using the Festuca reference transcriptome, 508 genes were significantly overexpressed in Festuca compared with Lolium, while 313 genes displayed significant overexpression in Lolium compared with Festuca. In contrast, using the Lolium reference transcriptome, 783 genes were overexpressed in Lolium and 419 genes were overexpressed in Festuca, indicating strong bias towards the “self reference” transcriptome (Figure S1a,b). Moreover, in both comparisons, less than half of the genes showing differential gene expression towards one of the parents (245 and 230 for F > L and L > F, respectively) were shared when mapped to the transcriptome reference of the other species (Table S1). These results indicate bias towards the “self reference” transcriptome, which would subsequently lead to incorrect differential gene expression assessment of genome dominance on expression level in hybrid samples. To reduce the bias in assessing differences in the sequences of Festuca and Lolium and the subsequent differential gene expression of their hybrid, a k‐mer–based approach was implemented. k‐mers of length 31 bp were used because they have been previously been proven to be suitable for identification of different allelic variants (Rahman et al., 2018; Voichek and Weigel, 2020), a large number of these is unique for each species in our RNA‐seq dataset, and it is the longest k‐mer length that can effectively be represented on 64‐bit machines. From RNA‐seq reads from both parents, over 83 million k‐mers of 31 bp length were identified. Such strings constituted two species‐specific k‐mer databases with over 20 million k‐mers for each parental species. After the filtering steps (see Experimental procedures), almost 165 000 pairs of k‐mers between parents were identified including 50 362 unique and species‐specific k‐mers mapped to 10 381 orthologous gene pairs with unique gene assignment and corresponding to identified orthologous gene pairs (Table S2). Additional filtering steps were applied to address the fact that more than one k‐mer of 31 bp length can originate from a single 100 bp read and thus, to remove redundancy. Despite the high reduction in the number of k‐mers used for gene expression analysis, this condition was chosen to achieve accurate and unbiased results. This resulted in 26 494 k‐mers (and their associated read counts) usable for determination of gene expression between 7958 of Festuca and Lolium genes, and 5790 of these genes were sufficiently expressed in all samples to be used for comparisons between parents and/or hybrids (Table S3). To reveal the efficacy of the k‐mer–based approach, Spearman’s correlations were calculated between three read count datasets (7958 genes) for each species: (i) generated by the k‐mer–based approach; (ii) generated by mapping to the reference transcriptome of the corresponding parent “self reference”; and (iii) generated by mapping to the reference transcriptome of the other parent, “non‐self reference.” For Festuca, the average Spearman’s correlations between the k‐mer counts and the “self reference” were 0.95, between k‐mer counts and the “non‐self reference” were 0.94, and between the “self reference” and the “non‐self reference” were 0.92. Similarly, for Lolium the average Spearman’s correlations between the k‐mer counts and the “self reference” were 0.96, between k‐mer counts and the “non‐self reference” were 0.92, and between the “self reference” and the “non‐self reference” were 0.95 (Figure S1c,d). These results indicate that our k‐mer approach is a suitable strategy for hybrid gene expression analysis and is not biased by differential mapping efficacy to a reference sequence of either parent.

HEB

The analysis of HEB reflects the relative contribution of two parental homoeologs to expression of a particular gene, either on a gene‐by‐gene basis or on a genome‐wide scale. In total, we were able to monitor homoeolog‐specific gene expression of 5790 genes described above in F1 and F2 generations of Festuca × Lolium reciprocal hybrids (Figure 1). Of those, 5224 (90.2%) and 4876 (82.5%) showed equivalent expression of both parental homoeologs in the three plants of Festuca × Lolium and three plants of Lolium × Festuca hybrids, respectively. A slightly higher number of genes (287 and 530 of 5790 genes) displayed higher expression of the Festuca homoeolog, compared with the genes showing higher expression of the Lolium homoeolog (279 and 384 of 5790) in F1 Festuca × Lolium and Lolium × Festuca hybrids, respectively (Figure 2).
Figure 1

Experimental design.

The initial cross involved tetraploid Lolium multiflorum and tetraploid Festuca pratensis. Three and three independent reciprocal F1 hybrids were obtained and used in the study. One of these plants (Festuca × Lolium) was self‐fertile and produced three plants of F2 generation without cross‐pollination with other hybrids. All nine hybrid plants together with parents were used for the analysis of gene expression. Each plant was represented by three biological replicates for RNA‐seq (tissues sampled from different ramets of the same plant). All plants were cytogenetically verified (representative karyotypes are given in the figure). WGD, whole genome doubling.

Figure 2

Homoeologous expression bias (HEB) in reciprocal Festuca × Lolium hybrids.

HEB in the F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, the F2 generation of Festuca × Lolium (F2 F × L) and F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively). HEB was computed for 5790 genes. Three categories of relative homoeolog expression were defined: L = F (the expression of both homoeologs is not significantly different), L > F (Lolium homoeolog is significantly more expressed than Festuca homoeolog) and F > L (Festuca homoeolog is significantly more expressed than Lolium homoeolog).

Experimental design. The initial cross involved tetraploid Lolium multiflorum and tetraploid Festuca pratensis. Three and three independent reciprocal F1 hybrids were obtained and used in the study. One of these plants (Festuca × Lolium) was self‐fertile and produced three plants of F2 generation without cross‐pollination with other hybrids. All nine hybrid plants together with parents were used for the analysis of gene expression. Each plant was represented by three biological replicates for RNA‐seq (tissues sampled from different ramets of the same plant). All plants were cytogenetically verified (representative karyotypes are given in the figure). WGD, whole genome doubling. Homoeologous expression bias (HEB) in reciprocal Festuca × Lolium hybrids. HEB in the F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, the F2 generation of Festuca × Lolium (F2 F × L) and F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively). HEB was computed for 5790 genes. Three categories of relative homoeolog expression were defined: L = F (the expression of both homoeologs is not significantly different), L > F (Lolium homoeolog is significantly more expressed than Festuca homoeolog) and F > L (Festuca homoeolog is significantly more expressed than Lolium homoeolog). The comparison of the relative contribution of parental homoeologs in hybrids with the expression levels in parents revealed that the transmission of parental expression levels accounted for 4443 (76.7%) and 4239 (73.0%) genes in Festuca × Lolium and Lolium × Festuca hybrids, respectively (Table 1). In addition, 985 (17.0%) and 907 (15.7%) homoeolog pairs with significant differential expression level between the parents displayed non‐biased expression in Festuca × Lolium and Lolium × Festuca hybrids, respectively. Another 349 (6.0%) and 619 (10.7%) genes had the same expression in parents, but displayed preferential expression from one parental homoeolog in Festuca × Lolium and Lolium × Festuca hybrids, respectively. There was a slightly higher number of genes for which the Festuca homoeolog was preferentially expressed than those for which the Lolium homoeolog was preferentially expressed in reciprocal hybrids. A low number of genes (13 and 39) reverted the expression bias observed in the parents (Table 1).
Table 1

Homoeolog expression bias in Festulolium hybrids

Expression in parentsExpression in progenyPhenomenaF1 L × FF1 F × LF2 F × LF1 F × L agingF2 F × L aging
L = FL = FParental legacy3969 (68.55%)4239 (73.21%)3927 (67.82%)3908 (67.5%)3961 (68.41%)
L > FL > FParental legacy126 (2.18%)110 (1.9%)139 (2.4%)150 (2.59%)145 (2.5%)
L < FL < FParental legacy130 (2.25%)94 (1.62%)187 (3.23%)121 (2.09%)177 (3.06%)
L < FL = FBias lost in hybrids416 (7.18%)461 (7.96%)360 (6.22%)428 (7.39%)366 (6.32%)
L > FL = FBias lost in hybrids491 (8.48%)524 (9.05%)487 (8.41%)475 (8.2%)479 (8.27%)
L = FL > FNovel bias in hybrids242 (4.18%)162 (2.8%)281 (4.85%)318 (5.49%)276 (4.77%)
L = FL < FNovel bias in hybrids377 (6.51%)187 (3.23%)380 (6.56%)362 (6.25%)351 (6.06%)
L < FL > FBias reverted in hybrids16 (0.28%)7 (0.12%)15 (0.26%)13 (0.22%)19 (0.33%)
L > FL < FBias reverted in hybrids23 (0.4%)6 (0.1%)14 (0.24%)15 (0.26%)16 (0.28%)
Total number of genes5790 (100%)5790 (100%)5790 (100%)5790 (100%)5790 (100%)

Homoeolog expression bias analyzed in various F1 (F1 L × F, F1 F × L, F2 F × L, and F1 F × L [F1 F × L aging] and F2 [F2 F × L aging] hybrids after 4 years of cultivation) compared with the ratio of gene expression levels in the parents. L = F denotes equal expression; L > F and L < F denote Lolium‐biased and Festuca‐biased expression, respectively.

Homoeolog expression bias in Festulolium hybrids Homoeolog expression bias analyzed in various F1 (F1 L × F, F1 F × L, F2 F × L, and F1 F × L [F1 F × L aging] and F2 [F2 F × L aging] hybrids after 4 years of cultivation) compared with the ratio of gene expression levels in the parents. L = F denotes equal expression; L > F and L < F denote Lolium‐biased and Festuca‐biased expression, respectively. Hybrids were also searched for homoeolog‐specific silencing. Homoeologs were considered silenced when the average count per million (CPM) value from all samples was <1. Overall, only a few genes of the 5790 genes exhibited silencing of one homoeolog, but the pattern was the same in reciprocal hybrids: a higher number of genes was observed with a silenced Lolium homoeolog (seven and 40) than with a silenced Festuca homoeolog (one and four) (Figure 3).
Figure 3

Number of genes with homoeolog‐specific silencing.

Silencing of homoeologs observed in the F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, the F2 generation of Festuca × Lolium (F2 F × L) and the F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively). Homoeologs were considered silenced when the average count per million value in all samples (three plants, each with three biological replicates) was <1. F, Festuca pratensis homoeolog; L, Lolium multiflorum homoeolog.

Number of genes with homoeolog‐specific silencing. Silencing of homoeologs observed in the F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, the F2 generation of Festuca × Lolium (F2 F × L) and the F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively). Homoeologs were considered silenced when the average count per million value in all samples (three plants, each with three biological replicates) was <1. F, Festuca pratensis homoeolog; L, Lolium multiflorum homoeolog. Overall, HEB analysis revealed that both homoeologs contributed equally to gene expression in hybrids for the majority of genes (>84% of 5790 genes). The number of genes displaying preferential expression of the homoeolog of one parent was almost the same for overexpression of both Lolium and Festuca homoeologs, with only slightly more genes preferentially expressing the Festuca than the Lolium homoeolog. Parental legacy (one parent has a higher expression of a particular gene than the other parent, with the hybrid mirroring this difference) was observed for ≥70% of the genes studied. Moreover, it seems that the direction of the cross plays only a minimal role in determining HEB.

ELD

A comparison of the total expression level of homoeologous gene pairs in hybrids with the expression level of their two parents enabled assignment of 4851–4905 genes to one of the 13 ELD categories previously described by Rapp et al. (2009). For the remaining 885–939 of genes in the reciprocal hybrids, it was not possible to assign unambiguously a particular gene to a category; such genes were classified as “unassigned.” From the assigned genes, 2811 (57.7%) and 2629 (54.2%) showed no difference in expression between both parents and between parents and hybrids (category 0) in three plants of Festuca × Lolium and three plants of Lolium × Festuca hybrids, respectively. Additivity, a category where parents differ in gene expression level and the hybrid displays a mid‐parent level of expression, accounted for 272 (5.6%) and 274 (5.7%) genes. Transgressive expression was observed for 981 (20.2%) and 1173 genes (24.1%) in the hybrid progeny with downregulation being more frequent than upregulation. ELD, where parents differ in the level of expression and the hybrid displays the expression level of one parent, was observed in 805 (16.6%) and 775 (15.9%) genes in Festuca × Lolium and Lolium × Festuca hybrids, respectively. Significantly more genes were assigned to Lolium expression dominance categories, where the hybrid displays the same expression level as the Lolium parent (category II and XI) as compared with the number of genes assigned to Festuca expression dominance (categories IV and IX) in Festuca × Lolium and Lolium × Festuca hybrids (489 versus 316 and 434 versus 341, respectively; Table 2, Table S4) using a paired sample sign test (P < 0.004). The results were highly consistent between reciprocal hybrids with 82.3% overlap of genes assigned to particular categories, suggesting that Lolium expression dominance is established immediately after hybridization with no effect of the cross direction.
Table 2

Expression level dominance (ELD) in Festulolium hybrids

CategoryIIXIIVIXTotal assigned genesUnassignedTotal analyzed genes
F1 L × F234 (4.8%)200 (4.1%)155 (3.2%)186 (3.8%)4869 (100%)9215790
434 (8.9%)341 (7%)
F1 F × L256 (5.3%)233 (4.8%)146 (3%)170 (3.5%)4895 (100%)8955790
489 (10.1%)316 (6.5%)
F2 F × L255 (5.2%)239 (4.9%)135 (2.8%)148 (3%)4851 (100%)9395790
494 (10.1%)283 (5.8%)
F1 F × L aging237 (4.9%)238 (4.9%)141 (2.9%)158 (3.2%)4862 (100%)9285790
475 (9.8%)299 (6.1%)
F2 F × L aging289 (5.9%)207 (4.2%)165 (3.4%)123 (2.5%)4905 (100%)8855790
496 (10.1%)288 (5.9%)
Overlap F1, F2 F × L140 (2.9%)136 (2.8%)65 (1.3%)70 (1.4%)2944 (60.3%)3285790
276 (5.7%)135 (2.7%)
Overlap F1 F × L, F2 L × F193 (4%)167 (3.4%)114 (2.4%)140 (2.9%)3999 (82.3%)5275790
360 (7.4%)254 (5.3%)
Overlap F1, F2 F × L aging191 (3.9%)144 (3%)90 (1.9%)86 (1.8%)3298 (67.5%)2975790
335 (6.9%)176 (3.7%)

ELD scored in F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, F2 generation of Festuca × Lolium (F2 F × L), F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively), and the overlaps of gene distribution in the ELD categories. Roman numerals mark the ELD categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H), and Lolium (L).

Expression level dominance (ELD) in Festulolium hybrids ELD scored in F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, F2 generation of Festuca × Lolium (F2 F × L), F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively), and the overlaps of gene distribution in the ELD categories. Roman numerals mark the ELD categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H), and Lolium (L).

Transmission of the gene expression patterns to the F2 generation

The investigation whether the gene expression changes associated with hybridization are heritable revealed that there were more genes significantly differentially expressed from one or the other homoeolog (HEB) in three plants of F2 generation of Festuca × Lolium (1016 compared with 566 genes in F1 hybrids). This homoeolog expression was slightly more biased towards the Festuca homoeologs, because 581 (10.0%) genes were preferentially expressed from the Festuca homoeolog compared with 435 (7.5%) genes expressed preferentially from the Lolium homoeolog (Figure 2). Considering expression differences in parents, the number of genes displaying parental legacy decreased only slightly, from 4443 (76.7%) to 4253 (73.5%). The most marked change was the increase in the number of genes expressed at the same level in parents, but preferentially from one homoeolog in hybrids (from 349 in F1 hybrids to 661 in F2 generation). This novel bias was more towards Festuca homoeologs as compared with Lolium homoeologs (187 versus 162 in F1 hybrids as compared with 380 versus 281 in F2; Table 1). At the overall expression level (ELD), there was the same tendency towards a higher number of genes with Lolium expression dominance as compared with Festuca expression dominance in both the F1 and F2 generations (489 versus 316 and 494 versus 283, respectively) with a statistically significant difference using a paired sample sign test (P < 0.004). The overlap of genes in different categories between F1 F × L and F2 F × L generations was 60.3%. Interestingly, there were almost the same frequencies of the genes in the ELD categories, with a slight increase (from 542 to 741 genes) in transgressively downregulated genes and a decrease in the number of transgressively upregulated genes (from 439 to 341 genes). The number of genes displaying Lolium expression dominance was almost identical in both the F1 and F2 generations (489 and 494), while the number of genes with Festuca expression dominance slightly decreased from 316 (6.5%) to 283 (5.8%; Table 2, Table S4). These results indicate that the established Lolium genome dominance at ELD level was transmitted to the successive F2 generation.

Gene expression changes caused by different plant age and growth conditions

To test whether the Lolium genome dominance could be altered in hybrids by different growth conditions and plant age, the F1 and F2 hybrids were re‐analyzed after 4 years of cultivation and after cold treatment (4°C for 3 weeks). After 4 years of cultivation HEB and ELD were only slightly altered. A higher number of genes with equal contribution of both homoeologs to gene expression was found early after the establishment of F1 generation than after 4 years of cultivation (5224 versus 4811 genes; P < 0.001 based on the hypergeometric test). However, these differences were not statistically significant in F2 (4774 versus 4806 genes; Figure 2). Consequently, the number of genes with equal parental expression that became biased in the hybrids increased in F1 from 349 to 680 genes (P < 0.001), but decreased slightly in F2 from 661 to 627 genes (not statistically significant). In terms of ELD, the most significant change after 4 years of cultivation was an increase in the number of genes with transgressive upregulation in both F1 and F2 (Table S4). The number of genes displaying Lolium expression dominance remained almost the same (489 versus 475 genes in F1 and 494 versus 496 genes in F2; not statistically significant in both cases), similar to those displaying Festuca expression dominance (316 versus 299 genes in F1 and 283 versus 288 genes in F2; not statistically significant in both cases; Table 2). Our analysis further revealed that the 4 years of cultivation slightly stabilized gene expression; the overlap of genes in categories between F1 and F2 increased from 60.3% to 67.5%. However, the overlaps for the Lolium expression dominance and Festuca expression dominance categories between the samples were always significant based on the hypergeometric test (P < 0.001) (Figure S2). The cold stress treatment was only performed after 4 years of cultivation, meaning that gene expression changes might reflect either or both the length of cultivation and cold treatment. Thus, the expression data were compared not only with that of the parents, but also with hybrids after 4 years of cultivation without cold treatment and only differentially expressed genes identified between plants after 4 years of cultivation and their corresponding clones after cold treatment were considered. We identified 1176 (518 upregulated and 658 downregulated) and 754 (263 upregulated and 491 downregulated) differentially expressed genes in F1 and F2 generation, respectively, with 468 being identified in both generations. Of these 468 genes, 464 (99.1%) shared the same gene expression pattern, with 161 (34.4%) being upregulated and 307 (64.7%) downregulated after cold stress treatment compared with control conditions. HEB analysis of F1 and F2 hybrids after cold stress treatment showed a trend similar to other samples (F1 and the F2 generation early after establishment and after 4 years of cultivation). Similar to the hybrids after 4 years of cultivation, most genes had equal contribution from both homoeologs (4581 and 4662 genes in F1 and F2 generation, respectively). For genes with unequal contribution of homoeologs, there were only slightly more genes with preferential expression from the Festuca homoeolog (633 and 613 genes in F1 and F2, respectively) compared with those with preferential expression from the Lolium homoeolog (576 and 515 genes in F1 and F2, respectively), but in both cases these differences were small. Slightly higher frequencies were revealed in the subset of 468 genes differentially expressed after cold stress in both generations (145 genes in F1 and 139 genes in F2; Figure 4), with some asymmetry favoring the expression of Festuca homoeologs.
Figure 4

Homoeologous expression bias in reciprocal Festuca × Lolium hybrids after cold stress.

Significant difference in homoeolog expression was computed for all 5790 genes and for the 468 genes differentially expressed after cold stress in both F1 and F2 generations. Three categories were defined: L = F (not significantly differentially expressed), L > F (Lolium homoeolog is significantly more expressed than Festuca homoeolog) and F > L (Festuca homoeolog is significantly more expressed than Lolium homoeolog). F, Festuca pratensis homoeolog; L, Lolium multiflorum homoeolog.

Homoeologous expression bias in reciprocal Festuca × Lolium hybrids after cold stress. Significant difference in homoeolog expression was computed for all 5790 genes and for the 468 genes differentially expressed after cold stress in both F1 and F2 generations. Three categories were defined: L = F (not significantly differentially expressed), L > F (Lolium homoeolog is significantly more expressed than Festuca homoeolog) and F > L (Festuca homoeolog is significantly more expressed than Lolium homoeolog). F, Festuca pratensis homoeolog; L, Lolium multiflorum homoeolog. ELD analysis showed that the most significant change in gene expression after cold stress was the increase in the number of genes displaying transgressive downregulation and transgressive upregulation. The frequency of genes displaying ELD remained almost the same as compared with plants after 4 years of cultivation without cold stress, but there was a slight increase in the frequency of genes with Festuca expression dominance (from 299 to 328 genes and from 288 to 317 genes in F1 and F2 generations, respectively), and a slight decrease in the frequency of genes with Lolium expression dominance (from 475 to 446 genes and from 496 to 455 genes in F1 and F2 generations, respectively). Considering only genes with significant differential expression between plants exposed to cold stress and from controlled conditions, the pattern was reverted: there were slightly more genes exhibiting Festuca expression dominance as compared with those with Lolium expression dominance (86 versus 79 genes and 61 versus 57 genes in F1 and F2 generation, respectively; Table 3, Table S5). Thus, even though the pattern of Lolium genome dominance remained almost unchanged after cold treatment, the expression of Festuca homoeologs is predominantly being modified in genes exhibiting the highest expression level changes. Moreover, we revealed that cold stress stabilized gene expression: the overlap of genes in ELD categories between F1 and F2 generations increased from 60.3% in young hybrids to 67.5% in hybrids after 4 years of cultivation to 83.1% in hybrids after cold stress and the overlaps of Lolium expression dominance and Festuca expression dominance categorized genes showed to be significant (hypergeometric test, P < 0.001).
Table 3

Expression level dominance (ELD) after cold stress

CategoryIIXIIVIXTotal assigned genesUnassignedTotal analyzed genes
F1 F × L stress249 (5.1%)197 (4%)167 (3.4%)161 (3.3%)4890 (100%)9005790
446 (9.1%)328 (7.7%)
F2 F × L stress256 (5.2%)199 (4%)161 (3.3%)156 (3.2%)4914 (100%)8765790
455 (9.2%)317 (6.5%)
F1 F × L stress aging44 (4.4%)35 (3.5%)44 (4.4%)42 (4.2%)996 (100%)1801176
79 (7.9%)86 (8.6%)
F1 F × L aging22 (3.4%)35 (5.4%)29 (4.4%)32 (4.9%)653 (100%)101754
57 (8.8%)61 (9.3%)
F2 F × L aging206 (4.2%)163 (3.3%)131 (2.7%)132 (2.7%)4172 (85.1%)5335790
369 (7.5%)263 (5.4%)
Overlap F1, F2 F × L14 (3.5%)18 (4.4%)17 (4.2%)16 (4%)405 (100%)63468
32 (7.9%)33 (8.2%)
Overlap F1 F × L, F2 L × F13 (3.2%)22 (5.3%)18 (4.4%)17 (4.1%)411 (100%)57468
35 (8.5%)35 (8.5%)
Overlap F1, F2 F × L aging9 (2.2%)17 (4.2%)14 (3.5%)12 (3%)339 (83.1%)29368
26 (6.4%)26 (6.5%)

ELD observed in F1 (F1 F × L) and F2 (F2 F × L) generations of Festuca × Lolium hybrids after cold stress, and the overlaps of gene distribution in ELD categories describing possible gene expression patterns in hybrids as compared with parents. Roman numerals mark the ELD categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H), and Lolium (L).

Expression level dominance (ELD) after cold stress ELD observed in F1 (F1 F × L) and F2 (F2 F × L) generations of Festuca × Lolium hybrids after cold stress, and the overlaps of gene distribution in ELD categories describing possible gene expression patterns in hybrids as compared with parents. Roman numerals mark the ELD categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H), and Lolium (L).

Cis and trans regulatory divergence

To estimate the effects of cis and trans regulatory divergence on changes in gene expression in hybrids, the approach of McManus et al. (2010) was used, in which all 5790 genes were assigned to one of the seven categories: cis‐only, trans‐only, cis + trans, cis × trans, compensatory, conserved and ambiguous (see Experimental procedures). In F1 Festuca × Lolium and Lolium × Festuca hybrids, the majority of genes could be classified as conserved (2487 and 1595 genes) and ambiguous (2928 and 3496 genes), with slightly more genes regulated by cis‐only (159 and 167 genes) versus trans‐only (113 and 155 genes, respectively). When both cis and trans regulatory changes occurred, they favored the expression of the same homoeolog (cis + trans) rather than the opposite homoeolog (cis × trans) (Table S6). In F2 generation of Festuca × Lolium hybrids, there was a reduction of genes classified as conserved and an increase in all other categories, in particular cis × trans, cis + trans and compensatory. Similarly, the most obvious change in the hybrids after 4 years of cultivation was an increased number of genes in compensatory, cis + trans and cis × trans categories (Table S6). On the other hand, cold stress treatment reduced the number of genes in all categories except the category conserved. The analysis of 468 genes up‐ or downregulated after cold stress in both F1 and F2 generations revealed a higher percentage of genes in the categories with trans regulatory divergence: trans‐only, cis + trans, cis × trans and compensatory.

Relationship between HEB and ELD

To determine how HEB gives rise to ELD, the expression of both homoeologs in hybrids was separately compared with the expression level in their parents. Strikingly, for both Lolium ELD (category II) and Festuca ELD (category IV), upregulation of the homoeolog of the submissive parent was observed in the majority of the genes (Table 4). However, the number of upregulated homoeologs of the submissive parent differed between Lolium and Festuca expression dominance. In category II (Lolium expression dominance), the Festuca homoeolog was upregulated in 136 (91.0%), 160 (87.9%), and 155 (83.8%) genes, both homoeologs were upregulated in 11 (7.3%), 12 (6.6%), and 21 (11.4%) genes, and the Lolium homoeolog was upregulated in three (2.0%), six (3.3%), and five (2.7%) genes in Festuca × Lolium, Lolium × Festuca and F2 generation of Festuca × Lolium hybrids, respectively. On the other hand, in category IV (Festuca expression dominance), the Lolium homoeolog was upregulated in 46 (70.8%), 52 (76.9%), and 52 (76.5%) genes, both homoeologs were upregulated in three (4.6%), four (5.1%), and one (1.5%) genes, and the Festuca homoeolog was upregulated in 16 (24.6%), 10 (12.8%), and 9 (13.2%) genes in the Festuca × Lolium, Lolium × Festuca, and F2 generation of Festuca × Lolium hybrids, respectively (Table 4). The same trend was observed in F1 and F2 generations after 4 years of cultivation, independent on whether the hybrids were exposed to cold stress or not (Table S7). Reciprocally, that is, in the case of ELD with the expression level in hybrids being equal to the parent with lower expression (categories XI and IX), the Festuca homoeolog was more efficiently downregulated than the Lolium homoeolog: 12 (9.2%) versus 8 (7.5%), 27 (20.3%) versus 24 (17.2%), and 28 (19.8%) versus 23 (15.1%) genes in the F1 Festuca × Lolium, F1 Lolium × Festuca, and F2 Festuca × Lolium hybrids, respectively (Table 4). Taken together, these data show that Festuca homoeologs are more prone to up‐ and downregulation than Lolium homoeologs in all comparisons.
Table 4

Relationship between expression level dominance and homoeolog expression bias

Homeolog regulationIIIVIXXI
PopulationF1 F × LF2 F × LF1 L × FF1 F × LF2 F × LF1 L × FF1 F × LF2 F × LF1 L × FF1 F × LF2 F × LF1 L × F
Both upregulated11 (7.3%)23 (11.4%)12 (6.6%)3 (4.6%)1 (1.5%)4 (5.1%)000000
F upregulated136 (91%)155 (83.8%)160 (87.9%)16 (24.6%)9 (13.2%)10 (12.8%)000000
L upregulated3 (2%)5 (2.7%)6 (3.3%)46 (70.8%)52 (76.5%)60 (76.9%)00001 (0.5%)0
F up‐ and L downregulated03 (1.6%)3 (1.6%)002 (2.6%)03 (2.6%)6 (5.1%)000
Both downregulated0000004 (4.6%)13 (11.2%)13 (11%)7 (4.4%)18 (9.7%)15 (9.6%)
F downregulated001 (0.5%)0004 (4.6%)10 (8.6%)11 (9.3%)147 (92.5%)154 (83.2%)130 (82.8%)
L downregulated00006 (8.8%)1 (1.3%)80 (90.9%)90 (77.6%)88 (74.6%)5 (3.1%)10 (5.4%)12 (7.6%)
F down‐ and L upregulated01 (0.5%)0001 (1.3%)00002 (1%)0
Not significant1067052816777823268745443
Total number of genes256255234146135155170148186233239200

Relationships observed in F1 of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) and F2 generation of Festuca × Lolium (F2 F × L) hybrids. Graphs at the top represent categories of expression level dominance for Lolium expression dominance (II and XI) and Festuca expression dominance (IV and IX), as in Table 2.

Relationship between expression level dominance and homoeolog expression bias Relationships observed in F1 of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) and F2 generation of Festuca × Lolium (F2 F × L) hybrids. Graphs at the top represent categories of expression level dominance for Lolium expression dominance (II and XI) and Festuca expression dominance (IV and IX), as in Table 2.

Biological processes associated with gene sets displaying ELD

To explore potential functional consequences of ELD we focused on genes displaying Festuca expression dominance and Lolium expression dominance. In total, 678 and 886 genes displaying Festuca expression dominance and Lolium expression dominance were identified, respectively, in both reciprocal hybrids, in F1 and F2 hybrids early after establishment and in F1 and F2 hybrids after 4 years of cultivation (F1 F × L, F2 F × L, F1 L × F, F1 F × L aging and F2 F × L aging). Gene Ontology (GO) terms were transferred from the functional annotation of Lolium perenne orthologs to our L. multiflorum reference transcriptome, and significantly enriched GO terms (P < 0.01) were identified per ELD group (Table S8). We paid special attention to up‐ or downregulated genes after cold stress shared between F1 and F2 generations of hybrids. Downregulated gene sets displayed enrichment of GO terms concerning: (i) cell wall organization or biogenesis (GO:0071554) with several transcripts encoding cellulose synthase A catalytic subunit 3, pectinesterase 31, protein cellulose synthase interactive 1, and expansin‐A2; (ii) secondary metabolic process (GO:0019748) containing several transcripts encoding isoflavone reductase, cinnamyl alcohol dehydrogenase, hydroxycinnamoyltransferase 2, and cinnamoyl CoA reductase; (iii) cuticle development (GO:0042335) with two transcripts encoding 3‐ketoacyl‐CoA synthase; and (iv) water transport (GO:0006833) comprised of three transcripts, all encoding PIP‐type aquaporins. The set of upregulated genes displayed GO term enrichment of the categories: response to cold (GO:009409) with transcripts encoding cysteine proteinase inhibitor, beta‐amylase 3, CBL‐interacting protein kinase 19 and 9, arginine decarboxylase 1, phytochrome B, diacylglycerol kinase 2, WRKY27 transcription factor, HVA22‐like protein e, and GIGANTEA; and polyamine metabolic process (GO:0006595) containing three transcripts encoding N‐carbamoylputrescine amidase, arginine decarboxylase 1, organelle RRM domain‐containing protein 2, and mitochondrial.

DISCUSSION

Novel k‐mer approach for ELD and HEB analysis in hybrids using RNA‐seq

Currently available bioinformatic approaches to study ELD and HEB in hybrids are based on RNA‐seq and on mapping sequence reads to a reference genome/transcriptome (usually from one of the parents) followed by a variant calling to identify loci suitable for discrimination of homoeologous reads. However, such approaches may introduce bias due to a different efficiency of mapping reads from two parental genomes to a single reference genome/transcriptome (Degner et al., 2009; Liu et al., 2014; Satya et al., 2012). Here we found that using the reference transcriptome sequence from one parental species significantly affected the results and introduced a bias in genome dominance assessment (Figure S1). Several tools such as Sailfish and Kallisto have been developed to minimize this bias by focusing more on the k‐mer composition of reads and references, which is subsequently used to assign, accurately and quickly, the specific reads to specific genes/transcripts (Bray et al., 2016; Patro et al., 2014). However, these tools only report the transcript abundancy, or in the case of Kallisto pseudomapping also bam‐like files, which then may be processed for variant calling. Moreover, as shown in our results, Kallisto may still be influenced by the reference transcriptome in more distant species (Figure S1; Table S1). Only recently, genome‐wide association study analysis of Escherichia coli and Arabidopsis thaliana presented a k‐mer–based approach for inferring sequence‐phenotype associations directly from sequenced reads (Rahman et al., 2018; Voichek and Weigel, 2020). These studies show that using k‐mers directly derived from the sequencing data largely agrees with the results obtained using a standard approach in genome‐wide association studies of the single species. Our results indicate that the k‐mer–based approach for inferring differences between two species avoids any bias caused by mapping reads from hybrids to a reference genome/transcriptome from one of the parental species of the hybrid (Figure S1). Thus, using k‐mers may be a useful approach for quantifying homoeolog‐specific gene expression in allopolyploids.

Gene expression alterations induced by genome merger

Immediately after merging two different genomes via interspecific hybridization, multiple genome and epigenome modifications are initiated, often collectively referred to as “genomic shock” (McClintock, 1984). One of the consequences is a genome dominance, whereby one of the two co‐resident genomes becomes “dominant.” Several distinct phenomena are encompassed by this concept, including preferential gene expression favoring the dominant genome. We note that genome dominance does not mean “complete” dominance, as the absolute silencing of the submissive genome has not been observed in plant hybrids or allopolyploids. Often some genes in the submissive genome are more highly expressed than their homoeologous counterparts in the dominant genome (Edger et al., 2017). Although differential homoeolog expression appears to be ubiquitous at the onset of polyploidy, it likely takes some time for a homoeolog to be completely silenced. Studies in cotton identified 180 and 191 genes (0.71% and 0.75% of all genes screened), where one homoeolog was silenced in natural allopolyploids and only 14 such genes were silenced in synthetically developed allopolyploids (Yoo et al., 2013). Similarly, we found that only eight and 44 genes showed homoeolog silencing in Festuca × Lolium and Lolium × Festuca hybrids, respectively (Figure 3). The general lack of genes commonly displaying silencing in various hybrids may indicate that this process is stochastic and prone to variation between progenies from the same cross (Boatwright et al., 2018; Buggs et al., 2009; Soltis et al., 2012; our results). The low frequency of rapid silencing suggests a retention of the majority of homoeologs in allopolyploids, which then potentially serve as a reservoir of genetic variation that may become valuable during subsequent stages of polyploid evolution and diversification (Adams et al., 2003; Nieto‐Feliner et al., 2020). Two aspects of genome dominance have been commonly defined and distinguished, that is, HEB and ELD (Grover et al., 2012; Yoo et al., 2013). In some hybrids, the genome dominating in the homoeolog‐specific expression (HEB) is not the same as the genome dominating at the ELD level. In fish hybrids Megalobrama amblycephala × Culter alburnus, homoeologs of the M. amblycephala genome are more frequently expressed than their counterparts (HEB), while the overall expression in hybrids more frequently mimic the expression level of the C. alburnus genome (ELD; Ren et al., 2019). Similarly, we also found more genes with a higher expression of the Festuca homoeolog than genes with higher expression of Lolium homoeologs, while overall gene expression levels in hybrids reflect more frequently the expression of the Lolium parent in all analyzed samples of hybrids, including those after 4 years of cultivation and after cold stress treatment (Figures 2 and 4, Tables 2 and 3). In a genome dominance context, we note that the unequal contribution of both homoeologs to the overall expression of the particular gene (HEB) is not necessarily evolutionarily meaningful. To the extent that homoeologs are functionally equivalent and that total expression is important to establish a given phenotype (at any level), ELD is a key feature of polyploidy and genome dominance (Nieto‐Feliner et al., 2020).

Predetermination, heritability, and evolution of genome dominance

Genome dominance at both the HEB and ELD levels has been observed in many ancient or relatively recent allopolyploids, including maize, Arabidopsis, cotton, bread wheat and Tragopogon (reviewed in Bird et al., 2018), while there are exceptions displaying little dominance (at the HEB level), such as Capsella bursa‐pastoris and the Ethiopian cereal teff (Douglas et al., 2015; VanBuren et al., 2020). It seems likely that the genomic conditions for dominance are mechanistically established immediately after merging of two distinct genomes in a single nucleus, and in the absence of selection (Adams et al., 2003; Flagel et al., 2008). Despite the scarcity of studies analyzing genomic responses during the first few generations after genome merging, our observations suggest that it appears to be mechanistically deterministic, and imply that genome dominance (both HEB and ELD) is heritable. We are aware of only a few studies in which this idea has been formally tested. In fish hybrids, M. amblycephala × C. alburnus, the number of genes in the expression dominance categories, together with additivity, decreased between the F1 and F3 generations (Ren et al., 2019), suggesting a gradual weakening of parental effects in hybrids. Here, we observed no significant reduction of the number of the genes displaying either Lolium expression dominance or Festuca expression dominance in the F2 generation (489 versus 494 and 316 versus 283 genes, respectively), demonstrating that these alterations of gene expression level are heritable. To the extent that genome dominance in F1 hybrids is mechanistically specified, one may assume that genome dominance is predetermined. We found that Festuca‐genome HEB and Lolium‐genome ELD were displayed in all analyzed hybrid plants, independent of the direction of the cross (Table 2). Alger and Edger (2020) hypothesized that environmental conditions might influence which genome becomes dominant in certain hybrids and allopolyploids. However, we observed that environmental or physiological conditions did not revert the established genome dominance. The number of genes displaying Lolium expression dominance and Festuca expression dominance (ELD) did not change significantly after the hybrids were cultivated for 4 years and similar results were found after cold stress treatment (Table 3). Taken together, our data suggest that genome dominance is predetermined and independent from the direction of the cross, parental genotypes and cold stress conditions. Over the longer term it is expected that homoeologous gene expression, including ELD, will be subjected to evolutionary forces, including those imposed by selection and drift and in the context of genome fractionation. Yoo et al. (2014) showed that older allopolyploids exhibit more ELD relative to neo‐allopolyploids. In contrast, Wu et al. (2018) identified about one‐third of the genes displaying ELD in newly resynthesized Brassica napus, but only one‐fifth of the genes retained ELD in natural allotetraploid B. napus after approximately 7500 years of independent evolution and domestication (Li et al., 2020). This suggests that at least in some cases the consequences of the genomic shock experienced by the newly established hybrids might be alleviated during subsequent evolutionary processes and that gene expression may gradually become balanced and stabilized. Similarly, the comparison of F1 hybrids, synthetically developed allopolyploids and naturally occurring allopolyploids and domesticated cultivars revealed that the HEB can influence increasingly more genes in successive generations (Edger et al., 2017; Yoo et al., 2013). Generally, older allopolyploids such as cotton and allopolyploid Brachypodium seem to display a slightly lower frequency of genes with parental legacy (about 63–65%) compared with young (approximately 100 years old Tragopogon; 69%) or newly established allopolyploids (Festuca × Lolium hybrids; 73–77%) (Shan et al., 2020; Takahagi et al., 2018; Yoo et al., 2013; Table 1). Such balancing and stabilizing gene expression patterns might also be revealed with the ontogeny of the hybrid plant. We found that the overlap among the genes in different gene expression categories between F1 and F2 hybrids is higher after 4 years of cultivation compared with newly established hybrids (60.3% versus 67.5%, Table S4). Collectively, these results reveal the complex temporal and evolutionary contexts that can generate diverse homoeologous expression patterns.

Mechanisms underlying genome dominance

Our knowledge regarding the mechanisms responsible for HEB and ELD remains vague. Transposable elements have been proposed to play an important role, due to frequent repression of adjacent genes (Edger et al., 2017, 2018; Freeling et al., 2012; Hollister and Gaut, 2009). Similarly, gene expression can be affected by small RNAs, including microRNAs and small interfering RNAs (Ng et al., 2012). In addition to the possible role of transposable elements and small RNAs in an uneven contribution of homoeologs to gene expression in hybrids, mismatches between effectors (transcription factors) and their target genes may generate expression alterations (Bottani et al., 2018; Hu and Wendel, 2019). The mismatches may arise from the divergence of the relevant cis‐ and/or trans‐acting regulatory machinery. Cis changes are allele‐specific, affecting expression of the allele linked to the changes and refer mainly to changes in the promoter region of a gene. In contrast, trans changes are diffusible, potentially affecting expression of both parental alleles, due to changes that affect the timing, level or activity of the transcription factors or other regulators controlling the expression of transcription factors or their targets (McManus et al., 2010). We investigated if and how the changes in cis‐acting and/or trans‐acting regulation can modify gene expression and contribute to genome dominance. In theory, if there is a higher expression of a gene in parent A than in parent B, and the hybrid displays the same gene expression level as the parent A, then, there can be upregulation of the A homoeolog, upregulation of B homoeolog, or upregulation of both. We found that upregulation of homoeolog from a submissive parent is the most likely scenario for both Festuca and Lolium expression dominance in Festuca × Lolium hybrids. However, it seems that the Lolium transcriptome machinery can modulate (activate or repress) the expression of Festuca homoeologs more effectively than vice versa. The higher efficiency of Lolium regulators compared with those of Festuca was found for both up‐ and downregulations of the counterpart homoeolog (Table 4). The results are highly consistent, as the same pattern was observed in the F2 generation, both F1 and F2 generations after 4 years of cultivation and in reciprocal crosses. Similar results were observed in a homoploid F1 hybrid and synthetic and two natural allopolyploids of cotton and B. napus, where upregulation of homoeologs from the submissive genome were more frequent than the opposite situation (Li et al., 2020; Yoo et al., 2013). Much of this behavior may reflect a difference in the parental spectrum of transcription factors, their concentrations and/or their affinities for targeted gene promoters (Bottani et al., 2018). From a practical standpoint, an increased understanding of the relative efficiency of transcription factors in parents may enable future harnessing of this information to manipulate genome dominance in hybrids for agronomic purposes.

Conclusions

Here we used a k‐mer–based RNA‐seq approach to provide insights into molecular mechanisms underlying genome dominance in Festuca–Lolium allopolyploids. We showed that the expression alterations at the onset of allopolyploidy are heritable and largely independent of plant age or environmental conditions. We also demonstrated that gene ELD mainly reflects up‐ or downregulation of homoeologs of the submissive genome. We speculate that besides other factors, this might be at least partially caused by the divergence of trans‐acting regulatory factors between the progenitors, and their cascading effects on downstream gene expression. Future insights are likely to derive from studies on the dynamics of transcription factor concentrations, affinities, binding kinetics, and changes in chromatin architecture resulting from hybridization and WGD. Understanding the molecular basis of genome dominance will have practical implications with respect to plant breeding. Enhanced understanding of the molecular genetic underpinnings of genome dominance will increase predictive power with respect to parental combinations that might yield desired phenotypes.

Experimental procedures

Plant material

Synthetically derived autotetraploid F. pratensis cultivar “Westa” and synthetically derived autotetraploid L. multiflorum cultivar “Mitos” were crossed to produce several F1 hybrids. Of these, three independent F1 hybrid Festuca ♀ × Lolium ♂ (F1 F × L) plants and three independent reciprocal Lolium ♀ × Festuca ♂ (F1 L × F) plants were randomly selected. One plant of the F1 F × L hybrid used for the analysis was found to be self compatible and three siblings of an F2 generation were selected (F2 F × L; Figure 1). Nine plants (three plants of F × L, three of F1 L × F and three of the F2 generation of F × L) were used in this study. The genomic composition of all hybrid plants used in this study was verified by genomic in situ hybridization. All plants contained an equal proportion of parental genomes with 14 chromosomes each from F. pratensis and L. multiflorum (Figure 1). All plant material was grown in a cultivation chamber (Weiss‐Gallenkamp, Loughborough, UK). Plants were cut to approximately 5 cm height and grown for three more weeks in the cultivation chamber (day: 14 h, 24°C, 50% relative humidity, 20 000 lux; night: 10 h, 20°C, 50% relative humidity). From each plant, the basal parts of a shoot and part of the root system (about 1 cm of each organ) were sampled for transcriptome sequencing. After sampling tissue for RNA extraction (tissue frozen in liquid nitrogen), the remaining parts of each plant were separated into two clones and grown in a greenhouse. The sampling of F1 F × L and F2 F × L was repeated after 4 years. One clone of each hybrid plant was grown under the same conditions as the experiment above, and the second clone was grown under cold stress conditions (day: 8 h, 4°C, 85% relative humidity, 20 000 lux; night: 16 h, 4°C, 85% relative humidity), both for 3 weeks in the cultivation chambers. Three biological replicates (the tissues from different ramets of the same plant) from each plant were sampled.

RNA extraction and sequencing

RNA was extracted from 100 mg of tissue using the RNeasy Plant Mini Kit (Qiagen, Inc., Valencia, CA, USA) according to the manufacturer’s instructions. The quality of total RNA was checked using an RNA Pico 6000 chip on a Bioanalyzer 2100 (Agilent Technologies, Inc., Santa Clara, CA, USA) and only samples with RNA integrity number (RIN) >7 were sequenced. RNA‐seq was performed at Istituto di Genomica Applicata (IGA Technology Services, Udine, Italy). Libraries were developed with the TruSeq RNA Sample Prep kit v2 according to the manufacturer’s instructions (Illumina Inc., San Diego, CA, USA). Indexed adapters were ligated to the complementary DNA and 200 ± 25‐bp fragments were gel purified and PCR‐amplified. Libraries were quantified using a Bioanalyzer 2100 (Agilent Technologies), pooled in equimolar amounts for sequencing using Illumina HiSeq2000 or HiSeq2500 instruments to produce 100‐ or 125‐bp paired‐end reads.

Analysis of RNA‐seq data

Reads of the two parents were first quality‐trimmed (LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50) using trimmomatic ver. 0.36 (Bolger et al., 2014). Reference‐guided transcriptome assembly for both parental species and identification of orthologous gene pairs between the parental species was performed. Quality‐trimmed reads were mapped to the scaffolds of reference genome assemblies of the respective parental species with star ver. 2.4.1 (Dobin et al., 2013). The assemblies of F. pratensis cv. “Columbus” and L. multiflorum cv. “Rabiosa” were kindly provided by Dr. Christian S. Jensen (DLF Seeds A/S, Roskilde, Denmark) and Professor Bruno Studer (ETH, Zurich, Switzerland), respectively. Reference transcriptomes for Festuca and Lolium were then produced using stringtie ver. 2.1.0 (Pertea et al., 2015). To reduce redundancy within the stringtie transcriptomes, cd‐hit ver. 4.6.1 (Fu et al., 2012) was used for the clustering of transcripts, and only the longest isoform of each transcript was retained. Putative orthologous gene pairs used for subsequent analysis were identified by best reciprocal BLAST hits (BLAST+ ver. 2.10.1; e‐value <1e‐4) (Altschul et al., 1990) between protein sequences translated from Festuca and Lolium transcriptomes (e‐value <1e‐4). For conventional analysis (using mapping reads to reference transcriptome from one parent), parental RNA‐seq reads were pseudo‐aligned separately to the reference transcriptomes of both parental species with kallisto ver. 0.44.0 (Bray et al., 2016). Abundances for uniquely assigned reads were loaded to the R environment ver. 3.4.4 (R Core Team, 2018). Differential gene expression analysis was assessed from generated read count data using the edger package (Robinson et al., 2010). The transcript abundance of each gene was estimated based on CPM reads mapped value. At first, only genes with abundance >1 CPM in each biological replicate were considered for the analysis. Then, genes with a log2 fold‐change value >1 and false discovery rate (FDR) <0.05 were defined as differentially expressed. The results indicated that there was a statistically significant difference in the ratio of genes with HEB between two reference transcriptomes used for read mapping (hypergeometric test, P << 0.001; Figure S1a,b). Therefore, a k‐mer–based analysis approach was implemented to reduce the potential bias introduced by conventional analysis using read mapping to a respective reference transcriptome selected for the analysis. Quality trimmed reads were used to create species‐specific k‐mer (k = 31) databases with kmc (k‐mer counter) ver. 3.0.0 (Kokot et al., 2017) and differing in one nucleotide with vmatch ver. 2.3.1 (Kurtz, 2003). Using bash, awk, and vmatch, the database entries were filtered to: (i) only retain k‐mers with an abundance of >20 counts (sum of all three replicates) to filter out potential sequencing errors; (ii) only retain k‐mers with a polymorphism in the central position of the 31‐bp sequence (nucleotide 16) that discriminates between parents to minimize possible identification of false positive variants in highly polymorphic regions; (iii) exclude k‐mers having more than one polymorphism; and (iv) exclude heterozygosity for the discriminating nucleotide in one or both parents (by retaining only within‐species homozygous k‐mers). Subsequently, k‐mer databases were aligned with BLASTN (word size = 15) against the reference transcriptomes to assign each k‐mer to its respective transcript, also recording the location on the transcript sequence. To avoid measuring expression values between paralogous genes, k‐mers with a match to more than one transcript were removed. Because the k‐mer data (31 bp) was generated from the reads of 100 bp, a single read can yield multiple k‐mers. Therefore, additional filtering steps were applied (see below). First, the central position of each k‐mer in a transcript was treated as a single nucleotide polymorphism (SNP) that discriminated between parents. Using the biostar214299 utility of jvarkit (Lindenbaum, 2015), RNA‐seq reads of both parents that mapped to their corresponding reference transcriptome and were spanning the k‐mer derived SNP positions were extracted and parsed into four groups: (i) REF, where reads spanning one or more SNPs contain the same variant as the reference on the SNP position; (ii) ALT, where reads spanning one or more SNPs contain the variant corresponding to the other parent; (iii) AMBIGUOUS, where reads spanning two or more SNPs contain both the variant as the reference and the variant corresponding to the other parent; and (iv) UNKNOWN, where reads spanning one or more SNPs contain variant(s) not corresponding to either parents. Only k‐mers originating from reads in the REF group were retained. Counts of full‐length k‐mers were produced using kat (k‐mer analysis toolkit) ver. 2.3.4 (Mapleson et al., 2017). As k‐mers were directly counted from RNA‐seq data, reads containing only part of a k‐mer sequence were not used. Thus, only the full‐length k‐mer sequence present in a read sequence and reference target were used in the subsequent analysis. The counts were aggregated and the mean count was calculated for each SNP cluster in case more SNPs discriminating the parents were present in a single read. Then, the mean k‐mer counts per read per transcript were summed to obtain a count representing the expression level per gene. To validate the usefulness of our strategy, the k‐mer–based method we developed was compared with the conventional method of mapping reads to the reference transcriptome of one or the other parental species. Only SNPs from k‐mers passing all filtering steps were used for this comparison. Once mapped to both transcriptomes, the parental RNA‐seq reads spanning these SNPs were used to generate read count data for the conventional approach. The counts obtained from the two approaches were normalized to CPM and used to create a heatmap of Spearman’s correlation among parental gene expression counts generated by different approaches.

ELD and HEB analysis

For the ELD analysis, total expression of a homoeologous gene pair in the hybrid was compared with the expression of both parents. Genes were then classified into 13 categories based on their expression level according to Rapp et al. (2009) and Yoo et al. (2013). To determine HEB, the expression levels of both homoeologs in hybrids were compared. All homoeologs showing average CPM <1 in hybrids were classified as silenced.

Cis and trans regulatory divergence analysis

The approach of McManus et al. (2010) was used to quantify cis and trans regulatory divergence. k‐mer count‐based expression levels of parents and hybrids were analyzed for evidence of differential expression using the binomial exact test followed by FDR analysis (FDR <0.05). To moderate the significance of differential expression, fold‐change was not used. Three types of differential expression were evaluated for each gene: significant differential expression between homoeologs in parents (Lolium/Festuca; P), significant differential expression between the homoeologs in hybrids (Lolium/Festuca; H), and significant differential expression of each particular homoeolog between the parental (tested separately for each parent) and hybrid data (T). All genes were then assigned to one of seven regulatory categories according to McManus et al. (2010): •cis‐only: Significant differential expression in P and H. Not significant T. •trans‐only: Significant differential expression in P, but not H. Significant T. •cis + trans: Significant differential expression in P and H. Significant T. The log2 fold‐change ratios of these genes have the same sign (P > 0 and H > 0, or P < 0 and H < 0), the regulatory divergence is in favor of expression of the same homoeolog. •cis × trans: Significant differential expression in P and H. Significant T. The log2 fold‐change ratios of these genes have the opposite sign (P > 0 and H < 0, or P < 0 and H > 0), the regulatory divergence favors the expression of opposite homoeologs. •Compensatory: Significant differential expression in H, but not P. Significant T. Regulatory divergences compensate each other, resulting in no expression difference between the two parental genomes. •Conserved: No significant differential expression in P or H. Not significant T. Regulation of these genes is conserved. •Ambiguous: All the other patterns of significance tests, which have no clear biological interpretation.

GO enrichment analysis

Genes were annotated using the GO for L. perenne, downloaded from Monocots PLAZA 4.5 (https://bioinformatics.psb.ugent.be/plaza/versions/plaza_v4_5_monocots/; based on reciprocal BLAST comparison with L. multiflorum. (Blanco‐Pastor et al., 2021). The topgo bioconductor package in R was used for GO analysis. The Fisher’s test implemented in topgo was used to identify enriched GO terms per comparison and all enriched GO terms with P < 0.01 were selected. Table S1. Differential gene expression statistics. Click here for additional data file. Table S2. Statistics of species specific k‐mer databases. Click here for additional data file. Table S3. ELD, HEB and cis/trans regulatory profiles of genes expressed in all analyzed hybrids. Click here for additional data file. Table S4. Expression level dominance (ELD) in reciprocal Festuca × Lolium hybrids. ELD scored in F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, F2 generation of Festuca × Lolium (F2 F × L), F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively), and the overlaps of gene distribution in 13 categories. Roman numerals mark the categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H) and Lolium (L). LED, Lolium expression dominance; FED, Festuca expression dominance. Table S5. Expression level dominance (ELD) after cold stress in Festuca × Lolium reciprocal hybrids. ELD observed in F1 (F1 F × L) and F2 (F2 F × L) generations of Festuca × Lolium hybrids after cold stress, and the overlaps of gene distribution in ELD categories describing possible gene expression patterns in hybrids as compared with parents. Roman numerals mark 13 categories as in Rapp et al. (2009), with schematic expression level graphs describing the differential gene expression profiles for Festuca (F), hybrid (H) and Lolium (L). LED, Lolium expression dominance; FED, Festuca expression dominance. Table S6. Numbers of genes in different cis/trans regulatory categories. Table S7. Relationship between ELD and HEB in F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively), and these hybrids after cold stress treatment. The graphs at the top represent categories of ELD for Lolium expression dominance (II and XI) and Festuca expression dominance (IV and IX), as in Table 2. Table S8. Gene ontology (GO) analysis. GO terms are provided for the genes with expression level dominance – Lolium expression dominance (categories II and XI) and Festuca expression dominance (categories IV and IX). Figure S1. Reference bias and correlation between conventional and k‐mer–based quantification of gene expression. Differentially expressed genes (DEG) between parents when parental RNA‐seq reads were mapped to the reference transcriptome of Festuca (a) and Lolium (b). Three categories of relative homoeolog expression were defined: L = F (not significantly differentially expressed), L > F (Lolium homoeolog is significantly more expressed than Festuca homoeolog) and F > L (Festuca homoeolog is significantly more expressed than Lolium homoeolog). (c,d) Correlation matrix of read count data of parents generated by different approaches for each replicate: “k‐mer”: DEG data generated by k‐mer approach, “self reference”: DEG data generated by mapping to the reference transcriptome of the corresponding parent, “non‐self reference”: DEG data generated by mapping to the reference transcriptome of the other parent. (c) correlation for three F. pratensis replicates; (d) correlation for three L. multiflorum replicates. Figure S2. Overlaps in gene sets showing expression level dominance. Overlaps produced between the F1 generation of Festuca × Lolium (F1 F × L) and Lolium × Festuca (F1 L × F) hybrids, the F2 generation of Festuca × Lolium (F2 F × L), F1 and F2 generations of Festuca × Lolium hybrids after 4 years of cultivation (F1 F × L aging and F2 F × L aging, respectively) for Festuca (categories IV and IX) and Lolium (categories II and XI) expression dominance. FED, Festuca expression dominance; LED, Lolium expression dominance. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  66 in total

Review 1.  Fractionation mutagenesis and similar consequences of mechanisms removing dispensable or less-expressed DNA in plants.

Authors:  Michael Freeling; Margaret R Woodhouse; Shabarinath Subramaniam; Gina Turco; Damon Lisch; James C Schnable
Journal:  Curr Opin Plant Biol       Date:  2012-02-14       Impact factor: 7.834

2.  The wondrous cycles of polyploidy in plants.

Authors:  Jonathan F Wendel
Journal:  Am J Bot       Date:  2015-10-08       Impact factor: 3.844

3.  The significance of responses of the genome to challenge.

Authors:  B McClintock
Journal:  Science       Date:  1984-11-16       Impact factor: 47.728

4.  Homoeolog expression bias and expression level dominance in allopolyploid cotton.

Authors:  M-J Yoo; E Szadkowski; J F Wendel
Journal:  Heredity (Edinb)       Date:  2012-11-21       Impact factor: 3.821

5.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

6.  Subgenome Dominance in an Interspecific Hybrid, Synthetic Allopolyploid, and a 140-Year-Old Naturally Established Neo-Allopolyploid Monkeyflower.

Authors:  Patrick P Edger; Ronald Smith; Michael R McKain; Arielle M Cooley; Mario Vallejo-Marin; Yaowu Yuan; Adam J Bewick; Lexiang Ji; Adrian E Platts; Megan J Bowman; Kevin L Childs; Jacob D Washburn; Robert J Schmitz; Gregory D Smith; J Chris Pires; Joshua R Puzey
Journal:  Plant Cell       Date:  2017-08-16       Impact factor: 11.277

7.  Gene loss and silencing in Tragopogon miscellus (Asteraceae): comparison of natural and synthetic allotetraploids.

Authors:  R J A Buggs; A N Doust; J A Tate; J Koh; K Soltis; F A Feltus; A H Paterson; P S Soltis; D E Soltis
Journal:  Heredity (Edinb)       Date:  2009-03-11       Impact factor: 3.821

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  Identifying genetic variants underlying phenotypic variation in plants without complete genomes.

Authors:  Yoav Voichek; Detlef Weigel
Journal:  Nat Genet       Date:  2020-04-13       Impact factor: 38.330

View more
  1 in total

1.  Genome Dominance in Allium Hybrids (A. cepa × A. roylei).

Authors:  David Kopecký; Olga Scholten; Joanna Majka; Karin Burger-Meijer; Martin Duchoslav; Jan Bartoš
Journal:  Front Plant Sci       Date:  2022-03-10       Impact factor: 5.753

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.