Literature DB >> 30715317

Origin of wheat B-genome chromosomes inferred from RNA sequencing analysis of leaf transcripts from section Sitopsis species of Aegilops.

Yuka Miki1, Kentaro Yoshida1, Nobuyuki Mizuno2, Shuhei Nasuda2, Kazuhiro Sato3, Shigeo Takumi1.   

Abstract

Dramatic changes occasionally occur in intergenic regions leading to genomic alterations during speciation and will consequently obscure the ancestral species that have contributed to the formation of allopolyploid organisms. The S genome of five species of section Sitopsis of genus Aegilops is considered to be an origin of B-genome in cultivated tetraploid and hexaploid wheat species, although its actual donor is still unclear. Here, we attempted to elucidate phylogenetic relationship among Sitopsis species by performing RNA sequencing of the coding regions of each chromosome. Thus, genome-wide polymorphisms were extensively analyzed in 19 accessions of the Sitopsis species in reference to the tetraploid and hexaploid wheat B genome sequences and consequently were efficiently anchored to the B-genome chromosomes. The results of our genome-wide exon sequencing and resultant phylogenetic analysis indicate that Ae. speltoides is likely to be the direct donor of all chromosomes of the wheat B genome. Our results also indicate that the genome differentiation during wheat allopolyploidization from S to B proceeds at different speeds over the chromosomes rather than at constant rate and recombination could be a factor determining the speed. This observation is potentially generalized to genome differentiation during plant allopolyploid evolution.
© The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  RNA sequencing; chromosomal synteny; genome differentiation; genome-wide polymorphisms; wheat

Mesh:

Year:  2019        PMID: 30715317      PMCID: PMC6476730          DOI: 10.1093/dnares/dsy047

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

Common wheat (Triticum aestivum L., genome constitution AABBDD), a major food crop, is an allohexaploid species derived via allopolyploid speciation through interspecific crossing between cultivated tetraploid wheat Triticum turgidum L. (AABB) and its diploid relative, Aegilops tauschii Coss. (DD). The cultivated tetraploid form was domesticated from the wild tetraploid wheat T. turgidum subspecies dicoccoides (AABB), which was thought to be derived through interspecific hybridization between wild diploid progenitors of the A and B genomes. The A genome donor was the wild diploid wheat T. urartu,, and the B genome could have been contributed by Ae. speltoides Tausch (SS). However, the origin of the B genome remains unclear, despite extensive research over the past few decades. The cytoplasmic genomes of allopolyploid wheat species were almost certainly transmitted from Ae. speltoides,, indicating that at least Ae. speltoides contributed to establishment of the nuclear genome of allopolyploid wheat. The indefinite origin of the wheat B genome is due to failure of homoeologous chromosome pairing between the B genome of allopolyploid wheat and the S genome of Ae. speltoides during meiosis in the respective interspecific hybrids., In addition, the section Sitopsis of Aegilops includes four wild diploid species, Ae. bicornis Jaub. et Spach. (SbSb), Ae. searsii Feldman et Kislev ex Hammer (SsSs), Ae. sharonensis Eig (SlSl), and Ae. longissima Schweinf. & Muschl. (SlSl), except Ae. speltoides. Of the five Sitopsis species that share the S genome, only Ae. speltoides of the subsection Truncata is cross-pollinating, whereas the other four subsection Emarginata species are self-pollinating. Two subspecies of Ae. speltoides (ligustica and speltoides; syn. Ae. aucheri Boiss.) have been defined to date,, and they can be distinguished at least in part by a single locus, Lig, on chromosome 3S, which controls spike morphology. The two Emarginata species, Ae. longissima and Ae. sharonensis, are quite closely related and recognized as forming one complex. The F1 hybrid plants among Sitopsis species show incomplete homoeologous pairing during meiosis, suggesting that differentiation to the modified S genome occurred during diversification of the Sitopsis species. Some chromosomal rearrangements in the S genome, including translocations, have been reported in Sitopsis species., In addition, the B and S genomes can exhibit differences in the pattern of transposable element insertion., Structural differences in intergenic regions have therefore increased the phylogenetic distance between the B genome of polyploid wheat and the S genome of Ae. speltoides. Constitutive heterochromatic regions detected by chromosome staining are much more abundant in B-genome chromosomes than those of the A and D genomes,, and the staining patterns of B-genome chromosomes appear to differ from those of S-genome chromosomes of Ae. speltoides. The difference in heterochromatin bands results in reduced pairing between B and S homoeologous chromosomes., These structural modifications and distinct heterochromatin distribution have made it difficult to elucidate the origin of the B genome and assess the relationship between the Sitopsis genomes and B genome. Molecular phylogenetic studies based on nuclear DNA polymorphisms (e.g. restriction fragment length polymorphisms [RFLPs] and amplified fragment length polymorphisms [AFLPs]) have revealed that the two subsections of Sitopsis are extensively differentiated,, and that the wheat B genome is much more closely related to the S genome of Ae. speltoides than to the other modified S genomes of subsection Emarginata species., Analyses of nucleotide sequence polymorphisms in single-copy genes also supported the hypothesis that Ae. speltoides is the donor of the B genome in allopolyploid wheat., In contrast, a few reports have suggested a polyphyletic origin of the wheat B genome via the introgression of several parental Sitopsis species. For example, a low copy number, non-coding sequence located in the region comprising 19% of the distal portion of the long arm of chromosome 3B exists only in Ae. searsii among all Sitopsis species. Moreover, nucleotide sequence analyses have revealed increased divergence in the B genome of modern common wheat compared with Ae. speltoides, and this divergence is thought to be a result of polyploidization events affecting B-genome evolution. Thus, the phylogenetic relationship between the B and S genomes of section Sitopsis should be reconsidered based on the polymorphisms of each limited chromosomal region as well as those covering the entire chromosomal regions of the B and S genomes. RNA sequencing is an effective approach for surveying large number of genome-wide polymorphisms derived only from the exon sequences in Aegilops species. In RNA sequencing of Aegilops species, polymorphisms identified without any reference genome information can be efficiently anchored to the homoeologous chromosomes of related species, such as common wheat and barley, based on conserved chromosomal synteny. Here, we conducted RNA sequencing analyses of leaf transcripts from section Sitopsis species to avoid the intergenic and repetitive sequences of wheat chromosomes. The objectives of the present study were to (i) identify genome-wide polymorphisms in the Sitopsis genomes, (ii) elucidate the phylogenetic relationship among Sitopsis species, and (iii) determine the wheat B-genome origin based on genome-wide polymorphisms anchored putatively to each chromosome of the B genome.

2. Materials and methods

Plant materials

Three accessions of Ae. speltoides ssp. ligustica (SS genome), four accessions of Ae. speltoides ssp. speltoides (SS genome), two accessions of Ae. bicornis (SbSb genome), three accessions of Ae. longissima (SlSl genome), three accessions of Ae. sharonensis (SlSl genome), and four accessions of Ae. searsii (SsSs genome) were chosen as representatives of each species from the collection of the section Sitopsis at the National Bio Resource Project–Wheat, Japan (Table 1). These accessions of Sitopsis species were originally collected in the Middle East (Supplementary Fig. S1). A tetraploid wheat (T. turgidum) cultivar Langdon (AABB genome) was also used in this study. Triticum urartu KU-199-5 (AA genome), Ae. umbellulata KU-4017 (UU genome), and Ae. tauschii KU-2075 (DD genome) were used as outgroup species.
Table 1

List of the 19 accessions in the section Sitopsis used in RNA-seq analyses

SpeciesAccession numberOriginsMating systems
Aegilops speltoides ssp. speltoidesKU-2208ATurkeyOutcrossing
KU-14601IsraelOutcrossing
KU-14605IsraelOutcrossing
KU-12963aSyriaOutcrossing
Ae. speltoides ssp. ligusticaKU-2236TurkeyOutcrossing
KU-7716IraqOutcrossing
KU-7848IraqOutcrossing
Ae. bicornis KU-5784EgyptSelf-pollinating
KU-14613IsraelSelf-pollinating
Ae. longissima KU-5752JordanSelf-pollinating
KU-14624IsraelSelf-pollinating
KU-14635IsraelSelf-pollinating
Ae. searsii KU-5755SyriaSelf-pollinating
KU-6142JordanSelf-pollinating
KU-6143JordanSelf-pollinating
KU-14651IsraelSelf-pollinating
Ae. sharonensis KU-14661IsraelSelf-pollinating
KU-14663EgyptSelf-pollinating
KU-14668IsraelSelf-pollinating
List of the 19 accessions in the section Sitopsis used in RNA-seq analyses

RNA sequencing

Total RNA was extracted using Sepasol-RNA I Super G (Nacalai Tesque, Kyoto, Japan) from leaves of 2- to 3-month-old plants grown in a glass house. The extracted RNA was treated with DNase I at 37 °C for 20 min, after which paired-end libraries for RNA sequencing were constructed from 6 to 10 µg of total RNA using a TruSeq RNA Library Preparation kit v2 (Illumina, San Diego, CA, USA) according to a previously reported protocol and then sequenced with 300-bp paired-end reads on an Illumina MiSeq sequencer. The obtained reads were deposited in the DDBJ Sequence Read Archive under accession number DRA007097. RNA sequencing data for the outgroup species (300-bp paired-end reads) were obtained from the DDBJ Sequence Read Archives: BioProject PRJDB4683 for Ae. tauschii KU-2075 and DRA006404 for Ae. umbellulata KU-4017.

Quality control, alignment of paired-end reads, de novo transcriptome assembly, and single-nucleotide polymorphism (SNP)/insertion-deletion (indel) calling

FASTQC software (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (6 July 2017, date last accessed)) was used to evaluate sequencing quality of the reads from each of the sequenced samples. Adapter sequences, low-quality bases with an average quality score per 4 bp of <30, and reads of less than 100 bp were removed using Trimmomatic software, version 0.33, and only filtered paired reads were retained for subsequent analyses. The filtered reads were aligned to the reference B genome sequences of T. aestivum cv. Chinese Spring using HISAT2 software, version 2.1.0. To select uniquely mapped reads for SNP and indel calling, reads with a mapping quality of <40 were filtered out using SAMtools. SNPs and indels were called using Coval under the same criterion reported by Nishijima et al.; the depth of read coverage was ≥10, and >95% of the mapped reads designated different nucleotide sequences from the reference sequences. To obtain non-redundant SNPs, we selected the positions of SNPs at which the read depth was ≥10 and there were no ambiguous nucleotides in any of the samples. We prepared two sets of non-redundant SNPs for construction of phylogenetic trees for intra- and interspecific comparisons of nucleotide variations. One was estimated in all of the samples, including A, D, U genome species, and the other was in the section of Sitopsis species and the wheat B genome. The distribution of SNPs/indels was visualized on the physical map of the B genome using CIRCOS and R statistical software. De novo transcriptome assembly of Ae. speltoides ssp. ligustica KU-7716 was performed using Bridger. Unmapped reads of Ae. speltoides ssp. ligustica KU-7716 were aligned to its assembled transcripts using Bowtie. If the breadth coverage of the unmapped reads over a transcript is 100%, its corresponding annotated transcript was searched in the published transcripts of T. aestivum cv. Chinese Spring (iwgsc_refseqv1.0) using BLASTN.

Construction of phylogenetic trees

Neighbour-joining (NJ) and maximum-likelihood (ML) phylogenetic trees were constructed using Molecular Evolutionary Genetics Analysis (MEGA) software, version 5.05. A Kimura 2-parameter model was used as the substitution model for tree construction. To assess node reliability in the trees, bootstrap probability was calculated from 1,000 bootstrap replicates. To construct phylogenetic trees for each chromosomal segment, the chromosomal regions of the B genome were divided into ranges of 60 Mbp each, generating a total of 86 segments that included 124–931 non-redundant SNPs and 79–442 informative polymorphic sites (Supplementary Table S1). NJ trees for each subset of non-redundant SNPs were constructed using MEGA software. Phylogenetic network trees were constructed using SplitsTree4.

Nucleotide divergence between Sitopsis species/subspecies

To estimate nucleotide divergence between Sitopsis species/subspecies, two distance parameters were calculated: fixed substitutions between species and average number of nucleotide differences between species. To clarify positional changes in nucleotide divergence between Sitopsis species and the B genome of T. aestivum, the chromosomal regions of the B genome were divided into ranges of 20 Mbp each. A total of 262 segments of subsets of non-redundant SNPs in Sitopsis species and the B genome of T. aestivum cv. Chinese Spring were obtained. The fixed nucleotide differences and average number of nucleotide differences between Sitopsis species and the B genome were estimated for each subset of non-redundant SNPs. Non-synonymous fixed nucleotide differences were estimated using the variant annotation and effect prediction tool SnpEff. The number of genes per 20 Mbp was counted based on the Chinese Spring annotation.

3. Results and discussion

RNA sequencing detected numerous SNPs in Sitopsis species and the common wheat B genome

To identify genome-wide SNPs and indels in Sitopsis species and the wheat B genome, RNA sequencing of 19 representative accessions of the five Sitopsis species was performed, generating 2–3 million filtered paired reads for each species (Table 2). Of these short reads, 67–81% were uniquely aligned to the B genome sequences of Chinese Spring, and 32,836–130,687 SNPs and 323–1,890 indels were identified. The fewest SNPs and indels were found in Ae. bicornis, whereas the other four species had similar numbers (Fig. 1). High within-species variance in the number of SNPs and indels was detected but considered a potential artifact because the number of filtered reads differed among the tested accessions (Table 2). To examine this possibility, the correlation between the number of filtered reads and SNPs/indels was determined. However, no correlation was observed between the number of SNPs and indels and the number of filtered reads, suggesting that the high variance is a genetic characteristic of section Sitopsis (Supplementary Fig. S2).
Table 2

Summary of RNA sequencing data for the 19 accessions in the section Sitopsis

SpeciesAccession numberRead pairsFiltered read pairsAlignment rate (%)a
Aegilops speltoides ssp. speltoidesKU-2208A4,317,2012,721,855 (63.05%)74.77
KU-146015,279,4303,088,916 (58.51%)78.29
KU-146054,005,9192,480,301 (61.92%)77.17
KU-12963a4,779,1683,041,573 (63.64%)70.94
Ae. speltoides ssp. ligusticaKU-22364,378,4452,313,014 (52.83%)74.40
KU-77164,036,6602,553,962 (63.27%)67.92
KU-78484,962,4222,843,436 (57.30%)77.83
Ae. bicornis KU-57844,470,5202, 713,658 (60.70%)80.71
KU-146134,567,6982,881,072 (63.07%)81.04
Ae. longissima KU-57524,371,2872,490,621 (56.98%)78.48
KU-146245,012,0642,865,373 (57.17%)73.05
KU-146354,623,1662,660,230 (57.54%)75.52
Ae. searsii KU-57555,240,4423,150,732 (60.12%)81.23
KU-61425,024,7253,108,286 (61.86%)77.75
KU-61434,852,7412,776,399 (57.21%)78.16
KU-146515,345,7593,178,031 (59.45%)77.87
Ae. sharonensis KU-146616,500,2703,241,926 (49.87%)75.55
KU-146638,544,4604,157,728 (48.66%)78.27
KU-1466810,079,6954,891,456 (48.53%)75.25

The aliment rate against the B genome of T. aestivum cv. Chinese Spring was calculated with HISAT2 version 2.1.0.

Figure 1

Bar charts illustrating the number of SNPs (A) and indels (B) between the B genome of T. aestivum cv. Chinese Spring and each of the 19 accessions of section Sitopsis species.

Bar charts illustrating the number of SNPs (A) and indels (B) between the B genome of T. aestivum cv. Chinese Spring and each of the 19 accessions of section Sitopsis species. Summary of RNA sequencing data for the 19 accessions in the section Sitopsis The aliment rate against the B genome of T. aestivum cv. Chinese Spring was calculated with HISAT2 version 2.1.0. To confirm that RNA sequencing could identify genome-wide SNPs and indels, the chromosomal distribution of SNPs and indels was examined (Fig. 2). SNPs and indels identified in all of the tested accessions of Sitopsis species and the B genome entirely covered all of the chromosomes, with no clear difference in the distribution of SNPs and indels among Sitopsis species. Regions with scant or abundant SNPs on the chromosomes were quite consistent between species. For each chromosome, the number of SNPs ranged from 4,194 to 21,175, and the number of indels ranged from 34 to 317, with the high variance reflecting differences in SNP and indels numbers within species but not between species (Supplementary Fig. S3).
Figure 2

Distribution of SNPs (A) and indels (B) between the T. aestivum cv. Chinese Spring (CS) B genome and each of the 19 accessions of section Sitopsis species on the physical map of the CS B genome. From outer to inner circle, CS B genome (scale in Mb) and chromosome number (black), Ae. speltoides ssp. ligustica KU-2236, KU-7716, and KU-7848 (red); Ae. spletoides ssp. speltoides KU-2208A, KU-14601, KU-14605, and KU-12963a (orange); Ae. longissima KU-5752, KU-14624, and KU-14635 (green); Ae. searsii KU-5755, KU-6142, KU-6143, and KU-14651 (blue); Ae. sharonensis KU-14661, KU-14663, and KU-14668 (purple); and Ae. bicornis KU-5784 and KU-14613 (grey).

Distribution of SNPs (A) and indels (B) between the T. aestivum cv. Chinese Spring (CS) B genome and each of the 19 accessions of section Sitopsis species on the physical map of the CS B genome. From outer to inner circle, CS B genome (scale in Mb) and chromosome number (black), Ae. speltoides ssp. ligustica KU-2236, KU-7716, and KU-7848 (red); Ae. spletoides ssp. speltoides KU-2208A, KU-14601, KU-14605, and KU-12963a (orange); Ae. longissima KU-5752, KU-14624, and KU-14635 (green); Ae. searsii KU-5755, KU-6142, KU-6143, and KU-14651 (blue); Ae. sharonensis KU-14661, KU-14663, and KU-14668 (purple); and Ae. bicornis KU-5784 and KU-14613 (grey).

Phylogenetic relationship between Sitopsis species and B genome of bread wheat

To clarify the phylogenetic relationship and nucleotide divergence between species, we estimated sets of non-redundant SNPs anchored to each chromosome of the B genome in the 19 tested accessions of Sitopsis species and the B genome of Chinese Spring, with/without three outgroup species: T. urartu, Ae. umbellulata, and Ae. tauschii. Without the outgroup species, 30,589 non-redundant SNPs were obtained. When the outgroup species were included, 39,148 non-redundant SNPs were identified. These sets of non-redundant SNPs covered all of the chromosome of the B genome (Supplementary Fig. S4), allowing evolutionary analyses of section Sitopsis based on genome-wide polymorphisms. NJ and ML phylogenetic trees and a phylogenetic network tree were constructed based on the set of non-redundant SNPs with the outgroup species (Fig. 3 and Supplementary Fig. S5). The Sitopsis species were clearly divided into two clades. One clade included Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica, and the other clade included Ae. longissima, Ae. sharonensis, Ae. bicornis, and Ae. searsii; the two clades corresponded to subsections Truncata and Emarginata, respectively. This result was consistent with the results of previous studies based on RFLPs and AFLPs.,, The Emarginata clade was more closely related to Ae. tauschii and Ae. umbellulata than the Truncata clade. The B genomes of T. aestivum and T. turgidum were closely related to Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica in the Truncata clade. The average number of nucleotide differences and fixed nucleotide differences between Ae. speltoides ssp. and the B genome were the lowest in pairwise comparisons between species of section Sitopsis and the B genome (Table 3). These results supported the previous hypothesis that the B genome originated from the S genome of Ae. speltoides. Considering that the wheat B genome was not nested within the Truncata clade, the most recent common ancestor of Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica is likely the direct donor of the wheat B genome.
Figure 3

Phylogenetic relationship among the 19 accessions of section Sitopsis species (S genome), the B genomes of T. aestivum cv. Chinese Spring and T. turgidum ssp. durum cv. Langdon, T. urartu (A genome), Ae. tauschii (D genome), and Ae. umbellulata (U genome). NJ tree (A) and phylogenetic network (B) are shown. Bootstrap probabilities are shown on the branches (number of bootstrap replications = 1000). The scale bar is shown below each phylogenetic tree.

Table 3

Summary of pairwise comparisons of fixed divergence (upper) and average number of nucleotide differences (lower) between Sitopsis species

Ae. speltoides ssp. ligustica Ae. speltoides ssp. ligustica Ae. speltoides ssp. speltoides Ae. bicornis Ae. longissima Ae. searsii Ae. sharonensis B-genomeSubspecies
lig/spel lon/sha
Ae. speltoidesssp. ligustica999,0328,1538,4317,8546,27007,693
Ae. speltoidesssp. speltoides5,411.48,5587,7047,9817,4005,77907,251
Ae. bicornis 12,785.512,6661,8843,6531,77113,0937,9351,557
Ae. longissima 12,412.112,313.53,239.72,53012112,1757,1100
Ae. searsii 12,052.811,971.94,318.83,782.81,97612,4987,3561,841
Ae. sharonensis 12,07011,965.53,048.51,720.93,244.911,8116,8210
B-genome9,6549,507.813,489.513,18712,774.812,803.35,17011,577
Sub-species lig/spel4,587.44,544.312,717.212,355.812,006.512,010.39,570.46,690
lon/sha12,241.112,139.53,144.11,358.73,513.81,331.212,995.212,183

lig/spel: comparisons between Ae. speltoides ssp. ligustica and Ae. speltoides ssp. speltoides.

lon/sha: comparisons between Ae. longissima and Ae. sharonensis.

Phylogenetic relationship among the 19 accessions of section Sitopsis species (S genome), the B genomes of T. aestivum cv. Chinese Spring and T. turgidum ssp. durum cv. Langdon, T. urartu (A genome), Ae. tauschii (D genome), and Ae. umbellulata (U genome). NJ tree (A) and phylogenetic network (B) are shown. Bootstrap probabilities are shown on the branches (number of bootstrap replications = 1000). The scale bar is shown below each phylogenetic tree. Summary of pairwise comparisons of fixed divergence (upper) and average number of nucleotide differences (lower) between Sitopsis species lig/spel: comparisons between Ae. speltoides ssp. ligustica and Ae. speltoides ssp. speltoides. lon/sha: comparisons between Ae. longissima and Ae. sharonensis. The two subspecies of Ae. speltoides were not clearly divided in the Truncata clade (Fig. 3 and Supplementary Fig. S5). The Truncata clade had longer external branches than the Emarginata clade. This observation could be explained by differences in the mating systems of the two clades: species of the Truncata clade are outcrossing, whereas species of the Emarginata clade are self-pollinating. The mating system of Ae. speltoides is highly outcrossing., RFLP analyses indicated that Ae. speltoides contained a higher proportion of heterozygous loci compared with other self-pollinating species of Sitopsis. Natural populations of Ae. speltoides would harbour nucleotide variations as heterozygous states. The tested accessions of Ae. speltoides had been maintained by self-pollinating for several decades in the Japanese gene bank, which could have led to fixation of one of the alleles in heterozygous sites, increasing the number of singletons within the Ae. speltoides accessions. This resulted in detection of a relatively large number of SNPs in the Ae. speltoides accessions (Fig. 1). In the Emarginata clade, Ae. searsii was monophyletic and separated from the other three species, whereas Ae. bicornis nested within Ae. sharonensis and Ae. longissima. Aegilops longissima diverged from Ae. sharonensis. The close relationship between Ae. sharonensis and Ae. longissima was consistent with previous reports, and not inconsistent with a recent proposal that Ae. sharonensis is a subspecies of Ae. longissima.

Identification of SNPs distinguishing subspecies ligustica/speltoides and Ae. sharonensis/Ae. longissima

The phylogenetic tree did not discriminate well between subspecies of Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica, whereas the subspecies of Ae. longissima and Ae. sharonensis were distinguished. These subspecies are morphologically classified. Notably, differences in spike morphology between Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica can be explained by a single locus, Lig, located on chromosome 3S. If there are SNPs that would distinguish these subspecies (fixed nucleotide differences between subspecies), they could be related to the morphologic differences between the subspecies. Between Ae. speltoides ssp. speltoides and Ae. speltoides ssp. ligustica, 99 fixed nucleotide differences were detected (Table 3). The positions of these fixed nucleotide differences were scattered over the chromosomes (Fig. 4A). Ten of the fixed nucleotide differences caused amino acid substitutions between the two subspecies (Supplementary Table S2).
Figure 4

Distributions of the positions of fixed nucleotide differences between subspecies of Aegilops speltoides ssp. ligustica and Ae. spletoides ssp. speltoides (A) and between Ae. sharonensis and Ae. longissima (B) on the physical map of the B genome of T. aestivum cv. Chinese Spring from chromosomes 1B to 7B.

Distributions of the positions of fixed nucleotide differences between subspecies of Aegilops speltoides ssp. ligustica and Ae. spletoides ssp. speltoides (A) and between Ae. sharonensis and Ae. longissima (B) on the physical map of the B genome of T. aestivum cv. Chinese Spring from chromosomes 1B to 7B. Between Ae. longissima and Ae. sharonensis, 121 fixed nucleotide differences were found (Table 3). All of the chromosomes had fixed nucleotide differences (Fig. 4B), and they were most densely located on the end of the long arm of chromosome 4B. Of all total fixed nucleotide differences, 19 caused amino acid substitutions between the two species. Genes with these fixed nucleotide differences encoded proteins involved in a variety of biological functions (Supplementary Table S3). Lateral awn elongation is a key character distinguishing Ae. sharonensis from Ae. longissima, as Ae. longissima lacks a lateral awn. The heading time and growth habitats of these two closely related species are also distinct. The distal region of chromosome 4Sl, in which many fixed nucleotide differences accumulated, might control the morphologic and physiologic differences between Ae. sharonensis and Ae. longissima. Identification of the causal genes is a focus for future research.

Contrasting pattern of nucleotide divergence in the distal and proximal regions of the chromosomes

Some previous studies suggested the possibility of introgression from several parental Sitopsis species, supporting a polyphyletic origin of the wheat B genome. If introgression contributed to the origin of the wheat B genome, the phylogenetic relationship between species could possibly be verified based on chromosomal positions. To test this hypothesis, NJ trees were constructed for 60-Mbp regions on each chromosome (Fig. 5). Of a total of 86 phylogenetic trees, 83 exhibited a similar topology to that of trees based on entire chromosomes, in which the B genome of Chinese Spring was closely related to the Truncata clade including Ae. speltoides (Fig. 3). Two trees indicated that the B genome was closely related to the Emarginata clade (Fig. 5). These irregular trees were detected at the end of the long arm of chromosomes 1B and 3B. In the other tree, the B genome at the end of the short arm of chromosome 3B was located outside the Sitopsis species.
Figure 5

Irregular topologies of the phylogenetic trees in the distal chromosomal regions. The chromosomal regions were divided into 86 segments of 60 Mbp each. NJ trees were constructed based on non-redundant SNPs located in each segment. Squares on the chromosomes in panel (A) correspond to the 86 segments. Phylogenetic trees of the white squares showed that the B genome of T. aestivum cv. Chinese Spring was the most closely related to Ae. speltoides in the section Sitopsis. This observation was consistent with those for trees constructed based on all non-redudnant SNPs (Fig. 5). Phylogenetic trees of the black squares showed that the B genome was closely related to Emarginata clades. A phylogenetic tree of the grey square showed that the B genome was located outside of Sitopsis species. Trees with irregular topologies at the end of the long arm of chromosome 1B (B) and the short arm (C) and long arm (D) of chromosome 3B are shown. Bootstrap probabilities with over 50% and scale bars are shown for each tree.

Irregular topologies of the phylogenetic trees in the distal chromosomal regions. The chromosomal regions were divided into 86 segments of 60 Mbp each. NJ trees were constructed based on non-redundant SNPs located in each segment. Squares on the chromosomes in panel (A) correspond to the 86 segments. Phylogenetic trees of the white squares showed that the B genome of T. aestivum cv. Chinese Spring was the most closely related to Ae. speltoides in the section Sitopsis. This observation was consistent with those for trees constructed based on all non-redudnant SNPs (Fig. 5). Phylogenetic trees of the black squares showed that the B genome was closely related to Emarginata clades. A phylogenetic tree of the grey square showed that the B genome was located outside of Sitopsis species. Trees with irregular topologies at the end of the long arm of chromosome 1B (B) and the short arm (C) and long arm (D) of chromosome 3B are shown. Bootstrap probabilities with over 50% and scale bars are shown for each tree. We estimated the average number of nucleotide differences between the B genome of Chinese Spring and each of the Truncata and Emarginata clades as a parameter of genetic divergence between the B genome and each clade (Fig. 6). In the distal regions of the chromosomes, genetic divergence between the B genome and Emarginata clade was slightly less than that between the B genome and Truncata clade, whereas in the proximal regions of the chromosomes, genetic divergence between the B genome and Emarginata clade was greater than that between the B genome and Truncata clade. The proximal chromosomal regions tended to exhibit conspicuous disparity in terms of genetic divergence ([DE-B − DT-B]/DT-B × 100 > 0 in Fig. 6), but the range of this disparity differed among the chromosomes. Almost the entire region (∼660 Mbp) of chromosome 3B exhibited clear disparity. In contrast, on chromosome 5B, the region exhibiting disparity was limited to within about 360 Mbp of the long arm. Except for chromosome 7B, the disparity appeared to be negatively correlated with the total number of non-redundant SNPs in Sitopsis species and the B genome. In chromosomal regions with larger number of non-redundant SNPs, genetic divergence between the B genome and Emarginata clade increased to the same extent as that between the B genome and Truncata clade. In addition, the disparity also reflected the number of genes; regions with fewer genes were found to exhibit greater disparity. Interestingly, this contrasting pattern of genetic divergence between the proximal and distal chromosomal regions corresponded to the gradient recombination rate along the chromosomes.
Figure 6

Contrasting pattern of nucleotide divergence in the distal and proximal regions of B-genome chromosomes. Average number of nucleotide differences per 20 Mbp between species in Truncata (=Ae. speltoides spp.) and the B genome of T. aestivum cv. Chinese Spring (DT-B) and between Emarginata species and the B genome (DE-B) are plotted on each chromosome. The distributions of these differences are shown by line graphs (orange and green) in the top panels. Area charts in grey colour in the top panels denote the distribution of the total number of non-redundant SNPs per 20 Mbp in Sitopsis species and the B genome. Middle panels show the distribution of the ratio expressing disparity between the two genetic divergences ([DE-B − DT-B] × 100/DT-B) along each chromosome using bar charts. Area charts in blue colour indicate the distribution of the number of genes along each chromosome.

Contrasting pattern of nucleotide divergence in the distal and proximal regions of B-genome chromosomes. Average number of nucleotide differences per 20 Mbp between species in Truncata (=Ae. speltoides spp.) and the B genome of T. aestivum cv. Chinese Spring (DT-B) and between Emarginata species and the B genome (DE-B) are plotted on each chromosome. The distributions of these differences are shown by line graphs (orange and green) in the top panels. Area charts in grey colour in the top panels denote the distribution of the total number of non-redundant SNPs per 20 Mbp in Sitopsis species and the B genome. Middle panels show the distribution of the ratio expressing disparity between the two genetic divergences ([DE-B − DT-B] × 100/DT-B) along each chromosome using bar charts. Area charts in blue colour indicate the distribution of the number of genes along each chromosome. In common wheat, the recombination rate and chromosomal gene density increase as the centromeric region recedes., Multiple genes created by gene duplication are more frequently located in the distal regions of the chromosomes, where they potentially drive increases in gene density and the recombination rate. This observation coincides with the observed positive correlation between the disparity of genetic divergence and number of genes (Fig. 6). Incomplete lineage sorting (ILS) is known to generate gene trees in which the topology is discordant with that of species trees and tends to more frequently occur in rapid successions of speciation events., If speciation events in section Sitopsis had occurred over a relatively short time, ILS in the ancestral population of Sitopsis represents a potential factor blurring the phylogenetic relationship between Sitopsis species and the B genome. ILS is positively correlated with recombination., Indeed, distal chromosomal regions with an irregular phylogenetic topology (Fig. 5) exhibited more complex reticulate structures in the network tree compared with whole-chromosomal regions (Fig. 3B and Supplementary Fig. S6). In the interspecies comparisons, the internal reticulate structures represent data conflicts caused by phenomena such as ILS. Therefore, ILS could explain the irregular topology of phylogenetic trees in Sitopsis species (Fig. 5) and the unclear disparity of genetic divergence (Fig. 6) that were prominent in the distal chromosomal regions with a higher recombination rate. In the present study, a large number of SNPs and indels were discovered in the wheat B genome and five Sitopsis species based on RNA sequencing of leaf-derived transcripts. The polymorphic data would be useful for developing genome-wide markers on the S-genome chromosomes as performed for the wild diploid relatives, Ae. tauschii and Ae. umbellulata., In conclusion, the present phylogenetic analyses based on genome-wide polymorphisms suggest that the B genome of common wheat was derived from the S genome of Ae. speltoides. A few chromosomal regions demonstrated the clearly exceptional relationship between the B genome and Sitopsis species, whereas the irregular topology observed could be explained by higher recombination rates in the distal regions of wheat chromosomes. Therefore, based on genome-wide polymorphisms identified from the RNA sequencing data, all of the chromosomal regions of the wheat B genome could have originated from the S genome of Ae. speltoides. Moreover, the failure of pairing between homoeologous chromosomes between the B and S genomes during meiosis could be due to factors associated with highly evolved regions or intergenic regions. The alignment rate to B genome of Ae. speltoides was not as high as those of Ae. bicornis and Ae. searsii (Table 2). Of the transcripts that were entirely covered with unaligned RNA sequencing reads, 18% encoded F-box proteins and disease resistance proteins such as NBS-LRR (Supplementary data S1). Positive selection is known to act on these protein genes and increase nucleotide substitutions between species. After separation of the B genome from the S genome, the different selective pressure between B and S genomes may contribute to enhancing their genetic differentiation. In addition, distinct patterns of accumulation of repetitive sequences could have led to the differential distribution of heterochromatic regions between the B and S genomes. To elucidate the molecular nature of the differentiation of the B and S genomes, future studies should compare the repetitive sequences over all chromosomal regions.

Data availability

All read sequences were deposited into the DDBJ/GenBank/MMBL database with accession number DRA007097. RNA sequencing data for the outgroup species were used in the DDBJ/GenBank/MMBL database: BioProject PRJDB4683 for Ae. tauschii KU-2075 and DRA006404 for Ae. umbellulata KU-4017. Click here for additional data file.
  46 in total

1.  A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

Authors:  Pablo Cingolani; Adrian Platts; Le Lily Wang; Melissa Coon; Tung Nguyen; Luan Wang; Susan J Land; Xiangyi Lu; Douglas M Ruden
Journal:  Fly (Austin)       Date:  2012 Apr-Jun       Impact factor: 2.160

Review 2.  Gene tree discordance, phylogenetic inference and the multispecies coalescent.

Authors:  James H Degnan; Noah A Rosenberg
Journal:  Trends Ecol Evol       Date:  2009-03-21       Impact factor: 17.712

3.  Phylogenetic relationships of Triticum and Aegilops and evidence for the origin of the A, B, and D genomes of common wheat (Triticum aestivum).

Authors:  Gitte Petersen; Ole Seberg; Merete Yde; Kasper Berthelsen
Journal:  Mol Phylogenet Evol       Date:  2006-02-28       Impact factor: 4.286

4.  Reconciling the evolutionary origin of bread wheat (Triticum aestivum).

Authors:  Moaine El Baidouri; Florent Murat; Maeva Veyssiere; Mélanie Molinier; Raphael Flores; Laura Burlot; Michael Alaux; Hadi Quesneville; Caroline Pont; Jérôme Salse
Journal:  New Phytol       Date:  2016-08-23       Impact factor: 10.151

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  Standard karyotype of Triticum longissimum and its cytogenetic relationship with T. aestivum.

Authors:  B Friebe; N Tuleen; J Jiang; B S Gill
Journal:  Genome       Date:  1993-08       Impact factor: 2.166

7.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

8.  Genome-wide identification of novel genetic markers from RNA sequencing assembly of diverse Aegilops tauschii accessions.

Authors:  Ryo Nishijima; Kentaro Yoshida; Yuka Motoi; Kazuhiro Sato; Shigeo Takumi
Journal:  Mol Genet Genomics       Date:  2016-05-03       Impact factor: 3.291

9.  RNA-seq analysis reveals considerable genetic diversity and provides genetic markers saturating all chromosomes in the diploid wild wheat relative Aegilops umbellulata.

Authors:  Moeko Okada; Kentaro Yoshida; Ryo Nishijima; Asami Michikawa; Yuka Motoi; Kazuhiro Sato; Shigeo Takumi
Journal:  BMC Plant Biol       Date:  2018-11-08       Impact factor: 4.215

10.  The Dynamics of Incomplete Lineage Sorting across the Ancient Adaptive Radiation of Neoavian Birds.

Authors:  Alexander Suh; Linnéa Smeds; Hans Ellegren
Journal:  PLoS Biol       Date:  2015-08-18       Impact factor: 8.029

View more
  16 in total

1.  Genome-wide polymorphisms from RNA sequencing assembly of leaf transcripts facilitate phylogenetic analysis and molecular marker development in wild einkorn wheat.

Authors:  Asami Michikawa; Kentaro Yoshida; Moeko Okada; Kazuhiro Sato; Shigeo Takumi
Journal:  Mol Genet Genomics       Date:  2019-06-11       Impact factor: 3.291

Review 2.  RNA-Seq-based DNA marker analysis of the genetics and molecular evolution of Triticeae species.

Authors:  Kazuhiro Sato; Kentaro Yoshida; Shigeo Takumi
Journal:  Funct Integr Genomics       Date:  2021-08-18       Impact factor: 3.410

3.  Identification and characterization of large-scale genomic rearrangements during wheat evolution.

Authors:  Inbar Bariah; Danielle Keidar-Friedman; Khalil Kashkush
Journal:  PLoS One       Date:  2020-04-14       Impact factor: 3.240

4.  The unique disarticulation layer formed in the rachis of Aegilops longissima probably results from the spatial co-expression of Btr1 and Btr2.

Authors:  Xiaoxue Zeng; Gang Chen; Lei Wang; Akemi Tagiri; Shinji Kikuchi; Hidenori Sassa; Takao Komatsuda
Journal:  Ann Bot       Date:  2021-02-09       Impact factor: 4.357

5.  High-resolution mapping of the Mov-1 locus in wheat by combining radiation hybrid (RH) and recombination-based mapping approaches.

Authors:  Alexander Mahlandt; Nidhi Rawat; Jeff Leonard; Prakash Venglat; Raju Datla; Nathan Meier; Bikram S Gill; Oscar Riera-Lizarazu; Gary Coleman; Angus S Murphy; Vijay K Tiwari
Journal:  Theor Appl Genet       Date:  2021-04-08       Impact factor: 5.574

6.  Transcriptome Analysis Reveals Important Candidate Genes Related to Nutrient Reservoir, Carbohydrate Metabolism, and Defence Proteins during Grain Development of Hexaploid Bread Wheat and Its Diploid Progenitors.

Authors:  Megha Kaushik; Shubham Rai; Sureshkumar Venkadesan; Subodh Kumar Sinha; Sumedha Mohan; Pranab Kumar Mandal
Journal:  Genes (Basel)       Date:  2020-05-05       Impact factor: 4.096

7.  The Mitochondrial Genome of Eleusine indica and Characterization of Gene Content within Poaceae.

Authors:  Nathan D Hall; Hui Zhang; Jeffrey P Mower; Joseph Scott McElroy; Leslie R Goertzen
Journal:  Genome Biol Evol       Date:  2020-01-01       Impact factor: 3.416

8.  Phenotypic effects of the U-genome variation in nascent synthetic hexaploids derived from interspecific crosses between durum wheat and its diploid relative Aegilops umbellulata.

Authors:  Moeko Okada; Asami Michikawa; Kentaro Yoshida; Kiyotaka Nagaki; Tatsuya M Ikeda; Shigeo Takumi
Journal:  PLoS One       Date:  2020-04-02       Impact factor: 3.240

9.  Diploid genome differentiation conferred by RNA sequencing-based survey of genome-wide polymorphisms throughout homoeologous loci in Triticum and Aegilops.

Authors:  Sayaka Tanaka; Kentaro Yoshida; Kazuhiro Sato; Shigeo Takumi
Journal:  BMC Genomics       Date:  2020-03-20       Impact factor: 3.969

10.  A roadmap for gene functional characterisation in crops with large genomes: Lessons from polyploid wheat.

Authors:  Nikolai M Adamski; Philippa Borrill; Jemima Brinton; Sophie A Harrington; Clémence Marchal; Alison R Bentley; William D Bovill; Luigi Cattivelli; James Cockram; Bruno Contreras-Moreira; Brett Ford; Sreya Ghosh; Wendy Harwood; Keywan Hassani-Pak; Sadiye Hayta; Lee T Hickey; Kostya Kanyuka; Julie King; Marco Maccaferrri; Guy Naamati; Curtis J Pozniak; Ricardo H Ramirez-Gonzalez; Carolina Sansaloni; Ben Trevaskis; Luzie U Wingen; Brande Bh Wulff; Cristobal Uauy
Journal:  Elife       Date:  2020-03-24       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.