| Literature DB >> 35269924 |
Sajjad Asaf1, Rahmatullah Jan2, Abdul Latif Khan3, Waqar Ahmad1, Saleem Asif2, Ahmed Al-Harrasi1, Kyung-Min Kim2, In-Jung Lee2.
Abstract
The mechanism and course of Triticum plastome evolution is currently unknown; thus, it remains unclear how Triticum plastomes evolved during recent polyploidization. Here, we report the complete plastomes of two polyploid wheat species, Triticum sphaerococcum (AABBDD) and Triticum turgidum subsp. durum (AABB), and compare them with 19 available and complete Triticum plastomes to create the first map of genomic structural variation. Both T. sphaerococcum and T. turgidum subsp. durum plastomes were found to have a quadripartite structure, with plastome lengths of 134,531 bp and 134,015 bp, respectively. Furthermore, diploid (AA), tetraploid (AB, AG) and hexaploid (ABD, AGAm) Triticum species plastomes displayed a conserved gene content and commonly harbored an identical set of annotated unique genes. Overall, there was a positive correlation between the number of repeats and plastome size. In all plastomes, the number of tandem repeats was higher than the number of palindromic and forward repeats. We constructed a Triticum phylogeny based on the complete plastomes and 42 shared genes from 71 plastomes. We estimated the divergence of Hordeum vulgare from wheat around 11.04-11.9 million years ago (mya) using a well-resolved plastome tree. Similarly, Sitopsis species diverged 2.8-2.9 mya before Triticum urartu (AA) and Triticum monococcum (AA). Aegilops speltoides was shown to be the maternal donor of polyploid wheat genomes and diverged ~0.2-0.9 mya. The phylogeny and divergence time estimates presented here can act as a reference framework for future studies of Triticum evolution.Entities:
Keywords: Triticum sphaerococcum; Triticum turgidum subsp. durum; chloroplast genome; inverted repeats; polyploidy; wheat
Mesh:
Year: 2022 PMID: 35269924 PMCID: PMC8911259 DOI: 10.3390/ijms23052783
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Genome map of the T. sphaerococcum and T. turgidum subsp. durum plastomes. The extent of the IR regions is represented by dark colors, which divide the cp genome into large (LSC) and small (SSC) single copy regions. Genes drawn inside the circle are transcribed clockwise, whereas those outside of the circle are transcribed counter-clockwise. Genes belonging to different functional groups are color coded. The light green in the inner circle corresponds to the GC content, whereas the dark green corresponds to the AT content.
Summary of the genome features of complete Triticum plastomes.
| Size (bp) | GC (%) | LSC (bp) | SSC (bp) | IR (bp) | PCD (bp) | tRNA (bp) | rRNA (bp) | Gene (n) | PCD (n) | tRNA (n) | Accesion No. | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 134,531 | 38.3 | 80,342 | 12,791 | 20,699 | 59,538 | 3004 | 9192 | 131 | 84 | 39 | MZ230675 |
| 134,015 | 38.3 | 79,817 | 12,788 | 20,701 | 59,538 | 3004 | 9192 | 131 | 84 | 39 | MZ230674 | |
| 134,545 | 38.3 | 80,348 | 12,790 | 20,703 | 59,946 | 3264 | 9191 | 133 | 83 | 42 | NC002762 | |
| 135,919 | 38.3 | 56,296 | 13,144 | 21,541 | 58,701 | 2469 | 9192 | 123 | 82 | 33 | KJ614403 | |
| 133,873 | 38.3 | 56,228 | 12,791 | 20,573 | 59,664 | 2943 | 9192 | 131 | 84 | 39 | KJ592713 | |
| 135,835 | 38.3 | 56,254 | 12,792 | 21,541 | 58,683 | 2469 | 9192 | 123 | 82 | 33 | KJ614396 | |
|
| 135,899 | 38.3 | 56,298 | 12,791 | 21,522 | 54,723 | 2480 | 9190 | 112 | 72 | 32 | NC_025955 |
| 136,886 | 38.3 | 56,492 | 12,806 | 21,547 | 54,723 | 2480 | 9192 | 112 | 72 | 32 | LC005977 | |
| 136,157 | 38.3 | 56,550 | 12,795 | 21,553 | 58,686 | 2469 | 9192 | 123 | 82 | 33 | NC024764 | |
| 136,074 | 38.3 | 56,474 | 12,789 | 21,553 | 58,686 | 2469 | 9192 | 123 | 82 | 33 | KJ614407 | |
| 136,124 | 38.3 | 56,523 | 12,789 | 21,553 | 58,686 | 2469 | 9192 | 123 | 82 | 33 | KJ614409 | |
| 136,119 | 38.3 | 56,518 | 12,789 | 21,553 | 58,686 | 2469 | 9192 | 123 | 82 | 33 | KJ614408 | |
| 135,926 | 38.3 | 56,284 | 12,792 | 21,541 | 58,701 | 2469 | 9192 | 123 | 82 | 33 | KJ614401 | |
| 135,865 | 38.3 | 56,266 | 12,792 | 21,541 | 58,683 | 2469 | 9192 | 123 | 82 | 33 | KJ614400 | |
| 135,889 | 38.3 | 56,287 | 12,793 | 21,541 | 58,701 | 2469 | 9192 | 123 | 82 | 33 | KJ614402 | |
| 135,835 | 38.3 | 56,254 | 12,792 | 21,541 | 85,683 | 2469 | 9192 | 123 | 82 | 33 | KJ614399 | |
| 135,835 | 38.3 | 56,254 | 12,792 | 21,541 | 58,683 | 2469 | 9192 | 123 | 82 | 33 | NC_024814 | |
| 135,836 | 38.3 | 56,255 | 12,792 | 21,541 | 58,683 | 2469 | 9192 | 123 | 82 | 33 | KJ614398 | |
| 135,948 | 38.3 | 56,293 | 15,992 | 17,066 | 61,746 | 2913 | 9192 | 136 | 89 | 39 | KM352501 | |
| 136,865 | 38.3 | 56,454 | 12,824 | 21,547 | 58,683 | 2469 | 9192 | 123 | 82 | 33 | KJ614411 | |
| 136,028 | 38.3 | 80,257 | 12,790 | 21,495 | 59,208 | 2871 | 9061 | 128 | 82 | 38 | NC_046698 |
Gene composition in T. sphaerococcum and T. turgidum subsp. durum plastomes.
| Category of Genes | Group of Genes | Name of Genes |
|---|---|---|
| Genes for photosynthesis | Subunits of ATP synthase | |
| Subunits of photosystem II | ||
| Subunits of NADH-dehydrogenase | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of photosystem I | ||
| Subunit of rubisco |
| |
| Self-replication | Large subunit of ribosome | |
| DNA dependent RNA polymerase | ||
| Small subunit of ribosome | ||
| Other genes | c-type cytochrom synthesis gene | ccsA |
| Envelop membrane protein |
| |
| Protease |
| |
| Translational initiation factor |
| |
| Maturase |
| |
| Unknown | Conserved open reading frames |
|
* represent duplicated genes.
Figure 2Summary of genes lost across Triticum plastomes. Blue color shows the missing genes, whereas the red color shows the genes duplicated in plastomes.
The genes with introns in the T. sphaerococcum and T. turgidum subsp. durum plastomes and the length of exons and introns.
| Gene | Strand | ExonI | IntronI | ExonII | IntronII | ExonIII | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| T.S | T.T | T.S | T.T | T.S | T.T | T.S | T.T | T.S | T.T | ||
| − | 38 | 38 | 2486 | 2490 | 36 | 36 | |||||
|
| − | 40 | 40 | 843 | 845 | 218 | 218 | ||||
| − | 31 | 31 | 658 | 658 | 62 | 62 | |||||
|
| + | 145 | 145 | 825 | 825 | 407 | 407 | ||||
|
| − | 124 | 124 | 756 | 756 | 230 | 230 | 731 | 731 | 159 | 159 |
|
| + | 114 | 114 | 223 | 223 | 540 | 540 | 32 | 32 | ||
| + | 36 | 36 | 588 | 588 | 51 | 51 | |||||
| − | 38 | 38 | 581 | 581 | 53 | 53 | |||||
|
| + | 6 | 6 | 748 | 748 | 642 | 642 | ||||
|
| + | 8 | 8 | 749 | 749 | 475 | 475 | ||||
|
| − | 9 | 9 | 1043 | 1043 | 402 | 402 | ||||
|
| − | 385 | 388 | 663 | 663 | 431 | 431 | ||||
|
| − | 775 | 775 | 712 | 712 | 758 | 758 | ||||
| + | 32 | 32 | 787 | 787 | 59 | 59 | |||||
| + | 37 | 37 | 805 | 805 | 36 | 36 | |||||
|
| − | 550 | 550 | 1036 | 1036 | 539 | 539 | ||||
| − | 37 | 37 | 805 | 805 | 36 | 36 | |||||
| − | 32 | 32 | 787 | 787 | 59 | 59 | |||||
* Represent duplicated genes. T. sphaerococcum = (T.S), T. turgidum subsp. durum = (T.T).
Figure 3Analysis of repeated sequences in the 21 Triticum plastomes. (A) Total number of three-repeat types in all plastomes. (B) Number of palindromic repeats by length. (C) Number of forward repeats by length. (D) Number of tandem repeats by length. (E) Pair plot showing the distribution of repeats in Triticum species. (F) Correlation among different characteristics of Triticum plastomes.
Figure 4Analysis of simple sequence repeats (SSRs) in the 21 Triticum plastomes. (A) SSR numbers detected in the 25 species whole plastomes. (B) Frequency of identified SSRs in Triticum the small single copy (SSC) regions. (C) Frequency of identified SSRs in inverted repeat (IR) region. (D) Frequency of identified SSRs in the long single copy (LSC) region. (E) For T. turgidum subsp. turgidum, pairwise sequence distance of 42 protein-coding genes shared with the plastomes of related species.
Figure 5Maximum credible molecular chronogram (time tree) of Triticum from BEAST based on whole plastomes, with branch lengths proportional to time and lognormal fossil-based calibrations. The GTR + G substitution model was used with four rate categories and a Yule tree speciation model was applied with a lognormal relaxed clock model in BEAST. Different color branches represent Triticum species based on their genomes. The diploid, tetraploid, and hexaploid genomes are shown at the right side of the tree. The 95% highest posterior density credibility intervals are shown for the node ages in black circles (mya). Numbers indicate date estimates for different nodes. A geological time scale is shown at the bottom of the figure.
Figure 6(A) Maximum likelihood tree for Triticum plastome sequences using 1000 bootstrap repetitions with four outgroup Hordeum species: Hordeum vulgare subsp. vulgare, Hordeum vulgare subsp. spontaneum, Hordeum jubatum, and Hordeum bogdani. Different colors represent Triticum species based on their genome types. (B) Divergence time estimates of Triticeae species based on 42 protein-coding shared genes from Triticeae. The GTR + G substitution model was used with four rate categories and a Yule tree speciation model was applied with a lognormal relaxed clock model in BEAST. Different color branches represent Triticum species based on their genomes. The diploid, tetraploid, and hexaploid genomes are shown at the right side of the tree. The 95% highest posterior density credibility intervals are shown for the node ages in black circles (mya). Numbers indicate date estimates for different nodes. A geological time scale is shown at the bottom of the figure.