| Literature DB >> 32060277 |
Robert VanBuren1,2, Ching Man Wai3,4, Xuewen Wang5, Jeremy Pardo3,4,6, Alan E Yocca3,6, Hao Wang5, Srinivasa R Chaluvadi5, Guomin Han5, Douglas Bryant7, Patrick P Edger3, Joachim Messing8, Mark E Sorrells9, Todd C Mockler7, Jeffrey L Bennetzen5, Todd P Michael10.
Abstract
Teff (Eragrostis tef) is a cornerstone of food security in the Horn of Africa, where it is prized for stress resilience, grain nutrition, and market value. Here, we report a chromosome-scale assembly of allotetraploid teff (variety Dabbi) and patterns of subgenome dynamics. The teff genome contains two complete sets of homoeologous chromosomes, with most genes maintaining as syntenic gene pairs. TE analysis allows us to estimate that the teff polyploidy event occurred ~1.1 million years ago (mya) and that the two subgenomes diverged ~5.0 mya. Despite this divergence, we detect no large-scale structural rearrangements, homoeologous exchanges, or biased gene loss, in contrast to many other allopolyploids. The two teff subgenomes have partitioned their ancestral functions based on divergent expression across a diverse expression atlas. Together, these genomic resources will be useful for accelerating breeding of this underutilized grain crop and for fundamental insights into polyploid genome evolution.Entities:
Mesh:
Year: 2020 PMID: 32060277 PMCID: PMC7021729 DOI: 10.1038/s41467-020-14724-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Hi-C-based clustering of the teff genome.
Heat map showing the density of Hi-C interactions between contigs, with red indicating high density of interactions. Distinct chromosomes are highlighted by blue boxes and homoeologous chromosome pairs are numbered.
Summary statistics of the teff genome.
| Chromosome | Size (bp) | Number of contigs | Number of genes | Number of tandem duplicates | Repetive element content (%) |
|---|---|---|---|---|---|
| 1A | 40,621,098 | 35 | 5135 | 465 | 27.5 |
| 1B | 35,710,944 | 32 | 4829 | 469 | 22.3 |
| 2A | 35,425,885 | 45 | 4398 | 441 | 26.1 |
| 2B | 30,633,641 | 23 | 4112 | 382 | 20.3 |
| 3A | 34,643,735 | 47 | 4415 | 404 | 25.2 |
| 3B | 32,575,812 | 43 | 4370 | 417 | 22.4 |
| 4A | 32,664,196 | 39 | 4224 | 318 | 29.9 |
| 4B | 29,936,223 | 32 | 4127 | 294 | 26.1 |
| 5A | 26,945,638 | 29 | 2899 | 403 | 31.7 |
| 5B | 24,206,550 | 36 | 2785 | 385 | 34.5 |
| 6A | 27,140,163 | 46 | 2409 | 365 | 40.2 |
| 6B | 19,415,607 | 31 | 1992 | 225 | 26.3 |
| 7A | 26,459,500 | 44 | 3006 | 315 | 33.6 |
| 7B | 23,383,462 | 34 | 2843 | 307 | 30.4 |
| 8A | 24,151,120 | 26 | 2464 | 270 | 32.2 |
| 8B | 21,147,804 | 28 | 2373 | 239 | 25.9 |
| 9A | 24,589,398 | 38 | 2736 | 292 | 31.1 |
| 9B | 21,940,566 | 23 | 2673 | 270 | 28.3 |
| 10A | 23,813,772 | 24 | 2346 | 268 | 20.3 |
| 10B | 20,101,091 | 32 | 2151 | 227 | 17.1 |
| Unanchored | 22,232,506 | 657 | 1968 | 130 | 18.2 |
| Total | 577,738,711 | 1344 | 68,255 | 6886 | 26.5 |
Fig. 2Collinearity of tef pseudomolecules with the high-density genetic map.
Two example chromosomes demonstrate a pseudomolecule spanning three linkage groups (top) and a pseudomolecule spanning a single linkage group (bottom). Lines connect the genetic makers with their physical location on the pseudomolecules. p Values within the scatterplots indicate the Pearson’s correlation coefficient of marker distance (cM) and physical distance (bp). Source data are provided as a Source Data file.
Summary of the repeat sequence distribution in the teff genome.
| Class | Subclass | Superfamily | Number of families | Loci | Size (Mb) | Genome % |
|---|---|---|---|---|---|---|
| SSR | SSR | NA | 1 | 116,936 | 5.2 | 0.9 |
| Class I | LTR | Gypsy | 944 | 54,384 | 71.8 | 12.4 |
| LTR | Unknown | 946 | 55,889 | 32.5 | 5.6 | |
| LTR | Copia | 330 | 13,571 | 11.6 | 2 | |
| LINE | L1 | 37 | 2784 | 1.6 | 0.3 | |
| LINE | I | 5 | 17 | 0 | ~0 | |
| SINE | Unknown | 109 | 14,909 | 2.4 | 0.4 | |
| Class II | TIR | Tc1 | 793 | 81,715 | 14.9 | 2.6 |
| TIR | CACTA | 266 | 25,197 | 4.4 | 0.8 | |
| TIR | hAT | 77 | 7084 | 1.4 | 0.2 | |
| TIR | PIF | 48 | 5746 | 1.1 | 0.2 | |
| TIR | Mutator | 26 | 3238 | 0.6 | 0.1 | |
| TIR | Unknown | 1 | 247 | 0 | ~0 | |
| Helitron | Helitron | 105 | 21,977 | 5.6 | 1 | |
| Total | 153.1 | 26.5 |
Fig. 3Insertion dynamics of 65 LTR-RT families in teff.
Box plots of insertion time for the 65 LTR-RT families having ≥5 intact LTR elements are plotted. Families 1–5 have ≥100 intact LTRs, 6–33 have ≥10 LTRs, and 34–65 have ≥5 LTRs. The exact number of LTR-RTs in each family is available in the TE annotation gff file. The six subgenome-specific families are highlighted in blue and the estimated range for the teff polyploidy event is shown in brown. A substitution rate of 1.3e-8 per site per year was used to infer the element insertion times. Box boundaries indicate the 25th and 75th percentiles of the insertion time and whiskers extend to 1.5 times the interquartile range.
Fig. 4Comparative genomics of the teff genome.
a Ratio of syntenic depth between Oropetium and teff. Syntenic blocks of Oropetium per teff gene (left) and syntenic blocks of teff per Oropetium gene (right) are shown indicating a clear 1:2 pattern of Oropetium to teff. b Microsynteny of the teff and Oropetium genomes. A region of the Oropetium chromosome 1 and the corresponding syntenic regions in homoeologous teff chromosomes 1A and 1B are shown. Genes are shown in red and blue (for forward and reverse orientation, respectively) and syntenic gene pairs are connected by gray lines. c Macrosynteny of the teff and Oropetium genomes. Syntenic gene pairs are denoted by gray points. d Collineariy of the teff subgenomes. The ten chromosomes belonging to the teff A and B subgenomes are shown in yellow and purple, respectively. Syntenic blocks between homoeologous regions are shown in grey. Source data underlying Fig. 4c are provided as a Source Data file.
Fig. 5Homoeolog expression bias between the A and B subgenomes of teff.
a The distribution of homoeolog expression bias (HEB) between all gene pairs in all tissues. An HEB >0 indicates bias toward the A subgenome and a HEB <0 indicates bias toward the B subgenome. b HEB across the ten tissues in the teff expression atlas. Gene pairs were classified as biased toward the A (blue) or B (red) subgenomes or balanced with no statistically significant differential expression (gray). c HEB in each of the ten pairs of chromosomes across all ten tissue types. Source data underlying Fig. 5a are provided as a Source Data file.