| Literature DB >> 31245697 |
Robert VanBuren1,2, Ching Man Wai1, Jens Keilwagen3, Jeremy Pardo4.
Abstract
Oropetium thomaeum is an emerging model for desiccation tolerance and genome size evolution in grasses. A draft genome of Oropetium was recently sequenced, but the lack of a chromosome-scale assembly has hindered comparative analyses and downstream functional genomics. Here, we reassembled Oropetium, and anchored the genome into 10 chromosomes using high-throughput chromatin conformation capture (Hi-C) based chromatin interactions. A combination of high-resolution RNAseq data and homology-based gene prediction identified thousands of new, conserved gene models that were absent from the V1 assembly. This includes thousands of new genes with high expression across a desiccation timecourse. Comparison between the Sorghum and Oropetium genomes revealed a surprising degree of chromosome-level collinearity, and several chromosome pairs have near perfect synteny. Other chromosomes are collinear in the gene rich chromosome arms but have experienced pericentric translocations. Together, these resources will be useful for the grass-comparative genomic community and further establish Oropetium as a model resurrection plant.Entities:
Keywords: Hi‐C; chromosome‐scale; comparative genomics; desiccation tolerance; grasses
Year: 2018 PMID: 31245697 PMCID: PMC6508818 DOI: 10.1002/pld3.96
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
Comparison of the Oropetium V1 and V2 assembly and annotation statistics
| Statistics | V1 | V2 |
|---|---|---|
| # of contigs | 625 | 436 |
| Contig N50 | 2.38 Mb | 2.02 Mb |
| Scaffold N50 | NA | 20.5 Mb |
| Total assembly size | 243 Mb | 236 Mb |
| Gene models | 28,446 | 28,835 |
| BUSCO | 72.1% | 98.9% |
Figure 1Hi‐C based contig anchoring. Post‐clustering heat map showing density of Hi‐C interactions between contigs from the Juicer and 3d‐DNA pipeline. The 10 Oropetium chromosomes are highlighted by blue squares
Figure 2Characterization of the updated V2 Oropetium annotation. (a) Tandem gene array size comparison of the V1 and V2 annotation. Tandem genes identified in V1 are shown in blue and tandem genes newly annotated in V2 are shown in gold. (b) Comparison of expression patterns from the V1 and V2 annotation. The total number of genes with detectable expression and differential expression (DE) in the Oropetium desiccation/rehydration timecourse are plotted
Figure 3Landscape of the Oropetium genome. Gypsy and Copia long terminal repeat retrotransposons (LTR‐RT) and CDS density are plotted for the 10 Oropetium chromosomes. Features are plotted in sliding windows of 50 kb with 25 kb step size. The location of centromere specific tandem arrays is highlighted by red bars. The heat maps below each landscape show relative density with red indicating high density and blue indicating low density for each feature
Centromeric repeat array composition
| Chromosome | Start cent. array (bp) | End cent. array (bp) | Number of cent. repeats | Cent. size (bp) |
|---|---|---|---|---|
| Chr_1 | 18,899,082 | 19,114,162 | 154 | 215,080 |
| Chr_2 | 18,277,215 | 18,463,229 | 786 | 186,014 |
| Chr_3 | 18,882,303 | 18,993,598 | 308 | 111,295 |
| Chr_4 | 11,739,636 | 13,338,554 | 176 | 1,598,918 |
| Chr_5 | 10,361,368 | 10,828,355 | 800 | 466,987 |
| Chr_6 | 3,649,010 | 3,746,417 | 513 | 97,407 |
| Chr_7 | 12,434,273 | 12,559,564 | 272 | 125,291 |
| Chr_8 | 8,288,262 | 9,010,114 | 306 | 721,852 |
| Chr_9 | 6,142,739 | 7,433,209 | 1,044 | 1,290,470 |
| Chr_10 | 3,147,692 | 3,209,432 | 155 | 61,740 |
| Unanchored | 4,258 | 982,774 |
Figure 4Comparative genomics between Oropetium and Sorghum. (a) Macrosyntenic dotplot of the Oropetium and Sorghum chromosomes based on 18,889 gene pairs. Each black dot represents a syntenic region between the two genomes. (b) Microsynteny of a typical genic region of Sorghum and Oropetium (top) and the pericentromeric region of Chromosome 6 of Oropetium and Sorghum (bottom). LTR‐RTs are shown in yellow and genes are shown in blue. Syntenic orthologs are connected by gray lines. The centromeric repeat array in Oropetium is shown in red