| Literature DB >> 29980662 |
Olivier Garsmeur1,2, Gaetan Droc1,2, Rudie Antonise3, Jane Grimwood4, Bernard Potier5, Karen Aitken6, Jerry Jenkins4, Guillaume Martin1,2, Carine Charron1,2, Catherine Hervouet1,2, Laurent Costet7, Nabila Yahiaoui1,2, Adam Healey4, David Sims4, Yesesri Cherukuri4, Avinash Sreedasyam4, Andrzej Kilian8, Agnes Chan9, Marie-Anne Van Sluys10, Kankshita Swaminathan4, Christopher Town9, Hélène Bergès11, Blake Simmons12, Jean Christophe Glaszmann1,2, Edwin van der Vossen3, Robert Henry13, Jeremy Schmutz4,14, Angélique D'Hont15,16.
Abstract
Sugarcane (Saccharum spp.) is a major crop for sugar and bioenergy production. Its highly polyploid, aneuploid, heterozygous, and interspecific genome poses major challenges for producing a reference sequence. We exploited colinearity with sorghum to produce a BAC-based monoploid genome sequence of sugarcane. A minimum tiling path of 4660 sugarcane BAC that best covers the gene-rich part of the sorghum genome was selected based on whole-genome profiling, sequenced, and assembled in a 382-Mb single tiling path of a high-quality sequence. A total of 25,316 protein-coding gene models are predicted, 17% of which display no colinearity with their sorghum orthologs. We show that the two species, S. officinarum and S. spontaneum, involved in modern cultivars differ by their transposable elements and by a few large chromosomal rearrangements, explaining their distinct genome size and distinct basic chromosome numbers while also suggesting that polyploidization arose in both lineages after their divergence.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29980662 PMCID: PMC6035169 DOI: 10.1038/s41467-018-05051-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Schematic representation of the genome of a typical modern sugarcane cultivar. Each bar represents a chromosome, in orange or red when originating from S. officinarum or S. spontaneum, respectively. Chromosomes aligned on the same row are hom(oe)ologous chromosomes (HG). The key characteristics of this genome are the high polyploidy, aneuploidy, bispecific origin of the chromosomes, the existence of structural differences between chromosomes of the two origins, and the presence of interspecific chromosome recombinants
Fig. 2Sequencing strategy targeting the sugarcane monoploid genome based on the overall synteny and colinearity conservation within sugarcane hom(oe)ologs and with sorghum. a WGP sequence tags were produced from R570 sugarcane BACs. b WGP sequence tags were aligned onto the sorghum sequence, thus allowing the location of sugarcane BACs on sorghum. c A minimum tiling path of a BAC (MTP) corresponding to a monoploid sugarcane genome was defined and sequenced. d Overlapping BAC sequences were trimmed to construct the single tiling path (STP). e The STP sequence contains BAC contigs that belong to distinct hom(oe)ologous chromosomes. S. officinarum and S. spontaneum chromosome segments are represented in orange and red, respectively
Selection and sequencing of BACs targeting the gene-rich part of the sugarcane monoploid genome
| Selection and sequencing of a sugarcane minimum tiling path (MTP) of BACs | Sugarcane single tiling path (STP) | |||||
|---|---|---|---|---|---|---|
| Sorghum chromosome (Mb) | Nb of sugarcane BACs anchored | Nb of BACs sequenced (Mb) | Mosaic super scaffolds (Mb) | Genes | TE | |
| Nb | % | % | ||||
| Sb01 (81) | 1924 | 778 (94) | Sh01 (67) | 4614 | 15 | 44 |
| Sb02 (78) | 1598 | 594 (68) | Sh02 (49) | 3270 | 13 | 43 |
| Sb03 (74) | 1624 | 634 (74) | Sh03 (51) | 3540 | 14 | 43 |
| Sb04 (69) | 1261 | 496 (56) | Sh04 (42) | 2881 | 14 | 44 |
| Sb05 (72) | 827 | 289 (33) | Sh05 (22) | 1337 | 11 | 41 |
| Sb06 (61) | 1060 | 404 (48) | Sh06 (33) | 2189 | 13 | 45 |
| Sb07 (66) | 871 | 305 (36) | Sh07 (28) | 1903 | 13 | 42 |
| Sb08 (63) | 623 | 265 (31) | Sh08 (23) | 1381 | 12 | 43 |
| Sb09 (59) | 891 | 391 (46) | Sh09 (34) | 2143 | 13 | 44 |
| Sb10 (61) | 1053 | 379 (45) | Sh10 (32) | 2058 | 16 | 43 |
| 683 Mb | 11,732 | 4535 (531 Mb) | 382 Mb | 25,316 | 13 | 43 |
Fig. 3Distribution of the 11,732 sugarcane BACs aligned onto the sorghum genome through WGP. Sugarcane BAC clones are represented by orange bars. Sorghum gene and transposable element densities are represented in green and gray, respectively
Fig. 4Coverage of the sorghum genome by the sequenced sugarcane BAC MTP. Dot plot with alignment of BAC sequences from the sugarcane MTP (y-axis) on the sorghum genomic sequence (x-axis). The sorghum genome covered by the sugarcane MTP sequence is highlighted in orange. Gray segments correspond to duplicated regions resulting from ancestral whole-genome duplications in Poales
Transposable element (TE) contents in the single tiling path (STP) and in whole-genome sequencing data from the R570 cultivar, one S. officinarum (S. OFF) and one S. spontaneum (S. SPONT)
| R570 STP annotation | Repeat explorer clusters assembled from WGS data | ||||||
|---|---|---|---|---|---|---|---|
| STP BAC | S. OFF BAC | S. SPONT BAC | STP reads | R570 WGS | S. OFF WGS | S. SPONT WGS | |
| TE (all) | 43.1% | 47.3% | 38.3% | 41.2% | 50.9% | 53.6% | 46.5% |
| DNA transposons | 4.6% | 4.9% | 4.3% | 1.8% | 1.8% | 1.9% | 1.9% |
| TIR | |||||||
| | 2.79% | 2.96% | 2.68% | 0.24% | 0.24% | 0.23% | 0.28% |
| | 1.15% | 1.23% | 1.02% | 0.71% | 0.82% | 0.86% | 0.81% |
| | 0.51% | 0.53% | 0.51% | 0.65% | 0.43% | 0.43% | 0.53% |
| | 0.18% | 0.21% | 0.12% | 0.25% | 0.34% | 0.37% | 0.28% |
| LTR retrotransposons | 36.8% | 40.9% | 32.3% | 29.8% | 38.1% | 40.1% | 34.6% |
| Copia | 16.09% | 17.53% | 14.44% | 13.34% | 14.01% | 15.25% | 12.09% |
| | 2.51% | 2.42% | 3.01% | 1.43% | 0.66% | 0.57% | 0.83% |
| | 1.09% | 1.19% | 1.00% | 1.01% | 0.95% | 0.99% | 0.80% |
| | 0.02% | 0.02% | 0.03% | 0.15% | 0.13% | 0.13% | 0.15% |
| | 0.16% | 0.16% | 0.14% | 0.37% | 0.22% | 0.25% | 0.15% |
| | 9.53% | 10.74% | 7.81% | 10.02% | 11.69% | 12.92% | 9.82% |
| | 0.11% | 0.11% | 0.14% | 0.25% | 0.24% | 0.25% | 0.31% |
| | 0.04% | 0.06% | 0.00% | 0.11% | 0.11% | 0.13% | 0.03% |
| | 2.63% | 2.83% | 2.31% | 0.00% | 0.00% | 0.00% | 0.00% |
| Gypsy | 20.39% | 23.06% | 17.51% | 16.46% | 24.10% | 24.80% | 22.54% |
| | 1.42% | 1.32% | 1.99% | 1.71% | 3.47% | 3.27% | 5.23% |
| | 11.00% | 13.29% | 7.38% | 10.47% | 16.65% | 17.62% | 11.32% |
| | 4.84% | 4.85% | 5.65% | 4.27% | 3.98% | 3.91% | 5.98% |
| | 0.84% | 0.95% | 0.69% | 0.00% | 0.00% | 0.00% | 0.00% |
| | 2.29% | 2.64% | 1.80% | 0.00% | 0.00% | 0.00% | 0.00% |
| RLX | 0.35% | 0.35% | 0.39% | 0.00% | 0.00% | 0.00% | 0.00% |
| Unclassified | 1.6% | 1.4% | 1.7% | 9.6% | 10.9% | 11.6% | 10.0% |
Fig. 5SNP-based sugarcane genetic map with putative origin of cosegregation groups and comparison with sorghum chromosomes. The 132 CGs of cultivar R570 are represented with SNP markers assigned to S. officinarum or S. spontaneum indicated by green and red bars, respectively. Circos represents orthologous relationships between sugarcane CGs and sorghum chromosomes (Sb1–Sb10) based on the alignment, for each CG, of a majority of the markers on one (a, b) or two (c) sorghum chromosomes (color links) (see Supplementary Fig. 2 for representation of all links). Based on these orthologous relationships, CGs were assembled in hom(oe)ology groups (HGs): a Four HGs (HG1–HG4) including CGs from S. officinarum, S. spontaneum, and interspecific recombinants orthologous to one sorghum chromosome. b Six HGs (HG6–HG10) including S. officinarum CGs, and a few S. spontaneum and interspecific recombinant CGs orthologous to one sorghum chromosome. Arrows point to two interspecific recombinations (see text). c Two pairs of HGs (sHG5 and sHG6, and sHG7 and sHG8), each including S. spontaneum or interspecific recombinant CGs orthologous to two sorghum chromosomes. d Schematic comparison of the deduced basic genome organization in S. spontaneum (x = 8), S. officinarum (x = 10), and sorghum (x = 10). Asterisk represents CGs from the MQ76-53 SNP map (see text and Supplementary Fig. 3)