| Literature DB >> 27047520 |
Vagner Katsumi Okura1, Rafael S C de Souza2, Susely F de Siqueira Tada3, Paulo Arruda4.
Abstract
Sequencing plant genomes are often challenging because of their complex architecture and high content of repetitive sequences. Sugarcane has one of the most complex genomes. It is highly polyploid, preserves intact homeologous chromosomes from its parental species and contains >55% repetitive sequences. Although bacterial artificial chromosome (BAC) libraries have emerged as an alternative for accessing the sugarcane genome, sequencing individual clones is laborious and expensive. Here, we present a strategy for sequencing and assembly reads produced from the DNA of pooled BAC clones. A set of 178 BAC clones, randomly sampled from the SP80-3280 sugarcane BAC library, was pooled and sequenced using the Illumina HiSeq2000 and PacBio platforms. A hybrid assembly strategy was used to generate 2,451 scaffolds comprising 19.2 MB of assembled genome sequence. Scaffolds of ≥20 Kb corresponded to 80% of the assembled sequences, and the full sequences of forty BACs were recovered in one or two contigs. Alignment of the BAC scaffolds with the chromosome sequences of sorghum showed a high degree of collinearity and gene order. The alignment of the BAC scaffolds to the 10 sorghum chromosomes suggests that the genome of the SP80-3280 sugarcane variety is ∼19% contracted in relation to the sorghum genome. In conclusion, our data show that sequencing pools composed of high numbers of BAC clones may help to construct a reference scaffold map of the sugarcane genome.Entities:
Keywords: BAC pool sequencing; sorghum; sugarcane; sugarcane genome; synteny
Year: 2016 PMID: 27047520 PMCID: PMC4804495 DOI: 10.3389/fpls.2016.00342
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Size distribution of scaffolds assembled using A hybrid assembly (AHA) strategy.
| Scaffold length | N° scaffolds | Total bases | Bases (% total) |
|---|---|---|---|
| <2,000 | 1,758 | 743,310 | 3.88 |
| 2,000–10,000 | 321 | 1,700,748 | 8.88 |
| 10,000–20,000 | 104 | 1,480,189 | 7.73 |
| 20,000–40,000 | 110 | 3,309,119 | 17.27 |
| 40,000–60,000 | 63 | 3,087,796 | 16.12 |
| 60,000–80,000 | 36 | 2,464,222 | 12.86 |
| 80,000–100,000 | 31 | 2,715,881 | 14.18 |
| 100,000–120,000 | 7 | 797,899 | 4.17 |
| 120,000–140,000 | 16 | 2,062,825 | 10.77 |
| 140,000–160,000 | 4 | 590,630 | 3.08 |
| >160,000 | 1 | 203,132 | 1.06 |
| Total | 2,451 | 19,155,751 | |
| Estimated total bases | 21,717,887 | ||
Summary of repetitive sequences among the sugarcane bacterial artificial chromosome (BACs).
| Repeat element | Number of elements | Length (bp) | % of Total bases |
|---|---|---|---|
| Transposable element | 1,279 | 10,209,529 | 53.30 |
| DNA transposon | 407 | 1,479,344 | 7.72 |
| EnSpm/CACTA | 102 | 545,515 | 2.85 |
| Harbinger | 77 | 282,933 | 1.48 |
| Helitron | 28 | 74,200 | 0.39 |
| Mariner/Tc1 | 11 | 7,401 | 0.04 |
| MuDR | 44 | 218,853 | 1.14 |
| hAT | 87 | 162,922 | 0.85 |
| Other | 58 | 187,520 | 0.98 |
| LTR retrotransposon | 732 | 8,297,946 | 43.32 |
| Copia | 291 | 2,473,755 | 12.91 |
| Gypsy | 426 | 5,795,891 | 30.26 |
| Other | 15 | 28,300 | 0.15 |
| Non-LTR retrotransposon | 138 | 431,477 | 2.25 |
| Other | 2 | 762 | 0.004 |
| Simple repeat | 9 | 206,466 | 1.08 |
| Satellite | 9 | 206,466 | 1.08 |
| Integrated virus | 3 | 43,599 | 0.23 |
| Caulimoviridae | 3 | 43,599 | 0.23 |
| Unclassified | 5 | 3,965 | 0.02 |
| Total of repeat elements | 1,296 | 10,463,559 | 54.62 |
| Total of assembled bases | 19,155,751 | 100.00 | |
Scaffolds longer than 2,000 bp mapped to sorghum chromosomes.
| Chromosome | Total bases | Bases (% Total) | N° Scaffolds | Scaffold size range |
|---|---|---|---|---|
| 1 | 1,821,039 | 9,89 | 42 | 3,008–203,132 |
| 2 | 1,607,121 | 8,73 | 36 | 3,106–152,524 |
| 3 | 1,589,753 | 8,63 | 44 | 2,882–138,830 |
| 4 | 1,716,141 | 9,32 | 37 | 3,734–141,339 |
| 5 | 1,121,959 | 6,09 | 40 | 2,086–125,065 |
| 6 | 689,413 | 3,74 | 15 | 6,387–97,701 |
| 7 | 1,268,000 | 6,89 | 22 | 7,190–151,964 |
| 8 | 722,184 | 3,92 | 14 | 10,282–123,118 |
| 9 | 1,270,664 | 6,90 | 24 | 4,562–137,971 |
| 10 | 675,823 | 3,67 | 18 | 2,688–135,690 |
| No Mapped | 5,930,344 | 32,21 | 401 | 2,003–129,783 |
| Total | 18,412,441 | 693 | ||
Expanded and contracted regions between sorghum chromosomes and sugarcane scaffolds.
| Sorghum chromosome | Number of scaffolds | Scaffold mapped size | Chromosome mapped size | Syntenic sorghum/sugarcane rate | Sum of sugarcane expanded regions (bp) | Sum of sugarcane contracted regions (bp) | Sum of sorghum expanded regions (bp) | Sum of sorghum contracted regions (bp) |
|---|---|---|---|---|---|---|---|---|
| 1 | 42 | 866,716 | 905,270 | 1.04 | 556,830 | 309,886 | 608,863 | 296,407 |
| 2 | 36 | 857,217 | 1,124,624 | 1.31 | 547,673 | 309,544 | 830,799 | 293,825 |
| 3 | 44 | 977,446 | 1,036,227 | 1.06 | 627,837 | 349,609 | 601,346 | 434,881 |
| 4 | 37 | 994,271 | 1,115,670 | 1.12 | 708,032 | 286,239 | 653,976 | 461,694 |
| 5 | 40 | 555,615 | 721,043 | 1.30 | 327,292 | 228,323 | 568,417 | 152,626 |
| 6 | 15 | 328,515 | 455,977 | 1.39 | 163,097 | 165,418 | 354,613 | 101,364 |
| 7 | 22 | 705,244 | 856,835 | 1.21 | 418,892 | 286,352 | 635,808 | 221,027 |
| 8 | 14 | 296,242 | 417,336 | 1.41 | 140,887 | 155,355 | 320,335 | 97,001 |
| 9 | 24 | 590,229 | 699,434 | 1.19 | 384,530 | 205,699 | 449,331 | 250,103 |
| 10 | 18 | 379,187 | 476,686 | 1.26 | 281,509 | 97,678 | 264,014 | 212,672 |
| Total | 292 | 6,550,682 | 7,809,102 | 1.19 | 4,156,579 | 2,394,103 | 5,287,502 | 2,521,600 |