| Literature DB >> 27287040 |
Jisen Zhang1,2,3, Anupma Sharma4, Qingyi Yu1,4, Jianping Wang1,5, Leiting Li3,6, Lin Zhu3,7, Xingtan Zhang1, Youqiang Chen2, Ray Ming8,9.
Abstract
BACKGROUND: Sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes.Entities:
Keywords: Genome evolution; Haplotypes; Homologous genes; Polyploidy; Saccharum
Mesh:
Substances:
Year: 2016 PMID: 27287040 PMCID: PMC4902974 DOI: 10.1186/s12864-016-2817-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of the sequence length, GC content, transposable element content, and gene number
| Species | NO | BAC ID | Length (Kb) | GC content | Transposable elements | Gene number | ||
|---|---|---|---|---|---|---|---|---|
| LTR | Non-LTR | Transposons | ||||||
|
| 1 | 146H19 | 77.5 | 44.7 % | 16.91 % | 1.92 % | 29.47 % | 8 |
| 2 | 99P01 | 74.4 | 44.2 % | 35.94 % | 2.46 % | 28.97 % | 7 | |
| 3 | 96B11 | 101.3 | 46.8 % | 35.12 % | 0.05 % | 24.55 % | 8 | |
| 4 | 57E04 | 95.3 | 45.0 % | 34.90 % | 3.57 % | 16.42 % | 8 | |
|
| 5 | 75D04 | 72.0 | 44.7 % | 11.13 % | 6.57 % | 36.23 % | 10 |
| 6 | 23 K06 | 127.7 | 45.3 % | 26.66 % | 4.70 % | 25.31 % | 11 | |
| Average | 91.5 | 45.2 % | 27.48 % | 3.20 % | 26.16 % | 8.7 | ||
| Average (LA) | 31.19 % | 1.94 % | 24.36 % | |||||
| Average (SES) | 21.02 % | 6.14 % | 29.27 % | |||||
| Total | 549 | - | 52 | |||||
Repeat content in the haplotype sequences of LA Purple (S.officinarum), AP85-441 (S. spontaneum), and the hybrid cultivar, R570
Summary of gapless sequence comparison the haplotypes BACs among S. officinarum(LA Purple) and S. spontaneum(AP85-441)
| BAC name | BAC name | Ss-75D04 | So-57E04 | So-96B11 | So-99P01 | So-146H19 |
|---|---|---|---|---|---|---|
| (Length bp) | Length (bp) | 71,995 | 95,342 | 101,291 | 74,354 | 77,460 |
| Ss-23 K06 | range of aligned sequence | (287–88918)/(302–72582) | (39884–95090)/(17837–88219) | (168–81137)/(8608–80258) | (37335–85913)/(26–66382) | (6628–78055)/(5672–77310) |
| 127,658 | Span of aligned BAC | 68512(77.30 %)/68716(95.07 %) | 33316(60.35 %)/33467(47.55 %) | 51174(63.20 %)/51331(71.64 %) | 29192(60.09 %)/29187(43.99 %) | 49413(69.18 %)/49502(69.10 %) |
| Aligned sequence | 67783 | 32441 | 50240 | 28799 | 48249 | |
| Average identity (%) | 98.37 | 95.92 | 96.08 | 96.72 | 94.53 | |
| SNP (%) | 881(1.30 %) | 620(1.91 %) | 1035(2.06 %) | 506(1.76 %) | 1153(2.39 %) | |
| MNP(%) | 0.43 | 2.17 | 1.86 | 1.52 | 3.08 | |
| Ss-75D04 | range of aligned sequence | (35029–72582)/(17837–83938) | (302–65299)/(8743–80258) | (32480–72726)/(26–74352) | (6303–62979)/(5672–77310) | |
| 71,995 | Span of aligned BAC | 29553(78.70 %)/29618(44.81 %) | 49688(76.45 %)/49719(69.52 %) | 29384(73.01 %)/29378(39.53 %) | 46112(81.36 %)/46037(64.26 %) | |
| Aligned sequence | 28814 | 48973 | 28938 | 45233 | ||
| Average identity (%) | 95.80 | 96.48 | 97.00 | 96.36 | ||
| SNP (%) | 479(1.66 %) | 984(2.01 %) | 489(1.69 %) | 1123(2.48 %) | ||
| 2.54 | 1.51 | 1.31 | 1.16 | |||
| So-57E04 | range of aligned sequence | (9475–36844)/(39893–80175) | (17837–95341)/(6441–67799) | (10894–39851)/(28927–77310) | ||
| 95,342 | Span of aligned BAC | 21851(79.84 %)/21895(54.35 %) | 57223(73.83 %)/57303(93.39 %) | 24491(84.58 %)/24304(50.23 %) | ||
| Aligned sequence | 21601 | 56769 | 23815 | |||
| Average identity (%) | 95.99 | 97.28 | 95.44 | |||
| SNP (%) | 377(1.75 %) | 993(1.75 %) | 473(1.99 %) | |||
| MNP(%) | 2.26 | 0.97 | 2.57 | |||
| So-96B11 | range of aligned sequence | (48130–80175)/(38–27603) | (14403–80175)/(5672–71165) | |||
| 101,291 | Span of aligned BAC | 23235(72.51 %)/23221(84.24 %) | 47909(72.84 %)/47707(72.84 %) | |||
| Aligned sequence | 23083 | 46867 | ||||
| Average identity (%) | 98.39 | 96.57 | ||||
| SNP (%) | 364(1.58 %) | 853(1.82 %) | ||||
| MNP(%) | 0.03 | 1.61 | ||||
| So-99P01 | range of aligned sequence | (24–30055)/(34688–77310) | ||||
| 74,354 | Span of aligned BAC | 24568(81.81 %)/24554(57.61 %) | ||||
| Aligned sequence | 24266 | |||||
| Average identity (%) | 96.98 | |||||
| SNP (%) | 441(1.82 %) | |||||
| MNP(%) | 1.20 |
Notes: A summary of gapless sequence comparison the haplotypes BACs among S.officinarum(LA Purple), S.spontaneum(AP85-441) and Saccarhum cultivar R570 was presented in table S. 4. SNP: single nucleotide polymorphism; MNP: Multiple nucleotide Polymorphisms
Fig. 1Comparison of the genome structures between 14 haplotypes (15 BAC sequences) from LA Purple (S.officinarum), AP85-441 (S. spontaneum), the hybrid cultivar R570, and sorghum. Genes are presented by color pentagon boxes. Psudogenes are marked with star, and TEs are indicated by rectangle and showed by color. Genes are numbered according to Additional file 2
Pairwise SNPs distributions in Saccharum species
| Aligned sequence(bp) | SNP number | SNP % | 100 nt | 250 nt | 500 nt | 1000 nt | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SD | 0 SNPs (%) | SD | 0 SNPs (%) | SD | 0 SNPs(%) | SD | 0 SNPs(%) | ||||
| So | 196401 | 3501 | 1.78 | 1.73 | 31.35 | 3.57 | 11.26 | 6.52 | 2.77 | 11.77 | 0.40 |
| Ss | 67783 | 881 | 1.30 | 1.34 | 50.22 | 2.94 | 27.52 | 5.51 | 22.30 | 9.78 | 17.39 |
| Ss/So | 311687 | 6389 | 2.05 | 1.76 | 18.66 | 3.54 | 3.31 | 6.27 | 0.49 | 11.32 | 0.00 |
Notes
The SNPs discovery was based on pairwise gapless sequence comparisons
0 SNPs (%): The percentage of fragments for specific sizes (100 nt, 250 nt, 500 nt and 1000 nt) that contained no SNP
SD: The standard variation of SNP numbers among the specific sizes of fragments
So: S. officinarum, Ss: S. spontaneum
The feature of syntenic genes on Saccharum and sorghum bacterial artificial chromosome (BAC) clones
|
|
| Saccharum hybrid- |
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ss-75D04 | Ss-23 K06 | So-99P01 | So-57E04 | So-96B11 | So-146H19 | 15 N23 | 197G04 | 142 J21 | 135P16 | 253G12 | 53A11 | CIR9O20/12E03 | 24P17 | ||
| Sorghum | |||||||||||||||
| Gene1 | DNA | 810 | 813 | - | - | 813 | - | - | - | - | - | - | 996 | - | 561 |
| Exons | 1 | 1 | - | - | 1 | - | - | - | - | - | - | 2 | - | 1 | |
| cDNA | 810 | 813 | - | - | 813 | - | - | - | - | - | - | 741 | - | 561 | |
| Amino acids | 269 | 270 | - | - | 270 | - | - | - | - | - | - | 246 | - | 186 | |
| Gene2 | DNA | 1180 | 1195 | - | - | 1195 | 1195 | 1192 | - | - | - | - | 1195 | - | 1192 |
| Exons | 2 | 2 | - | - | 2 | 2 | 2 | - | - | - | - | 2 | - | 2 | |
| cDNA | 1086 | 1101 | - | - | 1101 | 1101 | 1098 | - | - | - | - | 1101 | - | 1098 | |
| Amino acids | 361 | 366 | - | - | 366 | 366 | 365 | - | - | - | - | 366 | - | 365 | |
| Gene3 | DNA | 3216 | 3214 | - | - | 3223a | 3356 | 3449 | - | - | - | - | 3203 | - | 3193 |
| Exons | 7 | 7 | - | - | 7 | 7 | 7 | - | - | - | - | 7 | - | 7 | |
| cDNA | 948 | 948 | - | - | 953a | 948 | 975 | - | - | - | - | 948 | - | 975 | |
| Amino acids | 315 | 315 | - | - | -a | 315 | 324 | - | - | - | - | 315 | - | 324 | |
| Gene4 | DNA | 3894 | 8712 | - | - | 3910 | 3516 | 3888 | - | - | - | - | 3515 | - | 3330 |
| Exons | 10 | 10 | - | - | 10 | 10 | 10 | - | - | - | - | 10 | - | 10 | |
| cDNA | 1002 | 1002 | - | - | 1002 | 993 | 960 | - | - | - | - | 1002 | - | 1032 | |
| Amino acids | 333 | 333 | - | - | 333 | 330 | 319 | - | - | - | - | 333 | - | 343 | |
| Gene5 | DNA | 6972a | 6954a | - | - | 12770 | 7312 | 7912 | - | - | - | 7328 | 7300 | - | 6964 |
| Exons | 14a | 14a | - | - | 14 | 14 | 14 | - | - | - | 14 | 14 | - | 14 | |
| cDNA | 2706a | 2671a | - | - | 2667 | 2676 | 2676 | - | - | - | 2586 | 2670 | - | 2529 | |
| Amino acids | -a | -a | - | - | 888 | 891 | 891 | - | - | - | 861 | 889 | - | 842 | |
| Gene6 | DNA | 3986 | 4542 | 3996 | 3987 | 3999 | 4007 | 3958 | - | - | - | 3990 | 3988 | 4294 | 4144 |
| Exons | 8 | 8 | 8 | 8 | 8 | 8 | 8 | - | - | - | 8 | 8 | 8 | 7 | |
| cDNA | 876 | 873 | 873 | 873 | 873 | 873 | 882 | - | - | - | 882 | 882 | 882 | 891 | |
| Amino acids | 291 | 290 | 290 | 290 | 290 | 290 | 293 | - | - | - | 293 | 293 | 293 | 296 | |
| Gene7 | DNA | 3829 | 3779 | 3782 | 3805 | 3781 | 3775 | 3977 | 3298# | 1876# | - | 3814 | 3812 | 3962 | 3717 |
| Exons | 8 | 8 | 8 | 8 | 8 | 8 | 8 | 7# | 6# | - | 8 | 8 | 8 | 8 | |
| cDNA | 1092 | 1092 | 1092 | 1092 | 1092 | 1092 | 1092 | 1023# | 966# | - | 1092 | 1092 | 1092 | 1086 | |
| Amino acids | 363 | 363 | 363 | 363 | 363 | 363 | 363 | 340# | 321# | - | 363 | 363 | 363 | 361 | |
| Gene8 | DNA | 2988 | 2982 | 2974 | 2991 | 2993a | 2991 | 2976 | 2982 | 2993 | - | 2991 | 2991 | 3003 | 3003 |
| Exons | 5 | 5 | 5 | 5 | 5a | 5 | 5 | 5 | 6 | - | 5 | 5 | 5 | 5 | |
| cDNA | 2412 | 2421 | 2376 | 2415 | 2417a | 2415 | 2406 | 2421 | 2367 | - | 2415 | 2415 | 2427 | 2421 | |
| Amino acids | 803 | 806 | 791 | 804 | -a | 804 | 801 | 806 | 788 | - | 804 | 804 | 808 | 806 | |
| Gene9 | DNA | 1632 | 1638 | 1632 | 1626 | - | - | 1632 | 1413 | 1632 | - | 1629 | - | 1575 | X |
| Exons | 1 | 1 | 1 | 1 | - | - | 1 | 1 | 1 | - | 1 | - | 1 | X | |
| cDNA | 1632 | 1638 | 1632 | 1626 | - | - | 1632 | 1413 | 1632 | - | 1629 | - | 1575 | X | |
| Amino acids | 543 | 545 | 543 | 541 | - | - | 543 | 470 | 543 | - | 542 | - | 524 | X | |
| Gene10 | DNA | X | X | 17423a | 22357a | - | - | X | X | 620a | 744a | 1057a | - | 1066 | X |
| Exons | X | X | 6a | 6a | - | - | X | X | 3a | 5a | 4a | - | 6 | X | |
| cDNA | X | X | 490a | 436a | - | - | X | X | 427a | 393a | 501a | - | 534 | X | |
| Amino acids | X | X | N/A a | N/A a | - | - | X | X | N/Aa | N/A a | 166a | - | 177 | X | |
| Gene11a | DNA | X | X | 4273 | 4059 | - | - | X | X | 4043 | 4060 | 4062 | - | 1566#/33480 | X |
| Exons | X | X | 7 | 6 | - | - | X | X | 6 | 6 | 6 | - | 4#/6 | X | |
| cDNA | X | X | 1029 | 1029 | - | - | X | X | 1029 | 1029 | 1029 | - | 645#/993 | X | |
| Amino acids | X | X | 342 | 342 | - | - | X | X | 342 | 342 | 342 | - | 214#/330 | X | |
| Gene11b | DNA | 3293 | 3832 | 3671 | 3678 | - | - | 9988 | 3271 | 5684 | 9146 | 3674 | - | 6477 | 4616 |
| Exons | 6 | 6 | 6 | 6 | - | - | 6 | 6 | 7 | 6 | 6 | - | 7 | 6 | |
| cDNA | 948 | 948 | 948 | 948 | - | - | 948 | 948 | 927 | 948 | 948 | - | 954 | 957 | |
| Amino acids | 315 | 315 | 315 | 315 | - | - | 315 | 315 | 308 | 315 | 315 | - | 317 | 318 | |
| Gene12 | DNA | - | 1836 | - | 1841 | - | - | 2053 | 1840 | 1877 | 1877/1881/18811 | 1841 | - | - | 1846 |
| Exons | - | 3 | - | 3 | - | - | 3 | 3 | 3 | 3/3/3 | 3 | - | - | 3 | |
| cDNA | - | 1536 | - | 1539 | - | - | 1146 | 1536 | 1539 | 1671/1611/1611 | 1539 | - | - | 1545 | |
| Amino acids | - | 511 | - | 512 | - | - | 381 | 511 | 512 | 556/536/536 | 512 | - | - | 514 | |
Notes: a pseudo gene # BAC border X deletion - out of BACs
1. Three genes of 12 were annotated in the BAC 135P16 of R570
The average sequence identities between the homologous haplotypes from two progenitors and Saccharum hybrids R570
| BAC ID | 15 N23 | 197G04 | 142 J21 | 135P16 | 253G12 | 53A11 | CIR9020/12E03 |
| Haplotype ID | I | III | V | VI | II | IV | VII |
| Genbank ID | FN431663.1 | FN431667.1 | FN431666.1 | FN431665.1 | FN431668.1 | FN431664.1 | FN431669.1/FN431661 |
| Length (bp) | 137851 | 141630 | 126547 | 142236 | 158483 | 81164 | 87631 + 84926 |
|
| 96.07 | 97.24 | 95.88 | 94.56 | 95.28 | 95.87 | 95.67 |
|
| 96.61 | 93.82 | 98.38 | 97.48 | 96.65 | 96.62 | 96.41 |
Fig. 2Schematic representation of verifying the homologous haplotype origin in Saccharum hybrid R570 based on homo(oe)ologous gene allele sequence comparison with two progenitor Saccharum species as references. Notes: Similar analyses as Garsmeur et al. (2011) [18] were performed for the schematic. Each gene allele is represented by a square. For each locus, the most divergent allele is marked in black and its theoretical divergence time (highest estimate observed in Myr) is indicated in italics. All alleles that fall into groups (of at least three) with all values lower than one-third of this maximum divergence time are marked by white squares. When the phylogenetic trees were not degenerate, the alleles (loci 6, 7, 8, and 11b) of the same branch (relating to the same internal node) were placed in vertical dotted boxes. But for five alleles of loci 7, the alleles from the same phylogenetic tree branch were placed in the transparent blue boxes due to they distribute separately in the figure. The white triangle in the black square for locus 11a indicates an insertion. The ‘x’ mark indicates absence of the gene
Fig. 3Ka/Ks for each gene from homolougs haplotype from S.officinarum (LA Purple) and S.spotaneum(AP85-441). Pairwise comparisons with Duncan’s test gave P values of *, P ≤ 0.05; **,P ≤ 0.01; ***, P ≤ 0.001
Fig. 5Tajima’D test for bru1 genomic regions from S. spontaneum (a) and S. officinarum (b). Notes: Homologous haplotype sequences in S. spontaneum include Sh15N23, Sh197G04, Ss-75O04 and Ss-23 K06. Homologous haplotype sequences in S. officinarum contain So-96B11, So-146H19, So-57E04, So-99P01, Sh53A11, Sh135P16, Sh253G12 and Sh142J21
Fig. 4Distribution of transposable elements (TE) in the Bru1 surrounding regions of LA Purple (S.officinarum), AP85-441 (S. spontaneum), and the hybrid cultivar, R570. Notes: The TE names listed in the bottom row consist of four parts separated by an underscore” –“. The first part indicates the location, e.g., 01–02 indicates between gene 1 and gene 2 and 05–05 indicates within gene 5. The second part indicates full length (TE-f) or partial (TE-p). The third part indicates orientation (AS = Antisense, S = Sense). Colum 1 contains the clone identifiers: the species name, BAC name, and accession (if any) separated by underscore and the remaining columns score the presence (green) or absence (red) of each TEs listed at the bottom. A superscript “T” indicates a likely full length TE that was truncated due to its presence at the end of BAC and a superscript “S” indicates a solo LTR. The presence and absence of gene 10 (dark blue highlight in bottom row) and gene 11b (dark blue highlight) is also given for reference because these two genes are present in S. officinarum BAC sequences but absent in S. spontaneum BAC sequences. The blank cells in white indicate no data available