| Literature DB >> 24280451 |
In-Seon Jeong1, Ung-Han Yoon, Gang-Seob Lee, Hyeon-So Ji, Hyun-Ju Lee, Chang-Deok Han, Jang-Ho Hahn, Gynheung An, Tae-Ho Kim.
Abstract
BACKGROUND: Anther culture has advantage to obtain a homozygous progeny by induced doubling of haploid chromosomes and to improve selection efficiency for invaluable agronomical traits. Therefore, anther culturing is widely utilized to breed new varieties and to induce genetic variations in several crops including rice. Genome sequencing technologies allow the detection of a massive number of DNA polymorphism such as SNPs and Indels between closely related cultivars. These DNA polymorphisms permit the rapid identification of genetic diversity among cultivars and genomic locations of heritable traits. To estimate sequence diversity derived from anther culturing, we performed whole-genome resequencing of five Korean rice accessions, including three anther culture lines (BLB, HY-04 and HY-08), their progenitor cultivar (Hwayeong), and an additional japonica cultivar (Dongjin).Entities:
Year: 2013 PMID: 24280451 PMCID: PMC4883692 DOI: 10.1186/1939-8433-6-6
Source DB: PubMed Journal: Rice (N Y) ISSN: 1939-8425 Impact factor: 4.783
Reference assembly of each accession onto genome
| # of readsa | Mapped reads | Depth | All mapped nucleotide (bp) | Coverage (%) | 3+ mapped nucleotideb(bp) | Coverage (%) | ||
|---|---|---|---|---|---|---|---|---|
| (#) | (%) | |||||||
| Dongjin | 220,275,464 | 197,217,131 | 89.53 | 53.19 | 365,559,219 | 95.5 | 363,351,121 | 94.92 |
| Sub total | 220,275,464 | 197,217,131 | 89.53 | 53.19 | 365,559,219 | 95.5 | 363,351,121 | 94.92 |
| Hwayeong | 233,203,421 | 206,036,766 | 88.35 | 54.36 | 364,116,034 | 95.12 | 361,752,130 | 94.5 |
| BLB | 236,387,205 | 208,936,303 | 88.39 | 55.13 | 364,684,871 | 95.27 | 362,481,239 | 94.7 |
| HY-04 | 237,106,362 | 207,858,683 | 87.66 | 54.85 | 363,086,064 | 94.85 | 360,729,934 | 94.24 |
| HY-08 | 238,445,614 | 210,468,066 | 88.27 | 55.53 | 363,324,672 | 94.92 | 360,872,314 | 94.27 |
| Sub total | 945,142,602 | 833,299,818 | 352.67 | 219.87 | 1,455,211,641 | 380.16 | 1,445,835,617 | 377.71 |
| Sub average | 236,285,651 | 208,324,955 | 88.17 | 54.97 | 363,802,910 | 95.04 | 361,458,904 | 94.43 |
| Total | 1,165,418,066 | 1,030,516,949 | 442.20 | 273.06 | 1,820,770,860 | 475.66 | 1,809,186,738 | 472.63 |
| Average | 233,083,613 | 206,103,390 | 88.44 | 54.61 | 364,154,172 | 95.13 | 361,837,348 | 94.53 |
a: the number of reads which were generated by high-throughput sequencer.
b: base pair of nucleotide which mapped over 3 reads on one site.
Figure 1Distribution of SNP types. (a) Homozygous and heterozygous SNPs from each accession. Homozygous SNPs accounted for approximately 87% of the total potential SNPs. (b) The ratio of SNP types. Substitutions, insertions and deletions were 88.7%, 4.5%, and 6.6%, respectively, among DNA polymorphisms.
The number of SNPs on individual chromosomes detected between each accession and
| Dongjin | Hwayeong | HY-04 | HY-08 | BLB | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| # of SNPs | SNP/100 kb | # of SNPs | SNP/100 kb | # of SNPs | SNP/100 kb | # of SNPs | SNP/100 kb | # of SNPs | SNP/100 kb | |
| Chromosome 1 | 4,126 | (9.16) | 5,318 | (11.81) | 30,700 | (68.16) | 23,902 | (53.07) | 10,242 | (22.74) |
| Chromosome 2 | 4,805 | (13.06) | 7,143 | (19.41) | 9,399 | (25.55) | 7,875 | (21.40) | 9,369 | (25.46) |
| Chromosome 3 | 1,990 | (5.33) | 4,318 | (11.57) | 12,532 | (33.59) | 12,688 | (34.00) | 8,152 | (21.85) |
| Chromosome 4 | 14,197 | (39.37) | 14,637 | (40.59) | 14,110 | (39.13) | 14,505 | (40.22) | 11,252 | (31.20) |
| Chromosome 5 | 9,615 | (31.97) | 4,943 | (16.44) | 3,782 | (12.58) | 3,782 | (12.58) | 5,041 | (16.76) |
| Chromosome 6 | 7,441 | (23.16) | 4,263 | (13.27) | 7,881 | (24.53) | 3,880 | (12.08) | 7,158 | (22.28) |
| Chromosome 7 | 18,667 | (61.49) | 18,025 | (59.38) | 20,982 | (69.12) | 18,548 | (61.10) | 12,143 | (40.00) |
| Chromosome 8 | 9,663 | (33.87) | 70,295 | (246.39) | 69,329 | (243.00) | 71,301 | (249.92) | 66,592 | (233.41) |
| Chromosome 9 | 3,093 | (12.94) | 5,864 | (24.54) | 14,166 | (59.28) | 14,583 | (61.03) | 5,743 | (24.03) |
| Chromosome 10 | 9,799 | (41.34) | 8,752 | (36.92) | 22,221 | (93.75) | 8,704 | (36.72) | 12,149 | (51.25) |
| Chromosome 11 | 39,285 | (125.83) | 63,387 | (203.04) | 62,336 | (199.67) | 63,652 | (203.88) | 59,631 | (191.00) |
| Chromosome 12 | 48,280 | (174.43) | 15,917 | (57.51) | 16,092 | (58.14) | 16,572 | (59.87) | 9,246 | (33.40) |
| total | 170,961 | (47.66)a | 222,862 | (61.74) | 283,530 | (77.21) | 259,992 | (70.49) | 216,718 | (59.45) |
a: average DNA polymorphism per 100 kb.
Figure 2Classification of SNPs based on locations. Based on the annotation of IRGSP, SNPs and InDels, SNPs were classified as genic or intergenic. Depending on intra-genic locations, ‘genic’ was further separated into CDS, intron, and UTRs. The number and ratio of SNPs in each class is shown. Unknown SNPs were located in the coding region but were not annotated in IRGSP. (a) Dongjin is an elite cultivar in Korea. (b) Hwayeong is a mother cultivar of HY-04, HY-08, and BLB. (c), (d), and (e) were derived from Hwayeong via anther culturing.
Distribution of SNPs within genic regions
| Cultivar | Gene | Total SNP in gene | Non-synonymous | Synonymous | Non-synonymous SNP/synonymous SNP | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| (/Total gene #) | Count (/total SNP) | SNP/gene | Count | NS/gene | Count | SY/gene | ||||
| Dongjin | 4,713 | 11.2% | 31,166 | 18.2% | 6.61a | 4,356 | 0.92b | 3,802 | 0.81c | 1.14 |
| Sub total | 4,713 | 11.2% | 31,166 | 18.2% | 6.61 | 4,356 | 0.92 | 3,802 | 0.81 | 1.14 |
| Sub average | 4,713 | 11.2% | 31,166 | 18.2% | 6.61 | 4,356 | 0.92 | 3,802 | 0.81 | 1.14 |
| Hwayeong | 5,471 | 13.0% | 39,029 | 17.5% | 7.13 | 5,222 | 0.95 | 4,382 | 0.80 | 1.19 |
| HY-04 | 7,507 | 17.8% | 55,668 | 19.6% | 7.42 | 7,638 | 1.02 | 6,368 | 0.85 | 1.19 |
| HY-08 | 6,558 | 15.6% | 47,722 | 18.4% | 7.28 | 6,448 | 0.98 | 5,323 | 0.81 | 1.12 |
| BLB | 5,764 | 13.7% | 41,214 | 19.0% | 7.15 | 5,605 | 0.97 | 4,773 | 0.83 | 1.17 |
| Sub total | 25,300 | 60.1% | 183,633 | 18.7% | 7.26 | 24,913 | 0.98 | 20,846 | 0.82 | 4.67 |
| Sub average | 6,325 | 15.0% | 45,908 | 18.7% | 7.26 | 6,228 | 0.98 | 5,212 | 0.82 | 1.16 |
| Total | 30,013 | 71.3% | 214,799 | 18.6% | 7.16 | 29,269 | 0.98 | 24,648 | 0.82 | 5.81 |
| Average | 6,003 | 14.3% | 42,960 | 18.6% | 7.16 | 5,854 | 0.98 | 4,930 | 0.82 | 1.16 |
Total gene count (build 5): 42,088.
a: SNP/Gene : Total SNP frequency in each gene (including CDS, intron, UTR region), 31,166/4,713=6.61.
b: NS/Gene : non-synonymous SNP frequency in each gene, 4,356/4,713=0.92.
c: SY/Gene : Synonymous SNP frequency in each gene, 3,802/4,713=0.81.
Figure 3Comparisons between novel SNPs and dbSNP. By comparing with NCBI’s dbSNP, SNPs were classified into novel and common. Novel SNPs were more abundant than common ones. In the graph, the x-coordinate and y-coordinate represent each chromosome and the number of SNPs, respectively. (a) – (e) are the same as described in Figure 2.
Line-specific SNPs not reported in dbSNP
| Region/type | Hwayeong | BLB | HY-04 | HY-08 | |||||
|---|---|---|---|---|---|---|---|---|---|
| (#) | (%) | (#) | (%) | (#) | (%) | (#) | (%) | ||
| Location | Total lsSNP | 2,599 | 2,160 | 9,602 | 5,685 | ||||
| Intergenic | 2,300 | (88.5) | 1,821 | (84.3) | 7,972 | (83.0) | 4,990 | (87.8) | |
| Intron | 175 | (6.7) | 172 | (8.0) | 887 | (9.2) | 433 | (7.6) | |
| 5′ UTR | 26 | (1.0) | 31 | (1.4) | 139 | (1.4) | 68 | (1.2) | |
| CDS | 48 | (1.8) | 73 | (3.4) | 346 | (3.6) | 96 | (1.7) | |
| Non-synonymous | 26 | (1.0) | 44 | (2.0) | 214 | (2.2) | 63 | (1.1) | |
| Synonymous | 22 | (0.8) | 29 | (1.3) | 132 | (1.4) | 33 | (0.6) | |
| 3′ UTR | 43 | (1.7) | 49 | (2.3) | 229 | (2.4) | 85 | (1.5) | |
| Exon | 7 | (0.3) | 14 | (0.6) | 29 | (0.3) | 13 | (0.2) | |
| Type | Deletion | 209 | (8.0) | 267 | (12.4) | 736 | (7.7) | 590 | (10.4) |
| Insertion | 288 | (11.1) | 189 | (8.8) | 1,111 | (11.5) | 840 | (14.8) | |
| Substitution | 2,102 | (80.9) | 1,704 | (78.8) | 7,754 | (80.8) | 4,255 | (74.8) | |
Figure 4Line-specific SNPs and common SNPs. SNPs were classified as specific to Hwayeong and each of its progeny lines and SNPs common to them all. Line-specific SNPs and common SNPs were further classified into two groups. One group consisted of novel SNPs, which were not reported in the dbSNP (no dbSNP), and the others were listed in the dbSNP. In the graph, the x-coordinate and y-coordinate represent each chromosome and the number of SNPs, respectively.
Figure 5Functional analysis of genes carrying non-synonymous SNPs. Genes that contained one or more non-synonymous SNPs were separated into functional categories to obtain relationships between the gene’s function and potential SNPs by Gene Ontology.
Statistics of dbSNP
| chr | Number of rsfSNP | Build4.0 genome sequence | Build5.0 genome sequence | ||
|---|---|---|---|---|---|
| Number of mapping rsfSNP | Rate of mapping rsfSNP | Number of mapping rsfSNP | Rate of mapping rsfSNP | ||
| Total | 4,521,605 | 3,985,629 | 0.881463 | 3,985,423 | 0.881418 |
| (5,418,373) | (4,481,743) | (0.827138) | (4,481,537) | (0.827100) | |
| 1 | 547,062 (12%) | 481,321 (12%) | 0.879829 | 481,344 (12%) | 0.879871 |
| 2 | 463,395 (10%) | 408,263 (10%) | 0.881026 | 408,250 (10%) | 0.880998 |
| 3 | 449,034 (10%) | 402,287 (10%) | 0.895894 | 402,289 (10%) | 0.895899 |
| 4 | 367,590 (8%) | 323,843 (8%) | 0.880990 | 323,783 (8%) | 0.880826 |
| 5 | 354,080 (8%) | 314,864 (8%) | 0.889245 | 314,857 (8%) | 0.889226 |
| 6 | 379,978 (8%) | 336,607 (8%) | 0.885859 | 336,617 (8%) | 0.885885 |
| 7 | 361,174 (8%) | 315,166 (8%) | 0.872615 | 315,185 (8%) | 0.872668 |
| 8 | 355,927 (8%) | 314,842 (8%) | 0.884569 | 314,850 (8%) | 0.884592 |
| 9 | 302,749 (7%) | 268,656 (7%) | 0.887389 | 268,632 (7%) | 0.887309 |
| 10 | 286,270 (6%) | 250,494 (6%) | 0.875027 | 250,453 (6%) | 0.874884 |
| 11 | 332,510 (7%) | 288,612 (7%) | 0.867980 | 288,529 (7%) | 0.867730 |
| 12 | 321,836 (7%) | 280,674 (7%) | 0.872103 | 280,634 (7%) | 0.871978 |
| Multia | 697,672 | 472,089 | 0.676663 | 472,089 | 0.676663 |
| NotOnb | 199,096 | 24,025 | 0.12067 | 24,025 | 0.12067 |
a: reads which mapped in multiple regions.
b: reads which not mapped.
The number of rsSNP according to genome version
| Genome version | Number of mapping rsfSNP | Rate of mapping rsfSNP | Number of rsfSNP |
|---|---|---|---|
| Build 4.0 | 3,985,629 | 88.14% | 4,521,605 |
| Build 5.0 | 3,985,423 | 88.14% |