| Literature DB >> 24732455 |
Bo Wu1, Guang-yan Zhong2, Jian-qiang Yue3, Run-ting Yang4, Chong Li4, Yue-jia Li5, Yun Zhong5, Xuan Wang4, Bo Jiang5, Ji-wu Zeng5, Li Zhang5, Shu-tang Yan4, Xue-jun Bei4, Dong-guo Zhou3.
Abstract
Pummelo cultivars are usually difficult to identify morphologically, especially when fruits are unavailable. The problem was addressed in this study with the use of two methods: high resolution melting analysis of SNPs and sequencing of DNA segments. In the first method, a set of 25 SNPs with high polymorphic information content were selected from SNPs predicted by analyzing ESTs and sequenced DNA segments. High resolution melting analysis was then used to genotype 260 accessions including 55 from Myanmar, and 178 different genotypes were thus identified. A total of 99 cultivars were assigned to 86 different genotypes since the known somatic mutants were identical to their original genotypes at the analyzed SNP loci. The Myanmar samples were genotypically different from each other and from all other samples, indicating they were derived from sexual propagation. Statistical analysis showed that the set of SNPs was powerful enough for identifying at least 1000 pummelo genotypes, though the discrimination power varied in different pummelo groups and populations. In the second method, 12 genomic DNA segments of 24 representative pummelo accessions were sequenced. Analysis of the sequences revealed the existence of a high haplotype polymorphism in pummelo, and statistical analysis showed that the segments could be used as genetic barcodes that should be informative enough to allow reliable identification of 1200 pummelo cultivars. The high level of haplotype diversity and an apparent population structure shown by DNA segments and by SNP genotypes, respectively, were discussed in relation to the origin and domestication of the pummelo species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24732455 PMCID: PMC3986212 DOI: 10.1371/journal.pone.0094506
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
HRMA Primer sequences and statistics on the 25 Set1 SNPs.
| ID | SNP name | Accession No. | Forward primer | Reverse primer | Amplicon length (bp) | Ho | He | PIC | f |
| 1 | chr1_7539019A/G | GO241840 |
|
| 40 | 0.44 | 0.43 | 0.34 | 0.01 |
| 2 | chr1_20043485A/G | EY724955 |
|
| 50 | 0.27 | 0.29 | 0.25 | 0.07 |
| 3 | chr1_25991020A/T | EY722700 |
|
| 36 | 0.31 | 0.32 | 0.27 | 0.01 |
| 4 | chr2_31491T/C | GO241741 |
|
| 42 | 0.36 | 0.45 | 0.35 | 0.20 |
| 5 | chr2_16534248A/G | GO242464 |
|
| 42 | 0.43 | 0.46 | 0.35 | 0.08 |
| 6 | chr2_30594899T/C | EY661549 |
|
| 46 | 0.30 | 0.34 | 0.28 | 0.10 |
| 7 | chr2_30595627T/C | EY656665 |
|
| 61 | 0.25 | 0.30 | 0.25 | 0.16 |
| 8 | chr3_10462341T/C | EY699140 |
|
| 35 | 0.19 | 0.37 | 0.30 | 0.49 |
| 9 | chr3_25525170A/G | GO241356 |
|
| 45 | 0.13 | 0.16 | 0.15 | 0.18 |
| 10 | chr4_4533744A/G | GE213353 |
|
| 41 | 0.40 | 0.49 | 0.37 | 0.18 |
| 11 | chr4_14833122T/C | FC922642 |
|
| 44 | 0.18 | 0.17 | 0.16 | 0.04 |
| 12 | chr5_2097500T/G | FC922009 |
|
| 37 | 0.43 | 0.49 | 0.37 | 0.14 |
| 13 | chr5_12963514T/C | EY667938 |
|
| 51 | 0.38 | 0.36 | 0.29 | 0.05 |
| 14 | chr5_13684450T/C | EY732058 |
|
| 43 | 0.17 | 0.19 | 0.17 | 0.07 |
| 15 | chr5_15275826A/G | HS085998 |
|
| 47 | 0.42 | 0.45 | 0.35 | 0.07 |
| 16 | chr6_16520557A/G | HS086273 |
|
| 42 | 0.41 | 0.49 | 0.37 | 0.16 |
| 17 | chr7_7194644A/G | JZ120109 |
|
| 43 | 0.17 | 0.19 | 0.17 | 0.07 |
| 18 | chr7_7195033T/G | GO241763 |
|
| 39 | 0.17 | 0.22 | 0.20 | 0.21 |
| 19 | chr7_22205388T/C | HS088559 |
|
| 40 | 0.24 | 0.27 | 0.23 | 0.10 |
| 20 | chr7_31599886T/G | GO242250 |
|
| 42 | 0.10 | 0.11 | 0.10 | 0.14 |
| 21 | chr9_2379630T/G | HS088445 |
|
| 40 | 0.30 | 0.50 | 0.37 | 0.39 |
| 22 | chr9_13716551T/C | EY688408 |
|
| 43 | 0.10 | 0.12 | 0.11 | 0.13 |
| 23 | chr9_15596085A/G | JK694395 |
|
| 41 | 0.42 | 0.39 | 0.32 | 0.05 |
| 24 | chrUn_5023005A/G | HS086380 |
|
| 44 | 0.47 | 0.48 | 0.37 | 0.03 |
| 25 | chrUn_19904498A/G | GO241683 |
|
| 44 | 0.43 | 0.38 | 0.31 | 0.11 |
| Mean | - | - | - | - | - | 0.30 | 0.34 | 0.27 | 0.11 |
*The SNP name chr1_7539019A/G: an A/G SNP at the 7539019th bp on the pseudo-chromosome 1 of the sweet orange reference genome, and et cetera. Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphic information content; f, fixation coefficient.
Figure 1Representative high resolution melting curves for pummelo SNPs.
The left and the right panels were melting curves and derivative melting curves respectively. Red and green curves were homozygous amplicons with low and high melting temperatures, respectively. Gray and blue curves represent heterozygotes. Top panels: melting curves of an A/G SNP (chr1_20043485A/G). Middle panels: melting curves of an A/T SNP (chr1_25991020A/T). Bottom panels: melting curves of an A/G SNP (chr4_18094735A/G); note the two different heterozygotes with different melting curves, the normal heterozygote (referred as He1) as shown in gray and the abnormal heterozygote (referred to as He2 that probably contained a primer-template mismatch) in blue.
Figure 2Distribution of the minimum number of SNPs differed between a genotype and other genotypes.
Figure 3The relationship between observed genotypes and numbers of used SNPs.
ALL, all the 178 genotypes; MYANMAR, the 55 Myanmar genotypes; CUL, the 86 cultivar genotypes. The 25 SNPs were ranked, from large to small, by their PIC values in ALL, MYANMAR and CUL, respectively, and then added one by one into analysis.
Figure 4Increases in –lg(PI) and –lg(PIsib) with the use of increasing number of SNPs in different populations or groups.
SNPs were added one by one in the order from 1 to 25 (listed in Table 1) in the construction of the plot. (chr2_30594899T/C, chr2_30595627T/C), (chr5_12963514T/C, chr5_13684450T/C, chr5_15275826A/G), and (chrUn_5023005A/G, chrUn_19904498A/G) were treated as three super loci (the 6th, 12th and 21st dots (large)). The solid and dash lines connect PIs and PIsibs, respectively.
Genotypes of the 24 sequenced accessions.
| Group | Gene ID | Cs1g16760 | Cs1g23450 | Cs2g19680 | Cs2g31250 | Cs4g07130 | Cs4g15590 | Cs5g15460 | Cs7g10980 | Cs7g31800 | Cs9g04300 | Cs9g14320 | Cs9g16170 |
| Genomic locations of sequenced segments | chr1:20042999–20043567 | chr1:25990710–25991189 | chr2:16533735:16534365 | chr2:30595507–30595981 | chr4:4533481–4533998 | chr4:14833018–14833473 | chr5:13684338–13684914 | chr7:7194703–7195183 | chr7:31599722–31600139 | chr9:2379414–2379892 | chr9:13716462–13717019 | chr9:15595778–15596191 | |
| 1 | Dayongjuhuaxin | (1,2)* | (2,5) | (1,1) | (5,5) | (3,5) | (2,2) | (4,7) | (5,10) | (1,1) | (4,5) | (1,2) | (1,2) |
| 1 | Fujianwendan | (1,2) | (5,5) | (2,4) | (2,4) | (3,3) | (1,2) | (4,6) | (7,9) | (1,1) | (1,2) | (1,6) | (3,4) |
| 1 | Guanximiyou | (2,2) | (5,5) | (1,1) | (4,5) | (1,2) | (1,2) | (4,6) | (6,6) | (1,3) | (1,2) | (1,1) | (4,4) |
| 1 | Liangpingyou_cq | (1,2) | (2,2) | (1,2) | (5,5) | (3,3) | (1,2) | (4,4) | (6,6) | (1,3) | (1,1) | (1,1) | (4,4) |
| 2 | Dongguaquan | (2,3) | (2,5) | (2,2) | (5,5) | (3,4) | (2,2) | (4,7) | (6,8) | (1,3) | (3,4) | (6,7) | (3,6) |
| 2 | Hejiangyou | (2,3) | (5,5) | (2,2) | (5,5) | (3,4) | (1,2) | (4,4) | (6,8) | (1,1) | (4,7) | (6,7) | (3,6) |
| 2 | Hongxinshatianyou | (2,2) | (2,5) | (2,4) | (5,5) | (3,4) | (1,1) | (7,7) | (8,10) | (1,1) | (4,7) | (5,5) | (3,4) |
| 2 | Lingnanshatianyou | (2,3) | (5,5) | (2,2) | (5,5) | (3,4) | (1,2) | (4,4) | (6,8) | (1,1) | (4,7) | (6,7) | (3,6) |
| 2 | Meiweishatianyou | (2,2) | (5,5) | (2,2) | (5,5) | (4,4) | (1,2) | (4,7) | (6,8) | (1,1) | (7,7) | (5,7) | (2,3) |
| 2 | Zhaipoyou | (2,3) | (5,5) | (1,2) | (5,5) | (2,3) | (2,2) | (4,7) | (6,8) | (1,1) | (1,7) | (1,6) | - |
| 3 | Zuoshiyou | (1,2) | (2,5) | (1,1) | (4,5) | (3,4) | (1,1) | (4,4) | (2,10) | (1,1) | (4,4) | (1,6) | (4,6) |
| 3 | Unknown | (1,2) | (2,5) | (1,2) | (4,5) | (1,2) | (1,1) | (4,5) | (6,10) | (1,1) | (1,2) | (1,9) | (2,4) |
| 3 | Dianjianghongxinyou | (1,2) | (2,2) | (1,2) | (5,5) | (1,4) | (1,1) | (7,7) | (4,10) | - | (4,4) | (1,6) | (6,6) |
| 3 | Duanshiyou | (2,3) | (5,5) | (1,2) | (5,5) | (1,4) | (1,1) | (4,4) | (6,6) | (1,1) | (4,4) | (4,9) | (3,4) |
| 3 | HB | (3,4) | (5,5) | (2,2) | (5,5) | (1,3) | (1,2) | (4,4) | (2,4) | (1,1) | (4,4) | - | - |
| 3 | Humiyou | (2,2) | (2,4) | (2,2) | (3,5) | (3,3) | (1,2) | (4,7) | (3,10) | (1,1) | (1,4) | (1,3) | (4,4) |
| 3 | Qiyou | (3,3) | (5,5) | (1,2) | (5,5) | (4,4) | (1,1) | (4,7) | (6,8) | (1,1) | (4,4) | (4,4) | (2,3) |
| 3 | Shuhuayou | (2,2) | (2,5) | (1,2) | (5,5) | (3,4) | (1,2) | (4,7) | (6,10) | (1,1) | (7,7) | (5,7) | (3,4) |
| 3 | Taiguomiyou | (2,3) | (2,3) | (1,1) | (4,5) | (3,4) | (2,2) | (4,4) | (2,10) | (1,1) | (4,4) | (1,7) | (2,3) |
| 3 | Xianluodisuanyou | (1,2) | (5,5) | (1,1) | (5,5) | (3,4) | (1,1) | (4,4) | (10,10) | (1,3) | (4,7) | (1,1) | (4,4) |
| 4 | Boluoxiangyou | (1,1) | (1,2) | (2,3) | (1,5) | (1,4) | (1,3) | (1,4) | (1,10) | (1,3) | (1,1) | (1,3) | (4,5) |
| 4 | Jiaodaoyou | (1,2) | (5,5) | (2,3) | (6,7) | (1,6) | (2,4) | (4,5) | (10,10) | (1,1) | (6,8) | (1,6) | - |
| 4 | Tangyouzi | (2,4) | (1,5) | (2,3) | (5,9) | (1,6) | (1,3) | (2,4) | (1,5) | (2,4) | (6,6) | (3,8) | - |
| 4 | Iwaikan | (1,3) | (5,5) | (2,3) | (5,8) | (3,6) | (1,3) | (2,3) | (1,6) | (2,4) | (4,4) | (3,3) | (5,5) |
Numbers in parenthesis represent the two haplotypes of a gene segment in a cultivar. Different numbers represent different haplotypes; -, data unavailable. Group 1, Wendan; group 2, Shatianyou; group 3, hybrids; group 4, unassigned accessions. Gene ID: Cs1g16760, 40S ribosomal protein SA; Cs1g23450, Oxygen-evolving enhancer protein 1; Cs2g19680, Chlorophyll a–b binding protein of LHCII type I; Cs2g31250, Serine–glyoxylate aminotransferase; Cs4g07130, Protein TIFY 10A; Cs4g15590, Putative uncharacterized protein Sb09g005910; Cs5g15460, Protein RecA; Cs7g10980, Agamous-like MADS-box protein AGL9 homolog; Cs7g31800, Ribulose bisphosphate carboxylase/oxygenase; Cs9g04300, Translationally controlled tumor protein like protein; Cs9g14320, DC 1.2-like protein; Cs9g16170, Putative uncharacterized protein At5g6220.
Nucleotide diversity and haplotype diversity in the 20 sequenced true-to-type pummelos.
| Gene Accession Number | Cs1g16760 | Cs1g23450 | Cs2g19680 | Cs2g31250 | Cs4g07130 | Cs4g15590 | Cs5g15460 | Cs7g10980 | Cs7g31800 | Cs9g04300 | Cs9g14320 | Cs9g16170 | Mean |
| Available sequences | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 40 | 38 | 40 | 38 | 36 | 39.33 |
| Segment length (bp) | 569 | 499 | 631 | 475 | 518 | 456 | 577 | 473 | 454 | 479 | 558 | 410 | 508.25 |
| Number of SNPs | 2 | 3 | 2 | 5 | 5 | 3 | 3 | 6 | 1 | 6 | 10 | 8 | 4.5 |
| Number of Haplotypes | 4 | 4 | 3 | 4 | 5 | 2 | 4 | 9 | 2 | 6 | 8 | 5 | 4.7 |
| Haplotype diversity | 0.57 | 0.50 | 0.56 | 0.31 | 0.69 | 0.50 | 0.53 | 0.77 | 0.19 | 0.69 | 0.80 | 0.75 | 0.57 |
| Nucleotide diversity | 0.0012 | 0.0012 | 0.0010 | 0.0028 | 0.0032 | 0.0033 | 0.0010 | 0.0049 | 0.0004 | 0.0039 | 0.0061 | 0.0060 | 0.0029 |