| Literature DB >> 30809240 |
Wei Xia1, Tingting Luo2, Wei Zhang2, Annaliese S Mason3, Dongyi Huang1, Xiaolong Huang1, Wenqi Tang1, Yajing Dou1, Chunyu Zhang2, Yong Xiao4.
Abstract
High-density single nucleotide polymorphisms (SNPs) are used as highly favored makers to analyze genetic diversity and population structure, to construct high-density genetic maps and provide genotypes for genome-wide association analysis. In order to develop genome-wide SNP markers in oil palm (Elaeis guineensis), single locus amplified fragment sequencing (SLAF-seq) technology was performed in a diversity panel of 200 oil palm individuals and 1,261,501 SNPs were identified with minor allele frequency > 0.05 and integrity > 1. Among them, only 17.81% can be mapped within the genic region and the remaining was located into the intergenic region. A positive correlation was detected between the distribution of SNP markers and retrotransposons [transposable elements (TEs)]. Population structure analysis showed that the 200 individuals of oil palm can be divided into five subgroups based on cross-validation errors. However, the subpopulations divided for the 200 oil palm individuals based on the SNP markers were not accurately related to their geographical origins and 80 oil palm individuals from Malaysia showed highest genetic diversity. In addition, the physical distance of linkage disequilibrium (LD) decay in the analyzed oil palm population was 14.516 kb when r2 = 0.1. The LD decay distances for different chromosomes varied from 3.324 (chromosome 15) to 19.983 kb (chromosome 7). Our research provides genome-wide SNPs for future targeted breeding in palm oil.Entities:
Keywords: Elaeis guineensis; SNP; genetic diversity; linkage disequilibrium; population structure
Year: 2019 PMID: 30809240 PMCID: PMC6380268 DOI: 10.3389/fpls.2019.00130
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
The distribution of SLAF (specific locus amplified fragment sequencing) tags produced across the oil palm (E. guineensis) genome.
| Chromosome | Chromosome length | SLAF number | Average SLAF distance | The number of polymorphic SLAF |
|---|---|---|---|---|
| Chr1 | 68,435,666 | 17,521 | 3906 | 11,048 |
| Chr2 | 65,558,141 | 17,080 | 3838 | 11,184 |
| Chr3 | 60,060,032 | 14979 | 4010 | 9785 |
| Chr4 | 57,251,647 | 13,691 | 4182 | 9055 |
| Chr5 | 51,955,539 | 13,228 | 3928 | 8574 |
| Chr6 | 44,357,766 | 10,302 | 4306 | 7252 |
| Chr7 | 43,454,766 | 10,697 | 4062 | 7218 |
| Chr8 | 40,195,399 | 9914 | 4054 | 6377 |
| Chr9 | 38,056,896 | 8693 | 4378 | 5598 |
| Chr10 | 31,890,735 | 8380 | 3806 | 5627 |
| Chr11 | 30,068,910 | 7361 | 4085 | 4870 |
| Chr12 | 28,800,575 | 7548 | 3816 | 5118 |
| Chr13 | 27,817,470 | 6989 | 3980 | 4650 |
| Chr14 | 24,379,743 | 6610 | 3688 | 4409 |
| Chr15 | 24,314,465 | 6105 | 3983 | 4249 |
| Chr16 | 21,371,083 | 5673 | 3767 | 3806 |
| other | 878,010,073 | 192,607 | 4559 | 140,637 |
FIGURE 1Distribution of SLAF (specific locus amplified fragment) tags across the chromosomes of the oil palm reference genome.
FIGURE 2Genomic distributions of genes, microsatellites, TEs, and SNPs across the 16 assembled chromosomes of oil palm, including chromosome 1–4 (A), chromosome 5–8 (B), chromosome 9–12 (C), and chromosome 13–16 (D).
FIGURE 3The number and distribution of SNP markers in the 1 to 10-kb flanking region of TEs in chromosomes 1–6 (A), chromosomes 7–12 (B), chromosomes 13–16 (C), and averaged across all chromosomes (D). Horizontal coordinates represent the flanking regions of TEs. Vertical coordinates represent the number of SNP markers.
FIGURE 4Population structure of 200 oil palm individuals from four different geographical locations. Cross-validation (CV) errors suggest that the 200 oil palm individuals were divided into four or five true genetic populations.
FIGURE 5Phylogenetic tree and population structures of 200 oil palm individuals. The maximum likelihood method was applied to assign every oil palm individuals to a cluster, and the cut-off probability was set to 0.6. Colored lines represent geographical origin. Pie chart represents genetic admixture individuals and their genetic composition.
Genetic diversity analysis of different populations of 200 oil palm individuals from different geographic origins, including showing observed number of allele numbers, expected number of alleles number, observed number of heterozygous number loci, expected number of heterozygous number loci, and polymorphism information content.
| Southern China | Malaysia | Costa Rica | Africa | |||||
|---|---|---|---|---|---|---|---|---|
| Index | Total | Average | Total | Average | Total | Average | Total | Average |
| Observed_allele_number | 2,470,168 | 1.96 | 2,518,412 | 2 | 2,518,560 | 2 | 2242580 | 1.79 |
| Expected_allele_number | 1,956,294 | 1.56 | 1,961,905 | 1.56 | 1,973,880 | 1.56 | 1874468 | 1.49 |
| Observed_heterozygous_number | 278,424 | 0.22 | 322,220 | 0.26 | 302,847 | 0.24 | 276989 | 0.22 |
| Expected_heterozygous_number | 409,813 | 0.32 | 414,824 | 0.33 | 421,856 | 0.33 | 359851 | 0.29 |
| Polymorphysm_information_content | 329,197 | 0.26 | 333,885 | 0.26 | 339,491 | 0.269 | 286991 | 0.23 |
FIGURE 6Linkage disequilibrium (LD) decay of the oil palm genome and across different chromosomes.
Distribution of haplotype blocks in the genome of Elaeis guineensis.
| Chromosome | Chromosome length (Mp) | Number of blocks | Total length of all blocks (Mp) | Frequency |
|---|---|---|---|---|
| Chr1 | 68.4 | 986 | 23.6 | 34.60% |
| Chr2 | 65.5 | 1048 | 23.3 | 35.60% |
| Chr3 | 60.1 | 872 | 21.2 | 35.40% |
| Chr4 | 57.2 | 755 | 19.2 | 33.60% |
| Chr5 | 51.9 | 688 | 18.5 | 35.70% |
| Chr6 | 44.3 | 577 | 15 | 33.90% |
| Chr7 | 43.4 | 655 | 14.7 | 34.00% |
| Chr8 | 40.1 | 543 | 13 | 32.40% |
| Chr9 | 38 | 433 | 12.6 | 33.20% |
| Chr10 | 31.8 | 522 | 9.9 | 31.10% |
| Chr11 | 30 | 413 | 9.4 | 31.30% |
| Chr12 | 28.8 | 413 | 9.9 | 34.50% |
| Chr13 | 27.8 | 402 | 8.9 | 32.30% |
| Chr14 | 24.3 | 359 | 7.3 | 30.00% |
| Chr15 | 24.3 | 340 | 8.9 | 36.60% |
| Chr16 | 21.3 | 280 | 7.7 | 36.50% |