| Literature DB >> 31337341 |
Suzhen Niu1,2,3, Qinfei Song1, Hisashi Koiwa2, Dahe Qiao3, Degang Zhao4,5, Zhengwu Chen6, Xia Liu1, Xiaopeng Wen7,8.
Abstract
BACKGROUND: To efficiently protect and exploit germplasm resources for marker development and breeding purposes, we must accurately depict the features of the tea populations. This study focuses on the Camellia sinensis (C. sinensis) population and aims to (i) identify single nucleotide polymorphisms (SNPs) on the genome level, (ii) investigate the genetic diversity and population structure, and (iii) characterize the linkage disequilibrium (LD) pattern to facilitate next genome-wide association mapping and marker-assisted selection.Entities:
Keywords: Genetic diversity; Genotyping-by-sequencing; Guizhou plateau; Linkage disequilibrium; Origin center; Population structure; Tea plant
Mesh:
Year: 2019 PMID: 31337341 PMCID: PMC6652003 DOI: 10.1186/s12870-019-1917-5
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Percentage of transition and transversion SNPs identified using genotyping-by-sequencing
| Transitions | Transversions | |||||
|---|---|---|---|---|---|---|
| AG | CT | AT | AC | CG | GT | |
| Numbers of allelic sites | 31472 | 31490 | 4656 | 3960 | 3010 | 4024 |
| Percentage of allelic sites | 39.83% | 39.85% | 5.89% | 5.01% | 3.81% | 5.09% |
| Total (Percentage) | 62962(79.68%) | 15650(19.81%) | ||||
Genetic diversity parameters of 415 tea accessions in Guizhou Plateau
| Group | Number of tested tea accessions | PPL | GD | Ho | PIC | |||
|---|---|---|---|---|---|---|---|---|
| Cultivation Type | Wild Type | Total | ||||||
| Region | Ia | 106 | 62 | 168 | 0.302(0.007) | 0.262(0.003) | 0.273(0.004) | 0.218(0.002) |
| Ib | 42 | 9 | 51 | 0.286(0.009) | 0.249(0.004) | 0.235(0.004) | 0.207(0.002) | |
| Ic | 19 | 38 | 57 | 0.258(0.009) | 0.239(0.003) | 0.231(0.004) | 0.199(0.003) | |
| II | 57 | 26 | 83 | 0.276(0.007) | 0.250(0.003) | 0.237(0.004) | 0.208(0.002) | |
| III | 19 | 22 | 41 | 0.267(0.010) | 0.230(0.004) | 0.209(0.004) | 0.192(0.002) | |
| IV | 7 | 3 | 10 | 0.291(0.013) | 0.230(0.004) | 0.222(0.004) | 0.188(0.003) | |
| Growth habits | Cultivation Type | – | – | 255 | 0.298(0.004) | 0.253(0.003) | 0.259(0.004) | 0.210(0.002) |
| Wild Type | – | – | 160 | 0.264(0.006) | 0.225(0.003) | 0.229(0.004) | 0.190(0.002) | |
| Total | – | – | 415 | 2.000(0.000) | 0.257(0.003) | 0.247(0.004) | 0.214(0.002) | |
PPL The percentage of polymorphic loci, GD Genetic diversity, Ho Observed heterozygosity, PIC Polymorphism information content, Pure Wild Type, Admixed Wild Type and Pure Cultivation Type were groups based on STRUCTURE at K = 2 using 415 tea accessions
Ia Area with a good suitable climate for tea plant growth in North, Guizhou, Ib Area with a good suitable climate for tea plant growth in East, Guizhou, Ic Area with a good suitable climate for tea plant growth in South, Guizhou; II, Area with a suitable climate for tea plant growth in center, Guizhou; III, Area with a minor suitable climate for tea plant growth in West, Guizhou; IV, Area with an unsuitable climate for tea plant growth in West, Guizhou
Genetic differentiation of inferred populations of tea plants in Guizhou Plateau
| Group | S | PLL | GD | Ho | PIC | Fis |
|---|---|---|---|---|---|---|
| GP01 | 52 | 0.213b | 0.129c | 0.128c | 0.106c | 0.017( |
| GP02 | 100 | 0.286a | 0.248a | 0.276a | 0.208a | −0.107( |
| GP03 | 263 | 0.298a | 0.254a | 0.259b | 0.210a | −0.446( |
| GP03–1 | 198 | 0.298a | 0.253a | 0.260b | 0.209a | −0.026( |
| GP03–2 | 65 | 0.297a | 0.236b | 0.256b | 0.194b | −0.078( |
S Sample size, PLL The percentage of polymorphic loci, GD Genetic diversity, Ho Observed heterozygosity, PIC Polymorphism information content, Fis Inbreeding coefficient, p-value, the statistical significance of the Fis is compared to zero. The different letters indicate a significant difference in a column at p = 0.05 levels by T-test
Fig. 1The genetic clusters inferred using STRUCTURE. a Graphical method allowing the detection of the number of groups K using ∆K and LnP(K). ∆K and LnP(K) are shown in blue and red, respectively. b Inferred population structure of the collection using STRUCTURE software. Bar plot of individual ancestry proportions for the genetic clusters inferred using STRUCTURE (K = 2). Individual ancestry proportions (q values) are sorted within each cluster. Admixture model, independent frequencies, 30,000 burn-in iterations, and 100,000 Markov Chain Monte Carlo iterations were used for this analysis. Cultivation type and wild type ancestral populations are shown in red and blue, respectively
Fig. 2Principal component analysis (PCA) of 415 tea accessions. PCA using 1135 selected SNPs with no linkage disequilibrium in the set of 415 tea accessions. GP03 identified in STRUCTURE is shown in green, GP01 in red and GP02 in blue. First and second components (a) and first and third components (b) of the PCA analyses are shown
Fig. 3Cluster analysis based on genetic distance using an UPGMA tree. a UPGMA cluster tree compared with both growth habits, wild type (red) and cultivation type (green). b UPGMA cluster tree compared with STRUCTUER results (k = 2), Pure Wild Type (red), Pure Cultivation Type (green) and Admixed Wild Type (yellow). c UPGMA cluster tree compared with growthway, modern cultivation (red), ancient cultivation (green) and wild (yellow). d UPGMA cluster tree compared with classification results, C.tachangensis (red), C.sinensis (green), C.remotiserrata (yellow) and uncertain species (blue). e UPGMA cluster tree include 4 inferred groups, GP01 (red), GP02 (yellow), GP03–1 (green) and GP03–2 (purple)
Fig. 4Principal component analysis (PCA) of 415 tea accessions. PCA using 1135 selected SNPs with no linkage disequilibrium in the set of 415 tea accessions. The GP01 cluster identified in STRUCTURE is shown in red, The GP02 cluster in blue, GP03–1 in purple and GP03–2 in green. First and second components (a) and first and third components (b) of the PCA analyses are shown
Fig. 5Linkage disequilibrium decay for all scaffolds longer than 500 kb. a Scatter plot of LD decay (r2) against the genetic distance for pairs of linked SNP across all scaffolds longer than 500 kb. b Zoom-in scatter plot of LD decay (r2) against the genetic distance
Fst and pairwise genetic distance among four inferred populations of tea plant in Guizhou Plateau
| Group | GP01 | GP02 | GP03–1 | GP03–2 |
|---|---|---|---|---|
| GP01 | – | 0.408 | 0.576 | 0.599 |
| GP02 | 0.077c | – | 0.253 | 0.299 |
| GP03–1 | 0.155b | 0.064d | – | 0.173 |
| GP03–2 | 0.178a | 0.077c | 0.054e | – |
The bottom left is the value of Fst; The upper right is the value of pairwise genetic distance; The different letters indicate a significant difference in p = 0.05 levels by the T-test