| Literature DB >> 31092188 |
Mor Rubinstein1, Ravit Eshed1, Ada Rozen1, Tali Zviran1, David N Kuhn2, Vered Irihimovitch1, Amir Sherman1, Ron Ophir3.
Abstract
BACKGROUND: Discovering a genome-wide set of avocado (Persea americana Mill.) single nucleotide polymorphisms and characterizing the diversity of germplasm collection is a powerful tool for breeding. However, discovery is a costly process, due to loss of loci that are proven to be non-informative when genotyping the germplasm.Entities:
Keywords: Avocado; FST; Germplasm collection; Kinship; SNP
Mesh:
Substances:
Year: 2019 PMID: 31092188 PMCID: PMC6521498 DOI: 10.1186/s12864-019-5672-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of polymorphism in pools
| Number of polymorphic pools | Counts | Percentage of total |
|---|---|---|
| Three | 345,617 | 68.0% |
| Two | 117,692 | 23.2% |
| One | 44,552 | 8.8% |
| None | 56 | 0.0001% |
| Total | 507,917 | 100.0% |
Comparison of polymorphic status of loci in the sequenced pools to actual polymorphism in the collection
| Pool | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Guatemalan | Mexican | West Indian | ||||||||||
| Accessions | AA | AB | BB |
| AA | AB | BB |
| AA | AB | BB |
|
| AA | 1 | 13 | 1 | 1 | 2 | 14 | 0 | 12 | 1 | 9 | 0 | 1 |
| AB | 1 | 151 | 0 | 9 | 7 | 130 | 6 | 9 | 15 | 126 | 18 | 4 |
| BB | 0 | 1 | 0 | 3 | 0 | 1 | 6 | 3 | 0 | 3 | 1 | 3 |
|
| 0 | 11 | 0 | 0 | 0 | 11 | 0 | 1 | 1 | 9 | 1 | 0 |
| True positive percentage | 50% | 86% | 0% | 0% | 22% | 83% | 50% | 4% | 6% | 86% | 5% | 0% |
NA – missing data
Fig. 1Genetic relationship dendrogram and genetic structure of germplasm collection. Accession names on the dendrogram node-leaves are colored according to morphological characteristics distinct to the three races. Clustering of genetic groups was based on Ward’s method and the distance method of 1-proportion of shared alleles (PSA). Only bootstrap values > 650 are presented with a confidence of the bootstrap-value out of 1000. Genomic admixture was estimated by STRUCTURE program with three subpopulations (K = 3). The average result over 20 simulations was calculated using CLUMPAK, and plotted as a color coded bar plot
Classification of races by different methods
| Classifying method | # Mexican | # Guatemalan | # West Indian | Median of FST |
|---|---|---|---|---|
| A – Botanical characterization | 10 | 17 | 23 | 0.23 |
| B – PSA dendrogram | 14 | 16 | 20 | 0.36 |
| C - STRUCTURE (Q > 0.85) | 9 | 11 | 16 | 0.65 |
| A & B | 9 | 12 | 17 | 0.56 |
| A & C | 7 | 9 | 14 | 0.41 |
| B & C | 9 | 11 | 16 | 0.65 |
| A & B & C | 7 | 9 | 14 | 0.73 |
FST – fixation index
Fig. 2Principal component analysis (PCA) and the second-degree relatedness in avocado germplasm collection. PCA based on identity by state (IBS) of the core sets was performed using the SNPRelate R-package. The rest of the accessions were projected on the PCA space. Additionally, identity by descent (IBD) analysis were performed to detect the level of relatedness. Accessions related to each other with a kinship coefficient ≥ 0.25 are connected by edges