| Literature DB >> 28125645 |
Arthur T O Melo1, Robert S Guthrie2, Iago Hale1.
Abstract
Plant germplasm collections can be invaluable resources to plant breeders, provided they are well-characterized. After 140 years of acquisition and curation efforts by a wide and largely non-coordinated array of private and institutional actors, the current US collection of cold-hardy kiwifruit (Actinidia spp.) is rife with misclassifications, misnomers, and mix-ups. To facilitate the systematic improvement and resource-efficient curation of these species of long-recognized horticultural potential, we used genotyping-by-sequencing (GBS) data to deconvolute this historic collection. Evaluation of a total of 138 accessions (103 A. arguta, 28 A. kolomikta, and 7 A. polygama) with an interspecific set of 1,040 high-quality SNPs resulted in clear resolution of the three species. Intraspecific analysis (2,964 SNPs) within A. arguta revealed a significant level of redundancy (41.7%; only 60 unique genotypes out of 103 analyzed) and a sub-population structure reflecting likely geographic provenance, phenotypic classes, and hybrid pedigree. For A. kolomikta (3,425 SNPs), the level of accession redundancy was even higher (53.6%; 13 unique genotypes out of 28 analyzed); but no sub-structure was detected. Numerous instances were discovered of distinct genotypes sharing a common name, different names assigned to the same genotype, mistaken species assignments, and incorrect gender records, all critical information for both breeders and curators. In terms of method, this study demonstrates the practical and cost-effective use of GBS data to characterize plant genetic resources, despite ploidy differences and the lack of reference genomes. With the recent prohibition on further imports of Actinidia plant material into the country and with the active eradication of historic vines looming, this analysis of the US cold-hardy kiwifruit germplasm collection provides a timely assessment of the genetic resource base of an emerging, high-value specialty crop.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28125645 PMCID: PMC5268759 DOI: 10.1371/journal.pone.0170580
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary data characterizing the sets of SNPs used for all three species together (interspecific analysis) and for each species separately (intraspecific analyses).
| Species | N | PE reads | SNPs | D | D > 20 | Hetero | Homo | NA |
|---|---|---|---|---|---|---|---|---|
| 138 | 626,806,096 | 1,040 | 108 | 100 | 17.07 | 68.92 | 14.01 | |
| 103 | 476,836,814 | 2,964 | 44.04 | 75.12 | 33.86 | 58.00 | 8.12 | |
| 28 | 130,774,076 | 3,425 | 33.22 | 57.66 | 30.14 | 64.33 | 5.51 | |
| 7 | 19,195,206 | 2,037 | 47.57 | 74.57 | 37.53 | 56.59 | 5.87 | |
| Average | — | 208,935,365 | 2,808 | 41.61 | 69.11 | 33.84 | 59.64 | 6.50 |
a Number of genotypes (i.e. accessions) sampled
b Number of high quality, paired-end (PE) reads used to call SNPs
c Number of SNPs called after imposing all genotyping criteria and subsequent filters
d Average read depth (i.e. average number of independent supporting GBS fragments) for each called SNP
e Percentage of called SNPs with an average read depth of at least 20
f Percentage of heterozygous genotype calls
g Percentage of homozygous genotype calls
h Percentage of missing cells (i.e. no genotype assigned for a given SNP-accession combination)
The mean Gower dissimilarity coefficients (GD's) and 99% confidence thresholds (GD99%, in parentheses) generated via three different strategies to assess and declare genotypic redundancy.
| Species | Strategy | ||
|---|---|---|---|
| Biological | Technical | Read | |
| 0.0004 (0.0024) | 0.0029 (0.0046) | 0.0007 (0.0017) | |
| 0.0009 (0.0036) | 0.0060 (0.0112) | 0.0017 (0.0029) | |
| -- | -- | 0.0029 (0.0045) | |
Fig 1Interspecific UPGMA cladogram showing the genetic relationships, based upon a modified Gower’s dissimilarity coefficient (GD), among the three Actinidia species evaluated.
Population parameters characterizing the genetic diversity among and within the three collections of Actinidia species in this study.
| Species | MAF | NE | HO | HE | GD | FIS |
|---|---|---|---|---|---|---|
| 90.14 | 1.001 | 0.109 | 0.079 | 0.185 | -0.283 | |
| 88.87 | 1.460 | 0.369 | 0.293 | 0.169 | -0.193 | |
| 87.34 | 1.487 | 0.320 | 0.307 | 0.210 | -0.042 | |
| 100.00 | 1.680 | 0.400 | 0.424 | 0.295 | -0.031 | |
| Average | 92.07 | 1.542 | 0.363 | 0.341 | 0.224 | -0.089 |
a Percentage of loci with a minor allele frequency greater than 10%
b Average number of effective alleles per locus
c Observed heterozygosity
d Unbiased expected heterozygosity
e Mean modified Gower dissimilarity coefficient
f Inbreeding coefficient
Fig 2UPGMA cladogram for A. arguta sub-group G1.
The red dashed line indicates the 99% confidence Gower dissimilarity threshold (GD = 0.0046) used to declare redundant accessions within this species. Within this sub-group alone, 12 redundant groups of genotypes were identified. Cladogram labels consist of an accession name followed by either its six-digit USDA plant introduction (PI) number, if part of the NPGS, or the initials of its non-USDA source (see S1 Table). Accessions in bold are the most read abundant genotypes within their respective redundant groups and are used to represent their groups on the full A. arguta intraspecific cladogram (S8 Fig). All accessions are tetraploid A. arguta, unless otherwise noted: hyb = putative unspecified interspecific hybrid with A. arguta; ap = putative A. arguta var. purpurea.
Fig 3UPGMA cladogram for A. arguta sub-groups G2 and G3.
The red dashed line indicates the 99% confidence Gower dissimilarity threshold (GD = 0.0046) used to declare redundant accessions within this species. No accession redundancy was found in G2, but 5 redundant groups of genotypes were identified within sub-group G3. Cladogram labels consist of an accession name followed by either its six-digit USDA plant introduction (PI) number, if part of the NPGS, or the initials of its non-USDA source (see S1 Table). Accessions in bold are the most read abundant genotypes within their respective redundant groups and are used to represent their groups on the full A. arguta intraspecific cladogram (S8 Fig). All accessions are tetraploid A. arguta, unless otherwise noted: [Ch] = Chinese provenance; [6x] = hexaploid; hyb = putative unspecified interspecific hybrid with A. arguta; ac = putative A. arguta var. cordifolia; ap = putative A. arguta var. purpurea; a×m = putative A. arguta × A. melanandra hybrid.
Results of the AMOVA-based partitioning of the variance in allele frequencies within the collection of 60 non-redundant A. arguta genotypes.
| Source of variation | df | SS | MS | p-value | Var | Var % |
|---|---|---|---|---|---|---|
| Among Groups | 2 | 1,890.4 | 945.2 | 0.041 | 8.2 | 1.4 |
| Among Genotypes | 57 | 36,564.2 | 641.5 | 0.001 | 69.8 | 12.0 |
| Within Genotypes | 60 | 68,568.1 | 501.9 | 0.001 | 501.9 | 86.5 |
Fig 4The A. kolomikta intraspecific UPGMA cladogram.
The red dashed line indicates the 99% confidence Gower dissimilarity threshold (GD = 0.0112) used to declare redundant accessions within this collection. Nine groups of redundant accessions were found. Cladogram labels consist of an accession name followed by either its six-digit USDA plant introduction (PI) number, if part of the NPGS, or the initials of its non-USDA source (see S1 Table). Accessions in bold are the most read abundant genotypes within their respective redundant groups and are used to represent their groups on the bi-plot PCA analysis (S9 Fig).