| Literature DB >> 26880119 |
Ernesto Borrayo1,2, Ryoko Machida-Hirano3, Masaru Takeya4, Makoto Kawase5, Kazuo Watanabe6.
Abstract
BACKGROUND: Core collections are important tools in genetic resources research and administration. At present, most core collection selection criteria are based on one of the following item characteristics: passport data, genetic markers, or morphological traits, which may lead to inadequate representations of variability in the complete collection. The development of a comprehensive methodology that includes as much element data as possible has been explored poorly. Using a collection of (Setaria italica sbsp. italica (L.) P. Beauv.) as a model, we developed a method for core collection construction based on genotype data and numerical representations of agromorphological traits, thereby improving the selection process.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26880119 PMCID: PMC4754896 DOI: 10.1186/s12863-016-0343-z
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Core Collection evaluation scores for different K selected elements
| Group A | Group B | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 6 | 12 | 24 | 48 | 6 | 12 | 24 | 48 | |||
| ANE |
| 0.7924 | 0.7451 | 0.6851 | 0.6159 | N/A | N/A | N/A | N/A | ||
|
| 0.7167 | 0.6478 | 0.574 | 0.4294 | 0.5283 | 0.4047 | 0.3218 | 0.2279 | |||
|
| 0.5212 | 0.3944 | 0.3262 | 0.2007 | 0.7145 | 0.6496 | 0.5692 | 0.4367 | |||
|
| 0.7338 | 0.6683 | 0.5725 | 0.4322 | 0.4978 | 0.4164 | 0.3126 | 0.2199 | |||
| ENE |
| 0.1911 | 0.2646 | 0.2574 | 0.2735 | N/A | N/A | N/A | N/A | ||
|
| 0.2463 | 0.2886 | 0.2961 | 0.3584 | 0.4925 | 0.5548 | 0.6139 | 0.7087 | |||
|
| 0.4204 | 0.5183 | 0.574 | 0.6379 | 0.2703 | 0.289 | 0.3065 | 0.3519 | |||
|
| 0.1355 | 0.2516 | 0.3109 | 0.3145 | 0.4761 | 0.5329 | 0.6265 | 0.6776 | |||
| E |
| 0.9113 | 0.8894 | 0.9059 | 0.9069 | N/A | N/A | N/A | N/A | ||
|
| 0.8851 | 0.888 | 0.8917 | 0.8879 | 0.7604 | 0.7767 | 0.7576 | 0.74 | |||
|
| 0.7415 | 0.7671 | 0.7593 | 0.7587 | 0.8905 | 0.893 | 0.8815 | 0.8818 | |||
|
| 0.9272 | 0.8957 | 0.894 | 0.8915 | 0.7603 | 0.7357 | 0.7395 | 0.7501 | |||
| MD |
| 16.5192 | 4.7198 | 2.6549 | 1.7699 | N/A | N/A | N/A | N/A | ||
|
| 18.3746 | 9.894 | 2.1201 | 0.3534 | 0 | 0 | 0 | 0 | |||
|
| 22.2615 | 13.7809 | 6.0071 | 1.4134 | 22.2615 | 13.0742 | 6.0071 | 1.4134 | |||
|
| 24.7588 | 12.8617 | 1.9293 | 1.2862 | 7.1429 | 0 | 0 | 0 | |||
| VD |
| 27.4336 | 36.8732 | 41.0029 | 46.3127 | N/A | N/A | N/A | N/A | ||
|
| 33.9223 | 45.2297 | 51.2367 | 53.3569 | 67.8571 | 67.8571 | 57.1429 | 50 | |||
|
| 31.8021 | 38.8693 | 45.9364 | 56.1837 | 30.742 | 37.4558 | 44.1696 | 55.477 | |||
|
| 35.6913 | 42.4437 | 53.6977 | 54.0193 | 50 | 53.5714 | 67.8571 | 67.8571 | |||
| CR |
| 29.7935 | 46.0177 | 57.8171 | 69.9115 | N/A | N/A | N/A | N/A | ||
|
| 37.1025 | 55.1237 | 68.9046 | 81.9788 | 71.4286 | 85.7143 | 89.2857 | 100 | |||
|
| 36.7491 | 47.7032 | 62.1908 | 77.7385 | 34.2756 | 45.9364 | 60.0707 | 77.0318 | |||
|
| 41.4791 | 54.0193 | 73.6334 | 81.672 | 71.4286 | 85.7143 | 96.4286 | 96.4286 | |||
| VR |
| 27.6275 | 41.3319 | 54.425 | 66.2917 | N/A | N/A | N/A | N/A | ||
|
| 32.6321 | 48.6972 | 63.4782 | 80.4787 | 76.7938 | 86.7248 | 91.9404 | 102.2757 | |||
|
| 34.9972 | 46.2934 | 58.241 | 75.3076 | 30.9728 | 43.7712 | 55.211 | 74.2049 | |||
|
| 38.7036 | 51.9165 | 70.2887 | 77.0397 | 78.3303 | 93.7485 | 96.6503 | 94.5884 | |||
| CA |
| 64.8968 | 73.0088 | 78.9086 | 84.9558 | N/A | N/A | N/A | N/A | ||
|
| 68.5512 | 77.5618 | 84.4523 | 90.9894 | 85.7143 | 92.8571 | 94.6429 | 100 | |||
|
| 68.3746 | 73.8516 | 81.0954 | 88.8693 | 67.1378 | 72.9682 | 80.0353 | 88.5159 | |||
|
| 70.7395 | 77.0096 | 86.8167 | 90.836 | 85.7143 | 92.8571 | 98.2143 | 98.2143 | |||
ANE, average distance between each original collection (MC) and nearest core collection (CC) sample; ENE, average distance between each CC sample and nearest CC sample; E, average distance between CC samples; MD, homogeneity test for means; VD, homogeneity test for variance; CR, coincidence rate; VR, variable rate; CA, coverage of allele. N/A, not possible to perform diferent-set comparison. With the exception of ANE and MD, higher values suggest better representation. Detailed description of the scoring system is provided in the text. Group A core collections where compared with their original collection dataset; contrarily, when possible, core collections in group B where compared to another equivalent original collection dataset
Fig. 1Principal component distributions for data I (blue), data II (black), and data III (red) in the first three (left) and two (right) principal components, respectively
Fig. 2Principal component distributions of data I (left), data II (center), and data III (right) in data I for the first two principal components
Fig. 3Principal component distributions of data I (left), data II (center), and data III (right) in data II for the first two principal components
Fig. 4Principal component distributions of data I (left), data II (center), and data III (right) in data III for the first two principal components
Fig. 5Distribution of the selected CCs (k = 12) from data I (solid circles-left), data II (solid triangles-center), and data III (solid squares-right) based on the dendrogram obtained using 141 foxtail millet individuals. The dashed lines represent groups of clusters
Fig. 6Geographical distribution of k = 12 CCs from data I (top), data II (center), and data III (bottom). The colored dots represent the geographical origin of each CC member and the crosses represent the geographical origin of each accession included in the analysis. Maps were generated with Diva-GIS 7.5 http://www.diva-gis.org/ based on GADM v.1.0 http://www.gadm.org/