| Literature DB >> 29025420 |
Berhanu Tadesse Ertiro1, Kassa Semagn2, Biswanath Das3, Michael Olsen3, Maryke Labuschagne4, Mosisa Worku3, Dagne Wegary5, Girum Azmach6, Veronica Ogugo3, Tolera Keno6, Beyene Abebe6, Temesgen Chibsa6, Abebe Menkir7.
Abstract
BACKGROUND: Molecular characterization is important for efficient utilization of germplasm and development of improved varieties. In the present study, we investigated the genetic purity, relatedness and population structure of 265 maize inbred lines from the Ethiopian Institute of Agricultural Research (EIAR), the International Maize and Wheat Improvement Centre (CIMMYT) and the International Institute of Tropical Agriculture (IITA) using 220,878 single nucleotide polymorphic (SNP) markers obtained using genotyping by sequencing (GBS).Entities:
Keywords: Distance; GBS; Genetic purity; Heterogeneity; Heterotic grouping; Kinship; Population structure
Mesh:
Substances:
Year: 2017 PMID: 29025420 PMCID: PMC5639748 DOI: 10.1186/s12864-017-4173-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The chromosomal distribution and proportion of polymorphic markers used for computing heterogeneity, genetic distance, relative kinship, and principal coordinate analyses (dataset 1), and population structure and DAPC (dataset 2)
| Chromosome | Dataset 1 | Dataset 2 | ||
|---|---|---|---|---|
| No. of markers | Proportion | No. of markers | Proportion | |
| 1 | 35,002 | 16% | 3391 | 16% |
| 2 | 26,547 | 12% | 2539 | 12% |
| 3 | 25,311 | 11% | 2347 | 11% |
| 4 | 20,748 | 9% | 2353 | 11% |
| 5 | 26,051 | 12% | 2358 | 11% |
| 6 | 17,822 | 8% | 1696 | 8% |
| 7 | 18,630 | 8% | 1823 | 8% |
| 8 | 18,763 | 8% | 1941 | 9% |
| 9 | 16,680 | 8% | 1658 | 8% |
| 10 | 15,324 | 7% | 1539 | 7% |
| Total | 220,878 | 100% | 21,645 | 100 |
Fig. 1Summary of the heterogeneity of 265 inbred lines based on 220,878 polymorphic SNPs. The number of inbred lines are shown in brackets in the legend
Fig. 2Summary of (a) genetic distances and (b) kinship coefficients between pair of inbred lines from different sources based on 220,878 polymorphic SNPs. The number of inbred lines are shown in brackets in the legend
Fig. 3Plot of PC1 (7.1%) and PC2 (4.5%) from principal coordinate (PC) analysis of 265 inbred lines based on genetic distance matrix calculated from 220,878 SNPs. The plot was made using predicted group membership from STRUCTURE (group 1 = blue; group 2 = green; group 3 = red; mixed group = M, black). See Additional file 1 for details on membership of each group
Fig. 4Population structure of 265 inbred lines based on 21,645 SNPs using Discriminant Analysis of Principal Components (DAPC) and the model-based STRUCTURE: a Bayesian Information Criterion (BIC) values showing number of possible clusters ranging from 1 to 12; b Plot of PC1 and PC2 from DAPC showing the three groups; (c) groups predicted based on STRUCTURE at K = 3 (group 1 = G1; group 2 = G2, and group 3 = G3). In STRUCTURE, each inbred line is represented by a thin vertical bar, which is partitioned into three colored segments (blue = G1, green = G2 and red = G3) on the x-axis, with lengths proportional to the estimated probability membership on the y-axis
Fig. 5Plot of PC1 (7.1%) and PC2 (4.5%) from principal coordinate (PC) analysis based on genetic distance matrix calculated from 220,878 SNPs. Only inbred lines that belongs to heterotic group A and B are shown in pink and green font, respectively. Note that inbred lines in heterotic group B are primarily divided in two subgroups, while those in heterotic group A belong to a single group (see Additional file 1 for details)