| Literature DB >> 28125967 |
Hugo E Cuevas1, Giseiry Rosa-Valentin2, Chad M Hayes3, William L Rooney4, Leo Hoffmann4.
Abstract
BACKGROUND: The USDA Agriculture Research Service National Plant Germplasm System (NPGS) preserves the largest sorghum germplasm collection in the world, which includes 7,217 accessions from the center of diversity in Ethiopia. The characterization of this exotic germplasm at a genome-wide scale will improve conservation efforts and its utilization in research and breeding programs. Therefore, we phenotyped a representative core set of 374 Ethiopian accessions at two locations for agronomic traits and characterized the genomes.Entities:
Keywords: Exotic germplasm; Genome-wide association study; Genotyping-by-sequencing; Population structure; Sorghum bicolor
Mesh:
Year: 2017 PMID: 28125967 PMCID: PMC5270221 DOI: 10.1186/s12864-016-3475-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of sorghum races present in the Ethiopian collection from NPGS
| NPGSa (7,237 accessions) | Representative sample (374 accessions) | |||
|---|---|---|---|---|
| Race | Num. of accessions | % | Num. of accessionsb | % |
| Bicolor | 90 | 1.24 | 6 | 1.60 |
| Caudatum | 145 | 2.00 | 8 | 2.14 |
| Durra-Caudatum | 206 | 2.85 | 10 | 2.67 |
| Caudatum-bicolor | 78 | 1.08 | 5 | 1.34 |
| Guinea-Caudatum | 65 | 0.90 | 4 | 1.07 |
| Kaffir-Caudatum | 34 | 0.47 | 2 | 0.53 |
| Mixed | 389 | 5.38 | 24 | 6.42 |
| Kaffir-durra | 368 | 5.09 | 19 | 5.08 |
| Kaffir-bicolor | 24 | 0.33 | 2 | 0.53 |
| Kaffir | 47 | 0.65 | 2 | 0.53 |
| Guinea-Kaffir | 62 | 0.86 | 5 | 1.34 |
| Guinea-durra | 194 | 2.68 | 13 | 3.48 |
| Guinea-bicolor | 42 | 0.58 | 2 | 0.53 |
| Guinea | 18 | 0.25 | 1 | 0.27 |
| Durra-bicolor | 1336 | 18.47 | 80 | 21.39 |
| Durra | 2086 | 28.84 | 102 | 27.27 |
| Unknown | 2053 | 28.34 | 89 | 23.80 |
aNational Plant Germplasm System from United States
bNumber of randomly selected accession from each race group present in the NPGS Ethiopia collection
Fig. 1Distribution of identity by state (IBS) genetic distances amongst 374 NPGS Ethiopian accessions determined by the analysis of 27,306 unlinked (r 2 < 0.5) single nucleotides polymorphism
Fig. 2Population structure analysis of 374 NPGS Ethiopian sorghum accessions. a The optimal number of subpopulations was 11 based on the ADMIXTURE analysis with the cross validation for K values from 1 to 20 using 27,306 unlinked (r 2 < 0.5) SNPs distributed across the genome. b Hierarchical organization of genetic relatedness of the 374 Ethiopian accessions, each individual bar represents an accession
Pairwise estimates of Fst among the 11 NPGS Ethiopian populations based on the analysis of 27,306 single nucleotide polymorphism loci among 198 accessions
| Pop 1 | Pop 2 | Pop 3 | Pop 4 | Pop 5 | Pop 6 | Pop 7 | Pop 8 | Pop 9 | Pop 10 | |
|---|---|---|---|---|---|---|---|---|---|---|
| Pop 2 | 0.21 | |||||||||
| Pop 3 | 0.30 | 0.33 | ||||||||
| Pop 4 | 0.20 | 0.21 | 0.17 | |||||||
| Pop 5 | 0.33 | 0.34 | 0.39 | 0.32 | ||||||
| Pop 6 | 0.30 | 0.32 | 0.18 | 0.16 | 0.40 | |||||
| Pop 7 | 0.33 | 0.36 | 0.11 | 0.20 | 0.44 | 0.22 | ||||
| Pop 8 | 0.33 | 0.37 | 0.35 | 0.31 | 0.47 | 0.37 | 0.38 | |||
| Pop 9 | 0.33 | 0.34 | 0.35 | 0.30 | 0.20 | 0.36 | 0.39 | 0.42 | ||
| Pop 10 | 0.29 | 0.31 | 0.14 | 0.16 | 0.38 | 0.15 | 0.19 | 0.37 | 0.33 | |
| Pop 11 | 0.44 | 0.44 | 0.45 | 0.44 | 0.39 | 0.46 | 0.50 | 0.55 | 0.26 | 0.47 |
Fig. 3Neighbor joining tree of the 374 Ethiopian accessions where admixture accessions are not colored
Fig. 4Allele frequency distribution of 148,475 single nucleotide polymorphisms among 374 NPGS Ethiopian accessions
Fig. 5Neighbor-joining tree of the 198 representative NPGS Ethiopian accessions and 356 lines from the U.S. sorghum association panel (SAP) based on 26,026 unlinked (r2 < 0.5) SNPs. Colored labels represent the 11 populations present in NPGS Ethiopian germplasm
Fig. 6BAYESCAN result for the analysis of 27,306 unlinked SNPs distributed among 11 Ethiopian sorghum populations for outlier prediction. a The distribution of SNPs across the sorghum genome. Red dots represents Fst outliers. b BayeScan analysis for outlier prediction with a false discovery rate (FDR) of p < 0.001 (broken line)
Phenotypic traits for the 11 NPGS Ethiopian subpopulations
| Panicle | ||||||||
|---|---|---|---|---|---|---|---|---|
| References | Flowering1 | Plant height2 | Length3 | Diameter4 | Ratio5 | Seed Size6 | Dhurrin7 | G158 |
| RTx 2911 | 72 ± 17 | 105 ± 32 | 17.3 ± 5.8 | 2.9 ± 1.4 | 6.8 ± 3.1 | 4.0 ± 0.7 | 21.2 ± 8.0 |
|
| SC719 | 64 ± 08 | 111 ± 19 | 18.4 ± 4.4 | 2.7 ± 1.2 | 7.7 ± 2.8 | 5.2 ± 1.6 | 14.4 ± 0.9 |
|
| PI 610727 |
|
|
|
|
|
|
| 53.0 ± 6.0 |
| RTx436 |
|
|
|
|
|
|
| 00.0 ± 0.0 |
|
| ||||||||
| Population | *** | *** | *** | *** | *** | *** | *** |
|
| Location | *** | *** | ** |
|
| ** |
|
|
| Pop. x Loc. |
|
|
|
|
|
|
|
|
| R2 | 0.64 | 0.51 | 0.45 | 0.23 | 0.40 | 0.40 | 0.21 | 0.11 |
| C.V. | 10.01 | 22.13 | 29.23 | 41.41 | 37.42 | 22.08 | 82.51 | 92.01 |
|
| ||||||||
| Population 1 | 73 ± 09 d | 138 ± 25 e | 19.6 ± 2.4 cde | 4.2 ± 1.0 cd | 5.3 ± 1.7 bc | 3.8 ± 1.1 bc | 19.8 ± 30.8 bcd | 18.2 ± 35.9 |
| Population 2 | 65 ± 07 e | 159 ± 35 de | 18.2 ± 4.8 def | 4.4 ± 1.5 cd | 4.2 ± 1.6 cd | 4.4 ± 0.6 ab | 37.1 ± 23.0 a | 40.9 ± 30.0 |
| Population 3 | 79 ± 04 b | 203 ± 28 b | 20.9 ± 6.1 cd | 4.7 ± 1.1 cd | 4.6 ± 1.3 bc | 4.6 ± 0.9 ab | 11.4 ± 18.3 d | 27.7 ± 39.5 |
| Population 4 | 65 ± 06 e | 145 ± 17 de | 11.3 ± 2.7 g | 4.1 ± 1.2 d | 3.0 ± 0.8 def | 4.2 ± 0.7 ab | 17.3 ± 05.1 bcd | 50.6 ± 37.3 |
| Population 5 | 77 ± 01 cd | 265 ± 37 a | 35.4 ± 5.9 a | 7.5 ± 2.0 a | 5.0 ± 1.1 bc | 2.8 ± 0.2 d | 14.9 ± 10.1 cd | 36.6 ± 25.7 |
| Population 6 | 74 ± 06 cd | 172 ± 29 cd | 23.4 ± 4.5 cd | 4.4 ± 1.7 cd | 6.0 ± 1.9 ab | 4.6 ± 0.9 ab | 14.5 ± 12.7 d | 33.0 ± 32.3 |
| Population 7 | 78 ± 08 cd | 192 ± 35 bc | 15.5 ± 6.5 efg | 5.5 ± 1.9 bcd | 2.9 ± 0.5 ef | 4.5 ± 0.6 ab | 28.7 ± 21.5 abc | 12.4 ± 16.5 |
| Population 8 | 85 ± 08 a | 199 ± 36 bc | 13.4 ± 1.9 fg | 6.5 ± 2.0 ab | 1.9 ± 0.3 f | 3.9 ± 0.6 abc | 30.7 ± 09.4 ab | 36.3 ± 22.4 |
| Population 9 | 75 ± 04 cd | 192 ± 25 bc | 24.2 ± 4.7 bc | 6.9 ± 2.7 ab | 4.2 ± 1.3 cde | 2.9 ± 0.4 d | 18.2 ± 09.9 bcd | 42.9 ± 21.4 |
| Population 10 | 76 ± 07 cd | 203 ± 26 b | 29.3 ± 7.6 b | 4.5 ± 1.3 cd | 6.9 ± 1.6 a | 4.6 ± 0.7 a | 14.7 ± 08.1 d | 24.6 ± 27.1 |
| Population 11 | 84 ± 04 ab | 200 ± 19 bc | 15.5 ± 3.4 efg | 5.9 ± 1.4 abc | 2.7 ± 0.2 f | 3.3 ± 0.3 cd | 13.8 ± 06.7 d | 21.1 ± 15.0 |
Phenotype evaluations were completed at Isabela and Mayaguez, Puerto Rico in 2013 and 2014, respectively. Means followed by a common letter are not significantly different according to Fisher’s least significant different test
1Flowering time refers to days to 50% flowering of the plot
2Plant height (cm) refer to the distance from the base of the main stalk to the top of the panicle without sorghum converted accessions
3Panicle length (cm) refers to the distance from the base to the top of the panicle
4Panicle diameter (cm) refers to the wider region of the panicle
5Ratio refers to the length/diameter
6Seed size refers to the volume of 100 seeds
7Dhurrin (μg/cm2) content in leaf measure at post-flowering stage using HPLC [53]
8G15 refers to the percent of 30 seeds that germinated after 72 h at 15 °C in growth chamber
n.s., ** and *** refers to no significant, and significant effects at p ≤ 0.01 and .005, respectively
C.V. refer to coefficient of variation
Seed composition traits for the 11 NPGS Ethiopian subpopulations
| References | Protein1 | Fat1 | Fiber1 | Ash1 | Starch1 | Phenols2 | Tannins3 | 3-DOAs4 |
|---|---|---|---|---|---|---|---|---|
| RTx 2911 | 11.4 ± 1.0 | 1.6 ± 0.2 | 1.4 ± 0.1 | 1.3 ± 0.00 | 66.6 ± 1.2 | 05.1 ± 1.7 | 00.0 ± 00.0 | 36.8 ± 07.8 |
| SC719 | 10.0 ± 1.3 | 2.9 ± 0.5 | 1.9 ± 0.2 | 1.3 ± 0.10 | 66.4 ± 1.0 | 06.3 ± 1.7 | 11.4 ± 02.8 | 22.0 ± 03.9 |
|
| ||||||||
| Population | *** | ** | *** | *** | n.s. | *** | *** | *** |
| R2 | 0.35 | 0.22 | 0.35 | 0.54 | 0.11 | 0.52 | 0.46 | 0.34 |
| C.V. | 16.77 | 18.50 | 11.29 | 6.15 | 2.56 | 66.00 | 94.57 | 64.80 |
|
| ||||||||
| Population 1 | 10.7 ± 1.6 c | 2.7 ± 0.6 c | 1.6 ± 0.1 bc | 1.25 ± 0.04 abc | 66.0 ± 1.4 | 09.6 ± 5.3 b | 16.9 ± 14.1 b | 35.1 ± 21.9 cd |
| Population 2 | 13.0 ± 3.5 bc | 2.8 ± 0.9 bc | 1.6 ± 0.2 abc | 1.33 ± 0.08 a | 64.9 ± 2.5 | 08.4 ± 4.0 bc | 13.5 ± 09.0 bc | 29.5 ± 22.0 cd |
| Population 3 | 13.1 ± 2.1 bc | 2.8 ± 0.3 bc | 1.6 ± 0.2 abc | 1.27 ± 0.07 abc | 65.1 ± 1.5 | 05.2 ± 5.8 bcd | 07.4 ± 11.8 bcd | 39.1 ± 29.3 bcd |
| Population 4 | 13.4 ± 2.9 b | 2.8 ± 0.4 c | 1.6 ± 0.0 bc | 1.23 ± 0.12 bc | 62.9 ± 2.0 | 06.2 ± 2.4 bcd | 02.3 ± 03.3 cd | 46.7 ± 14.3 bcd |
| Population 5 | 10.8 ± 0.6 c | 3.3 ± 0.9 abc | 1.5 ± 0.1 cd | 1.31 ± 0.08 ab | 65.5 ± 0.6 | 10.0 ± 1.7 b | 18.9 ± 05.1 b | 15.4 ± 22.6 d |
| Population 6 | 12.8 ± 2.2 bc | 2.7 ± 0.5 c | 1.7 ± 0.1 ab | 1.31 ± 0.05 ab | 64.8 ± 1.5 | 05.1 ± 3.3 bcd | 07.3 ± 06.8 bcd | 31.9 ± 14.4 cd |
| Population 7 | 14.0 ± 2.3 b | 3.1 ± 0.5 abc | 1.5 ± 0.2 bc | 1.29 ± 0.06 ab | 64.0 ± 1.7 | 03.2 ± 3.8 cd | 03.1 ± 06.7 cd | 54.1 ± 32.5 abc |
| Population 8 | 15.1 ± 1.3 ab | 3.4 ± 0.2 ab | 1.6 ± 0.1 bc | 1.20 ± 0.11 c | 64.0 ± 1.0 | 09.5 ± 9.7 b | 15.9 ± 20.3 b | 69.1 ± 18.3 ab |
| Population 9 | 16.9 ± 2.1 a | 3.5 ± 0.5 a | 1.3 ± 0.2 d | 1.07 ± 0.13 d | 64.3 ± 1.4 | 18.1 ± 9.7 a | 33.5 ± 15.5 a | 82.1 ± 44.0 a |
| Population 10 | 13.8 ± 2.9 b | 3.0 ± 0.9 abc | 1.6 ± 0.2 abc | 1.29 ± 0.06 ab | 64.3 ± 2.2 | 06.2 ± 6.7 bcd | 10.0 ± 13.8 bcd | 23.2 ± 21.9 cd |
| Population 11 | 13.1 ± 0.8 bc | 3.5 ± 0.2 a | 1.8 ± 0.1 a | 1.30 ± 0.02 ab | 64.8 ± 0.8 | 00.9 ± 0.8 d | 00.0 ± 00.1 d | 29.1 ± 17.2 cd |
Seed composition traits were determined based on near-infrared (NIR) spectroscopy calibration curve (Dykes et al. [45]) from fresh seed obtained from Isabela, Puerto Rico in 2013. Means followed by a common letter are not significantly different according to Fisher’s least significant different test
1Values are based on percent
2Total phenols based on milligram of gallic acid equivalent (GAE)/gram
3Condensed tannins based on milligram of catechin equivalents (CE)/gram
43-Deoxyanthocyanidins content based on absorbance (abs)/milliliter/gram
n.s., ** and *** refers to no significant, and significant effects at p ≤ 0.01 and .005, respectively
C.V. refer to coefficient of variation
Principal component analysis of 16 phenotype traits evaluated in 374 accessions from NPGS Ethiopian germplasm collection
| Trait | PC1 | PC2 | PC3 | PC4 |
|---|---|---|---|---|
| Phenols1 | 0.37* | −0.08 | −0.31 | 0.09 |
| Protein2 | 0.35* | −0.05 | 0.30* | 0.25 |
| Tannins3 | 0.36* | −0.05 | −0.33 | 0.08 |
| 3-DOAs4 | 0.31* | −0.21 | 0.20 | 0.03 |
| Plant height5 | 0.08 | 0.57* | 0.12 | −0.10 |
| Length6 | 0.09 | 0.57* | −0.21 | 0.28 |
| Diameter7 | 0.29 | 0.34* | −0.02 | −0.48 |
| Dhurrin 8 | 0.10 | −0.03 | 0.38* | 0.04 |
| Ratio9 | −0.13 | 0.18 | −0.18 | 0.70* |
| Flowering10 | 0.05 | 0.28* | 0.23 | −0.10 |
| Seed Size11 | −0.31 | 0.00 | 0.20 | 0.07 |
| Fat2 | 0.25 | 0.15 | 0.18 | 0.04 |
| Fiber2 | −0.29 | 0.13 | −0.05 | −0.06 |
| Ash2 | −0.33 | 0.13 | 0.26 | 0.03 |
| Starch2 | −0.20 | 0.03 | −0.47 | −0.29 |
| G15 | 0.01 | −0.14 | −0.08 | −0.02 |
| % Variance | 24.4 | 13.6 | 12.6 | 9.90 |
| % Cumulative | 24.4 | 38.0 | 50.6 | 60.5 |
Phenotype evaluations were completed at Isabela and Mayaguez, Puerto Rico in 2013 and 2014
1Total phenols based on milligram of gallic acid equivalent (GAE)/gram
2Values are based on percent
3Condensed tannins based on milligram of catechin equivalents (CE)/gram
43-Deoxyanthocyanidins content based on absorbance (abs)/milliliter/gram
5Plant height refer to the distance from the base of the main stalk to the top of the panicle without sorghum converted accessions
6Panicle length refers to the distance from the base to the top of the panicle
7Panicle diameter refers to the wider region
8Dhurrin content in leaf measure at post-flowering stage using HPLC [53]
9Ratio refers to the length/diameter
10Flowering time refers to days to 50% flowering of the plot
11Seed size refers to the volume of 100 seeds
* most important variable in the component
Fig. 7GWAS for plant height using a subset of 352 Ethiopian accessions from NPGS germplasm collection
Fig. 8GWAS for seed protein and fat content using a subset of 374 Ethiopian sorghum accessions from the NPGS germplasm collection. a GWAS for protein content, b GWAS for fat content, c Chromosome 2 region associated with fat and protein