| Literature DB >> 33681979 |
J Lucas Boatwright1,2, Zachary W Brenton1,3, Richard E Boyles1,2, Sirjan Sapkota1, Matthew T Myers1, Kathleen E Jordan1, Savanah M Dale1, Nadia Shakoor4, Elizabeth A Cooper5, Geoffrey P Morris6, Stephen Kresovich1,2.
Abstract
Sorghum bicolor, a photosynthetically efficient C4 grass, represents an important source of grain, forage, fermentable sugars, and cellulosic fibers that can be utilized in myriad applications ranging from bioenergy to bioindustrial feedstocks. Sorghum's efficient fixation of carbon per unit time per unit area per unit input has led to its classification as a preferred biomass crop highlighted by its designation as an advanced biofuel by the U.S. Department of Energy. Due to its extensive genetic diversity and worldwide colonization, sorghum has considerable diversity for a range of phenotypes influencing productivity, composition, and sink/source dynamics. To dissect the genetic basis of these key traits, we present a sorghum carbon-partitioning nested association mapping (NAM) population generated by crossing 11 diverse founder lines with Grassl as the single recurrent female. By exploiting existing variation among cellulosic, forage, sweet, and grain sorghum carbon partitioning regimes, the sorghum carbon-partitioning NAM population will allow the identification of important biomass-associated traits, elucidate the genetic architecture underlying carbon partitioning and improve our understanding of the genetic determinants affecting unique phenotypes within Poaceae. We contrast this NAM population with an existing grain population generated using Tx430 as the recurrent female. Genotypic data are assessed for quality by examining variant density, nucleotide diversity, linkage decay, and are validated using pericarp and testa phenotypes to map known genes affecting these phenotypes. We release the 11-family NAM population along with corresponding genomic data for use in genetic, genomic, and agronomic studies with a focus on carbon-partitioning regimes.Entities:
Keywords: MPP; carbon-partitioning; genome-wide association study; genotype-by-sequencing; multiparental populations; nested association mapping; pericarp color
Mesh:
Substances:
Year: 2021 PMID: 33681979 PMCID: PMC8759819 DOI: 10.1093/g3journal/jkab060
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
CP-NAM parent agronomic and physiological traits
| Accession | Days to Harvest | Stalk weight (kg) | Leaf weight (kg) | Panicle weight (kg) | Brix | WSC (%DM) |
|---|---|---|---|---|---|---|
| PI154844 | 163.25 | 2.57 | 0.39 | 0.14 | 12.95 | − |
| PI22913 | 122.67 | 1.31 | 0.17 | 0.14 | 13.9 | 27.4 |
| PI229841 | 136 | 1.05 | 0.19 | 0.24 | 10.35 | 16.48 |
| PI297130 | 158.5 | 2.59 | 0.43 | − | 11.8 | 24.92 |
| PI297155 | 114 | 0.56 | 0.16 | 0.14 | 6.33 | − |
| PI506069 | 159 | 2.13 | 0.44 | − | 7.9 | 10.03 |
| PI508366 | 156 | 2.38 | 0.4 | − | 6.35 | 12.1 |
| PI510757 | 157 | 2.56 | 0.55 | − | 7.15 | 25.91 |
| PI563295 | 138.25 | 1.87 | 0.24 | 0.16 | 14.38 | 31.84 |
| PI586454 | 114.25 | 1.18 | 0.2 | 0.12 | 11.73 | − |
| PI655972 | 110.5 | 0.49 | 0.14 | 0.13 | 8.78 | 9.5 |
Cells contain the mean the standard deviation for each trait where replicate data were available. Single-replicate data do not contain standard deviations, and missing data are represent by “−.”
PI329311 did not have adequate germplasm for inclusion.
Photoperiod sensitive lines did not produce panicles.
NAM parent characteristics
| Common name | Accession | Race | Origin | Sorghum type | Pericarp | Testa |
|---|---|---|---|---|---|---|
| Grassl | PI154844 | Caudatum | Uganda | Cellulosic | r | y |
| Chinese Amber | PI22913 | Bicolor | China | Sweet | b | y |
| IS 2382 | PI229841 | Kafir | South Africa | Grain | r | y |
| IS 13613 | PI297130 | Caudatum | Uganda | Cellulosic | w | y |
| IS 13633 | PI297155 | Kafir | Uganda | Grain | r | y |
| IS 11069 | PI329311 | Durra | Ethiopia | Cellulosic | y | n |
| Mbonou | PI506069 | Guinea-bicolor | Togo | Cellulosic | y | n |
| MA 38 | PI508366 | Guinea | Mali | Cellulosic | w | n |
| AP79-714 | PI510757 | Durra | Cameroon | Cellulosic | w | n |
| Rio | PI563295 | Durra-caudatum | Maryland, USA | Sweet | w | y |
| Leoti | PI586454 | Kafir-bicolor | Hungary | Sweet | r | y |
| Pink Kafir | PI655972 | Kafir | Kansas, USA | Forage | w | n |
The characteristics of each NAM parent include the common name, USDA plant introduction numbers, botanical race, original source of germplasm, type as defined in the introduction, pericarp color, and presence or absence of pigmentation within the testa layer.
b, brown; r, red; y, yellow, and w, white.
y, pigmented and n, not pigmented.
RIL family statistics
| Male parent | No. indiv. | No. markers | Avg. inbreeding coefficient |
|---|---|---|---|
| PI22913 | 203 | 9,258 | 0.62 |
| PI229841 | 209 | 7,422 | 0.60 |
| PI297130 | 243 | 8,770 | 0.60 |
| PI297155 | 216 | 7,470 | 0.61 |
| PI329311 | 240 | 9,460 | 0.58 |
| PI506069 | 204 | 7,756 | 0.59 |
| PI508366 | 232 | 7,097 | 0.63 |
| PI510757 | 274 | 7,193 | 0.58 |
| PI563295 | 245 | 6,656 | 0.56 |
| PI586454 | 200 | 8,038 | 0.57 |
| PI655972 | 223 | 7,430 | 0.63 |
The columns from left to right are the male parent plant introduction number (column 1), the total number of individuals (column 2), markers in each family (column 3), and the average inbreeding coefficient (column 4).
Figure 1Linkage decay (Pearson’s correlation coefficient squared) plotted against the distance in kilobases across the genome.
Figure 2Principal component analysis plot using the whole CP-NAM population. Parents are labeled with the common name. The individual samples of the recurrent parent, Grassl, are additionally labeled with “x.” Each RIL family is represented by the male parent PI.
Figure 3Genome-wide, population admixture of the CP-NAM. Individuals (x-axis) are shown as vertical bars colored in proportion to their estimated ancestry within each cluster (y-axis) based upon 15 ancestral populations (K = 15) where each genetically distinct ancestral population is given a unique color.
Figure 4CP-NAM PCA with admixture coloration. Individuals were classified as Q1–Q15 as determined by the proportion of ancestral admixture. Cells 1–11 represent individual RIL families represented by the paternal identifier while the 12th cell contains the entire CP-NAM population.
Figure 5Univariate GWAS for pericarp pigmentation. The −log10 P-values (y-axis) are plotted against the position on each chromosome (x-axis). Each circle represents a SNP, and the red dashed line represents the Bonferroni-corrected threshold.
Figure 6Univariate GWAS for binary encoding of yellow pericarp pigmentation. The −log10 P-values (y-axis) are plotted against the position on each chromosome (x-axis). Each circle represents a SNP, and the red dashed line represents the Bonferroni-corrected threshold.
Top associations from the NAM GWAS
| Model | Chromosome | Position | Wald | Genes |
|---|---|---|---|---|
| Univariate Pericarp | 1 | − | − | <block too large> |
| Univariate Pericarp | 2 | 6,940,113 | 1.23E-10 | tan2 |
| Univariate Pericarp | 2 | 57,797,411 | 2.26E-06 | Classical Z locus; Sobic.002G190000 |
| Univariate Pericarp | 4 | 62,215,490 | 3.76E-31 | tan1 |
| Univariate Pericarp | 7 | 9,097,206 | 4.86E-15 | Unknown |
| Univariate Pericarp | 7 | 44,198,228 | 2.66E-08 | Unknown |
| Univariate Yellow | 1 | 71,320,809 | 1.39E-08 | y1 |
| Univariate Yellow | 2 | 57,797,411 | 2.50E-09 | Classical Z locus; Sobic.002G190000 |
| Univariate Yellow | 4 | 62,215,490 | 7.70E-17 | tan1 |
| Univariate Yellow | 7 | 8,111,484 | 2.00E-10 | Unknown |
| Univariate Yellow | 10 | 56,346,032 | 6.96E-08 | Putative carotenoid regulator |
| Multivariate Pericarp | 1 | − | − | <block too large> |
| Multivariate Pericarp | 2 | 6,940,113 | 5.00E-16 | tan2 |
| Multivariate Pericarp | 4 | 62,215,490 | 3.82E-37 | tan1 |
| Multivariate Pericarp | 4 | 62,463,940 | 1.55E-76 | tan1 |
| Multivariate Pericarp | 6 | 55,070,387 | 6.89E-06 | Putatively |
| Multivariate Pericarp | 7 | 8,111,484 | 5.54E-25 | Unknown |
| Multivariate Pericarp | 7 | 39,531,969 | 5.51E-16 | Unknown |
| Multivariate Pericarp | 7 | 44,198,228 | 6.02E-16 | Unknown |
| Multivariate Pericarp | 10 | 1,948,816 | 4.30E-52 |
|
For each significant association, the model, chromosome containing the SNP, SNP position, wald t-test P-value, and putative gene in LD with the significant SNP are identified.
Figure 7Multivariate GWAS for pericarp color and testa pigmentation. The −log10 P-values (y-axis) are plotted against the position on each chromosome (x-axis). Each circle represents a SNP, and the red dashed line represents a Bonferroni-corrected threshold.