| Literature DB >> 35769295 |
Georgia C Eizenga1, HyunJung Kim2, Janelle K H Jung2, Anthony J Greenberg3, Jeremy D Edwards1, Maria Elizabeth B Naredo4, Maria Celeste N Banaticla-Hilario4, Sandra E Harrington2, Yuxin Shi2, Jennifer A Kimball2, Lisa A Harper2, Kenneth L McNally4, Susan R McCouch2.
Abstract
Crop wild relatives represent valuable reservoirs of variation for breeding, but their populations are threatened in natural habitats, are sparsely represented in genebanks, and most are poorly characterized. The focus of this study is the Oryza rufipogon species complex (ORSC), wild progenitor of Asian rice (Oryza sativa L.). The ORSC comprises perennial, annual and intermediate forms which were historically designated as O. rufipogon, O. nivara, and O. sativa f. spontanea (or Oryza spp., an annual form of mixed O. rufipogon/O. nivara and O. sativa ancestry), respectively, based on non-standardized morphological, geographical, and/or ecologically-based species definitions and boundaries. Here, a collection of 240 diverse ORSC accessions, characterized by genotyping-by-sequencing (113,739 SNPs), was phenotyped for 44 traits associated with plant, panicle, and seed morphology in the screenhouse at the International Rice Research Institute, Philippines. These traits included heritable phenotypes often recorded as characterization data by genebanks. Over 100 of these ORSC accessions were also phenotyped in the greenhouse for 18 traits in Stuttgart, Arkansas, and 16 traits in Ithaca, New York, United States. We implemented a Bayesian Gaussian mixture model to infer accession groups from a subset of these phenotypic data and ascertained three phenotype-based group assignments. We used concordance between the genotypic subpopulations and these phenotype-based groups to identify a suite of phenotypic traits that could reliably differentiate the ORSC populations, whether measured in tropical or temperate regions. The traits provide insight into plant morphology, life history (perenniality versus annuality) and mating habit (self- versus cross-pollinated), and are largely consistent with genebank species designations. One phenotypic group contains predominantly O. rufipogon accessions characterized as perennial and largely out-crossing and one contains predominantly O. nivara accessions characterized as annual and largely inbreeding. From these groups, 42 "core" O. rufipogon and 25 "core" O. nivara accessions were identified for domestication studies. The third group, comprising 20% of our collection, has the most accessions identified as Oryza spp. (51.2%) and levels of O. sativa admixture accounting for more than 50% of the genome. This third group is potentially useful as a "pre-breeding" pool for breeders attempting to incorporate novel variation into elite breeding lines.Entities:
Keywords: Bayesian Gaussian mixture models; Oryza nivara; Oryza rufipogon; Oryza rufipogon species complex; Oryza sativa; crop wild relatives; genebank accessions; rice
Year: 2022 PMID: 35769295 PMCID: PMC9235872 DOI: 10.3389/fpls.2022.787703
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Phenotypic group composition. (A) Bar plot depicting posterior probabilities that a given accession (along the x-axis) belongs to each of the four phenotypic groups. The bars are stacked, with the total probability summing to 1. The bars below the x-axis indicate the O. sativa genome fraction, with the darker bars having a higher O. sativa fraction. (The actual values are in Supplementary Table 2) (B) Sankey plots comparing compositions of phenotypic groups inferred from data collected at Dale Bumpers and Cornell assuming N = 3 compared to species designations and the four groups inferred from data collected at IRRI. Each accession is a single “ribbon” running from left to right and colored according to the grouping to the left.
FIGURE 2Phenotypic identifiers of group divergence based on the IRRI data. (A) Traits sorted by the strength of the association of their values to N = 4 group assignments based on Hotelling T2 values (B) Among-trait correlation estimates within each phenotypic group. Squares mark trait pairs, with the three bars within each square colored according to correlation magnitude in the three phenotypic groups. Dots represent probabilities of observing the correlation differences by chance (lower probabilities reflect high confidence of correlation difference). The inset shows a segment of the plot in more detail. (Trait acronyms and actual values are in Supplementary Table 7) (C) Sankey plot comparing accession membership across groups inferred from trait subsets listed in Table 1.
Phenotypic traits identified by trait value (9 traits), correlation switching (11 traits) and the union of both sets of traits (16 traits) based on the analysis of the phenotypic trait data collected at IRRI.
| Descriptive trait name | Trait acronym (IRRI) | Top traits by value (9) | Top traits by correlation (11) | Union of value & correlation (16) |
| Anther length | ANTLT | X | X | X |
| Awn length | AWNLT | X | X | |
| Awn width | AWNWD | X | X | |
| Culm diameter | CUDI | X | X | X |
| Culm length | CULT | X | X | X |
| Days to 50% heading | DTHD | X | X | |
| Distance of nearest spikelet to the panicle base | DIST | X | X | |
| Flag leaf lamina width | FLFWD | X | X | |
| Ligule length | LIGLT | X | X | |
| No. empty spikelets per panicle | UNFILLED | X | X | X |
| Panicle fertility | FERT | X | X | |
| Panicle length | PNLG | X | X | |
| Penultimate (2nd) leaf length | 2LLT | X | X | |
| Spikelet length | SPKLT | X | X | |
| Spikelet width | SPKWD | X | X | |
| Sterile lemma width | STLWD | X | X |
These were used to construct the groups shown in the Sankey plot (
FIGURE 3Genotypic composition, local ancestry and phenotypic groups of ORSC accessions. (A) Sankey plot depicting correspondence between traditional species designations, our phenotypic groups, and Kim et al. (2016) genetic subpopulations using 32 phenotypic traits. (ADM/OSAT are accessions with a high proportion of admixture with O. sativa) (B) Graphical genotypes display the local ancestry of each ORSC individual as determined by RFMix trained on six ORSC subpopulations (W1-W6) and five O. sativa subpopulations: indica (IND), aus, (AUS), temperate japonica (TEJ), tropical japonica (TRJ), and aromatic (ARO). The accessions are arranged according to global genotypic ancestry (left) and phenotypic group assignment (right). Each individual’s genotype is represented by two adjoining rows corresponding to phased haplotypes with colors indicating RFMix population assignments of chromosome segments (C) Proportion of the ORSC subpopulations (left) and O. sativa subpopulations (right) within each of the three phenotypic classes.
FIGURE 4Comparison of ORSC genotypic subpopulations reported by Kim et al. (2016) based on 286 ORSC accessions to those reported by Wang et al. (2017) and Huang et al. (2012) based on 446 ORSC accessions. (A) Population structure assignments in a merged SNP dataset consisting of accessions from two ORSC collections. On the left side are the accessions from Kim et al. (2016), on the right are accessions from the analysis by Wang et al. (2017). In the center is a merged set with group assignments at K = 7. Ribbons connect the group assignments of the merged accessions with their group assignments in the respective ORSC collections (B) Comparison of the three subpopulations originally identified by Huang et al. (2012) to the six subpopulations and admixed group identified by Wang et al. (2017) when reanalyzing the same genotypic data. The inclusion of aus-like (Or-E) and indica-like (Or-F) accessions in Or-I and japonica-like (Or-B) accessions in Or-III as reported by Huang et al. (2012) is confirmed.