Literature DB >> 31145789

Dissecting the genome-wide genetic variants of milling and appearance quality traits in rice.

Gopal Misra1, Roslen Anacleto1, Saurabh Badoni1, Vito Butardo1, Lilia Molina1, Andreas Graner2, Matty Demont1, Matthew K Morell1, Nese Sreenivasulu1.   

Abstract

Higher head rice yield (HRY), which represents the proportion of intact grains that survive milling, and lower grain chalkiness (opacity) are key quality traits. We investigated the genetic basis of HRY and chalkiness in 320 diverse resequenced accessions of indica rice with integrated single- and multi-locus genome-wide association studies using 2.26 million single-nucleotide polymorphisms. We identified novel haplotypes that underly higher HRY on chromosomes 3, 6, 8, and 11, and that lower grain chalkiness in a fine-mapped region on chromosome 5. Whole-genome sequencing of 92 IRRI breeding lines was performed to identify the genetic variants of HRY and chalkiness. Rare and novel haplotypes were found for lowering chalkiness, but missing alleles hindered progress towards enhancing HRY in breeding material. The novel haplotypes that we identified have potential use in breeding programs aimed at improving these important traits in the rice crop.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Experimental Biology.

Entities:  

Keywords:  Genome-wide association studies; grain chalkiness; head rice yield; market survey; multi-locus GWAS; whole-genome resequencing

Year:  2019        PMID: 31145789      PMCID: PMC6793453          DOI: 10.1093/jxb/erz256

Source DB:  PubMed          Journal:  J Exp Bot        ISSN: 0022-0957            Impact factor:   6.992


Introduction

The annual global production of rice (Oryza sativa) grown in paddy systems amounts to 730 mt, resulting in 484 mt of polished grains after milling to remove the husk and the bran layers. As a staple food, rice provides at least 20% of the daily caloric requirement for ~3.5 billion people worldwide. To meet the future demands of a growing global population, it is estimated that annual milled rice production needs to increase by additional 105 mt by 2035. Despite the development of newer, high-yielding varieties, many farmers continue to plant the popular ‘mega-varieties’ produced between 1966–1985 (Peng ). The lack of sustained adoption of more recently released modern varieties suggests that we not only need to overcome yield stagnation in new varieties (Ray ), but that we also need to match the quality benchmarks of the mega-varieties in terms of premium grain quality traits. These include a high head rice yield and a low degree of chalkiness, which affects texture (Laborte ). Head rice yield (HRY) is the proportion of ‘intact’ grains that remain after milling, and comprises grains that are at least 75% of the original length (Cnossen ). HRY is an important quality trait that can fetch a higher price for farmers due to the net increase in milled whole-grain (unbroken) yield, especially when this is combined with a lower proportion of chalky grains. Dehulling and milling are important steps in the post-harvest processing of paddy rice, during which the grains are susceptible to fissuring and breakage (Buggenhout ). The mechanical stress imposed can result in up to 50% loss in HRY. Lower intrinsic grain strength leads to higher breakage during grain processing and this is measured as the fracture strength, which is defined as the force that can be tolerated by a grain before it cracks (Buggenhout ). There are many factors that affect HRY, including (i) post-harvest practices related to the process of grain drying, (ii) the grain moisture content at harvest, which affects the transition from a more diffusible, rubbery state to a less-diffusible, glassy starch that leads to grain fissuring (Cnossen ; Siebenmorgen ), (iii) the negative effect of high night temperatures during seed filling, and (iv) genetic factors (Cooper ; Sreenivasulu ). Although several milling quality traits that exhibit low heritability have been mapped in a subpopulation of japonica species (Nelson , 2012; Pinson ), no major quantitative trait loci (QTLs) and genes that influence HRY have been cloned in indica germplasm. The dissection of the genetic basis of HRY is important for understanding how kernel susceptibility to breakage can be reduced, and milling quality improved, through marker-assisted breeding in the long, slender grains of indica rice. Another important grain quality trait is chalkiness, which is generally characterized as an opaque white discoloration in the translucent endosperm caused by the creation of air spaces between irregularly shaped starch granules (Butardo and Sreenivasulu, 2019). An increased proportion of chalky grains lowers the appearance quality of rice, and is also believed to lower the milling quality by increasing the incidence of grain breakage (Del Rosario ; Lisle ; Wang ; Cooper ). Thus, chalkiness has negative impacts on the market value of rice. Grain chalkiness is influenced by genetic and environmental factors (Lisle ; Yamakawa ; Lanning ; Siebenmorgen ; Zhao and Fitzgerald, 2013). Chalk is a complex and polygenic quantitative trait and several fine-mapped target genes have been shown to influence it, including pyruvate orthophosphate dikinase (Kang ), starch synthase IIIa (Fujita ), cell wall invertase (Wang ), UDP-glucose pyrophosphorylase (Woo ), H+-translocating pyrophosphatase (chalk5) (Li ), and a transcriptional activator GS9 (Zhao ). Genome-wide association studies (GWAS) in combination with targeted-gene association studies (TGAS) have emerged as a two-pronged strategy that can assist in narrowing down the significant trait-specific gene(s) underlying QTLs and in revealing allelic variants that regulate complex grain-quality traits (Butardo ; Misra ). Here, we studied the genetic basis of HRY and chalkiness using combined GWAS (single- and multi-locus) and gene-set analysis by employing 2.26 million single-nucleotide polymorphisms (SNPs) from resequencing data of 320 indica rice accessions. This approach helped to identify key candidate genes that influence the complex genetic architecture of HRY under controlled drying as well as under moisture-stressed conditions. We identified novel haplotypes/allelic variants for enhanced HRY and reduced grain chalkiness. Our findings will assist the future development of high-yielding elite germplasm with superior grain quality and thus contribute to improving the livelihoods of farmers in many developing countries.

Materials and methods

Plant material and HRY analysis from controlled and moisture-stressed drying conditions

A diversity panel of 320 Oryza sativa subsp. indica accessions (262 landraces and 58 improved lines) were selected from 3000 resequenced rice genomes (The 3000 Rice Genomes Project, 2014; Wang ) (Supplementary Table S1 at JXB online). The accessions were grown in field plots of 12.5×50 m in three replicated randomized blocks during the 2014 wet season and during the 2015 dry season at the Experimental Station of the International Rice Research Institute (IRRI), Laguna, Philippines (14°N, 121°E). Standard crop management procedures were applied. Upon reaching seed maturity, the individual plots were harvested manually and hand-threshed at an optimum seed moisture content (MC) of 22–24%. For the controlled drying treatment, grains were then subjected to flat-bed drying to achieve 12–14% MC and subsequently placed in a seed storage room maintained at 18 °C before being milled. To further test the effect of moisture stress on HRY, dried harvested grains (~200 g samples) with 12–14% MC were soaked in reverse osmosis water for 60 min at ambient temperature until the MC of duplicate samples reached the optimum level of 22–24%. The samples were then placed inside a pre-heated oven at 45 °C until they returned to 12–14% MC. The samples were then placed at room temperature inside an air-tight plastic ziplock bag for 1 h, after which HRY was determined. Subsamples of grains (125 g, three replicates) from the control and moisture stress treatments were dehulled using a Satake dehuller (model THU-35A, Japan) with the settings optimized for the different grain shapes. Each sample was then milled using Grainman mill (model 60-200-60-DT, Miami) for 45 s. Head rice was separated from broken kernels using a post-harvest drum type-sizing machine (Satake rice machine type TRG with three different layers) and HRY (%) was calculated as the mass proportion of milled rice retained as head rice after complete milling. The stability value was calculated by dividing the HRY value under stress by the HRY value under controlled condition. The HRY evaluation was conducted according to the ISO 17025 guidelines. An additional 300 indica lines from the diversity panel with SNP-based genotyping data of 700K (McCouch ) were planted at the IRRI experimental station in 1-m2 plots in a six rows by six hills per row configuration with 20 cm between hills and rows. The plots were arranged in three replicated randomized blocks, and plants were grown during the dry seasons in 2013 and 2014. To mimic the usual farming practice of sun-drying, samples of 2 kg of intact straw and panicles were packed in large net bags and subjected to sun-drying followed by drying under shade until they attained 12–14% MC. The samples were then evaluated for HRY and other milling quality parameters as described above.

Phenotyping for grain chalkiness

Dried samples of grain at 12–14% MC (50 g, three replicates) of each cultivar were used to determine percentage chalkiness. Chalkiness was estimated using a SeedCount SC5000 Image Analyser (Next Instruments, NSW, Australia) as percentage grain chalkiness (PGC) using optimized methods. Grains were scattered randomly onto the sample tray of the instrument, and after scanning PGC was estimated using the in-built software after calibration according to the manufacturer’s instructions.

Performance of historical breeding lines and their whole-genome sequencing

We evaluated a total of 106 breeding lines developed by IRRI between 1966–2015 for HRY and chalkiness. Briefly, the lines were grown in field plots during wet (2015) and dry (2016) seasons according to standard agronomic practice (Laborte ). Grain was harvested at 22–24% MC and subjected to standard drying conditions as described above. The dried samples were evaluated for HRY and chalkiness as described above. Genomic DNA extracted from leaf samples of 92 breeding lines were subjected to whole-genome sequencing using the HiSeq X10 platform (Illumina) and 150-bp pair-end reads were generated at ~30× coverage. After completion of the sequencing process, the raw reads were filtered to remove adaptor sequences and low-quality reads.

Survey of farmers for desirable traits in the rice crop

IRRI conducted in-depth market surveys during 2015–2016 involving 886 rice farmers from Bangladesh, the eastern part of India, Cambodia, and the Philippines (Ynion , 2016; Maligalig, 2018). An interactive tablet application was utilized to assess the priorities of traits for their improvement that farmers would consider before they would replace their most popular current varieties with new ones. The application enabled farmers to design an ideal future variety profile by allocating a fixed investment fund across a portfolio of 11 traits, divided into market-related traits (slenderness, aroma, stickiness, and HRY) and environment-related traits (lodging tolerance, insect resistance, disease resistance, abiotic stress tolerance, reduction of shattering, earliness, and straw digestibility).

Genome-wide association studies and gene-set analysis

A resequenced diversity panel consisting of 324 indica accessions was used for calling of genome-wide SNPs against reference genomes of japonica (cv. Nipponbare) and indica (MH63 and ZS97) (Zhang ). An efficient mixed-model association expedited (EMMAX) approach was used for GWAS (Kang ). HRY and PGC data for the different seasons were transformed using the Python software WarpedLMM (https://pypi.org/project/WarpedLMM/) to adjust the phenotype distribution. Genotype files were filtered prior to running the association analysis. Individual genotypes and SNPs were retained with a missing rate of not more than 5% and further filtered for a minor allele frequency of at least 5%. This plink2-based filtering step (https://www.cog-genomics.org/plink/2.0/) resulted in a final set of 320 indica lines that accounted for 2 260 030 SNPs against the Nipponbare genome, 802 117 SNPs against the MH63 genome, and 450 604 SNPs against the ZS97 genome. For a diversity panel of 300 lines, we used the previously published 700K SNP data from McCouch . The statistical model used in EMMAX is a variance component model detailed in Kang . Four principal components (PCs), explaining ~14% of total variation, were used as covariates in GWAS after plotting 20 PCs in a scree plot. To determine the PC, a set of 2.26 million SNPs detected against the Nipponbare reference was utilized and the analysis was performed using the SNPRelate package in the R platform (http://dx.doi.org/10.18129/B9.bioc.SNPRelate). The data from the different seasons were used to check the reproducibility of the results. After running the GWAS on different seasons, we further analysed the results from indica resequenced lines. SNPs were tagged based on linkage disequilibrium (LD) from the set of SNPs with –log10P>5 for detailed analysis of HRY and PGC. The complete parameters used for clumping (in plink2) were adapted from Misra , except that we used the --clump-p1 of 1e−7.6. Haploview (Barrett ) was used to construct LD-blocks, and plots were then generated using standard procedures (Misra ). All genomic positions and gene annotations were based on the Nipponbare (MSUv7), MH63, and ZS97 reference genomes. The multi-locus GWAS was run to verify the peaks identified by single-locus GWAS. Different multi-locus (ML) GWAS methods, namely FASTmrEMMA (Wen ), mrMLM (Wang ), and ISIS EM-BLASSO (Tamba ), were followed as detailed in Misra . We performed a gene-set analysis based on the MSU7 annotation using the publicly available MAGMA software (de Leeuw ) for those genes implicated by the SNPs that met the Bonferroni threshold.

Validating the haplotype blocks

Haplotype blocks with distinguished phenotype ranges were validated using the SNP-seek database (Alexandrov ) where SNP data for 3000 rice genomes are archived.

Collinearity overlays with HRY and chalkiness genetic regions

The translated protein sequences obtained from reference genomes of Nipponbare, MH63, and ZS97 were used for deriving synteny. All-to-all blastP was performed for each pair of reference protein sequences using the NCBI-BLAST-2.2.28+ tool (Altschul ). A stringent filtering criterion of e−20 was used in the BLAST alignment of protein sequences between japonica (Nipponbare) and indica (ZS97). The longest proteins were selected in cases where splice variants were present. Collinearity was then established using MCScanX (Wang ) with the threshold of 10 genes constructing each collinear block between indica and japonica, whilst 12 genes were used between the two indica reference genomes. On the basis of these outcomes, colinearity was identified and represented in the form of blocks using Circos (Krzywinski ) that overlaid with the genomic region regulating HRY and PGC that had been identified through GWAS.

Results

Survey of farmers for importance of HRY

Given a fixed investment fund to be divided between 11 market- and environment-related traits, the share that farmers invested in HRY was in the range 0–14%, depending on the geographic location, the season, and the rice variety that was currently being grown (Fig. 1a). Slender grain varieties such as Phka Rumdoul in Cambodia (14%), SL-8 in the Philippines (12%), BRRI Dhan-50 in Bangladesh (11%), and Miniket in East India (7%) attracted the highest investment in HRY, and some of these varieties are typically grown in the dry season. Data for 106 breeding lines developed at IRRI during 1966–2015 analyzed by Laborte indicate that over several generations of introduced varieties, HRY has declined (Fig. 1b, bar plot, orange bars), while chalkiness has dramatically increased (blue bars). This is consistent with farmers’ prioritization of investment in HRY for high-yielding varieties such as NSIC Rc222 and hybrids such as SL-8. These trends argue for more investment in research focused on unravelling the genetics of HRY.
Fig. 1.

Survey of farmers for importance of head rice yield (HRY) in mega-varieties in the Philippines, Bangladesh, Cambodia and Eastern India, and evolution of grain chalkiness and HRY in breeding lines released by IRRI during the period 1966–2015. (a) Investment share assigned by farmers in HRY relative to other traits as elicited through an investment exercise (see Methods). (b) Trends in the genetic gain for % chalkiness and % HRY observed in dry seasons (DS) and wet seasons (WS) for the 106 breeding lines developed at IRRI between 1966–2015. The decrease in the HRY was observed to be more prominent during dry seasons, whilst chalkiness showed a similar trend in DS and WS. In the bar plots, orange bars represent HRY and blue bars represent % chalkiness , as adapted from Laborte , and show a similar trend in the Philippines and corroborates our results.

Survey of farmers for importance of head rice yield (HRY) in mega-varieties in the Philippines, Bangladesh, Cambodia and Eastern India, and evolution of grain chalkiness and HRY in breeding lines released by IRRI during the period 1966–2015. (a) Investment share assigned by farmers in HRY relative to other traits as elicited through an investment exercise (see Methods). (b) Trends in the genetic gain for % chalkiness and % HRY observed in dry seasons (DS) and wet seasons (WS) for the 106 breeding lines developed at IRRI between 1966–2015. The decrease in the HRY was observed to be more prominent during dry seasons, whilst chalkiness showed a similar trend in DS and WS. In the bar plots, orange bars represent HRY and blue bars represent % chalkiness , as adapted from Laborte , and show a similar trend in the Philippines and corroborates our results.

Performance of HRY and chalkiness in historical breeding lines

We evaluated a total of 106 breeding lines developed by IRRI between 1966–2015 for HRY and chalkiness by growing them together in dry and wet seasons and subjecting the grain to conditions of controlled drying. While chalkiness was found to be high in varieties released between 1966–1975, it was substantially lower in modern varieties released between 2006–2015 (Fig. 1b, trend line). This reduction was due to successful selection made against chalkiness through efficient image-based phenotyping. In contrast, HRY of varieties released during the early post-Green Revolution was significantly higher (~60% HRY for 1966–1975 shown in bar plot) compared to modern varieties released between 2006–2015 (~45% HRY). For varieties released between 2006–2015, HRY was found to be significantly lower in the dry season that in the wet season (Fig. 1b, trend line).

Evaluation of a diversity panel for genetic variation in HRY and chalkiness

As we did not observe much genetic variation for HRY in existing breeding lines, we examined a diversity panel of 320 resequenced indica lines. We phenotyped a wide range of milling and appearance qualities traits, namely percent HRY, weight of head rice (WHR), percent brown rice (PBR), weight of brown rice (WBR), percent milled rice (PMR), weight of milled rice (WMR), and PGC. The diversity panel showed a high degree of variation for HRY (6.22–63.8%) (Fig. 2a) with a broad-sense heritability of H2=0.88 for replicates grown in the 2015 dry season. Nevertheless, when the wet and dry season data were combined, the broad-sense heritability for HRY dropped to 0.56. PGC ranged from 0.07–100 with H2=0.90 for the dry season; when data from both seasons were combined, heritability remained high at H2=0.86. The other milling quality traits showed less variability (Supplementary Fig. S1 at JXB online). While HRY was moderately correlated with PMR and WMR (r2=0.54–0.55) across the 320 lines, a high level of correlation was observed between HRY and WHR (r2≥0.99). HRY was found to be poorly correlated with PBR and WBR (r2=0.09–0.10) (Fig. 2b), and also with traits related to grain size and shape (length, –0.13; width, –0.11; shape, –0.01). Moreover, PGC was negatively correlated with HRY (r2=–0.25), although with low significance. Thus, strategies employed towards lowering chalkiness alone might not contribute significantly to increase HRY, indicating that it is important to independently examine the underlying genetic variations that contribute to increasing HRY in the pre-breeding pool.
Fig. 2.

Phenotypic variation for milling quality traits in the diversity panel of 320 indica lines and their correlations with grain chalkiness. (a) Diversity for milling potential showing a large degree of variation for HRY (y-axis). The labels at the top indicate the stages during the milling process. Milling fractions include the hull (outer seed coat), the removal of which during the dehulling process give rise to brown rice. Removal of bran fraction results in milled rice (broken + intact grains). The intact grains represent the % HRY. (b, c) Correlations among different milling quality traits, grain chalkiness, and grain dimension attributes in (b) control and (c) moisture stress conditions. Blue and red indicate positive and negative correlations, respectively, and the size of the circles and intensity of the colour represent the degree of correlation. PGC, % grain chalkiness; GL, grain length; GW, grain weight; PBR, % brown rice; WBR, weight brown rice; HRY, head rice yield (%); WHR, weight head rice; PMR, % milled rice; WMR, weight milled rice. In both control and stress conditions, PGC showed negative correlations with the all of milling quality traits including the HRY and WHR, whereas showed positive correlation with GW.

Phenotypic variation for milling quality traits in the diversity panel of 320 indica lines and their correlations with grain chalkiness. (a) Diversity for milling potential showing a large degree of variation for HRY (y-axis). The labels at the top indicate the stages during the milling process. Milling fractions include the hull (outer seed coat), the removal of which during the dehulling process give rise to brown rice. Removal of bran fraction results in milled rice (broken + intact grains). The intact grains represent the % HRY. (b, c) Correlations among different milling quality traits, grain chalkiness, and grain dimension attributes in (b) control and (c) moisture stress conditions. Blue and red indicate positive and negative correlations, respectively, and the size of the circles and intensity of the colour represent the degree of correlation. PGC, % grain chalkiness; GL, grain length; GW, grain weight; PBR, % brown rice; WBR, weight brown rice; HRY, head rice yield (%); WHR, weight head rice; PMR, % milled rice; WMR, weight milled rice. In both control and stress conditions, PGC showed negative correlations with the all of milling quality traits including the HRY and WHR, whereas showed positive correlation with GW. Susceptibility to grain breakage due to induced moisture stress was examined in the 320 lines. Notably, we observed a relatively lower mean HRY value of 34±0.71%, compared with 44 ±0.64% for controlled drying. The broad-sense heritability of the replicates for HRY under moisture stress was H2=0.84. Grain length showed negative correlations with HRY and WHR in the moisture stress treatment (r2=–0.30 for both), with a stronger relationship compared to control conditions (r2=–0.13) (Fig. 2c). These results suggested that variable moisture content in the grain makes long-grain types more susceptible to fracture during milling. PGC consistently showed a modest positive correlation with grain width under both control and stress conditions (Fig. 2b, c), suggesting a relatively close link between these traits.

Identification of genomic regions regulating HRY through single- and multi-locus GWAS

An approach based on a linear mixed model for GWAS was adapted using EMMAX, with both inferred population structure and kinship being used as covariates (Supplementary Fig. S2). Based on the genotyping data of 2 260 030 SNPs obtained from mapping against the japonica reference genome, the GWAS was run for all milling quality traits phenotyped in the 320 indica lines under controlled drying and moisture stressed conditions from 2015 dry season. We identified association peaks exceeding the threshold (–log10P>5) for HRY and WHR on chromosome 3 (30.09 Mb), chromosome 6 (27.57–27.62 Mb), chromosome 8 (25.05–25.20 Mb), and chromosome 11 (25.78–25.79) under controlled drying conditions, with a narrow-sense heritability h2=0.76 (Fig. 3, Supplementary Figs S3, S4). To verify the loci with moderate significance (–log10P>5), we utilized a multi-locus GWAS analysis and further confirmed that the significant loci for HRY traits on chromosomes 3, 6, 8, and 11 co-located with the single-locus GWAS results (marked as red arrows in Fig 3a and Supplementary S3). Using the genotyping data from the two indica reference genomes, we confirmed that the peaks detected in chromosomes 6 and 8 associated with HRY.
Fig. 3.

Single-locus GWAS for head rice yield (HRY) in control conditions and for moisture stress stability. (a) Manhattan plots for HRY under control conditions using the reference genomes of japonica Nipponbare, and indica Zhenshan 97 (ZS97) and Minghui 63 (MH63). Significant genomic loci (threshold –log10P>5, blue line) for HRY were identified on chromosomes 6, 8, and 11, whilst loci with lower significance were identified on chromosome 3. The loci on chromosomes 6, 8, and 11 (only using the Nipponbare reference) were further confirmed by multi-locus GWAS and are indicated by red arrows. The horizontal red lines represent the genome-wide significance threshold of –log10P = 7.6. (b) Linkage disequilibrium (LD) plot of the 17 tagged SNPs on chromosome 8. Scaled log10(P) values are shown and the red bars indicate a relative negative effect. (c) Haplotypes constructed from LD-block 1 are depicted in boxplots with their phenotypic values for HRY. (d) Haplotypes constructed from all significant non-synonymous SNPs (P≤0.01) present in the candidate LOC_Os08g039780 showed variations for HRY values. (e) Manhattan plot for moisture stress stability showed a significantly associated genomic region on chromosome 8 (5.7Mb away from the HRY peak) using the Nipponbare reference genome, which was confirmed further in the reference genomes of ZS97 and MH63. (f) LD-plot consisting of two LD-blocks showing 11 tagged SNPs in the 30 Kb region; block-2 showed a high-effect SNP from the upstream region of the gene LOC_Os08g31340, constructing haplotypes possessing significant phenotypic variation for stability, depicted as a box plot (g). (h) Circos diagram representing the structural variation and collinearity in chromosome 8 (physical size shown in Mb), between japonica cultivar Nipponbare and indica cultivar ZS97. Significantly associated genomic regions for HRY identified in our study (highlighted as red blocks) and in previous studies (pink block) along with the stability-associated genomic region (grey blocks) are indicated; green stripes indicate conserved regions, while white indicates collinearity breaks in the region.

Single-locus GWAS for head rice yield (HRY) in control conditions and for moisture stress stability. (a) Manhattan plots for HRY under control conditions using the reference genomes of japonica Nipponbare, and indica Zhenshan 97 (ZS97) and Minghui 63 (MH63). Significant genomic loci (threshold –log10P>5, blue line) for HRY were identified on chromosomes 6, 8, and 11, whilst loci with lower significance were identified on chromosome 3. The loci on chromosomes 6, 8, and 11 (only using the Nipponbare reference) were further confirmed by multi-locus GWAS and are indicated by red arrows. The horizontal red lines represent the genome-wide significance threshold of –log10P = 7.6. (b) Linkage disequilibrium (LD) plot of the 17 tagged SNPs on chromosome 8. Scaled log10(P) values are shown and the red bars indicate a relative negative effect. (c) Haplotypes constructed from LD-block 1 are depicted in boxplots with their phenotypic values for HRY. (d) Haplotypes constructed from all significant non-synonymous SNPs (P≤0.01) present in the candidate LOC_Os08g039780 showed variations for HRY values. (e) Manhattan plot for moisture stress stability showed a significantly associated genomic region on chromosome 8 (5.7Mb away from the HRY peak) using the Nipponbare reference genome, which was confirmed further in the reference genomes of ZS97 and MH63. (f) LD-plot consisting of two LD-blocks showing 11 tagged SNPs in the 30 Kb region; block-2 showed a high-effect SNP from the upstream region of the gene LOC_Os08g31340, constructing haplotypes possessing significant phenotypic variation for stability, depicted as a box plot (g). (h) Circos diagram representing the structural variation and collinearity in chromosome 8 (physical size shown in Mb), between japonica cultivar Nipponbare and indica cultivar ZS97. Significantly associated genomic regions for HRY identified in our study (highlighted as red blocks) and in previous studies (pink block) along with the stability-associated genomic region (grey blocks) are indicated; green stripes indicate conserved regions, while white indicates collinearity breaks in the region. Chromosome 6 (27.57–27.62 Mb), including the candidate gene LOC_Os06g45580 that encodes the ubiquitin E3 RING protein with protein degradation activity, possessed a high number of significant SNPs that showed association with HRY (Supplementary Fig. S4A). The targeted association of LOC_Os06g45580 identified several significant indels in the promoter region as well in the 3´-UTR region that explained the higher HRY (50–60 %). The fine-mapped genomic region at 25.05–25.20 Mb on chromosome 8 (HR8.1) significantly associated with HRY and WHR, which spanned 21 candidate genes encompassing 17 tagged SNPs (Table 1) at the interval locus between LOC_Os08g39590 and LOC_Os08g39800 (Fig. 3b), which was further narrowed down by the gene-set analysis to 15 candidate genes (Table 1). The major phenotype variation for HRY was represented by haplotypes from the block 1, consisting of 14 SNPs (Fig. 3b, c) By mining this genetic region, we identified highly significant non-synonymous SNPs (ACCCGT) in LOC_Os08g39780 that exhibited a high mean value of HRY of ~53.7 (Fig. 3d). An additional fine-mapped 9.6-kb region (25.78–25.79 Mb) matching the flanking region of LOC_Os11g42800 was identified on chromosome 11 (HR11.1) and showed significant association with HRY and WHR (Supplementary Fig. S4b). This region was comprised of a single LD-block with a four SNP-based haplotype TTGA that contributed to high mean values of HRY of ~50.71, and was found to be a rare haplotype present in indica cultivars. In addition to HRY, the genomic regions that contributed to other milling quality traits under controlled drying conditions were related to PMR and WMR, which were mapped on chromosome 8 (6.10–6.18 Mb) through single-locus GWAS with a relatively high level of significance (–log10P>7.0) (Supplementary Fig. S3).
Table 1.

Gene-set analysis using GWAS of candidates identified within the hotspot genomic regions on chromosomes 3, 6, 8, and 11 associated with head rice yield.

Gene ID†No. of SNPsSNP positionsPercent head riceWeight head riceAnnotation
StartEnd Z-Score P-valueCorrected P-value Z-Score P-valueCorrected P-value
Chromosome 3
LOC_Os03g539903230954587309592012.790.00260.07842.820.00240.0726Development unspecified
Chromosome 6
LOC_Os06g455501627572045275734524.102.09E−50.00064.102.0E−50.0006Unclassified
LOC_Os06g455609127583949275902453.600.00020.00473.600.00020.0046Unclassified
LOC_Os06g45570227595301275955293.350.00040.01163.350.00040.0117Unclassified
LOC_Os06g45580427598795275992393.570.00020.00513.580.00020.0049Protein degradation ubiquitin E3 RING
LOC_Os06g45590527601517276037403.840.00010.00183.860.00010.0017Plastid branch 3-phosphoglycerate kinase
LOC_Os06g45600427608612276091813.870.00010.00163.890.00010.0015Unclassified
LOC_Os06g45610427614080276147653.650.00010.00393.660.00010.0036Unclassified
Chromosome 8
LOC_Os08g39590625060904250628982.080.01890.54842.060.01990.5777Signalling receptor kinases
LOC_Os08g396001425064684250666611.820.03420.99291.810.03541.0000RNA processing
LOC_Os08g39610525068294250684372.400.00810.23562.380.00860.2501Unclassified
LOC_Os08g396205725070901250764231.680.04661.00001.670.04771.0000Transcription factor
LOC_Os08g396405325102192251055992.330.00990.28672.320.01020.2953Cytochrome P450
LOC_Os08g396504825107131251093443.460.00030.00783.450.00030.0081Unclassified
LOC_Os08g396601625110579251113072.560.00520.15132.550.00530.1542Cytochrome P450
LOC_Os08g396949425121356251341821.800.03571.00001.790.03641.0000Cytochrome P450
LOC_Os08g397102225138857251421572.940.00160.04772.930.00170.0499Signalling receptor kinases
LOC_Os08g397302025155263251568442.010.02200.63692.030.02130.6170Cytochrome P450
LOC_Os08g397601525174628251799273.510.00020.00653.470.00030.0075Signalling receptor kinases
LOC_Os08g397703125185266251861952.600.00470.13622.570.00510.1478Unclassified
LOC_Os08g397803625187198251885632.350.00940.27342.330.00990.2882Unclassified
LOC_Os08g39790925189370251895531.920.02730.79111.900.02890.8373Unclassified
LOC_Os08g398001025193197251937991.720.04281.00001.680.04631.0000Unclassified
Chromosome 11
LOC_Os11g428006825775818257784491.910.02780.80591.920.02720.7874Cell organization

†Genes showing P≤0.05 were included.

‡ P-values subjected to Bonferroni correction.

Gene-set analysis using GWAS of candidates identified within the hotspot genomic regions on chromosomes 3, 6, 8, and 11 associated with head rice yield. †Genes showing P≤0.05 were included. ‡ P-values subjected to Bonferroni correction. Single-locus GWAS and gene-set analysis were performed to identify genomic regions regulating the stability (HRY ratio of moisture stress and control) along with covariate of HRY values obtained from controlled drying conditions. The HRY stable regions were mapped to a 30.4-kb significant genomic region on chromosome 8 (5.7Mb from the HRY control peak) (Fig. 3e, f, Table 2, Supplementary Table S2). The significant peak was further confirmed by multi-locus GWAS using the two indica reference genomes (Fig. 3e). The hotspot genomic region represented by the second LD-block was found to be negatively regulating the stability trait (Fig. 3f). Using TGAS together with gene-set analysis revealed highly significant SNPs located in the upstream region of LOC_Os08g31340 coding for a protein containing heavy metal-associated domain (Fig. 3g, Table 2). The haplotype TCC constructed from these SNPs contributed to higher stability of HRY (~1), meaning no difference between moisture stress versus control. The alternative haplotype CTC was responsible in decreasing HRY under moisture stress relative to the control. Furthermore, the haplotypes identified in the key candidates for influencing HRY on chromosomes 6, 8, and 11 showed consistent behavior across dry and wet seasons (Supplementary Fig. S5).
Table 2.

Gene-set analysis using GWAS of identified candidates within the hotspot genomic region on chromosomes 8 that regulate stability under moisture stress conditions

Gene IDSNPs startSNPs endNo. of SNPs Z-Score P-valueCorrected P-valueAnnotations
LOC_Os08g313401938295219387137102.750.00300.0150Metal handling binding chelation and storage
LOC_Os08g313201937161219373491173.807.34E−50.0004Unknown
LOC_Os08g31310193693471937028393.190.00070.0036Unknown
LOC_Os08g31300193646421936494692.220.01320.0661Unknown

†Genes showing P≤0.05 were included.

‡ P-values subjected to Bonferroni correction.

Gene-set analysis using GWAS of identified candidates within the hotspot genomic region on chromosomes 8 that regulate stability under moisture stress conditions †Genes showing P≤0.05 were included. ‡ P-values subjected to Bonferroni correction. Under stress conditions, GWAS identified a key region on chromosome 1 (37.02–37.73 Mb) that was highly associated with PBR (Supplementary Fig. S3). Another association peak was identified on an 11.14-kb fine-mapped region of chromosome 5 associated with WBR and WMR, located in the upstream region of the GW5/qSW5 gene (LOC_Os05g09520, encoding a calmodulin-binding motif family protein) that is known to regulate grain width and weight in rice (Weng ; Supplementary Fig. S3). We assessed the level of structural variation that existed in the target regions on chromosomes 3, 6, 8, and 11 between japonica (Nipponbare) and indica (ZS97) reference genomes, and identified collinearity and break-points in the respective genomes (Fig. 3h, Supplementary Fig. S6). The chromosome 8 hotspot region was located in the conserved regions. Notably, we detected several break-points between ZS97 and Nipponbare, especially within the target hotspot regions for HRY traits mapped on chromosomes 3, 6, and 11 (Supplementary Fig. S6). This suggested the existence of structural variations in the genetic regions that regulate HRY, which reinforced the importance of using subspecies-based reference genome(s) for genotype calling to conduct targeted associations.

Fine-mapping of the hotspot region on chromosome 5 associated with PGC using GWAS and TGAS

We conducted GWAS using SNPs acquired based on the alignment of whole-genome sequencing information for 320 diverse indica lines with a japonica reference genome, and this identified a highly significant (–log10P=15) fine-mapped genomic region on chromosome 5 mapped between 5.12–5.43 Mb, which further confirmed with the multi-locus GWAS peaks (Fig. 4a–c, Supplementary Fig. S7, red arrows). Within the fine-mapped region, four LD-blocks were defined with 18 tagged SNPs that showed significant association with PGC (Supplementary Table S3). Out of these, the LD-block- 1 region positively influenced PGC phenotype, defined by the CGG haplotype that was derived from three SNPs. The LD- block- 2 and LD- block- 3 regions showed negative association with PGC, with significant beta values, and the LD- block- 4 region showed a weak correlation with PGC. The LD- block- 2 region, which was represented by the haplotype ACGATC (n=16), showed higher beta values that explained the lower median PGC value of 6.6 (Fig. 4b, Supplementary Table S4). Using TGAS, the genomic region identified in LD- block- 2 that associated significantly with PGC was narrowed down to three candidate genes. These were LOC_Os05g09520 and LOC_Os05g09530, and a putative novel candidate gene that encodes for a ~122-amino acid protein with unknown function, which we named as chalk5.1 (Fig. 4d–f). Gene-set analysis identified a highly significant Z-score for chalk5.1, which was in contrast to the non-significant value (P>0.05) for LOC_Os05g09520 (Table 3).TGAS of chalk5.1 gene identified the haplotype GCTCGCTGGGGGCCG (n=10) that covered significant tagged SNPs in the regulatory, exonic, and intronic regions (Fig. 4d). This region (chalk5.1) explained the significant phenotypic variance (PV) of 21%. The lines possessing this haplotype had a low-chalkiness phenotype, with a median PGC of 7% (Fig. 4d). These lines were also found to have short grain length due to the presence of the GW5 gene (LOC_Os05g09520) in the same LD (Fig. 4d). When we mapped the chalkiness hotspot region that showed colinearity on chromosome 5, we observed several break-points between high chalkiness (ZS97) and low chalkiness (Nipponbare) reference genomes within the target hotspot region for PGC on chromosome 5 (Fig. 4c). This indicated the role of existing structural variations underlying the hotspot region in regulating grain chalkiness. Notably, the previously cloned V-PPase candidate gene of chalkiness (chalk5; Li ) was observed at some distance from the target region we identified in the present study (marked red in Fig. 4c).
Fig. 4.

Single-locus GWAS for grain chalkiness and prominent loci identified on chromosome 5 in a diversity panel of 320 indica rice accessions using japonica reference genomes. (a) Manhattan plots of the GWAS on percentage grain chalkiness (PGC) shows its association with a chromosome 5 hotspot region and chromosome 12 genetic region (red arrow), which was further confirmed by a multi-locus GWAS. The horizontal red and blue lines represent the genome-wide significant threshold –log10(P) values of 7.6 and 5, respectively. (b) Linkage disequilibrium (LD) plot of the 18 tagged SNPs significantly associated with PGC. A scaled and highly dense LD-based plot within the hotspot genomic region on chromosome 5 is represented. Scaled log10(P) values are shown, and black bars indicate a relative positive effect whilst red bars indicate a relative negative effect. Genes also detected by TGAS are highlighted in red. (c) Circos diagram showing the collinearity in chromosome 5 between Nipponbare and ZS97, particualry within the hotspot region (marked with blue blocks) where the collinearity break is evident, whereas the previously reported gene chalk5 that regulates chalkiness was observed more distantly (small red blocks). (d–f) TGAS of the candidates with their gene structures and haplotypes. (d) The newly detected hypothetical gene (named as chalk5.1); (e) LOC_Os05g09520; and (f) LOC_Os05g09530. SNPs identified in genic region (marked with blue triangles) with their –log10(P) values shown above the gene model. The boxplots show constructed haplotypes and related phenotype distribution for PGC; five selected lines from each of two haplotypes that show extreme phenotype are represented in each case.

Table 3.

Gene-set analysis using GWAS of candidates identified within the hotspot genomic regions on chromosomes 5 associated with percent grain chalkiness.

Gene IDSNP startSNP endNo. of SNPs Z-Score P-valueCorrectedAnnotations
Zhenshan 97 reference
ZS05g009370041322694132855102.490.00630.2852Unclassified
ZS05g009380041370484139938242.390.00850.3821Reverse transcriptase
ZS05g00947004211463422296733.280.00050.0236Phosphoinositide 3-kinase
Newly predicted gene (Chalk 5.1)4599233460054284.875.68E−72.56E−5Unclassified
ZS05g01101004624872462789712.680.00360.1639Aspartic peptidase
ZS05g01102004629326463033613.840.00010.0028Unclassified
ZS05g01103004632404463628212.920.00170.0785Der1-like protein
ZS05g011070046572544678577402.160.01550.6953Armadillo-type fold
ZS05g01109004682839468504622.450.00720.3244Ubiquinone biosynthesis protein
ZS05g011100046881294693152113.050.00120.0521Acid phosphatase
ZS05g01111004698217469904642.920.00170.0781Acid phosphatase
ZS05g01112004705604470774452.120.01720.7729Acid phosphatase
ZS05g01123004793937479581482.650.00400.1814Acid phosphatase
ZS05g01125004847621484894321.710.04391.0000FAR1-DNA binding domain
ZS05g01129004878450487931112.170.01500.6739Ulp1-protease-like protein
ZS05g01151005039853504029211.820.03441.0000Unclassified
ZS05g01152005057235505855223.010.00130.0593SANT/Myb domain
Minghui 63 reference
MH05g00893005011370501233514.671.53E−66.14E−6Unclassified
MH05g00910005069302507009321.910.02800.1120DUF protein
Nipponbare reference
LOC_Os05g092005140748514328513.973.57E−50.0008Unclassified
LOC_Os05g092205152077515482422.530.00570.1262Unclassified
LOC_Os05g092305157489516084742.190.01420.3129Unclassified
Newly predicted gene (Chalk 5.1)5361454536276186.417.23E−111.59E−9Unclassified
LOC_Os05g095305388454539045843.807.38E−50.0016Protein degradation aspartate protease
LOC_Os05g095405391521539219554.376.10E−60.0001Unclassified
LOC_Os05g095505394336539814183.550.00020.0042Protein degradation
LOC_Os05g0959054171505417753153.720.00010.0022Unclassified
LOC_Os05g096005419267542074414.151.67E−50.0004Unclassified
LOC_Os05g0962054243065433090263.510.00020.0049Unclassified

†Genes showing the significance of P≤0.05 were included.

‡ P-values are corrected as per Bonferroni corrections.

Gene-set analysis using GWAS of candidates identified within the hotspot genomic regions on chromosomes 5 associated with percent grain chalkiness. †Genes showing the significance of P≤0.05 were included. ‡ P-values are corrected as per Bonferroni corrections. Single-locus GWAS for grain chalkiness and prominent loci identified on chromosome 5 in a diversity panel of 320 indica rice accessions using japonica reference genomes. (a) Manhattan plots of the GWAS on percentage grain chalkiness (PGC) shows its association with a chromosome 5 hotspot region and chromosome 12 genetic region (red arrow), which was further confirmed by a multi-locus GWAS. The horizontal red and blue lines represent the genome-wide significant threshold –log10(P) values of 7.6 and 5, respectively. (b) Linkage disequilibrium (LD) plot of the 18 tagged SNPs significantly associated with PGC. A scaled and highly dense LD-based plot within the hotspot genomic region on chromosome 5 is represented. Scaled log10(P) values are shown, and black bars indicate a relative positive effect whilst red bars indicate a relative negative effect. Genes also detected by TGAS are highlighted in red. (c) Circos diagram showing the collinearity in chromosome 5 between Nipponbare and ZS97, particualry within the hotspot region (marked with blue blocks) where the collinearity break is evident, whereas the previously reported gene chalk5 that regulates chalkiness was observed more distantly (small red blocks). (d–f) TGAS of the candidates with their gene structures and haplotypes. (d) The newly detected hypothetical gene (named as chalk5.1); (e) LOC_Os05g09520; and (f) LOC_Os05g09530. SNPs identified in genic region (marked with blue triangles) with their –log10(P) values shown above the gene model. The boxplots show constructed haplotypes and related phenotype distribution for PGC; five selected lines from each of two haplotypes that show extreme phenotype are represented in each case. Using genotyping data derived from the ZS97 reference genome (Zhang ), GWAS also identified significant association on chromosome 5 with a region of ~0.9 Mb that harboured four major LD-blocks (Supplementary Fig. S7, S8, Supplementary Table S5). The SNPs with strong effects in conferring the low-chalkiness phenotype (Supplementary Fig. S8a, b) found within the candidate chalk5.1 gene (unknown function) were present in a separate LD-block (Supplementary Fig. S8a) while GW5/qSW5 (LOC_Os05g09520) that regulates grain width is in another LD-block (Shomura ; Weng ). The reference allele of the topmost SNP (snp_05_5361276) detected in the 5´-UTR of chalk5.1 matched with the binding site for tri-helix protein (Supplementary Fig S9). Similarly, other strongly associated SNPs identified in the upstream region of chalk5.1 were mapped within the binding site of trans-regulatory elements including AP2, C2H2, MYB, TIFY-transcription factor, and TBP (TATA-binding protein) (Supplementary Fig. S9). Using MH63 reference genome, a highly significant region associated with PGC was observed at a narrow interval of ~290 kb (Fig. 5a, Supplementary Table S6). The region was divided into two LD-blocks of 181 kb and 33 kb, separated by a region of 76 kb. The LD-block 1 containing three SNPs (CGC) correlated positively with PGC, whereas the SNPs located in LD-block 2 (matching the GW5 region) exerted a negative effect on chalkiness with a lower beta value. This might be attributable to the presence of few SNPs in the target region (Supplementary Fig. S10). TGAS analysis resulted in the identification of three candidate genes (MH05g0089300, MH05g0091600, MH05g0092200), which were confirmed through gene-set analysis (Fig. 5, Table 3). The significant SNP (snp_05_5107094, G/A) identified in LD-block 1 matched the candidate gene MH05g0091600 (Supplementary Table S6). These results suggested that possible genomic variations (insertion/deletion) exist in the identified targeted region.
Fig. 5.

Single-locus GWAS for percentage grain chalkiness (PGC) identified a highly associated chromosome 5 hotspot region using Minghui63 (indica) as a reference genome. (a) Linkage disequilibrium (LD) plot of the six tagged SNPs significantly associated with PGC. A scaled and highly dense LD-based plot within the hotspot genomic region is represented and the position of newly predicted gene is highlighted. Scaled log10(P) values are shown for the six SNPs, and black bars indicate a relative positive effect whilst red bars indicate a relative negative effect. (b) Haplotypes constructed within two LD blocks and their respective phenotypic values for PGC are represented as boxplots. (c–e) TGAS of selected genes: (c) MH05g0089300, (d) MH05g0091600, (e) MH05g0092200. These genes are highlighted in red in (a) based on their high effects with respective to PGC values. The boxplots show constructed haplotypes and related phenotype distribution for PGC.

Single-locus GWAS for percentage grain chalkiness (PGC) identified a highly associated chromosome 5 hotspot region using Minghui63 (indica) as a reference genome. (a) Linkage disequilibrium (LD) plot of the six tagged SNPs significantly associated with PGC. A scaled and highly dense LD-based plot within the hotspot genomic region is represented and the position of newly predicted gene is highlighted. Scaled log10(P) values are shown for the six SNPs, and black bars indicate a relative positive effect whilst red bars indicate a relative negative effect. (b) Haplotypes constructed within two LD blocks and their respective phenotypic values for PGC are represented as boxplots. (c–e) TGAS of selected genes: (c) MH05g0089300, (d) MH05g0091600, (e) MH05g0092200. These genes are highlighted in red in (a) based on their high effects with respective to PGC values. The boxplots show constructed haplotypes and related phenotype distribution for PGC. We additionally evaluated the different indica germplasm panels for chalkiness in field trials conducted across multiple seasons and years and consistently identified the same peak region as being highly associated with grain chalkiness (data not shown). Moreover, this region was found to be unique as it did not match any of the cloned chalk genes/mutants that have previously been reported (Supplementary Table S7).

HRY and chalkiness genetic regions for improving milling and appearance qualities

Out of 106 breeding lines, we resequenced 92 lines at a depth of ~30× coverage to identify the distribution of beneficial alleles for HRY and chalkiness. These lines were developed at IRRI and released between 1966–2015. To validate the significance of the allelic diversity that existed in the hotspot region of chromosome 8 (25.05–25.20 Mb) that resulted in high HRY in the selected indica panel, we examined existing variations within the IRRI breeding lines for these beneficial haplotypes. For HRY four main cluster groups were identified (Fig. 6a, b). Interestingly, accessions with the high HRY haplotype (group 1) identified in the rare indica accessions with a median HRY of 53.7% were clustered separately from the rest of the lines. The adjacent cluster group 2 encompassed the IRRI mega-varieties IR64, IRRI109, IRRI135, IRRI156, and IRRI174 that possessed alternative allelic combinations that resulted in a median HRY value of 45.37%. IR64 is a highly adapted mega-variety that harbours superior alleles for HRY, and it was released in the 1980s and accounts for the highest acreage in South and Southeast Asia (Mackill and Khush, 2018). Group 3 clustered around a newly developed IRRI breeding line with a poor median HRY value of 37.84%, and group 4 was represented by a range of breeding lines with a variable range of HRY values (Fig. 6a).
Fig. 6.

Haplotype distribution for head rice yield (HRY) and chalkiness in IRRI breeding lines together with selected diversity lines. An unrooted dendrogram (a) and a boxplot (b) depict the allelic variations for key haplotypes of the chromosome 8 hotspot region that governs HRY traits. The distribution indicated four different groups. Group-1 (red) represents high HRY cultivars (median 53.7%) as identified in the core collection with the presence of high-HRY haplotypes. Group-2 (yellow/green) had the second highest HRY values (median 45.4%) and included the broadly adapted mega-variety IR64 and other advanced breeding lines. Group-3 (green) mainly included recently released breeding lines with relatively lower and variable HRY values (median 37.84). Group 4 (black) included a wide range of HRY values (median 40.41). (c) Unrooted dendrogram and (d) boxplot based on allelic variations within the novel putative candidate gene chalk5.1, where group 5 (blue) represents the lowest chalkiness value (median 2.8) with lowest variance in three IRRI breeding lines and four accessions, while group 1 (red) showed a low chalk value (median 3.2) and represented only IRRI breeding lines. (e) Unrooted dendrogram and (f) boxplot showing allelic variation in the previously characterized chalk5 gene.

Haplotype distribution for head rice yield (HRY) and chalkiness in IRRI breeding lines together with selected diversity lines. An unrooted dendrogram (a) and a boxplot (b) depict the allelic variations for key haplotypes of the chromosome 8 hotspot region that governs HRY traits. The distribution indicated four different groups. Group-1 (red) represents high HRY cultivars (median 53.7%) as identified in the core collection with the presence of high-HRY haplotypes. Group-2 (yellow/green) had the second highest HRY values (median 45.4%) and included the broadly adapted mega-variety IR64 and other advanced breeding lines. Group-3 (green) mainly included recently released breeding lines with relatively lower and variable HRY values (median 37.84). Group 4 (black) included a wide range of HRY values (median 40.41). (c) Unrooted dendrogram and (d) boxplot based on allelic variations within the novel putative candidate gene chalk5.1, where group 5 (blue) represents the lowest chalkiness value (median 2.8) with lowest variance in three IRRI breeding lines and four accessions, while group 1 (red) showed a low chalk value (median 3.2) and represented only IRRI breeding lines. (e) Unrooted dendrogram and (f) boxplot showing allelic variation in the previously characterized chalk5 gene. A dendrogram constructed by considering the entire hotspot region (using 18 tagged SNPs) of chromosome 5 (5.12–5.43 Mb) that influenced PGC classified the breeding lines into four groups (Supplementary Fig. S11). A phylogentic grouping of IRRI breeding lines based on 15 significant SNPs identified in the new candidate gene chalk5.1 also identified five groups (Fig. 6c, d). Notably, allelic variation in group 1 potentially demonstrated a narrower range of the low-chalkiness phenotype (median 3.2%) and represented many modern IRRI breeding lines, followed by group 5 (median 2.8%) that represented diversity lines originating from low-chalkiness material (Fig. 6c, d). Beneficial alleles present in groups 1 and 5 in the IRRI-breeding lines conferred to reduced chalkiness. Many breeding lines represented in groups 3 and 4 showed a relatively broad range of chalkiness (Fig. 6c, d). Group 2 included the high-chalkiness IR5, IR8, and IR40 lines that were released during an early phase of IRRI’s breeding program, and they possess the haplotype that explains the high median PGC value of 13.9%. This high-chalkiness haplotype was also found in few breeding lines that were released more recently such as IRRI111, IRRI113, and IRRI147. The PV explained by chalk5.1 for the chalkiness phenotype in the breeding material was 11.42%, in contrast to 16.72% for chalk5. This suggests a relative lower representation of the chalk5.1 allele than chalk5 across the breeding lines (Fig. 6e, f). Intriguingly, the combined PV of both genic regions was assessed as 26.12%, which suggests the synergistic contribution of both candidate genes.

Discussion

High head rice yield (HRY) and low chalkiness are key traits that determine the milling and appearance qualities of rice, and hence they can significantly affect economic returns to farmer and millers. Current trends of limited genetic gains in yield per se combined with lack of progress in other important economic traits related to HRY have made it difficult to replace the widely adopted rice mega-varieties (Laborte ; Sreenivasulu ). Several trends assessed from market surveys and a lack of genetic diversity for HRY in high-yielding breeding lines collectively suggest that diverse genetic resources need to be exploited in order to improve milling quality. Rice industries around the world have observed that climate change has contributed to increasing temperatures during the dry season, which is linked to higher incidence of breakage of rice grains during milling (Custodio ; Zohoun ). In some countries premium slender (and fragrant) rice is typically grown in the dry season due to higher yields and lower production risks associated with diseases, and in accordance with this our market survey data indicated a higher importance of HRY in the dry season in countries such as the Philippines, Bangladesh, and India (Fig. 1a). In Eastern India, parboiling has been widely used to maximize head rice recovery (Custodio ). However, rising fuel costs for cooking have decreased this practice by shifting preferences from parboiled to non-parboiled rice. In addition, increasing demand for rice with slender grains in Asian (Custodio ) and African regions (Demont ) will introduce trade-offs between breeding for slenderness and head rice recovery, as these are generally considered as antagonistic traits (i.e. slender rice breaks more easily during milling), although some key mega-varieties such as IR64 typically feature both slender grains and high HRY (IRRI, 1986; Mackill and Khush, 2018) due to superior HRY alleles. In regular breeding programs, HRY has not been subjected to intensive selection (Zhao and Fitzgerald, 2013; Zhou ), due to the availability of only a limited amount of seed material during initial filial generations and due to lack of a robust high-throughput phenotyping method. Conversely, using chalkiness as a proxy trait for improving milling quality has resulted in improving the degree of chalkiness in IRRI breeding programs (mean of 4%) (Fig. 1), but has resulted in only limited progress for improving HRY, the mean for which remains at 45% among the most recently released breeding varieties. Using chalkiness as a proxy measure for HRY has been a common assumption that has pervaded rice breeding programs for decades, clearly it is not necessarily effective. This emphasises the need for identifying the genetics of HRY. Since HRY is known to be a low-heritable trait that is influenced primarily by moisture stress in the grain and thus by post-harvest practices (Siebenmorgen , 2013; Nelson ), we used controlled drying conditions to improve HRY (Cnossen ). HRY data obtained from using controlled drying conditions for 320 resequenced diverse indica lines combined with ultra-dense SNP genotyping allowed us to identify fine-mapped genetic regions from chromosome 6 and 8 with relatively high heritability (Fig. 3, Supplementary Fig. S4). In contrast, using traditional bi-parental populations has resulted in limited success in identifying QTLs for HRY in japonica rice due to low heritability and restricted recombination events (Nelson ; Pinson ; Ren ; Li ). In our present study, the ultra-high SNP density was up to 3-fold higher than that of recently published reports of GWAS-based studies in rice (Crowell ; McCouch ; Wang ). This enabled high-resolution mining of trait-specific allelic variants for higher HRY on chromosome 8 (25.05–25.20 Mb) under controlled drying (Fig. 3). An additional genomic region on chromosome 8 for stability was identified more distant from the controlled HRY region, which was not detected in a previous study (Qiu ) and might have been due to limited marker numbers and smaller population size. Fine-mapping these genomic regions is more appropriate in indica, where rapid LD decays occur compared to tropical and temperate japonica (Huang , 2013). Resequencing of popular IRRI varieties and studying target regions of HRY allowed us to inspect the allelic variations in the hotspot region for HRY on chromosome 8. Our study established that beneficial haplotypes responsible for high HRY that were present in only a few of the ancestral accessions were typically clustered together in group 1 (Fig. 6). It is highly likely that these underlying beneficial alleles have not been selected during breeding selection cycles. Many breeding lines in group 3 and 4 were characterized by inferior alleles resulting in poor HRY. As a result, no significant genetic improvements were made for HRY in the post-Green Revolution breeding programs. IR64 was found in group 2 with moderate HRY, together with the other modern varieties IRRI109, IRRI135, IRRI156, and IRRI174. IR64 is a popular mega-variety that is widely planted in India, Indonesia, Pakistan, the Philippines, and Vietnam, possibly due to its reasonable HRY coupled with a softer texture, which results in a wide preference by farmers and millers (Khush and Virk, 2005; Mackill and Khush, 2018). Farmers generally sell their harvest to millers, who prefer rice grains without chalkiness as it is perceived to lessen the predisposition to grain breakage during dehulling and polishing (Del Rosario ; Lisle ; Champagne, 2004; Wang ; Cooper ). Although previous studies have indicated out the inverse relationship between PGC and HRY (Buggenhout ; Zhao and Fitzgerald, 2013), in our present study using most of the IRRI rice varieties and a diversity rice panel we found lower correlations (Fig. 2b, c). Grain chalkiness with high heritability is judged as a crucial determinant for grading the export quality of rice (Wan ; Zhao ). Using GWAS, we detected a novel and highly significant association signal from a 0.31-Mb hotspot region of chromosome 5 that was associated with PGC (Fig. 4a, b) and observed it consistently across multiple seasons and replications in field trials in indica lines. This ruled out a major influence of environmental factors on the hotspot region. The region was located at a significant distance (1.78 Mb) from the previously cloned gene chalk5 V-PPase that encodes for inorganic pyrophosphate (PPi) H+-translocation activity (Li ). Moreover, the hotspot region that regulates chalkiness detected in our present study represent the novel chalk 5.1 gene, was found distinct from previously reported regions (Peng ; Zhao , 2016; Chen ; Gao ; Yun ; Zhu ). TGAS in combination with GWAS allowed us to narrow down the hotspot region to a locus encompassing three candidate genes (Fig. 4e), which included a known gene for grain width, GW5. In this region, deletion of 1212 bp from the ~5-kb upstream of the GW5 (LOC_Os05g9520) has been reported to contribute to higher grain width in japonica cultivars (Shomura ; Liu ). In addition, GW5 was also been considered as a putative candidate for reducing PGC (Qiu ; Yun ). Furthermore, by using the reference genome of the high-chalkiness indica line ZS97 (Zhang ) for calling SNPs, our GWAS results identified a new putative candidate gene chalk5.1 for regulating PGC situated ~3 kb upstream of GW5 (Fig. 4d). The lowest PGC value, confirmed by the haplotype of the chalk5.1 gene (Fig. 4d), suggested its importance in lowering chalkiness. We found lower PV values in breeding lines for chalk5.1 (11.4%) compared to chalk5 V-PPase (16.7%), which can be attributed to the lower frequency of representation of chalk5.1 alleles in the breeding material. Together, the two chalkiness genes identified on chromosome 5 explain 26.12% of the PV, indicating their likely potential benefits in future breeding initiatives to reduce chalkiness. In summary, the fine-mapped haplotypes of chromosome 5 identified for reducing chalk have been successfully explored in the present IRRI breeding programs. In contrast, the novel haplotypes identified for enhanced HRY from chromosome 8 in the diversity panel were not found in the 92 IRRI breeding lines. To ensure future economic benefits for farmers, superior alleles contributing for high HRY from chromosome 8 need to be recombined with the chalk 5.1 gene in the rice breeding programs to improve milling and appearance quality attributes for long-slender indica varieties.

Supplementary data

Supplementary data are available at JXB online. Table S1. List of accessions used in the study together with the haplotypes for HRY that were identified in the hotspot regions of chromosomes 8 and 11. Table S2. List of haplotypes identified for HRY stability within the candidate region on chromosome 8 in the accessions used in this study. Table S3. Single-locus GWAS for SNPs detected against the Nipponbare reference genome with corresponding orthologous IDs in ZS97 and MH63. Table S4. List of haplotypes identified for lowering chalkiness in hotspot region on chromosome 5 in the accessions used in this study. Table S5. Single-locus GWAS for SNPs detected against the ZS97 reference genome with corresponding orthologous IDs in MH63 and Nipponbare. Table S6. Single-locus GWAS for SNPs detected against the MH63 reference genome with corresponding orthologous IDs in ZS97 and Nipponbare. Table S7. Genes/QTLs previously characterized in mutants as influencing grain chalkiness. Fig. S1. Phenotypic variation for milling quality traits and chalkiness in 320 resequenced indica lines. Fig. S2. Principal component analysis for the 320 resequenced indica lines. Fig. S3. Manhattan plots generated by GWAS for milling quality traits within the indica germplasm. Fig. S4. Single-locus GWAS for HRY showing the association of the genomic regions of chromosomes 6 and 11. Fig. S5. Variation in the phenotypic values of HRY and chalkiness observed for different haplotypes across dry and wet seasons. Fig. S6. Comprehensive representation of structural variation and collinearity between the japonica cultivar Nipponbare and the indica cultivar ZS97. Fig. S7. Single-locus GWAS for grain chalkiness showing the main candidate loci on chromosomes 5 using the indica reference genomes ZS97 and MH63. Fig. S8. Single-locus GWAS for PGC showing the significant association in the chromosome 5 hotspot region using ZS97 as the reference genome. Fig. S9. Nucleotide sequence alignment in the main chromosome 5 hotspot region within cultivars with extreme chalkiness phenotypes. Fig. S10. SNPs in the genomic sequence of the chromosome 5 hotspot region that regulate PGC in the three reference genomes, Nipponbare, ZS97, and MH63. Fig. S11. Allelic variation in the chromosome 5 hotspot region and allele distribution in IRRI breeding lines and selected core-collection cultivars. Click here for additional data file. Click here for additional data file.

Author contributions

GM and RA conducted the GWAS, TGAS, and haplotype analyses; RA supervised the planting, harvesting, and drying of the rice diversity panel and IRRI varieties; MD conducted and interpreted the farmer survey; VB and LM optimized the methods for chalkiness and HRY assays, and SB analysed the data; NS, AG, and MKM conceptualized the work and NS supervised the conduct of all experiments and bioinformatics analyses; NS wrote the manuscript with contributions from VB and SB; all authors read and contributed to the revision of manuscript.
  51 in total

Review 1.  Designing climate-resilient rice with ideal grain quality suited for high-temperature stress.

Authors:  Nese Sreenivasulu; Vito M Butardo; Gopal Misra; Rosa Paula Cuevas; Roslen Anacleto; Polavarpu B Kavi Kishor
Journal:  J Exp Bot       Date:  2015-02-05       Impact factor: 6.992

2.  GW5 acts in the brassinosteroid signalling pathway to regulate grain width and weight in rice.

Authors:  Jiafan Liu; Jun Chen; Xiaoming Zheng; Fuqing Wu; Qibing Lin; Yueqin Heng; Peng Tian; ZhiJun Cheng; Xiaowen Yu; Kunneng Zhou; Xin Zhang; Xiuping Guo; Jiulin Wang; Haiyang Wang; Jianmin Wan
Journal:  Nat Plants       Date:  2017-04-10       Impact factor: 15.793

3.  Extensive sequence divergence between the reference genomes of two elite indica rice varieties Zhenshan 97 and Minghui 63.

Authors:  Jianwei Zhang; Ling-Ling Chen; Feng Xing; David A Kudrna; Wen Yao; Dario Copetti; Ting Mu; Weiming Li; Jia-Ming Song; Weibo Xie; Seunghee Lee; Jayson Talag; Lin Shao; Yue An; Chun-Liu Zhang; Yidan Ouyang; Shuai Sun; Wen-Biao Jiao; Fang Lv; Bogu Du; Meizhong Luo; Carlos Ernesto Maldonado; Jose Luis Goicoechea; Lizhong Xiong; Changyin Wu; Yongzhong Xing; Dao-Xiu Zhou; Sibin Yu; Yu Zhao; Gongwei Wang; Yeisoo Yu; Yijie Luo; Zhi-Wei Zhou; Beatriz Elena Padilla Hurtado; Ann Danowitz; Rod A Wing; Qifa Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2016-08-17       Impact factor: 11.205

4.  MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity.

Authors:  Yupeng Wang; Haibao Tang; Jeremy D Debarry; Xu Tan; Jingping Li; Xiyin Wang; Tae-ho Lee; Huizhe Jin; Barry Marler; Hui Guo; Jessica C Kissinger; Andrew H Paterson
Journal:  Nucleic Acids Res       Date:  2012-01-04       Impact factor: 16.971

5.  MAGMA: generalized gene-set analysis of GWAS data.

Authors:  Christiaan A de Leeuw; Joris M Mooij; Tom Heskes; Danielle Posthuma
Journal:  PLoS Comput Biol       Date:  2015-04-17       Impact factor: 4.475

6.  Genome-Wide Association Study of Grain Appearance and Milling Quality in a Worldwide Collection of Indica Rice Germplasm.

Authors:  Xianjin Qiu; Yunlong Pang; Zhihua Yuan; Danying Xing; Jianlong Xu; Michael Dingkuhn; Zhikang Li; Guoyou Ye
Journal:  PLoS One       Date:  2015-12-29       Impact factor: 3.240

Review 7.  IR64: a high-quality and high-yielding mega variety.

Authors:  David J Mackill; Gurdev S Khush
Journal:  Rice (N Y)       Date:  2018-04-09       Impact factor: 4.783

8.  QTL analysis for chalkiness of rice and fine mapping of a candidate gene for qACE9.

Authors:  Yang Gao; Chaolei Liu; Yuanyuan Li; Anpeng Zhang; Guojun Dong; Lihong Xie; Bin Zhang; Banpu Ruan; Kai Hong; Dawei Xue; Dali Zeng; Longbiao Guo; Qian Qian; Zhenyu Gao
Journal:  Rice (N Y)       Date:  2016-08-22       Impact factor: 4.783

9.  High-resolution QTL mapping for grain appearance traits and co-localization of chalkiness-associated differentially expressed candidate genes in rice.

Authors:  Likai Chen; Weiwei Gao; Siping Chen; Liping Wang; Jiyong Zou; Yongzhu Liu; Hui Wang; Zhiqiang Chen; Tao Guo
Journal:  Rice (N Y)       Date:  2016-09-22       Impact factor: 4.783

10.  Genome-wide association and high-resolution phenotyping link Oryza sativa panicle traits to numerous trait-specific QTL clusters.

Authors:  Samuel Crowell; Pavel Korniliev; Alexandre Falcão; Abdelbagi Ismail; Glenn Gregorio; Jason Mezey; Susan McCouch
Journal:  Nat Commun       Date:  2016-02-04       Impact factor: 14.919

View more
  7 in total

1.  Genome-Wide Association Mapping Reveals Novel Putative Gene Candidates Governing Reproductive Stage Heat Stress Tolerance in Rice.

Authors:  K T Ravikiran; S Gopala Krishnan; K P Abhijith; H Bollinedi; M Nagarajan; K K Vinod; P K Bhowmick; Madan Pal; R K Ellur; A K Singh
Journal:  Front Genet       Date:  2022-05-10       Impact factor: 4.772

2.  Superior haplotypes towards development of low glycemic index rice with preferred grain and cooking quality.

Authors:  Ramchander Selvaraj; Arun Kumar Singh; Vikas Kumar Singh; Ragavendran Abbai; Sonali Vijay Habde; Uma Maheshwar Singh; Arvind Kumar
Journal:  Sci Rep       Date:  2021-05-12       Impact factor: 4.379

3.  Fine mapping of two grain chalkiness QTLs sensitive to high temperature in rice.

Authors:  Weifeng Yang; Jiayan Liang; Qingwen Hao; Xin Luan; Quanya Tan; Shiwan Lin; Haitao Zhu; Guifu Liu; Zupei Liu; Suhong Bu; Shaokui Wang; Guiquan Zhang
Journal:  Rice (N Y)       Date:  2021-04-01       Impact factor: 4.783

4.  NH787 EMS mutant of rice variety Nagina22 exhibits higher phosphate use efficiency.

Authors:  Yugandhar Poli; Veronica Nallamothu; Ai Hao; Muddapuram Deeksha Goud; Xiaowen Wang; Subrahmanyam Desiraju; Satendra K Mangrauthia; Ajay Jain
Journal:  Sci Rep       Date:  2021-04-28       Impact factor: 4.379

5.  Fine Mapping of Two Major Quantitative Trait Loci for Rice Chalkiness With High Temperature-Enhanced Additive Effects.

Authors:  Weifeng Yang; Qingwen Hao; Jiayan Liang; Quanya Tan; Xin Luan; Shaojun Lin; Haitao Zhu; Suhong Bu; Zupei Liu; Guifu Liu; Shaokui Wang; Guiquan Zhang
Journal:  Front Plant Sci       Date:  2022-06-30       Impact factor: 6.627

6.  Low Light Stress Increases Chalkiness by Disturbing Starch Synthesis and Grain Filling of Rice.

Authors:  Qiuping Li; Fei Deng; Yuling Zeng; Bo Li; Chenyan He; Youyun Zhu; Xing Zhou; Zinuo Zhang; Li Wang; Youfeng Tao; Yu Zhang; Wei Zhou; Hong Cheng; Yong Chen; Xiaolong Lei; Wanjun Ren
Journal:  Int J Mol Sci       Date:  2022-08-15       Impact factor: 6.208

7.  Genome-wide association coupled gene to gene interaction studies unveil novel epistatic targets among major effect loci impacting rice grain chalkiness.

Authors:  Gopal Misra; Saurabh Badoni; Sabiha Parween; Rakesh Kumar Singh; Hei Leung; OluFunmilayo Ladejobi; Richard Mott; Nese Sreenivasulu
Journal:  Plant Biotechnol J       Date:  2020-12-09       Impact factor: 9.803

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.