Literature DB >> 36072319

De novo assembly of two chromosome-level rice genomes and bin-based QTL mapping reveal genetic diversity of grain weight trait in rice.

Weilong Kong1, Xiaoxiao Deng1, Zhenyang Liao2, Yibin Wang2, Mingao Zhou1, Zhaohai Wang3, Yangsheng Li1.   

Abstract

Following the "green revolution," indica and japonica hybrid breeding has been recognized as a new breakthrough in further improving rice yields. However, heterosis-related grain weight QTLs and the basis of yield advantage among subspecies has not been well elucidated. We herein de novo assembled the chromosome level genomes of an indica/xian rice (Luohui 9) and a japonica/geng rice (RPY geng) and found that gene number differences and structural variations between these two genomes contribute to the differences in agronomic traits and also provide two different favorable allele pools to produce better derived recombinant inbred lines (RILs). In addition, we generated a high-generation (> F15) population of 272 RILs from the cross between Luohui 9 and RPY geng and two testcross hybrid populations derived from the crosses of RILs and two cytoplasmic male sterile lines (YTA, indica and Z7A, japonica). Based on three derived populations, we totally identified eight 1,000-grain weight (KGW) QTLs and eight KGW heterosis loci. Of QTLs, qKGW-6.1 and qKGW-8.1 were accepted as novel KGW QTLs that have not been reported previously. Interestingly, allele genotyping results revealed that heading date related gene (Ghd8) in qKGW-8.1 and qLH-KGW-8.1, can affect grain weight in RILs and rice core accessions and may also play an important role in grain weight heterosis. Our results provided two high-quality genomes and novel gene editing targets for grain weight for future rice yield improvement project.
Copyright © 2022 Kong, Deng, Liao, Wang, Zhou, Wang and Li.

Entities:  

Keywords:  1,000-grain weight; genome sequencing; heterosis loci; rice; yield improvement

Year:  2022        PMID: 36072319      PMCID: PMC9443666          DOI: 10.3389/fpls.2022.995634

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   6.627


Introduction

Rice is one of the most important food crops in the world, providing food for more than half of the world’s population (Qin et al., 2021). Over the past two decades, multiple high-quality genomes of indica and japonica subspecies have been assembled, such as Nipponbare (Goff et al., 2002), 9311 (Yu et al., 2002), ZS97 (Zhang et al., 2016a,b), MH63 (Zhang et al., 2016a,b), R498 (Du et al., 2017), IR64 (Tanaka et al., 2020), TN1 (Panibe et al., 2021), Huazhan (Zhang H. et al., 2022), Tianfeng (Zhang H. et al., 2022), etc. Recently, several gap-free reference genomes were completed, namely, ZS97, MH63, PR106, LIMA, LARHAMUGAD, KETANNANGKA, NATELBORO, XL628S, LK638S, J4155S, and HZ (Song et al., 2021; Zhang F. et al., 2022; Zhang Y. et al., 2022). Advances in third-generation sequencing and assembly algorithms have continuously updated the accuracy of the rice pan-genome, revealing some important functionally related structural variations (SVs) and gene copy number variations (gCNVs) (Zhao et al., 2018; Qin et al., 2021; Zhang F. et al., 2022). However, there are dramatically different genetic backgrounds among thousands of rice cultivars, especially between subspecies, including cultivar-specific genes, different alleles of one gene, or gene family expansions (Li et al., 2021). Differences in rice agronomic traits are closely related to these genome variations (Stein et al., 2018; Zhao et al., 2018; Qin et al., 2021). Therefore, the discovery of new genes/alleles related to agronomic traits is inseparable from the comparative analysis of the genomes of elite varieties and the fine mapping in derived populations. For instance, we previously found that the hybrid progeny of the Luohui 9 (xian/indica) and RPY geng (geng/japonica) cross had significant heterosis in yield and resistance traits, and multiple recombinant inbred lines (RILs) derived from Luohui 9 X RPY geng aggregated the advantages of both parents (Kong et al., 2022a). Based on the high-density genetic map, we also obtained some QTLs related to plant height, salt stress tolerance, submerged germination, and grain shape (Kong et al., 2021b, 2022a,b; Deng et al., 2022). But the gene number differences and large structural variations between Luohui 9 and RPY geng, and the effects of these variations/differences on traits and heterosis, remain unclear. Owing to the impacts of human population growth and limited arable land, breeders and scientists faced the challenge of breeding higher yield potential crops. Rice yield is a complex agronomic trait composed of four main factors including effective panicle number, grain number per panicle, seed setting rate and 1,000-grain weight (KGW) (Zuo and Li, 2014). In addition, heterosis refers to the phenomenon that the phenotype of the hybrid progeny surpasses their parents in biomass, yield, growth vigor, resistance, etc., (Birchler et al., 2010). Yield heterosis between indica and japonica subspecies has been widely used to improve rice yield, causing a worldwide yield revolution (Li et al., 2018). As statistics, hybrid rice shows a 20–30% increase in yield than inbred rice and has effectively solved world food crisis (Xu et al., 2016). Therefore, analyzing the mechanism of rice grain weight (GW) and mining GW-related QTLs and GW-related heterosis loci are important foundations for improving rice yield. Based on different populations and QTL mapping methods, more than 600 QTLs related to grain weight and grain shape have been identified on all 12 chromosomes in rice to date[1] (Chan et al., 2021), and more than 20 QTLs have been cloned, including GW2 (Yan S. et al., 2011), GS3 (Liu et al., 2018), TGW6 (Ishimaru et al., 2013), GW6a (Song et al., 2015), WTG1 (Huang et al., 2017), GL7 (Wang et al., 2015), and gw5 (Wan et al., 2005). In fact, multiple reported QTLs may belong to one QTL, but there are differences in the size of the interval. Meta-analysis of QTLs was used to merge multiple QTLs from different rice genetic populations and to identify consensus and stable QTLs (Arcade et al., 2004; Kong et al., 2020), which narrowed down the confidence intervals of QTLs (Martinez et al., 2016; Zhang et al., 2017). Recently, 339 published GW QTLs merged into 34 Meta-QTLs (MQTLs) in rice (Khahani et al., 2020). The new GW QTLs/genes must be urgently explored in indica X japonica derived populations to further improve rice yields. In this study, we performed the chromosome-level de novo assembly of the Luohui 9 and RPY geng genomes and characterized their genomic differences in genome-wide scale. Additionally, the KGW traits of derived RIL populations from Luohui 9 X RPY geng in four environments were used for QTL mapping. Two testcross populations derived from the crosses of RILs and Z7A (japonica) or YTA (indica) were used to explore the heterosis loci of KGW. These results provided a new insight into the diversity mechanism of grain weight in rice.

Materials and methods

Materials and sequencing

The highly homozygous O. sativa ssp. indica/xian (Luohui 9, 2n = 2 × = 24) and O. sativa ssp. japonica/geng (RPY geng, 2n = 2 × = 24) were planted in the field in Wuhan, China, in 2016, These two subspecies have many significant differences in important agronomic traits, including plant height, number of tillers, and heading date, for example, Luohui 9 has excellent agronomic traits, and RPY geng has an ideal plant architecture (Figure 1A).
FIGURE 1

Whole plant phenotype (A), genomic characteristics (B), and large structural variations (C) between RPY geng and Luohui 9. The tracks from outside to inside are the chromosome, gene density, repeat sequence density, and GC content and the different color links represent orthologous gene pairs among chromosomes in (B).

Whole plant phenotype (A), genomic characteristics (B), and large structural variations (C) between RPY geng and Luohui 9. The tracks from outside to inside are the chromosome, gene density, repeat sequence density, and GC content and the different color links represent orthologous gene pairs among chromosomes in (B). The genomic DNA of these two subspecies was extracted from young leaves using a modified CTAB method and tested using Qubit Quantitation Starter Kit (Invitrogen, United States) and a 1% agarose gel electrophoresis, respectively. Libraries for Illumina short-read and single-molecule real-time (SMRT) sequencing (Pacific Biosciences, United States) were prepared according to the respective manufacturer’s instructions. The short-read DNA libraries were sequenced by paired-end (2 × 150 bp) method on Illumina HiSeqTM 2,500 (Illumina, United States) and the SMRT sequencing were sequenced on the PacBio RS II platform. In addition, Hi-C reads of DNA of young leaves from F1 progeny of Luohui 9 and RPY geng were sequenced on Illumina HiSeqTM 2,500 paired-end (2 × 150 bp) sequencing according to standard protocol. The total RNA of mixed tissues (root, stem, leaf, and young panicle) was extracted for RNA-seq libraries following the manufacturer’s standard protocol. Then, RNA-seq libraries were sequenced on an Illumina HiSeqTM 2,500 paired-end (2 × 150 bp) and raw reads were filtered using Trimmomatic software as described previously (Kong et al., 2020).

Genome assembly and annotation

PacBio RSII sub-reads were filtered by the PacBio SMRT-Analysis package including removing the adapters, low quality, and short length reads (parameters: readScore, 0.75; minSubReadLength, 500) and sub-reads after filtering were corrected by Illumina reads using an error correction module embedded in Canu v1.5 (Koren et al., 2017). The high-quality PacBio sub-reads were used for genome contigs assembly by using Canu v1.5 with default parameters. The contig-level genome was polished by Plion with these parameters: –mindepth 10; –changes; –threads 4; –fix bases (Walker et al., 2014). Hi-C data were used to assist in constructing chromosome-level genome assemblies. The Hi-C data were mapped to the contig-level genomes using BWA aligner software (Li and Durbin, 2009). A total of 46.73 Gb clean Hi-C data was mapped to the genome, with a coverage of 116.83 X. These uniquely mapped Hi-C reads were retained for chromosome-level genome assemblies using LACHESIS software with these parameters: CLUSTER MIN RE SITES = 22; CLUSTER MAX LINK DENSITY = 2; CLUSTER NON-INFORMATIVE RATIO = 2; ORDER MIN N RES IN TRUNK = 10; ORDER MIN N RES IN SHREDS = 10 (Burton et al., 2013). Finally, the chromosome-level genome was further improved and corrected by the high-density binmaps genetic map constructed in this study using all maps software according to the method described (Tang H. B. et al., 2015). Repeat sequence annotation was performed by EDTA with default parameters (Ou et al., 2019). Coding genes were predicted by de novo, homolog-based, and transcriptome-based strategies. Augustus v2.4 (Stanke and Waack, 2003), Genscan (Burge and Karlin, 1997), GeneID v1.4 (Alioto et al., 2018), GlimmerHMM v3.0.4 (Majoros et al., 2004), and SNAP (version 2006-07-28) (Korf, 2004) were used for de novo prediction. GeMoMa v1.3.1 (Keilwagen et al., 2016) was used for homolog-based prediction. In the transcriptome-based prediction, we used Hisat v2.0.4 and Stringtie v1.2.3 for sequence assembly based on a reference genome (Pertea et al., 2016), and applied TransDecoder v2.0[2] and GeneMarkS-T v5.1 (Tang S. Y. Y. et al., 2015) for gene prediction; On the other hand, PASA v2.0.2 software (Campbell et al., 2006) was used to perform unigene sequence prediction without reference assembly based on transcriptome sequencing data. Finally, we used the EVM v1.1.1 software (Haas et al., 2008) to integrate all gene prediction results from these three analysis methods. The predicted coding genes were annotated according to alignments against (E value 1e–5) databases including GO, KEGG, KOG, TrEMBL, and Nr databases using BLAST v2.2.31 (Altschul, 2012).

Orthologous clusters analysis and structural variant identification

We extracted the longest-protein sequences from Luohui 9 and RPY geng genomes for orthologous clusters identification using OrthoVenn2, E-value was set 1e–10, and other parameters with default (Xu et al., 2019). Genomic structural variants between Luohui 9 and RPY geng genomes were identified by SyRI (Goel et al., 2019). Luohui 9 genome was used as the reference genome, “nucmer – mum” for sequence alignment, with parameter, -c 100 -l 50 -g 1,000. Then, “delta-filter” was used to filter the comparison results, with parameter, -1 -q -r -i 89 -l 50. Finally, “show-coords,” “syri -c,” and “syri plotsr” steps were done with default parameters.

QTL mapping of KGW

A 272 RILs and their parents were planted in the Ezhou (30°N, 114°E) Experimental Base of Wuhan University, Wuhan City, Hubei Province in April 2017 (2017EZ) and in April 2018 (2018EZ), the Hybrid Rice Experimental Base of Wuhan University in Lingshui City (18°N, 110°E), Hainan Province in November 2019 (2019LS), and Breeding Experimental Base of Wuhan University Tianyuan Co., Ltd in Hannan District (30°N, 114°E), Wuhan City, Hubei Province in May 2019 (2019HN). Two testcross populations (Z7A-TCF1 and YTA-TCF1) were developed by crossing RILs (F14) with Z7A (japonica) and YTA (indica) in 2019LS. KGWs of Z7A-TCF1, YTA-TCF1, and RILs were investigated in 2019HN. All plants were planted under standard agricultural management practice (Kong et al., 2022a). KGW was surveyed in the above four environments. Each inbred line counted five individual plants, the average KGW value of five individual plants was considered as the KGW value of each inbred line. The genetic linkage map of 272 RILs including 4,578 bin blocks with the total bin-map distance 2,356.41 cM was previously constructed in our lab (Kong et al., 2022a). The QTL mapping of KGW was analyzed by R/qtl (Arends et al., 2010), the CIM interval mapping method was adopted and the LOD threshold was set by 3.0. The confidence interval was calculated with the function “lodint” (Dupuis and Siegmund, 1999) and the drop value was set to 1.5.

QTL mapping of KGW heterosis loci

Heterosis related indexes for the KGW trait were calculated by the formulas: where MPH is middle-parent heterosis, BPH is better-parent heterosis, LPH is lower-parent heterosis, P1 is the high parent, and P2 is the low parent. The QTL mapping of KGW heterosis related indexes was analyzed by R/qtl (Arends et al., 2010), the CIM interval mapping method was adopted and the LOD threshold was set by 2.5. The confidence interval was calculated with the function “lodint” (Dupuis and Siegmund, 1999) and the drop value was set to 1.5.

Results and discussion

De novo assembly and annotation of RPY geng and Luohui 9 genomes

A total of 25.1 / 36.5 Gb Illumina short reads with 62 / 91 X coverage of the genome and 15.6 / 19.5 Gb PacBio RSII long reads with 39 / 48 X coverage of the genome of RPY geng / Luohui 9 was obtained (Supplementary Table 1). The long reads were polished by the Illumina paired read and the polished long reads were assembled into contigs by Canu V1.5. After three rounds of contig polish by Pilon v1.22 and Hi-C data correction, we obtained a 383.45 Mb RPY geng genome and a 394.43 Luohui 9 genome, with the contig N50 of 2.81 and 2.84 Mb, respectively (Table 1). Luohui 9 was ∼10.98 Mb larger than the genome of RPY geng. Finally, 96.35 and 96.99 % of contigs were anchored onto 12 pseudo-chromosomes of RPY geng and Luohui 9 based on Hi-C interactions and linkage map from RPY geng x Luohui 9 derived population (Kong et al., 2022a), respectively (Table 1 and Supplementary Figure 1).
TABLE 1

Summary of genome assembly and annotation of RPY geng and Luohui 9.

RPY gengLuohui 9
Genome assembly
Assembly size (Mb)383.45394.43
Number of contigs684569
N50 size of contigs (Mb)2.812.84
Anchored contigs (Mb)369.49382.54
Anchored contigs (%)96.3596.99
Complete assessment of 456 core genes in CEGMA v2.5 (%)99.56100
Complete assessment of 238 core genes in CEGMA v2.5 (%)95.9797.18
Complete assessment of 1,614 core genes in BUSCO v3.0.2 (%)98.999.1
The long terminal repeat (LTR)-assembly index (LAI)19.4419.37
Genome annotation
Percentage of repeat sequences (%)44.2946.96
Number of predicted genes39,25539,440
Summary of genome assembly and annotation of RPY geng and Luohui 9. The percentages of repeat sequences in the genomes of RPY geng and Luohui 9 were 44.29 and 46.96% based on EDTA with default parameters (Supplementary Table 2). A combination of prediction strategies (de novo, homologous based and RNA-seq based) totally identified 39,255 and 39,440 gene models among RPY geng and Luohui 9 genomes, of which 96.81 and 94.75% had at least one annotation result in GO, KEGG, KOG, TrEMBL, or Nr database (Supplementary Tables 3, 4). The results of CEGMA and BUSCO showed that the assembly of RPY geng and Luohui 9 was complete, with more than 95.0% of the core genes. The long terminal repeat (LTR)-assembly index (LAI) of RPY geng and Luohui 9 was 19.44 and 19.37, which is close to the gold genome level (LAI ≥ 20) (Table 1). All the above-mentioned genome indices indicated that the newly assembled genomes of RPY geng and Luohui 9 was of high quality.

Global genome differences between RPY geng and Luohui 9

RPY geng and Luohui 9 showed obvious differences in yield, grain shape (Deng et al., 2022), plant height (Kong et al., 2022a), and abiotic stress resistances (Kong et al., 2020, 2021a,b,2022b), and the hybrid progeny of RPY geng X Luohui 9 had the excellent heterosis (Figure 1A). These essential agronomic differences are inseparable from the number and structural variation of genes between the two subspecies genomes (Zhao et al., 2018; Qin et al., 2021). Benefiting from the completion of the genomes of RPY geng and Luohui 9, we compared their gene numbers and large structural variations at the genome-wide level and highlighted some genes that have potential impact on agronomic traits. A total of 32,720 orthologous clusters including 32,509 orthologous gene pairs were identified (Supplementary Figure 2 and Figure 1B). Luohui 9 unique orthologous clusters were enriched with multiple essential life GO terms, while RPY geng unique orthologous clusters were enriched with multiple stress-related GO terms (Supplementary Table 5), namely, defense response, cellular response to amino acid stimulus, positive regulation of hydrogen peroxide, as well as response to osmotic stress, suggesting that RPY geng has more tolerance-related genes to abiotic stress than Luohui 9. These results are consistent with our previous findings that RPY geng has stronger resistance to salt stress and cold stress and carries important stress tolerance genes (Kong et al., 2020, 2021a,b,2022b). We further found 190 inversions, 6,852 translocations, 1,279 (Luohui 9)/1,212 (RPY geng) duplications between RPY geng and Luohui 9 involving 2,234 SV-related genes in Luohui 9 and 1,544 SV-related genes in RPY geng (Supplementary Table 6). Notably, at the position of 12.8–18.6 Mb on Luohui 9 chromosome 6 showed a sequence inversion with a length of about 5.7 Mb compared with the RPY geng genome (Figure 1C) and this inversion has also been reported in previous comparative genomic studies between subspecies (Du et al., 2017; Li et al., 2021; Xie et al., 2021), suggesting that this may be an important structural difference between subspecies. To study the potential roles of these SV-related genes in important agronomic traits, we collected 283 important known genes with functional function verifications (Supplementary Table 7) as query sequences to find homologous genes against SV-related genes by BlastP (value E-10). Totally, 337 SV-related genes were identified as homologous genes of 138 known functional genes belonging to cold tolerance, heat tolerance, salt tolerance, insect resistance, disease resistance, drought tolerance, fertility, grain quality, grain shape, heading date, panicle architecture, nutrient utilization, and panicle architecture (Supplementary Table 8). The above results suggested that the genome-wide number and structural differences play essential roles in the trait differences of RPY geng and Luohui 9, which were consistent with their differential trait characteristics.

QTLs of KGW

RPY geng and Luohui 9 belonged to japonica/geng and indica/xian subspecies, respectively, yield traits of their F1 and many RILs showed obvious over-parent dominance. To resolve yield-related genes, we here conducted trait surveys and linkage analysis of KGW based on the previously constructed high-density genetic map (Kong et al., 2022a). KGW of RILs were investigated in Lingshui, Hannan, or Ezhou among 2017 – 2019. There was extensive variance of KGW in RIL population while there was a minor difference of KGW between the parents (Figure 2 and Supplementary Table 9). KGW of the RIL population showed a normal distribution with high Pearson coefficients in four different environments and transgressive segregations were observed in the RIL population (Figure 2), which indicated that KGW were controlled by multiple genes and indica x japonica hybrid breeding strategy can breed high-yielding rice materials.
FIGURE 2

Thousand-grain weight in 2019HN, 2019LS, 2018EZ and 2017EZ.

Thousand-grain weight in 2019HN, 2019LS, 2018EZ and 2017EZ. We totally identified eight KGW QTLs on Chr 2, Chr 3, Chr5, Chr6, Chr8, and Chr10 (Table 2). Of QTLs, qKGW-8.1 was repeatedly detected in 2017EZ, 2018EZ, and 2019LS. qKGW-3.1 and qKGW-3.2 had almost the same interval and qKGW-5.2 was fully contained by qKGW-5.1. These results suggested that these three QTLs had relatively stable effects on KGW in multiple different environments. The remaining QTLs (qKGW-2.1, qKGW-6.1, and qKGW-10.1) were only detected in one specific ecological environment, and were possibly environment-specific KGW QTLs.
TABLE 2

Details of thousand-grain weight QTLs.

QTLEnvironmentChrQTL positionLODSize of QTL (Mb)Phenotypic variation (%)
qKGW-2.1 2019LSChr 222532020–299790733.777.456.18
qKGW-3.1 2019HNChr 38166100–107641267.742.6012.28
qKGW-3.2 2018EZ; 2019LSChr 38166100–110541156.49; 8.452.8910.41; 13.33
qKGW-5.1 2019LSChr 52493875–50850483.12.595.11
qKGW-5.2 2018EZChr 53542240–44158594.330.877.07
qKGW-6.1 2019_HNChr 62215678–25779244.830.367.85
qKGW-8.1 2017EZ; 2018EZ; 2019LSChr 84158052–48339703.42; 6.2; 4.410.685.63; 9.96; 6.20
qKGW-10.1 2017EZChr 1013311698–194727993.786.166.2
Details of thousand-grain weight QTLs.

Identification and function confirmation of two novel KGW QTLs

To distinguish the new TWG QTLs first discovered in this study, 34 Meta-QTLs from 339 original GW QTLs (Supplementary Table 10) and 126 known GW genes (Supplementary Table 11) were collected (Khahani et al., 2020). Except for qKGW-6.1 and qKGW-8.1, the remaining QTLs all had complete or partial overlap with Meta-QTLs, or contained the known KGW genes in these intervals (Figure 3). For example, qKGW-3.1 and qKGW-3.2 fully covered MQTL-GW18 and included pls2 and SRL2. Sequence alignment result showed that SRL2 had sequence differences in the coding sequence regions (Supplementary Table 12). Similarly, qKGW-2.1 containing OsVPE3, OsMADS6, and OsGRF4, overlapped MQTL-GW7 and MQTL-GW8; qKGW-5.1 and qKGW-5.2 carrying OsPPKL2, SRS3, and GS5 overlapped MQTL-GW18; qKGW-10.1 including FLO7 overlapped MQTL-GW29.
FIGURE 3

The positions of QTLs, MetaQTLs, and known KGW-related genes.

The positions of QTLs, MetaQTLs, and known KGW-related genes. Therefore, qKGW-6.1 and qKGW-8.1 were accepted as novel KGW QTLs. To further confirm the KGW regulation function of qKGW-6.1 and qKGW-8.1, all RILs were divided into different allelic combination based on peak maker genotyping results in genetic map. RPY geng allele (AA) RILs of qKGW-6.1 showed greater KGW than Luohui 9 (BB) RILs (Figure 4A). Interestingly, qKGW-8.1 showed the opposite result relative to qKGW-6.1 (Figure 4B). This suggested that the favorable alleles of these two QTLs are derived from RPY geng and Luohui 9, respectively, and the favorable allele aggregation may enhance KGW of some RILs. As expected, RILs that aggregated favorable alleles (qKGW-6.1 AA + qKGW-8.1 BB) had the largest KGW in all tested environments (Figures 4C–F). These results demonstrated that qKGW-6.1 and qKGW-8.1 are two new KGW loci that can be used to improve rice yield.
FIGURE 4

1000-grain weight of different allele combinations of qKGW-6.1 and qKGW-8.1. In (A), (C–F): qKGW-6.1 AA + qKGW-8.1 AA, (B): qKGW-6.1 AA + qKGW-8.1 BB; (C): qKGW-6.1 BB + qKGW-8.1 AA; (D): qKGW-6.1 BB + qKGW-8.1 AA.

1000-grain weight of different allele combinations of qKGW-6.1 and qKGW-8.1. In (A), (C–F): qKGW-6.1 AA + qKGW-8.1 AA, (B): qKGW-6.1 AA + qKGW-8.1 BB; (C): qKGW-6.1 BB + qKGW-8.1 AA; (D): qKGW-6.1 BB + qKGW-8.1 AA.

Ghd8 has a potential function in regulating rice grain weight

qKGW-8.1 could be detected repeatedly in three environments with phenotypic interpretation rates of 5.11–9.96. However, no known genes directly related to grain weight were found in qKGW-8.1. We therefore traversed the 92 gene annotation results in qKGW-8.1 and tried to correspond them to the phenotypic differences between the parents. We found that Ghd8 is located within the qKGW-8.1 interval, a gene reported to be closely associated with heading date and yield (Yan W. H. et al., 2011; Dai et al., 2012), which is consistent with parental heading date differences. Whether in Hainan or Hubei, both parents maintained the heading date difference of more than 10 days. We extracted the Ghd8 protein sequences from our newly assembled genome and sequence alignment revealed multiple sequence variations, including seven amino acid substitutions, one amino acid deletion, and a complex C-terminal amino acid variation (Figure 5A).
FIGURE 5

The candidate gene prediction of qKGW-8.1. (A). Protein sequence alignment results of Nipponbare, RPYgeng (FaGhd8), Luohui 9 (MoGhd8). (B) The 1,000-grain weight of RPY geng (AA) and Luohui 9 (BB) allele recombinant inbred lines (RILs). (C) The 1,000-grain weight of different allele rice core accessions.

The candidate gene prediction of qKGW-8.1. (A). Protein sequence alignment results of Nipponbare, RPYgeng (FaGhd8), Luohui 9 (MoGhd8). (B) The 1,000-grain weight of RPY geng (AA) and Luohui 9 (BB) allele recombinant inbred lines (RILs). (C) The 1,000-grain weight of different allele rice core accessions. To test whether Ghd8 has a potential effect on grain weight as a pleiotropic gene, we observed grain weight at different alleles in our RILs and in 532 rice core accessions from RiceVarMap v2.0[3] (Zhao et al., 2021). In RILs, the allele types of Ghd8 was determined based on a bin maker (Block174811) because Ghd8 is the only gene within block174811. In 532 rice core accessions, a functional snp (vg0804334484) was found in Ghd8 gene, containing A, C, and N alleles, and the N allele was eliminated in further phenotypic comparisons due to uncertainty about its base type. We found that different allele RILs or core accessions displayed significantly different KGW, suggesting that Ghd8 may play a role in KGW regulation (Figures 5B,C).

KGW heterosis loci

We totally identified two QTLs for KGW BPH, two QTLs for KGW MPH, and four QTLs for KGW LPH (Figure 6 and Supplementary Table 13). Three of the eight heterosis-related QTLs overlapped KGW QTLs: qLH-KGW-5.1 and qLH-KGW-5.2 were covered by qKGW-5.1 and qLH-KGW-8.1 overlapped with qKGW-8.1. Interestingly, qLH-KGW-8.1 coincided with a reported yield heterosis locus, RH8 (rice heterosis 8) (Li et al., 2016). Ghd8, as a major gene in RH8, was also located in the qLH-KGW-8.1 interval. This suggested that Ghd8 plays an important role in rice yield heterosis. In addition, two GW-related genes were in heterosis-related QTLs, namely, SRS3 and GS5 in qLH-KGW-5.1, GS5 in qLH-KGW-5.2. Whether these GW-related genes play a role in heterosis remains to be further explored.
FIGURE 6

Grain weight heterosis loci in two testcross populations. Black represents QTL loci in recombinant inbred lines (RILs). Red and blue loci represent heterosis-related QTLs of YTA and Z7A testcross population, respectively.

Grain weight heterosis loci in two testcross populations. Black represents QTL loci in recombinant inbred lines (RILs). Red and blue loci represent heterosis-related QTLs of YTA and Z7A testcross population, respectively.

Conclusion

In the present study, we de novo assembled genomes of an indica rice (Luohui 9) and a japonica rice (RPY geng) at the chromosome level and analyzed the KGW trait of their derived RIL populations. We concluded that the substantial genetic diversity of KGW in RILs were closely related to genome variations and allele aggregation difference of KGW QTLs. Importantly, we identified two novel KGW-related QTLs (qKGW-6.1 and qKGW-8.1) and several KGW heterosis loci in three derived population. Based on the genotyping results in RILs and 532 rice core accessions, Ghd8 in qKGW-8.1 was presumed to play an important role in GW regulation.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: ngdc, PRJCA010706.

Author contributions

YL and WK conceived and designed the experiments. WK performed genome assembly, analyzed the data, prepared the figures and tables, and wrote the manuscript. XD conducted a field survey of agronomic traits, QTL mapping, and allele genotyping in RILs and rice core accessions. ZW participated in the construction of the recombinant inbred lines and revision of the manuscript. YW provided help in genome annotation. ZL performed repetitive sequence annotation of RPY geng and Luohui 9 genomes. MZ collected Meta-QTLs and known genes performed parts of figures and tables. All authors read and approved the final version of the manuscript.
  61 in total

1.  Using geneid to Identify Genes.

Authors:  Tyler Alioto; Enrique Blanco; Genís Parra; Roderic Guigó
Journal:  Curr Protoc Bioinformatics       Date:  2018-10-17

Review 2.  Molecular genetic dissection of quantitative trait loci regulating rice grain size.

Authors:  Jianru Zuo; Jiayang Li
Journal:  Annu Rev Genet       Date:  2014-08-18       Impact factor: 16.830

3.  Integrated analysis of phenome, genome, and transcriptome of hybrid rice uncovered multiple heterosis-related loci for yield increase.

Authors:  Dayong Li; Zhiyuan Huang; Shuhui Song; Yeyun Xin; Donghai Mao; Qiming Lv; Ming Zhou; Dongmei Tian; Mingfeng Tang; Qi Wu; Xue Liu; Tingting Chen; Xianwei Song; Xiqin Fu; Bingran Zhao; Chengzhi Liang; Aihong Li; Guozhen Liu; Shigui Li; Songnian Hu; Xiaofeng Cao; Jun Yu; Longping Yuan; Caiyan Chen; Lihuang Zhu
Journal:  Proc Natl Acad Sci U S A       Date:  2016-09-23       Impact factor: 11.205

4.  A major QTL, Ghd8, plays pleiotropic roles in regulating grain productivity, plant height, and heading date in rice.

Authors:  Wen-Hao Yan; Peng Wang; Hua-Xia Chen; Hong-Ju Zhou; Qiu-Ping Li; Chong-Rong Wang; Ze-Hong Ding; Yu-Shan Zhang; Si-Bin Yu; Yong-Zhong Xing; Qi-Fa Zhang
Journal:  Mol Plant       Date:  2010-12-10       Impact factor: 13.164

5.  Bin mapping-based QTL analyses using three genetic populations derived from indica-japonica crosses uncover multiple grain shape heterosis-related loci in rice.

Authors:  Xiaoxiao Deng; Weilong Kong; Tong Sun; Chenhao Zhang; Hua Zhong; Gangqing Zhao; Xuhui Liu; Yalin Qiang; Yangsheng Li
Journal:  Plant Genome       Date:  2021-11-22       Impact factor: 4.089

6.  Long-read sequencing of 111 rice genomes reveals significantly larger pan-genomes.

Authors:  Fan Zhang; Hongzhang Xue; Xiaorui Dong; Min Li; Xiaoming Zheng; Zhikang Li; Jianlong Xu; Wensheng Wang; Chaochun Wei
Journal:  Genome Res       Date:  2022-04-08       Impact factor: 9.438

7.  Gene prediction with a hidden Markov model and a new intron submodel.

Authors:  Mario Stanke; Stephan Waack
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

8.  Metabolomic prediction of yield in hybrid rice.

Authors:  Shizhong Xu; Yang Xu; Liang Gong; Qifa Zhang
Journal:  Plant J       Date:  2016-08-29       Impact factor: 6.417

9.  Genome wide screening and comparative genome analysis for Meta-QTLs, ortho-MQTLs and candidate genes controlling yield and yield-related traits in rice.

Authors:  Bahman Khahani; Elahe Tavakol; Vahid Shariati; Fabio Fornara
Journal:  BMC Genomics       Date:  2020-04-10       Impact factor: 3.969

10.  De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing.

Authors:  Tsuyoshi Tanaka; Ryo Nishijima; Shota Teramoto; Yuka Kitomi; Takeshi Hayashi; Yusaku Uga; Taiji Kawakatsu
Journal:  G3 (Bethesda)       Date:  2020-05-04       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.