| Literature DB >> 35899250 |
Janet Higgins1, Bruno Santos2, Tran Dang Khanh3,4, Khuat Huu Trung3, Tran Duy Duong3, Nguyen Thi Phuong Doai3, Anthony Hall1, Sarah Dyer2, Le Huy Ham3, Mario Caccamo2, Jose De Vega1.
Abstract
Vietnam harnesses a rich diversity of rice landraces adapted to a range of conditions, which constitute a largely untapped source of diversity for the continuous improvement of cultivars. We previously identified a strong population structure in Vietnamese rice, which is captured in five Indica and four Japonica subpopulations, including an outlying Indica-5 group. Here, we leveraged that strong differentiation and 672 native rice genomes to identify genomic regions and genes putatively selected during the breeding of rice in Vietnam. We identified significant distorted patterns in allele frequency (XP-CLR) and population differentiation scores (F ST) resulting from differential selective pressures between native subpopulations, and later annotated them with QTLs previously identified by GWAS in the same panel. We particularly focussed on the outlying Indica-5 subpopulation because of its likely novelty and differential evolution, where we annotated 52 selected regions, which represented 8.1% of the rice genome. We annotated the 4576 genes in these regions and selected 65 candidate genes as promising breeding targets, several of which harboured alleles with nonsynonymous substitutions. Our results highlight genomic differences between traditional Vietnamese landraces, which are likely the product of adaption to multiple environmental conditions and regional culinary preferences in a very diverse country. We also verified the applicability of this genome scanning approach to identify potential regions harbouring novel loci and alleles to breed a new generation of sustainable and resilient rice.Entities:
Keywords: allele mining; genome scan; landraces; rice; selection
Year: 2022 PMID: 35899250 PMCID: PMC9309459 DOI: 10.1111/eva.13433
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 4.929
Number of accessions in each subpopulation by region of collection and basic description of each subpopulation
| Subtype | Indica | Japonica | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Subpop. | I1 | I2 | I3 | I4 | I5 | Im | J1 | J2 | J3 | J4 | Jm |
| Total | 145 | 91 | 37 | 62 | 43 | 48 | 115 | 50 | 17 | 21 | 8 |
|
| 0.0144 | 0.00127 | 0.0012 | 0.0012 | 0.001 | ‐ | 0.0006 | 0.0005 | 0.0007 | 0.0005 | ‐ |
| Region of collection (administrative regions of Vietnam) | |||||||||||
| Northeast | 5 | 1 | 7 | 1 | 2 | 5 | 22 | 13 | 0 | 1 | 1 |
| Northwest | 4 | 1 | 14 | 5 | 0 | 7 | 55 | 11 | 1 | 0 | 0 |
| Red River Delta | 6 | 1 | 0 | 32 | 12 | 5 | 0 | 6 | 0 | 8 | 0 |
| North Central Coast | 5 | 0 | 6 | 9 | 13 | 2 | 34 | 4 | 1 | 3 | 2 |
| South Central Coast | 3 | 1 | 8 | 2 | 4 | 13 | 0 | 1 | 12 | 0 | 0 |
| Central Highlands | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| Southeast | 1 | 3 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| Mekong Delta | 15 | 44 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Unknown | 105 | 40 | 1 | 13 | 12 | 12 | 4 | 14 | 2 | 9 | 3 |
| Dataset | |||||||||||
| New | 135 | 77 | 36 | 52 | 38 | 41 | 113 | 47 | 16 | 20 | 6 |
| 3KRGB | 10 | 14 | 1 | 10 | 5 | 7 | 2 | 3 | 1 | 1 | 2 |
New: Accession newly sequenced by us in Higgins et al. (2021). 3KRGP: Accessions sequenced in Zhou et al. (2020) by the 3000 Rice Genome Project. (π) Mean nucleotide diversity of each subpopulation. Regions sorted from North to South.
Descriptors from Higgins et al. (2021): Short‐growth: growth‐duration (less than 120 days from sowing to harvest). Long‐growth: long growth‐duration (over 140 days for sowing to harvest).
Subpopulation descriptions summary, based on Higgins et al. (2021)
| Subtype | Subpopulation | Agromorphology | 3K‐RGP overlap |
|---|---|---|---|
| Indica | I1 | Elite cultivars, Short season (<120 days), irrigated, lowland, longer grains, earlier heading date, higher culm strength, shorter leaf length, shorter culm length | XI‐1B1 (modern varieties), a few admixed (XI‐adm) |
| I2 | Landraces, Long season (<120 days), tall, rainfed, Mekong Delta | XI‐3B1 | |
| I3 | Landraces, Upland, deep roots | XI‐3B1, XI‐3B2 | |
| I4 | Landraces, Rainfed lowland, Red River Delta | XI‐3B2 | |
| I5 | Landraces, Northern and Red River Delta, lowland, thin roots, low genetic diversity, small non‐glutinous grains | XI‐adm | |
| Japonica | J1 | Tropical, Upland, North Vietnam, rainfed | GJ‐sbtrp |
| J2 | Temperate, Lowland, short grains, broad range, irrigated, lower grain/width length | GJ‐tmp | |
| J3 | Subtropical, Upland, large grains, South Central Coast | GJ‐sbtrp, GJ‐trp1, GJ‐adm | |
| J4 | Temperate, Lowland, short grains, Red River Delta, irrigated | GJ‐tmp |
Classification of accessions shared between Higgins et al. (2021) and the 3000 Rice Genome Project, which allowed to compare both population structures.
Whole‐genome XP‐CLR selection scores
| SCORE | J1 | J2 | J3 | J4 | |
|---|---|---|---|---|---|
| Selected | J1 | 17.8 | 7.6 | 6.1 | |
| J2 | 19.5 | 21.6 | 6.6 | ||
| J3 | 24.4 | 17.9 | 5.9 | ||
| J4 | 46.1 | 17.5 | 17.9 |
Note: Mean XP‐CLR score across the whole genome for each comparison between the four Japonica subpopulations and the five Indica subpopulations. Reciprocal comparisons shown in Table S1.
XP‐CLR scores and summary on the regions under selection in each subpopulation
| Mean XP‐CLR score | Cut‐off | Regions over 80 kbp | Mean length | Total length | % genome | Genes | |
|---|---|---|---|---|---|---|---|
| J1 | 10.5 | 136 | 28 | 576,707 | 16,147,785 | 4.3 | 2427 |
| J2 | 25.9 | 256 | 23 | 726,689 | 16,713,841 | 4.5 | 2439 |
| J3 | 16.1 | 228 | 24 | 577,089 | 13,850,139 | 3.7 | 2007 |
| J4 | 27.1 | 297 | 25 | 731,341 | 18,283,522 | 4.9 | 2643 |
| I1 | 7.6 | 161 | 44 | 453,570 | 19,957,065 | 5.3 | 3077 |
| I2 | 17.2 | 275 | 41 | 550,836 | 22,584,270 | 6.1 | 3346 |
| I3 | 27.9 | 401 | 42 | 474,009 | 19,908,387 | 5.3 | 2993 |
| I4 | 20.4 | 306 | 38 | 619,404 | 23,537,343 | 6.3 | 3465 |
| I5 | 41.4 | 440 | 52 | 583,706 | 30,352,734 | 8.1 | 4576 |
Note: Individual comparisons are shown in Table S2.
Cut‐off: 99 percentile.
Rice reference genome of 373,245,519 bp.
FIGURE 1XP‐CLR scores and regions under selection. (a) Selected regions for the five Indica subpopulations covering 5.4%, 6.1%, 5.3%, 6.3% and 8.1% of the genome for I1, I2, I3, I4 and I5 respectively. Centromeric regions are shown as 100 kb regions in dark grey. (b) Selected region for the four Japonica subpopulations covering 4.3%, 4.5%, 3.7% and 4.9% of the genome for J1, J2, J3 and J4 respectively. (c) PCA showing the relationship of the five Indica subpopulations, taken from Figure 2. Higgins et al. (2021). (d) PCA showing the relationship of the four Japonica subpopulations, taken from Figure 2. Higgins et al. (2021)
FIGURE 2Gene Ontology overrepresentation
Putative traits selected in each subpopulation based on the overlaps between QTLs and regions, which are further detailed in Tables S4 and S5
| TRAIT | INDICA | JAPONICA | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Trait ID | Description | I1 | I2 | I3 | I4 | I5 | J1 | J2 | J3 | J4 |
| GL | Grain length | 6,6 | 6 | 2 | 6 | 6 | 2 | 2,4 | 2 | 2,7 |
| GS | Grain size | 3 | ||||||||
| HD | Heading Date | 9 | 4 | |||||||
| FP | Floret Pubescence | 9 | 8 | |||||||
| PBintL | Primary branch internode length | 7 | 1 | 1 | 8 | |||||
| PBL | Primary branch length | 8 | 8 | 8 | ||||||
| PBN | Primary branch number | 8,10 | 8 | 8 | 1 | |||||
| SBintL | Secondary branch internode length | 12 | ||||||||
| SBN | Secondary branch number | 2 | 2 | 2 | 2 | |||||
| TIL | Number of tillers | 1,7 | 3 | 7,11 | 11 | 11 | ||||
| PL | Panicle length | 5,6 | ||||||||
| RL | Rachis length | 4 | 11 | 4,11 | 9 | 9 | ||||
| SHL | Shoot length | 1,12 | 1 | 1,8,11 | 8,11 | 8 | ||||
| SHW | Shoot weight | 1,12 | 12 | |||||||
| SpN | Spikelet number | 1 | 1 | 1 | 2 | 2 | 2 | 1,2 | ||
| TTW | Total weight | 1 | 1,9,12 | 3,9 | ||||||
| RCGR | Relative crop growth rate | 6 | ||||||||
| R‐S | Root to shoot ratio | 6 | ||||||||
| DEPTH | Deepest point reached by roots | 1 | 1 | 8 | 7 | 11 | 8,11 | |||
| DRP | Deep root proportion (<40 cm) | 6 | 1,1 | 1 | 1 | 1 | 1 | 1 | ||
| DRW | Deep root mass (<40 cm) weight | 6 | 1 | 1 | 1 | 1 | ||||
| DW2040 | Root mass 20–40 cm | 6 | ||||||||
| DW4060 | Root mass 40–60 cm | 6,12 | ||||||||
| DWB60 | root mass below 60 cm | 1 | 1 | 1 | 1 | |||||
| MRL | Maximum root length | 5 | 6 | |||||||
| NCR | Number of crown roots | 12 | 1 | 3 | 6,8,11 | 11 | 11 | 11 | ||
| RDW | Root dry weight | 6 | ||||||||
| RTL | Root length | 2 | 2 | |||||||
| RTW | Root weight | 11 | 11 | 10,11 | ||||||
| SRP | Shallow root proportion (0–20 cm) | 6 | 4 | 4 | ||||||
| THK | Root thickness | 2 | 2 | 3 | 11 | 11 | 11 | |||
| FW | Leaf fresh weight | 1 | 12,12 | 1,10 | 1 | 6 | ||||
| LLGHT | Longest leaf length | 6 | 6 | 6 | 6 | |||||
| TW | Leaf turgid weight | 1 | 12,12 | 1,10 | 1 | 6 | ||||
| RWC_1W | RWC after 1w drought | 11 | ||||||||
| RWC_2W | RWC after 2w drought | 7,11 | 11 | 11 | ||||||
| RWC_3W | RWC after 3w drought | 7,11 | 8,11 | 7,11 | ||||||
| RECO_1W | Recovery ability after 1w drought | 7,11 | 7 | |||||||
| RECO_3W | Recovery ability after 3w drought | 11 | 11 | 11 | ||||||
| RECO_4W | Recovery ability after 4w drought | 11 | 11 | 5 | 5 | |||||
| RPPUE | Relative physiological phosphate use efficiency | 5 | ||||||||
| RPUpE | Relative phosphate uptake efficienc | 1 | 3 | 1 | ||||||
Note: Numbers indicate the chromosomes where the selected region(s) associated with the trait are selected.
Genes within selected regions in indica‐5 further detailed in Tables S7 to S13. RWC: relative water content. Traits description extracted from the overlapping QTL descriptions. Overlaps showed in Figure 4. QTLs from eight published studies (Higgins et al., 2021; Hoang, Gantet, et al., 2019; Hoang, van Dinh, et al., 2019; Mai et al., 2020; Phung et al., 2016; Ta et al., 2018, 2019; To et al., 2020).
FIGURE 4Vietnamese QTLs and their overlap with selected regions in the I5 subpopulation. QTLs from eight published studies (Higgins et al., 2021; Hoang, Gantet, et al., 2019; Hoang, van Dinh, et al., 2019; Mai et al., 2020; Phung et al., 2016; Ta et al., 2018; To et al., 2019, 2020) are plotted along each chromosome together with the 52 regions selected in the I5 subpopulation. The fourteen selected regions which overlap with at least one QTL are highlighted, the letters refer to the details shown in Table 2
FIGURE 3Selection sweeps in the Indica I5 subpopulation compared to the other Vietnamese subpopulations. XP‐CLR scores in 100,000 bp sliding windows are plotted along the 12 chromosomes, showing selection in the I5 subpopulation compared to (a) I2, (b) I2, (c) I3, (d) I4. The horizontal dashed line indicates the threshold XP‐CLR score of 440 for determining selected regions. (e) F ST in 100,000 bp sliding windows for the 43 samples in the I5 subpopulation compared to the 190 samples in the I2, I3 and I4 subpopulations. The F ST peaks (selection signatures) ranged from 0.5 and 0.8, while the average F ST (associated with subpopulation differentiation) was 0.18 for this comparison. (f) Whole‐genome genetic diversity (Π) in 100,000 bp sliding windows for the 43 samples in the I5 subpopulation. The vertical lines show the position of the 52 selected regions
Fourteen of the 52 regions under selection in the Indica I5 subpopulation, and their overlap with QTLs
| Region | Chr. | Position (bp) |
| Genes | Overlaps: Subpopulations | Overlaps: Regions and genes | Overlaps: QTLs |
|---|---|---|---|---|---|---|---|
| I5_1 | 1 | 5,563,164–6,569,946 | 0.28 | 138 | I2, I4, J1, J3, J4 | 1 (39) | Root mass (Phung et al., |
| I5_5 | 1 | 37,850,965–38,378,420 | 0.64 | 84 | I1 | Leaf mass (Hoang, Gantet, et al., | |
| I5_16 | 2 | 28,191,142–29,329,745 | 0.24 | 168 | I3 | Jasmonate RTL (To et al., | |
| I5_30 | 5 | 386,347–1,563,159 | 0.28 | 190 | 3 (2) | 9_PL (d) | |
| I5_31 | 6 | 6,640,258–7,189,250 | 0.17 | 80 | I1, I2, I4 | 1 (7), 3 (39) | 12_GL (e) |
| I5_32 | 6 | 7,860,166–8,418,475 | 0.38 | 70 | I3, I4, J3 | 1 (3), 3 (34) | Leaf length (Phung et al., |
| I5_33 | 6 | 19,470,641–20,499,968 | 0.58 | 165 | I1 | Panicle length (Ta et al., | |
| I5_34 | 7 | 19,443,608–19,825,988 | 0.19 | 54 | I1, J4 | Water content after drought (Hoang, van Dinh, et al., | |
| I5_35 | 7 | 29,030,233–29,677,525 | 0.76 | 97 | I3 | Root depth (Phung et al., | |
| I5_36 | 8 | 3,484,045–3,758,632 | 0.35 | 39 | I3, I4 | Jasmonate SHL (To et al., | |
| I5_37 | 8 | 5,052,017–5,809,093 | 0.38 | 127 | I3, I4 | Panicle branches (Ta et al., | |
| I5_39 | 8 | 24,300,313–24,859,863 | 0.23 | 92 | Response of crown roots to phosphate (Mai et al., | ||
| I5_48 | 11 | 2,510,079–3,239,747 | 0.38 | 109 | I1, I4 | 1 (56) | Water content after drought (Hoang, van Dinh, et al., |
| I5_49 | 11 | 4,590,276–5,937,318 | 0.35 | 200 | J1 | 1 (3), 2 (14) | Root number (Phung et al., |
Note: Detailing the overlap of selected regions with published QTLs for Vietnamese rice populations, selected regions in Indica and Japonica subpopulations, and published selected regions (Cui et al., 2020; Lyu et al., 2014; Xie et al., 2015).
F ST per region between the 43 samples in subpopulation I5 and the 190 samples in subpopulations I2, I3 and I4. Further details per region are available in Table S7.
Overlaps with regions selected in other subpopulations.
Number of genes in brackets. Numbers naming subpopulations from: 1, tall (Ind1) [Xie 2015]; 2, semi‐dwarf (IndII) [Xie 2015]; 3, Cui et al. (2020).
Letters naming QTLs plotted in Figure 4.
Functional annotation of the 65 candidate genes under selection in the Indica I5 subpopulation and overlap with genes selected in previous studies
| Region | Gene ID (MSU) |
| Gene name | Selected in | SNP impact | References | Gene function |
|---|---|---|---|---|---|---|---|
| I5_1 | LOC_Os01g11860 | 0.300 | 2 | DJ‐1 family protein, putative, expressed | |||
| I5_5 | LOC_Os01g65670 | 0.909 | OsAAP6|qPC1 | Abbai et al. ( | Amino acid transporter, putative, expressed | ||
| I5_5 | LOC_Os01g65770 | 0.936 | Start lost | Expressed protein—rice specfic | |||
| I5_5 | LOC_Os01g65904 | 0.788 | Stop gained | Expressed protein—rice specfic | |||
| I5_5 | LOC_Os01g66030 | 0.651 | OsMADS2 | Lombardo et al. ( | OsMADS2—MADS‐box family gene with MIKCc type‐box, expressed | ||
| I5_5 | LOC_Os01g66070 | 0.445 | To et al. ( | PHD‐finger domain containing protein, putative | |||
| I5_16 | LOC_Os02g47310 | 0.564 | VTE4 | To et al. ( | Cyclopropane‐fatty‐acyl‐phospholipid synthase, putative, expressed | ||
| I5_16 | LOC_Os02g47350 | 0.666 | To et al. ( | Oxidoreductase, short‐chain dehydrogenase/reductase family, putative, expressed | |||
| I5_16 | LOC_Os02g47400 | 0.501 | To et al. ( | Pectinacetylesterase domain containing protein, expressed | |||
| I5_16 | LOC_Os02g47410 | 0.522 | To et al. ( | Protein kinase, putative, expressed | |||
| I5_16 | LOC_Os02g47420 | 0.572 | OSROPGEF | To et al. ( | ATROPGEF7/ROPGEF7, putative, expressed | ||
| I5_16 | LOC_Os02g47440 | 0.536 | To et al. ( | Syntaxin, putative, expressed | |||
| I5_16 | LOC_Os02g47590 | 0.637 | To et al. ( | Ornithine carbamoyltransferase, putative, expressed | |||
| I5_16 | LOC_Os02g47660 | 0.372 | OsBLR1 | Wang et al. ( | Basic helix–loop–helix, putative, expressed | ||
| I5_17 | LOC_Os03g12840 | 0.477 | DSM3|OsITPK2 | Stop gained | Du et al. ( | Inositol 1, 3, 4‐trisphosphate 5/6‐kinase, putative, expressed | |
| I5_17 | LOC_Os03g13010 | 0.837 | TUD1|DSG1|ELF1 | Sakamoto et al. ( | U‐box domain containing protein, expressed | ||
| I5_17 | LOC_Os03g13140 | 0.879 | Hb1 | Lira‐Ruan et al. ( | Non‐symbiotic haemoglobin 2, putative, expressed | ||
| I5_17 | LOC_Os03g14669 | 0.918 | OsHAP5C | Kim et al. ( | Core histone H2A/H2B/H3/H4, putative, expressed | ||
| I5_23 | LOC_Os03g49500 | 0.719 | Os‐ERS1 | Yu et al. ( | Ethylene receptor, putative, expressed | ||
| I5_23 | LOC_Os03g51050 | 0.660 | PTR8 | 1,3 | Ouyang et al. ( | Peptide transporter PTR2, putative, expressed | |
| I5_25 | LOC_Os03g58600 | 0.844 | MEL1 | Yi et al. ( | PAZ domain containing protein, putative, expressed | ||
| I5_25 | LOC_Os03g58630 | 0.886 | OsTrxh4 | Ying et al. ( | Thioredoxin, putative, expressed | ||
| I5_29 | LOC_Os04g58740 | 0.818 | 2 | Start lost | Expressed protein—rice specfic | ||
| I5_29 | LOC_Os04g58750 | 0.815 | OsBSK3 | 2 | Zhang et al. ( | Protein kinase family protein, putative, expressed | |
| I5_29 | LOC_Os04g58780 | 0.806 | WSL5|OsPPR4 | 2 | Liu et al. ( | Pentatricopeptide repeat protein, putative, expressed | |
| I5_29 | LOC_Os04g58870 | 0.813 | Splice acceptor or intron variant | Tu et al. ( | exo70 exocyst complex subunit, putative, expressed | ||
| I5_29 | LOC_Os04g58880 | 0.826 | RLS2|OsEXO70A1 | Tu et al. ( | exo70 exocyst complex subunit, putative, expressed | ||
| I5_30 | LOC_Os05g02260 | 0.617 | bip130 | Stop gained | Zhou et al. ( | Interacts with OsMPK1 | |
| I5_31 | LOC_Os06g12450 | 0.360 | ALK|SSIIa | 4 | Zhang et al. ( | Soluble starch synthase 2–3, chloroplast precursor, putative, expressed | |
| I5_32 | LOC_Os06g14620 | 0.471 |
| 4 | Qin et al. ( | Ribonucleoside‐diphosphate reductase small chain, putative, expressed | |
| I5_33 | LOC_Os06g34360 | 0.959 | Zang et al. ( | Zinc finger, C3HC4 type domain containing protein, expressed | |||
| I5_33 | LOC_Os06g34650 | 0.948 | Zang et al. ( | Zinc finger, C3HC4 type domain containing protein, expressed | |||
| I5_33 | LOC_Os06g33520 | 0.509 | OsABP | Macovei et al. ( | DEAD/DEAH box helicase, putative, expressed | ||
| I5_35 | LOC_Os07g48560 | 0.927 | WOX11 | Zhang et al. ( | Homeobox domain containing protein, expressed | ||
| I5_35 | LOC_Os07g48640 | 0.953 | OsSDR | Kim et al. ( | Short‐chain dehydrogenase/reductase, putative, expressed | ||
| I5_35 | LOC_Os07g48680 | 0.955 | Zang et al. ( | Zinc finger, C3HC4 type domain containing protein, expressed | |||
| I5_35 | LOC_Os07g48750 | 0.920 | OsARAF1 | Sumiyoshi et al. ( | Alpha‐N‐arabinofuranosidase, putative, expressed | ||
| I5_35 | LOC_Os07g48780 | 0.907 | OsCam1‐2|OsCam1 | Saeng‐ngam et al. ( | OsCam1‐2—Calmodulin, expressed | ||
| I5_35 | LOC_Os07g48820 | 0.901 | OsbZIP63|OsNIF1 | Delteil et al. ( | Transcription factor, putative, expressed | ||
| I5_35 | LOC_Os07g48830 | 0.931 | OsGolS2|wsi76 | Mukherjee et al. ( | Glycosyl transferase 8 domain containing protein, putative, expressed | ||
| I5_35 | LOC_Os07g48920 | 0.916 | OsALDH22 | Yang et al. ( | Aldehyde dehydrogenase, putative, expressed | ||
| I5_36 | LOC_Os08g06370 | 0.014 | To et al. ( | MYB family transcription factor, putative, expressed | |||
| I5_37 | LOC_Os08g09110 | 0.904 | Stop gained | NB‐ARC domain containing protein, expressed | |||
| I5_37 | LOC_Os08g09190 | 0.286 | OsPILS2 | Ta et al. ( | Auxin efflux carrier component, putative, expressed | ||
| I5_39 | LOC_Os08g39100 | 0.239 | OsPP2C66 | Mai et al. ( | Protein phosphatase 2C, putative, expressed | ||
| I5_39 | LOC_Os08g38990 | 0.202 | OsWRKY30 | Mai et al. ( | WRKY30, expressed | ||
| I5_41 | LOC_Os09g28280 | 0.654 | 4 | Gibberellin receptor GID1L2, putative, expressed | |||
| I5_41 | LOC_Os09g28840 | 0.654 | OsSCP43—Putative Serine Carboxypeptidase homologue, expressed | ||||
| I5_42 | LOC_Os09g30340 | 0.971 | PSAG | Park et al. ( | Photosystem I reaction centre subunit, chloroplast precursor, putative, expressed | ||
| I5_42 | LOC_Os09g30360 | 0.973 | Caffeoyl‐CoA O‐methyltransferase, putative, expressed | ||||
| I5_42 | LOC_Os09g30380 | 0.966 | AP005392‐AK108636—NBS/LRR genes that are S‐rich,divergent TIR, divergent NBS, expressed | ||||
| I5_42 | LOC_Os09g30400 | 0.954 | OsWRKY80 | Peng et al. ( | WRKY90, expressed | ||
| I5_42 | LOC_Os09g30410 | 0.961 | expressed protein | ||||
| I5_42 | LOC_Os09g31019 | 0.942 | Chen et al. ( | Ubiquitin fusion protein, putative, expressed | |||
| I5_47 | LOC_Os10g35260 | 0.703 | 3 | Rf1, mitochondrial precursor, putative, expressed | |||
| I5_47 | LOC_Os10g35540 | 0.783 | 3 | Hydrolase, alpha/beta fold family domain containing protein, expressed | |||
| I5_47 | LOC_Os10g35560 | 0.692 | OsSFR6 | 3 | de Freitas et al. ( | Expressed protein | |
| I5_47 | LOC_Os10g35604 | 0.661 | 3 | Stop gained | Expressed protein | ||
| I5_47 | LOC_Os10g35640 | 0.700 | Rf1b | 3 | Rf1, mitochondrial precursor, putative, expressed | ||
| I5_48 | LOC_Os11g05640 | 0.367 | OsZIP‐2a|OsbZIP80 | 2 | Nijhawan et al. ( | bZIP transcription factor domain containing protein, expressed | |
| I5_48 | LOC_Os11g06390 | 0.746 | OsACTIN2 | 2 | Actin, putative, expressed | ||
| I5_48 | LOC_Os11g06410 | 0.841 | SAB18 | 2 | Homeodomain, putative, expressed | ||
| I5_48 | LOC_Os11g06490 | 0.715 | Ribosome inactivating protein, putative, expressed | ||||
| I5_49 | LOC_Os11g09360 | 0.919 | OsFBX398 | Splice acceptor or intron variant | Jain et al. ( | OsFBX398—F‐box domain containing protein, expressed | |
| I5_49 | LOC_Os11g10070 | 0.721 | OsSEU2 | 3 | Splice acceptor or intron variant | Tanaka et al. ( | Transcriptional corepressor SEUSS, putative, expressed |
F ST per region between the 43 samples in subpopulation I5 and the 190 samples in subpopulations I2, I3 and I4. Further details are available in Table S12.
1, Ecotype differentiated genes (Lyu et al., 2014). 2, tall (Ind1) (Xie et al., 2015). 3, semi‐dwarf (IndII) (Xie et al., 2015). 4, domestication (Cui et al., 2020).
As measured by SNP effect.
FIGURE 5Allele Plots for “High impact” SNPs within eight candidate genes. Bar plots showing the base count for each subpopulation. A = adenine, T = thymine G = guanine, C = cytosine. Heterozygous calls are shown using IUPAC ambiguity codes