| Literature DB >> 35819660 |
Palvi Malik1,2, Mao Huang2, Kumari Neelam3, Dharminder Bhatia4, Ramanjeet Kaur1, Bharat Yadav1,5, Jasdeep Singh1, Clay Sneller2, Kuldeep Singh1,6.
Abstract
Being one of the most important staple dietary constituents globally, genetic enhancement of cultivated rice for yield, agronomically important traits is of substantial importance. Even though the climatic factors and crop management practices impact complex traits like yield immensely, the contribution of variation by underlying genetic factors surpasses them all. Previous studies have highlighted the importance of utilizing exotic germplasm, landraces in enhancing the diversity of gene pool, leading to better selections and thus superior cultivars. Thus, to fully exploit the potential of progenitor of Asian cultivated rice for productivity related traits, genome wide association study (GWAS) for seven agronomically important traits was conducted on a panel of 346 O. rufipogon accessions using a set of 15,083 high-quality single nucleotide polymorphic markers. The phenotypic data analysis indicated large continuous variation for all the traits under study, with a significant negative correlation observed between grain parameters and agronomic parameters like plant height, culm thickness. The presence of 74.28% admixtures in the panel as revealed by investigating population structure indicated the panel to be very poorly genetically differentiated, with rapid LD decay. The genome-wide association analyses revealed a total of 47 strong MTAs with 19 SNPs located in/close to previously reported QTL/genic regions providing a positive analytic proof for our studies. The allelic differences of significant MTAs were found to be statistically significant at 34 genomic regions. A total of 51 O. rufipogon accessions harboured combination of superior alleles and thus serve as potential candidates for accelerating rice breeding programs. The present study identified 27 novel SNPs to be significantly associated with different traits. Allelic differences between cultivated and wild rice at significant MTAs determined superior alleles to be absent at 12 positions implying substantial scope of improvement by their targeted introgression into cultivars. Introgression of novel significant genomic regions into breeder's pool would broaden the genetic base of cultivated rice, thus making the crop more resilient.Entities:
Keywords: Gene annotation; Genome-wide association study; LD decay; Minimum Bayes Factor; Oryza rufipogon; Population structure; Productivity related traits; SNP tagging
Year: 2022 PMID: 35819660 PMCID: PMC9276952 DOI: 10.1186/s12284-022-00582-4
Source DB: PubMed Journal: Rice (N Y) ISSN: 1939-8425 Impact factor: 5.638
Fig. 1Correlogram showing the distribution of phenotypic data for various traits and pairwise correlations between various traits. PH, PL and PB showed a negative correlation with GL, GW and HGW
Basic statistical summary of phenotypic data
| Trait | Range | Mean | Heritability |
|---|---|---|---|
| PH | 78.5–205.2 cm | 147 | 0.59 |
| CT | 2.64–7.72 mm | 4.83 | 0.57 |
| PL | 19.00–34.62 cm | 28.54 | 0.38 |
| PB | 7.00–21 | 11.06 | 0.57 |
| GL | 6.9–10.4 mm | 8.43 | 0.45 |
| GW | 1.52–3.19 mm | 2.17 | 0.4 |
| HGW | 0.73–2.50 g | 1.49 | 0.8 |
Fig. 2PCA plot generated from marker data of 346 accessions of O. rufipogon population from 16 different countries
Fig. 3Classification of O. rufipogon population into six sub-populations represented by different colors: red, blue, green, yellow, pink, cyan
AMOVA summary partitioning marker variance into within and between six sub-populations in complete set of O. rufipogon population
| Degrees of freedom | Sum of squares | Mean sum of squares | Sigma | Percent | |
|---|---|---|---|---|---|
| Variation between sub-populations | 5 | 37,691.01 | 7538.2 | 125.62 | 10.74 |
| Variations within populations | 340 | 354,885.5 | 1043.78 | 1043.78 | 89.26 |
| Total variations | 345 | 392,576.5 | 1137.9 | 1169.4 | 100 |
MTAs identified through GWAS for seven productivity related traits in AM panel of 346 O. rufipogon accessions
| SNP | Primary trait ( | Secondary trait ( | Previously reported QTLs/genes |
|---|---|---|---|
| S1_1931325 | CT(9.20E−05, 3.5%), HGW (2.24E−05, 3.42%) | ||
| S1_35199996 | CT (4.86E−05, 6.17%) | PH (0.011, 6.80%) | |
| S2_21858276 | CT (5.95E−06, 4.12%) | PB ( 4.98E−02, 3.19%) | |
| S3_15,462,193 | CT (1.70E−06, 6.75%) | PH (0.026303753, 5.56%), PB (0.021535861, 6.26%) | |
| S6_16,241,331 | CT (7.80E−05, 6.44%) | ||
| S7_1,592,572 | CT (8.89 E−05, 3.73%) | PB ( 2.20792E−4, 5.48%) | |
| S7_18,874,511 | CT (2.79E−05, 7.83%) | ||
| S7_24,536,919 | CT (2.99E−05, 7.27%) | PH (8.47 E−03, 13.23%) | |
| S9_21,545,679 | CT (8.02E−06, 3.98%) | ||
| S11_21,499,010 | CT (9.40E−05, 3.56%) | PB (1.58 E−04, 5.46%), PH (1.88E−02, 3.18%) | |
| S2_30,762,305 | PL (5.91E−05, 2.13%) | PH (0.021162648, 3.20%) | |
| S3_32,359,666 | PL (1.17E−05, 4.39%) | ||
| S4_34,899,389 | PL (5.89E−05, 4.06) | ||
| S6_6,597,022 | PL (1.37E−07, 4.70%) | PH(4.67879E−04, 7.92%), PB(0.005270489, 7%) | |
| S6_11,174,827 | PL(3.76E−06, 3.4%) | CT( 0.020179035, 3.17%), GW(0.026049429, 1.61%) | |
| S8_11,774,122 | PL(4.83E−07, 8.34%) | PB(0.035139693, 10.31%) | |
| S9_4,933,781 | PL(4.62E−05, 2.08%) | ||
| S10_14,320,467 | PL(3.59E−07, 3.1%) | PH(0.002110037, 4.37%) | |
| S10_16,385,834 | PL(2.00E−06, 5.40%) | PH(0.030803675, 6.90%), PB(0.00164866, 10.75%) | |
| S11_7,438,223 | PL(3.00E−05, 2.08%) | ||
| S4_30,721,851 | PB(7.49E−05, 5.56%) | ||
| S7_24,282,724 | PB(5.57E−05, 7.60%) | PH(0.042017316, 3.63%) | |
| S6_24,807,445 | GL(6.01E−05, 1.89%) | ||
| S8_5,775,398 | GL(9.86E−05, 1.84%) | ||
| S1_40,142,074 | GW(7.92E−05, 2.67%) | ||
| S2_7,048,091 | GW(6.96E−05, 2.25%) | ||
| S2_22,216,515 | GW(1.62E−05, 2.75%) | ||
| S3_7,917,671 | GW(5.35E−05, 4.64%) | ||
| S4_12,374,542 | GW(3.99E−06, 4.88%) | ||
| S4_34,598,600 | GW(8.93E−05, 3.35%) | ||
| S5_19,130,617 | GW(7.87E−06, 3.87%) | ||
| S5_23,720,696 | GW(4.31E−05, 2.90%) | ||
| S5_28,157,471 | GW(5.68E−06, 2.44%) | PH(0.002417007, 3.94%), HGW(0.002092217, 2.68%) | |
| S8_20,423,775 | GW(9.81E−06, 3.00%) | ||
| S8_24,621,885 | GW(3.61E−06, 3.92%) | PB(0.022511555, 5.8%), PH(0.020274467, 5.36%) | |
| S10_111,061 | GW(5.03E−06, 4.29%) | ||
| S10_19,109,511 | GW(1.47E−05, 2.4%) | ||
| S10_19,238,621 | GW(2.16E−05, 3.91%) | ||
| S1_38,370,584 | HGW(6.88E−07, 7.38%) | ||
| S2_2,875,772 | HGW(6.46E−05, 3.49%) | GL(0.035508538, 0.98%), PH(0.001764352, 3.4%) | |
| S2_3,873,759 | HGW(1.55E−05, 6.38%) | GL(0.005263071, 2.61%) | |
| S4_4,499,266 | HGW(1.34E−10, 9.46%) | ||
| S4_26,914,103 | HGW(9.69E−07, 6.17%) | ||
| S4_31,316,844 | HGW(5.89E−06, 5.43%) | ||
| S4_35,115,087 | HGW(3.76E−05, 6.58%) | CT(0.002310095, 6.00%) | |
| S5_24,316,574 | HGW(7.99E−08, 8.85%) | ||
| S11_19,062,952 | HGW(5.30E−07, 8.05%) |
Functional annotation of loci harboring significant SNPs/ present in LD-region of significant SNPs
| SNP | Loci harboring significant SNP/ in LD-region | Functions |
|---|---|---|
| S1_1931325 | LOC_Os01g04330, LOC_Os01g04340 | OsCML16—Calmodulin-related calcium sensor protein, expressed; hsp20/alpha crystallin family protein, putative, expressed |
| S1_35199996 | LOC_Os01g60870 | Expressed protein |
| S2_21858276 | LOC_Os02g36220 | Terpene synthase, putative, expressed |
| S3_15,462,193 | LOC_Os03g27003, LOC_Os02g36220 | Expressed protein |
| S6_16,241,331 | LOC_Os06g28550 | nmrA-like family domain containing protein |
| S7_1,592,572 | LOC_Os07g03810 | Lectin-like receptor kinase 7 |
| S7_18,874,511 | LOC_Os07g31770 | Chalcone synthase gene |
| S7_24,536,919 | LOC_Os07g41014 | Glycosyl hydrolases family 17 protein |
| S9_21,545,679 | LOC_Os09g37280 | Peroxisomal multifunctional enzyme type 2 protein |
| S11_21,499,010 | LOC_Os11g36460, LOC_Os11g36470 | SMC-related protein MSS2; ubiquitin carboxyl-terminal hydrolase 21 |
| S2_30,762,305 | LOC_Os02g50370 | Helicase domain-containing protein |
| S3_32,359,666 | LOC_Os03g56784 | Expressed protein |
| S4_34,899,389 | LOC_Os04g58690, LOC_Os04g58700 | tRNA-specific adenosine deaminase 1; expressed protein |
| S6_6,597,022 | LOC_Os06g12260, LOC_Os06g12250, LOC_Os06g12280 | N-rich protein; sphingolipid C4-hydroxylase SUR2; glycosyl transferase 8 domain containing protein |
| S6_11,174,827 | LOC_Os06g19590, LOC_Os06g19600, LOC_Os06g19610 | Estradiol 17-beta-dehydrogenase 12, expressed protein; oxidoreductase, short chain dehydrogenase/reductase family |
| S8_11,774,122 | LOC_Os08g19650, LOC_Os08g19670, LOC_Os08g19680 | Homeobox protein knotted-1; expressed protein; expressed protein |
| S9_4,933,781 | LOC_Os09g09220, LOC_Os09g09210 | Protein kinase domain containing protein, expressed; expressed protein |
| S10_14,320,467 | LOC_Os10g27170, LOC_Os10g27174, LOC_Os10g27180, LOC_Os10g27160 | Calmodulin-binding protein; 40S ribosomal protein S28; expressed protein; hypothetical protein |
| S10_16,385,834 | LOC_Os10g31240, LOC_Os10g31250 | Plant protein of unknown function domain containing protein; expressed protein |
| S11_7,438,223 | LOC_Os11g13570, LOC_Os11g13540, LOC_Os11g13560, LOC_Os11g13580 | Gibberellin receptor GID1L2; serpin domain containing protein, putative, expressed; serpin domain containing protein, putative, expressed; expressed protein |
| S4_30,721,851 | LOC_Os04g51809, LOC_Os04g51820 | Expressed protein; OsHKT1;1—Na + transporter, expressed |
| S7_24,282,724 | LOC_Os07g40520 | Formin-like protein 20, putative, expressed |
| S6_24,807,445 | LOC_Os06g41380, LOC_Os06g41384, LOC_Os06g41390 | Expressed protein; zinc finger C- × 8-C- × 5-C- × 3-H type family protein, expressed; N-terminal asparagine amidohydrolase |
| S8_5,775,398 | LOC_Os08g09990, LOC_Os08g10000 | Expressed protein; expressed protein |
| S1_40,142,074 | LOC_Os01g69070, LOC_Os01g69050, LOC_Os01g69060, LOC_Os01g69080 | Auxin efflux carrier component; lysine ketoglutarate reductase trans-splicing related 1; hydrolase, alpha/beta fold family domain containing protein; acyl-desaturase chloroplast precursor |
| S2_7,048,091 | LOC_Os02g13220 | F-box family protein |
| S2_22,216,515 | LOC_Os02g36830, LOC_Os02g36820 | Cytokinin-O-glucosyltransferase2; expressed protein |
| S3_7,917,671 | LOC_Os03g14590, LOC_Os03g14570, LOC_Os03g14580 | Calcium-binding EF hand family protein; expressed protein (conserved gene of known function); expressed protein (in ortholog, conserved gene of known function) |
| S4_12,374,542 | LOC_Os04g21820, LOC_Os04g21830, LOC_Os04g21840, LOC_Os04g21850 | OsWAK33—OsWAK receptor-like protein OsWAK-RLP, expressed; hypothetical protein; expressed protein; expressed protein |
| S4_34,598,600 | LOC_Os04g58080, LOC_Os04g58090, LOC_Os04g58100, LOC_Os04g58110 | Polygalacturonase inhibitor 3 precursor; harpin-induced protein 1 domain containing protein, Arabidopsis-LEA (LEA) hydroxyproline-rich glycoprotein family/ other ortho NHL25; Expressed Protein; pyruvate kinase |
| S5_19,130,617 | LOC_Os05g32680, LOC_Os05g32660, LOC_Os05g32670, LOC_Os05g32690 | Pale Cress Protein (PCP); leucine-rich repeat family protein,; pentatricopeptide repeat-containing protein (ortho-60S ribosomal protein L34); expressed protein |
| S5_23,720,696 | LOC_Os05g40370, LOC_Os05g40384 | Expressed protein; cytochrome P450 |
| S5_28,157,471 | LOC_Os05g49100 | WRKY49 |
| S8_20,423,775 | LOC_Os08g32930, LOC_Os08g32920, LOC_Os08g32940 | Expressed protein (n ortho-chaperonin like RbcX protein); dynamin-2B; endoglucanase |
| S8_24,621,885 | LOC_Os08g38960, LOC_Os08g38950 | Expressed protein (conserved gene of unknown function); EP |
| S10_111,061 | LOC_Os10g01110, LOC_Os10g01134 | OsSCP44—Putative Serine Carboxypeptidase homologue; OsSCP46—Putative Serine Carboxypeptidase homologue |
| S10_19,109,511 | LOC_Os10g35750, LOC_Os10g35720, LOC_Os10g35760 | Pentatricopeptide; OsGrx_S17—glutaredoxin subgroup II (Pistil); pentatricopeptide |
| S10_19,238,621 | LOC_Os10g35980, LOC_Os10g35990, LOC_Os10g36000 | Tetraspanin family protein; DEAD-box ATP-dependent RNA helicase; remorin C-terminal domain containing protein |
| S1_38,370,584 | – | – |
| S2_2,875,772 | LOC_Os02g05820, LOC_Os02g05830, LOC_Os02g05840 | Protein kinase domain containing protein; ribulose bisphosphate carboxylase small chain, chloroplast precursor; VIL2 protein |
| S2_3,873,759 | LOC_Os02g07480, LOC_Os02g07490, LOC_Os02g07495 | Transglycosylase SLT domain containing protein, expressed; glyceraldehyde-3-phosphate dehydrogenase; expressed protein (pistil) |
| S4_4,499,266 | LOC_Os04g08390, LOC_Os04g08410 | Leucine Rich Repeat family protein, expressed (NBS LRR orthologs in brachypodium, Sorghum); ELM2 domain containing protein, putative, expressed |
| S4_26,914,103 | LOC_Os04g45480, LOC_Os04g45460, LOC_Os04g45470, LOC_Os04g45490, LOC_Os04g45500 | Heat shock protein STI, putative; cysteine-rich repeat secretory protein precursor; vacuolar-processing enzyme precursor; elongation factor, putative; expressed protein |
| S4_31,316,844 | LOC_Os04g52630, LOC_Os04g52640, LOC_Os04g52614 | Leucine-rich repeat-containing protein kinase family protein; SHR5-receptor-like kinase, putative; SHR5-receptor-like kinase |
| S4_35,115,087 | LOC_Os04g59010, LOC_Os04g59020, LOC_Os04g59040, LOC_Os04g59030 | F-box domain containing protein; integral membrane protein; soluble inorganic pyrophosphatase, putative; expressed protein |
| S5_24,316,574 | LOC_Os05g41530, LOC_Os05g41510, LOC_Os05g41520, LOC_Os05g41540, LOC_Os05g41550 | ZOS5-11—C2H2 zinc finger protein, expressed; SH2 motif, putative, expressed; zinc finger, C3HC4 type domain containing protein, expressed; bZIP transcription factor domain containing protein, expressed; expressed protein |
| S11_19,062,952 | LOC_Os11g32260, LOC_Os11g32250, LOC_Os11g32270 | Lysosomal alpha-mannosidase precursor, putative, expressed; expressed protein; N-rich protein, putative, expressed |
Fig. 4Number of SNPs within 1 Mb window for all the 12 chromosomes
Fig. 5Circular Manhattan plot depicting significant marker trait associations for various traits moving from center towards the periphery plant height, culm thickness, panicle length, number of primary branches per panicle, grain length, grain width and hundred grain weight where red and blue lines represent p-value ≤ 1e−6 (Bonferroni threshold) and 1e−4 (threshold in current study), respectively
List of MTAs with significant differences between allelic effects on the basis of Kruskal–Wallis test
| Trait | SNP | Chi-square | |
|---|---|---|---|
| CT | S1_1931325 | 19.100 | 1.24E−05 |
| S2_21858276 | 21.327 | 3.87E−06 | |
| S3_15462193 | 17.774 | 2.49E−05 | |
| S7_1592572 | 31.947 | 1.24E−05 | |
| S7_18874511 | 5.212 | 0.02243 | |
| S7_24536919 | 10.542 | 0.001167 | |
| S9_21545679 | 22.647 | 1.95E−06 | |
| GL | S6_24807445 | 19.825 | 8.49E−06 |
| S8_5775398 | 15.401 | 8.69E−05 | |
| GW1 | S1_40142074 | 9.623 | 0.001922 |
| S2_22216515 | 4.190 | 0.04066 | |
| S3_7917671 | 19.100 | 1.24E−05 | |
| S10_19238621 | 6.454 | 0.01107 | |
| S10_19109511 | 20.360 | 6.42E−06 | |
| S5_23720696 | 4.833 | 0.02792 | |
| S8_24621885 | 16.743 | 4.28E−05 | |
| S5_28157471 | 50.133 | 1.44E−12 | |
| HGW | S1_1931325 | 36.661 | 1.41E−09 |
| S5_24316574 | 6.032 | 0.01405 | |
| S4_31,316,844 | 4.042 | 0.04438 | |
| S4_4499266 | 17.879 | 2.36E−05 | |
| S2_3873759 | 16.272 | 5.49E−05 | |
| S2_2875772 | 50.715 | 1.07E−12 | |
| PB | S4_30721851 | 14.919 | 0.0001122 |
| S7_24282724 | 14.063 | 0.0001768 | |
| PL | S2_30762305 | 12.208 | 0.0004759 |
| S3_32359666 | 11.277 | 0.0007845 | |
| S6_6597022 | 15.410 | 8.65E−05 | |
| S6_11174827 | 7.097 | 0.00772 | |
| S8_11774122 | 5.157 | 0.02316 | |
| S9_4933781 | 3.946 | 0.04699 | |
| S10_14320467 | 8.116 | 0.004388 | |
| S10_16385834 | 7.232 | 0.007163 | |
| S11_7438223 | 6.696 | 0.009661 |
Fig. 6Box and whisker plots depicting significant allelic effects of significant MTAs for all the traits a CT b PB c PL d GL e GW f HGW
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with GW where SNPs marked with * represent alleles absent in indica cultivars
| S1_40142074* | S2_22216515 | S3_7917671 | S5_23720696 | S5_28157471 | S8_24621885* | S10_19238621* | |
|---|---|---|---|---|---|---|---|
| Superior | AA | GG | GG | CC | CC | TT | GG |
| IR103308 | AA | GG | GG | CC | CC | TT | GG |
| PR114 | GG | GG | NN | CC | CC | CC | CC |
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with CT where SNPs marked with * represent alleles absent in indica cultivars
| S1_1931325* | S2_21858276 | S3_15462193 | S7_1592572 | S7_18874511 | S7_24536919 | S9_21545679 | |
|---|---|---|---|---|---|---|---|
| Superior | GG | GG | AA | CC | GG | GG | CC |
| IR82989 | GG | GG | AA | CC | GG | GG | CC |
| IR83814 | GG | GG | AA | CC | GG | GG | CC |
| IR88788 | GG | GG | AA | CC | GG | GG | CC |
| CR100018A | GG | GG | AA | CC | GG | GG | CC |
| CR100018 | GG | GG | AA | CC | GG | GG | CC |
| PR114 | CC | GG | AA | CC | GG | GG | CC |
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with HGW where SNPs marked with * represent alleles absent in indica cultivars
| S1_1931325 | S2_2875772 | S2_3873759 | S4_35115087 | S4_4499266* | S5_24316574 | |
|---|---|---|---|---|---|---|
| Superior | CC | TT | TT | CC | AA | TT |
| CR100443 | CC | TT | TT | TT | AA | TT |
| IR104777 | CC | TT | TT | TT | AA | TT |
| IR104783 | CC | CT | TT | TT | AA | TT |
| PR114 | CC | TT | TT | TT | GG | TT |
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with PL where SNPs marked with * represent alleles absent in indica cultivars
| S2_30762305* | S3_32359666 | S6_6597022 | S6_11174827* | S8_11774122 | S9_4933781* | S10_14320467* | S10_16385834 | S11_7438223* | |
|---|---|---|---|---|---|---|---|---|---|
| Superior | GG | GG | GG | CC | CC | GG | GG | CC | AA |
| CR100459 | GG | GG | GG | CC | CC | GG | GG | NN | AA |
| PR114 | TT | GG | GG | GG | CC | CC | AA | CC | GG |
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with PB where SNPs marked with * represent alleles absent in indica cultivars
| S4_30721851 | S7_24282724* | |
|---|---|---|
| Superior | TT | CC |
| CR100375 | TT | CC |
| CR100449 | TT | CC |
| IR82988 | TT | CC |
| IR81885 | TT | CC |
| IR83810 | TT | CC |
| IR103404 | TT | CC |
| IR89224 | TT | CC |
| IR104404 | TT | CC |
| IR104404A | TT | CC |
| IR93217 | TT | CC |
| IR104404B | TT | CC |
| IR105948 | TT | CC |
| CR100459 | TT | CC |
| PR114 | TT | TT |
The list of O. rufipogon accessions possessing combination of superior alleles at significant SNP positions associated with GL where SNPs marked with * represent alleles absent in indica cultivars
| S6_24807445 | S8_5775398* | |
|---|---|---|
| Superior | AA | TT |
| CR100013A | AA | TT |
| CR100443 | AA | TT |
| CR100455 | AA | TT |
| IR100923 | AA | TT |
| IR81801 | AA | TT |
| IR83813 | AA | TT |
| IR103404 | AA | TT |
| IR88788 | AA | TT |
| IR93120 | AA | TT |
| IR104404C | AA | TT |
| IR100597 | AA | TT |
| IR100678 | AA | TT |
| IR103850 | AA | TT |
| IR104425 | AA | TT |
| IR104641 | AA | TT |
| IR104746 | AA | TT |
| IR104766 | AA | TT |
| IR104777 | AA | TT |
| IR104783 | AA | TT |
| IR104796 | AA | TT |
| IR104821 | AA | TT |
| IR104821A | AA | TT |
| IR104873 | AA | TT |
| IR105240 | AA | TT |
| IR105726 | AA | TT |
| IR105491 | AA | TT |
| CR100001 | AA | TT |
| IR113661 | AA | TT |
| CR100216 | AA | TT |
| IR80562 | AA | TT |
| CR100368 | AA | TT |
| CR100465B | AA | TT |
| IR80660 | AA | TT |
| CR100490 | AA | TT |
| PR114 | AA | CC |