| Literature DB >> 35406707 |
Nitika Rana1,2, Surbhi Kumawat1,2, Virender Kumar1, Ruchi Bansal1,2, Rushil Mandlik1,2, Pallavi Dhiman1, Gunvant B Patil3, Rupesh Deshmukh1, Tilak Raj Sharma1,4, Humira Sonah1.
Abstract
Nutritional quality improvement of rice is the key to ensure global food security. Consequently, enormous efforts have been made to develop genomics and transcriptomics resources for rice. The available omics resources along with the molecular understanding of trait development can be utilized for efficient exploration of genetic resources for breeding programs. In the present study, 80 genes known to regulate the nutritional and cooking quality of rice were extensively studied to understand the haplotypic variability and gene expression dynamics. The haplotypic variability of selected genes were defined using whole-genome re-sequencing data of ~4700 diverse genotypes. The analytical workflow identified 133 deleterious single-nucleotide polymorphisms, which are predicted to affect the gene function. Furthermore, 788 haplotype groups were defined for 80 genes, and the distribution and evolution of these haplotype groups in rice were described. The nucleotide diversity for the selected genes was significantly reduced in cultivated rice as compared with that in wild rice. The utility of the approach was successfully demonstrated by revealing the haplotypic association of chalk5 gene with the varying degree of grain chalkiness. The gene expression atlas was developed for these genes by analyzing RNA-Seq transcriptome profiling data from 102 independent sequence libraries. Subsequently, weighted gene co-expression meta-analyses of 11,726 publicly available RNAseq libraries identified 19 genes as the hub of interactions. The comprehensive analyses of genetic polymorphisms, allelic distribution, and gene expression profiling of key quality traits will help in exploring the most desired haplotype for grain quality improvement. Similarly, the information provided here will be helpful to understand the molecular mechanism involved in the development of nutritional and cooking quality traits in rice.Entities:
Keywords: allelic effects; gene expression dynamics; genetic variation; haplotypic network; molecular evolution
Mesh:
Substances:
Year: 2022 PMID: 35406707 PMCID: PMC8998046 DOI: 10.3390/cells11071144
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Details of single nucleotide polymorphisms (SNP) predicted to have a deleterious impact on the functionality of genes known to regulate the nutritional and cooking quality related traits in rice.
| Gene Name | RAP ID | Feature | Total SNPs | Number of Haplotypes | Number of Missense Mutations | Functional Impact of Missense Mutations | Number of InDels |
|---|---|---|---|---|---|---|---|
|
|
| Grain weight | 82 | 12 | 3 | V58G, A70S, H235R*, V58A | 70 |
|
|
| Galactolipid biosynthesis | 72 | 12 | 6 | D63V, F335I*, S321C*, G305V*, D288N*, F273V* | 186 |
|
|
| Glutelin content | 29 | 8 | 8 | Y5H* | 7 |
|
|
| Grain protein content | 66 | 11 | 9 | A82N*, A82N*, E100G, H101D*, H101D*, C193S*, P240L*, L266F*, Q340R, D349H | 105 |
|
|
| Copper accumulation | 47 | 12 | 8 | I55M, F303L, T316M, A553V, S660A, | 64 |
|
|
| Endosperm granule formation | 65 | 9 | 17 | V25A, C46R, R102K, M107I, S115A, | 70 |
|
|
| Grain amylose content | 107 | 21 | 7 | S96F*, Y237F, R189K | 116 |
|
|
| Grain starch structure | 67 | 8 | 8 | I143T*, E259Q*, A265S*, I268V*, Q284P, | 54 |
|
|
| Seed phytic acid | 41 | 10 | 6 | N6T, A19T, R32L, A350T, N645K*, L1469F* | 28 |
|
|
| Selenite uptake | 20 | 9 | 5 | H398R, N335D, P269S, S258C*, V185I | 6 |
|
|
| Grain tryptophan content | 78 | 17 | 6 | E585D, G527R*, P446S, R303P, E79K, | 85 |
|
|
| Grain cadmium content | 61 | 7 | 5 | H90N, D292N, F412V*, V448L*, L532V | 12 |
|
|
| Grain iron and zinc content | 243 | 5 | 20 | V369I, R307K, R304K, R281K, I227M, | 217 |
|
|
| Grain yield | 90 | 8 | 5 | Q89H*, Q106K*, Q71K*, T20A*, R10P* | 140 |
|
|
| Starch structure in endosperm | 33 | 9 | 11 | T268N*, V165I, E153K, R550H*, R501C*, | 37 |
|
|
| Iron translocation | 72 | 10 | 5 | A170T*, V136A*, Q105K | 70 |
|
|
| Seed glutelin | 33 | 5 | 3 | E384G*, Q90R, Y86C* | 27 |
|
|
| Iron distribution | 48 | 8 | 4 | S511N, T368R, L256F, R90L* | 22 |
|
|
| Grain pericarp colour | 1140 | 14 | 6 | E308D, D173N, L140V, D101H, P84L, A29V* | 448 |
|
|
| Arsenic accumulation | 245 | 23 | 40 | S1468T, R1398G*, R1300Q, E1231V*, R990Q, L933F, K892R, Q879L, V814L, R712H*, P642H*, A524S, R518C*, A449V, L285I, N283S, R276Q, T216S, P206L*, L176V, A156S, I150S, I134M, A107V, A92V, R90Q, T61A, G47S, T33S, N23Y*, V21L, S1468N, F708L*, A409T*, R383H*, S266I*, F252T*, A233V*, C228F*, C83F*, T29A* | 135 |
|
|
| Grain size | 117 | 9 | 30 | A274T, L399P, I466T, R579K, P599L, G804D, S1203L, N1319D, A1608T, F195L*, H200N*, P204T*, S306P*, N323D*, M348R*, M348I*, L369F*, A378S*, G452S*, P515T*, W589C*, R725K*, A748V*, A789S*, N829Y*, R892Q*, G926C*, R987H*, | 217 |
|
|
| Chalkiness | 212 | 11 | 19 | R525L, I497V, V412I, K401M, A379T, | 143 |
|
|
| Grain copper accumulation | 79 | 8 | 7 | A132V, A200T, P309S, I576V, S751T, | 114 |
|
|
| Grain characteristics | 137 | 14 | 4 | D166G*, Y224S*, P415S, D528Y*, D528N | 205 |
|
|
| Phosphorus accumulation | 281 | 7 | 6 | Q385L*, I251V, L247F, V71G*, A47V, | 308 |
|
|
| Grain starch content | 197 | 10 | 12 | S596L, K438E*, H420Y*, S319G, D214N*, | 138 |
|
|
| Grain starch quality | 98 | 16 | 9 | P56A, T117P, A148S, D161E, E208D, D283E, S604G, M737V, L781F* | 65 |
|
|
| Cadmium in grains | 236 | 4 | 31 | E15D, D26A, E35Q, P43L, P48S, A54S, I60T, L67H, L70Q, A71D, G73D, A77T, A80S, N84K, E87K, V95I, L101F, T147S, R152S, V183A, V211L, L215F, K223N, M241V, Q246H, E258L, M310V, L380F, T480S, V494M, L495F* | 218 |
|
|
| Grain zinc and cadmium content | 104 | 8 | 3 | C19R*, R7W | 261 |
|
|
| Grain zinc content | 142 | 10 | 56 | G990A, E975D, C960G, T953I, G930R, | 50 |
|
|
| Grain quality and yield | 60 | 10 | 10 | I915M, S647A, S620G, N605K, P518S, A462S, R361H, R361C, L259F* | 75 |
|
|
| Grain starch content | 45 | 12 | 7 | A26T, E541K*, L551S, S559N, N634D, E637K, S15G* | 47 |
|
|
| Grain iron and zinc content | 75 | 10 | 4 | A32V, C37R, L147V*, R159C | 5 |
|
|
| Seed storage protein | 26 | 7 | 9 | R255H, A219S, S179P, M174I, L169V, A111P, A102V, R100H, E47K*, A111T | 12 |
|
|
| Endosperm appearance | 186 | 14 | 35 | A33T, M38K, T43N, A62S, R109H, K116N, E142D, A184T, A195V, A217T, G226E, E231K, A268V, S350L, F401S, D427G, V480L, T486I, A503T, R576K, E641V, S681N, G686E, R702Q, R748H, E790V, G817D, V843E, L957M, Y964C*, K1006N*, R1118K, R1240H, A1528S, T1755I | 136 |
|
|
| Grain aroma | 156 | 21 | 5 | A190V, K244I*, A316E, P458S*, G468V* | 123 |
|
|
| Grain quality and shape | 69 | 10 | 8 | P79L, A110V, D172N*, T274N, Q285K, G315S, M364I, A397T | 222 |
|
|
| Seed quality | 59 | 11 | 14 | H89L*, A115V*, P129S*, N149D, S155A, T181A, F205L, I355M, G712S, R811S, V874L, R1063Q, F1101V, I1320L | 135 |
|
|
| Grain yield and quality | 28 | 5 | 4 | L394F*, T181P, V35L | 39 |
Gene names are italicized. * indicates deleterious mutations.
Figure 1Haplotypic distribution of chalk5 gene across the ~3024 rice accessions depicted based on the sequence variation data retrieved from SNPseek database (https://snp-seek.irri.org/) (accessed on 15 September 2021). The topmost box represents the gene structure and sequence variations for gene chalk5. Bigger blue sections denote exons and the bars represent intronic regions. Red lines show 11 non-synonymous SNPs.
Figure 2Representative illustration showing genetic variations observed in chalk5 gene defining haplotypic diversity, evolution, and expression profile in different tissues. (A) Intron–exon organization of chalk5 gene showing the non-synonymous single nucleotide polymorphism sites; (B) haplotypic grouping based on the SNPs present in the chalk5 gene; (C) haplotypic network showing relatedness and allelic evolution of chalk5 gene; (D) box plot showing the frequency distribution of haplotypes and their association with translucent and chalky phenotype, *** significance level at p-value < 0.001; (E) field emission scanning electron microscopy for chalky and translucent grains; and (F) expression profiling of chalk5 gene in different tissues.
Figure 3Nucleotide diversity and phylogenetic analysis for chalk5 and OsIRT1 genes. (A) Nucleotide diversity for the chalk5 (LOC_Os05g06480) gene and (B) OsIRT1 gene in cultivated (red), indica (green), japonica (blue), and wild (purple) rice varieties. The lower panel shows the ratio of nucleotide diversity between wild and cultivated rice accessions. Here, the x-axis represents the genomic positions and the y-axis denotes the nucleotide diversity values. (C,D) A neighbor-joining phylogenetic tree was implemented within the ECOGEMS resource for chalk5 and OsRT1 genes, respectively. Each edge of the circular tree represents a rice accession. The inner track represents the cultivated rice varieties whereas the outer track represents wild accessions. The Oryza rufipogon (Or) wild accessions are represented with blue (Or-I), red (Or-II), and black (Or-III) colors, whereas the cultivated Indica and Japonica accessions in purple and yellow colors, respectively.
Figure 4Heatmaps showing gene expression dynamics of 80 nutritional and cooking quality-related genes across different tissues and conditions. (A) Gene expression across seven rice varieties viz Heugjinju, Heuseol, Heugnam, Josengheug, Boseogheug, Sinnongheug, and Dongjin at different stages of seed development; (B) expression across different tissues such as endosperm, embryo, anther, pistil, seed, inflorescence, and leaves at different stages; (C) expression in wild type and loss of function mutants rice accessions for GSK5 and ARF4; and (D) expression in grains, pericarp, and endosperm of white, black, and red rice. Red color corresponds to high expression whereas green for low gene expressions.
Figure 5(A) Chromosomal distribution and cluster of genes as functional modules predicted by WGCNA. The outermost track number 1 shows the chromosomal position of 80 cooking and nutritional quality-related genes. Genes highlighted in red color belong to Module 1: Starch, Chalkiness, Zinc bioavailability, Sucrose; Module 2 (blue): Grain size, Starch, Seed storage protein, Glutelin; Module 3 (green): Cd, Zn, Glumes, Phosphorus, Arsenic, Heavy metals; Module 4 (black): Grain width, Size, Weight, Cadmium, Copper; Module 5 (orange): Iron, Phytate, Phosphorus; Module 6 (grey): Glutelin, Prolamine, Selenite. Track number 2 depicts the number of SNPs in various genes responsible for grain cooking and nutritional quality traits whereas the innermost track 3 represents the missense SNPs for these genes. (B) Co-expression network predicted among the 80 cooking and nutritional quality-related genes. The network was developed with Cytoscape v3.7.2.20 using RPKM values. Genes with the highest degree of co-expression have been displayed in bigger and brighter nodes and vice versa. Positive correlations have been depicted in orange, whereas negative correlations in blue edges.
Details of hub genes identified through weighted correlation network analysis (WGCNA) performed with transcriptome profiling of genes known to regulate nutritional and cooking quality related traits in rice.
| Gene Name | Gene ID | Neighborhood Connectivity | Clustering Coefficient | Number of Directed Edges | WGCNA Module | WGCNA Module Description | Tissue Expression | Normalized |
|---|---|---|---|---|---|---|---|---|
|
|
| 9.80 | 0.24 | 20 | 5 | Iron, Phytate, Phosphorus | Seedling | 249.5 |
|
|
| 11.84 | 0.39 | 19 | 2 | Grain size, Starch, Seed storage protein, Glutelin | Endsoperm | 397.9 |
|
|
| 10.16 | 0.33 | 19 | 2 | Grain size, Starch, Seed storage protein, Glutelin | Developing seed | 491.1 |
|
|
| 10.67 | 0.39 | 18 | 5 | Iron, Phytate, Phosphorus | Seedling | 149.3 |
|
|
| 8.88 | 0.13 | 17 | 0 | NA | Seedling and shoot | 672.4 |
|
|
| 12.63 | 0.48 | 16 | 0 | NA | Seedling | 57.5 |
|
|
| 12.64 | 0.48 | 14 | 0 | NA | Endosperm | 86.7 |
|
|
| 10.31 | 0.54 | 13 | 0 | NA | Endosperm | 363.9 |
|
|
| 13.92 | 0.67 | 13 | 4 | Grain width, size, weight, Cadmium, Copper, | Caryopsis | 136.4 |
|
|
| 12.54 | 0.56 | 13 | 2 | Grain size, Starch, Seed storage protein, Glutelin | Seed | 492.1 |
|
|
| 10.67 | 0.68 | 12 | 6 | Glutelin, Prolamine, Selenite | Seed | 264.4 |
|
|
| 10.67 | 0.68 | 12 | 1 | Starch, Chalkiness, Zinc bioavailability, Sucrose | Endosperm | 665.1 |
|
|
| 14.42 | 0.74 | 12 | 3 | Cd, Zn, Glumes, Phosphorus, Arsenic, Heavy metals, | Seedling | 47.9 |
|
|
| 10.67 | 0.68 | 12 | 1 | Starch, Chalkiness, Zinc bioavailability, Sucrose | Seed | 444.2 |
|
|
| 11.92 | 0.64 | 12 | 1 | Starch, Chalkiness, Zinc bioavailability, Sucrose | Grain | 1237.6 |
|
|
| 14.25 | 0.61 | 12 | 4 | Grain width, size, weight, Cadmium, Copper, | Seed | 127.9 |
|
|
| 11.45 | 0.64 | 11 | 2 | Grain size, Starch, Seed Storage protein, Glutelin | Seed | 370.6 |
|
|
| 12.80 | 0.58 | 10 | 0 | NA | Seed | 246.5 |
|
|
| 11.60 | 0.80 | 10 | 1 | Starch, Chalkiness, Zinc bioavailability, Sucrose | Developing seed | 1138.6 |
Details of transcription factors predicted to have interaction with cooking and nutritional quality-related genes. Plant Transcriptional Regulatory Map (PlantRegMap) server [40] was used to predict the interaction.
| TF | Common Name | TF Family | Query_All # | Query_Bind $ | q-Value | |
|---|---|---|---|---|---|---|
|
|
| Dof | 80 | 33 | 3.18 × 10−6 | 3.75 × 10−4 |
|
|
| Dof | 80 | 37 | 3.56 × 10−6 | 3.75 × 10−4 |
|
|
| HSF | 80 | 6 | 5.58 × 10−5 | 3.93 × 10−3 |
|
|
| MYB | 80 | 11 | 2.28 × 10−4 | 8.59 × 10−3 |
|
|
| MYB | 80 | 11 | 2.28 × 10−4 | 8.59 × 10−3 |
|
|
| C2H2 | 80 | 6 | 2.75 × 10−4 | 8.59 × 10−3 |
|
|
| Dof | 80 | 26 | 2.85 × 10−4 | 8.59 × 10−3 |
|
|
| WRKY | 80 | 6 | 4.24 × 10−4 | 1.12 × 10−2 |
|
|
| WRKY | 80 | 8 | 4.79 × 10−4 | 1.12 × 10−2 |
|
|
| MYB | 80 | 9 | 7.53 × 10−4 | 1.26 × 10−2 |
# ‘Query_all’ stand for the number of gene promoters that were examined for the existence of transcription factor binding sites; $ ‘Query_bind’ represents the number of genes with a binding site for a specific transcription factor in their promoter; ¥ p-value cutoffs of ≤0.05 was used to claim significant interaction.