| Literature DB >> 36011264 |
Samuel Tareke Woldegiorgis1, Ti Wu1, Linghui Gao1, Yunxia Huang1, Yingjie Zheng1, Fuxiang Qiu1, Shichang Xu1, Huan Tao1, Andrew Harrison2, Wei Liu1, Huaqin He1.
Abstract
The availability of large-scale genomic data resources makes it very convenient to mine and analyze genes that are related to important agricultural traits in rice. Pan-genomes have been constructed to provide insight into the genome diversity and functionality of different plants, which can be used in genome-assisted crop improvement. Thus, a pan-genome comprising all genetic elements is crucial for comprehensive variation study among the heat-resistant and -susceptible rice varieties. In this study, a rice pan-genome was firstly constructed by using 45 heat-tolerant and 15 heat-sensitive rice varieties. A total of 38,998 pan-genome genes were identified, including 37,859 genes in the reference and 1141 in the non-reference contigs. Genomic variation analysis demonstrated that a total of 76,435 SNPs were detected and identified as the heat-tolerance-related SNPs, which were specifically present in the highly heat-resistant rice cultivars and located in the genic regions or within 2 kbp upstream and downstream of the genes. Meanwhile, 3214 upregulated and 2212 downregulated genes with heat stress tolerance-related SNPs were detected in one or multiple RNA-seq datasets of rice under heat stress, among which 24 were located in the non-reference contigs of the rice pan-genome. We then mapped the DEGs with heat stress tolerance-related SNPs to the heat stress-resistant QTL regions. A total of 1677 DEGs, including 990 upregulated and 687 downregulated genes, were mapped to the 46 heat stress-resistant QTL regions, in which 2 upregulated genes with heat stress tolerance-related SNPs were identified in the non-reference sequences. This pan-genome resource is an important step towards the effective and efficient genetic improvement of heat stress resistance in rice to help meet the rapidly growing needs for improved rice productivity under different environmental stresses. These findings provide further insight into the functional validation of a number of non-reference genes and, especially, the two genes identified in the heat stress-resistant QTLs in rice.Entities:
Keywords: heat stress; pan-genome; presence/absence variation (PAV); rice (Oryza sativa L.); single nucleotide polymorphisms (SNPs)
Mesh:
Year: 2022 PMID: 36011264 PMCID: PMC9407402 DOI: 10.3390/genes13081353
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Gene distribution in the rice pan-genome. (A) The growth model of pan-genome genes and core genes. (B) Distribution of exon counts among the core and variable genes. (C) Distribution of the length of genes among the core and variable genes.
Figure 2PAV-based pairwise relationships among the tested rice accessions *. * The left column indicates the heat stress response of the different rice cultivars. Green and red represent the heat-tolerant and -susceptible cultivars, respectively.
Figure 3Functional annotation of the predicted genes in the non-reference contigs of the rice pan-genome. GO terms enriched at padj < 0.05 significance.
Figure 4SNP-based phylogenetic tree of the tested rice varieties. The inner strip represents the heat stress response (R for resistant and S for susceptible) while the outer one represents the population of each rice cultivar based on their metadata. The outer bar chart represents the number of SNPs in each rice accession.
Classification of the SNPs detected in the rice pan-genome.
| Variant Type | Pan-Genome | Reference | Non-Reference | Cluster 2 | Cluster 1 |
|---|---|---|---|---|---|
| Bi-allele SNP | 5,059,798 | 4,868,611 | 191,187 | 4,763,997 | 2,373,085 |
| Splicing | 15,986 | 15,881 | 105 | 15,210 | 7773 |
| Exonic | 284,016 | 281,185 | 2831 | 269,738 | 140,517 |
| Intronic | 578,887 | 576,534 | 2353 | 548,909 | 277,418 |
| UTR | 224,599 | 224,127 | 472 | 213,375 | 105,933 |
| Upstream | 1,229,370 | 1,224,403 | 4967 | 2,790,074 | 1,351,465 |
| Downstream | 1,100,851 | 1,095,937 | 4914 | 2,595,947 | 1,262,776 |
| Missense | 143,819 | 142,071 | 1748 | 136,135 | 70,833 |
| Stop gained | 1936 | 1900 | 36 | 1816 | 840 |
Distribution of the heat stress-tolerant SNPs in the rice pan-genome.
| Annotation | SNPs in Resistant Cultivars | SNPs in Highly Resistant Cultivars |
|---|---|---|
| Downstream | 29,082 | 14,759 |
| Exon | 358 | 185 |
| Intron | 18,846 | 9541 |
| Non_Synonymous | 10,194 | 5046 |
| Splice site acceptor | 28 | 16 |
| Splice Site donor | 38 | 19 |
| Start gained | 769 | 366 |
| Start lost | 25 | 11 |
| Stop gained | 225 | 116 |
| Stop lost | 26 | 14 |
| Synonymous | 7827 | 4090 |
| Upstream | 66,718 | 35,970 |
| UTR_3 | 8820 | 4422 |
| UTR_5 | 3817 | 1880 |
|
|
|
|
Numbers of DEGs in the 4 RNA-seq datasets of rice under heat stress.
| ProjectID * | Test Cultivars | Comparisons | Pan-Genome Upregulated Genes | Reference Upregulated Genes | Non-Reference Upregulated Genes | Pan-Genome Downregulated Genes | Reference Upregulated Genes | Non-Reference Downregulated Genes |
|---|---|---|---|---|---|---|---|---|
| PRJNA604026 | 9311 | 9311HS_9311CTRL | 8248 | 8202 | 46 | 9691 | 9616 | 75 |
| Nipponbare | NIPHS_NIPCTRL | 4914 | 4909 | 5 | 6504 | 6495 | 9 | |
| PRJNA508820 | Huanghuazhan | HHZ40_HHZ32 | 2091 | 2064 | 27 | 1819 | 1800 | 19 |
| IR36 | IR3640_IR3632 | 1395 | 1364 | 31 | 1503 | 1486 | 17 | |
| PRJNA610667 | HSR1 | HSR1_LSR1 | 2143 | 2058 | 85 | 1650 | 1605 | 45 |
| HSR2 | HSR1_LSR2 | 1704 | 1645 | 59 | 1048 | 1020 | 28 | |
| LSR1 | HSR2_LSR1 | 1617 | 1560 | 57 | 2232 | 2180 | 52 | |
| LSR2 | HSR2_LSR2 | 1596 | 1532 | 64 | 2117 | 2060 | 57 | |
| PRJNA633211 | MH101 | MH36_MH28 | 1825 | 1809 | 16 | 1361 | 1354 | 7 |
| MH38_MH28 | 5181 | 5145 | 36 | 2898 | 2881 | 17 | ||
| SDW005 | SD36_SD28 | 1380 | 1375 | 5 | 110 | 103 | 7 | |
| SD38_SD28 | 4618 | 4599 | 19 | 923 | 916 | 7 |
* NCBI project accession ID.
Figure 5A Venn diagram plot of the genes with heat tolerance-related SNPs (GENES_WITH_HS_SNPs) compared to the upregulated and downregulated genes in the 4 RNA-seq datasets.
Figure 6Differential expression profile of the genes with heat stress tolerance-related SNPs in different transcriptomic datasets of rice under heat stress.
Number of genes in the non-reference contigs mapped to the heat-tolerant QTLs in each reference genome.
| BioSample ID * | Number of Genes |
|---|---|
| SAMN08217222 | 37 |
| SAMN10564385 | 60 |
| SAMN12715984 | 49 |
| SAMN12721963 | 46 |
| SAMN12672924 | 55 |
| SAMN12718029 | 49 |
| SAMN12748569 | 38 |
| SAMN12748589 | 42 |
| SAMN12748590 | 55 |
| SAMN12748600 | 41 |
| SAMN12748601 | 39 |
| SAMN13021815 | 51 |
* Biosample ID of the rice accession.