| Literature DB >> 31872320 |
Samuel Tareke Woldegiorgis1, Shaobo Wang1, Yiruo He2, Zhenhua Xu1, Lijuan Chen1, Huan Tao1, Yu Zhang1, Yang Zou1, Andrew Harrison3, Lina Zhang1, Yufang Ai1, Wei Liu4, Huaqin He5.
Abstract
BACKGROUND: Rice (Oryza sativa L.) yield is limited inherently by environmental stresses, including biotic and abiotic stresses. Thus, it is of great importance to perform in-depth explorations on the genes that are closely associated with the stress-resistant traits in rice. The existing rice SNP databases have made considerable contributions to rice genomic variation information but none of them have a particular focus on integrating stress-resistant variation and related phenotype data into one web resource.Entities:
Keywords: Abiotic stress; Biotic stress; Database; SNP
Year: 2019 PMID: 31872320 PMCID: PMC6928182 DOI: 10.1186/s12284-019-0356-0
Source DB: PubMed Journal: Rice (N Y) ISSN: 1939-8425 Impact factor: 4.783
Fig. 1System view of RSRS database (a) Data collection of rice stress resistant/susceptible varieties and their re-sequencing data (b) Variant detection of the rice stress resistant/susceptible varieties and identification of the stress resistant specific SNPs (c) The RSRS database and web interface design layout
Fig. 2Entity-Relationship Diagram for the RSRS database. The Pheno_SNP table is the table, which connects the phenotype details with the genotype details. In this table the pheno_ID field is used to identify the associated stress with the varieties and the snp_ID is used to identify the associated SNPs with the varieties. The Pheno_SNP table have a 1:N relationship with the Phenotypes and SNP_genotypes tables using pheno_ID and snp_ID keys. The SNP_genotypes table describes the detail of each SNP, which is characterized by its own unique ID, snp_ID, chrom, position, reference allele and genotypes of each accession. A SNP may consist of multiple alleles in which the annotation of each allele is stored in the SNP annotation table. The SNP_annotation table is connected in a 1:N relationship with the Gene_info table, which stores the detail of each gene. The Gene_Info table is connected to the GO_annotation table to characterize the associated genes with each SNP. The variety and phenotype details are stored in Variety_Info, Phenotype_data and Phenotypes table, var_ID and pheno_ID are used to connect these tables. The GWAS_info table stores the associated GWAS record of each SNP with its associated stress. The ER diagram was created by using draw.io at https://www.draw.io/
Number of stress-resistant SNPs located in different regions of rice genome
| Stress types | 3’UTRb | 5’UTRb | Downstreamb | Intergenicb | Intronb | CDSa | Upstreamb |
|---|---|---|---|---|---|---|---|
| Heat Stress | 93,767 | 55,759 | 653,058 | 635,265 | 63,162 | 205,078 | 2,780,114 |
| Alkali Stress | 66,920 | 37,644 | 388,645 | 331,550 | 43,425 | 147,100 | 1,755,014 |
| Salt Stress | 79,546 | 44,882 | 470,148 | 409,469 | 51,300 | 175,835 | 2,081,462 |
| Flood Stress | 67,419 | 37,926 | 391,338 | 334,562 | 43,764 | 142,223 | 1,787,105 |
| Cold Stress | 58,798 | 33,903 | 370,763 | 332,310 | 40,007 | 126,918 | 1,612,926 |
| Zinc Stress | 74,843 | 41,853 | 411,990 | 330,483 | 48,027 | 161,716 | 1,908,842 |
| Blast Fungus | 81,491 | 47,765 | 522,328 | 480,785 | 53,365 | 183,547 | 2,256,982 |
| Bacteria Leaf Blight | 63,649 | 35,647 | 368,539 | 319,163 | 41,456 | 138,399 | 1,633,392 |
| Bacteria Sheath Blight | 51,663 | 29,237 | 331,287 | 272,583 | 34,716 | 112,617 | 1,356,611 |
| Bacterial Rice Planthopper | 72,686 | 41,112 | 417,411 | 347,624 | 45,912 | 155,346 | 1,890,893 |
| Bacteria Stripe leaf blight | 49,221 | 27,107 | 269,081 | 203,861 | 31,815 | 103,938 | 1,274,279 |
| Brown Planthopper Pest | 79,293 | 44,994 | 475,951 | 410,522 | 52,299 | 173,293 | 2,109,723 |
| Gall Midge Pest | 50,495 | 29,177 | 291,121 | 230,150 | 32,704 | 107,667 | 1,367,183 |
| Small Brown Planthoppers Pest | 86,755 | 47,381 | 465,586 | 363,512 | 55,723 | 182,484 | 2,205,066 |
| Whitebacked Planthopper Pest | 94,541 | 53,730 | 552,635 | 462,375 | 61,037 | 203,404 | 2,508,144 |
| Rice Leafroller Pest | 79,704 | 44,658 | 448,760 | 358,452 | 50,581 | 172,603 | 2,072,717 |
aCDS includes the SNPs in exonic region including nsSNPs (non-synonymous SNPs) and sSNPs (synonymous SNPs)
bIntergenic, 5’UTR, Intron 3’UTR, Upstream and Downstream indicate that No. of stress-resistant SNPs located in the Intergenic, 5’UTR, Intronic, 3′ UTR, upstream and downstream regions of a gene
Fig. 3Number of abiotic and biotic stresses-resistant SNPs in splice region and other large effect SNPs. The large effect SNPs include start/stop gained, lost or retained, and splice donor and acceptor. Start and stop codon gained or lost SNP induce the gain or loss of the start/stop codon. SNP located in the stop codon may retain the stop codon function (Stop codon retained). The SNP generating a splice acceptor, donor or region are named splice acceptors, splice donor of splice region
Fig. 5The alternate allelic frequency distribution in different abiotic and biotic stress-resistant genes in rice. The X axis represents the allele frequencies of the non-reference alleles and the Y axis represents of the count of each allele frequency of the SNPs in these genes
Fig. 4The variants distribution by annotation in different abiotic and biotic stress-resistant genes in rice. The X-axis represents the variant annotation of the SNPs in these genes and the Y-axis represents the count of each annotation in these genes
Fig. 6Examples of search functions in RSRS database. Search Interface (a) allows users to search by region, gene ID and SNP ID. Additionally users can also set options, such as allele frequency, variant types. Search Results (b) retrieves the list of SNPs and displays the allele distribution in a given range. SNP detail view (c) displays the detail of each SNP and the genotype of each rice accession. Jbrowse visualization (d) displays the SNP detail in Jbrowse and gives the detail of the associated gene with the SNPs