| Literature DB >> 31687375 |
Rabiatul-Adawiah Zainal-Abidin1,2, Norliza Abu-Bakar2, Yun-Shin Sew2, Sanimah Simoh2, Zeti-Azura Mohamed-Hussein1,3.
Abstract
Recently, rice breeding program has shown increased interests on the pigmented rice varieties due to their benefits to human health. However, the genetic variation of pigmented rice varieties is still scarce and remains unexplored. Hence, we performed genome-wide SNP analysis from the genome resequencing of four Malaysian pigmented rice varieties, representing two black and two red rice varieties. The genome of four pigmented varieties was mapped against Nipponbare reference genome sequences, and 1.9 million SNPs were discovered. Of these, 622 SNPs with polymorphic sites were identified in 258 protein-coding genes related to metabolism, stress response, and transporter. Comparative analysis of 622 SNPs with polymorphic sites against six rice SNP datasets from the Ensembl Plants variation database was performed, and 70 SNPs were identified as novel SNPs. Analysis of SNPs in the flavonoid biosynthetic genes revealed 40 nonsynonymous SNPs, which has potential as molecular markers for rice seed colour identification. The highlighted SNPs in this study show effort in producing valuable genomic resources for application in the rice breeding program, towards the genetic improvement of new and improved pigmented rice varieties.Entities:
Year: 2019 PMID: 31687375 PMCID: PMC6811786 DOI: 10.1155/2019/4168045
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Whole grains of Bali, Pulut Hitam 9, MRM16, and MRQ76. Pulut Hitam 9 has a darker black pigment compared to Bali, while MRM16 has a darker red pigment compared to MRQ100.
Summary of sequence reads and mapping statistics in Bali, PH9, MRM16, and MRQ100 genome.
| Bali | PH9 | MRM16 | MRQ100 | |
|---|---|---|---|---|
| Total reads (bp) | 101,710,572 | 99,980,328 | 98,764,058 | 99,998,624 |
| Number of clean reads (bp) | 99,865,228 (98.18%) | 99,380,446 (99.40%) | 98,078,122 (99.30%) | 94,428,632 (99.43%) |
| Genome coverage (30×) | 88.59% | 88.45% | 88.45% | 88.49% |
| Total mapped reads | 96,479,796 | 95,971,696 | 94,870,967 | 91,170,844 |
| Percentage of total mapped reads | 96.61% | 96.57% | 96.73% | 96.55% |
Summary of SNP identification and annotation in Bali, PH9, MRM16, and MRQ100 when compared against Nipponbare reference genome. The number of total annotated SNPs was higher than the total number of quality SNPs due to more than one annotation in a single SNP.
| Bali | PH9 | MRM16 | MRQ100 | Total | |
|---|---|---|---|---|---|
| Number of raw SNPs | 2,394,592 | 2,227,819 | 2,740,764 | 2,380,079 | 9,743,254 |
| Number of high-quality SNPs | 436,322 | 412,791 | 469,782 | 435,382 | 1,754,277 |
| Intergenic SNPs | 328,261 | 310,712 | 349,786 | 327,021 | 1,315,780 |
| Genic SNPs | 149,232 | 140,677 | 165,124 | 149,903 | 604,936 |
Figure 2Distribution of 662 SNPs with polymorphic sites on 12 rice chromosomes. Of these, 70 novel SNPs (10%) were detected when compared against Oryza sativa japonica Ensembl Plants variation database.
Biological process and molecular function GO terms associated with genes containing SNPs with polymorphic sites. False discovery rate (FDR < 0.05). Only the top 10 GO terms from biological process and molecular function were further discussed in this paper.
| Reactome pathway name | Molecular function GO terms | Frequency of genes containing SNPs with polymorphic sites | Biological process GO terms | Frequency of genes containing SNPs with polymorphic sites |
|---|---|---|---|---|
| (1) Metabolism and regulation (R-OSA-2744345) | Binding (GO:0005488) | 55 | Cellular process (GO:0009987) | 51 |
| Catalytic activity (GO:0003824) | 52 | Metabolic process (GO:0008152) | 49 | |
| Heterocyclic compound binding (GO:1901363) | 43 | Organic substance metabolic process (GO:0071704) | 44 | |
| Organic cyclic compound binding (GO:0097159) | 43 | Primary metabolic process (GO:0044238) | 41 | |
| Ion binding (GO:0043167) | 38 | Cellular metabolic process (GO:0044237) | 41 | |
| Small molecule binding (GO:0036094) | 24 | Nitrogen compound metabolic process (GO:0006807) | 37 | |
| Nucleotide binding (GO:0000166) | 24 | Macromolecule metabolic process (GO:0043170) | 34 | |
| Nucleoside phosphate binding (GO:1901265) | 24 | Cellular macromolecule metabolic process (GO:0044260) | 29 | |
| Purine nucleotide binding (GO:0017076) | 23 | Macromolecule modification (GO:0043412) | 20 | |
| Carbohydrate derivative binding (GO:0097367) | 23 | Cellular protein modification process (GO:0006464) | 18 |
Annotation of nonsynonymous SNPs with polymorhic sites in Pfam family.
| Functional gene classifications | Pfam name and ID | Number of SNPs |
|---|---|---|
| Stress responsive | AIG1 family (PF04548) | 5 |
| Metabolism | Glycosyltransferase family 43 | 2 |
| Transporter | Mitochondrial carrier protein | 2 |
| Transcription factor | Myb-like DNA-binding domain | 4 |
Overview of genic SNPs in the genes encoding enzyme of flavonoid biosynthetic pathway. All genes were categorized into general phenylpropanoid, early biosynthetic genes, late biosynthetic genes, and transcription factor (bHLH (Kala4 and Rc), R2R3-MYB).
| Group of genes | Genes name | Total SNPs | Total SNPs (%) |
|---|---|---|---|
| General phenylpropanoid genes | Phenylalanine ammonia-lyase (PAL) | 511 | 28 |
| Early biosynthetic genes (EBGs) | Chalcone synthase (CHS) | 463 | 26 |
| Late biosynthetic genes (LBGs) | Dihydroflavonol reductase (DFR) | 675 | 37 |
| Transcription factors (TFs) | Basic helix-loop-helix (bHLH) | 160 | 9 |
Figure 3Distribution of genic SNPs identified in the flavonoid biosynthesis-related genes of Bali, PH9, MRM16, and MRQ100.
Figure 4Physical positions of 40 nonsynonymous SNPs (nsSNPs) in the 39 flavonoid biosynthetic genes (FBGs) and one transcription factor. Blue circles represent black rice whereas green circles represent red rice. All nsSNPs were distributed on chromosome 1 to chromosome 11. None of the nonsynonymous SNPs reported in chromosome 12. SNP identifier (SNP ID) are listed on the right side of the blue and green circles.