| Literature DB >> 35712577 |
Shashi Rekha Thummala1, Haritha Guttikonda2, Shrish Tiwari1, Rajeshwari Ramanan1, Niranjan Baisakh3, Sarla Neelamraju2, Satendra K Mangrauthia2.
Abstract
The genomes of an elite rice restorer line KMR3 (salinity-sensitive) and its salinity-tolerant introgression line IL50-13, a popular variety of coastal West Bengal, India, were sequenced. High-quality paired-end reads were obtained for KMR3 (147.6 million) and IL50-13 (131.4 million) with a sequencing coverage of 30X-39X. Scaffolds generated from the pre-assembled contigs of each sequenced genome were mapped separately onto the reference genome of Oryza sativa ssp. japonica cultivar Nipponbare to identify genomic variants in terms of SNPs and InDels. The SNPs and InDels identified for KMR3 and IL50-13 were then compared with each other to identify polymorphic SNPs and InDels unique and common to both the genomes. Functional enrichment analysis of the protein-coding genes with unique InDels identified GO terms involved in protein modification, ubiquitination, deubiquitination, peroxidase activity, and antioxidant activity in IL50-13. Linoleic acid metabolism, circadian rhythm, and alpha-linolenic acid metabolism pathways were enriched in IL50-13. These GO terms and pathways are involved in reducing oxidative damage, thus suggesting their role in stress responses. Sequence analysis of QTL markers or genes known to be associated with grain yield and salinity tolerance showed polymorphism in 20 genes, out of which nine were not previously reported. These candidate genes encoded Nucleotide-binding adaptor shared by APAF-1, R proteins, and CED-4 (NB-ARC) domain-containing protein, cyclase, receptor-like kinase, topoisomerase II-associated protein PAT1 domain-containing protein, ion channel regulatory protein, UNC-93 domain-containing protein, subunit A of the heteromeric ATP-citrate lyase, and three conserved hypothetical genes. Polymorphism was observed in the coding, intron, and untranslated regions of the genes on chromosomes 1, 2, 4, 7, 11, and 12. Genes showing polymorphism between the two genomes were considered as sequence-based new candidates derived from Oryza rufipogon for conferring high yield and salinity tolerance in IL50-13 for further functional studies.Entities:
Keywords: NGS; Oryza sativa; deletion; insertion; restorer line; salt stress; wild rice
Year: 2022 PMID: 35712577 PMCID: PMC9197125 DOI: 10.3389/fpls.2022.810373
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Summary of the sequencing data for KMR3 and IL50-13 genomes.
| KMR3 | 50-13 | |
| Total Reads | 151,715,750 | 134,489,967 |
| HQ PE Reads | 147,663,910 | 131,414,378 |
| Assembled Reads | 116,903,196 | 99,847,962 |
| Percentage Assembled | 77.0541 | 74.2419 |
| Total No. of Contigs | 61,648 | 73,557 |
| Min. Contig Length (bp) | 300 | 300 |
| Max. Contig Length (bp) | 143,306 | 75,580 |
| N50 Contig size (bp) | 14,062 | 9,802 |
| Total No. of Scaffolds | 55,719 | 64,909 |
| Min. Scaffolds Size (bp) | 300 | 300 |
| Max. Scaffolds Size (bp) | 143,306 | 81,532 |
| N50 Scaffold Size | 14,563 | 10,489 |
| Mean coverage | 38.02 | 32.73 |
| Mean insert size (bp) | 295.43 | 296.62 |
Chromosome-wise distribution of SNPs and InDels in KMR3 and IL50-13 relative to Nipponbare.
| SNPs | Insertions | Deletions | ||||
|
|
|
| ||||
| KMR3 | IL50-13 | KMR3 | IL50-13 | KMR3 | IL50-13 | |
| Chr. 1 | 3,393 (75.4) | 3,082 (68.4) | 16 (0.3) | 19 (0.4) | 22 (0.4) | 31 (0.6) |
| Chr. 2 | 2,941 (80.1) | 2,476 (67.4) | 19 (0.5) | 18 (0.4) | 30 (0.8) | 16 (0.4) |
| Chr. 3 | 2,565 (68.7) | 1,835 (49.1) | 14 (0.3) | 9 (0.2) | 28 (0.7) | 18 (0.4) |
| Chr. 4 | 4,635 (128.7) | 3,420 (95.0) | 16 (0.4) | 8 (0.2) | 15 (0.4) | 13 (0.3) |
| Chr. 5 | 3,212 (106.7) | 2,841 (94.3) | 15 (0.4) | 15 (0.4) | 23 (0.7) | 21 (0.6) |
| Chr. 6 | 3,694 (115.0) | 2,249 (70.0) | 16 (0.4) | 9 (0.2) | 24 (0.7) | 20 (0.6) |
| Chr. 7 | 4,096 (135.1) | 3,417 (112.7) | 22 (0.7) | 16 (0.5) | 30 (0.9) | 19 (0.6) |
| Chr. 8 | 2,699 (94.7) | 2,440 (85.6) | 15 (0.5) | 10 (0.3) | 28 (0.9) | 15 (0.5) |
| Chr. 9 | 2,430 (102.1) | 1,656 (69.5) | 22 (0.9) | 20 (0.8) | 23 (0.9) | 18 (0.7) |
| Chr. 10 | 3,512 (148.1) | 3,146 (132.7) | 24 (1.0) | 11 (0.4) | 19 (0.8) | 22 (0.9) |
| Chr. 11 | 9,171 (293.9) | 7,961 (255.1) | 16 (0.5) | 18 (0.5) | 27 (0.8) | 31 (0.9) |
| Chr. 12 | 2,854 (103.4) | 2,671 (96.7) | 10 (0.3) | 12 (0.4) | 11 (0.3) | 16 (0.5) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Numbers in parenthesis show the average number of SNPs and InDels obtained per Mb, Chr. = chromosome.
FIGURE 1Venn diagrams showing the number of unique and common SNPs (A) and InDels (B) between KMR3 and IL50-13 genomes.
Chromosome-wise distribution of SNPs and InDels unique to KMR3 and IL50-13 genomes.
| SNPs | Insertions | Deletions | ||||
|
|
|
| ||||
| KMR3 | IL50-13 | KMR3 | IL50-13 | KMR3 | IL50-13 | |
| Chr. 1 | 2,075 (46.1) | 1,772 (39.3) | 10 (0.2) | 13 (0.2) | 18 (0.4) | 27 (0.6) |
| Chr. 2 | 1,743 (47.4) | 1,279 (34.8) | 10 (0.2) | 10 (0.2) | 24 (0.6) | 9 (0.2) |
| Chr. 3 | 1,839 (49.3) | 1,110 (29.7) | 12 (0.3) | 7 (0.1) | 21 (0.5) | 11 (0.2) |
| Chr. 4 | 2,500 (69.4) | 1,286 (35.7) | 14 (0.3) | 6 (0.1) | 14 (0.3) | 12 (0.3) |
| Chr. 5 | 1,891 (62.8) | 1,523 (50.5) | 9 (0.2) | 10 (0.3) | 18 (0.5) | 16 (0.5) |
| Chr. 6 | 2,711 (84.4) | 1,267 (39.4) | 9 (0.2) | 5 (0.1) | 18 (0.5) | 18 (0.5) |
| Chr. 7 | 2,179 (71.9) | 1,500 (49.5) | 0 (0.0) | 16 (0.5) | 22 (0.7) | 11 (0.3) |
| Chr. 8 | 1,743 (61.1) | 1,487 (52.1) | 12 (0.4) | 6 (0.2) | 20 (0.7) | 8 (0.2) |
| Chr. 9 | 1,577 (66.2) | 803 (33.7) | 15 (0.6) | 13 (0.5) | 17 (0.7) | 12 (0.5) |
| Chr. 10 | 2,977 (125.6) | 2,613 (110.2) | 20 (0.8) | 6 (0.2) | 19 (0.8) | 22 (0.9) |
| Chr. 11 | 3,748 (120.1) | 2,538 (81.3) | 12 (0.3) | 14 (0.4) | 20 (0.6) | 24 (0.7) |
| Chr. 12 | 1,759 (63.7) | 1,575 (57.0) | 8 (0.2) | 9 (0.3) | 6 (0.2) | 12 (0.4) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Density of SNPs and InDels are shown in parenthesis. Chr. = chromosome.
FIGURE 2Circos plots for the chromosome-wise distribution of unique SNPs and InDels in KMR3.
FIGURE 3Circos plots for the chromosome-wise distribution of unique SNPs and InDels in IL50-13.
FIGURE 4Annotation of SNPs obtained in KMR3 (A) and IL50-13 (B). The SNPs obtained were categorized based on their occurrence in the genome, into genic and intergenic, which were again categorized into coding, intron, and UTR. The coding SNPs were further categorized as synonymous and non-synonymous based on their annotation to the codon position.
FIGURE 5Annotation of InDels obtained in KMR3 (A) and IL50-13 (B) genome. The InDels obtained in the genomes were annotated to different locations of the genome and categorized into genic and intergenic. The InDels were categorized into coding, intron, and UTRs (both 5′ and 3′).
FIGURE 6Degree distribution and skewness of the non-synonymous SNPs in KMR3 (A) and IL50-13 (B). In KMR3, the degree distribution and skewness of the nsSNPs per 1-kb region was calculated for the 87 genes that were annotated to the 347 nsSNPs. The outlier value (dotted line) calculated suggests that 12 genes had a nsSNP density > 8.5 per 1 kb of the gene in KMR3 (all values corresponding to the blue bars represented at the right side of the dotted line). In IL50-13, the degree distribution and skewness of the nsSNPs per 1-kb region was calculated for the 72 genes that were annotated to the 254 nsSNPs. The outlier value (dotted line) calculated suggests that nine genes had a nsSNP density > 7.5 per 1 kb of the gene in IL50-13 (all values corresponding to the blue color bars represented at the right side of the dotted line).
FIGURE 7Frequency distribution of the length of InDels in KMR3 (A) and IL50-13 (B) genome. The x-axis represents the length of the deletions (histograms with red color fill) and insertions (histograms with cyan color fill) in the genome and y-axis represents the frequency of each of the InDels of the specified length.
Gene Ontology of unique InDels in the protein-coding regions of KMR3 and IL50-13.
| Gene Ontology | GO Term | |
|
| ||
| GO:0065008 | Regulation of biological quality | 0.0264 |
| GO:1901615 | Organic hydroxy compound metabolic process | 0.0338 |
| GO:1901617 | Organic hydroxy compound biosynthetic process | 0.0338 |
| GO:0005507 | Copper ion binding | 0.0375 |
| GO:0050896 | Response to stimulus | 0.0403 |
| GO:0004497 | Monooxygenase activity | 0.0448 |
| GO:0065007 | Biological regulation | 0.0457 |
| GO:0009055 | Electron carrier activity | 0.0484 |
|
| ||
| GO:0070646 | Protein modification by small protein removal | 0.0398 |
| GO:0070647 | Protein modification by small protein conjugation or removal | 0.0398 |
| GO:0004842 | Ubiquitin-protein transferase activity | 0.0398 |
| GO:0016579 | Protein deubiquitination | 0.0398 |
| GO:0019787 | Ubiquitin-like protein transferase activity | 0.0398 |
| GO:0004601 | Peroxidase activity | 0.0469 |
| GO:0016684 | Oxidoreductase activity, acting on peroxide as acceptor | 0.0469 |
| GO:0016209 | Antioxidant activity | 0.0504 |
KEGG pathway enrichment analysis of unique InDels in the protein-coding regions of KMR3 and IL50-13.
| KEGG ID | Pathway name | KEGG Genes Id | Gene description | |
|
| ||||
| osa00905 | Brassinosteroid biosynthesis | dosa:Os04t0469800-01 | D11, DWARF_SHINKANEAIKOKU_OR_NOHRIN_28; Cytochrome P450 724B1 (EC 1.14.-.-) (OsDWARF11) (Dwarf protein 11); | 0.0068 |
| osa03410 | Base excision repair | dosa:Os08t0304900-01 | Similar to AtMMH-1 protein (F6D8.28 protein); | 0.0297 |
|
| ||||
| osa00591 | Linoleic acid metabolism | dosa:Os08t0509100-01 | LOX8, LIPOXYGENASE_8; Similar to Lipoxygenase, chloroplast precursor (EC 1.13.11.12); | 0.0127 |
| osa04712 | Circadian rhythm – plant | dosa:Os05t0571000-01 | WD40 repeat-like domain-containing protein; | 0.0206 |
| osa00592 | alpha-Linolenic acid metabolism | dosa:Os08t0509100-01 | LOX8, LIPOXYGENASE_8; Similar to Lipoxygenase, chloroplast precursor (EC 1.13.11.12); | 0.0300 |
Effect of unique InDels in the downstream region of genes in KMR3 and IL50-13.
| KEGG ID | Gene Id | InDel position | InDel change | Downstream region (bp) | InDel effect |
|
| |||||
| osa00905 | OS04G0469800 | 23,465,198 | TG - TGG | 1,967 | Modifier |
| osa03410 | OS08G0304900 | 12,720,895 | TC - T | 1,963 | Modifier |
|
| |||||
| osa04712 | OS05G0571000 | 28,442,608 | TTTTA - T | 2,615 | Modifier |
| osa00591/osa00592 | OS08G0509100 | 25,254,561 | TTT - TTTCTTTCCTTCTT | 4,526 | Modifier |
List of 20 genes from the four datasets that showed polymorphism between KMR3 and IL50-13.
| S. No | Trait | Gene Id | Coordinates (strand) | Gene length (bp) | Gene description |
|
| |||||
| 1 | Yield | Os04g0480600 | chr04:24044380-24045225 | 846 | Similar to Cytochrome P450 71A1 (EC 1.14.-.-) (CYPLXXIA1) (ARP-2) (Os04t0480600-01). |
| 2 | Yield | Os04g0480650 | chr04:24046426-24047084 | 659 | Similar to OSIGBa0158F13.10 protein (Os04t0480650-00). |
| 3 | Yield | Os07g0669200 | chr07:28277299-28279408 | 1,899 | Similar to GTP1/OBG family protein (Os07t0669200-00). |
| 4 | Salt tolerance | Os12g0568200 | chr12:23383189-23384177 | 989 | Metallothionein-like protein type 1 (Os12t0568200-01). |
| 5 | Salt tolerance | Os12g0568500 | chr12:23390501-23391407 | 907 | Metallothionein-like protein type 1 (Os12t0568500-01). |
| 6 | Salt tolerance | Os12g0564800 | chr12:23167167-23171951 | 4,785 | NB-ARC domain-containing protein (Os12t0564800-01). |
| 7 | Salt tolerance | Os12g0565100 | chr12:23182563-23188498 | 5,936 | NB-ARC domain-containing protein (Os12t0565100-01); NB-ARC domain-containing protein (Os12t0565100-02). |
|
| |||||
| 1 | Salt tolerance | Os01g0350100 | chr01:13978048-13979525 | 1,478 | Hypothetical protein (Os01t0350100-00). |
| 2 | Salt tolerance | Os01g0362100 | chr01:14754835-14764507 | 9,673 | Esterase/lipase/thioesterase domain-containing protein (Os01t0362100-01). |
| 3 | Salt tolerance | Os02g0187100 | chr02:4831212-4833985 | 2,774 | Similar to cyclase (Os02t0187100-00). |
| 4 | Salt tolerance | Os02g0194400 | chr02:5259822-5266512 | 1,520 | Similar to Receptor-like kinase (fragment) (Os02t0194400-01); Leucine-rich repeat receptor-like kinase, Target gene of microRNA390, Cadmium stress response (Os02t0194400-02). |
| 5 | Salt tolerance | Os02g0294700 | chr02:11209135-11211943 | 2,809 | Topoisomerase II-associated protein PAT1 domain-containing protein (Os02t0294700-01). |
| 6 | Salt tolerance | Os11g0606800 | chr11:23415625-23418653 | 3,029 | Similar to NB-ARC domain-containing protein (Os11t0606800-00). |
| 7 | Salt tolerance | Os11g0618800 | chr11:24096264-24097002 | 739 | Hypothetical conserved gene (Os11t0618800-00). |
| 8 | Salt tolerance | Os12g0566800 | chr12:23302307-23306305 | 3,999 | Ion channel regulatory protein, UNC-93 domain-containing protein (Os12t0566800-01). |
| 9 | Salt tolerance | Os12g0566200 | chr12:23271514-23272915 | 1,402 | Conserved hypothetical protein (Os12t0566200-01). |
| 10 | Salt tolerance | Os12g0566300 | chr12:23275214-23279087 | 3,874 | Subunit A of the heteromeric ATP-citrate lyase, Negative regulation of cell death, Disease resistance (Os12t0566300-01). |
| 11 | Salt tolerance | Os12g0566500 | chr12:23290875-23292674 | 1,800 | Conserved hypothetical protein (Os12t0566500-01). |
| 12 | Salt tolerance | Os02g0729700 | chr02:30381303-30383486 | 2,184 | Similar to HAHB-7 (fragment) (Os02t0729700-01); Similar to HAHB-7 (fragment) (Os02t0729700-02). |
| 13 | Salt tolerance | Os04g0610900 | chr04:30974167-30979824 | 5,658 | Similar to EDR1 (Os04t0610900-01). |