| Literature DB >> 35970910 |
Muhammad-Redha Abdullah-Zawawi1,2, Nisha Govender3, Nor Azlan Nor Muhammad1, Norfarhan Mohd-Assaad4, Zamri Zainal1,4, Zeti-Azura Mohamed-Hussein1,4.
Abstract
Sulfur is an essential element required for plant growth and development, physiological processes and stress responses. Sulfur-encoding biosynthetic genes are involved in the primary sulfur assimilation pathway, regulating various mechanisms at the gene, cellular and system levels, and in the biosynthesis of sulfur-containing compounds (SCCs). In this study, the SCC-encoding biosynthetic genes in rice were identified using a sulfur-dependent model plant, the Arabidopsis. A total of 139 AtSCC from Arabidopsis were used as reference sequences in search of putative rice SCCs. At similarity index > 30%, the similarity search against Arabidopsis SCC query sequences identified 665 putative OsSCC genes in rice. The gene synteny analysis showed a total of 477 syntenic gene pairs comprised of 89 AtSCC and 265 OsSCC biosynthetic genes in Arabidopsis and rice, respectively. Phylogenetic tree of the collated (AtSCCs and OsSCCs) SCC-encoding biosynthetic genes were divided into 11 different clades of various sizes comprised of branches of subclades. In clade 1, nearing equal representation of OsSCC and AtSCC biosynthetic genes imply the most ancestral lineage. A total of 25 candidate Arabidopsis SCC homologs were identified in rice. The gene ontology enrichment analysis showed that the rice-Arabidopsis SCC homologs were significantly enriched in the following terms at false discovery rate (FDR) < 0.05: (i) biological process; sulfur compound metabolic process and organic acid metabolic processes, (ii) molecular function; oxidoreductase activity, acting on paired donors with incorporation or reduction of molecular oxygen and (iii) KEGG pathway; metabolic pathways and biosynthesis of secondary metabolites. At less than five duplicated blocks of separation, no tandem duplications were observed among the SCC biosynthetic genes distributed in rice chromosomes. The comprehensive rice SCC gene description entailing syntenic events with Arabidopsis, motif distribution and chromosomal mapping of the present findings offer a foundation for rice SCC gene functional studies and advanced strategic rice breeding.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35970910 PMCID: PMC9378745 DOI: 10.1038/s41598-022-18068-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1The Arabidopsis thaliana (At)-Oryza sativa (Os) sulfur-containing compound (SCC) encoding biosynthetic synteny gene pairs identified with MCScanX. There are 477 syntenic gene pairs (represented by connecting colour lines) between 89 AtSCC and 265 OsSCC biosynthetic genes. The numbering on AT and OS labels denotes the chromosome number.
Figure 2Phylogenetic analysis of collated sulfur-encoding biosynthetic genes in Arabidopsis thaliana and rice (Oryza sativa). The tree is constructed with MEGA software.
Figure 3Motif distribution structure of Arabidopsis thaliana and Oryza sativa sulfur-encoding biosynthetic genes grouped by clades. The A. thaliana (ATXXXXXXX) and O. sativa (OsXXXXXXXX) gene IDs are written in black and red, respectively. Detailed information on the motif sequence information and annotation is available in Supplementary 2.
Figure 4Illustration of the Arabidopsis thaliana and Oryza sativa sulfur-encoding biosynthetic gene structure. Genes are grouped according to clades. The A. thaliana (ATXXXXXXX) and O. sativa (OsXXXXXXXX) gene IDs are written in black and red, respectively. Exons are indicated as yellow round-corner rectangles and introns with solid black lines.
Mining for Oryza sativa sulfur-encoding biosynthetic genes (OsSCC) with Arabidopsis sulfur-encoding biosynthetic gene (AtSCC) input data. Selection criteria are described as following: (1) synteny events; (2) phylogenetic clade; (3) motif composition (Os/At); and (4) number of exon (EN) with AtSCC biosynthetic genes (Os/At).
| Selection of | ||||||
|---|---|---|---|---|---|---|
| Criteria | ||||||
| 1 | 2 | 3 | 4 | |||
| At3g44300 ( | LOC_Os02g42330 | 7.00E-158 | 1 | 3/5 | 5/5 | |
| At1g31180 ( | LOC_Os03g45320 | 0.00E-000 | 1 | 10/10 | 11/8 | |
| At5g14200 ( | LOC_Os03g45320 | 0.00E + 00 | 1 | 10/10 | 11/9 | |
| At1g47600 ( | LOC_Os09g31410 | 2.00E-149 | 4 | 7/9 | 6/13 | |
| LOC_Os08g39860 | 3.00E-158 | 4 | 9/9 | 13/13 | ||
| LOC_Os06g21570 | 1.00E-156 | 4 | 9/9 | 13/13 | ||
| LOC_Os04g39814 | 1.00E-82 | 4 | 7/9 | 7/13 | ||
| LOC_Os11g08120 | 2.00E-41 | 4 | 5/9 | 7/13 | ||
| At1g74090 ( | LOC_Os09g08190 | 8.00E-66 | 8 | 4/4 | 1/1 | |
| At1g03400 ( | LOC_Os06g14390 | 1.00E-84 | 9 | 7/9 | 2/3 | |
| LOC_Os02g17940 | 1.00E-38 | 9 | 9/9 | 4/3 | ||
| LOC_Os03g63900 | 5.00E-37 | 9 | 9/9 | 1/3 | ||
| At5g43440 ( | LOC_Os06g14390 | 5.00E-75 | 9 | 7/9 | 2/3 | |
| LOC_Os09g18450 | 3.00E-111 | 9 | 9/9 | 3/3 | ||
| LOC_Os01g24980 | 7.00E-50 | 9 | 9/9 | 4/3 | ||
| LOC_Os03g63900 | 4.00E-46 | 9 | 9/9 | 1/3 | ||
| LOC_Os03g32470 | 1.00E-35 | 9 | 9/9 | 3/3 | ||
| At5g23010 ( | LOC_Os11g04670 | 7.00E-173 | 9 | 1/1 | 12/10 | |
| LOC_Os12g04440 | 5.00E-173 | 9 | 1/1 | 12/10 | ||
| At3g61400 ( | LOC_Os02g17940 | 1.00E-34 | 9 | 9/9 | 4/3 | |
| LOC_Os01g24980 | 3.00E-39 | 9 | 9/9 | 4/3 | ||
| At2g25450 ( | LOC_Os03g32470 | 8.00E-23 | 9 | 9/9 | 3/3 | |
| At1g76790 ( | LOC_Os02g57760 | 1.00E-40 | 10 | 7/8 | 4/3 | |
| At1g21100 ( | LOC_Os08g06100 | 6.00E-92 | 10 | 8/8 | 2/3 | |
| LOC_Os04g09604 | 3.00E-74 | 10 | 8/8 | 4/3 | ||
| At2g43100 ( | LOC_Os02g43830 | 2.00E-067 | 11 | 1/1 | 1/1 | |
| At1g24100 ( | LOC_Os11g04860 | 4.00E-058 | 11 | 7/8 | 1/2 | |
| LOC_Os09g11290 | 2.00E-12 | 11 | 2/8 | 1/2 | ||
| At2g22330 ( | LOC_Os04g08824 | N/A | 6 | 9/10 | 3/3 | |
| At4g39950 ( | LOC_Os04g08824 | N/A | 6 | 9/10 | 3/3 | |
| At5g61290 ( | LOC_Os10g40570 | N/A | 8 | 8/8 | 7/7 | |
| At5g07800 ( | LOC_Os10g40570 | N/A | 8 | 8/8 | 7/7 | |
| At1g24100 ( | LOC_Os06g23800 | N/A | 11 | 1/1 | 6/2 | |
Figure 6Gene ontology (GO) and pathway enrichment analysis. The bubble plot represents the top 20 significantly enriched terms of the Arabidopsis-rice homologous SCC-encoding geens. The GO terms are presented in (i-ii) biological process and (iii-iv) molecular functions whereas the KEGG pathways are presented in (v-vii). Red arrows represent the terms shared among the Arabidopsis-rice orthologous genes. The results are visualized at P < 0.05 using ShinyGO v0.75 (http://bioinformatics.sdstate.edu/go75/).
Figure 5Sulfur-containing compound (SCC) encoding biosynthetic gene distribution in A. thaliana and O. sativa chromosomes. Grey bars represent the physical maps. The chromosomes are numbered accordingly: A. thaliana;1–5 and O. sativa;1–12. Short lines on grey bars represent the locations of SCCs biosynthetic genes (labelled in red) on each physical map. The different colour boxes expressed adjacent to the gene ID represent the clades.
Sulfur-encoding biosynthetic gene, chromosomal and protein level description in Arabidopsis and rice. Each gene is characterized according to its chromosome number, chromosomal loci, open reading frame (ORF) and physical characteristics of the encoding protein.
| Gene ID | Gene name | Chr | Location | ORF length (bp) | Protein | ||
|---|---|---|---|---|---|---|---|
| Length | PI | MW (kDa) | |||||
| AT1G03400 | 1 | 842,747–844,190 | 1056 | 351 | 6.15 | 39.13 | |
| AT1G21100 | 1 | 7,386,839–7,388,428 | 1122 | 373 | 5.01 | 40.869 | |
| AT1G24100 | 1 | 8,525,435–8,527,087 | 1383 | 460 | 4.63 | 51.002 | |
| AT1G31180 | 1 | 11,142,714–11,144,633 | 1215 | 404 | 5.55 | 43.847 | |
| AT1G47600 | 1 | 17,491,732–17,494,759 | 1536 | 511 | 8.21 | 57.542 | |
| AT1G74090 | 1 | 27,862,909–27,864,193 | 1053 | 350 | 5.5 | 40.465 | |
| AT1G76790 | 1 | 28,822,186–28,823,673 | 1104 | 367 | 4.76 | 40.222 | |
| AT2G22330 | 2 | 9,488,554–9,491,187 | 1632 | 543 | 8.17 | 61.437 | |
| AT2G25450 | 2 | 10,829,916–10,831,655 | 1080 | 359 | 6.24 | 40.351 | |
| AT2G43100 | 2 | 17,920,660–17,921,689 | 771 | 256 | 6.01 | 27.043 | |
| AT3G44300 | 3 | 15,983,311–15,985,535 | 1020 | 339 | 5.24 | 37.153 | |
| AT3G61400 | 3 | 22,718,956–22,720,397 | 1113 | 370 | 5.64 | 41.601 | |
| AT4G39950 | 4 | 18,525,246–18,527,579 | 1626 | 541 | 8.73 | 61.347 | |
| AT5G07800 | 5 | 2,486,576–2,489,296 | 1383 | 460 | 6.21 | 52.337 | |
| AT5G14200 | 5 | 4,576,202–4,578,402 | 1230 | 409 | 5.81 | 44.161 | |
| AT5G23010 | 5 | 7,703,092–7,706,896 | 1521 | 506 | 7.28 | 55.125 | |
| AT5G43440 | 5 | 17,455,233–17,456,657 | 1098 | 365 | 6.18 | 40.86 | |
| AT5G61290 | 5 | 24,648,558–24,650,815 | 1386 | 461 | 4.9 | 52.406 | |
| LOC_Os01g24980 | 1 | 14,077,629–14,080,716 | 1035 | 344 | 5.62 | 38.731 | |
| LOC_Os10g40570 | 10 | 21,724,416–21,727,181 | 1449 | 482 | 5.69 | 53.726 | |
| LOC_Os11g04670 | 11 | 1,989,201–1,995,087 | 1908 | 635 | 6.46 | 68.448 | |
| LOC_Os11g04860 | 11 | 2,067,727–2,069,430 | 1449 | 482 | 5.38 | 54.068 | |
| LOC_Os11g08120 | 11 | 4,262,908–4,265,304 | 579 | 197 | 9.81 | 22.062 | |
| LOC_Os12g04440 | 12 | 1,888,943–1,894,920 | 1908 | 635 | 6.46 | 68.461 | |
| LOC_Os02g17940 | 2 | 10,386,279–10,390,290 | 1056 | 351 | 5.1 | 40.118 | |
| LOC_Os02g42330 | 2 | 25,459,397–25,462,730 | 1074 | 357 | 5.75 | 37.985 | |
| LOC_Os02g43830 | 2 | 26,465,591–26,469,280 | 774 | 257 | 7.61 | 26.443 | |
| LOC_Os02g57760 | 2 | 35,370,515–35,373,858 | 1098 | 365 | 5.34 | 38.647 | |
| LOC_Os03g32470 | 3 | 18,570,651–18,572,508 | 1650 | 549 | 8.36 | 60.587 | |
| LOC_Os03g45320 | 3 | 25,586,205–25,590,717 | 1227 | 408 | 5.86 | 43.371 | |
| LOC_Os03g63900 | 3 | 36,103,513–36,105,068 | 1089 | 362 | 5.97 | 40.792 | |
| LOC_Os04g08824 | 4 | 4,869,932–4,872,151 | 1476 | 491 | 9.26 | 55.727 | |
| LOC_Os04g09604 | 4 | 5,161,917–5,167,494 | 1137 | 378 | 5.33 | 40.594 | |
| LOC_Os04g39814 | 4 | 23,715,443–23,721,731 | 951 | 316 | 6.3 | 35.548 | |
| LOC_Os06g14390 | 6 | 8,031,719–8,035,243 | 1098 | 365 | 5.23 | 39.169 | |
| LOC_Os06g21570 | 6 | 12,437,997–12,442,742 | 1515 | 504 | 7.18 | 57.756 | |
| LOC_Os06g23800 | 6 | 13,905,082–13,909,018 | 711 | 236 | 8.94 | 25.725 | |
| LOC_Os08g06100 | 8 | 3,337,751–3,340,959 | 1107 | 368 | 5.41 | 39.75 | |
| LOC_Os08g39860 | 8 | 25,250,314–25,254,656 | 1500 | 499 | 8.53 | 56.804 | |
| LOC_Os09g08190 | 9 | 4,250,758–4,251,917 | 843 | 280 | 6.8 | 31.922 | |
| LOC_Os09g11290 | 9 | 6,266,198–6,266,539 | 342 | 113 | 5.25 | 12.531 | |
| LOC_Os09g18450 | 9 | 11,309,063–11,310,776 | 1050 | 349 | 6.19 | 39.001 | |
| LOC_Os09g31410 | 9 | 18,889,721–18,893,801 | 1401 | 466 | 9.02 | 53.08 | |