| Literature DB >> 23762267 |
Juexin Wang1, Liang Chen, Yan Wang, Jingfen Zhang, Yanchun Liang, Dong Xu.
Abstract
Salinity is one of the most common abiotic stresses in agriculture production. Salt tolerance of rice (Oryza sativa) is an important trait controlled by various genes. The mechanism of rice salt tolerance, currently with limited understanding, is of great interest to molecular breeding in improving grain yield. In this study, a gene regulatory network of rice salt tolerance is constructed using a systems biology approach with a number of novel computational methods. We developed an improved volcano plot method in conjunction with a new machine-learning method for gene selection based on gene expression data and applied the method to choose genes related to salt tolerance in rice. The results were then assessed by quantitative trait loci (QTL), co-expression and regulatory binding motif analysis. The selected genes were constructed into a number of network modules based on predicted protein interactions including modules of phosphorylation activity, ubiquity activity, and several proteinase activities such as peroxidase, aspartic proteinase, glucosyltransferase, and flavonol synthase. All of these discovered modules are related to the salt tolerance mechanism of signal transduction, ion pump, abscisic acid mediation, reactive oxygen species scavenging and ion sequestration. We also predicted the three-dimensional structures of some crucial proteins related to the salt tolerance QTL for understanding the roles of these proteins in the network. Our computational study sheds some new light on the mechanism of salt tolerance and provides a systems biology pipeline for studying plant traits in general.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23762267 PMCID: PMC3676415 DOI: 10.1371/journal.pone.0064929
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Improved volcano plot of GSE14403.
The horizontal axis represents MergeValue obtained by bootstraps SVM-RFE. The vertical axis shows the –log(p-value) from t-test. Black dots indicate selected probes with MergeValue threshold of 0.5 and t-test p-value threshold of 0.05. Red stars indicate selected probes mapped on QTL region. Blue dots indicate unselected probes.
GO term enrichment analysis on gene selected from microarray by AgriGO.
| GO term | Ontology | Description | Number inselected gene set | Number inBackground | p-value | FDR |
| GO: 0034641 | P | cellular nitrogen compound metabolic process | 17 | 459 | 2.40E-05 | 0.011 |
| GO: 0005506 | F | iron ion binding | 19 | 432 | 7.10E-07 | 0.0002 |
| GO: 0016491 | F | oxidoreductase activity | 32 | 1141 | 4.30E-06 | 0.0006 |
| GO: 0043169 | F | cation binding | 51 | 2582 | 0.00014 | 0.01 |
| GO: 0043167 | F | ion binding | 51 | 2584 | 0.00015 | 0.01 |
| GO: 0020037 | F | heme binding | 9 | 205 | 0.00055 | 0.031 |
| GO: 0046906 | F | tetrapyrrole binding | 9 | 217 | 0.00082 | 0.039 |
| GO: 0031982 | C | Vesicle | 131 | 7454 | 3.40E-06 | 0.0001 |
| GO: 0016023 | C | cytoplasmic membrane-bounded vesicle | 131 | 7445 | 3.20E-06 | 0.0001 |
| GO: 0031988 | C | membrane-bounded vesicle | 131 | 7445 | 3.20E-06 | 0.0001 |
| GO: 0031410 | C | cytoplasmic vesicle | 131 | 7454 | 3.40E-06 | 0.0001 |
| GO: 0031224 | C | intrinsic to membrane | 23 | 820 | 8.10E-05 | 0.002 |
| GO: 0016021 | C | integral to membrane | 21 | 804 | 0.0004 | 0.0083 |
Ontology “P” indicates Biological Process, Ontology “F” indicates Molecular Function, and Ontology “C” indicates Cellular Component. “Number in selected gene set” is the number of genes in the query gene list. “Number in background” is the number of genes in the proteome. P-value represents the statistical significance of the gene enrichment test. FDR means False Discovery Rate.
Evaluation of choosing salt-tolerance genes by QTLs.
| Methods | Sample Size | Original QTL region | Flanking region with length of the QTL region | ||||
| Number of Hits | Percentage of Population | Microarray-QTLs testp-value | Number of Hits | Percentage of sample | Microarray-QTL testp-value | ||
| Improved Volcano Plot | 556 selected | 34 | 6.12% | 4.21e-7 | 94 | 16.91% | 7.44e-18 |
The Number of hits and the percentage of samples depict the number and percentage of the chosen genes that can be mapped into the (extended) QTLs regions. The Microarray-QTL test p-value means the statistical significance of these chosen genes according to the QTLs.
Figure 2Salt-tolerance protein interaction modules that are related to QTLs.
Red nodes represent the proteins located in QTL regions. Yellow nodes represent the proteins located in flanked regions with the lengths of the QTL regions.
Modules of salt-tolerance network and their potential roles in salt tolerance.
| NO | Function | Nodes | Edges | # of QTL hits | # of ExtendedQTL hits | Salt Tolerance Mechanism | QTL hits | Extended QTL hits |
| 1 | phosphorylation | 47 | 372 | 4 | 7 | Signal Transduction/ion homeostasis/ABA | LOC_Os01g61070.1, LOC_Os07g09420.1, LOC_Os07g12240.1, LOC_Os01g59580.1 | LOC_Os01g70840.1/LOC_Os01g70840.2, LOC_Os09g29510.1, LOC_Os01g52640.3 |
| 2 | cytochrome P450 | 17 | 136 | 1 | 1 | ABA | LOC_Os09g26960.1 | |
| 3 | NA | 10 | 11 | 1 | 3 | NA | LOC_Os07g08840.1 | LOC_Os09g20220.1, LOC_Os01g70990.1 |
| 4 | peroxidase | 10 | 45 | 0 | 4 | Antioxidate | LOC_Os06g33080.1, LOC_Os06g35480.1, LOC_Os06g33100.1, LOC_Os06g32990.1 | |
| 5 | aspartic proteinase nepenthesin | 9 | 36 | 1 | 1 | Agreement | LOC_Os01g64840.1 | |
| 6 | Ubiquitin | 8 | 19 | 2 | 4 | ABA/degradation pathway | LOC_Os01g64620.1, LOC_Os01g60730.1/LOC_Os01g60730.2 | LOC_Os09g31031.2, LOC_Os09g27930.1 |
| 7 | starch | 5 | 5 | 2 | 2 | Agreement | LOC_Os04g33920.1, LOC_Os01g63810.1 | |
| 8 | glucosyltransferase | 5 | 10 | 1 | 1 | Agreement | LOC_Os07g10190.1 | |
| 9 | flavonol synthase | 5 | 10 | 1 | 1 | ROS-scavenging | LOC_Os01g61610.1/LOC_Os01g61610.2/LOC_Os01g61610.3 | |
| 10 | glycosyl hydrolases family | 4 | 6 | 0 | 0 | Agreement | ||
| 11 | MYB family transcription factor | 4 | 6 | 0 | 0 | Agreement | ||
| 12 | gibberellin receptor GID1L2 | 4 | 6 | 2 | 2 | Agreement | LOC_Os07g06840.1, LOC_Os07g06860.1 | |
| 13 | O-methyltransferase | 4 | 6 | 0 | 0 | Sodium sequestration | ||
| 14 | Phosphatidylethanolamine-binding | 4 | 6 | 1 | 2 | Cell signalling | LOC_Os04g33570.1, LOC_Os04g41130.1 | |
| 15 | aldo/keto reductase family | 2 | 1 | 1 | 1 | Antioxidate | LOC_Os04g37490.1 | |
| 16 | cytokinin dehydrogenase | 2 | 1 | 0 | 2 | ABA | LOC_Os01g56810.1, LOC_Os06g35650.1 | |
| 17 | cysteine synthase/indole-3-glycerol phosphate lyase | 2 | 1 | 1 | 1 | Agreement | LOC_Os06g36850.1 |
The number of members and interactions of the module are shown in column of “Nodes and Edges.” “# of QTL hits” and “# of Extended QTL hits” depict the number of module members that can be mapped into the (extended) QTLs regions. “Salt tolerance mechanism” shows the process or essential component that could participate in salt tolerance response. “Agreement” means the component is only consistent with suggestions in other studies but there is no direct evidence to link it to known salt tolerance mechanism. “QTL hits” and “Extended QTL hits” illustrate the member of module mapped into the (extended) QTLs regions.
Figure 3The largest module in the salt tolerance protein interaction network.
Black nodes indicate genes covered by QTLs. Yellow nodes indicate genes covered by extended QTLs.
AgriGO term enrichment analysis on the 51 genes in the largest module.
| GO term | Ontology | Description | Number of genes | Number in Background | p-value | FDR |
| GO: 0016310 | P | Phosphorylation | 9 | 1080 | 4.9e-06 | 0.00033 |
| GO: 0006468 | P | protein amino acid phosphorylation | 8 | 887 | 8.8e-06 | 0.00033 |
| GO: 0006796 | P | phosphate metabolic process | 9 | 1206 | 1.2e-05 | 0.00033 |
| GO: 0006793 | P | phosphorus metabolic process | 9 | 1206 | 1.2e-05 | 0.00033 |
| GO: 0043687 | P | post-translational protein modification | 8 | 1236 | 8.8e-05 | 0.002 |
| GO: 0034641 | P | cellular nitrogen compound metabolic process | 5 | 459 | 0.00016 | 0.0027 |
| GO: 0006464 | P | protein modification process | 8 | 1359 | 0.00017 | 0.0027 |
| GO: 0043412 | P | macromolecule modification | 8 | 1406 | 0.00021 | 0.003 |
| GO: 0009069 | P | serine family amino acid metabolic process | 5 | 538 | 0.00034 | 0.0043 |
| GO: 0044267 | P | cellular protein metabolic process | 8 | 2166 | 0.0031 | 0.033 |
| GO: 0006520 | P | cellular amino acid metabolic process | 5 | 918 | 0.0034 | 0.033 |
| GO: 0044106 | P | cellular amine metabolic process | 5 | 918 | 0.0034 | 0.033 |
| GO: 0019538 | P | protein metabolic process | 9 | 2770 | 0.0041 | 0.036 |
| GO: 0009308 | P | amine metabolic process | 5 | 1002 | 0.0049 | 0.041 |
| GO: 0001883 | F | purine nucleoside binding | 9 | 1171 | 9.3e-06 | 0.00012 |
| GO: 0001882 | F | nucleoside binding | 9 | 1171 | 9.3e-06 | 0.00012 |
| GO: 0000166 | F | nucleotide binding | 11 | 1686 | 5.2e-06 | 0.00012 |
| GO: 0005524 | F | ATP binding | 9 | 1071 | 4.6e-06 | 0.00012 |
| GO: 0030554 | F | adenyl nucleotide binding | 9 | 1171 | 9.3e-06 | 0.00012 |
| GO: 0032559 | F | adenyl ribonucleotide binding | 9 | 1074 | 4.7e-06 | 0.00012 |
| GO: 0032555 | F | purine ribonucleotide binding | 9 | 1218 | 1.3e-05 | 0.00012 |
| GO: 0032553 | F | ribonucleotide binding | 9 | 1218 | 1.3e-05 | 0.00012 |
| GO: 0017076 | F | purine nucleotide binding | 9 | 1317 | 2.3e-05 | 0.00019 |
| GO: 0016301 | F | kinase activity | 9 | 1464 | 5.1e-05 | 0.00039 |
| GO: 0004672 | F | protein kinase activity | 7 | 1102 | 0.00026 | 0.0018 |
| GO: 0016773 | F | phosphotransferase activity, alcohol group as acceptor | 7 | 1238 | 0.00051 | 0.0032 |
| GO: 0004674 | F | protein serine/threonine kinase activity | 6 | 949 | 0.00069 | 0.004 |
| GO: 0016772 | F | transferase activity, transferring phosphorus-containing groups | 9 | 2197 | 0.00091 | 0.005 |
| GO: 0005488 | F | Binding | 21 | 8681 | 0.0015 | 0.0076 |
One gene can be in multiple GO families. In “Ontology”, “P” indicates biological process and “F” indicates molecular function.
Figure 4Detected motif in the upstream sequences of the largest module.
Figure 5The whole genome mapping of selected genes and their interactions.
Blue lines in the outer circle represent all the 556 selected genes. Red regions on the chromosome are the QTLs while the grey and yellow regions are the extended QTLs with one QTL length at each side of the flanking region. Inside the circle, green links show the protein-protein interactions among 51 genes in the largest module.
Figure 6Part of the Arabidopsis Plant-Pathogen Interaction pathway in KEGG ( http://www.genome.jp/kegg-bin/show_pathway?ath04626+AT1G18210+AT3G51850+AT1G01340), where white boxes indicate that no genes have been assigned, green boxes have known genes in Arabidopsis, and boxes highlighted in red show the three mapped genes in the largest module.
Figure 7Predicted structural model of protein Os01g0725800.
GSE14403 resistant samples description.
| No | Sample | Genotype | Repeats | Stress condition | Class Label |
| 1 | GSM359902 | FL478 | 1 | Control | +1 |
| 2 | GSM359903 | FL478 | 2 | Control | +1 |
| 3 | GSM359904 | FL478 | 3 | Control | +1 |
| 4 | GSM359905 | FL478 | 1 | Salt-treated | −1 |
| 5 | GSM359906 | FL478 | 2 | Salt-treated | −1 |
| 6 | GSM359907 | FL478 | 3 | Salt-treated | −1 |
| 7 | GSM359913 | IR63731 | 1 | Control | +1 |
| 8 | GSM359914 | IR63731 | 2 | Control | +1 |
| 9 | GSM359915 | IR63731 | 3 | Control | +1 |
| 10 | GSM359916 | IR63731 | 1 | Salt-treated | −1 |
| 11 | GSM359917 | IR63731 | 2 | Salt-treated | −1 |
| 12 | GSM359918 | IR63731 | 3 | Salt-treated | −1 |
| 13 | GSM359919 | Pokkali | 1 | Control | +1 |
| 14 | GSM359920 | Pokkali | 2 | Control | +1 |
| 15 | GSM359921 | Pokkali | 3 | Control | +1 |
| 16 | GSM359922 | Pokkali | 1 | Salt-treated | −1 |
| 17 | GSM359923 | Pokkali | 2 | Salt-treated | −1 |
| 18 | GSM359924 | Pokkali | 3 | Salt-treated | −1 |
QTLs related to salt tolerance in rice from the Gramene QTL library.
| No. | QTL name | Chr | Mapped positions | Note |
| 1 | AQEM001 | 1 | 33,956,950–37,713,775 | |
| 2 | AQEM006 | 1 | 9,820,009–11,232,822 | |
| 3 | AQEM008 | 1 | 33,956,950–37,713,775 | Same as 1 |
| 4 | AQGR001 | 1 | 38,530,957–38,531,467 | |
| 5 | AQGR002 | 3 | 22,798,284–22,830,744 | |
| 6 | AQCL001 | 3 | 484,860–485,333 | |
| 7 | AQCL002 | 4 | 33,663,984–33,664,487 | |
| 8 | AQEM009 | 4 | 19,928,370–22,355,854 | |
| 9 | AQCL003 | 5 | 18,874,932–18,875,558 | |
| 10 | AQEM002 | 6 | 21,605,889–24,919,236 | |
| 11 | AQCL004 | 6 | 22,862,400–22,862,821 | |
| 12 | AQEM003 | 7 | 4,573,316–7,739,951 | |
| 13 | AQEM004 | 7 | 2,633,784–4,575,215 | |
| 14 | AQEM010 | 7 | 2,633,784–4,575,215 | Same as 13 |
| 15 | AQEM005 | 7 | 2,633,784–4,575,215 | Same as 13 |
| 16 | AQEM011 | 7 | 2,633,784–4,575,215 | Same as 13 |
| 17 | AQEM007 | 9 | 14,362,062–17,837,010 |