| Literature DB >> 27492285 |
Eiru Kim1, Sohyun Hwang1, Insuk Lee2.
Abstract
Soybean (Glycine max) is a legume crop with substantial economic value, providing a source of oil and protein for humans and livestock. More than 50% of edible oils consumed globally are derived from this crop. Soybean plants are also important for soil fertility, as they fix atmospheric nitrogen by symbiosis with microorganisms. The latest soybean genome annotation (version 2.0) lists 56 044 coding genes, yet their functional contributions to crop traits remain mostly unknown. Co-functional networks have proven useful for identifying genes that are involved in a particular pathway or phenotype with various network algorithms. Here, we present SoyNet (available at www.inetbio.org/soynet), a database of co-functional networks for G. max and a companion web server for network-based functional predictions. SoyNet maps 1 940 284 co-functional links between 40 812 soybean genes (72.8% of the coding genome), which were inferred from 21 distinct types of genomics data including 734 microarrays and 290 RNA-seq samples from soybean. SoyNet provides a new route to functional investigation of the soybean genome, elucidating genes and pathways of agricultural importance.Entities:
Mesh:
Year: 2016 PMID: 27492285 PMCID: PMC5210602 DOI: 10.1093/nar/gkw704
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of SoyNet construction. Co-functional networks of soybean genes were inferred from various soybean-specific genomics data and from evolutionarily conserved links in other species. The individual networks were then integrated into a single network, SoyNet, using Bayesian statistical framework.
SoyNet and component networks inferred from 21 distinct data types
| Network | Description | Genes | Links |
|---|---|---|---|
| SoyNet | Integrated network | 40 812 | 1 940 284 |
| GM-CX | By co-expression of | 38 300 | 539 521 |
| GM-GN | By gene neighborhood of two bacterial orthologs of | 6072 | 211 084 |
| GM-PG | By phylogenetic profile of | 2665 | 30 695 |
| AT-CC | By co-citation of | 11 482 | 256 676 |
| AT-CX | By co-expression of | 10 494 | 125 109 |
| AT-HT | By high-throughput | 4966 | 22 140 |
| AT-LC | By literature curated | 4261 | 15 971 |
| CE-CC | By co-citation of | 900 | 30 166 |
| CE-CX | By co-expression of | 746 | 6694 |
| DM-CX | By co-expression of | 4244 | 63 988 |
| DM-HT | By high-throughput | 6162 | 29 617 |
| DM-LC | By literature curated | 1379 | 10 494 |
| DR-CX | By co-expression of | 5934 | 281 220 |
| HS-HT | By high-throughput | 4695 | 85 126 |
| HS-LC | By literature curated | 7993 | 124 281 |
| OS-CX | By co-expression of | 12 682 | 253 016 |
| SC-CC | By co-citation of | 7730 | 216 412 |
| SC-CX | By co-expression of | 6985 | 580 194 |
| SC-GT | By genetic interactions of | 5793 | 229 598 |
| SC-HT | By high-throughput | 6754 | 445 574 |
| SC-LC | By literature curated | 5760 | 103 626 |
Figure 2.Assessment of SoyNet and other soybean functional networks. (A) Accuracies of gene pairs for the same agriGO pathways for the given genome coverage of each network are indicated for every bin of 1000 links. The resultant plot indicates that SoyNet outperforms STRING v10 and PlaNet in accuracy for most ranges of genome coverage. (B) Assuming genes connected in the network are functionally associated, SoyNet was assessed for functional modularity of proteins that are differentially expressed during specific abiotic stresses: drought and flooding. For both stress response proteomes, SoyNet shows significantly higher within-group edge counts than the distribution of those by 1000 random protein sets. (C) An abiotic stress response network of soybean genes based on SoyNet. Gene networks that respond to two different abiotic stresses, drought and flooding, have only three common genes, yet they are well-connected, suggesting that pathways for responding to different types of abiotic stresses are functionally interlaced.
Figure 3.Network-based methods for functional predictions implemented in the SoyNet server. (A) Overview of three network-based functional prediction methods. (B) Assessment of pathway predictions by ‘Find new members of a pathway’ with different soybean gene networks, SoyNet, STRING v10, and PlaNet. True positive rate (TPR) was measured for the top 100, 1000 and 10 000 retrieved genes for each of 338 agriGO pathways that have at least four member genes. Similar analyses were also conducted for random gene sets with the same number of member genes for each pathway. (C) Networks of 44 genes that respond to phosphorus deficiency and their intermediate nodes. A network obtained from a z-score threshold of 43 contains four intermediate nodes, whereas that by lower z-score threshold, 41, contains 13 more intermediate nodes. Clicking each gene or edge of the network shows additional information. For example, an intermediate node Glyma.04G195100 is annotated for lignin metabolic process.