| Literature DB >> 27515999 |
Qi Liu1, Changjun Ding1, Yanguang Chu1, Jiafei Chen1, Weixi Zhang1, Bingyu Zhang1, Qinjun Huang1, Xiaohua Su1,2.
Abstract
Poplar is not only an important resource for the production of paper, timber and other wood-based products, but it has also emerged as an ideal model system for studying woody plants. To better understand the biological processes underlying various traits in poplar, e.g., wood development, a comprehensive functional gene interaction network is highly needed. Here, we constructed a genome-wide functional gene network for poplar (covering ~70% of the 41,335 poplar genes) and created the network web service PoplarGene, offering comprehensive functional interactions and extensive poplar gene functional annotations. PoplarGene incorporates two network-based gene prioritization algorithms, neighborhood-based prioritization and context-based prioritization, which can be used to perform gene prioritization in a complementary manner. Furthermore, the co-functional information in PoplarGene can be applied to other woody plant proteomes with high efficiency via orthology transfer. In addition to poplar gene sequences, the webserver also accepts Arabidopsis reference gene as input to guide the search for novel candidate functional genes in PoplarGene. We believe that PoplarGene (http://bioinformatics.caf.ac.cn/PoplarGene and http://124.127.201.25/PoplarGene) will greatly benefit the research community, facilitating studies of poplar and other woody plants.Entities:
Mesh:
Year: 2016 PMID: 27515999 PMCID: PMC4981870 DOI: 10.1038/srep31356
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The overall workflow of PoplarGene construction.
PoplarGene network construction included three main steps: (a) inferring functional gene pairs; (b) assigning likelihood ratio scores for network links and (c) integrating component network linkages. The PoplarGene web server was then developed based on network linkages and other related functionalities.
Summary of the PoplarGene network and 23 network components.
| (Network source) description | #Nodes (coding genes coverage, %) | #Links |
|---|---|---|
| PoplarGene network | 29 049 (70.3) | 1 967 631 |
| (PT-CX) Co-expression network of Poplar genes using microarray experiments | 7 930 (19.2) | 282 144 |
| (PT-DC) Protein domains co-occurrence between two Poplar genes | 3 022 (7.3) | 27 096 |
| (PT-GN) Neighborhood conservation of Poplar genes in prokaryotic genomes | 8 881(21.5) | 213 509 |
| (PT-PG) The similarity of phylogenetic profile between Poplar genes | 11 623 (28.1) | 305 305 |
| (AT-CC) Transfer of co-citation links in A. thaliana orthology network | 7 243 (17.5) | 65 474 |
| (AT-CX) Transfer of co-expression links in A. thaliana orthology network | 18 290 (44.2) | 418 367 |
| (AT-HT) Transfer of high-throughput PPI in A. thaliana orthology network | 3 442 (8.3) | 6 390 |
| (AT-LC) Transfer of literature curated PPI in A. thaliana orthology network | 2 290 (5.5) | 3 952 |
| (CE-CX) Transfer of co-expression links in C. elegans orthology network | 6 273 (15.2) | 104 876 |
| (CE-HT) Transfer of high-throughput PPI in C. elegans orthology network | 1 781 (4.3) | 5 296 |
| (CE-LC) Transfer of co-citation links in C. elegans orthology network | 1 243 (3.0) | 4 873 |
| (DM-CX) Transfer of co-expression links in D. melanogaster orthology network | 1 719 (4.2) | 15 033 |
| (DM-HT) Transfer of high-throughput PPI in D. melanogaster orthology network | 1 272 (3.1) | 3 120 |
| (DM-LC) Transfer of literature curated PPI in D. melanogaster orthology network | 104 (0.3) | 183 |
| (HS-CX) Transfer of co-expression links in H. sapiens orthology network | 5 100 (12.3) | 88 318 |
| (HS-HT) Transfer of high-throughput PPI in H. sapiens orthology network | 1 661(4.0) | 5 716 |
| (HS-LC) Transfer of literature curated PPI in H. sapiens orthology network | 5 176 (12.5) | 55 102 |
| (OS-CX) Transfer of co-expression links in O. sativa orthology network | 3 187 (7.7) | 30 275 |
| (OS-LC) Transfer of literature curated PPI in O. sativa orthology network | 28 (0.1) | 80 |
| (SC-CC) Transfer of co-citation links in S. cerevisiae orthology network | 6 396 (15.5) | 146 710 |
| (SC-CX) Transfer of co-expression links in S. cerevisiae orthology network | 4 866 (11.8) | 141 350 |
| (SC-HT) Transfer of high-throughput PPI in S. cerevisiae orthology network | 5 486 (13.3) | 274 397 |
| (SC-LC) Transfer of literature curated PPI in S. cerevisiae orthology network | 6 086 (14.7) | 147 250 |
Figure 2Summary of quality assessment of the PoplarGene network.
(A) The gene linkages derived from 23 diverse functional genomics data sets, representing millions of experimental or computational observations, were integrated into a comprehensive network with higher accuracy and genome coverage than any single data set. The integrated network contains 1,967,631 linkages and 29,049 genes (>70% of the P. trichocarpa coding genome). The x-axis represents the log-scaled coverage of the P. trichocarpa coding genome covered by linkages derived from the corresponding datasets (curves). The y-axis indicates the accuracy of functional linkages, measured as the cumulative log likelihood of linked genes to shared GO-BP term annotations tested using 0.632 bootstrapping and plotted for each bin of 1,000 linkages. The datasets were designated AA-BB, with AA indicating species of data origin (AT, A. thaliana; CE, C. elegans; DM, D. melanogaster; HS, H. sapiens; OS, O. sativa; PT, P. trichocarpa; SC, S. cerevisiae) and BB indicating data type (CC, co-citation; CX, mRNA coexpression; DC, domain co-occurrence; GN, gene neighbor; LC, literature curated protein interactions; HT, high-throughput experimental screening of interaction; PG, phylogenetic profiles). (B) Venn diagram of the gene linkages, indicating that the PoplarGene network contains many more linkages than those derived by orthology transfer from the Arabidopsis gene network AraNet12 and the rice gene network RiceNet32 and that they have higher linkage accuracy. Linkage accuracy was measured using an independent set of reference linkages obtained from the agriGO database. (C) Precision-recall analysis comparing the PoplarGene network to the AraNet-derived network and the RiceNet-derived network. (D) Box-and-whisker plot of network predictive power for 277 agriGO BP terms (with more than four annotated genes), as measured by the area under the curve from ROC analysis.
Figure 3Screenshots of the PoplarGene web service.
Five modules were included in the PoplarGene web service: (A) Neighborhood-based gene priority module; (B) Gene search module; (C) Context-based gene priority module; (D) Interaction transferring module and (E) Gene extensive annotation module. (F) Other tools provided by the PoplarGene web service.
Figure 4Case studies using PoplarGene.
(A) Validation of new candidate poplar genes for secondary xylem development based on the neighborhood-based gene priority method. (B) Orthology transfer of PoplarGene network linkage to Eucalyptus grandis.