| Literature DB >> 18503085 |
Chung-Yen Lin1, Chia-Hao Chin, Hsin-Hung Wu, Shu-Hwa Chen, Chin-Wen Ho, Ming-Tat Ko.
Abstract
One major task in the post-genome era is to reconstruct proteomic and genomic interacting networks using high-throughput experiment data. To identify essential nodes/hubs in these interactomes is a way to decipher the critical keys inside biochemical pathways or complex networks. These essential nodes/hubs may serve as potential drug-targets for developing novel therapy of human diseases, such as cancer or infectious disease caused by emerging pathogens. Hub Objects Analyzer (Hubba) is a web-based service for exploring important nodes in an interactome network generated from specific small- or large-scale experimental methods based on graph theory. Two characteristic analysis algorithms, Maximum Neighborhood Component (MNC) and Density of Maximum Neighborhood Component (DMNC) are developed for exploring and identifying hubs/essential nodes from interactome networks. Users can submit their own interaction data in PSI format (Proteomics Standards Initiative, version 2.5 and 1.0), tab format and tab with weight values. User will get an email notification of the calculation complete in minutes or hours, depending on the size of submitted dataset. Hubba result includes a rank given by a composite index, a manifest graph of network to show the relationship amid these hubs, and links for retrieving output files. This proposed method (DMNC || MNC) can be applied to discover some unrecognized hubs from previous dataset. For example, most of the Hubba high-ranked hubs (80% in top 10 hub list, and >70% in top 40 hub list) from the yeast protein interactome data (Y2H experiment) are reported as essential proteins. Since the analysis methods of Hubba are based on topology, it can also be used on other kinds of networks to explore the essential nodes, like networks in yeast, rat, mouse and human. The website of Hubba is freely available at http://hub.iis.sinica.edu.tw/Hubba.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18503085 PMCID: PMC2447731 DOI: 10.1093/nar/gkn257
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Inside Hubba. In ‘user mode’ (top), user submits PPI dataset in PSI format, tabulated format with/without weight values and fill the note for this submission on the input page. After the calculation, user will receive the notification to views the result on the result display page. The result will be included general properties of submitted network, list of hub proteins with the graph of interaction among them, Cytoscape's format output and lists of protein ranking scores by several algorithms. In ‘system mode’ (bottom), HubbaD controls the entire process and calls programs to process input data, send notification to users and generate output files.
The percentage of top n ranked proteins, identified from the yeast protein interaction dataset (yeast20070107.lst) by six scoring methods and by the double screening scheme (DMNC || MNC, Hubba algorithm), to be yeast essential proteins
| Scoring methods | DSS | ||||||
|---|---|---|---|---|---|---|---|
| Degree (%) | BN (%) | EPC (%) | SC (%) | MNC (%) | DMNC (%) | DMNC || MNC (%) | |
| Top10 | 60.00 | 50.00 | 30.00 | 30.00 | 40.00 | 80.00 | 100.00 |
| Top20 | 35.00 | 40.00 | 30.00 | 25.00 | 50.00 | 80.00 | 95.00 |
| Top30 | 43.33 | 46.67 | 30.00 | 23.33 | 46.67 | 76.67 | 83.33 |
| Top40 | 45.00 | 47.50 | 30.00 | 20.00 | 45.00 | 70.00 | 72.50 |
| Top50 | 50.00 | 46.00 | 26.00 | 18.00 | 48.00 | 66.00 | 76.00 |
| Top60 | 50.00 | 48.33 | 31.67 | 15.00 | 46.67 | 61.67 | 70.00 |
| Top70 | 48.57 | 47.14 | 34.29 | 14.29 | 48.57 | 57.14 | 67.14 |
| Top80 | 47.50 | 46.25 | 35.00 | 12.50 | 48.75 | 55.50 | 65.50 |
| Top90 | 47.78 | 46.67 | 37.78 | 12.22 | 47.78 | 56.67 | 63.33 |
| Top100 | 48.00 | 45.00 | 41.00 | 14.00 | 48.00 | 58.00 | 63.00 |
For example, 8 of the top 10 proteins found by DMNC has been identified as yeast essential proteins [% = (8/10) × 100%].
Figure 2.The top10 nodes of the dataset yeast20070107.lst explored in Hubba. A coloring scheme is used to display the ranking score of each node. When the node is redder, the ranking of it will be higher. Solid lines indicate the connected nodes interact to each other (direct connections), while dotted lines with a number indicate the shortest path (distance) between two linking nodes (indirect connections).