| Literature DB >> 22748121 |
Ruth Dannenfelser1, Neil R Clark, Avi Ma'ayan.
Abstract
BACKGROUND: Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent.Entities:
Mesh:
Year: 2012 PMID: 22748121 PMCID: PMC3472228 DOI: 10.1186/1471-2105-13-156
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Process of creating FANs. The process of creating FANs involves gathering datasets and processing them into GMT files. Using these GMT files, networks are created using either the Jaccard index or a Binomial Proportion test. Large and dense networks are filtered using a declustering method and a cutoff is applied to produce the final FANs.
FAN properties
| CMAP co-expression | Binomial Proportion* | 130 | Connectivity Map Database | 8,924 | 62,382 |
| Transcription Factors (ChIP-X) | Binomial Proportion* | 27 | ChEA database | 13,223 | 70,347 |
| GeneRIF | Binomial Proportion* | 2000 | NCBI GeneRIF | 3,777 | 27,487 |
| GO Molecular Function | Binomial Proportion* | 160 | Gene Ontology | 2,944 | 23,356 |
| TRANSFAC | Binomial Proportion | 27 | TRANSFAC | 15,252 | 94,642 |
| GeneSigDB | Binomial Proportion | 350 | GeneSigDB | 10,536 | 65,776 |
| MicroRNA | Jaccard* | 0.3 | TargetScan | 6,590 | 46,161 |
| Mouse Phenotype | Jaccard* | 0.5 | MGI MP Browser | 7,553 | 52,637 |
| Metabolites | Jaccard* | 0.35 | Human Metabolome Database | 3,577 | 28,617 |
| Structural Domains | Jaccard* | 0.5 | Pfam and InterPro | 6,746 | 46,463 |
| GO Biological Process | Jaccard* | 0.99 | Gene Ontology | 4,287 | 29,988 |
| OMIM Expanded | Jaccard | 0.99 | OMIM Morbid Map | 2,051 | 23,191 |
| OMIM Disease | Jaccard | 0.99 | OMIM Morbid Map | 1,618 | 22,643 |
| Drug Target | Jaccard | 0.5 | DrugBank | 2,121 | 16,807 |
| PPI | None | N/A | 13 Databases | 15,548 | 64,741 |
Properties of all the FANs along with their scoring method, scoring cutoff, data source, edge and node totals. * indicates that the declustering method was applied.
Declustering Details
| CMAP co-expression | 2,000 | 8,924 | 8,924 | 119,420 | 61,362 |
| Transcription Factors (ChIP-X) | 1,500 | 13,223 | 13,223 | 110,901 | 70,347 |
| GeneRIFs | 2,000 | 3,777 | 3,777 | 52,512 | 27,487 |
| GO Molecular Function | 3,000 | 2,969 | 2,944 | 81,895 | 23,356 |
| MicroRNA | 3,000 | 6,590 | 6,590 | 176,766 | 46,161 |
| Mouse Phenotype | 3,300 | 7,795 | 7,553 | 290,381 | 52,637 |
| Metabolites | 3,500 | 3,692 | 3,577 | 205,468 | 28,617 |
| Structural Domains | 3,500 | 7,115 | 6,746 | 247,885 | 46,463 |
| GO Biological Process | 2,300 | 4,305 | 4,287 | 65,669 | 29,988 |
Declustering constants and node and edge counts before and after the declustering algorithm was applied on nine FANs.
Figure 2Heatmap of genes. Heatmap showing the similarity of the genes within each of the FANs and PPI network. Similarity was calculated using the Jaccard index.
Figure 3Heatmap of edges. Heatmap showing the similarity of the interactions connecting genes within each of the FANs and PPI network. Similarity was calculated using the Jaccard index.
Figure 4Topology of the FANs. The global structure of each of the FANs visualized with Cytoscape.
Figure 5Converting PubMed queries to lists of Entrez gene symbols. PubMed queries are first converted into a list of PubMed IDs using NCBI’s e-utilities. For each PubMed ID a list of genes is obtained using GeneRIF. Genes are tallied and sorted by their occurrence and the top N genes are uploaded automatically into Genes2FANs.
Figure 6The Genes2FANs web interface. A screenshot showing the results of running Genes2FANs with the query “eye color”. On the left side of the page users can enter a PubMed query or a gene list and customize the output settings. The resulting subnetwork and a table listing ranked intermediates are shown on the right. Users can also obtain all the functional and binding interactions for a specific gene.
Figure 7Distribution of edges for the disease gene lists. The distribution of edges for disease subnetworks created using genes directly from OMIM (A) and the disease terms with a maximum of 100 returned genes from the PubMed query tool of Genes2FANs (B). Diseases with a sum of PPI and functional edges less than 10 were omitted from both distribution plots.
Figure 8Correlation between subnetwork size and the edge ratio of PPIs to FANs. Scatterplots showing the correlation between the number of edges in the PPI subnetworks for each disease and the log of the ratio of PPI edges to functional edges. The red line depicts the mean of the data points (calculated by partitioning the points into groups of 10 for the OMIM disease gene lists (A) and 15 for the subnetworks made using the query PubMed function (B)). The blue dotted lines show one standard deviation away from the mean.
Figure 9Top diseases. The top 10 diseases with the greatest difference in edge counts for the PPI vs. FANs disease subnetworks made from the OMIM disease gene lists (A) and the top 20 diseases for the subnetworks made using the query PubMed function (B).
Comparison with Similar Tools
| Genes2FANs | 72.1 ± 51 | PPI, literature co-occurrence, miRNAs, co-regulation, domains, drug signatures & targets, gene signatures, metabolites, and phenotypes | 35,078 | |
| PIPs | 10.1 ± 25.2 | Co-expression, orthology, domains, co-localization, and PTMs | 5,338 | |
| HEFalMp | 681.3 ± 1123.2 | Functionally mapped data from microarray experiments and sequence comparisons | 24,433 | |
| GeneMania | 78.7 ± 39.2 | Co-expression, physical & genetic interactions, domains, co-localization, pathways, and orthology | 155,238 | |
| STRING 9.0 | 24.3 ± 14.4 | Co-localization, fusion, co-occurrence, co-expression, literature co-occurrence, and orthology | 5,214,234 | 1,133 Organisms |
| FunCoup | 47.7 ± 21.9 | PPI, orthology, co-expression, miRNA, co-localization, phylogenetics, co-regulation, genetic interactions, and domains | 1,800,000 |
Comparison of Genes2FANs with five similar tools; examining the average number of interactions returned for single gene queries, the types of background knowledge for each tool, the number of unique genes/proteins in each knowledgebase, and the supported organisms.