| Literature DB >> 17135207 |
H Craig Mak1, Mike Daly, Bianca Gruebel, Trey Ideker.
Abstract
CellCircuits (http://www.cellcircuits.org) is an open-access database of molecular network models, designed to bridge the gap between databases of individual pairwise molecular interactions and databases of validated pathways. CellCircuits captures the output from an increasing number of approaches that screen molecular interaction networks to identify functional subnetworks, based on their correspondence with expression or phenotypic data, their internal structure or their conservation across species. This initial release catalogs 2019 computationally derived models drawn from 11 journal articles and spanning five organisms (yeast, worm, fly, Plasmodium falciparum and human). Models are available either as images or in machine-readable formats and can be queried by the names of proteins they contain or by their enriched biological functions. We envision CellCircuits as a clearinghouse in which theorists may distribute or revise models in need of validation and experimentalists may search for models or specific hypotheses relevant to their interests. We demonstrate how such a repository of network models is a novel systems biology resource by performing several meta-analyses not currently possible with existing databases.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17135207 PMCID: PMC1751555 DOI: 10.1093/nar/gkl937
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The need for a new type of database. The CellCircuits database is positioned between raw molecular interaction databases (left) and databases of rigorously validated cellular pathways (right). Interaction database icons represent (clockwise from top left) the Database of Interacting Proteins [DIP (14)]; the General Repository of Interaction Datasets [GRID (17)]; Molecular INTeractions Database [MINT (48)]; the IntAct molecular interactions database (18); the interaction database at the Munich Information Center for Protein Sequences [MIPS (15)]; and Biomolecular Interaction Network Database [BIND (16)]. Pathway database icons represent Reactome (19); Signal Transduction Knowledge Environment [STKE (22)]; Gene Networks database [GeNet (23)]; BioCarta (); Kyoto Encyclopedia of Genes and Genomes [KEGG (21)]; and CellMap ().
Figure 2Representative network models stored in CellCircuits. (a) A collection of linear regulatory pathways downstream of mating-type locus in yeast (31) (b) An interaction cluster of co-expressed proteins suggestive of a functional complex (34) (c) Parallel clusters conserved between P.falciparum and yeast (40). (d) Parallel clusters that are highly connected by genetic interactions (41).
Sources of data
aY = Yeast; W = Worm; F = Fly; H = Human; P = P.falciparum.
bCounts refer to total number of models across all organisms modeled.
cCounts refer to number of distinct genes in yeast only across all models.
dCounts refer to number of distinct interactions in yeast only across all models.
eFor gene expression, counts refer to number of profiles used.
Shading indicates which publications utilize particular types of Interaction data, State data, or Network patterns.
Figure 3Web interface (). Results using RAD* and ‘DNA binding’ as the search query (circle 1). A total of 274 subnetwork models are returned. The search output includes a graphical representation of the model (circle 7), the genes and GO terms from the model that match the query (circle 6), alternative gene names or synonyms matching the query (circle 9), the total number of matches (circle 8), enriched GO terms (circle 5 and 3), a link to view similar models (circle 4) and a link to example search queries (circle 2).
Figure 4Meta-analysis of models. (a) Histogram of the number of genes or proteins per model. (b) Histogram of the number of genes (y-axis) that are contained in a given number of models (x-axis). The inset is an expanded view of the genes that span over 50 models. (c) Overlap between network modeling publications. Thicker lines represent greater similarity between the sets of models published in two publications (see legend). Similarity is measured by the number of distinct models that share one or more interactions (yeast interactions only) divided by the total number of models in both publications. Interactions are shared between almost every pair of publications, but for clarity, similarity scores <0.05 are not shown.