Literature DB >> 18000002

The BioGRID Interaction Database: 2008 update.

Bobby-Joe Breitkreutz1, Chris Stark, Teresa Reguly, Lorrie Boucher, Ashton Breitkreutz, Michael Livstone, Rose Oughtred, Daniel H Lackner, Jürg Bähler, Valerie Wood, Kara Dolinski, Mike Tyers.   

Abstract

The Biological General Repository for Interaction Datasets (BioGRID) database (http://www.thebiogrid.org) was developed to house and distribute collections of protein and genetic interactions from major model organism species. BioGRID currently contains over 198 000 interactions from six different species, as derived from both high-throughput studies and conventional focused studies. Through comprehensive curation efforts, BioGRID now includes a virtually complete set of interactions reported to date in the primary literature for both the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe. A number of new features have been added to the BioGRID including an improved user interface to display interactions based on different attributes, a mirror site and a dedicated interaction management system to coordinate curation across different locations. The BioGRID provides interaction data with monthly updates to Saccharomyces Genome Database, Flybase and Entrez Gene. Source code for the BioGRID and the linked Osprey network visualization system is now freely available without restriction.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18000002      PMCID: PMC2238873          DOI: 10.1093/nar/gkm1001

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Protein interactions underlie cell structure, biochemical activity and dynamic behavior; in turn, myriad genetic interactions reflect the vast functional interconnectivities of the protein network (1). High-throughput technologies now generate large datasets of protein and genetic interactions, which compliment more conventional detailed investigations of cellular processes (2). The collation of various types of interaction data is essential for interrogation of system-level attributes (3), and to this end a number of important interaction databases have been developed (4–8). Previously, we described a database called ‘Biological General Repository for Interaction Datasets’ (BioGRID) (www.thebiogrid.org) to archive and distribute comprehensive collections of physical and genetic interactions (9). The BioGRID has grown into a general resource for the research community with an average of 80 000 queries per month and millions of interactions downloaded per year. The 1 October 2007 version of BioGRID (v2.0.33) contains 198 791 (129 584 non-redundant) interaction records comprised of 137 834 (90 577 non-redundant) protein interactions and 60 957 (39 007 non-redundant) genetic interactions (Table 1). BioGRID provides full annotation support for 13 major model organism species (9), and currently houses interactions for Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus and Homo sapiens. Gene annotation tables for all supported species are routinely updated to prevent ambiguous search results. Sources of data in BioGRID include publications that report high-throughput interaction datasets and many focused individual studies curated from inspection of the primary literature (2). Each interaction record in BioGRID contains experimental evidence codes and is linked to the supporting publication. In addition to the BioGRID website, all interactions in BioGRID are available through the dynamically linked Osprey visualization system, which can be used to query network organization in a user-defined fashion (10). BioGRID currently holds observer status in the IMEx consortium of interaction databases (http://imex.sourceforge.net/).
Table 1.

Interactions in BioGRID

SpeciesSetNodesInteractions (redundant)Interactions (non-redundant)Publication sources
S. cerevisiaeP493664 79938 2984017
G374647 11633 3544339
P + G5196111 915716527193
S. pombeP84326311209595
G68922751769733
P + G1057490629781077
D. melanogasterP709122 83422 479140
G119910 68930201515
P + G754833 52325 4991626
C. elegansP2816467045578
G6548618573
P + G32675531541410
M. musculusP33553249949
G3222
P + G33753450151
H. sapiensP806242 87124 01112 288
G29201117
P + G806942 89124 02212 296
All interactionsP23 767137 83490 57716 972
G630960 95739 0076599
P + G25 148198 791129 58422 108

Redundant interactions include self edges, multiple sources and multiple experimental evidence types; non-redundant interactions exclude all duplicates. P, protein interactions; G, genetic interactions.

Interactions in BioGRID Redundant interactions include self edges, multiple sources and multiple experimental evidence types; non-redundant interactions exclude all duplicates. P, protein interactions; G, genetic interactions.

LITERATURE CURATION

Comprehensive manual curation of the entire S. cerevisiae literature for protein and genetic interactions yielded 35 224 (21 281 non-redundant) protein interactions and 19 172 (13 963 non-redundant) genetic interactions (2), all of which are accessible through BioGRID. Comparison of the literature-curated protein interaction dataset to recent high-throughput studies (11–13) reveals a considerable degree of non-overlap, suggesting that many interactions remain to be validated and discovered in this yeast (Figure 1). We have since continued to curate the current S. cerevisiae literature and have added 29 575 (17 017 non-redundant) protein and 27 994 (19 391 non-redundant) genetic interactions to BioGRID since the original curation effort. Updates to BioGRID are made on the first of every month; each release of BioGRID is date stamped and archived for comparative purposes. All S. cerevisiae interactions deposited in BioGRID are immediately imported into the Saccharomyces Genome Database (SGD) with associated citations and evidence codes (14). Additional interaction attributes, including post-translational modifications associated with protein interactions and specific phenotypes associated with each genetic interaction, are currently being annotated and will be released for the entire S. cerevisiae dataset in the near future.
Figure 1.

Overlap between various high-throughput protein interaction datasets (11–13) and the literature-curated interaction dataset as of 1 January 2007 (2). Dataset manipulation was performed with the Osprey visualization system (10). All datasets are available for download from www.thebiogrid.org. N, interaction node and E, interaction edge.

Overlap between various high-throughput protein interaction datasets (11–13) and the literature-curated interaction dataset as of 1 January 2007 (2). Dataset manipulation was performed with the Osprey visualization system (10). All datasets are available for download from www.thebiogrid.org. N, interaction node and E, interaction edge. To complement the comprehensive S. cerevisiae dataset, we have recently completed exhaustive manual curation of the S. pombe literature. Interactions were classified based on the same experimental evidence codes for protein and genetic interactions used previously (2). This effort yielded 2631 (1209 non-redundant) protein interactions and 2275 (1769 non-redundant) genetic interactions, as derived from 1077 publications. This new dataset has recently been deposited in BioGRID and, as for S. cerevisiae, will be updated on a monthly basis and provided to the S. pombe genome database (GeneDB) currently hosted by the Sanger Institute at www.genedb.org/genedb/pombe/ (15). Comparison of orthologous interactions between these evolutionary distant yeasts should prove informative for biological network structure and function. Imminent high-throughput studies in S. pombe should rapidly elaborate the cellular interaction network in this organism (16–18). In addition to systematic yeast curation, we have also undertaken partial interaction curation for higher species, for example, D. melanogaster and H. sapiens (Table 1). These curation efforts are often focused on specific aspects of biology and are in part guided by Gene Ontology inference codes for protein and genetic interactions (19) and the Textpresso text-mining algorithm (20). Other species interactions are added to BioGRID on an ongoing basis and, when available, released in monthly BioGRID updates. Contributions of curated interaction datasets from any species for deposition in the BioGRID are welcomed (www.thebiogrid.org).

DATABASE IMPROVEMENTS

We have expanded accessibility to interaction data in BioGRID via a primary mirror site at the SGD colony in Princeton (http://grid.princeton.edu/). In addition, source code for BioGRID and Osprey has been made available without restriction at SourceForge. BioGRID data files are currently linked to SGD, Flybase and NCBI, to which we provide automatic monthly updates. Analogous relationships are underway with the Arabidopsis Information Resource (TAIR) and S. pombe GeneDB (15,21). We will endeavor to fulfill all requests for custom datasets for export to other model organism databases; the download page at BioGRID contains examples of existing datasets created for export. The dataset download system for the BioGRID is now powered by Asynchronous JavaScript and XML (AJAX), such that downloads are available for every search result page and publication in the database. Users may thus download interactions associated with any gene or all interactions reported by a single publication. Further data subsets organized by experimental system or by organism, including a recently described multivalidated yeast–protein interaction dataset (22), are also available on the BioGRID download page. The tabular user interface of BioGRID has been improved through implementation of AJAX techniques. The interface now provides the option to narrow search results to quantitative datasets; this feature will soon be elaborated to enable user-defined search criteria according to data type, evidence codes and data source. The ability to expand hidden fields with a single mouse click to provide greater detail, such as for Gene Ontology classifications (23), has also been added. Search results now include bait and hit designations to indicate the directionality of interactions. Additional annotation features including phenotype, post-translational modification, domains and motifs are currently under construction.

THE INTERACTION MANAGEMENT SYSTEM

We have implemented an interaction management system (IMS) to support multiple simultaneous curators for each species supported by BioGRID. The IMS is a multiuser web-based application written in PHP that interfaces directly with the BioGRID. An intuitive graphical interface allows curators to quickly record interactions from an automatically updated list of publications. All interactions added via the IMS are verified against current annotation tables to eliminate errors and ambiguity in gene nomenclature. The IMS instantly commits new interactions to the BioGRID update pipeline, unless specified otherwise; interactions are collated each month and released as updates to the primary BioGRID and mirror sites, as well as model organism collaboration sites. Interactions may also be removed or modified in each monthly build, for example, in response to community feedback. All retired datasets are archived on the BioGRID downloads page in case the need for back-comparison arises.

FUTURE DEVELOPMENT

We will continue to curate interactions from major model organism species, with a view to comprehensive back-curation, as we have done for S. cerevisiae and S. pombe. Further refinement of tools and display features in the BioGRID graphical user interface based on a flexible record tag structure will enable greater control over data views and downloadable datasets by the user. New plugins for data visualization are under development for Osprey, Cytoscape (24) and the Edinburgh Pathway Editor (25), which will allow more sophisticated interrogation of interaction networks. In order to facilitate dissemination of our open source software tools, we will strive for compatibility with the Generic Model Organism Database (GMOD) project (26). Finally, we will continue to develop our record structures in compliance with the Proteomics Standards Initiative Molecular Interactions (PSI-MI) standard (27,28).
  27 in total

1.  The proteomics standards initiative.

Authors:  Sandra Orchard; Henning Hermjakob; Rolf Apweiler
Journal:  Proteomics       Date:  2003-07       Impact factor: 3.984

Review 2.  Harvesting the genome's bounty: integrative genomics.

Authors:  P Jorgensen; B J Breitkreutz; K Breitkreutz; C Stark; G Liu; M Cook; J Sharom; J L Nishikawa; T Ketela; D Bellows; A Breitkreutz; I Rupes; L Boucher; D Dewar; M Vo; M Angeli; T Reguly; A Tong; B Andrews; C Boone; M Tyers
Journal:  Cold Spring Harb Symp Quant Biol       Date:  2003

3.  Proteome survey reveals modularity of the yeast cell machinery.

Authors:  Anne-Claude Gavin; Patrick Aloy; Paola Grandi; Roland Krause; Markus Boesche; Martina Marzioch; Christina Rau; Lars Juhl Jensen; Sonja Bastuck; Birgit Dümpelfeld; Angela Edelmann; Marie-Anne Heurtier; Verena Hoffman; Christian Hoefert; Karin Klein; Manuela Hudak; Anne-Marie Michon; Malgorzata Schelder; Markus Schirle; Marita Remor; Tatjana Rudi; Sean Hooper; Andreas Bauer; Tewis Bouwmeester; Georg Casari; Gerard Drewes; Gitte Neubauer; Jens M Rick; Bernhard Kuster; Peer Bork; Robert B Russell; Giulio Superti-Furga
Journal:  Nature       Date:  2006-01-22       Impact factor: 49.962

4.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry.

Authors:  Yuen Ho; Albrecht Gruhler; Adrian Heilbut; Gary D Bader; Lynda Moore; Sally-Lin Adams; Anna Millar; Paul Taylor; Keiryn Bennett; Kelly Boutilier; Lingyun Yang; Cheryl Wolting; Ian Donaldson; Søren Schandorff; Juanita Shewnarane; Mai Vo; Joanne Taggart; Marilyn Goudreault; Brenda Muskat; Cris Alfarano; Danielle Dewar; Zhen Lin; Katerina Michalickova; Andrew R Willems; Holly Sassi; Peter A Nielsen; Karina J Rasmussen; Jens R Andersen; Lene E Johansen; Lykke H Hansen; Hans Jespersen; Alexandre Podtelejnikov; Eva Nielsen; Janne Crawford; Vibeke Poulsen; Birgitte D Sørensen; Jesper Matthiesen; Ronald C Hendrickson; Frank Gleeson; Tony Pawson; Michael F Moran; Daniel Durocher; Matthias Mann; Christopher W V Hogue; Daniel Figeys; Mike Tyers
Journal:  Nature       Date:  2002-01-10       Impact factor: 49.962

5.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae.

Authors:  Nevan J Krogan; Gerard Cagney; Haiyuan Yu; Gouqing Zhong; Xinghua Guo; Alexandr Ignatchenko; Joyce Li; Shuye Pu; Nira Datta; Aaron P Tikuisis; Thanuja Punna; José M Peregrín-Alvarez; Michael Shales; Xin Zhang; Michael Davey; Mark D Robinson; Alberto Paccanaro; James E Bray; Anthony Sheung; Bryan Beattie; Dawn P Richards; Veronica Canadien; Atanas Lalev; Frank Mena; Peter Wong; Andrei Starostine; Myra M Canete; James Vlasblom; Samuel Wu; Chris Orsi; Sean R Collins; Shamanta Chandran; Robin Haw; Jennifer J Rilstone; Kiran Gandi; Natalie J Thompson; Gabe Musso; Peter St Onge; Shaun Ghanny; Mandy H Y Lam; Gareth Butland; Amin M Altaf-Ul; Shigehiko Kanaya; Ali Shilatifard; Erin O'Shea; Jonathan S Weissman; C James Ingles; Timothy R Hughes; John Parkinson; Mark Gerstein; Shoshana J Wodak; Andrew Emili; Jack F Greenblatt
Journal:  Nature       Date:  2006-03-22       Impact factor: 49.962

6.  Human protein reference database--2006 update.

Authors:  Gopa R Mishra; M Suresh; K Kumaran; N Kannabiran; Shubha Suresh; P Bala; K Shivakumar; N Anuradha; Raghunath Reddy; T Madhan Raghavan; Shalini Menon; G Hanumanthu; Malvika Gupta; Sapna Upendran; Shweta Gupta; M Mahesh; Bincy Jacob; Pinky Mathew; Pritam Chatterjee; K S Arun; Salil Sharma; K N Chandrika; Nandan Deshpande; Kshitish Palvankar; R Raghavnath; R Krishnakanth; Hiren Karathia; B Rekha; Rashmi Nayak; G Vishnupriya; H G Mohan Kumar; M Nagini; G S Sameer Kumar; Rojan Jose; P Deepthi; S Sujatha Mohan; T K B Gandhi; H C Harsha; Krishna S Deshpande; Malabika Sarker; T S Keshava Prasad; Akhilesh Pandey
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  BioGRID: a general repository for interaction datasets.

Authors:  Chris Stark; Bobby-Joe Breitkreutz; Teresa Reguly; Lorrie Boucher; Ashton Breitkreutz; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae.

Authors:  Teresa Reguly; Ashton Breitkreutz; Lorrie Boucher; Bobby-Joe Breitkreutz; Nizar N Batada; Gary C Hon; Chad L Myers; Ainslie Parsons; Helena Friesen; Rose Oughtred; Amy Tong; Chris Stark; Yuen Ho; David Botstein; Brenda Andrews; Charles Boone; Olga G Troyanskya; Trey Ideker; Kara Dolinski; Mike Tyers
Journal:  J Biol       Date:  2006-06-08

9.  Stratus not altocumulus: a new view of the yeast protein interaction network.

Authors:  Nizar N Batada; Teresa Reguly; Ashton Breitkreutz; Lorrie Boucher; Bobby-Joe Breitkreutz; Laurence D Hurst; Mike Tyers
Journal:  PLoS Biol       Date:  2006-10       Impact factor: 8.029

10.  Ontological visualization of protein-protein interactions.

Authors:  Harold J Drabkin; Christopher Hollenbeck; David P Hill; Judith A Blake
Journal:  BMC Bioinformatics       Date:  2005-02-11       Impact factor: 3.169

View more
  317 in total

1.  Improved methodology for the affinity isolation of human protein complexes expressed at near endogenous levels.

Authors:  Michal Domanski; Kelly Molloy; Hua Jiang; Brian T Chait; Michael P Rout; Torben Heick Jensen; John LaCava
Journal:  Biotechniques       Date:  2012-05       Impact factor: 1.993

Review 2.  HSP90 at the hub of protein homeostasis: emerging mechanistic insights.

Authors:  Mikko Taipale; Daniel F Jarosz; Susan Lindquist
Journal:  Nat Rev Mol Cell Biol       Date:  2010-06-09       Impact factor: 94.444

3.  Predicting genetic modifier loci using functional gene networks.

Authors:  Insuk Lee; Ben Lehner; Tanya Vavouri; Junha Shin; Andrew G Fraser; Edward M Marcotte
Journal:  Genome Res       Date:  2010-06-09       Impact factor: 9.043

4.  Deciphering the Arabidopsis floral transition process by integrating a protein-protein interaction network and gene expression data.

Authors:  Fei He; Yuan Zhou; Ziding Zhang
Journal:  Plant Physiol       Date:  2010-06-07       Impact factor: 8.340

5.  Statistical inference and reverse engineering of gene regulatory networks from observational expression data.

Authors:  Frank Emmert-Streib; Galina V Glazko; Gökmen Altay; Ricardo de Matos Simoes
Journal:  Front Genet       Date:  2012-02-03       Impact factor: 4.599

6.  Functional wiring of the yeast kinome revealed by global analysis of genetic network motifs.

Authors:  Sara Sharifpoor; Dewald van Dyk; Michael Costanzo; Anastasia Baryshnikova; Helena Friesen; Alison C Douglas; Ji-Young Youn; Benjamin VanderSluis; Chad L Myers; Balázs Papp; Charles Boone; Brenda J Andrews
Journal:  Genome Res       Date:  2012-01-26       Impact factor: 9.043

7.  Information flow in interaction networks II: channels, path lengths, and potentials.

Authors:  Aleksandar Stojmirović; Yi-Kuo Yu
Journal:  J Comput Biol       Date:  2012-03-12       Impact factor: 1.479

8.  The FLIGHT Drosophila RNAi database: 2010 update.

Authors:  David Sims; Borisas Bursteinas; Ekta Jain; Qiong Gao; Buzz Baum; Marketa Zvelebil
Journal:  Fly (Austin)       Date:  2010-10-01       Impact factor: 2.160

9.  Systems pharmacology of arrhythmias.

Authors:  Seth I Berger; Avi Ma'ayan; Ravi Iyengar
Journal:  Sci Signal       Date:  2010-04-20       Impact factor: 8.192

Review 10.  Profiling of protein interaction networks of protein complexes using affinity purification and quantitative mass spectrometry.

Authors:  Robyn M Kaake; Xiaorong Wang; Lan Huang
Journal:  Mol Cell Proteomics       Date:  2010-05-05       Impact factor: 5.911

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.