| Literature DB >> 25428363 |
Andrew Chatr-Aryamontri1, Bobby-Joe Breitkreutz2, Rose Oughtred3, Lorrie Boucher2, Sven Heinicke3, Daici Chen1, Chris Stark2, Ashton Breitkreutz2, Nadine Kolas2, Lara O'Donnell2, Teresa Reguly2, Julie Nixon4, Lindsay Ramage4, Andrew Winter4, Adnane Sellam5, Christie Chang3, Jodi Hirschman3, Chandra Theesfeld3, Jennifer Rust3, Michael S Livstone3, Kara Dolinski3, Mike Tyers6.
Abstract
The Biological General Repository for Interaction Datasets (BioGRID: http://thebiogrid.org) is an open access database that houses genetic and protein interactions curated from the primary biomedical literature for all major model organism species and humans. As of September 2014, the BioGRID contains 749,912 interactions as drawn from 43,149 publications that represent 30 model organisms. This interaction count represents a 50% increase compared to our previous 2013 BioGRID update. BioGRID data are freely distributed through partner model organism databases and meta-databases and are directly downloadable in a variety of formats. In addition to general curation of the published literature for the major model species, BioGRID undertakes themed curation projects in areas of particular relevance for biomedical sciences, such as the ubiquitin-proteasome system and various human disease-associated interaction networks. BioGRID curation is coordinated through an Interaction Management System (IMS) that facilitates the compilation interaction records through structured evidence codes, phenotype ontologies, and gene annotation. The BioGRID architecture has been improved in order to support a broader range of interaction and post-translational modification types, to allow the representation of more complex multi-gene/protein interactions, to account for cellular phenotypes through structured ontologies, to expedite curation through semi-automated text-mining approaches, and to enhance curation quality control.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25428363 PMCID: PMC4383984 DOI: 10.1093/nar/gku1204
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Increase in BioGRID data content
| Organism | Type | August 2012 (3.1.92) | August 2014 (3.2.115) | ||||
|---|---|---|---|---|---|---|---|
| Nodes | Edges | Publications | Nodes | Edges | Publications | ||
| PI | 5915 | 16 476 | 1118 | 7200 | 21 536 | 1414 | |
| GI | 107 | 188 | 62 | 112 | 192 | 66 | |
| PI | 2927 | 5010 | 93 | 3288 | 6345 | 178 | |
| GI | 1109 | 2326 | 22 | 1129 | 2344 | 30 | |
| PI | 7998 | 35 843 | 314 | 8076 | 37 606 | 416 | |
| GI | 1023 | 9934 | 1468 | 1042 | 9980 | 1483 | |
| PI | 14 896 | 123 436 | 17 134 | 18 435 | 237 498 | 23 388 | |
| GI | 1291 | 1609 | 237 | 1364 | 1678 | 273 | |
| PI | 6003 | 114 506 | 6601 | 6410 | 135 690 | 7402 | |
| GI | 5561 | 189 692 | 6686 | 5674 | 207 188 | 7257 | |
| PI | 1773 | 6019 | 968 | 2694 | 11 270 | 1146 | |
| GI | 1907 | 14 015 | 1158 | 3158 | 56 745 | 1359 | |
| Other organisms | ALL | 8435 | 15 978 | 2724 | 16470 | 35 347 | 5269 |
| Total | ALL | 44 515 | 527 569 | 33 858 | 55 528 | 749 912 | 43 149 |
Data drawn from monthly release 3.1.92 and 3.2.115 of BioGRID.
Nodes refers to genes or proteins, edges refers to interactions.
PI, protein (physical) interactions; GI, genetic interactions.
Figure 1.Growth of the BioGRID database. Increments in interaction records and source publications reported in BioGRID from July 2006 (release 2.0.18) to August 2014 (release 3.2.115). Left panel shows the increase of annotated protein interactions (PI, red), genetic interactions (GI, green) and total interactions (blue). Right panel shows the number of publications that report protein or genetic interactions and the total number of curated publications.
Figure 2.The Interaction Management System. Overview of the new database architecture that allows BioGRID to transition from a pairwise interaction format to an n-way interaction format for representation of complex protein or genetic interaction relationships. The database schema has also been extended to include support for post-translational modification (PTM) and phenotype curation. The central components illustrated (Interactors, Post-translational Modifications, Interactions, and Ontologies) represent the four major sectors of the IMS architecture. Partial representations of the child support tables that link to the main parent tables are shown but precise entity relationships are not indicated. In total, the IMS contains 57 interlinked tables. All controlled vocabularies (experimental systems, modifications and tags) have been converted into formal ontologies in order to remove redundancies present in the previous database architecture.
Figure 3.Snapshot of the new IMS curation interface. The main functionalities available to BioGRID curators in the new IMS for the annotation of protein and genetic interactions are shown (A–D). The new system is based on ontologies (E) for the annotation of gene function (Gene Ontology), cell type (Cell Type Ontology), tissue (BRENDA Tissue ontology), small molecules (CheBI), human disease (Human Disease Ontology), human phenotypes (Human Phenotype Ontology, Phenotypic Qualities Ontologies) or anatomical structures (Uberon) and accepts annotation of binary and n-way interactions (F).