| Literature DB >> 30357367 |
Madalina Giurgiu1, Julian Reinhard1, Barbara Brauner1, Irmtraud Dunger-Kaltenbach1, Gisela Fobo1, Goar Frishman1, Corinna Montrone1, Andreas Ruepp1.
Abstract
CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (67%), mouse (15%) and rat (10%). Given the vital functions of these macromolecular machines, their identification and functional characterization is foundational to our understanding of normal and disease biology. The new CORUM 3.0 release encompasses 4274 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 4473 different genes, representing 22% of the protein coding genes in humans. Protein complexes are described by a protein complex name, subunit composition, cellular functions as well as the literature references. Information about stoichiometry of subunits depends on availability of experimental data. Recent developments include a graphical tool displaying known interactions between subunits. This allows the prediction of structural interconnections within protein complexes of unknown structure. In addition, we present a set of 58 protein complexes with alternatively spliced subunits. Those were found to affect cellular functions such as regulation of apoptotic activity, protein complex assembly or define cellular localization. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30357367 PMCID: PMC6323970 DOI: 10.1093/nar/gky973
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Annotation of protein complexes in CORUM. Presentation of the Ubiquitin E3 ligase (CUL3, KLHL20, RBX1) complex in CORUM. Information about functional annotation using term from Gene Ontology is automatically translated into respective terms from the FunCat annotation scheme.
Figure 2.Data growth in CORUM. The plot compares the data content of CORUM versions 1.0, 2.0 and 3.0. It includes the number of articles that were used to create the datasets, the number of core set complexes, the total number of protein complexes as well as the total number of different proteins that are found in the dataset. As we did not provide a core set in CORUM 1.0, a respective number is missing.
Figure 3.Protein–protein interactions between protein complex subunits. Based on data from the IntAct database, validated protein–protein interactions of the Fanconi anemia FAAP100 complex (complex 6884) are displayed with Cytoscape.