| Literature DB >> 34718729 |
Birgit H M Meldal1, Livia Perfetto1,2, Colin Combe3, Tiago Lubiana4, João Vitor Ferreira Cavalcante5, Hema Bye-A-Jee1, Andra Waagmeester6, Noemi Del-Toro1, Anjali Shrivastava1, Elisabeth Barrera1, Edith Wong7, Bernhard Mlecnik8,9,10,11, Gabriela Bindea8,9,10, Kalpana Panneerselvam1, Egon Willighagen12, Juri Rappsilber3,13, Pablo Porras1, Henning Hermjakob1, Sandra Orchard1.
Abstract
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from a range of model organisms. It summarizes complex composition, topology and function along with links to a large range of domain-specific resources (i.e. wwPDB, EMDB and Reactome). Since the last update in 2019, we have produced a first draft complexome for Escherichia coli, maintained and updated that of Saccharomyces cerevisiae, added over 40 coronavirus complexes and increased the human complexome to over 1100 complexes that include approximately 200 complexes that act as targets for viral proteins or are part of the immune system. The display of protein features in ComplexViewer has been improved and the participant table is now colour-coordinated with the nodes in ComplexViewer. Community collaboration has expanded, for example by contributing to an analysis of putative transcription cofactors and providing data accessible to semantic web tools through Wikidata which is now populated with manually curated Complex Portal content through a new bot. Our data license is now CC0 to encourage data reuse. Users are encouraged to get in touch, provide us with feedback and send curation requests through the 'Support' link.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34718729 PMCID: PMC8689886 DOI: 10.1093/nar/gkab991
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Top of the details page for SARS-CoV-2 Spike–human ACE2 complex displaying some of the new features available in the ComplexViewer (left side) and Participant table (right side).
Figure 2.Most E. coli proteins are found in only one or two different complexes (A) and most E. coli complexes contain five or fewer unique proteins (B).
Figure 3.Complex Portal and Wikidata. (A) example of an entry assertion in Wikidata with provenance pointing to Complex Portal (Q104836061). (B) Number of protein complexes in Wikidata per taxon (https://w.wiki/3ggX). (C) Subset of Wikidata connected to the SARS-CoV-2 polymerase complex (https://w.wiki/3eta)
Figure 4.Output of an enrichment analysis for Complex Portal complexes of S. cerevisiae proteins using ClueGO. Two different lists of proteins were compared. (A) functionally grouped complexes, (B) distribution (%) of the proteins from list 1 (red) and 2 (blue) within each of the complexes from (A), (C) relationship of complexes based on the Cellular Component part of the Gene Ontology Tree.