| Literature DB >> 33262342 |
Pablo Porras1, Elisabet Barrera1, Alan Bridge2, Noemi Del-Toro1, Gianni Cesareni3,4, Margaret Duesbury1,5, Henning Hermjakob1, Marta Iannuccelli3, Igor Jurisica6,7,8, Max Kotlyar6, Luana Licata3, Ruth C Lovering9, David J Lynn10,11, Birgit Meldal1, Bindu Nanduri12, Kalpana Paneerselvam1, Simona Panni13, Chiara Pastrello6, Matteo Pellegrini14, Livia Perfetto1, Negin Rahimzadeh5, Prashansa Ratan5, Sylvie Ricard-Blum15, Lukasz Salwinski5, Gautam Shirodkar5, Anjalia Shrivastava1,16, Sandra Orchard17.
Abstract
The International Molecular Exchange (IMEx) Consortium provides scientists with a single body of experimentally verified protein interactions curated in rich contextual detail to an internationally agreed standard. In this update to the work of the IMEx Consortium, we discuss how this initiative has been working in practice, how it has ensured database sustainability, and how it is meeting emerging annotation challenges through the introduction of new interactor types and data formats. Additionally, we provide examples of how IMEx data are being used by biomedical researchers and integrated in other bioinformatic tools and resources.Entities:
Mesh:
Year: 2020 PMID: 33262342 PMCID: PMC7708836 DOI: 10.1038/s41467-020-19942-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Number of binary protein–protein interactions for selected model organisms present in the IMEx dataset, May 2020.
| Binary pair | No. interactions IMEx/IMEx+legacy |
|---|---|
| Human–human (non-redundant) | 490,061/521,353 (259,962/278,983) |
| Human–mouse | 29,510/31,478 |
| Human–any mammala | 526,772/561,062 |
| Human–bacteria | 10,720/10,800 |
| Human–virus | 21,480/22,811 |
| Mouse–any mammal | 51,578/80,230 |
| Rat–any mammal | 11,236/14,105 |
| 48,067/51,741 | |
| 12,526/16,970 | |
| 52,131/55,661 | |
| 72,227/132,104 | |
| 19,318/28,513 |
aIncludes human–human.
Number of binary molecular interactions for different interactor types present in the IMEx dataset, May 2020 (human–human data only).
| Binary interaction | Interaction count |
|---|---|
| Protein–protein | 490,061 |
| Protein–protein complexa | 155 |
| Protein–small molecule | 6469 |
| Protein–DNA | 7649 |
| Protein–gene | 1055 |
| Protein–RNA (all types) | 3511 |
| RNA–RNA (all types) | 490 |
| miRNA–mRNA | 121 |
aProtein complex refers to a stable macromolecular functional unit which can be linked to a corresponding entry in the Complex Portal. It is used by curators when the interactor identification can only be made at the complex level.
Fig. 1Overlap between Unique human PPIs from different resources.
The intersection plot shows several PPI resources as well as legacy data curated by IMEx partners. Individual resource sizes are displayed as horizontal bars on the lower left corner of the image. Intersection sizes are shown as individual bars, with those including IMEx data highlighted in grey. Specific resources involved in each intersection are identified with connected solid black circles under the vertical bars, with unconnected circles representing pairs that are found exclusively in the corresponding database. Shading in the background of the circles helps differentiate between primary databases (green) and meta-databases (orange). Only the experimentally-derived content of the IID and STRING databases was used, excluding all predicted or text-mined interaction data. Only the 20 largest of all possible intersections are shown for clarity.
Fig. 2Number of unique interacting pairs in selected model organisms present in interaction databases.
Differently coloured bars indicate the type of studies from which the data were curated. Data were accessed in January 2020.
Fig. 3Coverage of UniProtKB model organism proteomes in interaction databases.
Shown is the fraction of all proteome entries of key model organisms represented in UniProtKB/Swiss-Prot (manually reviewed) and UniProtKB/TrEMBL (computationally inferred) that are found in primary interaction databases (IMEx, IMEx + legacy data and BioGRID).
Fig. 4Presentation of IMEx data in different resources.
a, b IMEx data for the human ZAP70 protein (P43403) represented in a Reactome and b UniProt. c IMEx-derived binding domain data for the S. cerevisiae vesicular SNARE complex SSO1-SEC9-SNC1 (CPX-1365) used in the Complex Portal.
Fig. 5Mutation-specific TP53 interaction network.
Survival-related patient mutation data[57] overlaid onto the mutation-specific TP53 network derived from interactions from IID (version 2020-05) and mutation-specific interaction data from IMEx. Survival-related mutations are shown on the network edges. It should be noted that many interactions are affected by many more mutations, but these are not necessarily linked to ovarian cancer. The network is visualised in NAViGaTOR[73] ver 3.0.13. Node colour corresponds to GO molecular function, edge colour represents the effect of the mutation on the interaction.