| Literature DB >> 35330478 |
Ekaterina V Ilgisonis1, Pavel V Pogodin1, Olga I Kiseleva1, Svetlana N Tarbeeva1, Elena A Ponomarenko1.
Abstract
Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through "handshakes" to biochemical processes.Entities:
Keywords: CHPP; Human Proteome Project; missing proteins; neXt-MP50; neXtCP-50; neXtProt; protein function; text-mining; uPE1 proteins
Year: 2022 PMID: 35330478 PMCID: PMC8952229 DOI: 10.3390/jpm12030479
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Change in the completeness of human protein data according to neXtProt. (a) Chronology of changes in the number of protein identifications; (b) chronology of replenishment of neXtProt with information on protein functions.
Categories of functional annotation and the number of records for 1441 proteins whose functions were first annotated in one of the versions of neXtProt over the past five years.
| Categories | Number of Records |
|---|---|
| catalytic-activity | 39 |
| function-info | 802 |
| go-biological-process | 2684 |
| go-molecular-function | 1571 |
| Pathway | 1303 |
| transport-activity | 98 |
Figure 2A cloud of biological functions for proteins. The data presented refer to 1392 proteins whose functional annotation appeared in neXtProt from the beginning of 2016 to the beginning of 2021.
Functions of proteins that began to be detected more/less frequently according to neXtProt over the past five years.
| Functions Detected More Frequently | Functions Detected Less Frequently |
|---|---|
| antigen binding | ATP binding |
| immunoglobulin production | DNA-binding transcription factor activity, RNA polymerase II-specific |
| immunoglobulin receptor binding | DNA binding |
| phagocytosis, recognition | DNA-binding transcription factor activity |
| positive regulation of B cell activation | regulation of transcription by RNA polymerase II |
| phagocytosis, engulfment | positive regulation of transcription, DNA-templated |
| complement activation, classical pathway | oxidation–reduction process |
| B cell receptor signaling pathway | positive regulation of transcription by RNA polymerase II |
| immune response | protein serine/threonine kinase activity |
| defense response to bacterium | regulation of transcription, DNA-templated |
| adaptive immune response |
The sequence of identification of protein functions related to binding to other proteins is estimated according to neXtProt using a selection of identified functions and evidence such as direct assay evidence used in manual assertion and physical interaction evidence used in manual assertion, referencing journal articles from 2016 to 2021.
| The Sequence of Identification of Protein Functions | Number of Cases |
|---|---|
| Protein binding and any other function have been identified in one publication | 79 |
| Protein binding has been shown prior to determining any other function | 58 |
| Protein binding has been shown after determining any other function | 14 |
| Protein binding has not been shown, but other function has been defined | 73 |
| To date, only protein binding has been shown | 442 |