| Literature DB >> 34767442 |
Lloyd M Smith1, Jeffrey N Agar2, Julia Chamot-Rooke3, Paul O Danis4, Ying Ge5, Joseph A Loo6, Ljiljana Paša-Tolić7, Yury O Tsybin8, Neil L Kelleher9.
Abstract
Proteins are the primary effectors of function in biology, and thus, complete knowledge of their structure and properties is fundamental to deciphering function in basic and translational research. The chemical diversity of proteins is expressed in their many proteoforms, which result from combinations of genetic polymorphisms, RNA splice variants, and posttranslational modifications. This knowledge is foundational for the biological complexes and networks that control biology yet remains largely unknown. We propose here an ambitious initiative to define the human proteome, that is, to generate a definitive reference set of the proteoforms produced from the genome. Several examples of the power and importance of proteoform-level knowledge in disease-based research are presented along with a call for improved technologies in a two-pronged strategy to the Human Proteoform Project.Entities:
Year: 2021 PMID: 34767442 PMCID: PMC8589312 DOI: 10.1126/sciadv.abk0734
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 1.Proteoforms: Distinct protein forms arising from a single gene.
Fig. 2.Proteoforms in human disease.
Five important clinical areas of interest are depicted and serve as examples where proteoforms have been identified and linked to the progression of human disease; they are discussed at length in an extended preprint version of this Perspective (). mAb, monoclonal antibody.
Fig. 3.Approach to creating an integrated Human Proteoform Atlas.
The upper path illustrates the use of protein affinity reagents to capture proteoform families derived from targeted genes. The lower path illustrates the in-depth analysis of human cell types for proteoform discovery and characterization. Relative abundance refers to the ratio of a given proteoform to the sum of all proteoforms in that family.
Fig. 4.The most studied proteins have essential proteoforms that contain common PTMs such as phosphorylation, methylation, acetylation, and other important variations of primary structure such as disulfide bond formation, metal attachment, and proteolytic processing.
TNF, tumor necrosis factor; TAU, tubulin-associated unit; HB, hemoglobin subunits (HbA, HbB, etc.); CASP, cysteine-aspartic proteases (Casp1 to Casp9); SOD, superoxide dismutases (SOD-1, SOD-2, and SOD-3); EGFR, estrogen growth factor receptor; CYTC, cytochrome C; TN, troponin (Tn-C, Tn-I, and Tn-T); APOE, apolipoprotein E; CA, carbonic anhydrase; CREB, cyclic adenosine 5′-monophosphate response element–binding protein; TP53, cellular tumor antigen p53. Citations are from the Web of Science Core Collection from 1975 to 2020. Citations per year and a history of research trends have been chronicled for a subset of these proteins ().
Fig. 5.Once proteoforms have been identified, affinity reagents and targeted assays will enable emergent strategies to delineate their spatial distribution and temporal dynamics of proteoforms and their PTMs.
CyTOF, mass cytometry; CODEX, CO-Detection by indEXing.
Fig. 6.Projected interactions and impact from the Human Proteoform Project.