| Literature DB >> 30714210 |
Jonathan Catazaro1, Adam Caprez2, David Swanson3, Robert Powers1,4.
Abstract
The functional evolution of proteins advances through gene duplication followed by functional drift, whereas molecular evolution occurs through random mutational events. Over time, protein active-site structures or functional epitopes remain highly conserved, which enables relationships to be inferred between distant orthologs or paralogs. In this study, we present the first functional clustering and evolutionary analysis of the RCSB Protein Data Bank (RCSB PDB) based on similarities between active-site structures. All of the ligand-bound proteins within the RCSB PDB were scored using our Comparison of Protein Active-site Structures (CPASS) software and database (http://cpass.unl.edu/). Principal component analysis was then used to identify 4431 representative structures to construct a phylogenetic tree based on the CPASS comparative scores (http://itol.embl.de/shared/jcatazaro). The resulting phylogenetic tree identified a sequential, step-wise evolution of protein active-sites and provides novel insights into the emergence of protein function or changes in substrate specificity based on subtle changes in geometry and amino acid composition.Entities:
Keywords: CPASS; functional evolution; protein active-sites; proteins
Mesh:
Substances:
Year: 2019 PMID: 30714210 PMCID: PMC6462239 DOI: 10.1002/prot.25670
Source DB: PubMed Journal: Proteins ISSN: 0887-3585