Literature DB >> 27267620

Visualizing and Clustering Protein Similarity Networks: Sequences, Structures, and Functions.

Te-Lun Mai1, Geng-Ming Hu1, Chi-Ming Chen1.   

Abstract

Research in the recent decade has demonstrated the usefulness of protein network knowledge in furthering the study of molecular evolution of proteins, understanding the robustness of cells to perturbation, and annotating new protein functions. In this study, we aimed to provide a general clustering approach to visualize the sequence-structure-function relationship of protein networks, and investigate possible causes for inconsistency in the protein classifications based on sequences, structures, and functions. Such visualization of protein networks could facilitate our understanding of the overall relationship among proteins and help researchers comprehend various protein databases. As a demonstration, we clustered 1437 enzymes by their sequences and structures using the minimum span clustering (MSC) method. The general structure of this protein network was delineated at two clustering resolutions, and the second level MSC clustering was found to be highly similar to existing enzyme classifications. The clustering of these enzymes based on sequence, structure, and function information is consistent with each other. For proteases, the Jaccard's similarity coefficient is 0.86 between sequence and function classifications, 0.82 between sequence and structure classifications, and 0.78 between structure and function classifications. From our clustering results, we discussed possible examples of divergent evolution and convergent evolution of enzymes. Our clustering approach provides a panoramic view of the sequence-structure-function network of proteins, helps visualize the relation between related proteins intuitively, and is useful in predicting the structure and function of newly determined protein sequences.

Keywords:  protein similarity networks; sequence similarity; sequence−structure−function relationship; structure similarity

Mesh:

Substances:

Year:  2016        PMID: 27267620     DOI: 10.1021/acs.jproteome.5b01031

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  6 in total

1.  SeQuery: an interactive graph database for visualizing the GPCR superfamily.

Authors:  Geng-Ming Hu; M K Secario; Chi-Ming Chen
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

Review 2.  Nucleobase deaminases: a potential enzyme system for new therapies.

Authors:  Vandana Gaded; Ruchi Anand
Journal:  RSC Adv       Date:  2018-06-28       Impact factor: 4.036

3.  A Comparison of Rosetta Stones in Adapter Protein Families.

Authors:  Hulikal Shivashankara Santosh Kumar; Vadlapudi Kumar
Journal:  Bioinformation       Date:  2016-08-15

4.  Visualizing the GPCR Network: Classification and Evolution.

Authors:  Geng-Ming Hu; Te-Lun Mai; Chi-Ming Chen
Journal:  Sci Rep       Date:  2017-11-14       Impact factor: 4.379

5.  Biocuration in the structure-function linkage database: the anatomy of a superfamily.

Authors:  Gemma L Holliday; Shoshana D Brown; Eyal Akiva; David Mischel; Michael A Hicks; John H Morris; Conrad C Huang; Elaine C Meng; Scott C-H Pegg; Thomas E Ferrin; Patricia C Babbitt
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

6.  Scaffold Diversity Synthesis Delivers Complex, Structurally, and Functionally Distinct Tetracyclic Benzopyrones.

Authors:  Muthukumar G Sankar; Sayantani Roy; Tuyen Thi Ngoc Tran; Kathrin Wittstein; Jonathan O Bauer; Carsten Strohmann; Slava Ziegler; Kamal Kumar
Journal:  ChemistryOpen       Date:  2018-04-26       Impact factor: 2.911

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.