Literature DB >> 32070994

A Refined 3-in-1 Fused Protein Similarity Measure: Application in Threshold-Free Hub Detection.

Sudipta Acharya, Laizhong Cui, Yi Pan.   

Abstract

An exhaustive literature survey shows that finding protein/gene similarity is an important step towards solving widespread bioinformatics problems, such as predicting protein-protein interactions, analyzing Protein-Protein Interaction Networks (PPINs), gene prioritization, and disease gene/protein detection. In this article, we have proposed an improved 3-in-1 fused protein similarity measure called FuSim-II. It is built upon combining the weighted average of biological knowledge extracted from three potential genomic/ proteomic resources such as Gene Ontology (GO), PPIN, and protein sequence. Furthermore, we have shown the application of the proposed measure in detecting potential hub-proteins from a given PPIN. Aiming that, we have proposed a multi-objective clustering-based protein hub detection framework with FuSim-II working as the underlying proximity measure. The PPINs of H. Sapiens and M. Musculus organisms are chosen for experimental purposes. Unlike most of the existing hub-detection methods, the proposed technique does not require to follow any protein degree cut-off or threshold to define hubs. A thorough assessment of efficiency between proposed and existing eight protein similarity measures along with eight single/multi-objective clustering methods has been carried out. Internal cluster validity indices like Silhouette and Davies Bouldin (DB) are deployed to accomplish analytical study. Also, a comparative performance analysis between proposed and five existing hub-proteins detection algorithms is conducted through the enrichment of essentiality study. The reported results show the improved performance of FuSim-II over existing protein similarity measures in terms of identifying functionally related proteins as well as relevant hub-proteins. Supplementary material is available at http://csse.szu.edu.cn/staff/cuilz/eng/index.html.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 32070994     DOI: 10.1109/TCBB.2020.2973563

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  Multi-view feature selection for identifying gene markers: a diversified biological data driven approach.

Authors:  Sudipta Acharya; Laizhong Cui; Yi Pan
Journal:  BMC Bioinformatics       Date:  2020-12-30       Impact factor: 3.169

2.  A consensus multi-view multi-objective gene selection approach for improved sample classification.

Authors:  Sudipta Acharya; Laizhong Cui; Yi Pan
Journal:  BMC Bioinformatics       Date:  2020-09-17       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.