Literature DB >> 30676987

Graph-Based Hub Gene Selection Technique Using Protein Interaction Information: Application to Sample Classification.

Pratik Dutta, Sriparna Saha, Saurabh Gulati.   

Abstract

Classification of samples of gene expression profile plays a significant role in prediction and diagnosis of diseases. In the task of sample classification, a robust feature selection algorithm is very much essential to identify the important genes from the high dimensional gene expression data. This paper explores the information of protein-protein interaction with a graph mining technique for finding a proper subset of features (genes), which further takes part in sample classification. Here, our contribution for feature selection is three-fold: first, all the genes are grouped into different clusters based on the integrated information of the gene expression values and their protein interactions using a multi-objective optimization based clustering approach. Second, the confidence scores of the protein interactions are incorporated in a popular graph mining algorithm namely Goldberg algorithm to find out the relevant features. These features are the topologically and functionally significant genes, named as hub genes. Finally, these hub genes are identified varying the degrees of the nodes, and those are utilized for the sample classification task. Different machine learning classifiers are exploited for this purpose, and the classification performance is measured with respect to various performance metrics namely accuracy, sensitivity, specificity, precision, F-measure, and Mathews coefficient correlation. Comparative analysis with respect to two baselines and several existing approaches proves the efficiency of the proposed approach. Furthermore, the robustness of the identified hub-gene modules is endorsed using some strong biological significance analysis.

Entities:  

Year:  2019        PMID: 30676987     DOI: 10.1109/JBHI.2019.2894374

Source DB:  PubMed          Journal:  IEEE J Biomed Health Inform        ISSN: 2168-2194            Impact factor:   5.772


  3 in total

1.  R-JaunLab: Automatic Multi-Class Recognition of Jaundice on Photos of Subjects with Region Annotation Networks.

Authors:  Zheng Wang; Ying Xiao; Futian Weng; Xiaojun Li; Danhua Zhu; Fanggen Lu; Xiaowei Liu; Muzhou Hou; Yu Meng
Journal:  J Digit Imaging       Date:  2021-02-25       Impact factor: 4.056

2.  Weighted gene coexpression network analysis identifies a new biomarker of CENPF for prediction disease prognosis and progression in nonmuscle invasive bladder cancer.

Authors:  Jiawei Shi; Pu Zhang; Lilong Liu; Xiaobo Min; Yajun Xiao
Journal:  Mol Genet Genomic Med       Date:  2019-09-30       Impact factor: 2.183

3.  Determination of biomarkers from microarray data using graph neural network and spectral clustering.

Authors:  Kun Yu; Weidong Xie; Linjie Wang; Shoujia Zhang; Wei Li
Journal:  Sci Rep       Date:  2021-12-13       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.