Literature DB >> 30655072

Identifying condition specific key genes from basal-like breast cancer gene expression data.

Ankush Maind1, Shital Raut2.   

Abstract

Mining patterns of co-expressed genes across the subset of conditions help to narrow down the search space for the analysis of gene expression data. Identifying conditions specific key genes from the large-scale gene expression data is a challenging task. The conditions specific key gene signifies functional behavior of a group of co-expressed genes across the subset of conditions and can be act as biomarkers of the diseases. In this paper, we have propose a novel approach for identification of conditions specific key genes from Basal-Like Breast Cancer (BLBC) disease using biclustering algorithm and Gene Co-expression Network (GCN). The proposed approach is a two-stage approach. In the first stage, significant biclusters have been extracted with the help of 'runibic' biclustering algorithm. The second stage identifies conditions specific key genes from the extracted significant biclusters with the help of GCN. By using difference matrix and gene correlation matrix, we have constructed biologically meaningful and statistically strong GCN. Also, presented the proposed approach with the help of a process diagram and demonstrated the procedure with an example of bicluster number 93 (Bic93). From the experimental results, we observed that 95% and 85% of the extracted biclusters are found to be biologically significant at the p-values less than 0.05 and 0.01 respectively. We have compared proposed approach with the Weighted Gene Co-expression Network Analysis (WGCNA) based approach. From the comparison, our approach has performed effectively and extracted biologically significant biclusters. Also, identified conditions specific key genes which cannot be extracted using the WGCNA based approach. Some of the important identified known key genes are PIK3CA, SHC3, ERBB2, SHC4, PTOV1, STAG1, ZNF215 etc. These key genes can be used as a diagnostic and prognostic biomarker for the BLBC disease after the rigorous analysis. The identified conditions specific key genes can be helpful to reduce the analysis time and increase the accuracy of further research such as biomarker identification, drug target discovery etc.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  BLBC; Bicluster; Bioinformatics; Data mining; GCN; Gene expression data; Key gene

Mesh:

Substances:

Year:  2018        PMID: 30655072     DOI: 10.1016/j.compbiolchem.2018.12.022

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  3 in total

1.  COSCEB: Comprehensive search for column-coherent evolution biclusters and its application to hub gene identification.

Authors:  Ankush Maind; Shital Raut
Journal:  J Biosci       Date:  2019-06       Impact factor: 1.826

2.  DISA tool: Discriminative and informative subspace assessment with categorical and numerical outcomes.

Authors:  Leonardo Alexandre; Rafael S Costa; Rui Henriques
Journal:  PLoS One       Date:  2022-10-19       Impact factor: 3.752

3.  Data Mining in Healthcare: Applying Strategic Intelligence Techniques to Depict 25 Years of Research Development.

Authors:  Maikel Luis Kolling; Leonardo B Furstenau; Michele Kremer Sott; Bruna Rabaioli; Pedro Henrique Ulmi; Nicola Luigi Bragazzi; Leonel Pablo Carvalho Tedesco
Journal:  Int J Environ Res Public Health       Date:  2021-03-17       Impact factor: 3.390

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.