Literature DB >> 33550016

Machine learning integrated ensemble of feature selection methods followed by survival analysis for predicting breast cancer subtype specific miRNA biomarkers.

Jnanendra Prasad Sarkar1, Indrajit Saha2, Anasua Sarkar3, Ujjwal Maulik3.   

Abstract

Breast cancer is the second leading cancer type among females. In this regard, it is found that microRNAs play an important role by regulating the gene expressions at the post-transcriptional phase. However, identification of the most influencing miRNAs in breast cancer subtypes is a challenging task, while the recent advancement in Next Generation Sequencing techniques allows analyzing high throughput expression data of miRNAs. Thus, we have conducted this research with the help of NGS data of breast cancer in order to identify the most significant miRNA biomarkers. The selected miRNA biomarkers are highly associated with the multiple breast cancer subtypes. For this purpose, a two-phase technique, called Machine Learning Integrated Ensemble of Feature Selection Methods, followed by survival analysis, is proposed. In the first phase, we have selected the best among seven machine learning techniques based on classification accuracy using the entire set of features (in this case miRNAs). Subsequently, eight different feature selection methods are used separately in order to rank the features and validate each set of top features using the selected machine learning technique by considering a multi-class classification task of the breast cancer subtypes. In the second phase, based on the classification accuracy values, the top features from each feature selection method are considered to make an ensemble to provide further categorization of the miRNAs as 8*, 7* up to 1*. The 8* miRNAs provide the highest average classification accuracy of 86% after 10-fold cross-validation. Thereafter, 27 miRNAs are identified from the list that is confined within 8* to 4* miRNAs based on their importance in survival for breast cancer subtypes using Cox regression based survival analysis. Moreover, expression analysis, regulatory network analysis, protein-protein interaction analysis, KEGG pathway and gene ontology enrichment analysis are performed in order to validate biological significance of the proposed solution. Additionally, we have prepared a miRNA-protein-drug interaction network to identify possible drug for the selected miRNAs. Thus, our findings may be considered during a clinical trial for the treatment of breast cancer patients.
Copyright © 2021 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Breast cancer; Cox regression; Drug repurposing; Feature selection; Machine learning; miRNA sequencing

Year:  2021        PMID: 33550016     DOI: 10.1016/j.compbiomed.2021.104244

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  6 in total

1.  Interpretable machine learning for genomics.

Authors:  David S Watson
Journal:  Hum Genet       Date:  2021-10-20       Impact factor: 5.881

2.  Identification of miRNA biomarkers for breast cancer by combining ensemble regularized multinomial logistic regression and Cox regression.

Authors:  Juntao Li; Hongmei Zhang; Fugen Gao
Journal:  BMC Bioinformatics       Date:  2022-10-18       Impact factor: 3.307

3.  Feature Subset Selection with Optimal Adaptive Neuro-Fuzzy Systems for Bioinformatics Gene Expression Classification.

Authors:  Anwer Mustafa Hilal; Areej A Malibari; Marwa Obayya; Jaber S Alzahrani; Mohammad Alamgeer; Abdullah Mohamed; Abdelwahed Motwakel; Ishfaq Yaseen; Manar Ahmed Hamza; Abu Sarwar Zamani
Journal:  Comput Intell Neurosci       Date:  2022-05-14

4.  Detecting prognostic biomarkers of breast cancer by regularized Cox proportional hazards models.

Authors:  Lingyu Li; Zhi-Ping Liu
Journal:  J Transl Med       Date:  2021-12-20       Impact factor: 5.531

5.  A novel dynamic Bayesian network approach for data mining and survival data analysis.

Authors:  Ali Sheidaei; Abbas Rahimi Foroushani; Kimiya Gohari; Hojjat Zeraati
Journal:  BMC Med Inform Decis Mak       Date:  2022-09-22       Impact factor: 3.298

6.  Drug Repositioning and Subgroup Discovery for Precision Medicine Implementation in Triple Negative Breast Cancer.

Authors:  Zainab Al-Taie; Mark Hannink; Jonathan Mitchem; Christos Papageorgiou; Chi-Ren Shyu
Journal:  Cancers (Basel)       Date:  2021-12-14       Impact factor: 6.639

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.