Literature DB >> 33374969

Application of Biological Domain Knowledge Based Feature Selection on Gene Expression Data.

Malik Yousef1,2, Abhishek Kumar3,4, Burcu Bakir-Gungor5.   

Abstract

In the last two decades, there have been massive advancements in high throughput technologies, which resulted in the exponential growth of public repositories of gene expression datasets for various phenotypes. It is possible to unravel biomarkers by comparing the gene expression levels under different conditions, such as disease vs. control, treated vs. not treated, drug A vs. drug B, etc. This problem refers to a well-studied problem in the machine learning domain, i.e., the feature selection problem. In biological data analysis, most of the computational feature selection methodologies were taken from other fields, without considering the nature of the biological data. Thus, integrative approaches that utilize the biological knowledge while performing feature selection are necessary for this kind of data. The main idea behind the integrative gene selection process is to generate a ranked list of genes considering both the statistical metrics that are applied to the gene expression data, and the biological background information which is provided as external datasets. One of the main goals of this review is to explore the existing methods that integrate different types of information in order to improve the identification of the biomolecular signatures of diseases and the discovery of new potential targets for treatment. These integrative approaches are expected to aid the prediction, diagnosis, and treatment of diseases, as well as to enlighten us on disease state dynamics, mechanisms of their onset and progression. The integration of various types of biological information will necessitate the development of novel techniques for integration and data analysis. Another aim of this review is to boost the bioinformatics community to develop new approaches for searching and determining significant groups/clusters of features based on one or more biological grouping functions.

Entities:  

Keywords:  biological knowledge; clustering; feature ranking; feature selection; grouping

Year:  2020        PMID: 33374969     DOI: 10.3390/e23010002

Source DB:  PubMed          Journal:  Entropy (Basel)        ISSN: 1099-4300            Impact factor:   2.524


  10 in total

1.  Mate Analysis of Hepatocellular Carcinoma Immune Subtypes and Their Functional Effects Based on Fuzzy Logic and Evolutionary Algorithms.

Authors:  Qingge Chen; Chaoyi Liu; Yujia Huo
Journal:  Contrast Media Mol Imaging       Date:  2022-05-05       Impact factor: 3.009

2.  miRcorrNet: machine learning-based integration of miRNA and mRNA expression profiles, combined with feature grouping and ranking.

Authors:  Malik Yousef; Gokhan Goy; Ramkrishna Mitra; Christine M Eischen; Amhar Jabeer; Burcu Bakir-Gungor
Journal:  PeerJ       Date:  2021-05-19       Impact factor: 2.984

3.  Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods.

Authors:  Burcu Bakir-Gungor; Hilal Hacılar; Amhar Jabeer; Ozkan Ufuk Nalbantoglu; Oya Aran; Malik Yousef
Journal:  PeerJ       Date:  2022-04-25       Impact factor: 3.061

4.  miRModuleNet: Detecting miRNA-mRNA Regulatory Modules.

Authors:  Malik Yousef; Gokhan Goy; Burcu Bakir-Gungor
Journal:  Front Genet       Date:  2022-04-12       Impact factor: 4.772

5.  Integrated Bioinformatics Analysis and Verification of Gene Targets for Myocardial Ischemia-Reperfusion Injury.

Authors:  Jianru Wang; Xiaohui Li; Guangcao Peng; Genhao Fan; Mengmeng Zhang; Jian Chen
Journal:  Evid Based Complement Alternat Med       Date:  2022-04-15       Impact factor: 2.650

6.  EGFAFS: A Novel Feature Selection Algorithm Based on Explosion Gravitation Field Algorithm.

Authors:  Lan Huang; Xuemei Hu; Yan Wang; Yuan Fu
Journal:  Entropy (Basel)       Date:  2022-06-25       Impact factor: 2.738

7.  TextNetTopics: Text Classification Based Word Grouping as Topics and Topics' Scoring.

Authors:  Malik Yousef; Daniel Voskergian
Journal:  Front Genet       Date:  2022-06-20       Impact factor: 4.772

8.  A manually curated compendium of expression profiles for the microbial cell factory Corynebacterium glutamicum.

Authors:  Angela Kranz; Tino Polen; Christian Kotulla; Annette Arndt; Graziella Bosco; Michael Bussmann; Ava Chattopadhyay; Annette Cramer; Cedric-Farhad Davoudi; Ursula Degner; Ramon Diesveld; Raphael Freiherr von Boeselager; Kim Gärtner; Cornelia Gätgens; Tobias Georgi; Christian Geraths; Sabine Haas; Antonia Heyer; Max Hünnefeld; Takeru Ishige; Armin Kabus; Nicolai Kallscheuer; Larissa Kever; Simon Klaffl; Britta Kleine; Martina Kočan; Abigail Koch-Koerfges; Kim J Kraxner; Andreas Krug; Aileen Krüger; Andreas Küberl; Mohamed Labib; Christian Lange; Christina Mack; Tomoya Maeda; Regina Mahr; Stephan Majda; Andrea Michel; Xenia Morosov; Olga Müller; Arun M Nanda; Jens Nickel; Jennifer Pahlke; Eugen Pfeifer; Laura Platzen; Paul Ramp; Doris Rittmann; Steffen Schaffer; Sandra Scheele; Stephanie Spelberg; Julia Schulte; Jens-Eric Schweitzer; Georg Sindelar; Ulrike Sorger-Herrmann; Markus Spelberg; Corinna Stansen; Apilaasha Tharmasothirajan; Jan van Ooyen; Philana van Summeren-Wesenhagen; Michael Vogt; Sabrina Witthoff; Lingfeng Zhu; Bernhard J Eikmanns; Marco Oldiges; Georg Schaumann; Meike Baumgart; Melanie Brocker; Lothar Eggeling; Roland Freudl; Julia Frunzke; Jan Marienhagen; Volker F Wendisch; Michael Bott
Journal:  Sci Data       Date:  2022-10-01       Impact factor: 8.501

9.  Transcriptional signatures of the BCL2 family for individualized acute myeloid leukaemia treatment.

Authors:  Chansub Lee; Sungyoung Lee; Eunchae Park; Junshik Hong; Dong-Yeop Shin; Ja Min Byun; Hongseok Yun; Youngil Koh; Sung-Soo Yoon
Journal:  Genome Med       Date:  2022-09-28       Impact factor: 15.266

10.  Long Non-Coding RNAs Might Regulate Phenotypic Switch of Vascular Smooth Muscle Cells Acting as ceRNA: Implications for In-Stent Restenosis.

Authors:  Alberto Arencibia; Fernando Lanas; Luis A Salazar
Journal:  Int J Mol Sci       Date:  2022-03-12       Impact factor: 5.923

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.