Literature DB >> 19576292

SoFoCles: feature filtering for microarray classification based on gene ontology.

Georgios Papachristoudis1, Sotiris Diplaris, Pericles A Mitkas.   

Abstract

Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.

Mesh:

Substances:

Year:  2009        PMID: 19576292     DOI: 10.1016/j.jbi.2009.06.002

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  3 in total

1.  Integrative Gene Selection on Gene Expression Data: Providing Biological Context to Traditional Approaches.

Authors:  Cindy Perscheid; Bastien Grasnick; Matthias Uflacker
Journal:  J Integr Bioinform       Date:  2018-12-22

2.  CogNet: classification of gene expression data based on ranked active-subnetwork-oriented KEGG pathway enrichment analysis.

Authors:  Malik Yousef; Ege Ülgen; Osman Uğur Sezerman
Journal:  PeerJ Comput Sci       Date:  2021-02-22

3.  Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets.

Authors:  David Amar; Tom Hait; Shai Izraeli; Ron Shamir
Journal:  Nucleic Acids Res       Date:  2015-08-10       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.