Literature DB >> 33500779

Recursive Cluster Elimination based Rank Function (SVM-RCE-R) implemented in KNIME.

Malik Yousef1, Burcu Bakir-Gungor2, Amhar Jabeer2, Gokhan Goy2, Rehman Qureshi3, Louise C Showe3.   

Abstract

In our earlier study, we proposed a novel feature selection approach, Recursive Cluster Elimination with Support Vector Machines (SVM-RCE) and implemented this approach in Matlab. Interest in this approach has grown over time and several researchers have incorporated SVM-RCE into their studies, resulting in a substantial number of scientific publications. This increased interest encouraged us to reconsider how feature selection, particularly in biological datasets, can benefit from considering the relationships of those genes in the selection process, this led to our development of SVM-RCE-R.  SVM-RCE-R, further enhances the capabilities of  SVM-RCE by the addition of  a novel user specified ranking function. This ranking function enables the user to  stipulate the weights of the accuracy, sensitivity, specificity, f-measure, area  under the curve and the precision in the ranking function This flexibility allows the user to select for greater sensitivity or greater specificity as needed for a specific project. The usefulness of SVM-RCE-R is further supported by development of the maTE tool which uses a similar approach to identify microRNA (miRNA) targets. We have also now implemented the SVM-RCE-R algorithm in Knime in order to make it easier to applyThe use of SVM-RCE-R in Knime is simple and intuitive and allows researchers to immediately begin their analysis without having to consult an information technology specialist. The input for the Knime implemented tool is an EXCEL file (or text or CSV) with a simple structure and the output is also an EXCEL file. The Knime version also incorporates new features not available in SVM-RCE. The results show that the inclusion of the ranking function has a significant impact on the performance of SVM-RCE-R. Some of the clusters that achieve high scores for a specified ranking can also have high scores in other metrics. Copyright:
© 2021 Yousef M et al.

Entities:  

Keywords:  KNIME; clustering; gene expression; grouping; machine learning; ranking; recursive

Year:  2020        PMID: 33500779      PMCID: PMC7802119          DOI: 10.12688/f1000research.26880.2

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  8 in total

1.  A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments.

Authors:  Wei Pan
Journal:  Bioinformatics       Date:  2002-04       Impact factor: 6.937

2.  A survey on filter techniques for feature selection in gene expression microarray analysis.

Authors:  Cosmin Lazar; Jonatan Taminau; Stijn Meganck; David Steenhoff; Alain Coletta; Colin Molter; Virginie de Schaetzen; Robin Duque; Hugues Bersini; Ann Nowé
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 Jul-Aug       Impact factor: 3.710

3.  maTE: discovering expressed interactions between microRNAs and their targets.

Authors:  Malik Yousef; Loai Abdallah; Jens Allmer
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

4.  The Gene Expression Omnibus Database.

Authors:  Emily Clough; Tanya Barrett
Journal:  Methods Mol Biol       Date:  2016

5.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

6.  ArrayExpress--a public repository for microarray gene expression data at the EBI.

Authors:  H Parkinson; U Sarkans; M Shojatalab; N Abeygunawardena; S Contrino; R Coulson; A Farne; G Garcia Lara; E Holloway; M Kapushesky; P Lilja; G Mukherjee; A Oezcimen; T Rayner; P Rocca-Serra; A Sharma; S Sansone; A Brazma
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

7.  Recursive cluster elimination (RCE) for classification and feature selection from gene expression data.

Authors:  Malik Yousef; Segun Jung; Louise C Showe; Michael K Showe
Journal:  BMC Bioinformatics       Date:  2007-05-02       Impact factor: 3.169

8.  Classification and biomarker identification using gene network modules and support vector machines.

Authors:  Malik Yousef; Mohamed Ketany; Larry Manevitz; Louise C Showe; Michael K Showe
Journal:  BMC Bioinformatics       Date:  2009-10-15       Impact factor: 3.169

  8 in total
  5 in total

1.  miRcorrNet: machine learning-based integration of miRNA and mRNA expression profiles, combined with feature grouping and ranking.

Authors:  Malik Yousef; Gokhan Goy; Ramkrishna Mitra; Christine M Eischen; Amhar Jabeer; Burcu Bakir-Gungor
Journal:  PeerJ       Date:  2021-05-19       Impact factor: 2.984

2.  Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods.

Authors:  Burcu Bakir-Gungor; Hilal Hacılar; Amhar Jabeer; Ozkan Ufuk Nalbantoglu; Oya Aran; Malik Yousef
Journal:  PeerJ       Date:  2022-04-25       Impact factor: 3.061

3.  miRModuleNet: Detecting miRNA-mRNA Regulatory Modules.

Authors:  Malik Yousef; Gokhan Goy; Burcu Bakir-Gungor
Journal:  Front Genet       Date:  2022-04-12       Impact factor: 4.772

4.  TextNetTopics: Text Classification Based Word Grouping as Topics and Topics' Scoring.

Authors:  Malik Yousef; Daniel Voskergian
Journal:  Front Genet       Date:  2022-06-20       Impact factor: 4.772

5.  Minimizing features while maintaining performance in data classification problems.

Authors:  Surani Matharaarachchi; Mike Domaratzki; Saman Muthukumarana
Journal:  PeerJ Comput Sci       Date:  2022-09-14
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.