Literature DB >> 33557857

A novel gene selection method for gene expression data for the task of cancer type classification.

N Özlem Özcan ŞİmŞek1, Arzucan ÖzgÜr2, Fikret GÜrgen3.   

Abstract

Cancer is a poligenetic disease with each cancer type having a different mutation profile. Genomic data can be utilized to detect these profiles and to diagnose and differentiate cancer types. Variant calling provide mutation information. Gene expression data reveal the altered cell behaviour. The combination of the mutation and expression information can lead to accurate discrimination of different cancer types. In this study, we utilized and transferred the information of existing mutations for a novel gene selection method for gene expression data. We tested the proposed method in order to diagnose and differentiate cancer types. It is a disease specific method as both the mutations and expressions are filtered according to the selected cancer types. Our experiment results show that the proposed gene selection method leads to similar or improved performance metrics compared to classical feature selection methods and curated gene sets.

Entities:  

Keywords:  Cancer research; DNA mutations; Disease classification; Gene expression; Gene weighting; Information retrieval; Machine learning

Year:  2021        PMID: 33557857      PMCID: PMC7869482          DOI: 10.1186/s13062-020-00290-3

Source DB:  PubMed          Journal:  Biol Direct        ISSN: 1745-6150            Impact factor:   4.540


  15 in total

1.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

2.  Multi-class HingeBoost. Method and application to the classification of cancer types using gene expression data.

Authors:  Z Wang
Journal:  Methods Inf Med       Date:  2012-03-01       Impact factor: 2.176

3.  The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Authors:  Arthur Liberzon; Chet Birger; Helga Thorvaldsdóttir; Mahmoud Ghandi; Jill P Mesirov; Pablo Tamayo
Journal:  Cell Syst       Date:  2015-12-23       Impact factor: 10.304

4.  Multi-class cancer classification via partial least squares with gene expression profiles.

Authors:  Danh V Nguyen; David M Rocke
Journal:  Bioinformatics       Date:  2002-09       Impact factor: 6.937

5.  Tumor gene expression data classification via sample expansion-based deep learning.

Authors:  Jian Liu; Xuesong Wang; Yuhu Cheng; Lin Zhang
Journal:  Oncotarget       Date:  2017-11-30

6.  Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.

Authors:  Esther Rheinbay; Morten Muhlig Nielsen; Federico Abascal; Jeremiah A Wala; Ofer Shapira; Grace Tiao; Henrik Hornshøj; Julian M Hess; Randi Istrup Juul; Ziao Lin; Lars Feuerbach; Radhakrishnan Sabarinathan; Tobias Madsen; Jaegil Kim; Loris Mularoni; Shimin Shuai; Andrés Lanzós; Carl Herrmann; Yosef E Maruvka; Ciyue Shen; Samirkumar B Amin; Pratiti Bandopadhayay; Johanna Bertl; Keith A Boroevich; John Busanovich; Joana Carlevaro-Fita; Dimple Chakravarty; Calvin Wing Yiu Chan; David Craft; Priyanka Dhingra; Klev Diamanti; Nuno A Fonseca; Abel Gonzalez-Perez; Qianyun Guo; Mark P Hamilton; Nicholas J Haradhvala; Chen Hong; Keren Isaev; Todd A Johnson; Malene Juul; Andre Kahles; Abdullah Kahraman; Youngwook Kim; Jan Komorowski; Kiran Kumar; Sushant Kumar; Donghoon Lee; Kjong-Van Lehmann; Yilong Li; Eric Minwei Liu; Lucas Lochovsky; Keunchil Park; Oriol Pich; Nicola D Roberts; Gordon Saksena; Steven E Schumacher; Nikos Sidiropoulos; Lina Sieverling; Nasa Sinnott-Armstrong; Chip Stewart; David Tamborero; Jose M C Tubio; Husen M Umer; Liis Uusküla-Reimand; Claes Wadelius; Lina Wadi; Xiaotong Yao; Cheng-Zhong Zhang; Jing Zhang; James E Haber; Asger Hobolth; Marcin Imielinski; Manolis Kellis; Michael S Lawrence; Christian von Mering; Hidewaki Nakagawa; Benjamin J Raphael; Mark A Rubin; Chris Sander; Lincoln D Stein; Joshua M Stuart; Tatsuhiko Tsunoda; David A Wheeler; Rory Johnson; Jüri Reimand; Mark Gerstein; Ekta Khurana; Peter J Campbell; Núria López-Bigas; Joachim Weischenfeldt; Rameen Beroukhim; Iñigo Martincorena; Jakob Skou Pedersen; Gad Getz
Journal:  Nature       Date:  2020-02-05       Impact factor: 49.962

7.  Sparse logistic regression with a L1/2 penalty for gene selection in cancer classification.

Authors:  Yong Liang; Cheng Liu; Xin-Ze Luan; Kwong-Sak Leung; Tak-Ming Chan; Zong-Ben Xu; Hai Zhang
Journal:  BMC Bioinformatics       Date:  2013-06-19       Impact factor: 3.169

Review 8.  Approaches to working in high-dimensional data spaces: gene expression microarrays.

Authors:  Y Wang; D J Miller; R Clarke
Journal:  Br J Cancer       Date:  2008-02-19       Impact factor: 7.640

9.  A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification.

Authors:  Alexander Statnikov; Lily Wang; Constantin F Aliferis
Journal:  BMC Bioinformatics       Date:  2008-07-22       Impact factor: 3.169

10.  Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification.

Authors:  Lingyun Gao; Mingquan Ye; Xiaojie Lu; Daobin Huang
Journal:  Genomics Proteomics Bioinformatics       Date:  2017-12-12       Impact factor: 7.691

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.