Literature DB >> 23445528

TSG: a new algorithm for binary and multi-class cancer classification and informative genes selection.

Haiyan Wang1, Hongyan Zhang, Zhijun Dai, Ming-shun Chen, Zheming Yuan.   

Abstract

BACKGROUND: One of the challenges in classification of cancer tissue samples based on gene expression data is to establish an effective method that can select a parsimonious set of informative genes. The Top Scoring Pair (TSP), k-Top Scoring Pairs (k-TSP), Support Vector Machines (SVM), and prediction analysis of microarrays (PAM) are four popular classifiers that have comparable performance on multiple cancer datasets. SVM and PAM tend to use a large number of genes and TSP, k-TSP always use even number of genes. In addition, the selection of distinct gene pairs in k-TSP simply combined the pairs of top ranking genes without considering the fact that the gene set with best discrimination power may not be the combined pairs. The k-TSP algorithm also needs the user to specify an upper bound for the number of gene pairs. Here we introduce a computational algorithm to address the problems. The algorithm is named Chisquare-statistic-based Top Scoring Genes (Chi-TSG) classifier simplified as TSG.
RESULTS: The TSG classifier starts with the top two genes and sequentially adds additional gene into the candidate gene set to perform informative gene selection. The algorithm automatically reports the total number of informative genes selected with cross validation. We provide the algorithm for both binary and multi-class cancer classification. The algorithm was applied to 9 binary and 10 multi-class gene expression datasets involving human cancers. The TSG classifier outperforms TSP family classifiers by a big margin in most of the 19 datasets. In addition to improved accuracy, our classifier shares all the advantages of the TSP family classifiers including easy interpretation, invariant to monotone transformation, often selects a small number of informative genes allowing follow-up studies, resistant to sampling variations due to within sample operations.
CONCLUSIONS: Redefining the scores for gene set and the classification rules in TSP family classifiers by incorporating the sample size information can lead to better selection of informative genes and classification accuracy. The resulting TSG classifier offers a useful tool for cancer classification based on numerical molecular data.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23445528      PMCID: PMC3552704          DOI: 10.1186/1755-8794-6-S1-S3

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


  29 in total

1.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

2.  Multiclass cancer diagnosis using tumor gene expression signatures.

Authors:  S Ramaswamy; P Tamayo; R Rifkin; S Mukherjee; C H Yeang; M Angelo; C Ladd; M Reich; E Latulippe; J P Mesirov; T Poggio; W Gerald; M Loda; E S Lander; T R Golub
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-11       Impact factor: 11.205

3.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

4.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

5.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

Authors:  A Bhattacharjee; W G Richards; J Staunton; C Li; S Monti; P Vasa; C Ladd; J Beheshti; R Bueno; M Gillette; M Loda; G Weber; E J Mark; E S Lander; W Wong; B E Johnson; T R Golub; D J Sugarbaker; M Meyerson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-13       Impact factor: 11.205

6.  Molecular classification of human carcinomas by use of gene expression signatures.

Authors:  A I Su; J B Welsh; L M Sapinoso; S G Kern; P Dimitrov; H Lapp; P G Schultz; S M Powell; C A Moskaluk; H F Frierson; G M Hampton
Journal:  Cancer Res       Date:  2001-10-15       Impact factor: 12.701

7.  Simple decision rules for classifying human cancers from gene expression profiles.

Authors:  Aik Choon Tan; Daniel Q Naiman; Lei Xu; Raimond L Winslow; Donald Geman
Journal:  Bioinformatics       Date:  2005-08-16       Impact factor: 6.937

8.  Gene expression correlates of clinical prostate cancer behavior.

Authors:  Dinesh Singh; Phillip G Febbo; Kenneth Ross; Donald G Jackson; Judith Manola; Christine Ladd; Pablo Tamayo; Andrew A Renshaw; Anthony V D'Amico; Jerome P Richie; Eric S Lander; Massimo Loda; Philip W Kantoff; Todd R Golub; William R Sellers
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

9.  A stable gene selection in microarray data analysis.

Authors:  Kun Yang; Zhipeng Cai; Jianzhong Li; Guohui Lin
Journal:  BMC Bioinformatics       Date:  2006-04-27       Impact factor: 3.169

10.  Gene selection for classification of microarray data based on the Bayes error.

Authors:  Ji-Gang Zhang; Hong-Wen Deng
Journal:  BMC Bioinformatics       Date:  2007-10-03       Impact factor: 3.169

View more
  16 in total

Review 1.  Contribution of bioinformatics prediction in microRNA-based cancer therapeutics.

Authors:  Jasjit K Banwait; Dhundy R Bastola
Journal:  Adv Drug Deliv Rev       Date:  2014-11-06       Impact factor: 15.470

2.  SINC: a scale-invariant deep-neural-network classifier for bulk and single-cell RNA-seq data.

Authors:  Chuanqi Wang; Jun Li
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

Review 3.  Data analysis methods for defining biomarkers from omics data.

Authors:  Chao Li; Zhenbo Gao; Benzhe Su; Guowang Xu; Xiaohui Lin
Journal:  Anal Bioanal Chem       Date:  2021-12-24       Impact factor: 4.142

4.  Multi-class BCGA-ELM based classifier that identifies biomarkers associated with hallmarks of cancer.

Authors:  Vasily Sachnev; Saras Saraswathi; Rashid Niaz; Andrzej Kloczkowski; Sundaram Suresh
Journal:  BMC Bioinformatics       Date:  2015-05-20       Impact factor: 3.169

5.  Automatic context-specific subnetwork discovery from large interaction networks.

Authors:  Ashis Saha; Aik Choon Tan; Jaewoo Kang
Journal:  PLoS One       Date:  2014-01-01       Impact factor: 3.240

6.  Informative gene selection and direct classification of tumor based on Chi-square test of pairwise gene interactions.

Authors:  Hongyan Zhang; Lanzhi Li; Chao Luo; Congwei Sun; Yuan Chen; Zhijun Dai; Zheming Yuan
Journal:  Biomed Res Int       Date:  2014-07-23       Impact factor: 3.411

7.  Discovering Pair-wise Synergies in Microarray Data.

Authors:  Yuan Chen; Dan Cao; Jun Gao; Zheming Yuan
Journal:  Sci Rep       Date:  2016-07-29       Impact factor: 4.379

8.  Stratification of patients with clear cell renal cell carcinoma to facilitate drug repositioning.

Authors:  Xiangyu Li; Woonghee Kim; Kajetan Juszczak; Muhammad Arif; Yusuke Sato; Haruki Kume; Seishi Ogawa; Hasan Turkez; Jan Boren; Jens Nielsen; Mathias Uhlen; Cheng Zhang; Adil Mardinoglu
Journal:  iScience       Date:  2021-06-12

9.  The current trend of genomics research for human diseases.

Authors:  Ke K Zhang; Hamid R Arabnia; Yunliang Wang; Youping Deng
Journal:  BMC Med Genomics       Date:  2013-01-23       Impact factor: 3.063

10.  Informative gene selection and the direct classification of tumors based on relative simplicity.

Authors:  Yuan Chen; Lifeng Wang; Lanzhi Li; Hongyan Zhang; Zheming Yuan
Journal:  BMC Bioinformatics       Date:  2016-01-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.