Literature DB >> 12217913

Multi-class cancer classification via partial least squares with gene expression profiles.

Danh V Nguyen1, David M Rocke.   

Abstract

MOTIVATION: Discrimination between two classes such as normal and cancer samples and between two types of cancers based on gene expression profiles is an important problem which has practical implications as well as the potential to further our understanding of gene expression of various cancer cells. Classification or discrimination of more than two groups or classes (multi-class) is also needed. The need for multi-class discrimination methodologies is apparent in many microarray experiments where various cancer types are considered simultaneously.
RESULTS: Thus, in this paper we present the extension to the classification methodology proposed earlier Nguyen and Rocke (2002b; Bioinformatics, 18, 39-50) to classify cancer samples from multiple classes. The methodologies proposed in this paper are applied to four gene expression data sets with multiple classes: (a) a hereditary breast cancer data set with (1) BRCA1-mutation, (2) BRCA2-mutation and (3) sporadic breast cancer samples, (b) an acute leukemia data set with (1) acute myeloid leukemia (AML), (2) T-cell acute lymphoblastic leukemia (T-ALL) and (3) B-cell acute lymphoblastic leukemia (B-ALL) samples, (c) a lymphoma data set with (1) diffuse large B-cell lymphoma (DLBCL), (2) B-cell chronic lymphocytic leukemia (BCLL) and (3) follicular lymphoma (FL) samples, and (d) the NCI60 data set with cell lines derived from cancers of various sites of origin. In addition, we evaluated the classification algorithms and examined the variability of the error rates using simulations based on randomization of the real data sets. We note that there are other methods for addressing multi-class prediction recently and our approach is along the line of Nguyen and Rocke (2002b; Bioinformatics, 18, 39-50). CONTACT: dnguyen@stat.tamu.edu; dmrocke@ucdavis.edu

Entities:  

Mesh:

Year:  2002        PMID: 12217913     DOI: 10.1093/bioinformatics/18.9.1216

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  34 in total

1.  Sparse partial least squares classification for high dimensional data.

Authors:  Dongjun Chung; Sunduz Keles
Journal:  Stat Appl Genet Mol Biol       Date:  2010-03-03

2.  Two-step paretial least square regression classifiers in brain-state decoding using functional magnetic resonance imaging.

Authors:  Zhiying Long; Yubao Wang; Xuanping Liu; Li Yao
Journal:  PLoS One       Date:  2019-04-10       Impact factor: 3.240

3.  Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes.

Authors:  Jeff W Chou; Tong Zhou; William K Kaufmann; Richard S Paules; Pierre R Bushel
Journal:  BMC Bioinformatics       Date:  2007-11-02       Impact factor: 3.169

4.  Multi-view manifold regularized compact low-rank representation for cancer samples clustering on multi-omics data.

Authors:  Juan Wang; Cong-Hai Lu; Xiang-Zhen Kong; Ling-Yun Dai; Shasha Yuan; Xiaofeng Zhang
Journal:  BMC Bioinformatics       Date:  2022-01-20       Impact factor: 3.169

5.  Gene expression profiling of mouse p53-deficient epidermal carcinoma defines molecular determinants of human cancer malignancy.

Authors:  Ramón García-Escudero; Ana B Martínez-Cruz; Mirentxu Santos; Corina Lorz; Carmen Segrelles; Guillermo Garaulet; Cristina Saiz-Ladera; Clotilde Costa; Agueda Buitrago-Pérez; Marta Dueñas; Jesús M Paramio
Journal:  Mol Cancer       Date:  2010-07-14       Impact factor: 27.401

6.  Dimension reduction with gene expression data using targeted variable importance measurement.

Authors:  Hui Wang; Mark J van der Laan
Journal:  BMC Bioinformatics       Date:  2011-07-29       Impact factor: 3.169

7.  Mining for genotype-phenotype relations in Saccharomyces using partial least squares.

Authors:  Tahir Mehmood; Harald Martens; Solve Saebø; Jonas Warringer; Lars Snipen
Journal:  BMC Bioinformatics       Date:  2011-08-03       Impact factor: 3.169

8.  SlimPLS: a method for feature selection in gene expression-based disease classification.

Authors:  Michael Gutkin; Ron Shamir; Gideon Dror
Journal:  PLoS One       Date:  2009-07-29       Impact factor: 3.240

9.  Non-gaussian distributions affect identification of expression patterns, functional annotation, and prospective classification in human cancer genomes.

Authors:  Nicholas F Marko; Robert J Weil
Journal:  PLoS One       Date:  2012-10-31       Impact factor: 3.240

10.  Stable feature selection and classification algorithms for multiclass microarray data.

Authors:  Sebastian Student; Krzysztof Fujarewicz
Journal:  Biol Direct       Date:  2012-10-02       Impact factor: 4.540

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.