Literature DB >> 19782232

Classification from microarray data using probabilistic discriminant partial least squares with reject option.

Cristina Botella1, Joan Ferré, Ricard Boqué.   

Abstract

Microarrays are used to simultaneously determine the expressions of thousands of genes. An important application of microarrays is in the classification of samples into classes of interest (e.g. either healthy cells or tumour cells). Discriminant partial least squares (DPLS) has often been used for this purpose. In this paper, we describe an improvement to DPLS that uses kernel-based probability density functions and the Bayes rule to classify samples whilst keeping the option of not classifying the sample if this cannot be done with sufficient confidence. With this approach, those samples outside the boundaries of the known classes or from the ambiguity region between classes are rejected and only samples with a high probability of being correctly classified are indeed classified. The optimal model is found by simultaneously minimizing the misclassification and rejection costs. The method (p-DPLS with reject option) was tested with two datasets. For the human cancers dataset the accuracy (obtained by leave-one-out cross-validation) was improved from 97% to 99% when compared to p-DPLS without reject option. For the breast cancer dataset, p-DPLS with reject option was able to reject 100% of the test samples that did not belong to any of the modelled classes. These samples would have been misclassified if the reject option had not been considered.

Entities:  

Mesh:

Year:  2009        PMID: 19782232     DOI: 10.1016/j.talanta.2009.06.072

Source DB:  PubMed          Journal:  Talanta        ISSN: 0039-9140            Impact factor:   6.057


  1 in total

1.  So you think you can PLS-DA?

Authors:  Daniel Ruiz-Perez; Haibin Guan; Purnima Madhivanan; Kalai Mathee; Giri Narasimhan
Journal:  BMC Bioinformatics       Date:  2020-12-09       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.