Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Robust sparse hyperplane classifiers: application to uncertain molecular profiling data.

Literature DB >> 15662199

Robust sparse hyperplane classifiers: application to uncertain molecular profiling data.

C Bhattacharyya¹, L R Grate, M I Jordan, L El Ghaoui, I S Mian.

Abstract

Molecular profiling studies can generate abundance measurements for thousands of transcripts, proteins, metabolites, or other species in, for example, normal and tumor tissue samples. Treating such measurements as features and the samples as labeled data points, sparse hyperplanes provide a statistical methodology for classifying data points into one of two categories (classification and prediction) and defining a small subset of discriminatory features (relevant feature identification). However, this and other extant classification methods address only implicitly the issue of observed data being a combination of underlying signals and noise. Recently, robust optimization has emerged as a powerful framework for handling uncertain data explicitly. Here, ideas from this field are exploited to develop robust sparse hyperplanes, i.e., classification and relevant feature identification algorithms that are resilient to variation in the data. Specifically, each data point is associated with an explicit data uncertainty model in the form of an ellipsoid parameterized by a center and covariance matrix. The task of learning a robust sparse hyperplane from such data is formulated as a second order cone program (SOCP). Gaussian and distribution-free data uncertainty models are shown to yield SOCPs that are equivalent to the SCOP based on ellipsoidal uncertainty. The real-world utility of robust sparse hyperplanes is demonstrated via retrospective analysis of breast cancer related transcript profiles. Data-dependent heuristics are used to compute the parameters of each ellipsoidal data uncertainty model. The generalization performance of a specific implementation, designated "robust LIKNON," is better than its nominal counterpart. Finally, the strengths and limitations of robust sparse hyperplanes are discussed.

Entities: Disease

Mesh：

Year: 2004 PMID： 15662199 DOI： 10.1089/cmb.2004.11.1073

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

3 in total

1. Feature selection and molecular classification of cancer using genetic programming.

Authors: Jianjun Yu; Jindan Yu; Arpit A Almal; Saravana M Dhanasekaran; Debashis Ghosh; William P Worzel; Arul M Chinnaiyan
Journal: Neoplasia Date: 2007-04 Impact factor: 5.715

2. A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.

Authors: Francesca Demichelis; Paolo Magni; Paolo Piergiorgi; Mark A Rubin; Riccardo Bellazzi
Journal: BMC Bioinformatics Date: 2006-11-24 Impact factor: 3.169

3. Many accurate small-discriminatory feature subsets exist in microarray transcript data: biomarker discovery.

Authors: Leslie R Grate
Journal: BMC Bioinformatics Date: 2005-04-13 Impact factor: 3.169

3 in total