Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Supervised distance matrices.

Literature DB >> 19049489

Supervised distance matrices.

Katherine S Pollard¹, Mark J van der Laan.

Abstract

We introduce a novel statistical concept, called a supervised distance matrix, which quantifies pairwise similarity between variables in terms of their association with an outcome. Supervised distance matrices are derived in two stages. First, the observed data is transformed based on particular working models for association. Examples of transformations include residuals or influence curves from regression models. In the second stage, a choice of distance measure is used to compute all pairwise distances between variables in the transformed data. We present consistent estimators of the resulting distance matrix, including an inverse probability of censoring weighted estimator for use with right-censored outcomes. Supervised distance matrices can be used with standard (unsupervised) clustering algorithms to identify groups of similarly predictive variables and to discover subpopulations of related samples. This approach is illustrated using simulations and an analysis of gene expression data with a censored survival outcome. The proposed methods are widely applicable in genomics and other fields where high-dimensional data is collected on each subject.

Mesh：

Year: 2008 PMID： 19049489 DOI： 10.2202/1544-6115.1404

Source DB: PubMed Journal: Stat Appl Genet Mol Biol ISSN： 1544-6115

Keyword Cloud
Cited

1 in total

1. Mapping forest fuels through vegetation phenology: the role of coarse-resolution satellite time-series.

Authors: Sofia Bajocco; Eleni Dragoz; Ioannis Gitas; Daniela Smiraglia; Luca Salvati; Carlo Ricotta
Journal: PLoS One Date: 2015-03-30 Impact factor: 3.240

1 in total