| Literature DB >> 23690879 |
Keita Mori1, Tomonori Oura, Hisashi Noma, Shigeyuki Matsui.
Abstract
Molecular heterogeneity of cancer, partially caused by various chromosomal aberrations or gene mutations, can yield substantial heterogeneity in gene expression profile in cancer samples. To detect cancer-related genes which are active only in a subset of cancer samples or cancer outliers, several methods have been proposed in the context of multiple testing. Such cancer outlier analyses will generally suffer from a serious lack of power, compared with the standard multiple testing setting where common activation of genes across all cancer samples is supposed. In this paper, we consider information sharing across genes and cancer samples, via a parametric normal mixture modeling of gene expression levels of cancer samples across genes after a standardization using the reference, normal sample data. A gene-based statistic for gene selection is developed on the basis of a posterior probability of cancer outlier for each cancer sample. Some efficiency improvement by using our method was demonstrated, even under settings with misspecified, heavy-tailed t-distributions. An application to a real dataset from hematologic malignancies is provided.Entities:
Mesh:
Year: 2013 PMID: 23690879 PMCID: PMC3649281 DOI: 10.1155/2013/693901
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1ROC curves that plot TPR versus FDR for normally distributed gene expression data.
Figure 2ROC curves that plot TPR versus FDR for t-distributed gene expression data.
The number of overlaps in selected genes between the gene selection methods in the example of hematologic malignancies. Top 200 genes were selected by each method.
|
| COPA | OS | ORT | MOST | Proposed | |
|---|---|---|---|---|---|---|
|
| — | 13 | 14 | 50 | 56 | 56 |
| COPA | 13 | — | 150 | 0 | 99 | 51 |
| OS | 14 | 150 | — | 139 | 108 | 86 |
| ORT | 50 | 0 | 139 | — | 151 | 89 |
| MOST | 56 | 99 | 108 | 151 | — | 75 |
| Proposed | 56 | 51 | 86 | 89 | 75 | — |
Figure 3Histograms of the standardized expression values of three genes selected by our method, but not by the other methods.