| Literature DB >> 15307894 |
James Lyons-Weiler1, Satish Patel, Michael J Becich, Tony E Godfrey.
Abstract
BACKGROUND: Microarray studies in cancer compare expression levels between two or more sample groups on thousands of genes. Data analysis follows a population-level approach (e.g., comparison of sample means) to identify differentially expressed genes. This leads to the discovery of 'population-level' markers, i.e., genes with the expression patterns A > B and B > A. We introduce the PPST test that identifies genes where a significantly large subset of cases exhibit expression values beyond upper and lower thresholds observed in the control samples.Entities:
Mesh:
Year: 2004 PMID: 15307894 PMCID: PMC514539 DOI: 10.1186/1471-2105-5-110
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Number of Genes with Significant Differences between Tumor and Normal Class in Various Cancer Types under the PPST and ABA tests and the parametric t-test (for comparison)
| Data Set | |||||
| breast7 | 572 | 40/5 | 326 | 28/49 | 313 |
| melanoma15 | 662 | 60/202 | 312 | 38/133 | 347 |
| colon45 | 1788 | 55/153 | 1558 | 46/55 | 1378 |
| ovarian16 | 3344 | 253/63 | 2060 | 189/22 | 1813 |
| lymphoma18 | 2077 | 286/42 | 1114 | 194/30 | 1370 |
| lung19 | 614 | 40/3 | 506 | 35/3 | 389 |
*A = tumor sample group, B = normal sample group **pooled variance t
Figure 1Conceptual representation of AB, BA, and ABA patterns of differential expression. The colored tails represent the placement of expression values of a given gene in tumors when compared to the distribution of expression values in normal samples. Standard AB and BA patterns are represented by red and black, respectively. Cases in which a surprising number of samples are distributed in both tails for a given gene are represented here as green (BA) and red (AB), respectively, and are painted similarly in Fig. 2 for specific samples.
Exemplar Genes Found to the Significant (p < 0.05) under the PPST test in the lymphoma data set[18], but missed under the t-test.
| B-cell growth factor (IL-4) | 13 | ABA | 0.301 |
| CCND1 Cyclin D1 (PRAD1; parathyroid adenomatosis 1) | 13 | AB | 0.087 |
| homeobox protein Cdx2 mRNA | 12 | AB | 0.232 |
| LIF Leukemia inhibitory factor (cholinergic differentiation factor) | 10 | AB | 0.181 |
| tumor susceptiblity protein (TSG101) mRNA | -16 | BA | 0.526 |
| carcinoembryonic antigen | -18 | BA | 0.328 |
| CCND2 Cyclin D2 | -18 | BA | 0.076 |
| VIM Vimentin | -18 | BA | 0.976 |
*PPST Score = s1 or -s2 (for AB or BA pattern) and s3 (for ABA pattern)
Summary of the overlap study of the two astrocytoma progression marker data sets. A. Internal consistency of the methods under comparison. k = Khatua et al. data set51; vdb = van den Boom et al. data set52 B. Number of significant genes that overlap between the two data sets in the significant gene list for each method. C. Comparison (% overlap) of methods in the k data set. D. Comparison (% overlap) of methods in the vdb data set
| % overlap | k | ||||
| test | t-test (p) | t-test (np) | PPST | ABA | |
| vdb | t-test | 5.307 | |||
| t-test (np) | 11.248 | ||||
| PPST | |||||
| ABA | 3.211 | ||||
| overlap | k | ||||
| vdb | test | t-test (p) | t-test (np) | PPST | ABA |
| t-test | 35 | ||||
| t-test (np) | 118 | ||||
| PPST | |||||
| ABA | 11 | ||||
| Test 1 | |||||
| k | t-test (p) | t-test (np) | PPST | ABA | |
| Test 2 | t-test (p) | 1 | 78.527 | 74.394 | 2.579 |
| t-test (np) | 1 | 4.678 | |||
| PPST | 1 | 1.28 | |||
| ABA | 1 | ||||
| Test 1 | |||||
| v | t-test (p) | t-test (np) | PPST | ABA | |
| Test 2 | t-test (p) | 1 | 46.382 | 26.649 | 0 |
| t-test (np) | 1 | 5.656 | |||
| PPST | 1 | 7.268 | |||
| ABA | 1 |