| Literature DB >> 20202973 |
Jennifer Clarke1, Pearl Seo, Bertrand Clarke.
Abstract
MOTIVATION: Global expression patterns within cells are used for purposes ranging from the identification of disease biomarkers to basic understanding of cellular processes. Unfortunately, tissue samples used in cancer studies are usually composed of multiple cell types and the non-cancerous portions can significantly affect expression profiles. This severely limits the conclusions that can be made about the specificity of gene expression in the cell-type of interest. However, statistical analysis can be used to identify differentially expressed genes that are related to the biological question being studied.Entities:
Mesh:
Year: 2010 PMID: 20202973 PMCID: PMC2853690 DOI: 10.1093/bioinformatics/btq097
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Rank-sorted ratios (R) from ‘electronic’ data across values of p
Fig. 2.Rank-sorted ratios (R) from ‘electronic’ titration data (dark) and observed titration data (light) for proportion values p = 0.25, 0.5 and 0.75. Note the qualitatively different curves caused by noise in the observed data.
Fig. 3.Values of and md(tR(α)) as functions of α for (a) MDA231/mouse titration data at p=0.5 and (b) MAQC human titration data at p=0.75. The vertical line indicates the correct value of α.
Bootstrap estimates of p
| Source | Norm | nb | Prop | Est | SE | 90% CI |
|---|---|---|---|---|---|---|
| UM-MDA231 | Qspline | 39 | 0.75 | 0.788 | 0.023 | (0.746, 0.819) |
| 37 | 0.50 | 0.529 | 0.065 | (0.396, 0.604) | ||
| 10 | 0.25 | 0.304 | 0.108 | (0.180, 0.437) | ||
| UM-MCF7 | Quantile | 100 | 0.75 | 0.722 | 0.086 | (0.596, 0.863) |
| 0.50 | 0.448 | 0.057 | (0.375, 0.553) | |||
| 0.286 | 0.031 | |||||
| MAQC-ILM | Cubic | 40 | 0.75 | 0.776 | 0.041 | (0.710, 0.842) |
| 55 | 0.303 | 0.021 | ||||
| MAQC-AFFX | MAS5 | 14 | 0.75 | 0.763 | 0.040 | (0.688, 0.805) |
| 68 | 0.25 | 0.270 | 0.027 | (0.232, 0.317) | ||
| BIIB500 | MAS5 | 100 | 0.80 | 0.761 | 0.031 | (0.697, 0.800) |
| 0.60 | 0.576 | 0.048 | (0.508, 0.659) | |||
| 0.40 | 0.493 | 0.053 | (0.388, 0.567) | |||
| 0.20 | 0.208 | 0.101 | (0.092, 0.381) | |||
| BIIB100 | MAS5 | 100 | 0.752 | 0.021 | ||
| 0.518 | 0.050 | |||||
| 0.40 | 0.443 | 0.050 | (0.365, 0.527) | |||
| 0.20 | 0.190 | 0.093 | (0.067, 0.347) |
Source, data source; Norm, normalization; nb, number of bootstrap samples; Prop, true value of p; Est, bootstrap point estimate, SE, bootstrap standard error; 90% CI , 90% bootstrap confidence interval. Bold values denotes cases where the true p is not in the interval.
Available datasets
| Source | Type | Platform | Proportion | Norm | GEO | |
|---|---|---|---|---|---|---|
| UMiami | MDA231 | ILM | 0:100:25 | 3 | None | |
| Mouse lung | cubic | |||||
| qspline | ||||||
| UMiami | MCF7 | ILM | 0:100:25 | 1 | None | |
| Mouse lung | quantile | |||||
| qspline | ||||||
| MAQC Site 3 | Univ human | ILM | 100/75/25/0 | 5 | Cubic | GSE5350 |
| brain | ||||||
| MAQC Site 1 | Univ human | AFFX | 100/75/25/0 | 5 | MAS5 | GSE5350 |
| brain | ||||||
| BIIB 500 | Mouse T cells | AFFX | 0:100:20 | 3 | MAS5 | GSE5130 |
| Mouse B cells | ||||||
| BIIB 100 | Mouse T cells | AFFX | 0:100:20 | 1 | MAS5 | GSE5130 |
| Mouse B cells |
Source, data source; type, tissue/cell types; platform, expression platform; proportion, p; n = number of samples at each proportion; Norm, normalization; GEO, GEO accession number. See text for further details.
Fig. 4.Bootstrap estimates of p with 90% confidence intervals. Boxes indicate the point estimates of p; light grey vertical lines indicate the true values of p. (a) MDA231 qspline-normalized data; (b) MCF7 quantile-normalized data; (c) MAQC ILM cubic spline-normalized data; (d) MAQC Affymetrix MAS5 data; (e) BIIB 500 Affymetrix MAS5 data; and (f) BIIB 100 Affymetrix MAS5 data.