| Literature DB >> 12967349 |
Sven Bilke1, Thomas Breslin, Mikael Sigvardsson.
Abstract
BACKGROUND: The availability of high throughput methods for measurement of mRNA concentrations makes the reliability of conclusions drawn from the data and global quality control of samples and hybridization important issues. We address these issues by an information theoretic approach, applied to discretized expression values in replicated gene expression data.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12967349 PMCID: PMC222958 DOI: 10.1186/1471-2105-4-40
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic diagram illustrating the transition from underlying to observed distributions of states, in the case of m = 4 samples. The underlying distribution on the left hand side can be described by the probabilities for each underlying state, P(σ1), P(σ0), and P(σr) (see text). This distribution is then distorted by sample specific errors, and , resulting in an experimentally observed distribution, depicted on the right hand side.
Figure 2The average error in the estimation of the parameters P(σ1), P(σ0), P(σr) are given as a function of correlation factor between the third and fourth bit. For correlation factors above 0.2 the error in P(σr) rises considerably.
Figure 3The average error in the estimation of the error probabilities . For correlation factors above 0.2 and are notably raised. Patterns were these bits deviate from the other two are then not considered as random but rather caused by an error. This effect could only be avoided by introducing extra parameters for correlation between bits.
Typical paramter values. Summary of the estimated parameter values for the B-cell data. Prefers to the set of error probabilities, i.e., .
| Max | Min | Median | ||||
| Pro B-cells | 0.405 | 0.460 | 0.135 | 0.035 | 0.0002 | 0.020 |
| Pre B-cells | 0.395 | 0.450 | 0.155 | 0.047 | 0.003 | 0.028 |
| Full B-cells | 0.343 | 0.471 | 0.186 | 0.073 | 0.0007 | 0.022 |
t-test vs. probabilistic analysis of gene expression levels. The three groups I, II and III indicate the expressional changes between Pro-B to Pre-B, Pre-B to Mature-B, and Pro-B to Mature-B respectively. U stands for accepting the hypothesis up, D for down, and S (stable) if no hypothesis could be accepted on the 95% confidence level.
| Discrete | Continuous | |||||||
| Gene | Id | I | II | III | I | II | III | Literature |
| - | B-Cell specific | |||||||
| CD20 | 99446_at | S | U | U | S | U | U | |
| Spi | 93657_at | S | U | U | S | S | S | |
| - | Pre-B specific | |||||||
| Sox-4 | 160109_at | U | D | S | U | D | S | |
| lef-1 | 103628_at | U | D | S | U | S | S | |
| rag-1 | 93683_at | U | S | S | S | S | U | |
| VpreB | 92972_at | U | D | S | U | D | S | |
| Lambda-5 | 99429_at | U | D | S | U | D | S | |
| I1-7 receptor | 99030_at | U | D | S | U | S | S | |
| TdT | 99030_at | U | D | S | U | S | S | |
| - | B-lineage Specific | |||||||
| Bob-1 | 93915_at | U | S | U | U | S | U | |
| CD 19 | 99945_at | U | S | U | U | D | U | |
| Blnk | 100771_at | U | S | U | U | S | U | |
| Pax-5 | 96993_at | U | S | U | U | S | U | |
| Blk | 92359_at | U | S | U | U | S | U | |
| Mb-1 | 102778_at | U | D | S | U | D | S | |
| B29 | 161012_at | S | S | S | U | S | U | |
| CD24 | 100600_at | U | S | U | U | S | U | |
| - | Not in B-lineage | |||||||
| Id-1 | 100050_at | D | S | D | D | S | D | |
| Fag-1 | 97974_at | S | D | D | D | S | D | |
| I1-3 receptor | 94747_at | D | S | D | D | S | D | |
| CD 63 | 160493_at | D | S | D | D | S | D | |
| Gata-2 | 102789_at | D | S | D | D | D | D | |