| Literature DB >> 16144520 |
Hai-Yan Pan1, Jun Zhu, Dan-Fu Han.
Abstract
Microarray has become a popular biotechnology in biological and medical research. However, systematic and stochastic variabilities in microarray data are expected and unavoidable, resulting in the problem that the raw measurements have inherent "noise" within microarray experiments. Currently, logarithmic ratios are usually analyzed by various clustering methods directly, which may introduce bias interpretation in identifying groups of genes or samples. In this paper, a statistical method based on mixed model approaches was proposed for microarray data cluster analysis. The underlying rationale of this method is to partition the observed total gene expression level into various variations caused by different factors using an ANOVA model, and to predict the differential effects of GV (gene by variety) interaction using the adjusted unbiased prediction (AUP) method. The predicted GV interaction effects can then be used as the inputs of cluster analysis. We illustrated the application of our method with a gene expression dataset and elucidated the utility of our approach using an external validation.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16144520 PMCID: PMC5172465 DOI: 10.1016/s1672-0229(05)03005-6
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Variance Component Estimates and Their Proportions to Total Variance for Yeast Sporulation Data
| Parameter | Estimate | Proportion |
|---|---|---|
| 0.002 | 0.001 | |
| 0.119 | 0.045 | |
| 1.194 | 0.452 | |
| 0.262 | 0.099 | |
| 0.962 | 0.364 | |
| 0.102 | 0.039 | |
Comparisons of Three Clustering Methods with log2(Ratios) and GV Effects for Yeast Sporulation Data Model (2)
| Method | Pearson | Euclidian | ||
|---|---|---|---|---|
| log2(Ratios) | log2(Ratios) | |||
| Complete-linkage | 0.369 | 0.420 | 0.390 | 0.487 |
| UPGMA-linkage | 0.291 | 0.395 | 0.338 | 0.315 |
| DIANA | 0.301 | 0.311 | 0.391 | 0.500 |