| Literature DB >> 19958478 |
Changwon Keum1, Jung Hoon Woo, Won Seok Oh, Sue-Nie Park, Kyoung Tai No.
Abstract
BACKGROUND: Gene expression similarity measuring methods were developed and applied to search rapidly growing public microarray databases. However, current expression similarity measuring methods need to be improved to accurately measure similarity between gene expression profiles from different platforms or different experiments.Entities:
Mesh:
Year: 2009 PMID: 19958478 PMCID: PMC2788367 DOI: 10.1186/1471-2164-10-S3-S15
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Cell type classification accuracy. Cell type classification accuracies using CGSEP and PEPC for three different search databases, overall, cross-platform and cross-experiment.
Top 10 scoring profiles for test profile GSM18935 (thalamus) with overall search database.
| Scoring method | Similarity score | Profile ID | Cell type | Platform ID | Study ID |
|---|---|---|---|---|---|
| CGSEP | 0.62 | GSM12688 | brain | GPL8300 | GSE803 |
| 0.61 | GSM12708 | brain | GPL8300 | GSE803 | |
| 0.6 | GSM12703 | spinal | GPL8300 | GSE803 | |
| 0.59 | GSM12753 | spinal | GPL8300 | GSE803 | |
| 0.58 | GSM2885 | caudate | GPL91 | GSE96 | |
| 0.58 | GSM2820 | cerebral | GPL91 | GSE96 | |
| 0.58 | GSM2886 | caudate | GPL91 | GSE96 | |
| 0.58 | GSM2881 | thalamus | GPL91 | GSE96 | |
| 0.58 | GSM18443 | cardiac | GPL570 | GSE1145 | |
| 0.57 | GSM2897 | thalamus | GPL91 | GSE96 | |
| PEPC | 0.86 | GSM2881 | thalamus | GPL91 | GSE96 |
| 0.86 | GSM2897 | thalamus | GPL91 | GSE96 | |
| 0.84 | GSM2885 | caudate | GPL91 | GSE96 | |
| 0.83 | GSM2886 | caudate | GPL91 | GSE96 | |
| 0.83 | GSM2884 | amygdala | GPL91 | GSE96 | |
| 0.83 | GSM2820 | cerebral | GPL91 | GSE96 | |
| 0.83 | GSM2828 | brain | GPL91 | GSE96 | |
| 0.82 | GSM2874 | amygdala | GPL91 | GSE96 | |
| 0.82 | GSM12708 | brain | GPL8300 | GSE803 | |
| 0.82 | GSM12688 | brain | GPL8300 | GSE803 |
Top 10 scoring profiles for test profile GSM12641(liver) with cross-platform search database
| Scoring method | Similarity score | Profile ID | Cell type | Platform ID | Study ID |
|---|---|---|---|---|---|
| CGSEP | 0.31 | GSM19143 | colon | GPL97 | GSE1152 |
| 0.3 | GSM19142 | ileum | GPL97 | GSE1152 | |
| 0.28 | GSM11827 | kidney | GPL97 | GSE781 | |
| 0.25 | GSM11810 | kidney | GPL97 | GSE781 | |
| 0.24 | GSM4230 | skeletal | GPL246 | GSE465 | |
| 0.24 | GSM4231 | skeletal | GPL246 | GSE465 | |
| 0.23 | GSM18443 | cardiac | GPL570 | GSE1145 | |
| 0.22 | GSM18809 | placenta | GPL1074 | GSE1133 | |
| 0.22 | GSM18798 | prostate | GPL1074 | GSE1133 | |
| 0.22 | GSM18711 | leukocyte | GPL1074 | GSE1133 | |
| PEPC | 0.43 | GSM2831 | liver | GPL91 | GSE96 |
| 0.43 | GSM12640 | liver | GPL92 | GSE803 | |
| 0.43 | GSM2854 | trachea | GPL91 | GSE96 | |
| 0.42 | GSM2858 | spleen | GPL91 | GSE96 | |
| 0.42 | GSM18949 | lung | GPL96 | GSE1133 | |
| 0.42 | GSM18950 | lung | GPL96 | GSE11750 | |
| 0.42 | GSM2835 | spleen | GPL91 | GSE96 | |
| 0.42 | GSM2844 | liver | GPL91 | GSE96 | |
| 0.41 | GSM2834 | salivary | GPL91 | GSE96 | |
| 0.41 | GSM12718 | liver | GPL8300 | GSE803 |
Average similarity scores of top scoring results
| Scoring method | Type of top hit | Overall | Cross-platform | Cross-study |
|---|---|---|---|---|
| CGSEP | Correct | 0.91 ± 0.04 | 0.54 ± 0.12 | 0.89 ± 0.05 |
| Incorrect | 0.82 ± 0.05 | 0.54 ± 0.17 | 0.78 ± 0.11 | |
| PEPC | Correct | 0.97 ± 0.03 | 0.74 ± 0.17 | 0.91 ± 0.06 |
| Incorrect | 0.84 ± 0 | 0.69 ± 0.18 | 0.84 ± 0.09 |
Cross-experiment classification results with different search spaces
| Scoring Method | Cross-experiment (Same platform) | Cross-experiment (Different platform) | ||||
|---|---|---|---|---|---|---|
| CGSEP | 17.5 | 0.89 ± 0.05 | 0.73 ± 0.24 | 10 | 0.54 ± 0.12 | 0.54 ± 0.17 |
| PEPC | 15 | 0.95 ± 0.06 | 0.77 ± 0.25 | 55 | 0.74 ± 0.17 | 0.7 ± 0.18 |