| Literature DB >> 24134721 |
Jaehee Kim1, Robert Todd Ogden, Haseong Kim.
Abstract
BACKGROUND: Time course gene expression experiments are an increasingly popular method for exploring biological processes. Temporal gene expression profiles provide an important characterization of gene function, as biological systems are both developmental and dynamic. With such data it is possible to study gene expression changes over time and thereby to detect differential genes. Much of the early work on analyzing time series expression data relied on methods developed originally for static data and thus there is a need for improved methodology. Since time series expression is a temporal process, its unique features such as autocorrelation between successive points should be incorporated into the analysis.Entities:
Mesh:
Year: 2013 PMID: 24134721 PMCID: PMC4015127 DOI: 10.1186/1471-2105-14-310
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of screening and clustering results (low noise)
| AR(1) parameter | Method | Error | Sil | ARI | Error | Sil | ARI | Sensitivity | Specificity | FDR | FNR | |
| FC* | 2 | .037 | .509 | .909 | .015 | .560 | .918 | .878 | .723 | .121 | .276 | |
| | 3 | .020 | .471 | .921 | .016 | .484 | .932 | .860 | .783 | .139 | .216 | |
| | 4 | .015 | .430 | .963 | .017 | .438 | .937 | .863 | .842 | .136 | .157 | |
| | 5 | .015 | .388 | .964 | .014 | .403 | .944 | .854 | .798 | .145 | .201 | |
| | 8 | .015 | .305 | .964 | .017 | .317 | .940 | .851 | .836 | .148 | .163 | |
| GPR** | | | | .855 | .779 | .220 | .144 | |||||
| FC | 2 | .052 | .471 | .871 | .021 | .523 | .875 | .871 | .722 | .128 | .277 | |
| | 3 | .036 | .423 | .912 | .026 | .443 | .888 | .846 | .783 | .153 | .217 | |
| | 4 | .029 | .386 | .931 | .028 | .398 | .895 | .847 | .839 | .152 | .160 | |
| | 5 | .027 | .348 | .935 | .022 | .366 | .906 | .837 | .798 | .162 | .205 | |
| | 8 | .028 | .274 | .936 | .029 | .287 | .895 | .830 | .836 | .169 | .163 | |
| GPR | | | | .826 | 678 | .321 | .173 | |||||
| FC | 2 | .073 | .430 | .822 | .030 | .487 | .815 | .863 | .723 | .136 | .276 | |
| | 3 | .056 | .380 | .865 | .042 | .402 | .814 | .828 | .783 | .171 | .217 | |
| | 4 | .052 | .339 | .875 | .045 | .356 | .825 | .827 | .834 | .172 | .165 | |
| | 5 | .049 | .306 | .883 | .036 | .326 | .845 | .817 | .790 | .182 | .209 | |
| | 8 | .047 | .244 | .888 | .049 | .257 | .823 | .803 | .832 | .196 | .167 | |
| GPR | | | | .798 | .571 | .428 | .201 | |||||
| FC | 2 | .159 | .340 | .610 | .056 | .414 | .633 | .835 | .717 | .165 | .201 | |
| | 3 | .139 | .287 | .663 | .093 | .329 | .591 | .775 | .768 | .224 | .231 | |
| | 4 | .124 | .255 | .702 | .113 | .280 | .578 | .766 | .811 | .233 | .188 | |
| | 5 | .132 | .226 | .682 | .093 | .259 | .615 | .762 | .773 | .237 | .226 | |
| | 8 | .143 | .181 | .649 | .134 | .205 | .562 | .730 | .815 | .269 | .184 | |
| GPR | | | | .756 | .410 | .589 | .244 | |||||
| FC | 2 | .266 | .287 | .345 | .088 | .357 | .351 | .755 | .704 | .244 | .295 | |
| | 3 | .264 | .224 | .347 | .153 | .272 | .314 | .682 | .738 | .317 | .261 | |
| | 4 | .258 | .190 | .370 | .186 | .230 | .303 | .668 | .771 | .331 | .228 | |
| | 5 | .258 | .171 | .375 | .161 | .211 | .317 | .676 | .745 | .324 | .255 | |
| | 8 | .267 | .137 | .339 | .220 | .172 | .287 | .641 | .769 | .358 | .230 | |
| GPR | .731 | .335 | .664 | .268 | ||||||||
* FC: proposed method with Fourier coefficients, **GPR: Gaussian process regression.
Comparison of estimation error rate (E), Silhouette width (S) and Adjusted Rand Index (ARI) values of model-based clustering without screening vs with screening with J Fourier coefficients including sensitivity, specificity, FDR and FNR with m = 20 time points. These summaries are based on 500 repetitions of each consisting of 800 curves with AR(1) parameter ρ’s with the noise standard deviation σ = 0.5.
Comparison of screening and clustering results (high noise)
| AR(1) parameter | Method | Error | Sil | ARI | Error | Sil | ARI | Sensitivity | Specificity | FDR | FNR | |
| FC* | 2 | .326 | .259 | .200 | .235 | .292 | .149 | .589 | .708 | .411 | .291 | |
| 3 | .321 | .192 | .199 | .298 | .214 | .151 | .561 | .716 | .439 | .283 | ||
| 4 | .321 | .151 | .194 | .320 | .166 | .156 | .553 | .722 | .447 | .278 | ||
| 5 | .323 | .125 | .185 | .269 | .142 | .156 | .571 | .714 | .428 | .285 | ||
| 8 | .324 | .084 | .175 | .324 | .094 | .148 | .552 | .722 | .448 | .277 | ||
| GPR** | | | | | | .483 | .779 | .221 | .517 | |||
| FC | 2 | .343 | .253 | .164 | .234 | .291 | .117 | .567 | .697 | .432 | .302 | |
| 3 | .339 | .185 | .155 | .287 | .208 | .117 | .545 | .703 | .454 | .296 | ||
| 4 | .338 | .149 | .151 | .306 | .160 | .120 | .539 | .708 | .461 | .291 | ||
| 5 | .337 | .125 | .146 | .261 | .138 | .120 | .555 | .702 | .445 | .297 | ||
| 8 | .338 | .086 | .132 | .307 | .094 | .113 | .538 | .706 | .462 | .293 | ||
| GPR | | | | | | .536 | .677 | .323 | .463 | |||
| FC | 2 | .359 | .248 | .128 | .284 | .290 | .090 | .546 | .681 | .453 | .318 | |
| 3 | .351 | .185 | .119 | .329 | .208 | .089 | .531 | .683 | .468 | .316 | ||
| 4 | .350 | .148 | .115 | .347 | .159 | .092 | .526 | .685 | .473 | .314 | ||
| 5 | .350 | .127 | .108 | .304 | .137 | .088 | .537 | .681 | .462 | .318 | ||
| 8 | .357 | .083 | .089 | .351 | .091 | .079 | .526 | .685 | .474 | .314 | ||
| GPR | | | | | | .584 | .572 | .427 | .415 | |||
| FC | 2 | .383 | .246 | .073 | .330 | .284 | .053 | .517 | .632 | .482 | .367 | |
| 3 | .375 | .183 | .066 | .356 | .198 | .051 | .512 | .634 | .488 | .365 | ||
| 4 | .369 | .151 | .062 | .365 | .158 | .052 | .510 | .633 | .490 | .366 | ||
| 5 | .369 | .126 | .056 | .338 | .137 | .046 | .514 | .633 | .485 | .367 | ||
| 8 | .370 | .086 | .046 | .370 | .092 | .042 | .509 | .634 | .490 | .365 | ||
| GPR | | | | | | .646 | .409 | .590 | .353 | |||
| FC | 2 | .395 | .248 | .035 | .356 | .275 | .030 | .505 | .504 | .495 | .422 | |
| 3 | .384 | .186 | .034 | .368 | .193 | .028 | .503 | .503 | .496 | .424 | ||
| 4 | .383 | .148 | .031 | .373 | .155 | .026 | .502 | .502 | .497 | .421 | ||
| 5 | .381 | .125 | .028 | .358 | .134 | .024 | .504 | .504 | .496 | .419 | ||
| 8 | .377 | .092 | .027 | .370 | .097 | .023 | .503 | .502 | .497 | .415 | ||
| GPR | .679 | .337 | .662 | .320 | ||||||||
* FC: proposed method with Fourier coefficients, **GPR: Gaussian process regression
Comparison of estimation error rate (E), Silhouette width (S) and Adjusted Rand Index (ARI) values of model-based clustering without screening vs with screening with J Fourier coefficients including sensitivity, specificity, FDR and FNR with m = 20 time points. These summaries are based on 500 repetitions of each consisting of 800 curves with AR(1) parameter ρ s with the noise standard deviation σ = 1.5.
Silhouette values for model-based clustering with Fourier coefficients of yeast data
| J = 2 | 5 | 4 | 1715 | .160 | .112 | .451 | .388 |
| J = 3 | 5 | 4 | 1735 | .204 | .146 | .505 | .426 |
| J = 4 | 5 | 4 | 2227 | .174 | .119 | .552 | .485 |
| J = 5 | 5 | 4 | 2792 | .029 | .015 | .048 | .028 |
| J = 6 | 5 | 4 | 3071 | .196 | .119 | .041 | .043 |
| J = 8 | 5 | 4 | 3050 | .136 | .055 | .032 | .024 |
| J = 10 | 5 | 4 | 3142 | .153 | .092 | .031 | .010 |
Median and average silhouette values for model-based clustering with Fourier coefficients using Euclidean distance with screening vs. without screening.
Figure 1Means of J = 4 sample Fourier coefficients with yeast data. The mean profiles of Fourier coefficients in the four clusters and one cluster with genes screened out.
Figure 2Average gene curves in four clusters and one screened-out cluster. The mean curves of gene curves in 4 clusters and one mean curve with genes screened out.
Number of genes in each cluster with J = 4
| Number of genes | 2262 | 51 | 29 | 2077 | 70 |
| Filtered genes | 2041 | 46 | 28 | 1881 | 64 |
| Significant GO terms | 1 | 36 | 17 | 6 | 37 |
The number of genes screened, screened out and significant genes with respect to GO.
Figure 3GO graph of each cluster of yeast data.
GO terms connected sequentially in their GO relationship graph (C: cluster number, S: subset number)
| 1 | 1 | GO:0006334 | Nucleosome assembly | 2 | GO:0000280 | Nuclear division |
| GO:0031497 | Chromatin assembly | GO:0000087 | M phase of mitotic cell cycle | |||
| GO:0006323 | DNA packaging | GO:0048285 | Organelle fission | |||
| GO:0034728 | Nucleosome organization | GO:0022402 | Cell cycle process | |||
| GO:0006333 | Chromatin assembly or disassembly | GO:0000278 | Mitotic cell cycle | |||
| GO:0016043 | Cellular component organization | GO:0007049 | Cell cycle | |||
| GO:0051276 | Chromosome organization | GO:0007067 | Mitosis | |||
| GO:0006325 | Chromatin organization | GO:0000279 | M phase | |||
| GO:0006996 | Crganelle organization | GO:0022403 | Cell cycle phase | |||
| 2 | 1 | GO:0007109 | Cytokinesis, completion of separation | 2 | GO:0071554 | Cell wall organization or biogenesis |
| GO:0000920 | Cell separation during cytokinesis | GO:0007047 | Cellular cell wall organization | |||
| GO:0032506 | Cytokinetic process | GO:0071555 | Cell wall organization | |||
| GO:0000910 | Cytokinesis | GO:0070882 | Cellular cell wall organization or biogenesis | |||
| | | GO:0031505 | Cungal-type cell wall organization | |||
| 4 | 1 | GO:0051716 | Cellular response to stimulus | 2 | GO:0006302 | Couble-strand break repair |
| GO:0050896 | Response to stimulus | GO:0006281 | DNA repair | |||
| GO:0033554 | Cellular response to stress | GO:0000724 | Couble-strand break repair via homologous recombination | |||
| GO:0006950 | Response to stress | GO:0006974 | Cesponse to DNA damage stimulus | |||
| GO:0034605 | Cellular response to heat | 3 | GO:0006260 | DNA replication | ||
| GO:0009408 | Response to heat | GO:0006273 | Cagging strand elongation | |||
| GO:0009628 | Response to abiotic stimulus | GO:0006261 | DNA-dependent DNA replication | |||
| GO:0009266 | Response to temperature stimulus | GO:0006271 | DNA strand elongation during DNA replication | |||
| | | GO:0022616 | DNA strand elongation | |||
| GO:0006259 | DNA metabolic process |
Cluster 1, 2 and 4 have biologically meaningful genes as shown in the table.