| Literature DB >> 26250443 |
Fang Yu1, Ming-Hui Chen2, Lynn Kuo3, Heather Talbott4, John S Davis5.
Abstract
BACKGROUND: Recently, the Bayesian method becomes more popular for analyzing high dimensional gene expression data as it allows us to borrow information across different genes and provides powerful estimators for evaluating gene expression levels. It is crucial to develop a simple but efficient gene selection algorithm for detecting differentially expressed (DE) genes based on the Bayesian estimators.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26250443 PMCID: PMC4527130 DOI: 10.1186/s12859-015-0664-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Graphical illustration of the confident difference criterion method. The figure on the left panel and the right panel uses γ 0=0.5 and γ 0=0.7 separately. The μ −μ measures the difference in the mean intensities of gene g between the two conditions. Both figures are drawn based on an assumption that the posterior mean of μ −μ are positive. The shaded area in both figures measures the posterior probability for having a positive difference μ −μ
Performance evaluation under Study I (Setting 1), (G=5000, 500 DE gene)
| Cut-off | Method | CDE | CCDE | CCEE | FNR | FPR | FDR | FNDR |
|---|---|---|---|---|---|---|---|---|
|
| I | 796.7(14.5) | 466.7(4.7) | 4169.9(13.9) | 0.067(0.009) | 0.073(0.003) | 0.414(0.011) | 0.008(0.001) |
| (0.4) | II | 864.4(13.8) | 499.2(1.1) | 4134.8(13.8) | 0.002(0.002) | 0.081(0.003) | 0.422(0.009) | 0.000(0.000) |
|
| I | 526.5(9.8) | 419.2(6.7) | 4392.7(6.9) | 0.162(0.013) | 0.024(0.002) | 0.204(0.011) | 0.018(0.001) |
| (0.6) | II | 582.3(7.3) | 493.3(3.0) | 4411.0(6.9) | 0.013(0.006) | 0.020(0.002) | 0.153(0.010) | 0.002(0.001) |
| FDR | I | 296.4(13.7) | 283.3(12.9) | 4486.9(2.9) | 0.433(0.026) | 0.003(0.001) | 0.044(0.009) | 0.046(0.003) |
| (0.05) | II | 469.2(13.1) | 450.7(10.2) | 4481.5(4.4) | 0.099(0.020) | 0.004(0.001) | 0.039(0.009) | 0.011(0.002) |
| SAM | 330.5(16.0) | 314.0(13.5) | 4483.4(4.7) | 0.372(0.027) | 0.004(0.001) | 0.050(0.013) | 0.040(0.003) | |
| LIMMA | 320.2(15.2) | 304.9(13.7) | 4484.7(4.1) | 0.390(0.027) | 0.003(0.001) | 0.048(0.012) | 0.042(0.003) | |
| SPH | 192.0(10.6) | 188.1(10.0) | 4496.1(2.1) | 0.624(0.020) | 0.001(0.000) | 0.020(0.011) | 0.065(0.002) | |
| EBA | 166.4(14.1) | 158.8(13.3) | 4492.3(2.2) | 0.682(0.027) | 0.002(0.000) | 0.046(0.012) | 0.071(0.003) |
#Empirical estimates of the standard deviation were reported in the parentheses
Performance evaluation under Study I (Setting 2), (G=5000, 500 DE gene)
| Cut-off | Method | CDE | CCDE | CCEE | FNR | FPR | FDR | FNDR |
|---|---|---|---|---|---|---|---|---|
|
| I | 1086.5(22.5) | 476.3(4.4) | 3890.8(21.9) | 0.045(0.009) | 0.135(0.005) | 0.561(0.009) | 0.006(0.001) |
| (0.4) | II | 1388.1(25.7) | 499.7(0.5) | 3611.6(25.7) | 0.001(0.001) | 0.197(0.006) | 0.640(0.007) | 0.000(0.000) |
|
| I | 656.8(13.8) | 448.2(5.5) | 4291.4(13.1) | 0.104(0.011) | 0.046(0.003) | 0.317(0.014) | 0.012(0.001) |
| (0.6) | II | 749.3(15.0) | 496.0(1.8) | 4246.7(15.1) | 0.008(0.004) | 0.056(0.003) | 0.338(0.014) | 0.001(0.000) |
| FDR | I | 357.1(8.7) | 342.7(8.0) | 4485.6(3.9) | 0.315(0.016) | 0.003(0.001) | 0.040(0.011) | 0.034(0.002) |
| (0.05) | II | 480.7(10.5) | 458.7(7.7) | 4478.0(5.7) | 0.083(0.015) | 0.005(0.001) | 0.046(0.011) | 0.009(0.002) |
| SAM | 326.9(12.9) | 312.0(11.2) | 4485.1(4.5) | 0.376(0.022) | 0.003(0.001) | 0.045(0.013) | 0.040(0.002) | |
| LIMMA | 329.5(51.9) | 309.9(21.6) | 4480.4(31.8) | 0.380(0.043) | 0.004(0.007) | 0.053(0.045) | 0.041(0.004) | |
| EBA | 190.4(6.7) | 184.2(6.7) | 4493.7(1.9) | 0.632(0.013) | 0.001(0.000) | 0.033(0.010) | 0.066(0.001) |
#Empirical estimates of the standard deviation were reported in the parentheses
Performance evaluation under Study II (Setting 1), (G=5000, 500 DE gene)
| Method | CDE | CCDE | CCEE | FNR | FPR | FDR | FNDR |
|---|---|---|---|---|---|---|---|
| twocri. ( | 785.1(8.3) | 465.6(2.5) | 4180.5(7.6) | 0.069(0.005) | 0.071(0.002) | 0.407(0.006) | 0.008 (0.001) |
| ( | 509.4(6.8) | 421.3(4.6) | 4411.9(4.1) | 0.157(0.009) | 0.019(0.001) | 0.173(0.007) | 0.018(0.001) |
| ( | 344.2(8.2) | 328.8(6.3) | 4484.6(2.6) | 0.342(0.013) | 0.003(0.001) | 0.044(0.007) | 0.037(0.001) |
| edgeR 1( | 289.7(17.6) | 278.1(16.5) | 4488.3(3.6) | 0.444(0.033) | 0.003(0.001) | 0.040(0.011) | 0.047(0.003) |
| edgeR 2( | 290.6(18.1) | 276.4(16.8) | 4485.8(3.8) | 0.447(0.034) | 0.003(0.001) | 0.049(0.012) | 0.047(0.003) |
| DESeq( | 297.2(21.3) | 265.9(18.4) | 4468.7(5.6) | 0.468(0.037) | 0.007(0.001) | 0.105(0.016) | 0.050(0.002) |
| BaySeq( | 203.1(22.8) | 203.0(22.8) | 4499.9(0.2) | 0.594(0.046) | 0.000(0.000) | 0.000(0.001) | 0.062(0.004) |
| NBPSeq( | 248.3(20.5) | 239.8(19.3) | 4491.5(4.0) | 0.520(0.039) | 0.002(0.001) | 0.034(0.015) | 0.055(0.004) |
| EBSeq( | 303.7(18.8) | 257.7(14.9) | 4454.0(6.8) | 0.485(0.030) | 0.010(0.002) | 0.151(0.017) | 0.052(0.003) |
| NOISeq( | 303.1(19.1) | 294.4(17.6) | 4491.3(3.2) | 0.411(0.035) | 0.002(0.001) | 0.028(0.010) | 0.044(0.004) |
| SAMSeq( | 134.2(45.2) | 126.1(43.2) | 4491.9(3.4) | 0.748(0.086) | 0.002(0.001) | 0.061(0.022) | 0.077(0.008) |
| TSPM( | 85.4 (19.2) | 58.7 (15.4) | 4473.3(6.5) | 0.883(0.031) | 0.006(0.001) | 0.316(0.056) | 0.090(0.003) |
#Empirical estimates of the standard deviation were reported in the parentheses
edgeR 1estimates the common dispersion parameter for all tags; edgeR 2 estimates the tag-wise dispersion parameters
denotes the FDR
Performance evaluation under Study II (Setting II), (G=5000, 500 DE gene)
| Method | CDE | CCDE | CCEE | FNR | FPR | FDR | FNDR |
|---|---|---|---|---|---|---|---|
| twocri.( | 654.6(5.4) | 460.9(2.3) | 4306.3(5.2) | 0.078(0.005) | 0.043(0.001) | 0.295(0.006) | 0.009(0.000) |
| ( | 490.0(3.9) | 434.1(2.4) | 4444.2(3.4) | 0.132(0.005) | 0.012(0.001) | 0.114(0.006) | 0.014(0.001) |
| ( | 415.7(5.0) | 400.6(3.5) | 4484.9(2.7) | 0.199(0.007) | 0.003(0.001) | 0.036(0.006) | 0.022(0.001) |
| edgeR 1( | 420.0(8.6) | 411.7(8.4) | 4491.7(2.8) | 0.177(0.017) | 0.002(0.001) | 0.020(0.001) | 0.020(0.002) |
| edgeR 2( | 399.4(10.2) | 386.3(9.8) | 4486.9(4.6) | 0.227(0.020) | 0.003(0.001) | 0.033(0.011) | 0.025(0.002) |
| DESeq( | 443.6(15.9) | 409.3(15.1) | 4465.8(5.3) | 0.181(0.030) | 0.008(0.001) | 0.077(0.011) | 0.020(0.003) |
| BaySeq( | 331.0(15.5) | 327.0(14.9) | 4496.0(2.2) | 0.346(0.030) | 0.001(0.000) | 0.012(0.006) | 0.037(0.003) |
| NBPSeq( | 422.3(7.9) | 412.7(7.9) | 4490.4(3.1) | 0.175(0.016) | 0.002(0.001) | 0.023(0.007) | 0.019(0.002) |
| EBSeq( | 332.9(14.0) | 248.4(11.1) | 4415.6(9.4) | 0.503(0.022) | 0.019(0.002) | 0.253(0.023) | 0.054(0.002) |
| NOISeq( | 196.8(11.0) | 191.4(10.8) | 4494.6(2.5) | 0.617(0.022) | 0.001(0.001) | 0.028(0.013) | 0.064(0.002) |
| SAMSeq( | 274.3(15.7) | 212.1(7.7) | 4437.8(10.9) | 0.576(0.015) | 0.014(0.002) | 0.226(0.029) | 0.061(0.001) |
| TSPM( | 129.9(10.5) | 80.1 (8.9) | 4450.2(7.1) | 0.840(0.018) | 0.011(0.002) | 0.383(0.046) | 0.086(0.002) |
#Empirical estimates of the standard deviation were reported in the parentheses
edgeR 1 estimates common dispersion parameter for all tags; edgeR 2 estimats tag-wise dispersion parameters
denotes the false discovery rate
Fig. 2Number of identified DE genes out of 16471 genes from real data analysis. Two venndiagrams present the overlapping among the DE genes identified separately by Method I/II, SAM, and EBarrays with the false discovery rate controlled at 0.05 from the real data