Literature DB >> 26166210

A quantum leap in the reproducibility, precision, and sensitivity of gene expression profile analysis even when sample size is extremely small.

Kevin Lim1, Zhenhua Li2, Kwok Pui Choi3, Limsoon Wong1.   

Abstract

Transcript-level quantification is often measured across two groups of patients to aid the discovery of biomarkers and detection of biological mechanisms involving these biomarkers. Statistical tests lack power and false discovery rate is high when sample size is small. Yet, many experiments have very few samples (≤ 5). This creates the impetus for a method to discover biomarkers and mechanisms under very small sample sizes. We present a powerful method, ESSNet, that is able to identify subnetworks consistently across independent datasets of the same disease phenotypes even under very small sample sizes. The key idea of ESSNet is to fragment large pathways into smaller subnetworks and compute a statistic that discriminates the subnetworks in two phenotypes. We do not greedily select genes to be included based on differential expression but rely on gene-expression-level ranking within a phenotype, which is shown to be stable even under extremely small sample sizes. We test our subnetworks on null distributions obtained by array rotation; this preserves the gene-gene correlation structure and is suitable for datasets with small sample size allowing us to consistently predict relevant subnetworks even when sample size is small. For most other methods, this consistency drops to less than 10% when we test them on datasets with only two samples from each phenotype, whereas ESSNet is able to achieve an average consistency of 58% (72% when we consider genes within the subnetworks) and continues to be superior when sample size is large. We further show that the subnetworks identified by ESSNet are highly correlated to many references in the biological literature. ESSNet and supplementary material are available at: http://compbio.ddns.comp.nus.edu.sg:8080/essnet .

Entities:  

Keywords:  FSNet; GSEA; Microarray data analysis; PFSNet; SNet; biological pathways; gene expression profiling; subnetworks

Mesh:

Year:  2015        PMID: 26166210     DOI: 10.1142/S0219720015500183

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  4 in total

1.  Bayesian inference with historical data-based informative priors improves detection of differentially expressed genes.

Authors:  Ben Li; Zhaonan Sun; Qing He; Yu Zhu; Zhaohui S Qin
Journal:  Bioinformatics       Date:  2015-10-30       Impact factor: 6.937

2.  Protein complex-based analysis is resistant to the obfuscating consequences of batch effects --- a case study in clinical proteomics.

Authors:  Wilson Wen Bin Goh; Limsoon Wong
Journal:  BMC Genomics       Date:  2017-03-14       Impact factor: 3.969

3.  Significant random signatures reveals new biomarker for breast cancer.

Authors:  Elnaz Saberi Ansar; Changiz Eslahchii; Mahsa Rahimi; Lobat Geranpayeh; Marzieh Ebrahimi; Rosa Aghdam; Gwenneg Kerdivel
Journal:  BMC Med Genomics       Date:  2019-11-08       Impact factor: 3.063

4.  Weighted-SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes.

Authors:  Suyan Tian; Howard H Chang; Chi Wang
Journal:  Biol Direct       Date:  2016-09-29       Impact factor: 4.540

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.