Literature DB >> 21397683

Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach.

Xiao-Yong Pan1, Ye Tian, Yan Huang, Hong-Bin Shen.   

Abstract

Epistatic miniarray profiling (E-MAP) is a powerful tool for analyzing gene functions and their biological relevance. However, E-MAP data suffers from large proportion of missing values, which often results in misleading and biased analysis results. It is urgent to develop effective missing value estimation methods for E-MAP. Although several independent algorithms can be applied to achieve this goal, their performance varies significantly on different datasets, indicating different algorithms having their own advantages and disadvantages. In this paper, we propose a novel ensemble approach EMDI based on the high-level diversity to impute missing values that consists of two global and four local base estimators. Experimental results on five E-MAP datasets show that EMDI outperforms all single base algorithms, demonstrating an appropriate combination providing complementarity among different methods. Comparison results between several fusion strategies also demonstrate that the proposed high-level diversity scheme is superior to others. EMDI is freely available at www.csbio.sjtu.edu.cn/bioinf/EMDI/.
Copyright © 2011 Elsevier Inc. All rights reserved.

Mesh:

Year:  2011        PMID: 21397683     DOI: 10.1016/j.ygeno.2011.03.001

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  9 in total

1.  Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion.

Authors:  Marinka Žitnik; Blaž Zupan
Journal:  J Comput Biol       Date:  2015-02-06       Impact factor: 1.479

2.  An Algorithm for Creating Virtual Controls Using Integrated and Harmonized Longitudinal Data.

Authors:  William B Hansen; Shyh-Huei Chen; Santiago Saldana; Edward H Ip
Journal:  Eval Health Prof       Date:  2018-05-03       Impact factor: 2.651

3.  SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information.

Authors:  Zhong-Hao Ren; Chang-Qing Yu; Li-Ping Li; Zhu-Hong You; Yong-Jian Guan; Yue-Chao Li; Jie Pan
Journal:  Front Genet       Date:  2022-02-28       Impact factor: 4.599

4.  RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.

Authors:  Xiaoyong Pan; Hong-Bin Shen
Journal:  BMC Bioinformatics       Date:  2017-02-28       Impact factor: 3.169

5.  A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.

Authors:  Hai-Cheng Yi; Zhu-Hong You; De-Shuang Huang; Xiao Li; Tong-Hai Jiang; Li-Ping Li
Journal:  Mol Ther Nucleic Acids       Date:  2018-03-09       Impact factor: 8.886

6.  LncRBase V.2: an updated resource for multispecies lncRNAs and ClinicLSNP hosting genetic variants in lncRNAs for cancer patients.

Authors:  Troyee Das; Aritra Deb; Sibun Parida; Sudip Mondal; Sunirmal Khatua; Zhumur Ghosh
Journal:  RNA Biol       Date:  2020-10-28       Impact factor: 4.652

7.  A hybrid imputation approach for microarray missing value estimation.

Authors:  Huihui Li; Changbo Zhao; Fengfeng Shao; Guo-Zheng Li; Xiao Wang
Journal:  BMC Genomics       Date:  2015-08-17       Impact factor: 3.969

8.  Searching for synergies: matrix algebraic approaches for efficient pair screening.

Authors:  Philip Gerlee; Linnéa Schmidt; Naser Monsefi; Teresia Kling; Rebecka Jörnsten; Sven Nelander
Journal:  PLoS One       Date:  2013-07-25       Impact factor: 3.240

9.  IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.

Authors:  Xiaoyong Pan; Yong-Xian Fan; Junchi Yan; Hong-Bin Shen
Journal:  BMC Genomics       Date:  2016-08-09       Impact factor: 3.969

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.