Literature DB >> 24718098

Imputing missing values for genetic interaction data.

Yishu Wang1, Lin Wang1, Dejie Yang2, Minghua Deng3.   

Abstract

BACKGROUND: Epistatic Miniarray Profiles (EMAP) enable the research of genetic interaction as an important method to construct large-scale genetic interaction networks. However, a high proportion of missing values frequently poses problems in EMAP data analysis since such missing values hinder downstream analysis. While some imputation approaches have been available to EMAP data, we adopted an improved SVD modeling procedure to impute the missing values in EMAP data which has resulted in a higher accuracy rate compared with existing methods.
RESULTS: The improved SVD imputation method adopts an effective soft-threshold to the SVD approach which has been shown to be the best model to impute genetic interaction data when compared with a number of advanced imputation methods. Imputation methods also improve the clustering results of EMAP datasets. Thus, after applying our imputation method on the EMAP dataset, more meaningful modules, known pathways and protein complexes could be detected.
CONCLUSION: While the phenomenon of missing data unavoidably complicates EMAP data, our results showed that we could complete the original dataset by the Soft-SVD approach to accurately recover genetic interactions.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  EMAP; Genetic interaction; Imputation; Soft-SVD

Mesh:

Year:  2014        PMID: 24718098     DOI: 10.1016/j.ymeth.2014.03.032

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  1 in total

1.  Low-Rank and Sparse Matrix Decomposition for Genetic Interaction Data.

Authors:  Yishu Wang; Dejie Yang; Minghua Deng
Journal:  Biomed Res Int       Date:  2015-07-26       Impact factor: 3.411

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.