Literature DB >> 17920334

Sequential imputation for missing values.

Sabine Verboven1, Karlien Vanden Branden, Peter Goos.   

Abstract

As missing values are often encountered in gene expression data, many imputation methods have been developed to substitute these unknown values with estimated values. Despite the presence of many imputation methods, these available techniques have some disadvantages. Some imputation techniques constrain the imputation of missing values to a limited set of genes, whereas other imputation methods optimise a more global criterion whereby the computation time of the method becomes infeasible. Others might be fast but inaccurate. Therefore in this paper a new, fast and accurate estimation procedure, called SEQimpute, is proposed. By introducing the idea of minimisation of a statistical distance rather than a Euclidean distance the method is intrinsically different from the thus far existing imputation methods. Moreover, this newly proposed method can be easily embedded in a multiple imputation technique which is better suited to highlight the uncertainties about the missing value estimates. A comparative study is performed to assess the estimation of the missing values by different imputation approaches. The proposed imputation method is shown to outperform some of the existing imputation methods in terms of accuracy and computation speed.

Mesh:

Year:  2007        PMID: 17920334     DOI: 10.1016/j.compbiolchem.2007.07.001

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  6 in total

1.  ImShot: An Open-Source Software for Probabilistic Identification of Proteins In Situ and Visualization of Proteomics Data.

Authors:  Wasim Aftab; Shibojyoti Lahiri; Axel Imhof
Journal:  Mol Cell Proteomics       Date:  2022-05-13       Impact factor: 7.381

2.  Identification of Rare Variants in Metabolites of the Carnitine Pathway by Whole Genome Sequencing Analysis.

Authors:  Akram Yazdani; Azam Yazdani; Xiaoming Liu; Eric Boerwinkle
Journal:  Genet Epidemiol       Date:  2016-06-03       Impact factor: 2.135

3.  DrImpute: imputing dropout events in single cell RNA sequencing data.

Authors:  Wuming Gong; Il-Youp Kwak; Pruthvi Pota; Naoko Koyano-Nakagawa; Daniel J Garry
Journal:  BMC Bioinformatics       Date:  2018-06-08       Impact factor: 3.169

4.  Characterization of the Secretome, Transcriptome, and Proteome of Human β Cell Line EndoC-βH1.

Authors:  Maria Ryaboshapkina; Kevin Saitoski; Ghaith M Hamza; Andrew F Jarnuczak; Séverine Pechberty; Claire Berthault; Kaushik Sengupta; Christina Rye Underwood; Shalini Andersson; Raphael Scharfmann
Journal:  Mol Cell Proteomics       Date:  2022-04-02       Impact factor: 7.381

5.  Using Machine Learning to Identify Biomarkers Affecting Fat Deposition in Pigs by Integrating Multisource Transcriptome Information.

Authors:  Huatao Liu; Kai Xing; Yifan Jiang; Yibing Liu; Chuduan Wang; Xiangdong Ding
Journal:  J Agric Food Chem       Date:  2022-08-11       Impact factor: 5.895

Review 6.  A Review of Integrative Imputation for Multi-Omics Datasets.

Authors:  Meng Song; Jonathan Greenbaum; Joseph Luttrell; Weihua Zhou; Chong Wu; Hui Shen; Ping Gong; Chaoyang Zhang; Hong-Wen Deng
Journal:  Front Genet       Date:  2020-10-15       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.