Literature DB >> 34472591

Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data.

Sandra Taylor1, Matthew Ponzini1, Machelle Wilson1, Kyoungmi Kim1.   

Abstract

Missing values are common in high-throughput mass spectrometry data. Two strategies are available to address missing values: (i) eliminate or impute the missing values and apply statistical methods that require complete data and (ii) use statistical methods that specifically account for missing values without imputation (imputation-free methods). This study reviews the effect of sample size and percentage of missing values on statistical inference for multiple methods under these two strategies. With increasing missingness, the ability of imputation and imputation-free methods to identify differentially and non-differentially regulated compounds in a two-group comparison study declined. Random forest and k-nearest neighbor imputation combined with a Wilcoxon test performed well in statistical testing for up to 50% missingness with little bias in estimating the effect size. Quantile regression imputation accompanied with a Wilcoxon test also had good statistical testing outcomes but substantially distorted the difference in means between groups. None of the imputation-free methods performed consistently better for statistical testing than imputation methods.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  imputation; mass spectrometry; metabolomics; missing data; sample size

Mesh:

Year:  2022        PMID: 34472591      PMCID: PMC8769695          DOI: 10.1093/bib/bbab353

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  23 in total

1.  The influence of missing value imputation on detection of differentially expressed genes from microarray data.

Authors:  Ida Scheel; Magne Aldrin; Ingrid K Glad; Ragnhild Sørum; Heidi Lyng; Arnoldo Frigessi
Journal:  Bioinformatics       Date:  2005-10-10       Impact factor: 6.937

2.  A statistical framework for protein quantitation in bottom-up MS-based proteomics.

Authors:  Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

3.  Protein quantification in label-free LC-MS experiments.

Authors:  Timothy Clough; Melissa Key; Ilka Ott; Susanne Ragg; Gunther Schadow; Olga Vitek
Journal:  J Proteome Res       Date:  2009-11       Impact factor: 4.466

Review 4.  Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.

Authors:  Bobbie-Jo M Webb-Robertson; Holli K Wiberg; Melissa M Matzke; Joseph N Brown; Jing Wang; Jason E McDermott; Richard D Smith; Karin D Rodland; Thomas O Metz; Joel G Pounds; Katrina M Waters
Journal:  J Proteome Res       Date:  2015-04-22       Impact factor: 4.466

5.  Proper imputation of missing values in proteomics datasets for differential expression analysis.

Authors:  Mingyi Liu; Ashok Dongre
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

6.  Missing value imputation for microarray data: a comprehensive comparison study and a web tool.

Authors:  Chia-Chun Chiu; Shih-Yao Chan; Chung-Ching Wang; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2013-12-13

7.  A multiple imputation method based on weighted quantile regression models for longitudinal censored biomarker data with missing values at early visits.

Authors:  MinJae Lee; Mohammad H Rahbar; Matthew Brown; Lianne Gensler; Michael Weisman; Laura Diekman; John D Reveille
Journal:  BMC Med Res Methodol       Date:  2018-01-11       Impact factor: 4.615

8.  Plasma metabolites and lipids associate with kidney function and kidney volume in hypertensive ADPKD patients early in the disease course.

Authors:  Kyoungmi Kim; Josephine F Trott; Guimin Gao; Arlene Chapman; Robert H Weiss
Journal:  BMC Nephrol       Date:  2019-02-25       Impact factor: 2.388

9.  Mealtime, temporal, and daily variability of the human urinary and plasma metabolomes in a tightly controlled environment.

Authors:  Kyoungmi Kim; Christine Mall; Sandra L Taylor; Stacie Hitchcock; Chen Zhang; Hiromi I Wettersten; A Daniel Jones; Arlene Chapman; Robert H Weiss
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

10.  Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.

Authors:  Runmin Wei; Jingye Wang; Mingming Su; Erik Jia; Shaoqiu Chen; Tianlu Chen; Yan Ni
Journal:  Sci Rep       Date:  2018-01-12       Impact factor: 4.379

View more
  1 in total

1.  Imputation of Missing Values for Multi-Biospecimen Metabolomics Studies: Bias and Effects on Statistical Validity.

Authors:  Machelle D Wilson; Matthew D Ponzini; Sandra L Taylor; Kyoungmi Kim
Journal:  Metabolites       Date:  2022-07-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.