Literature DB >> 32520347

Proper imputation of missing values in proteomics datasets for differential expression analysis.

Mingyi Liu, Ashok Dongre.   

Abstract

Label-free shotgun proteomics is an important tool in biomedical research, where tandem mass spectrometry with data-dependent acquisition (DDA) is frequently used for protein identification and quantification. However, the DDA datasets contain a significant number of missing values (MVs) that severely hinders proper analysis. Existing literature suggests that different imputation methods should be used for the two types of MVs: missing completely at random or missing not at random. However, the simulated or biased datasets utilized by most of such studies offer few clues about the composition and thus proper imputation of MVs in real-life proteomic datasets. Moreover, the impact of imputation methods on downstream differential expression analysis-a critical goal for many biomedical projects-is largely undetermined. In this study, we investigated public DDA datasets of various tissue/sample types to determine the composition of MVs in them. We then developed simulated datasets that imitate the MV profile of real-life datasets. Using such datasets, we compared the impact of various popular imputation methods on the analysis of differentially expressed proteins. Finally, we make recommendations on which imputation method(s) to use for proteomic data beyond just DDA datasets.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  data-dependent acquisition; differential expression analysis; imputation; mass spectrometry; missing values; proteomics

Year:  2021        PMID: 32520347     DOI: 10.1093/bib/bbaa112

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  8 in total

1.  HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values.

Authors:  Hannah Voß; Simon Schlumbohm; Philip Barwikowski; Marcus Wurlitzer; Matthias Dottermusch; Philipp Neumann; Hartmut Schlüter; Julia E Neumann; Christoph Krisp
Journal:  Nat Commun       Date:  2022-06-20       Impact factor: 17.694

2.  Assessment of label-free quantification and missing value imputation for proteomics in non-human primates.

Authors:  Zeeshan Hamid; Kip D Zimmerman; Hector Guillen-Ahlers; Cun Li; Peter Nathanielsz; Laura A Cox; Michael Olivier
Journal:  BMC Genomics       Date:  2022-07-08       Impact factor: 4.547

3.  Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data.

Authors:  Sandra Taylor; Matthew Ponzini; Machelle Wilson; Kyoungmi Kim
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

4.  Dissecting platelet proteomics to understand the pathophysiology of immune thrombocytopenia: studies in mouse models.

Authors:  Patricia Martínez-Botía; Marjolein Meinders; Iris M De Cuyper; Johannes A Eble; John W Semple; Laura Gutiérrez
Journal:  Blood Adv       Date:  2022-06-14

5.  iTRAQ-based proteomic analysis of differentially expressed proteins in sera of seronegative and seropositive rheumatoid arthritis patients.

Authors:  Yujue He; Junyu Lin; Jifeng Tang; Ziqing Yu; Qishui Ou; Jinpiao Lin
Journal:  J Clin Lab Anal       Date:  2021-11-23       Impact factor: 2.352

6.  Spatial proteomics reveals subcellular reorganization in human keratinocytes exposed to UVA light.

Authors:  Hellen Paula Valerio; Felipe Gustavo Ravagnani; Angela Paola Yaya Candela; Bruna Dias Carvalho da Costa; Graziella Eliza Ronsein; Paolo Di Mascio
Journal:  iScience       Date:  2022-03-16

7.  Comparative assessment and novel strategy on methods for imputing proteomics data.

Authors:  Minjie Shen; Yi-Tan Chang; Chiung-Ting Wu; Sarah J Parker; Georgia Saylor; Yizhi Wang; Guoqiang Yu; Jennifer E Van Eyk; Robert Clarke; David M Herrington; Yue Wang
Journal:  Sci Rep       Date:  2022-01-20       Impact factor: 4.379

8.  Multiple Imputation Approaches Applied to the Missing Value Problem in Bottom-Up Proteomics.

Authors:  Miranda L Gardner; Michael A Freitas
Journal:  Int J Mol Sci       Date:  2021-09-06       Impact factor: 5.923

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.