Literature DB >> 26896791

Effects of imputation on correlation: implications for analysis of mass spectrometry data from multiple biological matrices.

Sandra L Taylor1, L Renee Ruhaak2, Karen Kelly3, Robert H Weiss4, Kyoungmi Kim5.   

Abstract

With expanded access to, and decreased costs of, mass spectrometry, investigators are collecting and analyzing multiple biological matrices from the same subject such as serum, plasma, tissue and urine to enhance biomarker discoveries, understanding of disease processes and identification of therapeutic targets. Commonly, each biological matrix is analyzed separately, but multivariate methods such as MANOVAs that combine information from multiple biological matrices are potentially more powerful. However, mass spectrometric data typically contain large amounts of missing values, and imputation is often used to create complete data sets for analysis. The effects of imputation on multiple biological matrix analyses have not been studied. We investigated the effects of seven imputation methods (half minimum substitution, mean substitution, k-nearest neighbors, local least squares regression, Bayesian principal components analysis, singular value decomposition and random forest), on the within-subject correlation of compounds between biological matrices and its consequences on MANOVA results. Through analysis of three real omics data sets and simulation studies, we found the amount of missing data and imputation method to substantially change the between-matrix correlation structure. The magnitude of the correlations was generally reduced in imputed data sets, and this effect increased with the amount of missing data. Significant results from MANOVA testing also were substantially affected. In particular, the number of false positives increased with the level of missing data for all imputation methods. No one imputation method was universally the best, but the simple substitution methods (Half Minimum and Mean) consistently performed poorly.
© The Author 2016. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Keywords:  imputation; mass spectrometry; metabolomics; missing data; multivariate analysis; within-subject correlation

Mesh:

Year:  2017        PMID: 26896791      PMCID: PMC5862252          DOI: 10.1093/bib/bbw010

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  23 in total

1.  A Bayesian missing value estimation method for gene expression profile data.

Authors:  Shigeyuki Oba; Masa-aki Sato; Ichiro Takemasa; Morito Monden; Ken-ichi Matsubara; Shin Ishii
Journal:  Bioinformatics       Date:  2003-11-01       Impact factor: 6.937

2.  MissForest--non-parametric missing value imputation for mixed-type data.

Authors:  Daniel J Stekhoven; Peter Bühlmann
Journal:  Bioinformatics       Date:  2011-10-28       Impact factor: 6.937

3.  Kidney tumor biomarkers revealed by simultaneous multiple matrix metabolomics analysis.

Authors:  Sheila Ganti; Sandra L Taylor; Omran Abu Aboud; Joy Yang; Christopher Evans; Michael V Osier; Danny C Alexander; Kyoungmi Kim; Robert H Weiss
Journal:  Cancer Res       Date:  2012-05-24       Impact factor: 12.701

4.  The influence of missing value imputation on detection of differentially expressed genes from microarray data.

Authors:  Ida Scheel; Magne Aldrin; Ingrid K Glad; Ragnhild Sørum; Heidi Lyng; Arnoldo Frigessi
Journal:  Bioinformatics       Date:  2005-10-10       Impact factor: 6.937

5.  Serum and tissue metabolomics of head and neck cancer.

Authors:  Koichiro Yonezawa; Shin Nishiumi; Shin Nishiumii; Junko Kitamoto-Matsuda; Takeshi Fujita; Koichi Morimoto; Daisuke Yamashita; Miki Saito; Naoki Otsuki; Yasuhiro Irino; Masakazu Shinohara; Masaru Yoshida; Ken-ichi Nibu
Journal:  Cancer Genomics Proteomics       Date:  2013 Sep-Oct       Impact factor: 4.069

6.  Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes.

Authors:  Guy N Brock; John R Shaffer; Richard E Blakesley; Meredith J Lotz; George C Tseng
Journal:  BMC Bioinformatics       Date:  2008-01-10       Impact factor: 3.169

7.  Comparison of squamous cell carcinoma and adenocarcinoma of the lung by metabolomic analysis of tissue-serum pairs.

Authors:  K W Jordan; C B Adkins; L Su; E F Halpern; E J Mark; D C Christiani; L L Cheng
Journal:  Lung Cancer       Date:  2009-06-25       Impact factor: 5.705

8.  Metabolomic biomarkers in serum and urine in women with preeclampsia.

Authors:  Marie Austdal; Ragnhild Bergene Skråstad; Astrid Solberg Gundersen; Rigmor Austgulen; Ann-Charlotte Iversen; Tone Frost Bathen
Journal:  PLoS One       Date:  2014-03-17       Impact factor: 3.240

9.  A four-compartment metabolomics analysis of the liver, muscle, serum, and urine response to polytrauma with hemorrhagic shock following carbohydrate prefeed.

Authors:  Nancy Witowski; Elizabeth Lusczek; Charles Determan; Daniel Lexcen; Kristine Mulier; Beverly Ostrowski; Greg Beilman
Journal:  PLoS One       Date:  2015-04-14       Impact factor: 3.240

10.  Influence of microarrays experiments missing values on the stability of gene groups by hierarchical clustering.

Authors:  Alexandre G de Brevern; Serge Hazout; Alain Malpertuy
Journal:  BMC Bioinformatics       Date:  2004-08-23       Impact factor: 3.169

View more
  17 in total

1.  Metabolomic profiles of plasma, exhaled breath condensate, and saliva are correlated with potential for air toxics detection.

Authors:  Chandresh Nanji Ladva; Rachel Golan; Roby Greenwald; Tianwei Yu; Stefanie Ebelt Sarnat; W Dana Flanders; Karan Uppal; Douglas I Walker; ViLinh Tran; Donghai Liang; Dean P Jones; Jeremy A Sarnat
Journal:  J Breath Res       Date:  2017-12-06       Impact factor: 3.262

2.  Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

Authors:  Sandra L Taylor; L Renee Ruhaak; Robert H Weiss; Karen Kelly; Kyoungmi Kim
Journal:  Bioinformatics       Date:  2016-09-04       Impact factor: 6.937

3.  Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations.

Authors:  Zhuxuan Jin; Jian Kang; Tianwei Yu
Journal:  Bioinformatics       Date:  2018-05-01       Impact factor: 6.937

Review 4.  Use of Metabolomics in Improving Assessment of Dietary Intake.

Authors:  Marta Guasch-Ferré; Shilpa N Bhupathiraju; Frank B Hu
Journal:  Clin Chem       Date:  2017-10-16       Impact factor: 8.327

5.  Machine learning algorithm-generated and multi-center validated melanoma prognostic signature with inspiration for treatment management.

Authors:  Zaoqu Liu; Hui Xu; Siyuan Weng; Chunguang Guo; Qin Dang; Yuyuan Zhang; Yuqing Ren; Long Liu; Libo Wang; Xiaoyong Ge; Zhe Xing; Jian Zhang; Peng Luo; Xinwei Han
Journal:  Cancer Immunol Immunother       Date:  2022-08-23       Impact factor: 6.630

6.  A multiple phenotype imputation method for genetic diversity and core collection in Taiwanese vegetable soybean.

Authors:  Yen-Hsiang Huang; Hsin-Mei Ku; Chong-An Wang; Ling-Yu Chen; Shan-Syue He; Shu Chen; Po-Chun Liao; Pin-Yuan Juan; Chung-Feng Kao
Journal:  Front Plant Sci       Date:  2022-09-02       Impact factor: 6.627

7.  Metabolomic Profiling of the Malaria Box Reveals Antimalarial Target Pathways.

Authors:  Erik L Allman; Heather J Painter; Jasmeet Samra; Manuela Carrasquilla; Manuel Llinás
Journal:  Antimicrob Agents Chemother       Date:  2016-10-21       Impact factor: 5.191

8.  Distribution based nearest neighbor imputation for truncated high dimensional data with applications to pre-clinical and clinical metabolomics studies.

Authors:  Jasmit S Shah; Shesh N Rai; Andrew P DeFilippis; Bradford G Hill; Aruni Bhatnagar; Guy N Brock
Journal:  BMC Bioinformatics       Date:  2017-02-20       Impact factor: 3.169

Review 9.  Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources.

Authors:  Tara Eicher; Garrett Kinnebrew; Andrew Patt; Kyle Spencer; Kevin Ying; Qin Ma; Raghu Machiraju; And Ewy A Mathé
Journal:  Metabolites       Date:  2020-05-15

10.  A MicroRNA Expression Signature as Prognostic Marker for Oropharyngeal Squamous Cell Carcinoma.

Authors:  Xinyi Liu; Ping Liu; Rebecca D Chernock; Zhenming Yang; Krystle A Lang Kuhs; James S Lewis; Jingqin Luo; Hua Li; Hiram A Gay; Wade L Thorstad; Xiaowei Wang
Journal:  J Natl Cancer Inst       Date:  2021-06-01       Impact factor: 13.506

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.