Literature DB >> 24246290

Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.

Sandra L Taylor, Gary S Leiserowitz, Kyoungmi Kim.   

Abstract

Mass spectrometry is an important high-throughput technique for profiling small molecular compounds in biological samples and is widely used to identify potential diagnostic and prognostic compounds associated with disease. Commonly, this data generated by mass spectrometry has many missing values resulting when a compound is absent from a sample or is present but at a concentration below the detection limit. Several strategies are available for statistically analyzing data with missing values. The accelerated failure time (AFT) model assumes all missing values result from censoring below a detection limit. Under a mixture model, missing values can result from a combination of censoring and the absence of a compound. We compare power and estimation of a mixture model to an AFT model. Based on simulated data, we found the AFT model to have greater power to detect differences in means and point mass proportions between groups. However, the AFT model yielded biased estimates with the bias increasing as the proportion of observations in the point mass increased while estimates were unbiased with the mixture model except if all missing observations came from censoring. These findings suggest using the AFT model for hypothesis testing and mixture model for estimation. We demonstrated this approach through application to glycomics data of serum samples from women with ovarian cancer and matched controls.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24246290      PMCID: PMC3905689          DOI: 10.1515/sagmb-2013-0021

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  14 in total

1.  Comparisons of two-part models with competitors.

Authors:  P A Lachenbruch
Journal:  Stat Med       Date:  2001-04-30       Impact factor: 2.373

2.  Processing and analysis of GC/LC-MS-based metabolomics data.

Authors:  Elizabeth Want; Perrine Masson
Journal:  Methods Mol Biol       Date:  2011

3.  A statistical framework for protein quantitation in bottom-up MS-based proteomics.

Authors:  Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N Adkins; Charles Ansong; Fred Heffron; Thomas O Metz; Wei-Jun Qian; Hyunjin Yoon; Richard D Smith; Alan R Dabney
Journal:  Bioinformatics       Date:  2009-06-17       Impact factor: 6.937

4.  Bioinformatics for mass spectrometry-based metabolomics.

Authors:  David P Enot; Bernd Haas; Klaus M Weinberger
Journal:  Methods Mol Biol       Date:  2011

5.  A mixed gamma model for regression analyses of quantitative assay data.

Authors:  L H Moulton; N A Halsey
Journal:  Vaccine       Date:  1996-08       Impact factor: 3.641

6.  A mixture model with detection limits for regression analyses of antibody response to vaccine.

Authors:  L H Moulton; N A Halsey
Journal:  Biometrics       Date:  1995-12       Impact factor: 2.571

Review 7.  Regulatory networks of glucosinolates shape Arabidopsis thaliana fitness.

Authors:  Meike Burow; Barbara A Halkier; Daniel J Kliebenstein
Journal:  Curr Opin Plant Biol       Date:  2010-03-11       Impact factor: 7.834

8.  Hypothesis tests for point-mass mixture data with application to 'omics data with many zero values.

Authors:  Sandra Taylor; Katherine Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2009-02-04

9.  Use of log-skew-normal distribution in analysis of continuous data with a discrete component at zero.

Authors:  High Seng Chai; Kent R Bailey
Journal:  Stat Med       Date:  2008-08-15       Impact factor: 2.373

10.  Normalization and missing value imputation for label-free LC-MS analysis.

Authors:  Yuliya V Karpievitch; Alan R Dabney; Richard D Smith
Journal:  BMC Bioinformatics       Date:  2012-11-05       Impact factor: 3.169

View more
  12 in total

1.  Integrated analysis of proteome and transcriptome changes in the mucopolysaccharidosis type VII mouse hippocampus.

Authors:  Michael K Parente; Ramona Rozen; Steven H Seeholzer; John H Wolfe
Journal:  Mol Genet Metab       Date:  2016-03-07       Impact factor: 4.797

2.  Effects of imputation on correlation: implications for analysis of mass spectrometry data from multiple biological matrices.

Authors:  Sandra L Taylor; L Renee Ruhaak; Karen Kelly; Robert H Weiss; Kyoungmi Kim
Journal:  Brief Bioinform       Date:  2017-03-01       Impact factor: 11.622

3.  Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

Authors:  Sandra L Taylor; L Renee Ruhaak; Robert H Weiss; Karen Kelly; Kyoungmi Kim
Journal:  Bioinformatics       Date:  2016-09-04       Impact factor: 6.937

4.  Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data.

Authors:  Sandra Taylor; Matthew Ponzini; Machelle Wilson; Kyoungmi Kim
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

5.  Distribution based nearest neighbor imputation for truncated high dimensional data with applications to pre-clinical and clinical metabolomics studies.

Authors:  Jasmit S Shah; Shesh N Rai; Andrew P DeFilippis; Bradford G Hill; Aruni Bhatnagar; Guy N Brock
Journal:  BMC Bioinformatics       Date:  2017-02-20       Impact factor: 3.169

6.  PolySTest: Robust Statistical Testing of Proteomics Data with Missing Values Improves Detection of Biologically Relevant Features.

Authors:  Veit Schwämmle; Christina E Hagensen; Adelina Rogowska-Wrzesinska; Ole N Jensen
Journal:  Mol Cell Proteomics       Date:  2020-05-18       Impact factor: 5.911

7.  Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data.

Authors:  Runmin Wei; Jingye Wang; Mingming Su; Erik Jia; Shaoqiu Chen; Tianlu Chen; Yan Ni
Journal:  Sci Rep       Date:  2018-01-12       Impact factor: 4.379

8.  Differential Abundance Analysis with Bayes Shrinkage Estimation of Variance (DASEV) for Zero-Inflated Proteomic and Metabolomic Data.

Authors:  Zhengyan Huang; Andrew N Lane; Teresa W-M Fan; Richard M Higashi; Heidi L Weiss; Xiangrong Yin; Chi Wang
Journal:  Sci Rep       Date:  2020-01-21       Impact factor: 4.379

9.  BayesMetab: treatment of missing values in metabolomic studies using a Bayesian modeling approach.

Authors:  Jasmit Shah; Guy N Brock; Jeremy Gaskins
Journal:  BMC Bioinformatics       Date:  2019-12-20       Impact factor: 3.169

10.  High-resolution metabolomic profiling of Alzheimer's disease in plasma.

Authors:  Megan M Niedzwiecki; Douglas I Walker; Jennifer Christina Howell; Kelly D Watts; Dean P Jones; Gary W Miller; William T Hu
Journal:  Ann Clin Transl Neurol       Date:  2019-12-11       Impact factor: 4.511

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.