Literature DB >> 29743963

A MIXED-EFFECTS MODEL FOR INCOMPLETE DATA FROM LABELING-BASED QUANTITATIVE PROTEOMICS EXPERIMENTS.

Lin S Chen1, Jiebiao Wang1, Xianlong Wang2, Pei Wang3.   

Abstract

In mass spectrometry (MS) based quantitative proteomics research, the emerging iTRAQ (isobaric tag for relative and absolute quantitation) and TMT (tandem mass tags) techniques have been widely adopted for high throughput protein profiling. In a typical iTRAQ/TMT proteomics study, samples are grouped into batches, and each batch is processed by one multiplex experiment, in which the abundances of thousands of proteins/peptides in a batch of samples can be measured simultaneously. The multiplex labeling technique greatly enhances the throughput of protein quantification. However, the technical variation across different iTRAQ/TMT multiplex experiments is often large due to the dynamic nature of MS instruments. This leads to strong batch effects in the iTRAQ/TMT data. Moreover, the iTRAQ/TMT data often contain substantial batch-level nonignorable missing entries. Specifically, the abundance measures of a given protein/peptide are often either observed or missing altogether in all the samples from the same batch, with the missing probability depending on the combined batch-level abundances. We term this unique missing-data mechanism as the Batch-level Abundance-Dependent Missing-data Mechanism (BADMM). We introduce a new method- mixEMM-for analyzing iTRAQ/TMT data with batch effects and batch-level nonignorable missingness. The mixEMM method employs a linear mixed-effects model and explicitly models the batch effects and the BADMM. With simulation studies, we showed that, compared with existing approaches that utilize relative abundances and ignore the missing batches under the missing-completely-at-random assumption, the mixEMM method achieves more accurate parameter estimation and inference. We applied the method to an iTRAQ proteomics data from a breast cancer study and identified phosphopeptides differentially expressed between different breast cancer subtypes. The method can be applied to general clustered data with cluster-level nonignorable missing-data mechanisms.

Entities:  

Keywords:  Batch-level Abundance-Dependent Missing-data Mechanism (BADMM); Mixed-effects models; the expectation-conditional-maximization ( ECM) algorithm

Year:  2017        PMID: 29743963      PMCID: PMC5937554          DOI: 10.1214/16-AOAS994

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  29 in total

1.  Comparison of microarray designs for class comparison and class discovery.

Authors:  K Dobbin; R Simon
Journal:  Bioinformatics       Date:  2002-11       Impact factor: 6.937

2.  Addressing accuracy and precision issues in iTRAQ quantitation.

Authors:  Natasha A Karp; Wolfgang Huber; Pawel G Sadowski; Philip D Charles; Svenja V Hester; Kathryn S Lilley
Journal:  Mol Cell Proteomics       Date:  2010-04-10       Impact factor: 5.911

3.  Thermal proteome profiling for unbiased identification of direct and indirect drug targets using multiplexed quantitative mass spectrometry.

Authors:  Holger Franken; Toby Mathieson; Dorothee Childs; Gavain M A Sweetman; Thilo Werner; Ina Tögel; Carola Doce; Stephan Gade; Marcus Bantscheff; Gerard Drewes; Friedrich B M Reinhard; Wolfgang Huber; Mikhail M Savitski
Journal:  Nat Protoc       Date:  2015-09-17       Impact factor: 13.491

4.  Protein labeling by iTRAQ: a new tool for quantitative mass spectrometry in proteome research.

Authors:  Sebastian Wiese; Kai A Reidegeld; Helmut E Meyer; Bettina Warscheid
Journal:  Proteomics       Date:  2007-02       Impact factor: 3.984

5.  Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA.

Authors:  Ann L Oberg; Douglas W Mahoney; Jeanette E Eckel-Passow; Christopher J Malone; Russell D Wolfinger; Elizabeth G Hill; Leslie T Cooper; Oyere K Onuma; Craig Spiro; Terry M Therneau; H Robert Bergen
Journal:  J Proteome Res       Date:  2008-01-04       Impact factor: 4.466

6.  Ion coalescence of neutron encoded TMT 10-plex reporter ions.

Authors:  Thilo Werner; Gavain Sweetman; Maria Fälth Savitski; Toby Mathieson; Marcus Bantscheff; Mikhail M Savitski
Journal:  Anal Chem       Date:  2014-03-11       Impact factor: 6.986

7.  A penalized EM algorithm incorporating missing data mechanism for Gaussian parameter estimation.

Authors:  Lin S Chen; Ross L Prentice; Pei Wang
Journal:  Biometrics       Date:  2014-01-28       Impact factor: 2.571

8.  Neural crest transcription factor Sox10 is preferentially expressed in triple-negative and metaplastic breast carcinomas.

Authors:  Ashley Cimino-Mathews; Andrea P Subhawong; Hillary Elwood; Hind Nassar Warzecha; Rajni Sharma; Ben Ho Park; Janis M Taube; Peter B Illei; Pedram Argani
Journal:  Hum Pathol       Date:  2012-12-20       Impact factor: 3.466

9.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

Review 10.  Isobaric labeling-based relative quantification in shotgun proteomics.

Authors:  Navin Rauniyar; John R Yates
Journal:  J Proteome Res       Date:  2014-11-04       Impact factor: 4.466

View more
  8 in total

1.  ESTIMATION AND INFERENCE IN METABOLOMICS WITH NON-RANDOM MISSING DATA AND LATENT FACTORS.

Authors:  Chris McKennan; Carole Ober; Dan Nicolae
Journal:  Ann Appl Stat       Date:  2020-06-29       Impact factor: 2.083

2.  Integrative Proteo-genomic Analysis to Construct CNA-protein Regulatory Map in Breast and Ovarian Tumors.

Authors:  Weiping Ma; Lin S Chen; Umut Özbek; Sung Won Han; Chenwei Lin; Amanda G Paulovich; Hua Zhong; Pei Wang
Journal:  Mol Cell Proteomics       Date:  2019-07-07       Impact factor: 5.911

3.  Identifying candidate genes and drug targets for Alzheimer's disease by an integrative network approach using genetic and brain region-specific proteomic data.

Authors:  Andi Liu; Astrid M Manuel; Yulin Dai; Brisa S Fernandes; Nitesh Enduru; Peilin Jia; Zhongming Zhao
Journal:  Hum Mol Genet       Date:  2022-09-29       Impact factor: 5.121

4.  Estimating and accounting for unobserved covariates in high-dimensional correlated data.

Authors:  Chris McKennan; Dan Nicolae
Journal:  J Am Stat Assoc       Date:  2020-06-30       Impact factor: 4.369

5.  Using multivariate mixed-effects selection models for analyzing batch-processed proteomics data with non-ignorable missingness.

Authors:  Jiebiao Wang; Pei Wang; Donald Hedeker; Lin S Chen
Journal:  Biostatistics       Date:  2019-10-01       Impact factor: 5.899

6.  Evaluation of Differential Peptide Loading on Tandem Mass Tag-Based Proteomic and Phosphoproteomic Data Quality.

Authors:  James A Sanford; Yang Wang; Joshua R Hansen; Marina A Gritsenko; Karl K Weitz; Tyler J Sagendorf; Cristina E Tognon; Vladislav A Petyuk; Wei-Jun Qian; Tao Liu; Brian J Druker; Karin D Rodland; Paul D Piehowski
Journal:  J Am Soc Mass Spectrom       Date:  2021-11-23       Impact factor: 3.109

7.  Assessment of TMT Labeling Efficiency in Large-Scale Quantitative Proteomics: The Critical Effect of Sample pH.

Authors:  Chelsea Hutchinson-Bunch; James A Sanford; Joshua R Hansen; Marina A Gritsenko; Karin D Rodland; Paul D Piehowski; Wei-Jun Qian; Joshua N Adkins
Journal:  ACS Omega       Date:  2021-05-06

8.  The Mount Sinai cohort of large-scale genomic, transcriptomic and proteomic data in Alzheimer's disease.

Authors:  Minghui Wang; Noam D Beckmann; Panos Roussos; Erming Wang; Xianxiao Zhou; Qian Wang; Chen Ming; Ryan Neff; Weiping Ma; John F Fullard; Mads E Hauberg; Jaroslav Bendl; Mette A Peters; Ben Logsdon; Pei Wang; Milind Mahajan; Lara M Mangravite; Eric B Dammer; Duc M Duong; James J Lah; Nicholas T Seyfried; Allan I Levey; Joseph D Buxbaum; Michelle Ehrlich; Sam Gandy; Pavel Katsel; Vahram Haroutunian; Eric Schadt; Bin Zhang
Journal:  Sci Data       Date:  2018-09-11       Impact factor: 6.444

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.