Literature DB >> 16675466

Integrated analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: zero-inflated Poisson regression models to predict abundance of undetected proteins.

Lei Nie1, Gang Wu, Fred J Brockman, Weiwen Zhang.   

Abstract

MOTIVATION: Integrated analysis of global scale transcriptomic and proteomic data can provide important insights into the metabolic mechanisms underlying complex biological systems. However, because the relationship between protein abundance and mRNA expression level is complicated by many cellular and physical processes, sophisticated statistical models need to be developed to capture their relationship.
RESULTS: In this study, we describe a novel data-driven statistical model to integrate whole-genome microarray and proteomic data collected from Desulfovibrio vulgaris grown under three different conditions. Based on the Poisson distribution pattern of proteomic data and the fact that a large number of proteins were undetected (excess zeros), zero-inflated Poisson (ZIP)-based models were proposed to define the correlation pattern between mRNA and protein abundance. In addition, by assuming that there is a probability mass at zero representing unexpressed genes and expressed proteins that were undetected owing to technical limitations, a Potential ZIP model was established. Two significant improvements introduced by this approach are (1) the predicted protein abundance level values for experimentally detected proteins are corrected by considering their mRNA levels and (2) protein abundance values can be predicted for undetected proteins (in the case of this study, approximately 83% of the proteins in the D.vulgaris genome) for better biological interpretation. We demonstrated the use of these statistical models by comparatively analyzing proteomic and microarray results from D.vulgaris grown on lactate-based versus formate-based media. These models correctly predicted increased expression of Ech hydrogenase and decreased expression of Coo hydrogenase for D.vulgaris grown on formate.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16675466     DOI: 10.1093/bioinformatics/btl134

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  Reductions in drug use among young people living with HIV.

Authors:  W Scott Comulada; Robert E Weiss; William Cumberland; Mary Jane Rotheram-Borus
Journal:  Am J Drug Alcohol Abuse       Date:  2007       Impact factor: 3.829

2.  Proteomic and transcriptomic analyses of "Candidatus Pelagibacter ubique" describe the first PII-independent response to nitrogen limitation in a free-living Alphaproteobacterium.

Authors:  Daniel P Smith; J Cameron Thrash; Carrie D Nicora; Mary S Lipton; Kristin E Burnum-Johnson; Paul Carini; Richard D Smith; Stephen J Giovannoni
Journal:  MBio       Date:  2013-11-26       Impact factor: 7.867

Review 3.  Global signatures of protein and mRNA expression levels.

Authors:  Raquel de Sousa Abreu; Luiz O Penalva; Edward M Marcotte; Christine Vogel
Journal:  Mol Biosyst       Date:  2009-10-01

4.  Correlation of mRNA expression and protein abundance affected by multiple sequence features related to translational efficiency in Desulfovibrio vulgaris: a quantitative analysis.

Authors:  Lei Nie; Gang Wu; Weiwen Zhang
Journal:  Genetics       Date:  2006-10-08       Impact factor: 4.562

5.  Integration of transcriptomic and proteomic data identifies biological functions in cell populations from human infant lung.

Authors:  Yina Du; Geremy C Clair; Denise Al Alam; Soula Danopoulos; Daniel Schnell; Joseph A Kitzmiller; Ravi S Misra; Soumyaroop Bhattacharya; David Warburton; Thomas J Mariani; Gloria S Pryhuber; Jeffrey A Whitsett; Charles Ansong; Yan Xu
Journal:  Am J Physiol Lung Cell Mol Physiol       Date:  2019-07-03       Impact factor: 5.464

6.  Integrative analysis of transcriptomic and proteomic data of Desulfovibrio vulgaris: a non-linear model to predict abundance of undetected proteins.

Authors:  Wandaliz Torres-García; Weiwen Zhang; George C Runger; Roger H Johnson; Deirdre R Meldrum
Journal:  Bioinformatics       Date:  2009-05-15       Impact factor: 6.937

7.  A Bayesian Semi-parametric Approach for the Differential Analysis of Sequence Counts Data.

Authors:  Michele Guindani; Nuno Sepúlveda; Carlos Daniel Paulino; Peter Müller
Journal:  J R Stat Soc Ser C Appl Stat       Date:  2014-04       Impact factor: 1.864

8.  Correlating gene and protein expression data using Correlated Factor Analysis.

Authors:  Chuen Seng Tan; Agus Salim; Alexander Ploner; Janne Lehtiö; Kee Seng Chia; Yudi Pawitan
Journal:  BMC Bioinformatics       Date:  2009-09-01       Impact factor: 3.169

9.  Transcriptome and proteome exploration to model translation efficiency and protein stability in Lactococcus lactis.

Authors:  Clémentine Dressaire; Christophe Gitton; Pascal Loubière; Véronique Monnet; Isabelle Queinnec; Muriel Cocaign-Bousquet
Journal:  PLoS Comput Biol       Date:  2009-12-18       Impact factor: 4.475

10.  Integrated analysis of transcriptomic and proteomic data.

Authors:  Saad Haider; Ranadip Pal
Journal:  Curr Genomics       Date:  2013-04       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.