Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Treatment of missing values for multivariate statistical analysis of gel-based proteomics data.

Literature DB >> 18383008

Treatment of missing values for multivariate statistical analysis of gel-based proteomics data.

Romina Pedreschi¹, Maarten L A T M Hertog, Sebastien C Carpentier, Jeroen Lammertyn, Johan Robben, Jean-Paul Noben, Bart Panis, Rony Swennen, Bart M Nicolaï.

Abstract

The presence of missing values in gel-based proteomics data represents a real challenge if an objective statistical analysis is pursued. Different methods to handle missing values were evaluated and their influence is discussed on the selection of important proteins through multivariate techniques. The evaluated methods consisted of directly dealing with them during the multivariate analysis with the nonlinear estimation by iterative partial least squares (NIPALS) algorithm or imputing them by using either k-nearest neighbor or Bayesian principal component analysis (BPCA) before carrying out the multivariate analysis. These techniques were applied to data obtained from gels stained with classical postrunning dyes and from DIGE gels. Before applying the multivariate techniques, the normality and homoscedasticity assumptions on which parametric tests are based on were tested in order to perform a sound statistical analysis. From the three tested methods to handle missing values in our datasets, BPCA imputation of missing values showed to be the most consistent method.

Mesh：

Year: 2008 PMID： 18383008 DOI： 10.1002/pmic.200700975

Source DB: PubMed Journal: Proteomics ISSN： 1615-9853 Impact factor: 3.984

Keyword Cloud
Cited

18 in total

Treatment of missing values for multivariate statistical analysis of gel-based proteomics data.

Review 1. Image analysis tools and emerging algorithms for expression proteomics.

2. Normalization and statistical analysis of quantitative proteomics data generated by metabolic labeling.

3. Detecting Significant Changes in Protein Abundance.

Review 4. Quality assessment for clinical proteomics.

5. Statistical analysis of variation in the human plasma proteome.

Review 6. Proteomics of plant pathogenic fungi.

7. Multivariate meta-analysis of proteomics data from human prostate and colon tumours.

8. Metabolomic Analysis of the Effect of Postnatal Hypoxia on the Retina in a Newly Born Piglet Model.

9. Putative glycosyltransferases and other plant Golgi apparatus proteins are revealed by LOPIT proteomics.

10. Kernel weighted least square approach for imputing missing values of metabolomics data.