Literature DB >> 23929862

GENESHIFT: a nonparametric approach for integrating microarray gene expression data based on the inner product as a distance measure between the distributions of genes.

Cosmin Lazar1, Jonatan Taminau, Stijn Meganck, David Steenhoff, Alain Coletta, David Y Weiss Solís, Colin Molter, Robin Duque, Hugues Bersini, Ann Nowé.   

Abstract

The potential of microarray gene expression (MAGE) data is only partially explored due to the limited number of samples in individual studies. This limitation can be surmounted by merging or integrating data sets originating from independent MAGE experiments, which are designed to study the same biological problem. However, this process is hindered by batch effects that are study-dependent and result in random data distortion; therefore numerical transformations are needed to render the integration of different data sets accurate and meaningful. Our contribution in this paper is two-fold. First we propose GENESHIFT, a new nonparametric batch effect removal method based on two key elements from statistics: empirical density estimation and the inner product as a distance measure between two probability density functions; second we introduce a new validation index of batch effect removal methods based on the observation that samples from two independent studies drawn from a same population should exhibit similar probability density functions. We evaluated and compared the GENESHIFT method with four other state-of-the-art methods for batch effect removal: Batch-mean centering, empirical Bayes or COMBAT, distance-weighted discrimination, and cross-platform normalization. Several validation indices providing complementary information about the efficiency of batch effect removal methods have been employed in our validation framework. The results show that none of the methods clearly outperforms the others. More than that, most of the methods used for comparison perform very well with respect to some validation indices while performing very poor with respect to others. GENESHIFT exhibits robust performances and its average rank is the highest among the average ranks of all methods used for comparison.

Mesh:

Year:  2013        PMID: 23929862     DOI: 10.1109/TCBB.2013.12

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  Data harmonisation for information fusion in digital healthcare: A state-of-the-art systematic review, meta-analysis and future research directions.

Authors:  Yang Nan; Javier Del Ser; Simon Walsh; Carola Schönlieb; Michael Roberts; Ian Selby; Kit Howard; John Owen; Jon Neville; Julien Guiot; Benoit Ernst; Ana Pastor; Angel Alberich-Bayarri; Marion I Menzel; Sean Walsh; Wim Vos; Nina Flerin; Jean-Paul Charbonnier; Eva van Rikxoort; Avishek Chatterjee; Henry Woodruff; Philippe Lambin; Leonor Cerdá-Alberich; Luis Martí-Bonmatí; Francisco Herrera; Guang Yang
Journal:  Inf Fusion       Date:  2022-06       Impact factor: 17.564

2.  Development of a Drug-Response Modeling Framework to Identify Cell Line Derived Translational Biomarkers That Can Predict Treatment Outcome to Erlotinib or Sorafenib.

Authors:  Bin Li; Hyunjin Shin; Georgy Gulbekyan; Olga Pustovalova; Yuri Nikolsky; Andrew Hope; Marina Bessarabova; Matthew Schu; Elona Kolpakova-Hart; David Merberg; Andrew Dorner; William L Trepicchio
Journal:  PLoS One       Date:  2015-06-24       Impact factor: 3.240

3.  Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data.

Authors:  Oana A Tomescu; Diethard Mattanovich; Gerhard G Thallinger
Journal:  BMC Syst Biol       Date:  2014-03-13
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.