Literature DB >> 16873512

Integrating structured biological data by Kernel Maximum Mean Discrepancy.

Karsten M Borgwardt1, Arthur Gretton, Malte J Rasch, Hans-Peter Kriegel, Bernhard Schölkopf, Alex J Smola.   

Abstract

MOTIVATION: Many problems in data integration in bioinformatics can be posed as one common question: Are two sets of observations generated by the same distribution? We propose a kernel-based statistical test for this problem, based on the fact that two distributions are different if and only if there exists at least one function having different expectation on the two distributions. Consequently we use the maximum discrepancy between function means as the basis of a test statistic. The Maximum Mean Discrepancy (MMD) can take advantage of the kernel trick, which allows us to apply it not only to vectors, but strings, sequences, graphs, and other common structured data types arising in molecular biology.
RESULTS: We study the practical feasibility of an MMD-based test on three central data integration tasks: Testing cross-platform comparability of microarray data, cancer diagnosis, and data-content based schema matching for two different protein function classification schemas. In all of these experiments, including high-dimensional ones, MMD is very accurate in finding samples that were generated from the same distribution, and outperforms its best competitors.
CONCLUSIONS: We have defined a novel statistical test of whether two samples are from the same distribution, compatible with both multivariate and structured data, that is fast, easy to implement, and works well, as confirmed by our experiments. AVAILABILITY: http://www.dbs.ifi.lmu.de/~borgward/MMD.

Entities:  

Mesh:

Year:  2006        PMID: 16873512     DOI: 10.1093/bioinformatics/btl242

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  45 in total

1.  Role of bacterial peptidase F inferred by statistical analysis and further experimental validation.

Authors:  Liliana Lopez Kleine; Véronique Monnet; Christine Pechoux; Alain Trubuil
Journal:  HFSP J       Date:  2008-01-07

2.  Multimodal manifold-regularized transfer learning for MCI conversion prediction.

Authors:  Bo Cheng; Mingxia Liu; Heung-Il Suk; Dinggang Shen; Daoqiang Zhang
Journal:  Brain Imaging Behav       Date:  2015-12       Impact factor: 3.978

3.  Selective Transfer Machine for Personalized Facial Expression Analysis.

Authors:  Fernando De la Torre; Jeffrey F Cohn
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-03-28       Impact factor: 6.226

4.  Maximum Mean Discrepancy Based Multiple Kernel Learning for Incomplete Multimodality Neuroimaging Data.

Authors:  Xiaofeng Zhu; Kim-Han Thung; Ehsan Adeli; Yu Zhang; Dinggang Shen
Journal:  Med Image Comput Comput Assist Interv       Date:  2017-09-04

5.  Transfer Extreme Learning Machine with Output Weight Alignment.

Authors:  Shaofei Zang; Yuhu Cheng; Xuesong Wang; Yongyi Yan
Journal:  Comput Intell Neurosci       Date:  2021-02-11

6.  Selective Transfer Machine for Personalized Facial Action Unit Detection.

Authors:  Wen-Sheng Chu; Fernando De la Torre; Jeffery F Cohn
Journal:  Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit       Date:  2013

7.  Batch Mode Active Sampling based on Marginal Probability Distribution Matching.

Authors:  Rita Chattopadhyay; Zheng Wang; Wei Fan; Ian Davidson; Sethuraman Panchanathan; Jieping Ye
Journal:  KDD       Date:  2012

8.  Distribution-free tests of independence in high dimensions.

Authors:  Fang Han; Shizhe Chen; Han Liu
Journal:  Biometrika       Date:  2017-10-03       Impact factor: 2.445

9.  Gating mass cytometry data by deep learning.

Authors:  Huamin Li; Uri Shaham; Kelly P Stanton; Yi Yao; Ruth R Montgomery; Yuval Kluger
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

10.  Adversarial deconfounding autoencoder for learning robust gene expression embeddings.

Authors:  Ayse B Dincer; Joseph D Janizek; Su-In Lee
Journal:  Bioinformatics       Date:  2020-12-30       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.