Literature DB >> 26096134

Agglomerative joint clustering of metabolic data with spike at zero: A Bayesian perspective.

Vahid Partovi Nia1, Mostafa Ghannad-Rezaie2.   

Abstract

In many biological applications, for example high-dimensional metabolic data, the measurements consist of several continuous measurements of subjects or tissues over multiple attributes or metabolites. Measurement values are put in a matrix with subjects in rows and attributes in columns. The analysis of such data requires grouping subjects and attributes to provide a primitive guide toward data modeling. A common approach is to group subjects and attributes separately, and construct a two-dimensional dendrogram tree, once on rows and then on columns. This simple approach provides a grouping visualization through two separate trees, which is difficult to interpret jointly. When a joint grouping of rows and columns is of interest, it is more natural to partition the data matrix directly. Our suggestion is to build a dendrogram on the matrix directly, thus generalizing the two-dimensional dendrogram tree to a three-dimensional forest. The contribution of this research to the statistical analysis of metabolic data is threefold. First, a novel spike-and-slab model in various hierarchies is proposed to identify discriminant rows and columns. Second, an agglomerative approach is suggested to organize joint clusters. Third, a new visualization tool is invented to demonstrate the collection of joint clusters. The new method is motivated over gas chromatography mass spectrometry (GCMS) metabolic data, but can be applied to other continuous measurements with spike at zero property.
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Agglomerative clustering; Bayesian clustering; Dendrogram; Metabolic data; Spike-and-slab model

Mesh:

Year:  2015        PMID: 26096134     DOI: 10.1002/bimj.201400110

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  2 in total

1.  Group spike-and-slab lasso generalized linear models for disease prediction and associated genes detection by incorporating pathway information.

Authors:  Zaixiang Tang; Yueping Shen; Yan Li; Xinyan Zhang; Jia Wen; Chen'ao Qian; Wenzhuo Zhuang; Xinghua Shi; Nengjun Yi
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

2.  The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection.

Authors:  Zaixiang Tang; Yueping Shen; Xinyan Zhang; Nengjun Yi
Journal:  Genetics       Date:  2016-10-31       Impact factor: 4.562

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.