Literature DB >> 19040366

A Bayesian nonparametric approach for comparing clustering structures in EST libraries.

Antonio Lijoi1, Ramsés H Mena, Igor Prünster.   

Abstract

Inference for Expressed Sequence Tags (ESTs) data is considered. We focus on evaluating the redundancy of a cDNA library and, more importantly, on comparing different libraries on the basis of their clustering structure. The numerical results we achieve allow us to assess the effect of an error correction procedure for EST data and to study the compatibility of single EST libraries with respect to merged ones. The proposed method is based on a Bayesian nonparametric approach that allows to understand the clustering mechanism that generates the observed data. As specific nonparametric model we use the two parameter Poisson-Dirichlet (PD) process. The PD process represents a tractable nonparametric prior which is a natural candidate for modeling data arising from discrete distributions. It allows prediction and testing in order to analyze the clustering structure featured by the data. We show how a full Bayesian analysis can be performed and describe the corresponding computational algorithm.

Entities:  

Mesh:

Year:  2008        PMID: 19040366     DOI: 10.1089/cmb.2008.0043

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  Sparse graphs using exchangeable random measures.

Authors:  François Caron; Emily B Fox
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2017-09-23       Impact factor: 4.488

2.  A Bayesian Semi-parametric Approach for the Differential Analysis of Sequence Counts Data.

Authors:  Michele Guindani; Nuno Sepúlveda; Carlos Daniel Paulino; Peter Müller
Journal:  J R Stat Soc Ser C Appl Stat       Date:  2014-04       Impact factor: 1.864

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.