Literature DB >> 17447950

Statistical analysis of unlabeled point sets: comparing molecules in chemoinformatics.

Ian L Dryden1, Jonathan D Hirst, James L Melville.   

Abstract

We consider Bayesian methodology for comparing two or more unlabeled point sets. Application of the technique to a set of steroid molecules illustrates its potential utility involving the comparison of molecules in chemoinformatics and bioinformatics. We initially match a pair of molecules, where one molecule is regarded as random and the other fixed. A type of mixture model is proposed for the point set coordinates, and the parameters of the distribution are a labeling matrix (indicating which pairs of points match) and a concentration parameter. An important property of the likelihood is that it is invariant under rotations and translations of the data. Bayesian inference for the parameters is carried out using Markov chain Monte Carlo simulation, and it is demonstrated that the procedure works well on the steroid data. The posterior distribution is difficult to simulate from, due to multiple local modes, and we also use additional data (partial charges on atoms) to help with this task. An approximation is considered for speeding up the simulation algorithm, and the approximating fast algorithm leads to essentially identical inference to that under the exact method for our data. Extensions to multiple molecule alignment are also introduced, and an algorithm is described which also works well on the steroid data set. After all the steroid molecules have been matched, exploratory data analysis is carried out to examine which molecules are similar. Also, further Bayesian inference for the multiple alignment problem is considered.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17447950     DOI: 10.1111/j.1541-0420.2006.00622.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  4 in total

1.  BAYESIAN PROTEIN STRUCTURE ALIGNMENT.

Authors:  Abel Rodriguez; Scott C Schmidler
Journal:  Ann Appl Stat       Date:  2014-12-19       Impact factor: 2.083

2.  BAYESIAN ALIGNMENT OF SIMILARITY SHAPES.

Authors:  Kanti V Mardia; Christopher J Fallaize; Stuart Barber; Richard M Jackson; Douglas L Theobald
Journal:  Ann Appl Stat       Date:  2013       Impact factor: 2.083

3.  Scaling of form and function in the xenarthran femur: a 100-fold increase in body mass is mitigated by repositioning of the third trochanter.

Authors:  Nick Milne; Paul O'Higgins
Journal:  Proc Biol Sci       Date:  2012-06-06       Impact factor: 5.349

4.  Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs.

Authors:  Joseph L Herman; Ádám Novák; Rune Lyngsø; Adrienn Szabó; István Miklós; Jotun Hein
Journal:  BMC Bioinformatics       Date:  2015-04-01       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.