| Literature DB >> 26924916 |
A Emre Onuk1, Murat Akcakaya2, Jaydeep P Bardhan3, Deniz Erdogmus1, Dana H Brooks1, Lee Makowski4.
Abstract
In this paper, we describe a model for maximum likelihood estimation (MLE) of the relative abundances of different conformations of a protein in a heterogeneous mixture from small angle X-ray scattering (SAXS) intensities. To consider cases where the solution includes intermediate or unknown conformations, we develop a subset selection method based on k-means clustering and the Cramér-Rao bound on the mixture coefficient estimation error to find a sparse basis set that represents the space spanned by the measured SAXS intensities of the known conformations of a protein. Then, using the selected basis set and the assumptions on the model for the intensity measurements, we show that the MLE model can be expressed as a constrained convex optimization problem. Employing the adenylate kinase (ADK) protein and its known conformations as an example, and using Monte Carlo simulations, we demonstrate the performance of the proposed estimation scheme. Here, although we use 45 crystallographically determined experimental structures and we could generate many more using, for instance, molecular dynamics calculations, the clustering technique indicates that the data cannot support the determination of relative abundances for more than 5 conformations. The estimation of this maximum number of conformations is intrinsic to the methodology we have used here.Entities:
Year: 2015 PMID: 26924916 PMCID: PMC4767180 DOI: 10.1109/TSP.2015.2455515
Source DB: PubMed Journal: IEEE Trans Signal Process ISSN: 1053-587X Impact factor: 4.931