| Literature DB >> 25247055 |
Bo Shuang1, David Cooper1, J Nick Taylor2, Lydia Kisley1, Jixin Chen1, Wenxiao Wang3, Chun Biu Li2, Tamiki Komatsuzaki2, Christy F Landes4.
Abstract
We introduce a step transition and state identification (STaSI) method for piecewise constant single-molecule data with a newly derived minimum description length equation as the objective function. We detect the step transitions using the Student's t test and group the segments into states by hierarchical clustering. The optimum number of states is determined based on the minimum description length equation. This method provides comprehensive, objective analysis of multiple traces requiring few user inputs about the underlying physical models and is faster and more precise in determining the number of states than established and cutting-edge methods for single-molecule data analysis. Perhaps most importantly, the method does not require either time-tagged photon counting or photon counting in general and thus can be applied to a broad range of experimental setups and analytes.Entities:
Year: 2014 PMID: 25247055 PMCID: PMC4167035 DOI: 10.1021/jz501435p
Source DB: PubMed Journal: J Phys Chem Lett ISSN: 1948-7185 Impact factor: 6.475
Figure 3Performance of STaSI using simulated five FRET states traces with fast dynamics. Only the first 200 (out of about 15 000) bin time (corresponding to 2000 sampling time for raw data in panel a) data points are shown for illustration. (a) Simulated raw data analyzed by STaSI and vbFRET. (b) Corresponding histograms of the STaSI fit, vbFRET fit, and the true states for raw data. (c) Simulated ten-point binned data analyzed by STaSI and vbFRET. (d) Corresponding histograms of the STaSI fit, vbFRET fit, and the true states for binned data.
Figure 1Demonstration of STaSI using a simulated three-state FRET efficiency trace with added Gaussian noise. (a) Recursive process to detect step transitions using the Student’s t test. The step transition identified in each recursion is highlighted by the black arrows. The number of segments is indicated in the upper-left corners. (b) The iterative method to group the identified segments into states begins from the final result of step detection process and continues until only a single state remains. The merged segments from five to four states and from four to three states are highlighted by the black arrows. The number of assumed states is indicated in the upper-left corners. (c) The calculated MDL value for each state set. Clearly, the three-state set is the optimum number of states, with the global minimum MDL value. (d) The determined three-state fit (red) compared with true states (blue).
Figure 2Comparison between the L1 norm and the L2 norm for data with different noise levels and mean lifetime of the states. The horizontal axis labels the four different noise levels, and the vertical axis labels the five different mean lifetimes of the states. The simulation uses five FRET states: 0.2, 0.25, 0.35, 0.5, and 0.7; a sampling time of 1 ms; and a binning time of 10 ms. Under each condition, 100 simulations are repeated. The different colors represent the success rates of correctly identifying the number of states. (a) Using the L1 norm analyzing raw data. (b) Using the L2 norm analyzing raw data. (c) Using the L1 norm analyzing binned data. (d) Using the L2 norm analyzing binned data.