Literature DB >> 34276862

A Python Toolbox for Unbiased Statistical Analysis of Fluorescence Intermittency of Multilevel Emitters.

Isabelle M Palstra^1,2, A Femius Koenderink².

Abstract

We report on a Python toolbox for unbiased statistical analysis of fluorescence intermittency properties of single emitters. Intermittency, that is, step-wise temporal variations in the instantaneous emission intensity and fluorescence decay rate properties, is common to organic fluorophores, II-VI quantum dots, and perovskite quantum dots alike. Unbiased statistical analysis of intermittency switching time distributions, involved levels, and lifetimes are important to avoid interpretation artifacts. This work provides an implementation of Bayesian changepoint analysis and level clustering applicable to time-tagged single-photon detection data of single emitters that can be applied to real experimental data and as a tool to verify the ramifications of hypothesized mechanistic intermittency models. We provide a detailed Monte Carlo analysis to illustrate these statistics tools and to benchmark the extent to which conclusions can be drawn on the photophysics of highly complex systems, such as perovskite quantum dots that switch between a plethora of states instead of just two.

Entities: Chemical Disease Species

Year: 2021 PMID： 34276862 PMCID： PMC8282189 DOI： 10.1021/acs.jpcc.1c01670

Source DB: PubMed Journal: J Phys Chem C Nanomater Interfaces ISSN： 1932-7447 Impact factor: 4.126

Introduction

Since the seminal first observation of single molecule emitters in fluorescence microscopy three decades ago,[1] single quantum emitter photophysics has taken center stage in a large body of research. On the one hand, single quantum emitters as single photon sources[2] are held to be an essential part of quantum communication networks and are deemed essential for building optically addressed and cavity-quantum electrodynamics-based quantum computing nodes.[3] This has particularly spurred research in III–V semiconductor quantum dots,[4,5] color centers in diamond, silicon carbide and 2D materials,[6−8] and organic molecules at low temperatures.[9] On the other hand, classical applications of ensembles of emitters for displays, lighting, lasers, and as microscopy tags drive the continuous development of new types of emitters, such as II–VI self-assembled quantum dots 20 years ago[10−12] and inorganic perovskite quantum dots just recently.[13−20] For all these systems, understanding the photophysics on the single emitter level is instrumental, whether the intended use is at the single or ensemble level. A common challenge for almost all types of emitters is that they exhibit intermittency, also known as blinking.[21,22] Under constant pumping, emitters switch, seemingly at random, between brighter and dimmer states, often corresponding with higher and low quantum yields and different fluorescence decay rates. Frequently, the switching behavior also shows peculiar, power-law distributed, random distributions of durations of events. Determining the mechanisms through which emitters blink, that is, the origin of the involved states, the power-law distribution of residence times, and the cause of switching, have been the topic of a large number of studies particularly for II–VI quantum dots as recently reviewed by Efros and Nesbitt.[23] Recent studies on inorganic perovskite quantum dots uncover intermittency behavior that does not fit common models for intermittency in their II–VI counterparts.[14−20] Schematic overview of the working of the toolbox. (A) Illustration of the two methods available to obtain TCSPC data either by photoluminescence TCSPC measurements of a single emitter (top panel) or through simulation of a single emitter (bottom panel). The latter is provided in the toolbox. Both will result in a stream of time stamps that can then be further analyzed by the toolkit. The simulation part of the toolbox simulates dots of m0 levels, with associated count rates and fluorescence decay rates I and γ, with the dot visiting levels in random order, and with residence times for each segment chosen drawn according to specified power law exponents α. The output simulated data consist of time-stamped photon arrivals over a total time span T, where for each photon k = 1..., a time stamp s is recorded. These time stamps are randomly distributed over two detector channels SA, SB. The delays of each of the photon time stamps relative to the time stamp in the third channel SR, representing the periodic pump laser pulse train, is chosen in accordance with the set emitter decay rate. (B) Starting out with this stream of photon events, using CPA (C), the changepoints are found and with these, the instantaneous intensities. (D) Subsequently, these events are grouped in order to find the most likely underlying intensity levels between the behavior. Following this, a number of analyses can be done, such as (E) ascertaining whether the time between switching events is power law-distributed, (F) the visible separation of states in FDIDs/FLIDs, and (G) the presence of memory in the switching behavior. Here, we show an example of a simulated, four-level emitter, with simulation parameters chosen for clarity of illustration. In order to quantify intermittent behavior, the simplest and most commonly employed method is to subdivide a measurement stream of individual photon-arrival times into short bins of a few milliseconds to calculate the intensity (in counts/second) of each bin. Every bin can then be assigned to a state (on, off, or gray) according to its brightness so that on/off times as well as intensity levels can be defined and analyzed.[21] For pulsed laser excitation, also quasi-instantaneous fluorescence decay rates can be obtained.[24,25] However, it is well known that this method of binning time streams and histogramming binned intensities causes detrimental artifacts.[21,26−32] Retrieved parameters of the quantum dot behavior often exhibit a dependency on the choice of the bin width, which affect estimates of switching time distributions and power laws and also the objective assignment of intensities to intrinsic levels. Narrower bin widths in principle allow better resolution but run into shot noise limits, while conversely choosing larger bins suppresses noise but will render the analysis blind to fast events. To overcome these issues, Watkins and Yang[26] proposed changepoint analysis (CPA) as a Bayesian statistics approach for the unbiased determination of switching times that is optimal, that is, gives the best performance given the constraints of shot noise in the data. CPA and clustering are examples of Bayesian inference methods to determine the transitions and underlying levels in single photon trajectories. In the domain of high-throughput single-molecule analysis,[33] many methods to process single photon trajectories have appeared, which one can classify as supervised learning methods with a priori model assumptions on one hand and unsupervised approaches on the other hand. The so-called hidden Markov model (HMM) methods[34] that view photon data streams as an experimentally measured output of transitions between hidden transition states are prominent. Bayesian inference can then estimate parameters such as transition probabilities if one a priori postulates the number of levels and the allowed transitions. As this underlying model is a priori often not known, one can apply HMM with different possible models and rank them according to probabilistic criteria, such as the Bayesian information criterion (BIC). Also, the requirement for a priori known models is relaxed in so-called aggregated Markov models[35] and non-Markov memory kernel models.[36] Juxtaposed to such supervised analysis methods are unsupervised approaches. Such methods apply CPA to partition data into time segments between jumps and subsequent clustering of intensity levels. The CPA method pioneered by Watkins and Yang is essentially such a combination of changepoint detection and hierarchical agglomerative clustering using the BIC to determine the best clustering of measured intensities in distinct levels (states) with as sole assumption that in each segment of time wherein the emitter is in a given level, the counts are Poisson distributed.[26,29] A main drawback is that particularly the clustering is slow. We refer to ref (37) for recent developments in machine learning to mitigate this problem. A main advantage of CPA is that no underlying model is required and that the data are segmented and clustered to the level that the data allow, given that the data is Poisson-distributed in intensity and given a required confidence level stipulated by the user. Variations on CP for other types of noise, such as Gaussian noise, have also appeared.[38,39] Despite the well-documented superior performance over binning of photon counting data, in the domain of single photon counting data from quantum dots, only very few groups have adopted these methods.[17,26,28,29,32,40,41] In this paper, we provide, benchmark, and document a Python toolbox for CPA, state clustering, and analysis of fluorescence–intensity–decay rate correlations (Figure ) that is posted on GitHub.[42] A main motivation lies in the emergence of new quantum emitter systems with complex photophysics. While II–VI quantum dots for which CPA was originally developed are generally understood to switch between just two or three states, the problem of accurate analysis of intermittency is gaining in prominence with the advent of novel emitters such as perovskite quantum dots, which appear to switch not between just two but instead a multitude of states.[14−20] There is hence a large need for a toolbox that provides unbiased, model-free analysis of photon counting data, for which reason we provide a CPA implementation and benchmark it for complex multilevel emitters. Our toolbox is both applicable to real quantum dot data and valuable as a testbed for both testing models and analysis techniques on synthetic, that is, numerically generated, data. Indeed, the code toolbox we supply includes a code to generate numerically random photon time arrival data streams for “synthetic” quantum dots that jump between an arbitrary set of intensity levels and decay rates, with jump time statistics and photon budgets that can be set by the user. As results, we provide benchmarks on the performance of CPA for detecting changepoints in the function of the number of intensity levels and total photon budget, and we explore the limits to the number of distinct states that the clustering analysis can reliably separate. Moreover, the toolbox allows to us test the accuracy of jump time statistic such as power law statistics, for such multilevel dots. Finally, the test suite also allows to benchmark the accuracy of fluorescence decay model fitting with maximum likelihood estimation, and we discuss the construction of fluorescence–intensity–decay rate correlations from CPA-partitioned data. The applicability of the toolbox to experimental data is illustrated by the supplied data sets, which correspond to a related paper in this journal.[43] We note that some other approaches such as HMM methods may be more suited for processes where more knowledge on the underlying physical processes is available. In contrast, our toolbox is ideal for cases in which one wants to make no a priori assumption on the physical mechanism behind intermittency. Furthermore, the CPA method in the toolbox operates on the finest level of information available in photon counting, that is, on the distribution of individual photon arrival times, as opposed to methods that are optimized to work on camera frame data.[33] This paper is structured as follows. In the Methods section, we summarize the Bayesian statistics tools we implemented to analyze all aspects of our data. Next, we benchmark the performance of CPA to pinpoint intensity jumps and of level clustering to identify the number of levels between which a dot switches on the basis of Monte Carlo simulations. Next, we present considerations on the dependence of on–off time distributions, decay rate fit, and so-called “fluorescence decay rate intensity diagrams” (FDIDs) on count rates.

Figure 1

Schematic overview of the working of the toolbox. (A) Illustration of the two methods available to obtain TCSPC data either by photoluminescence TCSPC measurements of a single emitter (top panel) or through simulation of a single emitter (bottom panel). The latter is provided in the toolbox. Both will result in a stream of time stamps that can then be further analyzed by the toolkit. The simulation part of the toolbox simulates dots of m0 levels, with associated count rates and fluorescence decay rates I and γ, with the dot visiting levels in random order, and with residence times for each segment chosen drawn according to specified power law exponents α. The output simulated data consist of time-stamped photon arrivals over a total time span T, where for each photon k = 1..., a time stamp s is recorded. These time stamps are randomly distributed over two detector channels SA, SB. The delays of each of the photon time stamps relative to the time stamp in the third channel SR, representing the periodic pump laser pulse train, is chosen in accordance with the set emitter decay rate. (B) Starting out with this stream of photon events, using CPA (C), the changepoints are found and with these, the instantaneous intensities. (D) Subsequently, these events are grouped in order to find the most likely underlying intensity levels between the behavior. Following this, a number of analyses can be done, such as (E) ascertaining whether the time between switching events is power law-distributed, (F) the visible separation of states in FDIDs/FLIDs, and (G) the presence of memory in the switching behavior. Here, we show an example of a simulated, four-level emitter, with simulation parameters chosen for clarity of illustration.

Methods

In this section, we present all the methods implemented in our Python toolbox as well as the methods for benchmarking them. Benchmark results are presented in the Results section. We refer to the Supporting Information for a manual to the code and the code itself.

Changepoint Analysis

First, we summarize CPA, a Bayesian statistics method for the unbiased determination of jumps or “changepoints” in time traces of discrete events.[17,26,28,29,32,39,41,44−48] Bayesian statistics is a paradigm that reverses the usual standpoint of probability theory. Usual probability theory views a data set as a random draw from a probability distribution, given a hypothesis on the parameters of the underlying physical process. In this framework, one can calculate the likelihood of drawing the specific measured data set. Bayesian statistics, on the other hand, compares the likelihood of distinct hypotheses, given a measured data set and assumptions on the underlying measurement noise. We consider time-tagged single photon counting data consisting of an ordered list of measured photon arrival times s, collected over a measurement time T. For a single emitter with no memory that emits at a count rate of N photons in a time T, the waiting times—that is, the times between photon arrivals—are exponentially distributed with waiting time τw = T/N. In order to determine whether there is a changepoint in some segment q, CPA compares the likelihood of two distinct hypotheses (1) there is a jump in emission intensity (i.e., the average waiting time τw jumping from some value to another) against (2) there is the same intensity throughout the measurement interval. When testing for a jump at a photon detection event k at time s in this trajectory q with time duration T containing N photon events, this leads to a log-likelihood ratio, or “Bayes factor””[26,28,29]where V = s/T. Derivation of this log-likelihood ratio involves several steps. First, it incorporates the assumption that in between jumps, the waiting time between photons is exponentially distributed, on the basis of which one can assess the likelihood of measuring the given data set for a given hypothesis on the exponential waiting time τw. Second, it uses maximally non-informative priors for to compare the hypothesis of presence versus absence of a changepoint without further restrictive assumptions on the involved intensity levels. It should be noted that there are other ways to arrive at the same log-likelihood ratio test. One alternative starting point is a binary time series in which there is an underlying uniform and small probability distribution of photon detection per bin [e.g., imagining the time axis binned in by the timing card resolution (of order 0.1 ns for typical hardware)].[40] Such a uniform distribution would emerge as a direct consequence of exponential waiting time distributions. In this case, one should start from a binomial distribution and ultimately arrive at the same formula after application of Stirling’s formula. Another starting point is CPA applied to binned data with wider bins with multiple counts, that is, to series of Poisson-distributed intensities instead of discrete events.[26,29] However, the binning would introduce an undesirable time scale through the chosen bin width. Of these three methods, working with photon arrival times is the most data-efficient approach and introduces no artificial partitioning whatsoever. We refer to ref (29) for a derivation of the log-likelihood ratio in all these three scenarios, which includes a precise description of the use of maximally non-informative priors. Following Watkins and Yang[26] and Ensign,[29] the most likely location of a changepoint, if any, is at the k that maximizes the Bayes factor . The hypothesis that this most likely changepoint is indeed a real event is accepted if exceeds a critical threshold value for or “skepticism”. This value is chosen to balance false positives against missed events. A full data set is partitioned recursively, that is, by recursively checking if data sets between two accepted changepoints themselves contain further changepoints. This results in a division of the data set into segments, each of which starts and ends at an accepted changepoint and with the level of skepticism as the stop criterion for the recursion. The resulting segmentation provides the most likely description of data as consisting of segments within which the intensity is constant, given the value chosen for the degree of “skepticism” and given the amount of data collected. Since the algorithm works with the list of individual photon arrival times, this segmentation entails no arbitrary partitioning. An accepted rule of thumb is that if the Bayes factor exceeds a “skepticism” value of just between 1 and 3, the evidence for a changepoint is highly ambiguous, whereas values in the range 7–10 are deemed strong evidence. The toolbox is supplied with a default value of skepticism of 8, set following the analysis of refs (26) and (29). The reader is warned that for a given photophysics scenario (intensity levels, segment duration statistics), it is advisable to set the level of “skepticism” on the basis of simulations in order to optimize the trade-off between missing changepoints altogether (false negatives) and precision (avoiding false positives). Our Result section provides an example of such an optimization.

Clustering

CPA splits the data in segments separated by jumps (a list of Q jumps delineate Q – 1 segments). One can now ask what the statistical properties are of the segmentation, that is, what the statistics is of the length of segments, the intensity levels most likely corresponding to the segments, and the fluorescence decay times associated with the segments. For instance, it is a nontrivial question how many distinct constant intensity levels or states mr actually underlie the N – 1 found segments, with intensities I1...I. To answer this question, Watkins and Yang[26] proposed a clustering approach. The recent work of Li and Yang[39] provides a detailed explanation of the reasoning involved, though quoting results for Gaussian instead of Poissonian distributed data. The idea is that with the Q – 1 found segments, each with their associated recorded intensities I, one can use expectation maximization to calculate for a hypothesized and fixed number of levels nG what the most likely underlying intensity levels are (with m ∈ 1...nG) and how probable it is that each segment is ascribed to a given level (probability p). Subsequently, Bayesian inference is used to establish the most likely number of levels (i.e., states) mr and associated intensities , with m ∈ {1...mr} describing the data. Following refs (26) and (39), the expectation minimization in our toolbox is implemented as an interactive algorithm started by a first guess of the segmentation. This guess is obtained by a hierarchical clustering of Q – 1 segments in m = 1, 2, ..., Q – 1 levels that proceeds recursively. In each step, it identifies the two segments in the list with the most similar intensity levels as belonging to the same level. This provides an initial clustering of the measured data in any number m = 1, 2, ..., Q – 1 of levels. For the expectation maximization, the idea is to simultaneously and iteratively optimize the probability p for segment q to belong to the mth level as well as an estimate of the intensities of these levels . In each iteration, the intensities of all levels are estimated from the level assignment from p. Following this, the probability distribution p is updated to redistribute the segments over the levels. In this calculation, it is important to understand the type of noise statistics the data obey. In the case of single-photon measurements and for the purpose of this discussion, the intensities are Poisson-distributed. The iteration is repeated until p converges (practically also capped by a maximum number of iterations). The final outcome is a most likely assignment of the measured segments into nG levels. Next, for each value of nG one assesses the BIC. This criterion is a measure for how good the description of the segmented intensity trace is with nG intensity levels, given the assumption of Poisson counting statistics for each fixed intensity level. Beyond a mere “goodness of fit” metric that would simply improve with improved number of parameters available to describe the data, this metric is penalized for the number of parameters to avoid overfitting. For Poisson-distributed data, the criterion is derived in ref (26) aswhere Q again is the number of changepoints detected and nG is the number of available levels. The term is the log-likelihood function optimized in the expectation maximization step, that is, with the Poisson probability function at mean λ, p the probability of drawing level m. The second term in the BIC is the term penalizing the BIC for overfitting. The accepted best description of an emitter in nG-levels is taken to be the value of nG where the BIC peaks.

Intensity Cross-/Autocorrelation and Maximum Likelihood Lifetime Fitting

Many single photon counting experiments are set up with pulsed laser excitation for fluorescence decay rate measurements and with multiple detectors to collect intensity autocorrelations (e.g., to verify antibunching in g(2)(τ) for time intervals τ comparable to the fluorescence decay rate and shorter than the commonly longer detector dead time). In a typical absolute-time tagging setup, this results in multiple data streams SA, SB, SR of time stamps corresponding to the detection events on each detector and the concomitant laser pulses that created them, respectively. Our Python toolbox contains an implementation of the correlation algorithm of Wahl et al.[49] that operates on time stamp series, and returns, for any combination of channels S1, S2 (1, 2 ∈ {A, B, R}), the cross-correlation C(τ)Δτ, that is, the number of events in the time series S1 and the time series S2 that coincided when shifted over τ, within a precision Δτ. Cross-correlating detected photons and laser arrival times, taking Δτ to be the binning precision of the counting electronics and the range of τ equal to the laser pulse repetition rate, returns a histogram of the delay times between photon detection events and laser pulses. To obtain g(2)(τ) to investigate antibunching, streams of photon events from two detectors in a Hanbury–Brown Twiss setup are cross-correlated. Δτ is taken to be the binning precision of the counting electronics and the sampled range of τ as an interval is taken symmetrically around τ = 0 and several times the laser pulse interval. Finally auto- or cross-correlating detector streams over τ-ranges from nanoseconds to seconds, coarsening both τ and Δτ to obtain equidistant sampling on a logarithmic time axis, results in long-time intensity autocorrelations of use in intermittency analysis.[31] Our toolbox also provides this logarithmic time-step coarsening version of the correlation algorithm of Wahl et al.[49] Of particular interest for intermittent single emitters is the analysis of fluorescence decay rates in short segments of data as identified by CPA, which may be so short as to contain only 20 to 1000 photons. For each of the photon detection events in a single CPA segment, cross-correlation with the laser pulse train yields a histogram of the N photons in segment q. In each of the bins (with width Δτ), the photon counts are expected to be Poisson-distributed. Therefore, the optimum fit procedure to extract decay rates employs the maximum likelihood estimate procedure for Poisson-distributed data, as described by Bajzer et al.[50] In brief, for a decay trace sampled at time points τ relative to the laser excitation, with counts per bin D(τ), the merit function reads Assuming a chosen fit function F(t), the parameter set A that minimizes this merit function provides the parameter values that most likely correspond to the data. The estimated errors in these parameters then follow from the diagonal elements of the inverse of the Hessian of M relative to the parameters A. Importantly, the fact that the Poisson distribution is tied to absolute numbers of counts implies that this approach requires that the data are neither scaled nor background-subtracted. Instead, the background should be part of the fit function either as a free parameter or as a known constant. Furthermore, it should be noted that time bins with zero counts are as informative to the fit as non-empty ones and should not be left out.

Generating Synthetic Quantum Dot Data

To benchmark the CPA and clustering method and to test its limits, our toolbox provides an example routine to generate artificial data mimicking quantum dot intermittency. To obtain mimicked quantum dot data, we first choose a number m0 of intensity levels between which we assume the dot to switch. Next, we generate switching times for each of the states. In this work, we choose all switching times from a power law distribution. For benchmark purposes, we will present results with power law exponent α = 1.5, though any exponent can be set in the code. On the assumption that intensity levels appear in a random and uncorrelated order, this segments the time axis in a list of switching events T0,, j = 1, 2, ..., m0, where for each segment, we randomly assign one of the nominal intensities . Next, to mimic a pulsed excitation experiment, we imagine each of these segments to be subdivided in intervals of length τL equivalent to a laser repetition rate (τL = 100 ns in the examples in this work). We assign each of these intervals to be populated with one photon at probability . This ensures that the number of photons N in every segment is drawn from a Poisson distribution at mean . By removal of all empty bins, the binary list is translated into a list S = (t1, t2, t3, ...) of photon arrival time stamps at resolution τL to which one can directly apply CPA to attempt a retrieval of switching times and apply clustering to retrieve the number of states. To also enable fluorescence decay rate analysis, we further refine the photon arrival time list. Recalling that we have generated switching events T0, between intensity states , we now also assume fluorescence decay rates γ0,. As each segment q was already chosen to correspond to some level m, we now impose the decay rate γ0, on the photon arrival times. To do so, for each of the photon events k = 1...N already generated at resolution τL, we now randomly draw a delay time Δ relative to its exciting laser from an exponential distribution characterized by rate γ0,. To mimic the behavior of typical TCSPC counting equipment, the delay time is discretized at a finite time resolution Δτ [in this work, it was chosen as 165 ps to match the hardware in the provided example experimental data measured in our lab (Becker & Hickl DPC-230), though of course in the toolbox, the value can be set to match that of any time-correlated single photon counting (TCSPC) card vendor]. Testing of fluorescence decay trace fitting can operate directly on the generated list of delay times, or alternatively, one can synthesize a TCSPC experiment by re-assigning S to represent laser-pulse arrival times SR and defining photon arrival times as the events in SA, SB each shifted by its delay time, that is, SX = (t1 + Δ1, t1 + Δ2, ...), with X ∈ A, B. Cross-correlation of SR and SX returns the delay time list. We note that although our work does not focus on antibunching, our quantum dot simulation routine provides data distributed over two detector channels, where emission events antibunch, while an uncorrelated background noise level of the detectors can also be set.

Practical Implementation

We have implemented the toolbox ingredients in Python 3.8. As time stamp data can be substantial in size, we use the “parquet” binary format to store time stamps as 64-bit integers. Processing and plotting the data is dependent on Pythons’ standard libraries numpy, matplotlib, while we use Numba, a just-in-time compiler, to accelerate the time stamp correlation algorithms. An example script to generate synthetic data and to run the entire workflow on simulated data is provided. We refer to the Supporting Information for a guide to the practical implementation and use of our toolbox. The toolbox comes also with example experimental data on single CsPbBr3 quantum dots from a recent experiment.[43]Table lists scaling and performance metrics for the algorithms contained in the toolbox.

Table 1

Algorithm Scaling and Computation Time as a Function of Number of Photons N and Changepoints Qa

algorithm	scaling	timing
CPA		ca. 10 s for N = 10⁶
grouping	initial clustering dominates over iterative algorithm
initializationb		0.5 s for Q = 3 × 10²
iterative optimizationc	overhead dominated	0.75 ms per iteration (Q < 500)
g⁽²⁾d		8 ms per plot point at N = 10⁶
long-time autocorrelatione		ca. 2 s for full plot

For a large Q, this is accelerated by first clustering subsets, merging, and continuing clustering.

Typically, 5 to 10 iterations are required when nG ≈ m.

Numba JIT acceleration assuming int64 provides over 2 orders of magnitude acceleration. A typical g(2) plot has ca. nplot points = 2000 and hence requires 10 to 20 s to evaluate.

Essentially repeating, g(2) and logarithmic coarsening every ncascade points.

Timings were obtained on a standard desktop (Intel I7 4790 at 3.6 GHz, with 16 Gb of DDR3 RAM) and are obtained on the basis of 5 × 100 photon trajectories (100 independent draws for 5 different trajectory record lengths from ca. 104 to 3 × 106 photon events). For a large Q, this is accelerated by first clustering subsets, merging, and continuing clustering. Typically, 5 to 10 iterations are required when nG ≈ m. Numba JIT acceleration assuming int64 provides over 2 orders of magnitude acceleration. A typical g(2) plot has ca. nplot points = 2000 and hence requires 10 to 20 s to evaluate. Essentially repeating, g(2) and logarithmic coarsening every ncascade points.

Results

The remainder of this work is devoted to presenting benchmarks of the provided methods. Benchmarks for emitters with “binary”” switching, that is, two well-separated intensity levels as is typical for II–VI quantum dots, have already been presented in literature.[26,32,40] However, emitters under the current study, such as perovskite quantum dots, appear to have a multitude or perhaps even a continuum of intensity levels. Our tests hence focus on determining the performance of CPA and level clustering for many-level single photon emitters.

Precision of Identifying Individual Changepoints

Figure A,B shows examples of CPA applied to a simulated quantum dot with a single jump in its behavior, from an intensity level of 104 to 103 cts/s resp. from 4.5 × 102 to 2.25 × 102, with ∼900 and ∼150 photons left and right of the changepoint, respectively. Purely for visualization purposes, the data are plotted in a binned format as the analysis itself does not make use of any binning. Alongside the binned intensity trace, we also show the log-likelihood ratio . In both cases, the log-likelihood ratio clearly peaks at or close to the point where there is a changepoint in the data. Since the Bayes factor is actually a logarithmic measure for the comparison of hypotheses, the algorithm indeed identifies the changepoint with high probability and to within just a few photon events, even where the jump is far smaller than the shot noise in the binned representation in the plot at a relative intensity contrast of just a factor of 2. Generally, the probability with which the algorithm identifies or misses the changepoint is dependent on the total number of photons recorded both before and after the changepoint and on the contrast in intensities, consistent with the findings of Watkins, and Ensign.[26]

Figure 2

Demonstration of changepoint detection applied to a synthesized data set with a single changepoint, with equal photon counts before and after the changepoint. In (A,B) the contrast between intensities is a factor of 10 and 2, respectively, while the total photon budget is approximately 2000 and 300. The bottom panels show the log-likelihood ratio test, which clearly peaks at the changepoint in both cases. The y-axis unit cts/ms stands for counts per millisecond. The robustness of the method is demonstrated in (C,D), where we show the likelihood of detecting a changepoint in such a series for different intensity contrasts, and the variance of the found times, respectively, as a function of the total photon numbers. To gather accurate statistics, 104 photon traces were generated for each data point. The data are plotted in a binned fashion (ms bins) for visualization purposes only. To identify the limits of CPA,[29,32] we consider the feasibility of identifying changepoints of contrast I2/I1 as a function of the total number of photons in the time record. The results are shown in Figure C for the likelihood of detecting a changepoint and Figure D for the error in identifying the precise event k at which the changepoint that is identified occurred. Here, we only consider the case where there are roughly an equal number of photon events before and after the changepoint. These data are obtained by simulating 104 switching events of the type as shown in Figure A,B for each contrast and mean photon count shown. The range of contrasts is chosen commensurate with reported on–off contrasts for typical quantum dots in literature, which generally fall in the 1.5- to 5-fold contrast range. At a high intensity contrast, exceeding a factor 5, a total photon count as low as 300 is enough for near-unity detection. Moreover, for sufficiently high photon count left and right of the changepoint, even very small changes in intensity have a high likelihood of being accurately detected, even if in binned data representations, the jump is not visible within the shot noise. Figure D provides a metric for the accuracy within which changepoints are pinpointed. CPA returns the most likely photon event k in which the jump occurred, which in our analysis can be compared to the actual photon event index k0 at which we programmed the Monte Carlo simulation to show a jump. Figure d reports the mean error as a metric of accuracy. At jump contrasts above a factor 2, changepoints are identified to within an accuracy of almost one photon event even with just 102 photon counts in the total event record. At very small contrasts, the error in determining the location of a changepoint is generally on the level of one or two photon events, only worsening when there are fewer than 200 counts. This observation highlights the fact that if the photon record has just a few counts in total, the error in estimating the count rate before and after the jump becomes comparable to the magnitude of the jump.

Intermittency and On–Off Time Histograms

As the next step in our Monte Carlo benchmarking, we turn to time series with many, instead of a single, jumps. Figure A shows a representative example for a simulated intermittent quantum dot with two states, assuming a contrast ratio between states of 2 × 104 and 5 × 103 s–1. We generally observe that the recursive CPA algorithm accurately identifies switching events, barring a number of missed events of very short duration. From the CPA, we retrieve the time duration between switching events. Figure C shows a histogram of time durations, plotted as a probability density function obtained from a whole series of Monte Carlo simulated time traces of varying contrasts between states (see legend). Notably, if we simulate quantum dots that have switching times that are power-law distributed, the retrieved distribution indeed follows the assumed power-law particularly for long times. At shorter times, the histogram remains significantly below the power law, particularly at low intensity ratios between the two assumed states. This indicates that CPA misses fast switching events and is consistent with the observation from Figure C that a minimum photon count is required to observe switching events of a given contrast. As a rule of thumb, usual II–VI colloidal quantum dots have a contrast between dark and bright states of around 5, meaning that of order 200 photons are required to detect a changepoint with near-certainty. At the assumed count rates (2 × 104 s–1 for the bright state), this means one expects CPA to fail for switching times below 10 ms, where the on–off time histogram indeed shows a distinct roll-off. This result suggests that one should interpret on–off time histograms from changepoint detection with care: one can generally rely on the long-time tail but should determine the shortest time scale below which the histogram is meaningless on basis of the intensity levels present in the data.

Figure 3

(A) Typical time trace of a simulated quantum dot. The intensity duty cycle switches between 0.5 × 104 and 2 × 104 counts/s. It shows an on/off input duty cycle generated with a power-law distribution (orange), the duty cycle with Poissonian noise (purple), and the retrieved duty cycle (green). Overall, the original intensities and lifetimes (B) are retrieved well. (C) Histogram of the number of switching events as a function of their duration. Each data point represents 10.000, 10 s power-law distributed time traces. The input shows the initial power-law distribution, the lighter colors show the number of retrieved changepoints, after applying Poisson noise and CPA, at different contrasts, with I1 = 105. counts/s. We can see that even at low contrast, events with long times between switching are retrieved, but each contrast has a characteristic duration below which changepoints cannot be accurately retrieved. This puts a fundamental limit on the information that can be extracted from a given data set. (D) Occupancy diagram illustrating the behavior of the clustering algorithm for a system with m0 = 4 intensity levels. The color scale indicates the amount of time spent in each state m after the assignment of states for a given number of available states nG. We see that when nG > m0, effectively all segments are distributed across only nG ≤ m0 intensity levels.

Error Analysis for Trajectories with Multiple Jumps

The CPA results in Figure C essentially quantify the algorithm performance in terms of the fraction of correctly identified changepoints (true positives) for traces with a single step in intensity as a function of contrast and photon budget. Actual single emitter photon trajectories have a plethora of steps, where CPA is mainly likely to miss short segments because only those changepoints are accepted for which the evidence in the data is compelling relative to the shot noise in it. Indeed, the short-time roll-off in Figure C highlights exactly this tendency of CPA to under-report on closely spaced changepoints (false negative rates high for short segments). The level of skepticism set as a parameter for running CPA sets the overall accuracy of the algorithm, essentially trading off the rates of false positives and false negatives. When using the toolbox for a particular photophysical scenario, the reader is recommended to study the error rates as a function of skepticism. To demonstrate that type of study, here we report on algorithm performance as a function of skepticism using the error metrics accuracy, precision, and recall. To this end, we generate synthetic data and match the list of nominal changepoints and retrieved changepoints to determine the rate TP of true positives, the rate FP (false positives) of detected transitions for which no transition was actually present, and the rate FN of false negatives, in which a true transition is not detected by CPA. The standard definition for the error metrics reads[34,37] The accuracy benchmarks overall performance, whereas precision measures the false positive error rate and recall quantifies the false negative rate. Since, first, changepoint detection is not accurate to the level of a single photon arrival time and, second, the set of stored nominal switching times in our toolbox may fall in between synthesized photon events, such a comparison requires a tolerance range to be meaningful. Figure presents the algorithm performance as a function of the level of skepticism (vertical axis) and as a function of the tolerance range within which changepoints are accepted as true positives, measured in milliseconds. The results are for a more challenging case than a two-level dot, namely, a four-level system with mean count rate 5 × 104 counts per second, four equidistant intensity levels (2, 4, 6, and 8 × 104 counts per second), and power law-distributed segment residence times (exponent 1.5, with shortest residence time of 10 ms in a segment). The presented results are obtained from 200 photon trajectories with on average 5 × 105 photons and 102 changepoints each. If the time axis for the tolerance is chosen as short as the inverse mean count rate, the apparent algorithm precision is low, indicating that changepoints are generally found close to, but not quite at, the moment where the switching event occurs. At tolerances of 2 to 5 ms (containing of order 50–250 photons typically at the given rate and for the various assumed intensity levels), the error rate saturates at above 90%. The accuracy for this example peaks at a skepticism of ca. 7.0 (Figure A). At higher levels of skepticism, the precision increases, that is, the number of false positives reduces further (Figure B). However, this is at the expense of recall, that is, the number of missed changepoints. The false negatives rate decreases only with skepticism lowered to below 10 (Figure C).

Figure 4

Accuracy (A), precision (B), and recall (C) for CPA as a function of tolerance in milliseconds for which nominal and detected changepoints are matched as equal and as a function of the level of “skepticism” which the Bayes factor needs to exceed for a changepoint to be accepted.

Performance of Level Clustering

Next, we consider the performance of the grouping algorithm applied to the segmentation of simulated time traces. For reference, Figure A shows the BIC versus nG for the example of simulated dots with m0 = 2, 3, 4 intensity levels. Generally, the BIC rapidly rises as nG approaches the actual number of levels with which the data were simulated and gently decreases once nG exceeds the actual number of levels in the data m0. The fast rise indicates that within the assumption of Poisson-distributed intensities, the data cannot at all be described by fewer than m0 levels. The slow decrease is due to the penalization of the BIC by the number of fit parameters. Since the BIC criterion actually relates to the logarithm of the probability with which nG states are the appropriate description of the data, even an apparently gentle maximum in BIC actually coincides with an accurate, unique determination of m0.

Figure 5

(A) Examples of the BIC for simulated quantum dots, with m0 = 2, 3, 4. The most likely number of levels is indicated by a small peak in L. In the inset, we show that the distributions indeed peak at their respective m0. It should be noted that the BIC shows a very sharp rise and then, from m0 onward appears almost flat. There is in fact a shallow downward slope. Here, one should keep in mind that the BIC is a logarithmic metric. On a linear scale, the maximum is significant. (B–E) Likelihood P of finding mr levels for a simulated system, given m0 initial intensity levels and ⟨N⟩ the mean number of photon counts. We see that the photon budget plays a defining role in the total number of states that one can reliably resolve. At low photon budgets, the number of levels is systematically underestimated, whereas at high photon budgets, P(m = m0) remains high even at high m0. We see that mr is never overestimated. To gauge the accuracy of the retrieval of the number of states for multi-state quantum dots, we simulated quantum dot data with power law-distributed (α = 1.5) switching events, assuming switching from m̃0 = 2 to 10 equally likely levels, where we assumed intensity levels to be assigned to segments randomly and where we assumed levels for simplicity evenly spaced from dark to bright. Lastly, all segments were reassigned an intensity according to Poisson statistics. In other words, we added shot noise. For many random realizations with different m0 and ⟨N⟩, we determine the most likely number of states mr (BIC(mr) = max(BIC)) according to the clustering algorithm and construct histograms of outcomes. The total photon budget is set by the product of assumed record length and the mean count rates of the different levels. The outcomes of these calculations are shown in Figure B–E, where each panel corresponds to a different photon budget. A plot with only diagonal entries signifies that the number of levels retrieved by the clustering algorithm always corresponds to the number of levels assumed, so mr = m0. At high photon budgets (Figure D,E), the retrieval of the number of states is indeed robust, even for simulated dots that switch between as many as 10 intensity levels. At low total photon budgets (Figure B,C), we see that mr is often underestimated. This signifies that there is high uncertainty due to shot noise in the assigned intensity levels, so that levels cannot be discriminated within the photon budget. It is remarkable that at photon budgets of 106 photons, as many as 10 intensity levels can be robustly discerned even though the smallest contrast between levels is as small as 10% in intensity in view of Figure , where it is evident that detecting changepoints for small intensity jumps is difficult. Here, however, one should realize that in contrast to Figure C where single small-contrast changepoints are studied, here many levels are visited in a random order. Thus, the correct detection of small level differences is not reliant on the detection of small changepoint contrasts but on having sufficient photon statistics to resolve the segment count rates of already identified segments. A secondary metric, additional to the BIC, is in how the clustering algorithm assigns occupancy to the levels. The clustering algorithm assigns to each data segment the most likely intensity level that it was drawn from. Occupancy is a metric for how often each of the nG levels available to the algorithm is actually visited by the measured intensity sequence. We find that if one allows the clustering algorithm to use more levels than originally used to synthesize the quantum dot data (nG > m0), the additional levels take essentially no occupancy. We show this in Figure D for an example of a dot assumed to have four intensity levels with a total photon count of 5 × 105. As soon as additional states (m ≥ 5) are offered to the grouping algorithm, these additional states take no occupancy and do not change the distribution of segments over the states found at the correct m. Thus, we confirm again that the grouping algorithm does not over-estimate the number of states.

Accuracy of Decay Rates versus Photon Budget

Figure A shows examples of fitted simulated data for slow and fast decays as examples of the Monte Carlo simulations we have performed to benchmark the accuracy of decay rate fitting in function of photon budget and decay rate (panel (B)). We find that the error in γ very roughly follows roughly a power law with an exponent of 0.9–1.1, with slower decay rates showing higher errors. Consistent with ref (51), we find by Monte Carlo simulation that one requires approx. 200 (50) counts to obtain an error below 10% (20%) in decay rate if one fits mono-exponential decay with free parameters. A problem intrinsic to the use of CPA is that fast switching events may be missed, leading to an averaging of short time intervals with others. This leads to decay traces that are in fact not attributable to a single exponential decay.

Figure 6

(A) Two examples of decay traces (γ = 0.1, 1 ns–1) with low photon count (N = 300) fitted to a single-exponential decay. (B) Standard error of the fitted decay rate w.r.t. the input decay rate as a function of the total photon count for different decay rates. Each data point is the average of 103 simulated decay traces.

FDID Diagrams

Correlative diagrams that plot correlations between intensity levels and fluorescence lifetime[24,25,32] form a powerful visualization of quantum dot photophysics. Our toolbox contains the code to generate both FDIDs and fluorescence lifetime intensity diagrams (FLIDs). The considerations in this section hold equally for FDID and FLID diagrams, although the provided example is for a FDID analysis. Essential to the construction of FDID/FLID diagrams is that for each detected photon also the delay time to the laser pulse that generated it is known so that decay rates can be fitted even to short segments of a time trace segmented by CPA. Here, we discuss the construction of FDID diagrams derived from CPA, again illustrated by examining simulated data for a quantum dot that switches between two states of distinct intensity and lifetime. FDID diagrams are conventionally constructed from time-binned data, where it is interpreted as a simple histogram in which each time-bin contributes a single histogram count to one single intensity-decay rate bin. It is not trivial to extend this notion to CPA-segmented data since CPA segments intrinsically have very different time durations instead of having equal width as in conventional time-binning. We propose two modifications to the construction of a FDID as a histogram. First, instead of representing FDID entries as a single binary entry in just one histogram bin (one time segment contributes one count to a single pixel in an FDID), we propose to incorporate the uncertainty in intensity and decay rate that is associated with each time segment. To this end, each segment contributes to the FDID diagram according to a 2D Gaussian function centered at the CPA-segment decay rate and intensity (total counts C in segment j divided by segment duration T), where the width is given by the fit error in the decay rate and the shot noise error in the segment . If one would apply this logic to regular time-binned data, giving each Gaussian contributor the same integrated weight, one obtains a diagram similar to a regular FDID histogram except that the results are smooth and with less dependency on a chosen histogram bin width. Instead, the feature size in FDID represents the actual uncertainties in intensity and rate. As a second modification, we propose to reconsider the weights of the Gaussians—that is, the integrated contribution to each entry in the FDID. For time-binned data, one assigns each segment equal weight so that equal lengths of time contribute to equal weight. Since CPA results in segments of unequal length, several choices for constructing FDID diagrams are possible, which to our knowledge have not been discussed in CPA literature. Giving equal weight to each CPA fragment will lead to FDID diagrams from those obtained from binned data since effectively long time segments are then underrepresented compared to short segments. This leads to under-representation of states with steeper power law distributions in their switching times. The direct equivalent to regular FDID-weighting for CPA-segmented data is that a segment of duration T has a weight proportional to T (henceforth “duration-weighted FDIDs”). Alternatively one could argue that since time-averaged intensity and fluorescence decay traces are rather set by the contribution in emitted photons, one could instead use the total number of photons C in each segment as weight (henceforth “count-weighted FDID”). It has been established in a multitude of studies of II–VI quantum dots that the distribution of on–off times follows a power law (exponents αon,off) truncated by an exponential with specific time τl, giving the distribution[21−23]t–α e–. We analyze Monte Carlo-simulated FDIDs to establish if there are conditions of the truncation time under which a two-state quantum dot would appear not as a bimodal distribution. Figure shows duration-weighted FDIDs for simulated quantum dot data. For (A), we consider a two-state quantum dot with power law-distributed switching times. In (B–D), we show a quantum dot simulated with similar parameters but with the maximum duration of the segments Tc = 10, 1, 0.1 s. Evidently, the bi-modal nature of the quantum dot is faithfully represented by the FDID diagram constructed through CPA. This remains true also for power laws with a long time truncation (t–α e–), unless truncation times τl are as short as 20 ms, so that there are no segments with over approximately 102 counts. This limit of our benchmarking space zooms in on the regime where CPA intrinsically fails (Figure C). In this regime, the originally assumed bimodal quantum dot behavior no longer results in a bi-modal FDID. Instead a significant broadening is evident. We can conclude that for most realistic quantum dot systems, CPA-generated FDIDs will not suffer from this artificial broadening artifact.

Figure 7

Four FDID diagrams of a simulated bimodal quantum dot with (I, γ) = (2 × 104 s–1, 0.1 ns–1) and (0.5 × 104 s–1, 0.4 ns–1). In each diagram, we apply a different cutoff time Tc to the simulated power law at ∞, 10, 1, and 0.1 s. We find that a shorter cutoff causes an increasingly strong smearing effect between the two states.

Conclusions

In this work, we have provided a Python toolbox for CPA and for determining the most likely intensity level assignment for intermittent multilevel emitters. We investigated the limits of CPA and clustering as fundamentally set by the photon budget and for the case of many state emitters. We have shown that for long switching times, the typical power law behavior of many quantum emitters can be accurately retrieved. We also show that in the case of many-state emitters, the number of intensity levels can be retrieved with high fidelity, provided the photon count is high enough. At low photon counts, the number of states is systematically underestimated. This shows in which way the photon budget puts a fundamental limit on the amount of information that can be retrieved from a given TCSPC data set. We show that the photon budget also poses a limitation on the accuracy at which the slope of a single-exponential decay can be retrieved. Additionally, we investigate the commonly used intensity-decay time diagrams. We show that with CPA, a two-state simulated quantum dot is well-represented in an FDID, but when a cutoff is introduced to match that commonly found in literature, the states become increasingly poorly defined in an FDID representation. While the Bayesian inference algorithms in this toolbox were reported earlier for application to quantum dots with just two or three intensity states, this toolbox and the provided benchmarks point at the applicability even to emitters that the jump between many closely spaced intensity levels will, in our view, be of large practical use for many workers analyzing the complex photophysics of, for example, perovskite quantum dots. Also, the toolbox can be used for theory development, following a workflow in which hypotheses are cast in synthetic photon counting data, which in turn can be subjected to the CPA suite to assess how hypothesized mechanisms express in observables and how far they are testable. The limit of this testability generally depends on an interplay of total photon budgets, residence time in each level, intensity contrast between levels, and segment durations. For a given photophysics scenario, the user can easily deploy the toolbox to directly assess data segmentation in terms of accuracy, precision, and recall error rates. These rates will depend on the level of skepticism that the user wishes to apply in order to accept assertions regarding the segmentation of data in segments and intensity levels. These error rates, and hence the testability of a hypothesized photophysics scenario, are ultimately limited by the evidence in the counting statistics and not the segmentation algorithm.

32 in total

1. Room Temperature Single-Photon Emission from Individual Perovskite Quantum Dots.

Authors: Young-Shin Park; Shaojun Guo; Nikolay S Makarov; Victor I Klimov
Journal: ACS Nano Date: 2015-09-08 Impact factor: 15.881

2. Fast calculation of fluorescence correlation data with asynchronous time-correlated single-photon counting.

Authors: Michael Wahl; Ingo Gregor; Mattias Patting; Jörg Enderlein
Journal: Opt Express Date: 2003-12-29 Impact factor: 3.894

3. The quantum internet.

Authors: H J Kimble
Journal: Nature Date: 2008-06-19 Impact factor: 49.962

4. Direct measurement of off-state trapping rate fluctuations in single quantum dot fluorescence.

Authors: Amy A Cordones; Teresa J Bixby; Stephen R Leone
Journal: Nano Lett Date: 2011-07-08 Impact factor: 11.189

5. Autocorrelation analysis for the unbiased determination of power-law exponents in single-quantum-dot blinking.

Authors: Julien Houel; Quang T Doan; Thomas Cajgfinger; Gilles Ledoux; David Amans; Antoine Aubret; Agnès Dominjon; Sylvain Ferriol; Rémi Barbier; Michel Nasilowski; Emmanuel Lhuillier; Benoît Dubertret; Christophe Dujardin; Florian Kulzer
Journal: ACS Nano Date: 2015-01-08 Impact factor: 15.881

6. Single Photon Sources in Atomically Thin Materials.

Authors: Milos Toth; Igor Aharonovich
Journal: Annu Rev Phys Chem Date: 2019-02-08 Impact factor: 12.703

7. Excitons and Biexciton Dynamics in Single CsPbBr₃ Perovskite Quantum Dots.

Authors: Bin Li; He Huang; Guofeng Zhang; Changgang Yang; Wenli Guo; Ruiyun Chen; Chengbing Qin; Yan Gao; Vasudevan P Biju; Andrey L Rogach; Liantuan Xiao; Suotang Jia
Journal: J Phys Chem Lett Date: 2018-11-30 Impact factor: 6.475

8. Memories in the photoluminescence intermittency of single cesium lead bromide nanocrystals.

Authors: Lei Hou; Chen Zhao; Xi Yuan; Jialong Zhao; Franziska Krieg; Philippe Tamarat; Maksym V Kovalenko; Chunlei Guo; Brahim Lounis
Journal: Nanoscale Date: 2020-03-17 Impact factor: 7.790

9. Nanocrystals of Cesium Lead Halide Perovskites (CsPbX₃, X = Cl, Br, and I): Novel Optoelectronic Materials Showing Bright Emission with Wide Color Gamut.

Authors: Loredana Protesescu; Sergii Yakunin; Maryna I Bodnarchuk; Franziska Krieg; Riccarda Caputo; Christopher H Hendon; Ruo Xi Yang; Aron Walsh; Maksym V Kovalenko
Journal: Nano Lett Date: 2015-02-02 Impact factor: 11.189

10. Single molecule conformational memory extraction: p5ab RNA hairpin.

Authors: Steve Pressé; Jack Peterson; Julian Lee; Phillip Elms; Justin L MacCallum; Susan Marqusee; Carlos Bustamante; Ken Dill
Journal: J Phys Chem B Date: 2014-06-05 Impact factor: 2.991