Literature DB >> 29555800

A short walk in quantum probability.

Abstract

This is a personal survey of aspects of quantum probability related to the Heisenberg commutation relation for canonical pairs. Using the failure, in general, of non-negativity of the Wigner distribution for canonical pairs to motivate a more satisfactory quantum notion of joint distribution, we visit a central limit theorem for such pairs and a resulting family of quantum planar Brownian motions which deform the classical planar Brownian motion, together with a corresponding family of quantum stochastic areas.This article is part of the themed issue 'Hilbert's sixth problem'.

Entities: Chemical Disease Gene

Keywords: quantum central limit theorem; quantum planar Brownian motion; quantum probability

Year: 2018 PMID： 29555800 PMCID： PMC5897838 DOI： 10.1098/rsta.2017.0226

Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN： 1364-503X Impact factor: 4.226

Introduction

Kolmogorov’s great book [1] that provided a rigorous mathematical foundation for the classical theory of probability was published in 1933. In Kolmogorovian probability theory, as refined by his disciples, the fundamental notion is that of a probability space , that is, a triple comprising a non-empty set Ω, a σ-field of subsets of Ω, and a probability measure on . Real-valued random variables, or observables in physicists’ language, are represented as real-valued functions F on Ω which are measurable with respect to the pair of σ-fields , where is the σ-field of Borel subsets of , so that for each . Such random variables admit a functional calculus; for a bounded, -measurable -valued function f on , f(F) is the random variable given by ( f(F))(ω)=f(F(ω)). This extends to complex-valued functions on by separating real and imaginary parts. The usual rules of functional calculus hold, e.g. The probability distribution of F can be defined as the probability measure on the Borel subsets of given by Thus, if it exists, the expectation or mean of F is given by The probability distribution is characterized by the family of expectations it yields for bounded measurable functions f of F: In fact, the exponential functions are sufficient for this characterization, hence the characteristic functionϕ of F is well named. One can imagine that, if Kolmogorov had been familiar with another book [2] published slightly earlier, he might have wished to have written a different book. But we must assume that he was unaware of the existence of quantum probability. Quantum probability is, in the first place, a non-commutative extension of classical probability, in which random variables are represented as self-adjoint operators S acting in a complex Hilbert space , and the underlying probability measure by a unit vector . The pair is an example of a quantum probability space (but a more general notion will be needed below). According to the spectral theorem, S admits a unique spectral resolution, in terms of a projection-valued measure E on the Borel field. Using this, the quantum random variable S acquires a probability distribution The expectation, if it exists, is given by Alternatively, using the bounded Borel measurable functional calculus for S, we find that which is always well defined. Hence, we can write the corresponding characteristic function as Kolmogorovian classical probability based on a probability space can be included in the quantum framework by taking the complex Hilbert space to be and the unit vector ψ to be the element ψ(ω)=1, ω∈Ω. The classical random variable F can then be represented by the self-adjoint operator mult of multiplication by F acting on the domain on which this action yields a function still within which is the whole of if F is bounded. Conversely, any individual quantum random variable S can be realized classically by the function F(λ)=λ on the probability space , where is the probability distribution (1.1). Now suppose we are given two quantum random variables R and S. If these are bounded self-adjoint operators, then by definition they commute if RS=SR. This is the case if and only if any of the following three equivalent conditions hold. — for arbitrary Borel sets A and B — for arbitrary bounded Borel measurable functions f and g — for arbitrary real x and y For unbounded R and S, we define commutativity to mean that these three equivalent conditions hold. We may then define the joint probability distribution of the commuting random variables R and S as the probability measure on for which That such exists and is unique can be inferred from the two-dimensional form of Bochner’s theorem, after checking that the function ϕ(x,y)=〈ψ,e eiψ〉 satisfies the conditions Generalizing the case of a single random variable, the two commuting quantum random variables R and S can be realized classically by the functions D(λ,μ)=λ and F(λ,μ)=μ on the probability space . But if R and S do not commute, there is usually no sensible notion of joint probability distribution and they cannot be simultaneously realized classically. In quantum mechanics, it is not possible to measure simultaneously the values of the observables represented by R and S. Thus, there is no way of empirically constructing a joint probability distribution and no obligation on quantum probability to say what it will be. Quantum probability really is different. In quantum probability, the observables belonging to a particular physical system are often taken to be the self-adjoint operators affiliated to a von Neumann algebra , that is, to a unital sub-*algebra of the algebra of bounded operators on the Hilbert space , which is closed in the strong operator topology in which a sequence converges to a limit S if and only if converges to Sϕ for every . Here, a self-adjoint operator S is affiliated to if and only if either it is bounded and belongs to or (equivalently in the bounded case) all its spectral projections belong to (or for every bounded measurable function f, or for all ). Another assumption which is commonly made, though it does not hold in some important examples, is that the unit vector ψ is cyclic, that is, is dense in , and separating, that is, Sψ=0 for implies that S=0. A commutative von Neumann algebra is always isomorphic to the algebra of bounded measurable complex-valued functions acting by multiplication on the Hilbert space for some probability space thus quantum probability is in a very precise sense a non-commutative generalization of Kolmogorovian probability. This situation may be compared with non-commutative geometry [3].

Canonical pairs

Much of my life has been concerned with probabilistic aspects of the Heisenberg commutation relation Here, q is the position observable and p is the canonically conjugate momentum observable of a particle localizable in one dimension, and h is Planck’s constant, whose value in mks units is h=6.6261103×10−34. Despite the smallness of this number, in order to harmonize some probabilistic conventions with physics, we will find it convenient to take h=4π and to define a canonical pair as a pair of self-adjoint operators (p,q) satisfying the commutation relation in the sense that the corresponding families of unitary operators satisfy the formally equivalent but mathematically rigorous Weyl commutation relations An example of such a pair, called the Schrödinger pair (pSchr,qSchr), can be constructed in the Hilbert space by defining the two families of operators having first verified that the families of unitary operators defined by these actions are indeed both continuous unitary representations of the group so that by Stone’s theorem they determine unique self-adjoint operators pSchr and q, and that (2.2) holds. In fact, by the Stone–von Neumann uniqueness theorem [4-6], the Schrödinger pair is essentially unique. More precisely given an arbitrary canonical pair (p,q) acting in a Hilbert space there exists a Hilbert space and a Hilbert space isomorphism (i.e. a unitary transformation) U from to , which intertwines each ei with and each ei with . Another useful canonical pair is in the Hilbert space of square-summable complex sequences, which it is convenient to regard as the Fock space over the Hilbert space , in the sense of the following definition.

Definition 2.1.

Given a complex Hilbert space , the Fock space over is a Hilbert space equipped with generating a family of exponential vectors satisfying That such a Hilbert space exists and is unique in the sense that, given any two candidate Fock spaces over , there is a unique Hilbert space isomorphism which exchanges the two candidate exponential vectors corresponding to each follows from the kernel theorem [7] and the non-negative definiteness of the kernel over . Physicists usually realize the Fock space explicitly as the infinite direct sum of symmetrized tensor products, in which case the exponential vectors are given by But our more abstract view of Fock spaces lends itself better to probabilistic aspects. By the uniqueness property, given a Hilbert space automorphism U on there exists a unique automorphism Γ(U) on , called, for physical reasons, the second quantization of U, such that, for each exponential vector e( f), Γ(U)e( f)=e(Uf). Another useful family of unitary operators on are the so-called Weyl operators which are conveniently defined by their actions on the exponential vectors, They satisfy the Weyl relation Now take and consider the one-parameter groups of unitary operators and . From (2.4) whereas so that the Weyl relations (2.2) hold, and we obtain a new canonical pair (pFock,qFock) by writing In fact, the pair (pFock,qFock) is irreducible and the isomorphism U of the Stone–von Neumann uniqueness theorem is thus from itself to . It can be constructed explicitly as follows. First, apply the isomorphism from to the Hilbert space determined by the orthonormal basis of Hermite functions. Then compose this with the isomorphism from to in which the successive tensor powers of are all identified with itself by multiplication of complex numbers, and the preimage of each exponential vector is its physicists’ realization,

Joint distributions for canonical pairs

Can we construct a joint probability distribution for a canonical pair (p,q) acting in a quantum probability space One tempting route to such a construction is based on the observation that, for all real x and y, the self-adjoint operator xp+yq can be defined unambiguously as the generator of the one-parameter group . Hence, the unitary operator ei( is well defined. We can then write 〈ψ,eψ〉 as a Fourier–Stieltjes transform, and hope that is a plausible joint distribution. Sometimes, this works [8]; sometimes, it does not [9]. To see this, let us first write where z=x+iy and a† and a are the so-called creation and annihilation operators, respectively, By formal manipulation using the Baker–Hausdorff formula (which can be made rigorous [7]), one finds that and so Now, using the Schrödinger realization, (p,q)=(pSchr,qSchr), take ψ to be the zero-order Hermite function in Then, So, eψ=ψ and it follows from (3.1) that our candidate characteristic function reduces to corresponding to a joint distribution of isotropic Gaussian form with unit variance. But, if ψ1=a†ψ we find that so ψ1 is a unit vector which we can use to generate probabilities, and, using the commutation relation ea†=(a†+z)e and the relation 〈a†ψ,ψ〉=〈ψ,aψ〉=0, This is the Fourier transform of which may perhaps look like a plausible joint probability density until one uses it to compute the probability that (p,q) lies in the disc {(u,v):u2+v2≤1}. In fact, the appearence of such negative probabilities is the rule rather than the exception; in the Schrödinger realization, only when the probability vector is itself of essentially Gaussian form do they not appear [9]. Nowadays, the ‘negative probabilities’ which always appear otherwise are widely used in quantum optics as a measure of ‘quantumness’. To find a truly quantum substitute for the joint probability distribution of a canonical pair (p,q), recall that, in the commutative case, the joint probability distribution encodes all expectations of bounded measurable functions of the two observables, that is, elements of the von Neumann algebra that they generate. Let us find a way of similarly encoding the expectations of elements of the von Neumann algebra generated by a canonical pair (p,q). In accordance with the Stone–von Neumann theorem, we write Then, because the Schrödinger representation is irreducible, For each element, we write where ρ is the partial trace of the one-dimensional projector |ϕ〉〈ϕ| over the auxiliary Hilbert space The operator ρ is a non-negative trace-class operator of unit trace on called the distribution operator of (p,q). It is our substitute for a classical joint probability distribution for p and q. Thus, two canonical pairs are identically distributed if they have the same distribution operator. Now suppose that, given two canonical pairs (p,q) and (p′,q′) which commute with each other in the sense that, for arbitrary real x,y, each of p and q commute with each of p′ and q′, Equivalently, the two pairs generate von Neumann algebras contained in each other’s commutants. The two-dimensional form of the Stone–von Neumann theorem allows us to define, in a similar way, a joint distribution operator ρ, which is an operator on which encodes the expectations of elements of the von Neumann algebra generated jointly by p,q,p′,q′. The two pairs are stochastically independent if ρ=ρ⊗ρ, in so far as is canonically identified with or equivalently if and only if for arbitrary S and S′ belonging to the von Neumann algebras generated by the pairs (p,q) and (p′,q′), respectively.

A quantum central limit theorem

Let be a sequence of canonical pairs, any two of which commute with each other in the sense of (3.2). Then it makes sense to demand also that they are independent and identically distributed. Assuming also that the means are zero, , and that the second moments are all finite, then by applying a common unimodular linear transformation where with αδ−βγ=1, we may assume without loss of generality that the covariance matrix takes the canonical form where we define The variance parameter σ2≥1 in view of the Heisenberg uncertainty inequality. The sequence consists of mutually commuting independent, identically distributed random variables of mean 0 and variance σ2. So by the classical Demoivre–Laplace central limit theorem, the sequence converges in distribution to the normal limit distribution N(0,σ2) of mean zero and variance σ2. Similarly, converges in distribution to N(0,σ2). So far, so classical. Now consider the commutator By our mutual commutativity assumption, the first two sums vanish, leaving only So, for each n=1,2,…, is another canonical pair! The reader is invited to rigorously reformulate and prove this statement, using only the one-parameter groups generated by the various unbounded self-adjoint operators. Is there a quantum central limit theorem behind this? Note that such a theorem is not just the two-dimensional version of the de Moivre–Laplace theorem as in general the canonical pairs (4.2) do not have a joint distribution in the classical sense.[1] Let us first construct the limit distribution operator, which must have the properties that the individual distributions of p and q are both N(0,σ2) distributed, and that the covariance matrix is given by (4.1), which is inherited unchanged from each of the approximands (4.2). First, we consider the case of minimal variance allowed by the Heisenberg uncertainty principle, namely σ2=1. In this case, the distribution operator is the ‘pure state density operator’ ρ=|ψ0〉〈ψ0|, where Indeed, passing to the equivalent Fock representation, we find that, for , so that p (and similarly q) is indeed N(0,1) distributed. Moreover, and similarly . Now consider the case σ2>1. Define positive real numbers α and β by so that σ2=α2+β2 and α2−β2=1. Given a complex Hilbert space h, denote by the Banach dual space of h, for f∈h by the bounded linear functional and, for S∈B(h), by the operator Now, equip the Hilbert space tensor product with the unit vector and define a canonical pair (p,q) informally by The random variable p is the sum of independentmutually commuting random variables and , which are both normally distributed with zero mean and with variances α2 and β2, respectively. Hence, p is N(0,α2+β2), that is, N(0,σ2). The same is true of q. Moreover, so (p,q) is indeed a canonical pair as claimed. Finally, because the means are zero and similarly . What does it mean to say that the sequence of canonical pairs (4.2) converges in distribution to (p,q)? The simplest notion of convergence is that of distribution operators in the Hilbert–Schmidt norm. This would correspond to convergence in the L2 sense of the densities of joint distributions to the corresponding Gaussian density, in contrast with the ‘weak convergence’ in the probabilistic sense [10] of the two-dimensional classical central limit theorem. But it can be shown that [11] in the quantum case convergence is in the stronger sense, for an arbitrary bounded operator S on As bounded operators are non-commutative ‘bounded measurable functions’, our quantum central limit theorem, in which convergence is in the sense of expectations of such ‘functions’, is actually stronger than the classical central limit theorem in which only expectations of bounded continuous functions must converge. This is just as well because it is not obvious how to find an explicit non-commutative substitute in this context for the continuous functions similar to the von Neumann algebra generated for the measurable ones. For many years, the quantum central limit theorem of [11] found few applications, though as we will see below it was crucial in suggesting theoretical advances such as quantum Brownian motion. Recently, however, it has found an increasing number of applications in quantum statistics and estimation problems and elsewhere [12,13].

Quantum planar Brownian motion [14]

Donsker’s theorem [10], also known variously as the functional central limit theorem and as the[2] invariance principle, is a generalization of the well-known construction of Brownian as a limit of Bernoulli random walks.

Theorem 5.1.

Let be a sequence of independent identically distributed random variables of mean zero and variance σ2. Then the sequence of random processes on [0,1] converges weakly to a standard one-dimensional Brownian motion X. Here, weak convergence is in the sense of probability measures on the metric space of continuous functions vanishing at 0 on [0,1] equipped with the sup norm. In particular, bounded continuous functions of X such as the supremum converge with n to their values on X. Thus, the convergence is stronger than that of all multi-dimensional joint distributions; for example, the supremum depends on infinitely many values. Now let be a sequence of independent identically distributed canonical pairs. By the invariance principle, we can expect informally that, individually, each of the sequences of processes and where converge to Brownian motions P and Q, respectively. To what do they converge jointly? To answer this question, let us observe that, as follows from (2.1), suggesting that the limit processes P and Q should satisfy How do we construct two unit variance Brownian motions satisfying this commutation relation? Consider first the case of minimal variance, σ2=1. We work in the Fock space .[3] Fix a real parameter θ∈[0,2π[. Let χ[0, denote the indicator function of [0.t[⊂[0,1] and consider the family of Weyl operators As the Weyl relation (2.4) gives the one-parameter group property So, for each fixed θ∈[0,2π[ and t∈[0,1], there is a unique self-adjoint operator Ξ(t) As similarly Im〈xeiχ[0,,yeiχ[0,〉=0, the family (Ξ(s)) is commutative. Using the explicit action of the Weyl operators e(0), it can be verified that, with this probability vector, the process Ξ is Gaussian, with stationary independent increments, and that This suffices for us to recognize it as a standard (unit variance) Brownian motion. Now define processes P and Q by P=Ξ,Q=Ξ0. Then, so which, as required, is the rigorous Weyl form of (5.1) with σ2=1. Note that we could equally well have taken P=Ξ, Q=Ξ. Denoting the latter candidates by P and Q, it can be shown that this unit variance quantum planar Brownian motion has rotational invariance When σ2>1, we construct (P,Q) in the Hilbert space by defining the corresponding one-parameter unitary groups as where α and β are defined by (4.3), and taking the unit probability vector as e(0)⊗(e(0))−. Then P and Q are both sums of two independent Brownian motions of variances α2/σ2 and β2/σ2, and hence are both Brownian motions of variance α2/σ2+β2/σ2=1. But, they satisfy (5.1) as required. This may be seen using the Weyl relation and the rules and . The resulting quantum planar Brownian motion inherits rotational invariance from the case σ2=1. An important joint property of the Brownian motions P and Q is that, despite their non-commutativity, increments of P commute with increments of Q over disjoint intervals. Indeed, if 0≤v≤u≤t≤s≤1, then, informally, In fact, not only do the increments generate commuting von Neumann algebras but also they are independent in the sense that, for arbitrary real x,y, as is seen by ‘splitting’ [7,15] the Fock space between u and t. This strong independent increments property leads on to a corresponding genuinely non-commutative strong Markov property for planar quantum Brownian motion [16] and a corresponding splitting at each Markov time [17].

Quantum Lévy area

The minimal variance quantum planar Brownian motion has found many applications, most of which depend on the corresponding quantum stochastic calculus [7,15]. The case σ2>1 has so far proved less versatile even though the corresponding quantum stochastic calculus [18,19] is in some ways technically simpler. One of the most interesting constructions based on classical planar Brownian motion is Lévy’s stochastic area [20]. Informally, this is the signed area between the chord joining two points on the Brownian path and the path itself. Rigorously, it can be constructed either as a martingale limit of polygonal approximations [20], or an equivalent iterated stochastic integral [21]. Though the result can no longer be understood as an area (there is no path), both methods of construction can be imitated for quantum planar Brownian motion [22,23]. The classical Lévy area is related to other areas of mathematics, for example its moments are essentially the well-known Euler numbers which evaluate the Riemann zeta function at even natural numbers. The corresponding quantum moments have been calculated recently [24]; except when σ=1 when they all vanish [22], they involve the combinatorial Eulerian numbers, and correctly approach the corresponding classical values in the classical limit as . More generally, the so-called Lévy area formula is for the conditional characteristic function of the Lévy area S over the time interval [0,t], given the value (x1,x2) of the planar Brownian motion (B1,B2) at time t. It has many such applications [25] and has recently been applied to simplified proofs of Apéry’s celebrated theorem that ζ(3) is irrational. A direct quantum analogue of (6.1) cannot be defined as it involves a joint value of the two mutually non-commuting components of quantum planar Brownian motion. But, using the rotational invariance of the Brownian motion, we can rewrite (6.1) as In this form, it is evident that the information in (6.1) is implicitly contained in the joint probability distribution for S and the squared Brownian radius |B|2. A quantum Lévy area formula would similarly contain the same information as a joint distribution for the quantum Lévy area and the quantum-squared Brownian radius P(t)2+Q(t)2. Unfortunately, these two processes do not commute with each other or with themselves at different times. However, help is at hand in the observation that P(t)2+Q(t)2 is, like the Lévy area, itself an iterated stochastic integral; in fact, where, following the notation of [22,24], means integrate the multi-differential which follows as an iterated integral over [a,b]. Having removed the time integral which commutes with the rest, we may find a substitute for the joint characteristic function, which is the exponential of a double integral and thus problematic in a non-commutative context, as the corresponding double product integral, to define which does not require commutativity. A start has been made on this procedure [26].

1 in total

1. Linear Transformations in Hilbert Space: III. Operational Methods and Group Theory.

Authors: M H Stone
Journal: Proc Natl Acad Sci U S A Date: 1930-02 Impact factor: 11.205

1 in total

1. Hilbert's sixth problem: the endless road to rigour.

Authors: A N Gorban
Journal: Philos Trans A Math Phys Eng Sci Date: 2018-04-28 Impact factor: 4.226

1 in total