Literature DB >> 30498198

Holevo Capacity of Discrete Weyl Channels.

Junaid Ur Rehman¹, Youngmin Jeong², Jeong San Kim³, Hyundong Shin⁴.

Abstract

Holevo capacity is the maximum rate at which a quantum channel can reliably transmit classical information without entanglement. However, calculating the Holevo capacity of arbitrary quantum channels is a nontrivial and computationally expensive task since it requires the numerical optimization over all possible input quantum states. In this paper, we consider discrete Weyl channels (DWCs) and exploit their symmetry properties to model DWC as a classical symmetric channel. We characterize lower and upper bounds on the Holevo capacity of DWCs using simple computational formulae. Then, we provide a sufficient and necessary condition where the upper and lower bounds coincide. The framework in this paper enables us to characterize the exact Holevo capacity for most of the known special cases of DWCs.

Entities: Chemical Disease Gene

Year: 2018 PMID： 30498198 PMCID： PMC6265333 DOI： 10.1038/s41598-018-35777-7

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

One of the fundamental tasks in the context of information theory is to compute the maximum rate at which information can be reliably transmitted[1,2]. Classical channels have the capability of transmitting classical information only. On the contrary, quantum channels are more rich in terms of communication tasks[3,4]. Trivially, quantum channels are capable of transmitting quantum information. However, due to the versatile nature and unique features of quantum mechanics, it is possible to associate multiple communication tasks with a quantum channel[5]. Thus, we have classical capacity, quantum capacity, private classical capacity, and entanglement-assisted classical capacity of a quantum channel. All of theses correspond to different information communication tasks[6-9]. The calculation of various capacities involves an optimization task that is not easy to perform. For example, the capacity of a classical channel is given by a single letter formula—the mutual information between input and output of the channel—maximized over the probability distribution of the input random variable[10]. Efficient methods exist that can perform this maximization[11,12]. On the contrary, capacities (except the entanglement-assisted classical capacity) of a quantum channel are given in terms of regularization of asymptotically many channel uses. These regularized formulae are mathematically intractable in general and put forth an unsolvable optimization problem[13]. Simplification of these formulae is not possible due to the nonadditive and nonconvex natures of capacities of quantum channels[14-16]. The need of regularization, however, can be removed either (1) if the capacity of the channel is additive, or (2) if we restrict the optimization to be on the individual channel use. For example, unital qubit channels[17] and entanglement breaking channels[18] are known to be additive and thus their classical capacity can be computed without the need of regularization. Similarly, for the task of classical communication over a quantum channel, one can prohibit the use of inputs states correlated over multiple uses of the channel–effectively allowing optimization on the individual channel use only–to obtain a lower bound on the classical capacity of a quantum channel. This notion of capacity is known as the Holevo capacity. Even with such a simplification of the problem, the calculation remains considerably demanding. As a matter of fact, calculation of the Holevo capacity falls in the category of NP-complete problems[15,19]. This multilayer difficulty has stimulated a good amount of research in the field of quantum information theory. Different researchers have taken different routes to accomplish this seemingly impossible task. For example, different definitions of capacities have been proposed[20], analytical expressions for the special channels have been found[21], and some bounds that are additive and easier to calculate have been computed[22] to solve the problem of regularization. While for solving the difficulty of calculation, exploiting special properties of a given channel[23], and methods that can approximate the capacity upto a fixed a posteriori error have been proposed[24]. In this work we give easy to compute lower and upper bounds on the Holevo capacity of discrete Weyl channels (DWCs). Our employed approach involves modeling the DWC as a classical symmetric channel and using the existing results from the classical information theory to lower bound the Holevo capacity of a DWC. The upper bound is based on the majorization relation of any possible output state of a DWC with the most ordered state based on the channel parameters. We give a necessary and sufficient condition for which the two bounds coincide. We find that this condition is met for the known special cases (Pauli qubit channel, and the qudit depolarizing channel) of DWC and hence we can recover the exact capacity expression for these cases. Through numerical examples we show that the coincidence of two bounds is sufficient but not necessary for the lower bound to give exact capacity.

Discrete Weyl Channel

A quantum state on the Hilbert space is a positive operator with unit trace (i.e., a density operator). We consider the Hilbert space of finite dimension d. The state is said to be pure if it has the form = |ψ〉 〈ψ|. We usually denote a pure state simply by a ket e.g., |ψ〉, which is a column vector in the Hilbert space. A quantum channel is a completely positive trace preserving (CPTP) map transforming the input state to an output state . The map can be specified in terms of Kraus operators {} as where and is the identity operator on the d-dimensional Hilbert space. For a random unitary channel, it is possible to represent Kraus operators as , such that the channel applies an operator on the input state with the probability p[25]. Let be the 2 × 2 identity matrix, andbe the Pauli matrices. The Pauli qubit channel, denoted by , is then defined aswhich is a random unitary channel. Discrete Weyl operators are a non-Hermitian generalization of Pauli operators for dimension d[26]. A Weyl operator nm on the d-dimensional Hilbert space is defined as[27]for ; ; and |k〉 is the kth basis vector in the computational basis (for notational convenience, the indexing of entries of vectors and matrices start from 0). A general structure of a d-dimensional Weyl operator is shown in Fig. 1.

Figure 1

The general structure of a Weyl operator in an arbitrary dimension d.

Property 1.

A Weyl operator , when applied on a d-dimensional vector α, up-shifts the entries of α by m locations and rotates ith entry (according to new indexing) by a phase of ω. We refer to this property as shift and phase operation of Weyl operators. Eigenvalues of a Weyl operator are given by (see supplementary material),where for . A schematic illustration for the Weyl operator 31 on a 4-dimensional Hilbert space is given in Fig. 2. Note that Weyl operators operating on a prime dimensional Hilbert space have d distinct eigenvalues (and we can simply state that ) except for 00. On the other hand, some Weyl operators of a composite dimension may have repeated eigenvalues. This repetition of eigenvalues restrains us from deriving general forms of our results directly. We circumvent this problem by first presenting our results for the Hilbert space of a prime dimension, and then show that an alternate formulation of our results can be applied to the case of a composite dimensional Hilbert space as well.

Figure 2

A schematic illustration for the structure of discrete Weyl operator 31 on a 4-dimensional Hilbert space. Each eigenvalue λs and eigenvector |λs〉 can be found using (4) and (30), respectively.

A schematic illustration for the structure of discrete Weyl operator 31 on a 4-dimensional Hilbert space. Each eigenvalue λs and eigenvector |λs〉 can be found using (4) and (30), respectively. A DWC, denoted by , is a generalization of the Pauli qubit channel[1], defined in terms of discrete Weyl operators aswhere acts on the input state with probability p. The Holevo capacity of a quantum channel is defined as[6,28]where p is the a priori probability of input state ; is the von Neumann entropy, and is the output state produced by the action of channel on the input state . The Holevo capacity corresponds to the maximum rate of classical information when input states are restricted to be separable, i.e., the inputs of the channel are not entangled over multiple uses.

Lemma 1.

If the input state of a DWC operating on a d-dimensional Hilbert space is an eigenstate of a d-dimensional Weyl operator , then the output state is diagonal in the eigenbasis of .

Proof

. See Methods section.◽ As a consequence of the above Lemma, we can choose the set of input states to be d orthogonal eigenvectors of some Weyl operator , and measure the output in the eigenbasis of . The uncertainty at the output of the channel in this case is purely classical in nature. In this sense, a DWC is behaving as a classical channel, transitioning a distinguishable state into an unknown but perfectly distinguishable state. We completely characterize the simulated classical channel in terms of channel transition matrix in the following Proposition.

Proposition 1

A DWC of a prime dimension d with orthonormal eigenstates of as the input states behaves as a classical symmetric channel with the following transition matrix where

Proof.

See Methods section.◽ As an example, a DWC driven by the eigenstates of 21 with d = 3 is shown in Fig. 3. In this example, we have , , and .

Figure 3

An example DWC for d = 3 driven by the eigenstates of 21.

Results

Based on the proposition 1, we give the following simple and natural lower bound on the Holevo capacity of a DWC:

Theorem 1

The Holevo capacity of the channel in (5) with a prime d is bounded aswhere is the channel transition matrix of the (n, m) th symmetric channel obtained by fixing the eigenstates of as the signal states and is the Shannon entropy. . See Methods section.◽ The restriction on d to be a prime number is primarily because the repetition of eigenvalues of of a composite d does not allow us to construct the channel transition matrix . The following remark provides us an alternative approach to lower bound the Holevo capacity of DWC of any d.

Remark 1.

It is straightforward to show that when d is prime, where is the density matrix of any eigenstate of . Therefore, we can equivalently calculatefor prime d. Then, we can extend (10) to any d by replacing the optimization on any in (20) with the optimization on the eigenstates of only.

Theorem 2.

Let us define a vector such thatwhere the elements of are the elements of vector in descending order; the matrix is given bywhere denotes the transpose operation, and 1 and 0 are all-one and all-zero vectors of d elements, respectively. Then, the Holevo capacity of a DWC iswhere , whose elements are probabilities associated with respective Weyl operators . See Methods section.◽ In a d-dimensional Hilbert space, d2 Weyl operators are defined whose indices are given in the form of 2-tuples, e.g., (i, j). We define a set that contains all the d2 indices of defined Weyl operators. We call a set a d-set if all its elements for are non-overlapping d element subsets of where means that is a d-element subset of , is the empty set, and gives a set whose elements are the common elements of and . In the d dimensional Hilbert space, there aredifferent possible d-sets, whereare the binomial coefficients. A d-set whose all elements satisfy the propertyfor some n, m, and some constants k is called an achievable d-set. For exampleis an achievable d-set for (n, m) = (2, 1) butis a d-set which is not achievable.

Theorem 3.

We arrange the elements of p in nonincreasing order and collect the indices of p while preserving the order to form a d-set. The bounds of Theorem 1, and Theorem 2 coincide if and only if (resp. only if) the obtained d-set is achievable and d is a prime number (resp. a composite number). See Methods section.◽

Remark 2.

If the two bounds coincide, we have However, the converse is not true as will be shown by the numerical examples in the next section.

Discussion

An efficient approximation for the capacity of classical-quantum channels has previously been discussed without exploiting any special properties of a given channel[24]. For example, it takes 40,154 seconds in order to approximate the Holevo capacity of a Pauli qubit channel with a posteriori error of 1.940 × 10−3. In contrast to existing methods, the average time to calculate the (lower) bound in this paper is of the order 10−4 seconds even for large d by virtue of the use of special properties of DWCs. We have strong numerical evidence that the lower bound is tighter and is saturated more often even when the two bounds do not coincide, as shown in the Fig. 4(a–c) where the upper (χUB) and the lower (χLB) bounds (normalized by log2(d)) are plotted for 1200 random channel realizations for d = 3, 4, and 5, respectively. In these figures, Holevo capacity by using[23]with the optimization performed via genetic algorithm (χGA) is also presented. Comparison of χLB, χUB, and χGA shows that the frequency of coincidence of two bounds as well as the frequency of the saturation of the lower bound is higher for the case of d = 3.

Figure 4

χUB, χLB, and χGA of random channel realizations (in decreasing order of χUB) when d = 3, 4, 5.

χUB, χLB, and χGA of random channel realizations (in decreasing order of χUB) when d = 3, 4, 5. Our bounds not only ease the requirement of optimization for the calculation of tight bounds for a general DWC, but also allows to recover the analytic expressions for the special cases of DWC. For example, here we recover the analytic expression for the classical capacity of a qudit depolarizing channel using the approach developed above. A quantum depolarizing channel transforms an input state to the output state according to the following mapwhere is the maximally mixed state on the output Hilbert space. In terms of Weyl operators,Thus, we can rewrite equation (21) asThereforewhich shows that all d-sets (whether achievable or not) are equivalent in terms of summation of p over the elements . Therefore, we can choose an ordering of p such that the condition of Theorem 3 is satisfied and we can use equation (13) to calculate the Holevo capacity. From equation (21) and the output vector of , we see that Thus, the Holevo capacity of this channel iswhich is equal to the classical capacity of the quantum depolarizing channel[21]. Additionally, it is easy to see that for a Pauli qubit channel (d = 2), there are 3 possible d-sets which are all achievable. Therefore, both bounds are exact for the Pauli qubit (and all its special cases) channel. With simple algebraic manipulations one can obtain the analytic expressions for the capacities of any of the special cases of the Pauli qubit channel[24]. From Theorem 3, we can also define special channels for which the two bounds always coincide. This approach gives us a class of quantum channels whose exact Holevo capacity can readily be calculated. We define two such channels here and call them one-parameter depolarizing-like, and two-parameter depolarizing-like channels, respectively. The one-parameter depolarizing-like channel is defined aswhose exact Holevo capacity is same as (26) with the depolarizing parameter ξ. The two-parameter depolarizing-like channel iswhere . This channel is a further generalization of the one-parameter depolarizing-like channel. The exact Holevo capacity of this channel can readily be calculated by Theorem 3. In this work we modeled a DWC as a classical symmetric channel for the task of classical communication. Through this modeling, we presented a simple to compute lower bound on the Holevo capacity of a given DWC of an arbitrary dimension. We also gave an intuitive upper bound which coincides with the lower bound under a certain condition. This (sufficient and necessary for a prime d, and necessary for a composite d) condition, however, is not frequently met despite the frequent convergence of the lower bound to the actual Holevo capacity as shown by the numerical examples. The lower bound was derived by noting the similarity of a quantum channel with a classical channel. An interesting future direction is to find similar cases where the results of classical information theory (which is more mature despite being a special case of quantum information theory) can be applied on the problems of quantum information theory with a little or no modification. Similarly, based on the equality of upper and lower bounds, one can define special channels for which these bounds always coincide. Such characterization of quantum channels can give us a class of channels whose exact Holevo capacity can readily be calculated.

Methods

Proof of Lemma 1

Since the DWC is a random unitary channel, the output of the channel is merely the state obtained by randomly applying one of the d2 Weyl operators on the input. Thus, we need to show that operation of on any eigenstate of results into an eigenstate of . Letbe a normalized eigenvector of with the corresponding eigenvalue λ. From the eigenvalue relation , and due to the property 1, we get the following relation among the entries of vector of (29)where the eigenvalues λ are equidistant points on the unit circle (see Fig. 2). Since we have obtained this relation from the condition of eigenvector, any vector satisfying above relation will be an eigenvector of . Now let us consider the effect of any on the vector of (29). To this end, we let , and recall property 1 again to writei.e., the kth entry of is . If the elements of exhibit a similar relation as (30), is also an eigenvector of . Repeated use of (30) gives the following relation between the entries of which essentially bears the same form as (30); because is another eigenvalue of . Hence the vector is an eigenvector of . Since the output state is a statistical mixture of orthonormal eigenstates of , it is diagonal in the same basis, i.e., in the eigenbasis of .

Proof of Proposition 1

Let the input state be an eigestate |λ〉 of corresponding to the eigenvalue λ. From the proof of Lemma 1, the application of transforms the input state to the eigenstate of corresponding to the eigenvalue . Since , is always from the set . Therefore, we can define,as the transition probability of |λ〉 to the orthogonal state . We can define the complete set of transition probabilities P, for only if does not have any repeated eigenvalues which is guaranteed only if d is prime and (note the similarity between and the expression for s in the definition of eigenvalues). Furthermore, we notice that the rows of are permutations of each other and its columns are permutation of each other. Therefore, in (7) defines a classical symmetric channel.

Proof of Theorem 1

From proposition 1 we know that in this setting DWC acts as a classical symmetric channel. Since the capacity of a symmetric channel with d inputs and outputs is given by[2]and we have restricted our input states to be from the eigenstates of Weyl operators, thuswhere the condition along with the condition on d to be prime ensures that we can model the given DWC as a classical symmetric channel with the channel transition matrix by virtue of Proposition 1.

Proof of Theorem 2

For a vector , we denote x in non-increasing order asand denote the vector of elements of rearranged in nonincreasing order. We denote by and say is majorized by ifwith strict equality when k = n. For two Hermitian operators and , we denote if , where is the vector of eigenvalues of . Let be the optimal input state, then the Holevo capacity of a DWC is[23]We can rewrite (13) aswhere is some state with the eigenvalues q given by the elements of . Comparing (37) and (38), our claim simplifies toor from the Schur concavity of von Neumann entropy[29]whereEigendecomposition of can be written aswhere , and are some pure states; if i = j, and 0 otherwise; and are some unitary operators defined by the relation . We note that we are free to choose any as long it has eigenvalues q. This freedom translates to the choice of , and hence to . Equation (40) is true if and only if [30], [Theorem 5]for some probability vector s with elements s and some unitary matrices . We writewhere we can write because both and are pure states, and we can obtain due to[30], [Theorem 4]. Without a loss of generality we can assume both and to be the basis states of a basis set each, i.e., , and . There is also no loss of generality in assuming to be the computational basis. Under these assumptions, the unitary is the change of basis unitary from to the computational basis, i.e., We need to find U, and S, such thatorChoosingsatisfies the above product (the indexing of j and of is arbitrary except for j = 0), as well as the orthogonality of . Therefore, (13) is an upper bound on the Holevo capacity of a DWC.

Proof of Theorem 3

We first observe that the condition on the summation in (8) for the lower bound, and the condition on a d-set to be achievable (16) are essentially the same and result in the same d-element partitioning and ordering of p. Thus, in a prime dimension d, every achievable d-set corresponds to a classical symmetric channel that can be simulated by DWC for some n, m. On the other hand, the upper bound is obtained by ordering the elements of p in a nonincreasing order. Therefore, the achievability of the d-set formed by the indices of p when the p are arranged in a nonincreasing order is sufficient for the existence of a simulated classical symmetric channel of prime dimension that achieves the upper bound. Similarly, since the correspondence of achievable d-sets to a simulated classical symmetric channel is bijective, therefore the conincidence of two bounds necessarily implies the achievability of the d-set formed above. For a composite d, the correspondence between the simulated classical symmetric channel to the achievable d-sets is injective-only. Therefore the above condition is necessary but no longer sufficient for the coincidence of two bounds. Supplementary Material

3 in total