| Literature DB >> 34985960 |
Abhinav Deshpande1,2,3, Arthur Mehta4,5, Trevor Vincent4, Nicolás Quesada4,6, Marcel Hinsche7, Marios Ioannou7, Lars Madsen4, Jonathan Lavoie4, Haoyu Qi4, Jens Eisert7,8,9, Dominik Hangleiter1,7, Bill Fefferman10, Ish Dhand11.
Abstract
Photonics is a promising platform for demonstrating a quantum computational advantage (QCA) by outperforming the most powerful classical supercomputers on a well-defined computational task. Despite this promise, existing proposals and demonstrations face challenges. Experimentally, current implementations of Gaussian boson sampling (GBS) lack programmability or have prohibitive loss rates. Theoretically, there is a comparative lack of rigorous evidence for the classical hardness of GBS. In this work, we make progress in improving both the theoretical evidence and experimental prospects. We provide evidence for the hardness of GBS, comparable to the strongest theoretical proposals for QCA. We also propose a QCA architecture we call high-dimensional GBS, which is programmable and can be implemented with low loss using few optical components. We show that particular algorithms for simulating GBS are outperformed by high-dimensional GBS experiments at modest system sizes. This work thus opens the path to demonstrating QCA with programmable photonic processors.Entities:
Year: 2022 PMID: 34985960 PMCID: PMC8730598 DOI: 10.1126/sciadv.abi7894
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 1.Different representations of a D = 2D optical delay GBS instance with lattice size a = 3.
(A) Circuit representation. The vertical lines with dots at the end represent beam splitters. (B) Bidimensional lattice representation. The vertices of the lattice represent the modes, while edges represent beam splitters. (C) Optical circuit representation. The modes are defined by time-bins traveling in a waveguide. The horizontal gray slabs at the bottom of the delays represent the beam splitters. The number of cycles C in a high-dimensional GBS instance corresponds to applying multiple times the gates contained in the green-dotted box in (A). This action physically maps to using concatenating C copies of the delays encircled in the green box in (C). Note that for simplicity, we have not shown the photon-number detectors used to probe the quantum state at the end of the circuit.
Fig. 2.Absolute values of the entries of the unitary matrices associated with two high-dimensional GBS instances drawn from U.
On the left, we show an (a = 6, D = 3, C = 1) instance, and on the right, we show an (a = 15, D = 2, C = 2) instance. Note that we explicitly color the zero entries of the unitary white; thus, the color scale is discontinuous at this end.
Fig. 3.Distribution of the total photon number for M = 216 single mode–squeezed states with squeezing parameter r = 0.8.
We assume a total transmission of η = 0.5 (corresponding to roughly 3 dB of loss) for the lossy distribution. Note that the lossless distribution has no support on odd numbers of photons, which explains why visually it looks as if it has more area under the curve.
Fig. 4.The time cost of calculating a Hafnian of size n in double precision.
The stars indicate actual sizes computed in the Niagara supercomputer (). The blue line is a fit to tNiagara(n) = cNiagaran32 with the only fitting parameter cNiagara = 5.42 × 10−15 s. The standard deviation of fitting parameter cNiagara is 1.2 × 10−16 s, which would give error bands thinner than the width of the line. We find an equivalent expected time in Fugaku, among the most powerful supercomputers, by considering the ratio of their Rmax scores (maximal LINPACK performance achieved) giving their performance in number of floating point operations per second. The conversion factor between the left scale for Niagara and the right scale for Fugaku is the ratio of Rmax values of Fugaku and Niagara, or equivalently, cNiagara/cFugaku = 122.8. Note that since the computation of Hafnians can be broken into the independent calculation of an exponential number of summands (known as an embarrassingly parallel computation), this scaling is expected to be quite accurate.
Benchmarks for a D = 3 high-dimensional GBS instance with minimal Fock space cutoff c = 4.
The first column gives the number of lattice points, from which the number of modes follows M = a3. The second column is the expected run time in Fugaku. This time is obtained by estimating the number of floating point operations required to contract the tensor using cotengra () and converting this into a time by using the Rmax floating point operation per second score for Fugaku. Note that cotengra implements randomized algorithms; thus, for each problem size, we run it 200 times and confirm that, after the first 100 runs, there is no significant variation in the best score found. The last column gives the number of elements of the largest tensor ever needed to be stored in memory during the contraction. Note that this places restrictions on the RAM available in each of the nodes of a supercomputer. In particular, the nodes in Fugaku have up to 32 gigabyte of RAM, allowing to store on the order of 4 × 109 64-bit floating point numbers; thus, an a = 6 instance will far exceed the required capacity of a single node requiring distributed storage and thus subsequent hit in efficiency due to communication complexity.
|
|
|
|
| 4 | 1.65 × 10−1 | 4.39 × 1012 |
| 5 | 4.56 × 105 | 4.61 × 1018 |
| 6 | 2.11 × 1014 | 7.92 × 1028 |