| Literature DB >> 32704032 |
Karol Bartkiewicz1,2, Clemens Gneiting3, Antonín Černoch4, Kateřina Jiráková5, Karel Lemr6, Franco Nori3,7.
Abstract
We implement an all-optical setup demonstrating kernel-based quantum machine learning for two-dimensional classification problems. In this hybrid approach, kernel evaluations are outsourced to projective measurements on suitably designed quantum states encoding the training data, while the model training is processed on a classical computer. Our two-photon proposal encodes data points in a discrete, eight-dimensional feature Hilbert space. In order to maximize the application range of the deployable kernels, we optimize feature maps towards the resulting kernels' ability to separate points, i.e., their "resolution," under the constraint of finite, fixed Hilbert space dimension. Implementing these kernels, our setup delivers viable decision boundaries for standard nonlinear supervised classification tasks in feature space. We demonstrate such kernel-based quantum machine learning using specialized multiphoton quantum optical circuits. The deployed kernel exhibits exponentially better scaling in the required number of qubits than a direct generalization of kernels described in the literature.Entities:
Year: 2020 PMID: 32704032 PMCID: PMC7378258 DOI: 10.1038/s41598-020-68911-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Kernel family (2) for different amplitude choices. (a) We find that the resolution-optimized kernel (blue solid) exhibits suppressed side maxima as compared to the MSI kernel (red dashed), while the TSQ kernel (with squeezing factor , black dotted) maintains a nonvanishing plateau at all x values. For comparison, we also display the respective squeezed-state kernel for (gray dotted) and CK (purple dash-dotted). (b) Characteristic amplitude progressions for the example of and . (c) The optimized kernel exhibits a significantly improved resolution progression with N, as compared to the MSI or the TSQ kernel (here with ).
Figure 2Training results on a random inseparable data set of 40 samples (up/down-tipped triangles). The performance on a test set (left/right-tipped triangles) of 60 points (the fraction of correctly classified samples that were not used in the QML process) is given in the bottom right corner of each respective subplot. We find that the optimal variance/resolution choice for the Gaussian kernel is . For we deal with overfitting. Shown are the simulation results both for an exact Gaussian kernel and for the truncated FM (8) comprising 4 terms (). The learned classification boundaries are given as contour plots. The slight difference in performance compared to the theoretical prediction is due to statistical fluctuations in the experimental data and the relatively small test set (misclassification of a single near-boundary point results in a 0.02 performance drop).
Figure 4Training results on a random inseparable data set of 40 samples (up/down-tipped triangles). The performance on a test set (left/right-tipped triangles) of 60 points (the fraction of correctly classified samples that were not used in the QML process) is given in the bottom right corner of each respective subplot. We see that the best choice of CK is . For we deal with overfitting and for the kernel is too coarse to give as good results as for . The learned classification boundaries are given as contour plots. The slight difference in performance of KQML in relation to the theoretical prediction is due to statistical fluctuations of the experimental data and relatively small test set (misclassification of a single near-boundary point results in 0.02 performance drop).
Figure 3Optical circuit implementing both the FM and the model circuits. The performance of the setup in QML is shown in Fig. 4 for and The experimental setup consists of polarizing beam splitters (PBSs), beam dividers (BDs), quarter-wave and half-wave plates (QWPs and HWPs, respectively), and single photon detectors for . and are H/V polarization resolving (implemented as a PBS and two standard detectors). The kernel is given as a ratio of coincidences registered by photon detectors and to the total number of photons.