| Literature DB >> 35610272 |
Roberto Leporini1, Davide Pastorello2.
Abstract
Optimal measurements for the discrimination of quantum states are useful tools for classification problems. In order to exploit the potential of quantum computers, feature vectors have to be encoded into quantum states represented by density operators. However, quantum-inspired classifiers based on nearest mean and on Helstrom state discrimination are implemented on classical computers. We show a geometric approach that improves the efficiency of quantum-inspired classification in terms of space and time acting on quantum encoding and allows one to compare classifiers correctly in the presence of multiple preparations of the same quantum state as input. We also introduce the nearest mean classification based on Bures distance, Hellinger distance and Jensen-Shannon distance comparing the performance with respect to well-known classifiers applied to benchmark datasets.Entities:
Year: 2022 PMID: 35610272 PMCID: PMC9130267 DOI: 10.1038/s41598-022-12392-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Average accuracy with the first 2 features.
| Dataset | Helstrom | Euclide | Bures | Hellinger | Jensen | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.529 | 0.842 | 0.8445 | 0.8425 | 0.842 | 0.952 | 0.9365 | 0.8325 | 0.944 | 0.844 | 0.834 | 0.894 | 0.9035 | 0.9135 | 0.8385 |
| Cicles | 0.4855 | 0.631 | 0.509 | 0.5555 | 0.6525 | 0.854 | 0.8895 | 0.4065 | 0.8905 | 0.8765 | 0.853 | 0.835 | 0.8375 | 0.8285 | 0.86 |
| Linearly separable | 0.929 | 0.933 | 0.936 | 0.935 | 0.933 | 0.9425 | 0.93 | 0.9285 | 0.942 | 0.939 | 0.93 | 0.9065 | 0.9125 | 0.896 | 0.936 |
| Analcatdata aids | 0.382 | 0.31 | 0.312 | 0.306 | 0.308 | 0.262 | 0.103 | 0.386 | 0.095 | 0.261 | 0.252 | 0.093 | 0.105 | 0.205 | 0.279 |
| Analcatdata asbestos | 0.606471 | 0.714706 | 0.725882 | 0.720588 | 0.714118 | 0.722941 | 0.744706 | 0.695294 | 0.721176 | 0.748235 | 0.728824 | 0.748235 | 0.755294 | 0.695882 | 0.713529 |
| Analcatdata boxing2 | 0.548889 | 0.524815 | 0.547778 | 0.536667 | 0.531111 | 0.450741 | 0.521111 | 0.532222 | 0.494444 | 0.522593 | 0.528519 | 0.434815 | 0.44037 | 0.455926 | 0.539259 |
| Hill valley with noise | 0.481317 | 0.499835 | 0.502634 | 0.504938 | 0.50465 | 0.497531 | 0.516872 | 0.478189 | 0.517654 | 0.488971 | 0.499383 | 0.51535 | 0.509012 | 0.51465 | 0.489547 |
| Hill valley without noise | 0.489712 | 0.514486 | 0.516049 | 0.508066 | 0.509259 | 0.503909 | 0.49679 | 0.492222 | 0.493868 | 0.501193 | 0.513827 | 0.503292 | 0.505391 | 0.518189 | 0.507942 |
| Lupus | 0.773333 | 0.735 | 0.733333 | 0.733333 | 0.734444 | 0.706111 | 0.757222 | 0.756667 | 0.722222 | 0.753333 | 0.742778 | 0.707778 | 0.721667 | 0.665556 | 0.717778 |
| Prnn synth | 0.455 | 0.8566 | 0.832 | 0.8506 | 0.8558 | 0.854 | 0.8622 | 0.8362 | 0.868 | 0.8516 | 0.8424 | 0.8232 | 0.8468 | 0.8298 | 0.8362 |
The best result for each dataset is marked in bold.
F1-score with the first 2 features.
| Dataset | Helstrom | Euclide | Bures | Hellinger | Jensen | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.464922 | 0.840323 | 0.841519 | 0.84047 | 0.840323 | 0.953685 | 0.936021 | 0.829191 | 0.946382 | 0.841357 | 0.83168 | 0.892442 | 0.901073 | 0.91238 | 0.837082 |
| Cicles | 0.629838 | 0.646017 | 0.667201 | 0.680357 | 0.700705 | 0.855677 | 0.886392 | 0.30833 | 0.887022 | 0.873499 | 0.843021 | 0.829894 | 0.833722 | 0.82728 | 0.850513 |
| Linearly separable | 0.925752 | 0.927311 | 0.930675 | 0.929727 | 0.927311 | 0.94043 | 0.928188 | 0.92364 | 0.939444 | 0.935823 | 0.92697 | 0.904343 | 0.908898 | 0.894871 | 0.931908 |
| Analcatdata aids | 0.312119 | 0.290014 | 0.290706 | 0.286587 | 0.28699 | 0.230239 | 0.111767 | 0.360506 | 0.101949 | 0.25471 | 0.252926 | 0.039859 | 0.093086 | 0.191513 | 0.263386 |
| Analcatdata asbestos | 0.423265 | 0.678411 | 0.703593 | 0.689334 | 0.678303 | 0.653684 | 0.686772 | 0.652546 | 0.644262 | 0.696681 | 0.686944 | 0.689379 | 0.696547 | 0.635218 | 0.67861 |
| Analcatdata boxing2 | 0.683352 | 0.597178 | 0.637601 | 0.617339 | 0.606975 | 0.491697 | 0.607008 | 0.649924 | 0.568273 | 0.60897 | 0.615534 | 0.474085 | 0.507492 | 0.530448 | 0.630684 |
| Hill valley with noise | 0.348074 | 0.337724 | 0.291553 | 0.317597 | 0.327122 | 0.498334 | 0.393383 | 0.306991 | 0.372891 | 0.366183 | 0.380321 | 0.453604 | 0.47774 | 0.444733 | 0.430896 |
| Hill valley without noise | 0.555846 | 0.617317 | 0.636632 | 0.624886 | 0.621875 | 0.512918 | 0.531402 | 0.585316 | 0.604029 | 0.591217 | 0.660424 | 0.506052 | 0.526377 | 0.525226 | 0.642497 |
| Lupus | 0.602344 | 0.669221 | 0.666688 | 0.666656 | 0.667856 | 0.579922 | 0.656767 | 0.582944 | 0.547894 | 0.647052 | 0.630765 | 0.562144 | 0.591334 | 0.546564 | 0.600368 |
| Prnn synth | 0.458623 | 0.858884 | 0.840259 | 0.855394 | 0.858714 | 0.851627 | 0.862972 | 0.841066 | 0.868374 | 0.852702 | 0.845555 | 0.819956 | 0.845004 | 0.829847 | 0.838924 |
Average accuracy with 2 features mapped into high-dimensional feature space .
| Dataset | Helstrom | Euclide | Bures | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.761 | 0.8355 | 0.838 | 0.9365 | 0.928 | 0.7995 | 0.931 | 0.907 | 0.9235 | 0.8055 | 0.8385 | 0.788 | 0.819 |
| Cicles | 0.757 | 0.805 | 0.7805 | 0.8315 | 0.8805 | 0.4635 | 0.855 | 0.886 | 0.872 | 0.845 | 0.8685 | 0.8565 | 0.8945 |
| Linearly separable | 0.831 | 0.9325 | 0.7525 | 0.948 | 0.9275 | 0.921 | 0.938 | 0.9355 | 0.938 | 0.92 | 0.9155 | 0.9055 | 0.941 |
| Analcatdata aids | 0.25 | 0.236 | 0.232 | 0.172 | 0.089 | 0.335 | 0.099 | 0.217 | 0.153 | 0.09 | 0.11 | 0.085 | 0.214 |
| Analcatdata asbestos | 0.761765 | 0.732941 | 0.708824 | 0.724118 | 0.738235 | 0.612941 | 0.732941 | 0.747647 | 0.6 | 0.741176 | 0.753529 | 0.751176 | 0.73 |
| Analcatdata boxing2 | 0.541111 | 0.518148 | 0.537037 | 0.45963 | 0.507778 | 0.536667 | 0.487407 | 0.516296 | 0.479259 | 0.433333 | 0.445185 | 0.435926 | 0.516296 |
| Hill valley with noise | 0.483128 | 0.495226 | 0.490535 | 0.494938 | 0.511852 | 0.477325 | 0.527407 | 0.48856 | 0.492593 | 0.512428 | 0.505926 | 0.503169 | 0.487449 |
| Hill valley without noise | 0.501852 | 0.509547 | 0.138148 | 0.505761 | 0.496132 | 0.486049 | 0.516749 | 0.500905 | 0.54284 | 0.607078 | 0.591975 | 0.58214 | 0.511646 |
| Lupus | 0.737778 | 0.715 | 0.720556 | 0.73 | 0.745556 | 0.619444 | 0.749444 | 0.746667 | 0.747222 | 0.681667 | 0.704444 | 0.657222 | 0.696111 |
| Prnn synth | 0.8084 | 0.8438 | 0.8386 | 0.8588 | 0.8678 | 0.8466 | 0.8576 | 0.857 | 0.8576 | 0.8182 | 0.8346 | 0.8042 | 0.8404 |
Average accuracy with 2 features mapped into high-dimensional feature space .
| Dataset | Helstrom | Euclide | Bures | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.8395 | 0.882 | 0.8835 | 0.9495 | 0.928 | 0.479 | 0.933 | 0.932 | 0.9175 | 0.889 | 0.7975 | 0.8915 | 0.8025 |
| Cicles | 0.628 | 0.841 | 0.846 | 0.847 | 0.878 | 0.4585 | 0.872 | 0.8785 | 0.857 | 0.832 | 0.8565 | 0.845 | 0.8675 |
| Linearly separable | 0.9045 | 0.9095 | 0.9165 | 0.935 | 0.9375 | 0.489 | 0.9415 | 0.9205 | 0.9435 | 0.91 | 0.87 | 0.9145 | 0.918 |
| Analcatdata aids | 0.172 | 0.191 | 0.187 | 0.173 | 0.181 | 0.327 | 0.093 | 0.198 | 0.318 | 0.093 | 0.105 | 0.084 | 0.189 |
| Analcatdata asbestos | 0.725882 | 0.724706 | 0.715882 | 0.722941 | 0.732353 | 0.561176 | 0.731176 | 0.738824 | 0.665294 | 0.728824 | 0.743529 | 0.698235 | 0.682941 |
| Analcatdata boxing2 | 0.515926 | 0.50037 | 0.522963 | 0.467407 | 0.488148 | 0.536667 | 0.486667 | 0.506667 | 0.492963 | 0.429259 | 0.435556 | 0.445556 | 0.497407 |
| Hill valley with noise | 0.483333 | 0.488807 | 0.495556 | 0.496008 | 0.51786 | 0.479053 | 0.529383 | 0.492016 | 0.500165 | 0.510206 | 0.504198 | 0.499671 | 0.493333 |
| Hill valley without noise | 0.496626 | 0.508148 | 0.506461 | 0.505226 | 0.49284 | 0.481728 | 0.518025 | 0.500247 | 0.551564 | 0.561687 | 0.531399 | 0.58037 | 0.505432 |
| Lupus | 0.773333 | 0.715 | 0.71 | 0.720556 | 0.747778 | 0.622222 | 0.747778 | 0.745 | 0.699444 | 0.693333 | 0.68 | 0.68 | 0.693889 |
| Prnn synth | 0.8466 | 0.8564 | 0.8594 | 0.8622 | 0.8674 | 0.7806 | 0.863 | 0.873 | 0.8478 | 0.8342 | 0.816 | 0.833 | 0.7902 |
Average accuracy with 2 features mapped into high-dimensional feature space .
| Dataset | Helstrom | Euclide | Bures | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.9165 | 0.903 | 0.9105 | 0.945 | 0.93 | 0.4805 | 0.9345 | 0.9355 | 0.9 | 0.871 | 0.728 | 0.8985 | 0.7675 |
| Cicles | 0.7605 | 0.855 | 0.8785 | 0.851 | 0.881 | 0.453 | 0.8745 | 0.8805 | 0.853 | 0.822 | 0.8545 | 0.829 | 0.88 |
| Linearly separable | 0.9325 | 0.8765 | 0.8875 | 0.938 | 0.949 | 0.483 | 0.9295 | 0.9195 | 0.9375 | 0.935 | 0.8425 | 0.9405 | 0.8315 |
| Analcatdata aids | 0.121 | 0.176 | 0.173 | 0.183 | 0.284 | 0.341 | 0.148 | 0.29 | 0.084 | 0.095 | 0.117 | 0.084 | 0.195 |
| Analcatdata asbestos | 0.731765 | 0.715294 | 0.714118 | 0.723529 | 0.72 | 0.557647 | 0.732941 | 0.733529 | 0.6 | 0.742353 | 0.745882 | 0.747647 | 0.661176 |
| Analcatdata boxing2 | 0.51 | 0.49 | 0.508889 | 0.467037 | 0.49963 | 0.536667 | 0.494074 | 0.507037 | 0.495185 | 0.427037 | 0.442222 | 0.436667 | 0.496667 |
| Hill valley with noise | 0.483251 | 0.487984 | 0.493457 | 0.500206 | 0.510905 | 0.479053 | 0.529218 | 0.485885 | 0.506502 | 0.508642 | 0.500206 | 0.493169 | 0.498025 |
| Hill valley without noise | 0.498477 | 0.507984 | 0.509342 | 0.504609 | 0.493992 | 0.483621 | 0.520082 | 0.497119 | 0.562593 | 0.520082 | 0.508601 | 0.546132 | 0.50963 |
| Lupus | 0.773333 | 0.703333 | 0.695556 | 0.701667 | 0.746667 | 0.622222 | 0.748333 | 0.745556 | 0.636111 | 0.678889 | 0.659444 | 0.668889 | 0.639444 |
| Prnn synth | 0.8538 | 0.8546 | 0.8628 | 0.857 | 0.8668 | 0.488 | 0.8708 | 0.8756 | 0.8552 | 0.8372 | 0.7952 | 0.8344 | 0.7462 |
Average accuracy with 2 features mapped into high-dimensional feature space .
| Dataset | Helstrom | Euclide | Bures | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.92 | 0.8915 | 0.909 | 0.9375 | 0.931 | 0.4665 | 0.9265 | 0.9375 | 0.88 | 0.859 | 0.682 | 0.896 | 0.6465 |
| Cicles | 0.807 | 0.8445 | 0.887 | 0.8525 | 0.882 | 0.445 | 0.871 | 0.8855 | 0.7695 | 0.823 | 0.857 | 0.8555 | 0.892 |
| Linearly separable | 0.9365 | 0.8375 | 0.856 | 0.9005 | 0.9505 | 0.4735 | 0.8925 | 0.908 | 0.9135 | 0.9295 | 0.7665 | 0.9225 | 0.713 |
| Analcatdata aids | 0.099 | 0.205 | 0.211 | 0.201 | 0.257 | 0.37 | 0.284 | 0.348 | 0.084 | 0.093 | 0.121 | 0.084 | 0.205 |
| Analcatdata asbestos | 0.732353 | 0.698235 | 0.708235 | 0.723529 | 0.729412 | 0.553529 | 0.732353 | 0.731176 | 0.737647 | 0.711176 | 0.742353 | 0.727647 | 0.642353 |
| Analcatdata boxing2 | 0.502593 | 0.486296 | 0.502222 | 0.467778 | 0.500741 | 0.536667 | 0.495556 | 0.507037 | 0.538148 | 0.43 | 0.45 | 0.447778 | 0.498889 |
| Hill valley with noise | 0.484362 | 0.480988 | 0.494774 | 0.499012 | 0.504979 | 0.479053 | 0.531193 | 0.480082 | 0.50642 | 0.515885 | 0.504321 | 0.518642 | 0.495103 |
| Hill valley without noise | 0.500494 | 0.504321 | 0.503251 | 0.50428 | 0.49749 | 0.483621 | 0.521029 | 0.490453 | 0.569218 | 0.52572 | 0.501852 | 0.53214 | 0.508889 |
| Lupus | 0.772778 | 0.684444 | 0.685 | 0.686667 | 0.726111 | 0.622222 | 0.748333 | 0.74 | 0.652222 | 0.651667 | 0.578333 | 0.642778 | 0.582778 |
| Prnn synth | 0.8556 | 0.8174 | 0.8602 | 0.8582 | 0.8616 | 0.4834 | 0.874 | 0.8772 | 0.8392 | 0.8286 | 0.7712 | 0.824 | 0.6968 |
F1-score with 2 features mapped into high-dimensional feature space .
| Dataset | Helstrom | Euclide | Bures | Nearest neighbors | Gaussian process | Linear SVM | RBF SVM | Neural net | QDA | Decision tree | Random Forest | AdaBoost | Naive Bayes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Moons | 0.919654 | 0.888403 | 0.90364 | 0.940338 | 0.931047 | 0.355928 | 0.927139 | 0.940196 | 0.874958 | 0.856571 | 0.679505 | 0.893739 | 0.637247 |
| Cicles | 0.78895 | 0.814057 | 0.884429 | 0.848254 | 0.876753 | 0.286566 | 0.856554 | 0.877363 | 0.743484 | 0.816633 | 0.849931 | 0.849835 | 0.884564 |
| Linearly separable | 0.9331 | 0.811657 | 0.840846 | 0.898957 | 0.949107 | 0.352685 | 0.887571 | 0.90338 | 0.908466 | 0.928714 | 0.764568 | 0.920865 | 0.681532 |
| Analcatdata aids | 0.094768 | 0.191961 | 0.197471 | 0.197468 | 0.218508 | 0.291432 | 0.241194 | 0.292872 | 0.072162 | 0.048727 | 0.09554 | 0.083706 | 0.184656 |
| Analcatdata asbestos | 0.646487 | 0.664024 | 0.685391 | 0.648116 | 0.667969 | 0.03001 | 0.678992 | 0.667294 | 0.653541 | 0.643194 | 0.686401 | 0.674249 | 0.589861 |
| Analcatdata boxing2 | 0.607337 | 0.508385 | 0.559737 | 0.507701 | 0.612686 | 0.652648 | 0.574432 | 0.612221 | 0.657725 | 0.465764 | 0.525866 | 0.518559 | 0.549566 |
| Hill valley with noise | 0.385468 | 0.415765 | 0.410437 | 0.504632 | 0.4108 | 0.338354 | 0.385776 | 0.396176 | 0.335648 | 0.292807 | 0.465071 | 0.451033 | 0.326622 |
| Hill valley without noise | 0.590686 | 0.590196 | 0.622812 | 0.513252 | 0.547225 | 0.545744 | 0.60283 | 0.564534 | 0.666941 | 0.467788 | 0.521789 | 0.524884 | 0.638644 |
| Lupus | 0.632978 | 0.581279 | 0.585018 | 0.54156 | 0.584137 | 0 | 0.582675 | 0.593176 | 0.534378 | 0.527788 | 0.390944 | 0.523898 | 0.460007 |
| Prnn synth | 0.856229 | 0.795313 | 0.860908 | 0.856145 | 0.861114 | 0.366932 | 0.87431 | 0.877672 | 0.839585 | 0.823271 | 0.77101 | 0.819984 | 0.70899 |
Death/alive laryngeal cancer patients and case-control marks of cancer cases in North Liverpool.
| Helstrom | Linear | RadialBasisFunction | Polynomial | Sigmoid | RandomForest | NaiveBayes | NearestNeighbors | LogisticRegression | |
|---|---|---|---|---|---|---|---|---|---|
| LarynxCancer | 0.52 | 0.965 | 0.928333 | 0.93 | 0.888333 | 0.791111 | 0.712222 | 0.747778 | 0.946667 |
| LiverpoolCancer | 0.637017 | 0.799501 | 0.799501 | 0.799501 | 0.790197 | 0.769658 | 0.799001 | 0.799606 | 0.799501 |
Case-control study of esophageal cancer.
| PrettyGood | GeometricHelstrom | Linear | RadialBasisFunction | Polynomial | Sigmoid | RandomForest | NaiveBayes | NearestNeighbors | LogisticRegression | |
|---|---|---|---|---|---|---|---|---|---|---|
| EsophagealCancer | 0.336111 | 0.4 | 0.293333 | 0.238333 | 0.236667 | 0.218333 | 0.493889 | 0.457222 | 0.241667 | 0.347222 |