| Literature DB >> 35061684 |
Abstract
When keystroke dynamics are used for authentication, users tend to get different levels of security due to differences in the quality of their templates. This paper addresses this issue by proposing a metric to quantify the quality of keystroke dynamic templates. That is, in behavioral biometric verification, the user's templates are generally constructed using multiple enrolled samples to capture intra-user variation. This variation is then used to normalize the distance between a set of enrolled samples and a test sample. Then a normalized distance is compared against a predefined threshold value to derive a verification decision. As a result, the coverage area for accepted samples in the original space of vector representation is discrete. Therefore, users with the higher intra-user variation suffer higher false acceptance rates (FAR). This paper proposes a metric that can be used to reflect the verification performance of individual keystroke dynamic templates in terms of FAR. Specifically, the metric is derived from statistical information of user-specific feature variations, and it has a non-decreasing property when a new feature is added to a template. The experiments are performed based on two public keystroke dynamic datasets comprising of two main types of keystroke dynamics: constrained-text and free-text, namely the CMU keystroke dynamics dataset and the Web-Based Benchmark for keystroke dynamics dataset. Experimental results based on multiple classifiers demonstrate that the proposed metric can be a good indicator of the template's false acceptance rate. Thus, it can be used to enhance the security of the user authentication system based on keystroke dynamics.Entities:
Mesh:
Year: 2022 PMID: 35061684 PMCID: PMC8782503 DOI: 10.1371/journal.pone.0261291
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A false acceptance rate of a keystroke template of each user at EER threshold according to Manhattan classifier described in [8].
Fig 2Imposter score distributions of three template groups according to their distinctiveness derived from the Manhattan distance classifier.
Fig 3The CMU keystroke’s performance trade-off curves between false acceptance rate and the threshold value for each template group based on three classifiers: (a) the Manhattan distance, (b) the Euclidean distance, and (c) the Mahalanobis distance.
Fig 7The Web-Based Benchmark (free-text password field) performance trade-off curves between false acceptance rate and the threshold value for each template group based on two classifiers: (a)the Manhattan distance and (b) the Euclidean distance.
The Spearman rank correlation (ρ) between the proposed distinctiveness score and the FAR along with the EER(%) of the system for the CMU dataset.
| Classifier |
| p-value | EER |
|---|---|---|---|
| Manhattan | -0.63 | 5.71e-07 | 16.00 |
| Euclidean | -0.72 | 3.22e-09 | 16.73 |
| Mahalanobis | -0.81 | 0 | 17.16 |
The Spearman rank correlation (ρ) between the proposed distinctiveness score and the FAR along with the EER(%) of the system for the Web-based keystroke dataset.
| Field | Classifier |
| p-value | EER |
|---|---|---|---|---|
| Constrain text | ||||
| Username ‘laboratoire greyc’ | Manhattan | -0.77 | 0 | 13.28 |
| Euclidean | -0.83 | 0 | 16.86 | |
| Password ’S2SAME’ | Manhattan | -0.62 | 1.65e-06 | 24.40 |
| Euclidean | -0.70 | 2.42e-08 | 24.93 | |
| Mahalanobis | -0.66 | 2.53e-07 | 25.16 | |
| Free-text | ||||
| Username | Manhattan | -0.68 | 1.84e-08 | 18.19 |
| Euclidean | -0.70 | 6.21e-09 | 17.94 | |
| Password | Manhattan | -0.67 | 1.15e-04 | 16.99 |
| Euclidean | -0.67 | 1.44e-04 | 18.91 | |
FAR and FRR of each template group from the CMU dataset at various threshold settings.
| Classifier | Group | Thresholding | |||||
|---|---|---|---|---|---|---|---|
| EER | 5% FRR | 10% FRR | |||||
| FAR | FRR | FAR | FRR | FAR | FRR | ||
| Manhattan | All | 16.01% | 15.99% | 35.72% | 5.00% | 23.55% | 10.00% |
| GOOD | 3.88% | 12.44% | 13.72% | 4.53% | 6.98% | 7.94% | |
| MID | 19.42% | 19.00% | 41.86% | 5.97% | 28.05% | 12.24% | |
| BAD | 24.74% | 16.53% | 51.57% | 4.50% | 35.61% | 9.82% | |
| Euclidean | All | 16.73% | 16.74% | 46.85% | 5.00% | 28.77% | 10.00% |
| GOOD | 3.80% | 15.12% | 20.80% | 5.21% | 9.03% | 9.53% | |
| MID | 18.52% | 19.32% | 51.23% | 5.35% | 31.64% | 11.12% | |
| BAD | 27.88% | 15.76% | 68.52% | 4.44% | 45.64% | 9.35% | |
| Mahalanobis | All | 17.16% | 17.16% | 47.65% | 5.00% | 29.77% | 10.00% |
| GOOD | 3.52% | 15.24% | 20.58% | 5.59% | 8.69% | 9.50% | |
| MID | 16.41% | 20.12% | 49.48% | 5.26% | 29.39% | 11.18% | |
| BAD | 31.56% | 16.12% | 72.90% | 4.15% | 51.21% | 9.32% | |
FAR and FRR of each template group from the Web-Based Benchmark dataset at various threshold settings.
| Classifier | Group | EER | 5% FRR | 10% FRR | |||
|---|---|---|---|---|---|---|---|
| FAR | FRR | FAR | FRR | FAR | FRR | ||
| Constraint text–Username (’laboratoire greyc’) | |||||||
| Manhattan | All | 13.11% | 13.34% | 45.03% | 5.00% | 20.97% | 10.04% |
| Good | 2.34% | 21.70% | 13.65% | 7.53% | 4.13% | 15.94% | |
| Mid | 12.31% | 11.31% | 46.75% | 4.34% | 20.14% | 8.70% | |
| Bad | 25.42% | 8.04% | 73.05% | 3.50% | 39.41% | 6.13% | |
| Euclidean | All | 16.63% | 16.87% | 72.20% | 5.02% | 40.20% | 10.07% |
| Good | 2.78% | 23.19% | 40.48% | 6.50% | 10.85% | 13.52% | |
| Mid | 14.17% | 15.71% | 77.94% | 4.95% | 39.34% | 9.69% | |
| Bad | 35.27% | 12.12% | 92.79% | 3.49% | 71.21% | 6.94% | |
| Constraint text–Password (’S2SAME’) | |||||||
| Manhattan | All | 24.21% | 24.01% | 67.98% | 5.00% | 50.42% | 9.82% |
| Good | 12.36% | 35.50% | 46.54% | 8.16% | 31.17% | 17.85% | |
| Mid | 22.28% | 21.45% | 68.53% | 4.61% | 49.15% | 7.88% | |
| Bad | 40.08% | 15.21% | 88.28% | 1.80% | 72.30% | 4.03% | |
| Euclidean | All | 24.80% | 24.49% | 72.57% | 4.97% | 55.03% | 9.82% |
| Good | 11.69% | 36.47% | 49.96% | 7.31% | 32.87% | 16.73% | |
| Mid | 22.57% | 21.85% | 74.60% | 4.50% | 53.91% | 8.04% | |
| Bad | 42.56% | 15.24% | 90.94% | 3.04% | 79.51% | 5.09% | |
| Mahalanobis | All | 24.93% | 24.99% | 68.37% | 4.97% | 52.20% | 9.95% |
| Good | 12.11% | 35.20% | 49.99% | 6.04% | 32.81% | 12.92% | |
| Mid | 23.11% | 22.41% | 70.01% | 4.88% | 52.48% | 9.67% | |
| Bad | 41.53% | 17.92% | 83.35% | 3.79% | 71.00% | 6.73% | |
| Free-text –Username | |||||||
| Manhattan | All | 17.86% | 18.04% | 43.27% | 4.98% | 26.88% | 9.89% |
| Good | 5.61% | 23.92% | 16.13% | 5.16% | 7.91% | 11.92% | |
| Mid | 15.14% | 16.77% | 42.39% | 4.53% | 24.68% | 8.86% | |
| Bad | 35.96% | 14.21% | 72.31% | 5.89% | 50.57% | 10.00% | |
| Euclidean | All | 17.58% | 17.80% | 49.34% | 4.96% | 28.67% | 9.90% |
| Good | 4.07% | 23.77% | 19.15% | 4.84% | 7.31% | 12.10% | |
| Mid | 14.77% | 16.46% | 49.68% | 4.77% | 26.19% | 9.19% | |
| Bad | 37.16% | 14.03% | 78.81% | 5.56% | 55.38% | 9.02% | |
| Free-text –Password) | |||||||
| Manhattan | All | 16.32% | 17.87% | 51.32% | 5.45% | 27.66% | 10.24% |
| Good | 4.32% | 18.51% | 27.15% | 4.13% | 11.91% | 9.12% | |
| Mid | 14.34% | 18.02% | 51.81% | 6.20% | 27.26% | 11.43% | |
| Bad | 39.46% | 16.11% | 83.59% | 4.73% | 51.03% | 7.43% | |
| Euclidean | All | 18.25% | 19.81% | 59.09% | 5.15% | 37.98% | 10.18% |
| Good | 4.69% | 21.34% | 39.32% | 3.97% | 18.46% | 8.03% | |
| Mid | 17.19% | 19.55% | 59.44% | 5.46% | 36.12% | 11.07% | |
| Bad | 40.59% | 18.18% | 85.67% | 5.98% | 71.28% | 10.43% | |
Fig 8Verification performance of free-text username and password combination for each group in terms of FAR and FRR.
Experiments were performed on the Web-based Benchmark dataset based on: (a) Manhattan distance and (b) Euclidean distance.
Verification performance comparison of existing keystroke dynamic approaches.
| Ref. | Dataset | Number of users | Keystroke length | Algorithm | EER (%) |
|---|---|---|---|---|---|
| [ | SUNY | 75 | 10 | CNN plus RNN | 9.75 |
| [ | CMU | 51 | 10 | Dependence Clustering | 7.70 |
| [ | CMU | 51 | 10 | Nearest Neighborg | 8.40 |
| [ | CMU | 51 | 10 | Manhattan (scaled with average absolute deviation) | 9.60 |
| This work | CMU | 51 | 10 | Manhattan (scaled with standard deviation) | 9.16 |
| • Good users | 4.50 | ||||
| • Mid users | 10.27 | ||||
| • bad users | 12.70 |
Entropy measures of different authentication factors.
| Ref. | Authentication factor | Dataset description | Entropy (bits) |
|---|---|---|---|
| [ | 4-digits PIN | Dodonew, CSDN, Rockyou, Yahoo (total 3.4 M) | 8.41 |
| [ | 6-digits PIN | Dodonew, CSDN, Rockyou, Yahoo (total 6.4 M) | 13.21 |
| [ | Password | 14 datasets (total 113.3 M) | 20-23 |
| [ | Iris | ICE (High quality set—374 iris, 10 samples each) | 8.9-10 |
| [ | Finger Vein | VERA (220 fingers, 2 samples each) | 4.2—13.2 |
| UTFVP (360 fingers, 4 images each) | 18.9-19.5 | ||
| This work | Keystroke | CMU (51 typists, 400 samples each) | 3.48 |
| • Good user | 4.62 bits |