| Literature DB >> 33265185 |
Lara Ortiz-Martin1, Pablo Picazo-Sanchez2, Pedro Peris-Lopez1, Juan Tapiador1.
Abstract
The proliferation of wearable and implantable medical devices has given rise to an interest in developing security schemes suitable for these systems and the environment in which they operate. One area that has received much attention lately is the use of (human) biological signals as the basis for biometric authentication, identification and the generation of cryptographic keys. The heart signal (e.g., as recorded in an electrocardiogram) has been used by several researchers in the last few years. Specifically, the so-called Inter-Pulse Intervals (IPIs), which is the time between two consecutive heartbeats, have been repeatedly pointed out as a potentially good source of entropy and are at the core of various recent authentication protocols. In this work, we report the results of a large-scale statistical study to determine whether such an assumption is (or not) upheld. For this, we have analyzed 19 public datasets of heart signals from the Physionet repository, spanning electrocardiograms from 1353 subjects sampled at different frequencies and with lengths that vary between a few minutes and several hours. We believe this is the largest dataset on this topic analyzed in the literature. We have then applied a standard battery of randomness tests to the extracted IPIs. Under the algorithms described in this paper and after analyzing these 19 public ECG datasets, our results raise doubts about the use of IPI values as a good source of randomness for cryptographic purposes. This has repercussions both in the security of some of the protocols proposed up to now and also in the design of future IPI-based schemes.Entities:
Keywords: authentication; biometric; implantable medical devices; inter-pulse intervals; privacy; randomness
Year: 2018 PMID: 33265185 PMCID: PMC7512659 DOI: 10.3390/e20020094
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1A typical electrocardiogram (ECG) signal and its main features: peaks (P, Q, R, S, T, U), waves, segments and intervals. IPI, Inter-Pulse Interval.
Figure 2Architecture of a generic biometric recognition system.
Datasets and number of test runs used by related work.
| Work | Dataset | Randomness Test |
|---|---|---|
| [ | 50 subjects from the MIMICII Waveform | Shannon’s Entropy |
| [ | 99 subjects from a private dataset | NIST STS (5/15) |
| [ | 50 subjects from a private dataset | NIST STS (15/15) |
| [ | Not specified | NIST STS (6/15) |
| [ | 47 subjects from | NIST STS (8/15) |
| [ | Shannon’s entropy | |
| [ | Shannon’s entropy | |
| [ | ENT | |
| [ | Rényi’s entropy | |
| [ | PhysioNet | NIST STS (9/15) |
| [ | 84 subjects from a private dataset and European ST-T | NIST STS (5/15) |
| [ | 18 subjects from MIT-BIHand 79 from the European ST-T | NIST STS (10/15) |
It is not specified in the paper.
The 19 datasets used in this work. For each dataset, the table provides the number of records (subjects), the sampling frequency, the median value of IPIs per database and the pathology (if any) of the subjects involved in each dataset.
| Dataset | #Records | Frequency (Hz) | Median (IPIs) | Pathology |
|---|---|---|---|---|
| 54 | 5000 | 175 | Healthy volunteers | |
| 545 | 1000 | 68 | Myocardial problems and healthy controls | |
| 5 | 500 | 87 | Myocardial problems | |
| 5 | 1000 | 37 | Atrial fibrillation or flutter | |
| 53 | 250 | 12 | Holter recordings | |
| 14 | 360 | 1246 | Physically-active volunteers | |
| 46 | 360 | 1113 | Arrhythmia | |
| 104 | 250 | 520.5 | Holter recordings | |
| 28 | 360 | 1243 | Stress tests | |
| 9 | 250 | 415 | Ventricular problems | |
| 10 | 720 | 48.5 | Tachycardia | |
| 47 | 128 | 1192 | Partial epilepsy | |
| 17 | 250 | 1800 | Tachycardia | |
| 7 | 200 | 4439 | Partial epilepsy | |
| 17 | 250 | 11,517 | Sleep apnea syndrome | |
| 90 | 250 | 4405 | Myocardial and hypertension | |
| 202 | 360 | 2426 | Unstable patients in critical care units | |
| 77 | 100 | 15,786 | Tachycardia | |
| 23 | 128 | 46,910 | Hypertension |
Figure 3Statistical analysis of beat streams (in bits) and time (in seconds).
ENT tests: optimal values, thresholds used to consider that a sequence passes the test and results obtained for a counting sequence.
| Test | Optimal Value | Threshold | Counter |
|---|---|---|---|
| Entropy | 1.0 | >0.85 | 0.99 |
| Optimum compression | <0% | <5% | 0% |
| Chi square | 1% | ||
| Arithmetic mean | 0.5 | 0.4 | 0.46 |
| Monte Carlo value for | error = 0% | error < 5% | 12.38% |
| Serial correlation coefficient | 0 | 0.012 |
Results of the ENT tests expressed as the percentage of subjects that pass each test per database.
| Dataset | Entropy | Optimum Compression | Chi Square | Arithmetic Mean | Monte Carlo Value for | Serial Correlation |
|---|---|---|---|---|---|---|
| 100% | 100% | 0% | 50% | 10% | 60% | |
| 99.82% | 100% | 0% | 97.98% | 22.20% | 99.63% | |
| 100% | 100% | 0% | 80% | 0% | 100% | |
| 100% | 100% | 0% | 100% | 40% | 100% | |
| 100% | 100% | 0% | 81.13% | 1.89% | 96.23% | |
| 100% | 100% | 0% | 92.86% | 35.71% | 100% | |
| 100% | 100% | 0% | 97.83% | 47.83% | 97.83% | |
| 99.04% | 100% | 0% | 96.15% | 38.46% | 100% | |
| 100% | 100% | 0% | 100% | 35.71% | 100% | |
| 100% | 100% | 0% | 44.44% | 11.11% | 100% | |
| 80% | 100% | 0% | 50% | 10% | 60% | |
| 100% | 100% | 0% | 97.87% | 42.55% | 97.87% | |
| 83% | 100% | 0% | 17% | 6% | 94% | |
| 85.71% | 100% | 0% | 85.71% | 71.43% | 85.71% | |
| 100% | 100% | 0% | 100% | 74.47% | 100% | |
| 98.89% | 100% | 0% | 98.89% | 60% | 100% | |
| 72.28% | 100% | 0% | 59.41% | 22.28% | 86.14% | |
| 75.32% | 100% | 0% | 62.34% | 29.87% | 81.82% | |
| 95.65% | 100% | 0% | 95.65% | 55.52% | 100% |
NIST STS requirements in terms of length [76].
| Test Name |
| |
|---|---|---|
| Frequency (Monobit) | - | |
| Frequency Test within a Block | - | |
| Run | - | |
| Longest Run of Ones in a Block | ||
| Binary Matrix Rank | - | |
| Discrete Fourier Transform (Spectral) | - | |
| Non-Overlapping Template Matching | ||
| Overlapping Template Matching | ||
| Maurer’s “Universal Statistical” Test | ||
| Linear Complexity | ||
| Serial | ||
| Approximate Entropy | ||
| Cumulative Sums | ||
| Random Excursions | ||
| Random Excursions Variant |
Results of the NIST STS tests expressed as the percentage of subjects that pass each test.
| Dataset | Monobit Frequency | Block Frequency | Runs | Longest Run Ones | Binary Matrix Rank | Spectral | Non Overlapping Template Matching | Overlapping Template Matching | Universal Statistic | Linear Complexity | Serial | Approximate Entropy | Cumulative Sums | Random Excursions | Random Excursions Variant |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 94% | 81% | 85% | 87% | 98% | 100% | 96% | 72% | 100% | 20% | 85% | 96% | 96% | 98% | 100% | |
| 89% | 89% | 89% | 89% | 85% | 92% | 98% | 84% | 1% | 0% | 100% | 85% | 98% | 99% | 65% | |
| 80% | 60% | 80% | 100% | 80% | 80% | 100% | 80% | 20% | 0% | 100% | 60% | 100% | 80% | 80% | |
| 100% | 100% | 100% | 80% | 100% | 100% | 100% | 100% | 20% | 0% | 100% | 100% | 80% | 100% | 40% | |
| 94% | 25% | 92% | 94% | 100% | 94% | 96% | 81% | 0% | 0% | 100% | 77% | 100% | 100% | 0% | |
| 14% | 21% | 7% | 64% | 21% | 50% | 71% | 57% | 86% | 57% | 7% | 93% | 93% | 71% | 100% | |
| 46% | 33% | 33% | 50% | 35% | 59% | 91% | 52% | 89% | 39% | 9% | 87% | 96% | 85% | 100% | |
| 47% | 41% | 44% | 54% | 25% | 53% | 92% | 56% | 89% | 0% | 0% | 77% | 98% | 81% | 95% | |
| 50% | 7% | 29% | 54% | 21% | 21% | 64% | 32% | 64% | 21% | 4% | 71% | 100% | 32% | 100% | |
| 0% | 11% | 0% | 56% | 11% | 22% | 11% | 67% | 67% | 0% | 11% | 56% | 100% | 22% | 78% | |
| 10% | 20% | 30% | 40% | 20% | 10% | 90% | 90% | 0% | 0% | 100% | 60% | 100% | 0% | 40% | |
| 23% | 11% | 19% | 28% | 6% | 9% | 77% | 43% | 77% | 21% | 0% | 77% | 100% | 28% | 94% | |
| 29% | 12% | 12% | 41% | 18% | 12% | 29% | 29% | 88% | 18% | 0% | 71% | 100% | 24% | 100% | |
| 14% | 0% | 0% | 29% | 0% | 0% | 43% | 29% | 71% | 0% | 0% | 86% | 100% | 0% | 86% | |
| 24% | 0% | 6% | 35% | 6% | 12% | 76% | 12% | 24% | 0% | 0% | 76% | 94% | 6% | 94% | |
| 23% | 1% | 14% | 29% | 3% | 7% | 62% | 21% | 43% | 9% | 0% | 86% | 100% | 2% | 94% | |
| 22% | 14% | 14% | 33% | 11% | 12% | 35% | 34% | 34% | 7% | 19% | 57% | 99% | 15% | 60% | |
| 5% | 4% | 4% | 17% | 1% | 0% | 27% | 26% | 23% | 0% | 9% | 68% | 96% | 1% | 75% | |
| 9% | 0% | 0% | 4% | 0% | 0% | 17% | 0% | 0% | 0% | 0% | 48% | 100% | 0% | 96% | |
| 36.8% | 21.0% | 26.3% | 52.6% | 26.3% | 42.1% | 68.4% | 52.6% | 47.3% | 5.2% | 31.5% | 94.7% | 100% | 42.1% | 84.2% |
Characteristics vs. success rate datasets.
| Dataset | ENT | NIST STS | Avg. No. Samples | Median (IPI) | Pathology |
|---|---|---|---|---|---|
| 66.6% | 93.3% | 4,968,780 | 175 | Healthy volunteers | |
| 66.6% | 86.6% | 108,818 | 68 | Myocardial problems and Healthy controls | |
| 66.6% | 86.6% | 59,770 | 87 | Myocardial problems | |
| 66.6% | 80.0% | 19,707,034 | 37 | Atrial fibrillation or flutter | |
| 66.6% | 73.3% | 5,120 | 12 | Holter recordings | |
| 66.6% | 66.6% | 650,000 | 1246 | Physically active volunteers | |
| 66.6% | 60.0% | 650,000 | 1113 | Arrhythmia | |
| 66.6% | 60.0% | 224,999 | 520.5 | Holter recordings | |
| 66.6% | 46.6% | 624,166 | 1243 | Stress tests | |
| 50.0% | 40.0% | 127,232 | 415 | Ventricular problems | |
| 66.6% | 33.3% | 55,522 | 48.5 | Tachycardia | |
| 66.6% | 33.3% | 230,400 | 1192 | Partial epilepsy | |
| 50.0% | 26.6% | 525,000 | 1800 | Tachycardia | |
| 83.3% | 26.6% | 17,245,701 | 4439 | Partial epilepsy | |
| 83.3% | 26.6% | 4,188,530 | 11,517 | Sleep apnea syndrome | |
| 83.3% | 26.6% | 1,800,000 | 4405 | Myocardial and hypertension | |
| 66.6% | 20.0% | 1,479,358 | 2426 | Unstable patients in critical care units | |
| 66.6% | 20.0% | 11,930 | 15,786 | Tachycardia | |
| 83.3% | 13.3% | 10,553,116 | 46,910 | Hypertension |
Figure 4Distribution of the fraction of tests passed for the mitdb dataset as a function of the number of bits used. (a) ENT suite; and (b) NIST STS suite.