| Literature DB >> 32455686 |
Alex Barros1, Paulo Resque1, João Almeida1, Renato Mota1, Helder Oliveira1, Denis Rosário1, Eduardo Cerqueira1.
Abstract
The rapid spread of wearable technologies has motivated the collection of a variety of signals, such as pulse rate, electrocardiogram (ECG), electroencephalogram (EEG), and others. As those devices are used to do so many tasks and store a significant amount of personal data, the concern of how our data can be exposed starts to gain attention as the wearable devices can become an attack vector or a security breach. In this context, biometric also has expanded its use to meet new security requirements of authentication demanded by online applications, and it has been used in identification systems by a large number of people. Existing works on ECG for user authentication do not consider a population size close to a real application. Finding real data that has a big number of people ECG's data is a challenge. This work investigates a set of steps that can improve the results when working with a higher number of target classes in a biometric identification scenario. These steps, such as increasing the number of examples, removing outliers, and including a few additional features, are proven to increase the performance in a large data set. We propose a data improvement model for ECG biometric identification (user identification based on electrocardiogram-DETECT), which improves the performance of the biometric system considering a greater number of subjects, which is closer to a security system in the real world. The DETECT model increases precision from 78% to 92% within 1500 subjects, and from 90% to 95% within 100 subjects. Moreover, good False Rejection Rate (i.e., 0.064003) and False Acceptance Rate (i.e., 0.000033) were demonstrated. We designed our proposed method over PhysioNet Computing in Cardiology 2018 database.Entities:
Keywords: ECG; authentication; biometric; random forest; security; wearables
Mesh:
Year: 2020 PMID: 32455686 PMCID: PMC7284328 DOI: 10.3390/s20102920
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Related Works Summary.
| Paper | # of | # of | Acquisition | Classifier | Metrics | Observation |
|---|---|---|---|---|---|---|
| Biel et al. [ | 20 | 10 to 360 | Siemens | PCA built | 100% | Fiducial |
| Camara et al. [ | 10 | 256 | IMDs | DT, SVM, | 93.5% | Non-fiducial |
| Zhang and Wu [ | 85 | 9 | PTB, MITDB, | SVM, NN, | 96.6% | Used voting |
| Zhang, Y et al. [ | 100 | 10 + Feature | MIT-BH | PCA + LDA | 99% | Used hybrid |
| Zhang, Q et al. [ | 220 | Wavelet | CEBSDB, | 1D-CNN | 96.5% | Used 1D-CNN |
| Labati et al. [ | 237 | V vectors | E-HOL-03- | 1D-CNN | 100%acc | Preferred use |
| Zhang, Y et al. [ | 319 | 400 | PTB, CEBSDB, | SVM, NNC | 96.5–99.1% | Used |
| Cao et al. [ | N/A | 8528 | Computing in | RNN, LSTM | F1 score: | |
| Pouryayevali et al. [ | 1012 | UofTDB | LDA | Evaluate | collected | |
| Alotaiby et al. [ | 290 | 11 | PTB | RF | 99.61%, | used direct |
Figure 1Overview of all processing steps of DETECT.
Figure 2Unfiltered and filtered signal.
Features captured directly from ECG stream.
| No. | Features |
|---|---|
| 1 | Mean Q Peak amplitude ( |
| 2 | Mean R Peak amplitude ( |
| 3 | Mean S Peak amplitude ( |
| 4 | Q Peak Standard Deviation |
| 5 | R Peak standard deviation |
| 6 | S Peak standard deviation |
| 7 | QRS amplitude ( |
| 8 | R wave duration (ms) |
| 9 | R-R Interval (ms) |
| 10 | Mean P Peak amplitude ( |
| 11 | Mean T Peak amplitude ( |
| 12 | Mean QS Distance |
| 13 | Mean QT Distance |
| 14 | Mean QRS onset amplitude ( |
| 15 | Mean QRS offset amplitude ( |
| 16 | Mean QT interval |
| 17 | Mean ST interval |
| 18 | Mean T wave |
| 19 | Mean PQ segment |
| 20 | Mean ST segment |
| 21 | Mean TP segment |
| 22 | Mean PP interval |
Figure 3Example of P-QRS-T cycle, with the peaks of all the relevant waveforms.
Figure 4Part of a decision tree from one of the one-vs-all model random forest base estimators.
Figure 5FAR results for each scenario of data improvement models.
Figure 6FRR results for each scenario of data improvement models.
Mean FAR and FRR values and standard deviation in each data improvement models.
| Scenario | FAR | Standard Deviation | FRR | Standard Deviation |
|---|---|---|---|---|
| Scenario 1 | 0.000194 | 0.000207 | 0.383813 | 0.312530 |
| Scenario 2 | 0.000079 | 0.000069 | 0.156959 | 0.134858 |
| Scenario 3 | 0.000080 | 0.000071 | 0.153801 | 0.143330 |
| DETECT | 0.000033 | 0.000042 | 0.064003 | 0.084876 |
Figure 7Precision distribution for the different user identification models.
Figure 8Precision results for each scenario of data improvement.
Figure 9Recall results for each scenario of data improvement.
Figure 10F1-score results for each scenario of data improvement.
Figure 11Results for different dataset size for the DETECT and standard model.