| Literature DB >> 30210533 |
Junying Zeng1, Xiaoxiao Zhao1, Junying Gan1, Chaoyun Mai1, Yikui Zhai1, Fan Wang1.
Abstract
Face recognition (FR) with single sample per person (SSPP) is a challenge in computer vision. Since there is only one sample to be trained, it makes facial variation such as pose, illumination, and disguise difficult to be predicted. To overcome this problem, this paper proposes a scheme combined traditional and deep learning (TDL) method to process the task. First, it proposes an expanding sample method based on traditional approach. Compared with other expanding sample methods, the method can be used easily and conveniently. Besides, it can generate samples such as disguise, expression, and mixed variation. Second, it uses transfer learning and introduces a well-trained deep convolutional neural network (DCNN) model and then selects some expanding samples to fine-tune the DCNN model. Third, the fine-tuned model is used to implement experiment. Experimental results on AR face database, Extend Yale B face database, FERET face database, and LFW database demonstrate that TDL achieves the state-of-the-art performance in SSPP FR.Entities:
Mesh:
Year: 2018 PMID: 30210533 PMCID: PMC6126063 DOI: 10.1155/2018/3803627
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The framework of the proposed method.
Figure 2The basic principle diagram of the expanding sample method.
The algorithm of generating intraclass variation set.
| Input: an extrafrontal face dataset |
| Output: intraclass variation set |
| (1) calculate: |
|
|
| where |
| (2) calculate: |
|
|
| (3) output intraclass variation set, as follows: |
|
|
Figure 3The framework of generating intraclass variation set.
Figure 4The framework of expanding sample.
Figure 5The architecture of the lightened CNN.
The algorithm of measuring similarity between expanding samples and actual images.
| Input: expanding samples |
| Output: the similarity of the |
| 1. Calculate |
| 2. Calculate |
| 3. Initialize |
| 4. for ( |
|
|
|
|
|
|
|
|
|
|
| 5. Calculate |
The thresholds of intraclass variation.
| Number | 2 | 3 | 4 | 5 | 6 |
| Threshold | 802.3 | 814.1 | 873.0 | 839.5 | 804.3 |
| Number | 7 | 8 | 9 | 10 | 11 |
| Threshold | 834.0 | 855.1 | 914.1 | 898.5 | 636.6 |
| Number | 12 | 13 | 14 | 15 | 16 |
| Threshold | 780.9 | 835.8 | 815.2 | 848.9 | 880.1 |
| Number | 17 | 18 | 19 | 20 | 21 |
| Threshold | 889.9 | 864.3 | 850.0 | 856.3 | 895.4 |
| Number | 22 | 23 | 24 | 25 | 26 |
| Threshold | 953.5 | 945.2 | 614.4 | 793.9 | 804.4 |
The similarities between expanding database and AR database.
| Number | 1 | 2 | 3 | 4 | 5 | 6 |
| Similarity | 100% | 100% | 100% | 99% | 100% | 100% |
| Number | 7 | 8 | 9 | 10 | 11 | 12 |
| Similarity | 100% | 88% | 98% | 98% | 4% | 26% |
| Number | 13 | 14 | 15 | 16 | 17 | 18 |
| Similarity | 40% | 98% | 97% | 98% | 94% | 98% |
| Number | 19 | 20 | 21 | 22 | 23 | 24 |
| Similarity | 98% | 97% | 80% | 94% | 93% | 1% |
| Number | 25 | 26 | — | — | — | — |
| Similarity | 29% | 29% | — | — | — | — |
Figure 6The accuracies in session 1 by using different parts to fine-tune the lightened CNN.
Figure 7The accuracies in session 2 by using different parts to fine-tune the lightened CNN.
Figure 8The losses in session 1 by using different parts to fine-tune the lightened CNN.
Figure 9The losses in session 2 by using different parts to fine-tune the lightened CNN.
Accuracy (%) on AR face database (session 1).
| Method | Illu | Exp | Dis | Disill |
|---|---|---|---|---|
| SRC [ | 80.8 | 85.4 | 55.6 | 25.3 |
| CRC [ | 80.5 | 80.4 | 58.1 | 23.8 |
| AGL [ | 93.3 | 77.9 | 70.0 | 53.8 |
| DMMA [ | 92.1 | 81.4 | 46.9 | 30.9 |
| PNN [ | 84.6 | 86.7 | 90.0 | 72.5 |
| PCRC [ | 95.0 | 86.7 | 95.6 | 81.3 |
| ESRC [ | 99.6 | 85.0 | 83.1 | 68.6 |
| SVDL [ | 98.3 | 86.3 | 86.3 | 79.4 |
| LGR [ | 100 | 97.9 | 98.8 | 96.3 |
| TLC [ | 100 | 98.3 | 99.4 | 98.1 |
| TDL | 100 | 100 | 99.5 | 99.3 |
Illu, illumination; Exp, expression; Dis, disguise; Disill, illumination + disguise.
Accuracy (%) on AR face database (session 2).
| Method | Illu | Exp | Dis | Disill |
|---|---|---|---|---|
| SRC [ | 55.8 | 68.8 | 29.4 | 12.8 |
| CRC [ | 55.8 | 69.6 | 35.0 | 13.5 |
| AGL [ | 70.8 | 55.8 | 40.6 | 30.7 |
| DMMA [ | 77.9 | 61.7 | 28.1 | 21.9 |
| PNN [ | 77.5 | 73.8 | 71.9 | 52.8 |
| PCRC [ | 88.8 | 71.7 | 81.8 | 63.1 |
| ESRC [ | 87.9 | 70.4 | 59.4 | 45.0 |
| SVDL [ | 87.1 | 74.2 | 61.3 | 54.1 |
| LGR [ | 97.5 | 85.0 | 93.8 | 88.8 |
| TLC [ | 99.2 | 87.1 | 96.3 | 91.9 |
| TDL | 100 | 100 | 100 | 99.3 |
Accuracy on Extend Yale B face database.
| Method | Accuracy (%) |
|---|---|
| SRC [ | 49.2 |
| CRC [ | 51.2 |
| AGL [ | 59.5 |
| DMMA [ | 61.7 |
| PNN [ | 67.5 |
| PCRC [ | 77.8 |
| ESRC [ | 67.9 |
| SSAE [ | 82.2 |
| SVDL [ | 85.0 |
| LGR [ | 86.6 |
| TDL | 88.3 |
Accuracy on FERET database.
| Method | Accuracy (%) |
|---|---|
| PCA [ | 84.0 |
| (PC)2A [ | 84.5 |
| E (PC)2A [ | 85.5 |
| 2DPCA [ | 84.5 |
| (2D)2PCA [ | 85.0 |
| SOM [ | 91.0 |
| LPP [ | 84.0 |
| SVD - LDA [ | 85.5 |
| Block PCA [ | 84.5 |
| Block LDA [ | 86.5 |
| UP [ | 90.0 |
| DMMA [ | 93.0 |
| Fast DMMA [ | 91.0 |
| TDL | 93.9 |
Accuracy on LFW database.
| Method | Accuracy (%) |
|---|---|
| SRC [ | 20.4 |
| CRC [ | 19.8 |
| AGL [ | 19.2 |
| DMMA [ | 17.8 |
| PNN [ | 17.6 |
| PCRC [ | 24.2 |
| ESRC [ | 27.3 |
| SVDL [ | 28.6 |
| LGR [ | 30.4 |
| TDL | 74 |