| Literature DB >> 30836714 |
Marcos Quintana1, Sezer Karaoglu2,3, Federico Alvarez4, Jose Manuel Menendez5, Theo Gevers6,7.
Abstract
Latest advances of deep learning paradigm and 3D imaging systems have raised the necessity for more complete datasets that allow exploitation of facial features such as pose, gender or age. In our work, we propose a new facial dataset collected with an innovative RGB⁻D multi-camera setup whose optimization is presented and validated. 3DWF includes 3D raw and registered data collection for 92 persons from low-cost RGB⁻D sensing devices to commercial scanners with great accuracy. 3DWF provides a complete dataset with relevant and accurate visual information for different tasks related to facial properties such as face tracking or 3D face reconstruction by means of annotated density normalized 2K clouds and RGB⁻D streams. In addition, we validate the reliability of our proposal by an original data augmentation method from a massive set of face meshes for facial landmark detection in 2D domain, and by head pose classification through common Machine Learning techniques directed towards proving alignment of collected data.Entities:
Keywords: 3D data collection; 3D face modelling; deep learning; face landmark detection; head pose classification
Year: 2019 PMID: 30836714 PMCID: PMC6427725 DOI: 10.3390/s19051103
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Graphical description of the proposed scenario to capture the 3DWF dataset. Coordinates of the markers are expressed in meters.
Evaluation of the distance from the model to the frontal camera.
| Distance (cm) | Cloud Points |
|---|---|
| 80 | 7754 |
| 100 | 5871 |
| 120 | 4342 |
Main features of the light source employed in 3DWF setup.
| Feature | Value |
|---|---|
| Power | 36 W |
| Color temperature | 5800 K ± 300 K |
| Luminous flux | 4200 lm |
Evaluation of the influence of the light source luminous flux.
| Lm | Points of the Cloud | ||
|---|---|---|---|
| Cloud Cam. 1 | Cloud Cam. 2 | Cloud Cam. 3 | |
| 5000 | 5859 | 7693 | 7460 |
| 2625 | 6029 | 7896 | 7986 |
| 1435 | 6084 | 8221 | 7562 |
| 250 | 6621 | 8841 | 8935 |
Evaluation of the orientation of the light sources.
| Focus | Angles | Cam 1 | Cam 2 | Cam 3 | ||||
|---|---|---|---|---|---|---|---|---|
| Light | Cams |
| MAD |
| MAD |
| MAD | |
| Front | 30 | 40 | 189 | 56 | 87 | 58 | 124 | 73 |
| Side | 40 | 30 | 138 | 51 | 89 | 51 | 128 | 62 |
| Front | 30 | 60 | 164 | 50 | 100 | 68 | 122 | 81 |
Evaluation of the orientation of the cameras.
| Angle | Cam 1 | Cam 2 | Cam 3 | |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| 30 | 129.35 | 62.69 | 94.22 | 66.38 | 118.35 | 72.43 |
| 40 | 172.00 | 71.09 | 98.29 | 71.37 | 122.51 | 75.80 |
| 60 | 171.50 | 71.90 | 108.674 | 79.94 | 124.57 | 85.75 |
Figure 2Graphics showing the most important features of the subjects included in 3DWF dataset: age and gender.
Figure 3Raycasting geometry model with a plane and a pinhole. Extracted from [11] and reproduced with permission from Prof. House.
Figure 4Block diagram of the different steps involved in 3D reconstruction of faces.
Figure 5Graphical detailing of the procedure followed to reconstruct the 3D models upon the depth and information captured by the three RGB–D devices.
Figure 62D images extracted from the final face clouds proposed by this work.
Figure 72D images extracted from the final face clouds for Marker 1 without texture mapping proposed by this work.
Parameters for training 3DU dataset.
| Dataset | Subjects | Learning Rate | ||
|---|---|---|---|---|
| Train. | Val. | RCN | Vanilla | |
| 3DU | 35 | 12 |
|
|
Parameters for training AFLW dataset.
| Dataset | Images | Learning Rate | |||
|---|---|---|---|---|---|
| Train | Val. | Test | RCN | Vanilla | |
| AFLW | 9000 | 3000 | 1000 |
|
|
Figure 8Results obtained for both pipelines of face landmark detection with different combinations of datasets.
Representation of the last layer of convolution from VainillaCNN.
| Conv 3DU | Conv 3DU + AFLW |
|
|
|
Representation of the last layer of max-pooling from VainillaCNN.
| Pool 3DU | Pool 3DU + AFLW |
|
|
|
Figure 9Graphical plots of Euler Angles obtained for the different markers of 3DWF.
Results for head pose classification.
| Method | Training Samples | Testing Samples | Training Accuracy | Testing Accuracy |
|---|---|---|---|---|
| LDA | 80% | 20% | 82% | 83% |
| GNB | 80% | 20% | 84% | 84% |
Figure 10Confusion Matrix calculated for Head Pose validation method.