| Literature DB >> 35242945 |
Houssam Halmaoui1,2, Abdelkrim Haqiq2.
Abstract
In a previous publication [1], we created a dataset of feature patches for detection model training. In this paper, we use the same patches to create a new large synthetic dataset of feature pairs, similar and different, in order to perform, thanks to a siamese convolutional model, the description and matching of the detected features. We thus complete the entire matching pipeline. The accurate manual labeling of image features being very difficult because of their large number and the various associated parameters of position, scale and rotation, recent deep learning models use the result of handcrafted methods for training. Compared to existing datasets, ours avoids model training with false detections of the extraction of feature patches by other algorithms, or with inaccuracy errors of manual labeling. The other advantage of synthetic patches is that we can control their content (corners, edges, etc.), as well as their geometric and photometric parameters, and therefore we control the invariance of the model. The proposed datasets thus allow a new approach to train the different matching modules without using traditional methods. To our knowledge, these are the first feature datasets based on generated synthetic patches for image matching.Entities:
Keywords: Feature descriptors; Feature pairs; Feature patch dataset; Interest points; Keypoints; Learned features; Matching model; Matching pipeline
Year: 2022 PMID: 35242945 PMCID: PMC8873551 DOI: 10.1016/j.dib.2022.107965
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Random samples from the the patch dataset with different geometric and photometric transformations. Labels 1 and 0 correspond, respectively, to features (corners) and non-features (lines and uniform regions).
Fig. 3Example of a mosaic image (top left) and the corresponding feature map before (top right) and after adding the features at the intersection (bottom right). Illustration of the new features potentially generated at the intersection of four neighboring patches (bottom left).
Fig. 5Random samples from the dataset of feature pairs. Similar pairs have a label 1 and different pairs have a label 0.
Fig. 2Convolutional sliding window model for fast feature detection.
Fig. 4Training of the convolutional sliding window model on mosaic dataset: MSE Loss and cosine similarity metric, according to the number of iterations, for training set and validation set.
Values of MSE and cosine similarity after training the detection model on the mosaic dataset.
| Loss (MSE) | Cosine Similarity | ||||
|---|---|---|---|---|---|
| Train | Validation | Test | Train | Validation | Test |
| 0.0011 | 0.0022 | 0.0023 | 0.964 | 0.9137 | 0.9395 |


Fig. 6Siamese convolutional model (top) and CNN architecture used (bottom).
Fig. 7Training of the siamese model on feature pairs dataset: Loss and accuracy, according to the number of iterations, for training set and validation set.
Contrastive loss and accuracy after learning the matching model on the feature pairs dataset.
| Contrastive loss | Accuracy | ||||
|---|---|---|---|---|---|
| Train | Validation | Test | Train | Validation | Test |
| 0.011 | 0.0146 | 0.0148 | 98.59% | 98.13% | 98.09% |
Fig. 8From top to bottom: Simulation of several transformations of 30× rotation, scale 0.8, noise , motion blur with kernel and linear change of luminance. Left: matching of the detected features. Right: estimation of the homography from the matched features.
| Subject | Computer Vision and Pattern Recognition |
| Specific subject area | Image matching |
| Type of data | Image |
| How data were acquired | Datasets of feature patches, feature mosaic images and feature pairs were generated using Matlab and Python programming languages on an Intel i7 computer, then, used for model training with Python on GPU (Tesla P100-PCIE-16GB). |
| Data format | Raw |
| Parameters for data collection | The images were carefully selected based on the different geometric and photometric variables of rotation, angle, luminance, contrast, blur and noise. |
| Description of data collection | The patch dataset consists of about |
| Data source location | Institution: ISMAC - Higher Institute of Audiovisual and Film Professions |
| Data accessibility | Repository name: Mendeley Data |
| Related research article | H. Halmaoui, A. Haqiq, Convolutional sliding window based model and synthetic dataset for fast feature detection, in: The International Conference on Artificial Intelligence and Computer Vision, Springer, 2021, pp. 101-111. |