| Literature DB >> 35480229 |
Huang Chengcheng1,2, Yuan Jian1,3, Qin Xiao1.
Abstract
With the rapid development of apparel e-commerce, the variety of apparel is increasing, and it becomes more and more important to classify the apparel according to its collar design. Traditional image processing methods have been difficult to cope with the increasingly complex image backgrounds. To solve this problem, an EMRes-50 classification algorithm is proposed to solve the problem of garment collar image classification, which is designed based on the ECA-ResNet50 model combined with the MC-Loss loss function method. Applying the improved algorithm to the Coller-6 dataset, and the classification accuracy obtained was 73.6%. To further verify the effectiveness of the algorithm, it was applied to the DeepFashion-6 dataset, and the classification accuracy obtained was 86.09%. The experimental results show that the improved model has higher accuracy than the existing CNN model, and the model has better feature extraction ability, which is helpful to solve the problem of the difficulty of fine-grained collar classification and promote the further development of clothing product image classification.Entities:
Keywords: attention mechanism; clothing classification; collar classification; convolutional neural network; loss function
Year: 2022 PMID: 35480229 PMCID: PMC9035927 DOI: 10.3389/fncom.2021.766284
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
Figure 1ECA module (Wang Q. et al., 2020).
Figure 2A is the residual block, B is the residual block introduced into the ECA module.
Figure 3Components of the interpass loss function.
EMRes-50 network structure.
| EMRes-50 network structure | ||
|---|---|---|
| 7 × 7 conv 64 | ||
| 3 × 3 conv 64 | ||
| C:1 × 1 | Conv 64 |
|
| C:3 × 3 | Conv 64 | |
| C:1 × 1 | Conv 256 | |
| ECA module | 256 | |
| C:1 × 1 | Conv 128 |
|
| C:3 × 3 | Conv 128 | |
| C:1 × 1 | Conv 256 | |
| ECA module | 256 | |
| C:1 × 1 | Conv 256 |
|
| C:3 × 3 | Conv 256 | |
| C:1 × 1 | Conv 1,024 | |
| ECA module | 1,024 | |
| C:1 × 1 | Conv 512 |
|
| C:3 × 3 | Conv 512 | |
| C:1 × 1 | Conv 2,048 | |
| ECA module | 2,048 | |
| C:1 × 1 | Conv 2,200 | |
Average pool, 6d, fc, softmax.
Figure 4Schematic diagram of EMRes-50 training phase architecture.
EMRes-50 weight update process.
|
|
|---|
| Input: Let the network have N layers, |
| 1: |
| 2: |
| 3: |
| 4: |
| 5: |
| 6: Updata by |
| 7: Updata by |
| 8: End |
Collar-6 experimental data distribution.
|
|
|
|
|
|---|---|---|---|
| Round collar | 2,480 | 620 | 3,100 |
| Lapel collar | 2,608 | 652 | 3,260 |
| Stand collar | 2,464 | 616 | 3,080 |
| Hooded collar | 2,560 | 640 | 3,200 |
| V collar | 2,468 | 617 | 3,085 |
| Fur lapels | 3,122 | 625 | 3,122 |
Figure 5Some images from the Collar-6 dataset.
Distribution of experimental data of DeepFashion-6.
|
|
|
|
|
|---|---|---|---|
| Dress | 2,555 | 639 | 3,194 |
| Jacket | 2,505 | 627 | 3,132 |
| Jeans | 2,412 | 603 | 3,015 |
| Shorts | 2,541 | 636 | 3,177 |
| Tank | 2,528 | 632 | 3,160 |
| Tee | 2,439 | 610 | 3,049 |
Comparison with variants of ResNet or improved models based on ResNet.
|
|
|
|---|---|
| ResNeXt50 | 73.05 |
| CBAEMRes-50 | 63.68 |
| SE-ResNet50 | 63.44 |
| SCNet50 | 66.07 |
| Res2Net50 | 73.44 |
|
|
|
Comparison with lightweight model.
|
|
|
|---|---|
| MobileNetV3_large | 68.48 |
| MobileNetV3_small | 63.23 |
| GhostNet | 60.98 |
| SqueezeNet1_0 | 63.76 |
|
|
|
Comparison with other models.
|
|
|
|---|---|
| AlexNet | 67.53 |
| Xception | 70.53 |
| VGG16 | 63.68 |
| VGG19 | 65.62 |
|
|
|
Comparison of model accuracy in the DeepFashion-6 dataset.
|
|
|
|---|---|
| AlexNet | 83.50 |
| ResNet50_CBAM | 82.78 |
| GhostNet | 82.84 |
| InceptionV3 | 73.36 |
| MobileNet_large | 83.77 |
| MobileNet_small | 83.40 |
| Res2Net | 85.13 |
| SCNet | 79.57 |
| SqueezeNet1_0 | 82.03 |
| Xception | 85.01 |
|
|
|
Models for ablation experiments on the Collar-6 dataset.
|
|
|
|---|---|
| ResNet50 | 59.36 |
| ResNet50+MC-Loss | 66.84 |
| ResNet50+ECA | 57.50 |
|
|
|
Ablation experiments of the model on the DeepFashion-6 dataset.
|
|
|
|---|---|
| ResNet50 | 81.47 |
| ResNet50+MC-Loss | 84.53 |
| ResNet50+ECA | 80.94 |
|
|
|
Different effects of different k on the ECA module on the Collar-6 dataset.
|
|
|
|---|---|
| 5 | 70.82 |
| 7 | 67.37 |
|
|
|