| Literature DB >> 36236693 |
Kaixuan Du1,2, Xianghong Che2, Yong Wang2, Jiping Liu2, An Luo2, Ruiyuan Ma2, Shenghua Xu2.
Abstract
There is a critical need for detection of administrative regions through network map pictures in map censorship tasks, which can be implemented by target detection technology. However, on map images there tend to be numerous administrative regions overlaying map annotations and symbols, thus making it difficult to accurately detect each region. Using a RetinaNet-based target detection model integrating ResNet50 and a feature pyramid network (FPN), this study built a multi-target model and a single-target cascading model from three single-target models by taking Taiwan, Tibet, and the Chinese mainland as target examples. Two models were evaluated both in classification and localization accuracy to investigate their administrative region detection performance. The results show that the single-target cascading model was able to detect more administrative regions, with a higher f1_score of 0.86 and mAP of 0.85 compared to the multi-target model (0.56 and 0.52, respectively). Furthermore, location box size distribution from the single-target cascading model looks more similar to that of manually annotated box sizes, which signifies that the proposed cascading model is superior to the multi-target model. This study is promising in providing support for computer map reading and intelligent map censorship.Entities:
Keywords: RetinaNet; administrative regions; map pictures; multi-target model; single-target cascading model
Mesh:
Year: 2022 PMID: 36236693 PMCID: PMC9572589 DOI: 10.3390/s22197594
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1Basic network model structure of RetinaNet.
Figure 2Single-target cascading and multi-target detection model.
Figure 3Different thematic maps: (a) seismic zone; (b) climate; (c) transportation and mountain terrain.
Sample distribution of different targets.
| Region of Interest | Training Dataset | Test Dataset | Total | |
|---|---|---|---|---|
| Target 1 | Taiwan | 2151 | 538 | 2689 |
| Target 2 | Tibet | 582 | 146 | 728 |
| Target 3 | Chinese mainland | 459 | 115 | 574 |
| Total | 3192 | 799 | 3991 |
Target annotation format.
| No. | path_img_file | box_x | box_y | Width | Height | Label |
|---|---|---|---|---|---|---|
| 1 | dataset/image_0001.jpg | 890 | 659 | 944 | 743 | Taiwan |
| 2 | dataset/ image_0002.jpg | 775 | 631 | 845 | 721 | Taiwan |
| 3 | dataset/ image_0003.jpg | 36 | 57 | 762 | 535 | Xizang |
| 4 | dataset/ image_0004.jpg | 5 | 51 | 536 | 316 | Xizang |
| 5 | dataset/ image_0005.jpg | 5 | 2 | 341 | 289 | Chinese mainland |
| 6 | dataset/ image_0006.jpg | 95 | 93 | 666 | 546 | Chinese mainland |
Figure 4Training loss of single-target and multi-target detection models: (a) Training loss of model for target 1 (Taiwan); (b) Training loss of model for target 2 (Tibet); (c) Training loss of model for target 3 (Chinese mainland); (d) Training loss of model for all three targets.
Accuracy statistics of different targets with the single-target cascading detection model.
| Single-Target Model | Single-Target Cascading Detection Models | |||
|---|---|---|---|---|
| Taiwan | Tibet | Chinese Mainland | Taiwan, Tibet, and Chinese Mainland | |
| Precision (P) | 0.92 | 0.77 | 0.52 | 0.80 |
| Recall (R) | 0.91 | 0.96 | 0.94 | 0.93 |
| f1_socre | 0.92 | 0. 86 | 0.67 | 0.86 |
Accuracy statistics of different targets with the multi-target detection model.
| Multi-Target Model | ||||
|---|---|---|---|---|
| Taiwan | Tibet | Chinese Mainland | Taiwan, Tibet, and Chinese Mainland | |
| Precision (P) | 0.94 | 0.53 | 0.3 | 0.77 |
| Recall (R) | 0.78 | 0.31 | 0.05 | 0.44 |
| f1_socre | 0.85 | 0.39 | 0.10 | 0.56 |
Figure 5Annotations of different targets in maps (top row) and detected results from the single-target cascading model (middle row) and multi-target model (bottom row). (Different colors of the boxes indicate different types of targets, and labels indicate the target types; score is for classification confidence; IOU is for localization confidence).
Figure 6P–R curves of two categories of models: (a) single-target cascading model; (b) multi-target model.
Figure 7Comparison of AP between the single-target cascading model and the multi-target model.
Figure 8Distribution of ground truth and prediction: (a) Distribution of box size in ground truth; (b) Distribution of box size in prediction of single-target cascading model; (c) Distribution of box size in prediction of multi-target model.