| Literature DB >> 35632314 |
Baoquan Shi1,2,3, Weichen Gu1,2,3, Xudong Sun1,2,3.
Abstract
A low-cost and power-efficient video surveillance system, named XDMOM, is developed for real-time moving object detection outdoors or in the wild. The novel system comprises four parts: imaging subsystem, video processing unit, power supply, and alarm device. The imaging subsystem, which consists of a dual-spectrum camera and rotary platform, can realize 360-degree and all-day monitoring. The video processing unit uses a power-efficient NVIDIA GeForce GT1030 chip as the processor, which ensures the power consumption of the whole system maintains a low level of 60~70 W during work. A portable lithium battery is employed to supply power so that the novel system can be used anywhere. The work principle is also studied in detail. Once videos are recorded, the single-stage neural network YOLOv4-tiny is employed to detect objects in a single frame, and an adaptive weighted moving pipeline filter is developed to remove pseudo-targets in the time domain, thereby reducing false alarms. Experimental results show that the overall correct alarm rate of the novel system could reach 85.17% in the daytime and 81.79% at night when humans are monitored in real outdoor environments. The good performance of the novel system is demonstrated by comparison with state-of-the-art video surveillance systems.Entities:
Keywords: all-day monitoring; in the wild; moving object detection; outdoors; real-time; video surveillance system
Mesh:
Year: 2022 PMID: 35632314 PMCID: PMC9144562 DOI: 10.3390/s22103905
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Performance analysis of the state-of-the-art video surveillance systems.
| System | Resolution | Speed/fps | Power-Efficiency | All-Day Monitoring | Long-Term Monitoring |
|---|---|---|---|---|---|
| REDBEE [ | 640 × 480 | 16.2 | × | × | × |
| Mori et al. [ | 800 × 480 | 26.6 | √ | × | √ |
| Wang et al. [ | 720 × 576 | 22 | √ | × | √ |
| Soc system [ | 640 × 480 | 15 | √ | × | √ |
| Dong et al. [ | 640 × 480 | 25 | × | × | √ |
| Iqbal et al. [ | 800 × 600 | 15.1 | × | × | × |
| Alam et al. [ | 1280 × 720 | 30 | × | × | × |
| AURORA [ | 640 × 480 | 5 | × | × | × |
| DATMO [ | 640 × 480 | 10 | × | × | × |
|
|
|
|
|
|
|
Figure 1Hardware composition of the novel system. In practical use, the LCD is unnecessary.
Figure 2Details of (a) dual-spectrum camera (VS and IR camera) and (b) industrial computer.
Figure 3Software of the novel system in visible-spectrum mode.
Figure 4Software of the novel system in infrared mode.
Core indicators of the novel system.
| Item | Value |
|---|---|
| Resolution | 720 × 576 |
| Frame rate | 25 fps |
| Field of view | 360-degree |
| Monitoring range | 100 m |
| Monitoring interval | All-day monitoring |
| Power consumption | 60~70 W |
| Correct alarm rate | 85.17% during the day and 81.79% at night |
| Application scene | Outdoors or in the wild |
Figure 5Workflow of the novel system.
Figure 6Detailed structure of YOLOv4-tiny.
Figure 7Two classical pipelines: (a) circular pipeline and (b) rectangular pipeline.
Figure 8Schematic diagram of the adaptive weighted moving pipeline filtering (AWMPF).
Figure 9AWMPF applied to (a) VS frames and (b) IR frames. The rectangles with the same color represent the pipeline of a person. The red arrows are pointing to pseudo-targets.
Comparison of moving object detection results with and without applying AWMPF.
| Videos | Total Alarms | False Alarms | |
|---|---|---|---|
| Without Applying AWMPFM | With Applying AWMPFM | ||
| VS | 81 | 18 (22.22%) | 1 (1.23%) |
| IR | 139 | 19 (13.67%) | 8 (5.76%) |
Moving objects retrained in the novel system.
| Object Type | VS Mode | IR Mode |
|---|---|---|
| Vehicles | car, bus, truck | |
| Humans | person | person/people |
| Animals | cat, dog, horse, bird, sheep, cow |
Figure 10The relationship between loss and iteration during training.
Figure 11The novel system in the testing scenario.
Figure 12Some screenshots of (a–c) VS videos and (d–f) IR videos. Detected moving objects (people) are labeled.
Statistics of monitoring results of VS videos.
| Videos | Total Alarms | Correct Alarms | False Alarms | Missed Alarms |
|---|---|---|---|---|
| 1 | 38 | 32 (84.21%) | 0 (0.0%) | 6 (15.79%) |
| 2 | 95 | 83 (87.37%) | 0 (0.0%) | 12 (12.63%) |
| 3 | 39 | 32 (82.05%) | 0 (0.0%) | 7 (17.95%) |
| 4 | 37 | 32 (86.49%) | 0 (0.0%) | 5 (13.51%) |
| 5 | 81 | 68 (83.95%) | 1 (1.23%) | 13 (16.05%) |
| total | 290 | 247 (85.17%) | 1 (0.34%) | 43 (14.83%) |
Statistics of monitoring results of IR videos.
| Videos | Total Alarms | Correct Alarms | False Alarms | Missed Alarms |
|---|---|---|---|---|
| 1 | 139 | 109 (78.42%) | 8 (5.76%) | 30 (21.58%) |
| 2 | 143 | 110 (76.92%) | 5 (3.50%) | 33 (23.08%) |
| 3 | 104 | 91 (87.50%) | 6 (5.77%) | 13 (12.50%) |
| 4 | 174 | 148 (85.06%) | 10 (5.75%) | 26 (14.94%) |
| total | 560 | 458 (81.79%) | 29 (5.18%) | 102 (18.21%) |
The processing time analysis with the novel system running offline. (Note that the system works online at a frame rate of 25 fps).
| Videos | Number of Frames | Processing Time (s) | Frame Rate (fps) |
|---|---|---|---|
| VS1 | 2475 | 70.714 | 35 |
| VS2 | 4775 | 136.429 | 35 |
| IR1 | 2350 | 67.143 | 35 |
| IR2 | 2500 | 71.429 | 35 |
Power consumption analysis of the novel system.
| System | Voltage (V) | Current (A) | Power (W) |
|---|---|---|---|
| XDMOM | 12 | 5.0~5.8 | 60~69.6 |
| Dong et al. [ | - | - | >180 |
| Alam et al. [ | - | - | >400 |
| AURORA [ | - | - | >720 |
The values listed in the table for the equipment presented by Dong et al. [18], for the equipment developed by Alam et al. [20], and for the system AURORA [21], comprise only the power consumption of key components; the whole system consumes more power.
Detection precision comparison of our system with state-of-the-art systems.
| Systems | VS Videos | IR Videos |
|---|---|---|
| Iqbal et al. [ | 79.0% | -- |
| Alam et al. [ | 83.19% | -- |
| Wang et al. [ | -- | 66.0% |
| Zhang et al. [ | 79.0% | 79.68% |
|
|
|
|