| Literature DB >> 35368446 |
Devashree R Patrikar1, Mayur Rajaram Parate1.
Abstract
The current concept of smart cities influences urban planners and researchers to provide modern, secured and sustainable infrastructure and gives a decent quality of life to its residents. To fulfill this need, video surveillance cameras have been deployed to enhance the safety and well-being of the citizens. Despite technical developments in modern science, abnormal event detection in surveillance video systems is challenging and requires exhaustive human efforts. In this paper, we focus on evolution of anomaly detection followed by survey of various methodologies developed to detect anomalies in intelligent video surveillance. Further, we revisit the surveys on anomaly detection in the last decade. We then present a systematic categorization of methodologies for anomaly detection. As the notion of anomaly depends on context, we identify different objects-of-interest and publicly available datasets in anomaly detection. Since anomaly detection is a time-critical application of computer vision, we explore the anomaly detection using edge devices and approaches explicitly designed for them. The confluence of edge computing and anomaly detection for real-time and intelligent surveillance applications is also explored. Further, we discuss the challenges and opportunities involved in anomaly detection using the edge devices.Entities:
Keywords: Anomaly detection; Edge computing; Machine learning; Video surveillance
Year: 2022 PMID: 35368446 PMCID: PMC8963404 DOI: 10.1007/s13735-022-00227-8
Source DB: PubMed Journal: Int J Multimed Inf Retr
Fig. 1Anomaly detection in video surveillance scenes. a A truck moving on the footpath (UCSD Dataset); b Pedestrian walking on the lawn (UCSD Dataset); c A person throwing an object (Avenue); d a person carrying a suspicious bag (Avenue); e Incorrect parking of vehicle (MDVD); f people fighting (MDVD); g a person catching a bag (ShanghaiTech); h vehicles moving on the footpath (ShanghaiTech)
Fig. 2General block diagram of anomaly detection
Evolution of anomaly detection techniques over the recent years
| Year | Handcrafted methods | CNN | DNN | LSTM | GAN |
|---|---|---|---|---|---|
| 2021 | [ | [ | [ | [ | |
| [ | |||||
| 2020 | [ | [ | [ | [ | |
| [ | |||||
| 2019 | [ | [ | [ | [ | |
| 2018 | [ | [ | [ | [ | |
| [ | [ | ||||
| 2017 | [ | [ | |||
| 2016 | [ | ||||
| 2015 | [ | [ | |||
| 2014 | [ |
Fig. 3Timeline for evolution of anomaly detection techniques
Recent surveys for anomalies in different context
| Year | Existing work | Broad topics |
|---|---|---|
| 2021 | Khosro Rezaee et al. [ | DL-based real-time crowd anomaly detection |
| 2021 | Rashmiranjan Nayak et al. [ | DL-based methods for video anomaly detection |
| 2020 | Bharathkumar Ramachandra et al. [ | A survey of single-scene video anomaly detection |
| 2019 | Gaurav Tripathi et al. [ | CNN for crowd behavior |
| 2019 | Ahmed et al. [ | Trajectory-based surveillance |
| 2018 | Shobha et al. [ | Vehicle detection, Recognition and tracking |
| 2016 | Yuan et al. [ | Moving object trajectory clustering |
| 2016 | Xiaoli Li et al. [ | Anomaly detection techniques |
| 2015 | Li et al. [ | Crowded scene analysis |
| 2014 | Tian et al. [ | Vehicle surveillance |
| 2013 | Sivaraman et al. [ | Vehicle detection, Tracking, |
| 2012 | Popoola et al. [ | Abnormal human behavior recognition |
| 2012 | Angela A. Sodemann [ | Human behavior detection |
Fig. 4Correlation of surveillance, surveillance targets, and associated anomalies
Fig. 5Anomaly classification
Anomaly detection classification
| Type of learning | Reference |
|---|---|
| Supervised | [ |
| Unsupervised | [ |
| [ | |
| Semi-supervised | [ |
Categorization of anomaly detection techniques
| Approach | Ref | Technique | Highlights |
|---|---|---|---|
| Statistical based | [ | Gaussian process regression | Hierarchical feature representation, Gaussian process regression. Spatiotemporal interest points (STIP) is used to detect local and global video anomaly detection; Dataset: Subway, UCSD, Behave, QMUL Junction; Parameters: AUC, EER (AUC: area under curve), EER: equal error rate) |
| [ | Histogram-based model | HOG, Histograms of oriented swarms (HOS) KLT interest point tracking are used to detect anomalous event in crowd; Aims at achieving high accuracy and low computational cost; Dataset used: UCSD, UMN; Parameters: ROI, EER, DR; (ROI: region of interest, DR: detection rate) | |
| [ | Histogram based | Crowd anomaly detection using histogram of magnitude and momentum (HoMM); background removal, feature extraction (optical flow), anomaly detection (using K-means clustering); Dataset: UCSD, UMN; Parameters: AUC, EER, RD (rate reduction) true positive rate (TPR), false positive rate (FPR) | |
| [ | Bayesian model | Bayesian nonparametric (BNP) approach, hidden Markov model (HMM) and Bayesian nonparametric factor analysis is employed for data segmentation and pattern discovery of abnormal events. Dataset: MIT parameters: energy distribution | |
| [ | Bayesian model | Bayesian nonparametric dynamic topic model is used. Hierarchical Dirichlet process (HDP) is used to detect anomaly. Dataset: QMUL-junction; Paremeters: ROC, AUC (ROC: receiver operating characteristics) | |
| [ | Gaussian classifier | Fully convolutional neural network, Gaussian classifier is used. It extracts distinctive features of video regions to detect anomaly. Dataset: UCSD, Subway; Parameters: ROC, EER, AUC; Aims: to increase the speed and accuracy. | |
| [ | Gaussian model | 3D CNN model used to extract spatiotemporal characteristics, Scene background modelled by Gaussian model Dataset: Subway, UCSD ped2, Parameters: AUC, EER | |
| [ | Histogram-based model | Structural context descriptor (SCD) selective histogram of optical flow (SHOF), 3D discrete cosine transform (DCT) object tracker, Spatiotemporal analysis for abnormal detection is crowd using Energy Function Dataset: UMN and UCSD; Parameters: ROC, AUC, TPR, FPR | |
| Proximity based | [ | Histogram | Histogram of optical flow and motion entropy (HOFME) is used to detect the pixel level features diverse anomalous events in crowd anomaly scenarios as compared with conventional features. Nearest neighbor threshold is used by HOFME. Dataset: UCSD, Subway, Badminton; Parameters: AUC, EER |
| [ | Accumulated relative density (ARD) | ARD method is used for large-scale traffic data and detect outliers; Dataset: self-deployed parameter: detection success rate (DSR) | |
| Proximity Based | [ | Density based | A weighted neighborhood density estimation is used to detect anomalies. Hierarchical context-based local kernel regression. Dataset: KDD, Shuttle; Parameters: precision, recall |
| [ | Density based | Trajectory-based DBSCAN clustering; Foreground segmentation by using active contouring; Dataset: UCF Web, Collective Motion, Violent Flows Parameters: MAE (mean absolute error) and F-score | |
| [ | Trajectory extraction flow analysis | Foreground segmentation by using active contouring density-based DBSCAN clustering. Dataset: UCF Web, collective motion, and violent flows parameters: MAE (mean absolute error) and F-score | |
| Classification based | [ | Adaptive sparse representation | Trajectory-based video anomaly detection using joint sparsity model. Dataset: CAVIAR. Parameters: ROC |
| [ | One-class classification | OCC-based anomaly detection techniques using SVM. Pixel level features are extracted.Dataset: PETS2009 and UMN; Parameters: ROC | |
| [ | Harr-Cascade, HOG feature extraction | Smart Surveillance as an Edge Network using Harr-Cascade, SVM, lightweight-CNN. Provides fast object detection and tracking. Dataset: VOC07, ImageNet; Parameters: accuracy | |
| [ | Stacked Sparse Coding | Intraframe classification strategy; Dataset: UCSD, Avenue, Subway; Parameters: EER, AUC, accuracy | |
| [ | Sequential deep trajectory descriptor | Dense trajectories are projected into 2D plane, long term motion CNN-RNN network is employed. Dataset: KTH, HMDB51, UCF101; Parameters: accuracy | |
| [ | Autoencoders, data reduction | Feature learning with deep learning, autoencoder is placed on the edge, decoder part is placed on the cloud Dataset: HAR, MHEALTH; Parameters: accuracy | |
| [ | Kanade–Lucas–Tomasi (KLT) tracker | k-means clustering, connected graphs and traffic flow theory are used estimate real-time parameters from aerial videos; Dataset: Self-Deployed; Parameters: speed, density, volume of vehicles | |
| [ | Deep neural network | Face mask identification; video restoration, face detection using DNN; edge computing-based dataset: bus drive monitoring and public dataset; Parameters: accuracy | |
| [ | 3D convolutional neural network (3D CNN) | Spatiotemporal features are extracted using temporal 3D CNN; Dataset:UCF-Crime dataset; Parameters: AUC; | |
| Reconstruction | [ | Hyperspectral image (HSI) Analysis | Discriminative reconstruction method for HSI anomaly detection with spectral learning (SLDR). Loss function of SLDR model is generated. Dataset: ABU, San Diego Parameters: ROC, AUC |
| [ | Hyperspectral imagery (HSI) | Low-rank sparse matrix decomposition (LRaSMD)-based dictionary reconstruction is used for anomaly detection. Dataset: STONE, AVIRIS Parameters: ROC | |
| [ | 3D convolutional network (C3D) | C3D network is used to perform feature extraction and detect anomaly using sparse coding, DL. Dataset: Avenue, Subway, UCSD ; Parameters: AUC, EER | |
| Reconstruction based | [ | Deep OC neural network | One stage model is used to learn compact features and train a DeepOC (Deep One Class)classifier. Dataset: UCSD, Avenue, Live Video; Parameters: ROC |
| [ | Generative adversarial network (GAN) | Anomaly detection using generative adversarial network for hyperspectral images (HADGAN). Dataset: ABU, San Diego, HYDICE; Parameters: ROC, computing time | |
| [ | Adversarial attention based, auto-encoder GAN | Normal patterns are learnt through adversarial attention-based auto-encoder and anomaly is detected. Dataset: ShanghaiTech, Avenue, UCSD, Subway; Parameters: AUC EER | |
| [ | Adversarial 3D Conv,Autoencoder | Spatiotemporal patterns are learnt using adversarial 3D Conv, Autoencoder to detect abnormal events in videos; Dataset: Subway, UCSD, Avenue, ShanghaiTech; Parameters: AUC/EER | |
| [ | Sparse reconstruction | Sparsity-based reconstruction method is used with low rank property to determine abnormal events. Datasets: UCSD, Avenue; Parameters: ROC, AUC, EER | |
| [ | Sparsity-based method | Abnormal event detection in traffic surveillance using low-rank sparse representation (CLSR). Dataset: UCSD, Subway, Avenue; Parameters: AUC, EER | |
| [ | GAN | Singular value decomposition (SVD) loss function for reconstruction; Dataset: UCSD, ShanghaiTech, Avenue Parameters: AUC, EER | |
| [ | GAN | Auto-encoder/generator = dense residual networks + self-attention; Discriminator= introduces self attention Dataset: CUHK Avenue, Shanghai Tech Parameters: AUC | |
| [ | GAN | Proposed GAN: Multi-scale U-Net unsupervised anomaly detection Dataset: UCSD, CHUK Avenue, ShanghaiTech Parameters: AUC | |
| [ | GAN | Deep spatiotemporal translation network based on GAN; edge wrapping: to reduce the noise Dataset: UCSD, UMN, CUHK Avenue | |
| [ | Dual discriminator-based GAN | Semi-supervised; dual discriminator-based GAN to generate and distinguish motion data; Dataset: CUHK, UCSD, Shanghai; Parameters: AUC, EER | |
| [ | GAN | Bidirectional retrospective GAN with frame prediction model; 3D CNN to capture temporal relations between frames; Dataset: Avenue, CUHK, UCSD, Shanghai; Parameters: AUC | |
| [ | GAN | Combination of prediction based and reconstruction based methods to construct a U-shaped GAN. Dataset: UCSD, CUHK Avenue, ShanghaiTech; Parameters: AUC | |
| Prediction Based | [ | Video prediction framework | Spatial/motion constraints are used for future frame prediction for normal events and identifies abnormal events; Dataset: CUHK, UCSD, ShanghaiTech; Parameters: AUC |
| [ | Incremental spatiotemporal learner | ISTL, an unsupervised deep learning approach with fuzzy aggregation is used to distinguish between anomalies that evolve over time in assistance with spatiotemporal autoencoder, ConvLSTM. Dataset: CUHK, Avenue, UCSD; Parameters: AUC, EER | |
| [ | LSTM, Cross Entropy | Recognizing industrial equipment in manufacturing system using edge computing; Big Data, smart meter Dataset: Self deployed ; Parameter: Accuracy | |
| [ | Optical flow, GMM, HoF | Detection accuracy, less computational time Dataset: UMN, UCSD, Subway, LV; Parameter: AUC, EER; | |
| [ | Spatiotemporal feature extraction | CNN, Adaptive ISTA and LSTM Dataset: CUHK Avenue, UCSD, UMN ; Parameter:AUC EER | |
| [ | Predictive convolutional attentive block | Integration of reconstruction and predictive approach to predict masked information Dataset: MVTec AD, Avenue, ShanghaiTech Parameter:AUC | |
| [ | Temporal attention network | Event score prediction for each video; TANet to learn features and temporal values Dataset: UCSD; Parameters: accuracy, specificity, sensitivity | |
| [ | Bi-directional LSTM (BD-LSTM) | Frame level prediction using BD-LSTM and CNN Dataset: UCSD; Parameters: accuracy, AUC; | |
| [ | Deep BD-LSTM | Action recognition using Deep BD-LSTM; Redundancy reduction; Capable of analyzing long videos Dataset: UCF-101, YouTube 11 Actions, HMDB51; Parameters: accuracy; | |
| [ | LSTM | Feature extraction using lightweight CNN; Residual attention-based LSTM for abnormal event detection; Dataset: UCF-Crime, UMN, Avenue; Parameters: accuracy; | |
| [ | BD-LSTM | Deep learning for segmentation; BD-LSTM to analyze anomalies in UAV real-world aerial datasets; Dataset: real-world aerial dataset; Parameters: mean precision, mean, recall, F1 mean | |
| Other Approaches | [ | Fuzzy theory | Anomaly detection in road traffic using fuzzy theory. The Gaussian distribution model is trained. Dataset: SNA2014-Nomal; Parameters: accuracy, false detection rate |
| [ | Sparse reconstruction | Joint sparsity model for abnormality detection, multi-object anomaly detection for real world scenarios. Dataset: CAVIAR Parameters:ROC | |
| [ | Sparsity based | Video surveillance of traffic. Background subtraction. Non-convex optimization; generalized shrinkage thresholding operator (GSTO), Joint estimation Dataset: I2R, CDnet2014 Parameters: F-measure | |
| [ | Frequency domain (power spectral density; correlation) | Detection of vehicle anomaly using edge computing high-frequency correlation, sensors, reduces computation overhead, privacy; Dataset: Open Source Platform; Parameters: FPR, TPR, ROC | |
| [ | Prediction model | Particle filtering used to detect anomaly in frames; of particle filtering less processing time; Dataset: UCSD, LIVE Parameters: EER | |
| [ | Magnitude of motion detection (MoMD) | MoMD based on spatiotemporal interest points (STIP); segmentation of video into key frames; elimination of redundant frames; CNN: YOLO; Dataset: UA-DETRAC, crossroad in Beijing city; Parameters: precision, recall, F-measure |
List of Datasets
| Dataset | Usage in literature | Surveillance target |
|---|---|---|
| UCSD | [ | Automobile, Individual, Crowd (Public Places) |
| [ | ||
| [ | ||
| Avenue, CUHK | [ | Individual, Crowd, Object (Public Places) |
| [ | ||
| UMN | [ | Individual, Crowd (Public Places) |
| Subway | [ | Individual, Crowd (Entrance and Exit of Subway |
| [ | Stations) | |
| Uturn | [ | Objects (Vehicles), Individuals (Pedestrians) |
| Vanaheim | [ | Individual, Crowd (Metro Stations) |
| Mind’s Eye | [ | Individual, Crowd, Objects (Vehicles) (Parking Area) |
| UVSD, DAVIS | [ | Individuals, Objects (Vehicles) |
| Shanghai | [ | Individual, Crowd, Objects (Vehicles) (Public Places) |
| Badminton | [ | Individual, Crowd (Badminton Game) |
| Behave | [ | Individual, Crowd Public Places) |
| ABU dataset | [ | Objects (Aerial View) |
| MDVD | [ | Objects (Aerial View) |
| IEEE SP Cup-2020 | [ | Objects (Lobby, Parking Area) |
| AU-AIR | [ | Objects (Aerial View) |
| Facial expression | [ | Individuals (Facial Expressions) |
| 2013 (FER-2013) | ||
| ISLD-A | [ | Individuals, Objects (Human Action Recognition) |
| Self-deployed | [ | Individuals, Objects, Crowd, Automobiles |
| [ | ||
| [ | ||
| [ | ||
| STONE | [ | Objects (Grassy Scene) |
| AVIRIS | [ | Objects (Aerial View) |
| SkyEye | [ | Objects (Aerial View) |
| UCF101 | [ | Individual (Action Videos) |
| HMDB51 | [ | Individual (Action Videos) |
| THUMOS14 | [ | Individual (Action Videos) |
| BP4D | [ | Individual (Facial Expressions) |
| AFEW | [ | Individual (Facial Expressions) |
| CAVIAR | [ | Individuals, Objects (Vehicles) |
| PASCAL VOC | [ | Objects (Vehicles) |
| DS1, DS2 | [ | Events (Fire/Smoke/Fog) |
| SFpark | [ | Object (Taxis) |
| LISA 2010 | [ | Objects (Vehicles) |
| ImageNet | [ | Events (Fire/Smoke/Fog) |
| QMUL junction | [ | Objects (Vehicles) |
| HockeyFight | [ | Individuals |
| ViolentFlow | [ | Individuals |
Breakthroughs and attainments on different datasets (FL=frame level; PL = pixel level, NA = not applicable, Ent=Entrance)
| Dataset | Year | Ref | AUC | EER | Method |
|---|---|---|---|---|---|
| UCSD (Ped1) | 2021 | [ | 94.8% | NA | LSTM |
| 2021 | [ | 85.3% | 23.6% | GAN | |
| 2021 | [ | 73.26% | 28.75% | GAN | |
| 2021 | [ | 84.7% | 22.5% | GAN | |
| 2020 | [ | 90.2% | 11.6% | Adversarial 3D CNN Autoencoder | |
| 2020 | [ | 82.31% | 21.43% | HoMM | |
| 2020 | [ | FL = 98.5%; PL = 77.4% | FL = 5.2%; PL = 27.3% | Deep GAN | |
| 2019 | [ | 75.2% | 29.8% | Deep Learning ISTL Autoencoder | |
| 2019 | [ | FL = 83.5% ; PL = 45.2 % | 25.2% | LSTM | |
| 2019 | [ | 90.5% | 11.9% | GAN | |
| 2019 | [ | FL = 83.5; PL = 63.1% | 23.4% | DeepOC | |
| 2018 | [ | 90.6% | 16.2% | Deep Learning | |
| 2018 | [ | FL = 94.1% ; PL = 74.8% | FL = 11.4; PL = 28.8% | SVM | |
| 2017 | [ | 99.6% | FL = 9.1; PL = 15.18 | Deep Cascade | |
| 2016 | [ | 72.7% | 33.1% | HOF | |
| UCSD (Ped2) | 2021 | [ | 96.5% | NA | LSTM |
| 2021 | [ | 95.7% | 12% | GAN | |
| 2021 | [ | 76.98% | 23.46% | GAN | |
| 2021 | [ | 97.1% | NA | GAN | |
| 2021 | [ | 94.6% | 7.6% | GAN | |
| 2020 | [ | 91.1% | 10.9% | Adversarial 3D CNN Autoencoder | |
| 2020 | [ | 94.16% | 13.25% | HoMM | |
| 2020 | [ | FL = 95.5%; PL = 83.1% | FL = 9.4%; PL = 21.8% | Deep GAN | |
| 2020 | [ | NA | FL = 10.5%; PL = 13.8% | CNN | |
| 2020 | [ | 95.6% | NA | GAN | |
| 2019 | [ | 91.1% | 8.9% | Deep Learning ISTL Autoencoder | |
| 2019 | [ | FL = 94.4% ; PL = 52.8% | 10.3% | LSTM | |
| 2019 | [ | 90.7% | 11.3% | GAN | |
| 2019 | [ | FL = 96.9%; PL = 95.0% | 8.8% | DeepOC | |
| 2018 | [ | 90.2% | 17.3% | Deep Learning | |
| 2018 | [ | PL = 89.2% | PL = 16.7% | SVM | |
| 2018 | [ | NA | FL = 11%; PL = 15% | Fully CNN | |
| 2017 | [ | NA | FL = 8.2; PL = 19 | Deep Cascade | |
| 2016 | [ | 87.5% | 20.0% | HOF | |
| Avenue | 2021 | [ | 89.2% | NA | GAN |
| 2021 | [ | 86.9% | 20.2% | GAN | |
| 2021 | [ | 98% | NA | LSTM | |
| 2021 | [ | 85.8% | NA | GAN | |
| 2021 | [ | 88.6% | NA | GAN | |
| 2021 | [ | 89.82 % | 21.55% | GAN | |
| 2020 | [ | 88.9% | 18.2% | Adversarial 3D Autoencoder | |
| 2020 | [ | 87.9% | 20.2% | DeepGAN | |
| 2020 | [ | 84.9 % | NA | DNN | |
| 2019 | [ | 76.8% | 29.2% | DL ISTL Autoencoder | |
| Avenue | 2019 | [ | FL = 86.1%; PL = 94.1% | 22% | LSTM |
| 2019 | [ | 89.2% | 17.6 | GAN | |
| 2019 | [ | 88.6% | 18.5% | DeepOC | |
| 2018 | [ | FL = 82.1; PL = 93.7% | NA | Deep Learning | |
| ShanghaiTech | 2021 | [ | 75.7% | NA | GAN |
| 2021 | [ | 73% | 32.3% | GAN | |
| 2021 | [ | 78.43% | 25.16% | GAN | |
| 2021 | [ | 73.7% | NA | GAN | |
| 2021 | [ | 74.5% | 31.6% | GAN | |
| 2020 | [ | 73.7% | 32% | GAN | |
| 2020 | [ | 74.6% | 27.6% | Adversarial 3D Convolution Autoencoder | |
| 2019 | [ | 70% | 36.5% | GAN | |
| Subway | 2020 | [ | Ent = 90.5%; Exit = 94.8% | Ent = 22.7%; Exit = 9.6% | Adversarial 3D |
| 2020 | [ | Ent = NA; Exit = NA | Ent = 10.1%; Exit = 15.9% | 3D CNN | |
| 2019 | [ | Ent = 90.2%; Exit = 94.6% | Ent = 22.67%; Exit = 9.3% | GAN | |
| 2019 | [ | Ent = 91.1%; Exit = 89.5% | Ent = NA; Exit = NA | DeepOC | |
| 2018 | [ | Ent = 90.4%; Exit = 90.2% | Ent = 17%; Exit = 16% | FCNN | |
| 2016 | [ | Ent = 81.6%; Exit = 84.9% | Ent = 22.8%; Exit = 17.8% | HOF |
Fig. 6Architectural overview of edge computing
Anomaly detection at the edge FP=false positive; FN= false negative; FPR=false positive rate; TPR=true positive rate; ROC=receiver operating characteristics; MAE=maximum absolute error
| Method | Ref/Year | Features | Learning | Anomaly criteria | Dataset | Parameters |
|---|---|---|---|---|---|---|
| Hand-crafted Features | [ | Low computational cost with high accuracy, performance | HOG,SVM,KCF | Human object tracking | Self deployed | Speed, Performance |
| [ | Real-time, good accuracy, with limited resources | LCNN, Kerman (KCF, KF, BS) Objects | Tracking human | VOC07, VOC12 | Accuracy Precision | |
| [ | Accuracy | Binary Neural Network; FPGA system design | Recognition of facial emotions | JAFFE | Accuracy | |
| CNN | [ | Processing requires less memory | Harr-Cascade, SVM, L-CNN | Smart surveillance | ImageNet | Accuracy precision, recall, FP, FN F-measure |
| [ | Processing requires less memory | LCNN | Smoke detection in foggy surveillance | ImageNet | FP, FN, Accuracy precision, recall, F-measure | |
| [ | Real-time low complexity | CNN | Detecting pedestrian | Multi pedestrian | Accuracy | |
| [ | Reduced Bandwidth,Security, Real-time | Video motion magnitude | Vehicle anomaly | UA-DETRAC, crossroad in Beijing city | Precision Recall F-measure | |
| [ | Real-time feasible | Spatiotemporal feature extraction, trajectory association | Pedestrian identification | UCSD Ped1 UMN | Accuracy | |
| DNN | [ | Improved efficiency | CNN, DL | Industry manufacture inspection | Self deployed | FPR, TPR, ROC |
| [ | Smart parking | CNN, HOG | Parking surveillance | Self deployed | Accuracy | |
| [ | Balances computational power and workload | DIVS | Vehicle classification | Self deployed | Efficiency | |
| [ | Reduced computation , overhead privacy | high-frequency correlation, sensors | Vehicle Anomaly | Open Source Platform | FPR, TPR, ROC | |
| [ | Resource optimization | Deep learning, Video analytics | Video optimization | VIRAT 2.0 Ground | F1 Score accuracy | |
| [ | Accuracy time efficiency | DNN | Face Mask Identification | Bus Drive Monitoring, public dataset | Accuracy | |
| DNN | [ | Reduce network occupancy and system response delay | Intelligent Edge Surveillance | Cloud, DL, Edge | Self deployed | Accuracy Model loss |
| LSTM | [ | Parallel computing to improve efficiency | LSTM, Cross entropy | Industrial electrical equipment | Self deployed | Accuracy |
| [ | Effective utilization of resources | BD-LSTM, L-CNN in videos | Crime scene | UCF-Crime,RWF-2000 | Accuracy | |
| GAN | [ | Increases computing performance | SaliencyGAN Deep SOD CNN Adversarial learning semi-supervised | Object detection/Tracking | PASCALS | MAE F-Measure precision recall |