| Literature DB >> 36258895 |
Anmol Pattanaik1, Rakesh Chandra Balabantaray1.
Abstract
The current breakthroughs in the highway research sector have resulted in a greater awareness and focus on the construction of an effective Intelligent Transportation System (ITS). One of the most actively researched areas is Vehicle Licence Plate Recognition (VLPR), concerned with determining the characters contained in a vehicle's Licence Plate (LP). Many existing methods have been used to deal with different environmental complexity factors but are limited to motion deblurring. The aim of our research is to provide an effective and robust solution for recognizing characters present in license plates in complex environmental conditions. Our proposed approach is capable of handling not only the motion-blurred LPs but also recognizing the characters present in different types of low resolution and blurred license plates, illegible vehicle plates, license plates present in different weather and light conditions, and various traffic circumstances, as well as high-speed vehicles. Our research provides a series of different approaches to execute different steps in the character recognition process. The proposed approach presents the concept of Generative Adversarial Networks (GAN) with Discrete Cosine Transform (DCT) Discriminator (DCTGAN), a joint image super resolution and deblurring approach that uses a discrete cosine transform with low computational complexity to remove various types of blur and complexities from licence plates. License Plates (LPs) are detected using the Improved Bernsen Algorithm (IBA) with Connected Component Analysis(CCA). Finally, with the aid of the proposed Xception model with transfer learning, the characters in LPs are recognised. Here we have not used any segmentation technique to split the characters. Four benchmark datasets such as Stanford Cars, FZU Cars, HumAIn 2019 Challenge datasets, and Application-Oriented License Plate (AOLP) dataset, as well as our own collected dataset, were used for the validation of our proposed algorithm. This dataset includes the images of vehicles captured in different lighting and weather conditions such as sunny, rainy, cloudy, blurred, low illumination, foggy, and night. The suggested strategy does better than the current best practices in both numbers and quality.Entities:
Keywords: Connected Component Analysis; Generative Adversarial Networks; Improved Bernsen Algorithm; Transfer Learning; Xception
Year: 2022 PMID: 36258895 PMCID: PMC9560886 DOI: 10.1007/s11042-022-13922-9
Source DB: PubMed Journal: Multimed Tools Appl ISSN: 1380-7501 Impact factor: 2.577
Fig. 1Flowchart of our Proposed Work
Fig. 2Generator Architecture
Fig. 3Discriminator Architecture
Fig. 4Xception Model Architecture
Fig. 5Transfer Learning
Prameters of the DCTGAN Algorithm
| Parameter used | Description |
|---|---|
| ReLU Activation function | It is used in generator for the purpose of updation of the ideal parameters during the training and to avoid overfitting when the number of hidden layers are more. |
| Leaky ReLU Activation function | Discriminator uses Leaky ReLU activation function to make the layer much more optimised and to get better performance result. |
Prameters of the Proposed Xception Algorithm
| Parameter used | Name/Value | Description |
|---|---|---|
| Activation function | Mish | Due to the smooth and non monotonic nature of Mish activation function, we replaced the ReLU activation of the original model with Mish activation function so that its unbounded above and bounded below property can improve the performance of the proposed model. |
| Classifier | Softmax Classifier | We have used softmax classifier in the output layer of the proposed Xception framework which is otherwise called as multinomial logistic regression as it performs better in multi-class classification problem by giving probability for each class label. |
| Loss function | categorical cross entropy | Due to the multi-category nature of the challenge, categorical cross entropy is employed as the model’s loss function. |
| Optimization algorithm | Adam optimization algorithm | Model’s loss function is minimised by implementing the Adam [ |
| Learning rate | 0.001 | It has been choosen heuristically |
| Dropout rate | 0.5 | It has been choosen heuristically |
| Total number of epochs | 50 | It has been choosen heuristically |
Fig. 6Stanford cars dataset,FZU cars dataset, HumAIn 2019 challenge dataset
Fig. 7Sample of AOLP dataset with three subsets: AC, LE and RP
Fig. 8Our own dataset
Comparision of State-Of-The-Art ANPR systems with the proposed ANPR System
| system | Plate | Character | Character | Overall | Character type |
|---|---|---|---|---|---|
| detection | segmentation | recognition | accuracy | ||
| Ref [ | NR | 0.99 | NR | NR | Korean |
| Ref [ | 0.971 | 0.983 | 0.978 | 0.935 | English Japanese |
| Ref [ | 0.965 | NR | 0.891 | 0.86 | English |
| Ref [ | 0.973 | NR | 0.945 | 0.919 | Persian |
| Ref [ | 0.969 | 0.987 | 0.945 | 0.904 | Persian |
| Ref [ | 0.959 | NR | 0.923 | 0.90 | Chinese English |
| Ref [ | 0.993 | NR | 0.966 | 0.96 | Persian |
| Ref [ | 0.973 | NR | 0.957 | 0.931 | English |
| Ref [ | 0.971 | NR | 0.964 | 0.936 | English |
| Ref [ | NR | NR | NR | 0.91 | English |
| Ref [ | 0.979 | NR | 0.956 | 0.937 | English |
| Proposed system | 0.993 | - | 0.990 | 0.983 | English |
NR: Not Reported
Result analysis of existing models with proposed model on FZU Car dataset
| Model | Precision | Recall | F1-Score |
|---|---|---|---|
| VGG16 | 0.942 | 0.920 | 0.940 |
| ResNet50 | 0.953 | 0.935 | 0.971 |
| InceptionV3 | 0.954 | 0.907 | 0.963 |
| MobileNetV2 | 0.973 | 0.929 | 0.970 |
| Faster RCNN | 0.970 | 0.902 | 0.965 |
| YOLO 9000 | 0.974 | 0.873 | 0.969 |
| Proposed Model | 0.978 | 0.966 | 0.972 |
Result analysis of existing models with proposed model on stanford cars dataset
| Model | Precision | Recall | F1-Score |
|---|---|---|---|
| VGG16 | 0.907 | 0.850 | 0.876 |
| ResNet50 | 0.911 | 0.872 | 0.918 |
| InceptionV3 | 0.932 | 0.908 | 0.964 |
| MobileNetV2 | 0.941 | 0.929 | 0.951 |
| Faster RCNN | 0.945 | 0.931 | 0.959 |
| YOLO 9000 | 0.951 | 0.905 | 0.962 |
| Proposed Model | 0.979 | 0.969 | 0.974 |
Result analysis of existing models with proposed model on HumAIn 2019 dataset
| Model | Precision | Recall | F1-Score |
|---|---|---|---|
| VGG16 | 0.898 | 0.870 | 0.884 |
| ResNet50 | 0.891 | 0.879 | 0.907 |
| InceptionV3 | 0.850 | 0.830 | 0.920 |
| MobileNetV2 | 0.921 | 0.913 | 0.893 |
| Faster RCNN | 0.934 | 0.898 | 0.912 |
| YOLO 9000 | 0.946 | 0.876 | 0.922 |
| Proposed Model | 0.972 | 0.961 | 0.967 |
Result analysis of existing models with proposed model on AOLP dataset
| Model | Precision | Recall | F1-Score |
|---|---|---|---|
| VGG16 | 0.856 | 0.815 | 0.910 |
| ResNet50 | 0.907 | 0.887 | 0.909 |
| InceptionV3 | 0.935 | 0.916 | 0.943 |
| MobileNetV2 | 0.927 | 0.876 | 0.928 |
| Faster RCNN | 0.957 | 0.941 | 0.966 |
| YOLO 9000 | 0.968 | 0.929 | 0.963 |
| Proposed Model | 0.982 | 0.958 | 0.971 |
Result analysis of existing models with proposed models on our own dataset
| Model | Precision | Recall | F1-Score | Execution time |
|---|---|---|---|---|
| VGG16 | 0.954 | 0.912 | 0.925 | 2 min 15 sec |
| ResNet50 | 0.943 | 0.916 | 0.907 | 62 min |
| InceptionV3 | 0.949 | 0.945 | 0.946 | 37 min |
| MobileNetV2 | 0.961 | 0.955 | 0.972 | 29 min 15 sec |
| Faster RCNN | 0.952 | 0.922 | 0.907 | 9 min 78 sec |
| YOLO 9000 | 0.965 | 0.911 | 0.975 | 3 min 22 sec |
| Proposed Model | 0.995 | 0.962 | 0.991 | 116 min 667 sec |
Fig. 9Step wise Recognition Process
Fig. 10Comparative Result Analysis of Existing and Proposed methods on differnt Datasets
Fig. 11Accuracy and Loss plot of our proposed model