| Literature DB >> 33805765 |
Rami Ahmed1, Mandar Gogate2, Ahsen Tahir2,3, Kia Dashtipour2, Bassam Al-Tamimi4, Ahmad Hawalah5, Mohammed A El-Affendi6, Amir Hussain2.
Abstract
Offline Arabic Handwriting Recognition (OAHR) has recently become instrumental in the areas of pattern recognition and image processing due to its application in several fields, such as office automation and document processing. However, OAHR continues to face several challenges, including high variability of the Arabic script and its intrinsic characteristics such as cursiveness, ligatures, and diacritics, the unlimited variation in human handwriting, and the lack of large public databases. In this paper, we introduce a novel context-aware model based on deep neural networks to address the challenges of recognizing offline handwritten Arabic text, including isolated digits, characters, and words. Specifically, we propose a supervised Convolutional Neural Network (CNN) model that contextually extracts optimal features and employs batch normalization and dropout regularization parameters. This aims to prevent overfitting and further enhance generalization performance when compared to conventional deep learning models. We employ a number of deep stacked-convolutional layers to design the proposed Deep CNN (DCNN) architecture. The model is extensively evaluated and shown to demonstrate excellent classification accuracy when compared to conventional OAHR approaches on a diverse set of six benchmark databases, including MADBase (Digits), CMATERDB (Digits), HACDB (Characters), SUST-ALT (Digits), SUST-ALT (Characters), and SUST-ALT (Names). A further experimental study is conducted on the benchmark Arabic databases by exploiting transfer learning (TL)-based feature extraction which demonstrates the superiority of our proposed model in relation to state-of-the-art VGGNet-19 and MobileNet pre-trained models. Finally, experiments are conducted to assess comparative generalization capabilities of the models using another language database , specifically the benchmark MNIST English isolated Digits database, which further confirm the superiority of our proposed DCNN model.Entities:
Keywords: Arabic handwritten; DCNN; batch normalization; databases; dropout
Year: 2021 PMID: 33805765 PMCID: PMC8001675 DOI: 10.3390/e23030340
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Some of the characteristics of Arabic script writing.
Summary of reviewed related works.
| Literature/ Year | Findings | Outline |
|---|---|---|
| Elleuch et al. [ | The DBNN method obtained an ECR of 2.1% and an accuracy of 97.9% on the HACDB characters database. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The DBN method obtained an ECR of 1.67% and 3.64% and an accuracy of 98.33% and 96.36% on the HACDB database with 24 characters and the HACDB database with 66 characters, respectively. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The CNN method obtained an ECR of 5% and 14.71% and an accuracy of 95% and 85.29% on the HACDB database with 24 characters and the HACDB database with 66 characters, respectively. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| ElAdel et al. [ | The DCNWN method obtained an ECR of 2.1% and an accuracy of 93.92% on the IESKarDB characters database. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The DBN method obtained an ECR of 6.08% and an accuracy of 97.9% on the HACDB database with 66 characters. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The CDBN method obtained an ECR of 1.82% and 16.3% and an accuracy of 98.18% and 83.7% on the HACDB (24) characters and the IFN/ENIT words databases, respectively. | Generalization was not tested for Arabic digits and characters. Accuracy requires enhancement. |
| El-Sawy et al. [ | The CNN method obtained an ECR of 12% and an accuracy of 88% on the MADBase digits database. | Generalization wa not tested for Arabic characters and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The DSVM method obtained an ECR of 5.68% and an accuracy of 94.32% on the HACDB (66) characters database. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The CNN-SVM method obtained an ECR of 2.09%, 5.83%, and 7.05% and an accuracy of 97.91%, 94.17%, and 92.95% on the HACDB database with 24 characters, the HACDB database with 66 characters, the IFN/ENIT (56) words databases, respectively. | Generalization was not tested for Arabic digits. Accuracy requires enhancement. |
| Loey et al. [ | The SAE method obtained an ECR of 2.6% and an accuracy of 98.5% on the CMATERDB 3.3.1 digits database. | Generalization was not tested for Arabic characters and words. Accuracy requires enhancement. |
| Ashiquzzaman et al. [ | The CNN method obtained an ECR of 1.5% and an accuracy of 97.4% on the MADBase digits database. | Generalization was not tested for Arabic characters and words. Accuracy requires enhancement. |
| Chen et al. [ | The RRN-GRU method obtained an ECR of 13.51% and an accuracy of 86.49% on the IFN/ENIT words database. | Generalization was not tested for Arabic digits and characters. Accuracy requires enhancement. |
| M. Amrouch et al. [ | The CNN-based HMM method obtained an ECR of 11.05% and 10.77% and an accuracy of 88.95% and 89.23% on the IFN/ENIT words database with “abd-e” protocol and the IFN/ENIT words database with “abcd-e”, respectively. | Generalization was not tested for Arabic digits and characters. Accuracy requires enhancement. |
| Elbashir et al. [ | The CNN method obtained an ECR of 6.5% and an accuracy of 93.5% on the SUST-ALT characters database. | Generalization was not tested for Arabic digits and words. Accuracy requires enhancement. |
| Elleuch et al. [ | The CDBN method obtained an ECR of 1.14%, 8.45%, and 7.1% and an accuracy of 98.86%, 91.55%, and 92.9% on HACDB database with 66 characters, the IFN/ENIT words database with “abd-e” protocol, and the IFN/ENIT words database with “abc-d” protocol, respectively. | Generalization was not tested for Arabic digits. Accuracy requires enhancement. |
| Ashiquzzaman et al. [ | The CNN method obtained an ECR of 0.6% and an accuracy of 99.4% on the CMATERDB 3.3.1digits database. | Generalization was not tested for Arabic characters and words. |
| Mustafa et al. [ | The CNN method obtained an ECR of 0.86% and an accuracy of 99.14% on the SUST-ALT words database. | Generalization was not tested for Arabic digits and characters. |
Figure 2Offline Arabic handwritten recognition system general framework.
Figure 3The proposed DCNN architecture for oahr.
Raw samples of digits from Arabic handwritten databases.
| Number | Machine | MADBase | CMATERDB | SUST-ALT |
|---|---|---|---|---|
| Zero |
|
|
|
|
| One |
|
|
|
|
| Two |
|
|
|
|
| Three |
|
|
|
|
| Four |
|
|
|
|
| Five |
|
|
|
|
| Six |
|
|
|
|
| Seven |
|
|
|
|
| Eight |
|
|
|
|
| Nine |
|
|
|
|
Raw samples of characters from Arabic handwritten databases.
| Character | Machine | HACDB | SUST-ALT |
|---|---|---|---|
| Alif |
|
|
|
| Raa |
|
|
|
| Seen |
|
|
|
| Saad |
|
|
|
| Ayn |
|
|
|
| Faa |
|
|
|
| Meem |
|
|
|
| Noon |
|
|
|
| Haa |
|
|
|
| Waw |
|
|
|
Raw samples of words from Arabic handwritten databases.
| Name in English | Machine Form | SUST-ALT Names Database |
|---|---|---|
| Ahmed |
|
|
| Ali |
|
|
| Ebraheem |
|
|
| Taha |
|
|
| Soliman |
|
|
| Eman |
|
|
| Fatema |
|
|
| Rian |
|
|
| Marwa |
|
|
| Samah |
|
|
The block-wise performance metrics of the proposed DCNN model on the MADBase digits database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 586 | 0.991 | 0.991 | 0.991 | 0.9982 |
| Two | 628 | 0.994 | 0.994 | 0.994 | 0.9988 |
| Three | 654 | 0.9945 | 0.9945 | 0.9945 | 0.9989 |
| Four | 717 | 0.9951 | 0.9951 | 0.9951 | 0.99902 |
| Five (Final Model) | 548 | 0.9953 | 0.9953 | 0.9953 | 0.99906 |
The block-wise performance metrics of the proposed DCNN model on the CMATERDB digits database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 24 | 0.96833 | 0.96833 | 0.96833 | 0.99367 |
| Two | 25 | 0.975 | 0.975 | 0.975 | 0.99 |
| Three | 26 | 0.97 | 0.97 | 0.97 | 0.994 |
| Four | 27 | 0.98542 | 0.98542 | 0.98542 | 0.99708 |
| Five (Final Model) | 22 | 0.98608 | 0.98608 | 0.98608 | 0.99722 |
The block-wise performance metrics of the proposed DCNN model on the SUST-ALT digits database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 288 | 0.96061 | 0.96061 | 0.96061 | 0.99212 |
| Two | 350 | 0.9885 | 0.9885 | 0.9885 | 0.9977 |
| Three | 491 | 0.99283 | 0.99283 | 0.99283 | 0.99857 |
| Four | 343 | 0.99391 | 0.99391 | 0.99391 | 0.99878 |
| Five (Final Model) | 282 | 0.99107 | 0.99107 | 0.99107 | 0.99821 |
The block-wise performance metrics of the proposed DCNN model on the HACDB characters database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 52 | 0.70833 | 0.70833 | 0.70833 | 0.99116 |
| Two | 53 | 0.93561 | 0.93561 | 0.93561 | 0.99805 |
| Three | 59 | 0.95909 | 0.95909 | 0.95909 | 0.99876 |
| Four | 60 | 0.97197 | 0.97197 | 0.97197 | 0.99915 |
| Five (Final Model) | 53 | 0.96967 | 0.96967 | 0.96967 | 0.99908 |
The block-wise performance metrics of the proposed DCNN model on the SUST-ALT characters database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 393 | 0.82687 | 0.82687 | 0.82687 | 0.98982 |
| Two | 573 | 0.95338 | 0.95338 | 0.95338 | 0.99726 |
| Three | 400 | 0.9733 | 0.9733 | 0.9733 | 0.99843 |
| Four | 611 | 0.97799 | 0.97799 | 0.97799 | 0.99871 |
| Five (Final Model) | 344 | 0.97591 | 0.97591 | 0.97591 | 0.99858 |
The block-wise performance metrics of the proposed DCNN Model on the SUST-ALT Words (Names) Database.
| Stacked Blocks Count | Training Time (in Minutes) | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| One | 1079 | 0.67788 | 0.67788 | 0.67788 | 0.98389 |
| Two | 1345 | 0.96213 | 0.96213 | 0.96213 | 0.99811 |
| Three | 504 | 0.98725 | 0.98725 | 0.98725 | 0.99936 |
| Four | 508 | 0.9895 | 0.9895 | 0.9895 | 0.99948 |
| Five (Final Model) | 534 | 0.99038 | 0.99038 | 0.99038 | 0.99952 |
Summary of the performance measurement factors of the four-block model and the five-block model.
| Database/Type | Four Blocks | Five Blocks | ||||
|---|---|---|---|---|---|---|
| Best Accuracy | Best Precision | Less Training | Best Accuracy | Best Precision | Less Training | |
| MADBase (Digits) | No | No | No | Yes | Yes | Yes |
| CMATERDB (Digits) | No | No | No | Yes | Yes | Yes |
| SUST-ALT (Digits) | Yes | Yes | No | No | No | Yes |
| HACDB (Characters) | Yes | Yes | No | No | No | Yes |
| SUST-ALT (Characters) | Yes | Yes | No | No | No | Yes |
| SUST-ALT (Words) | No | No | Yes | Yes | Yes | No |
Results of the proposed DCNN model’s experiments on different offline handwritten Arabic databases.
| Database | Training Time | Training | Training | Validation | Validation | ECR | Accuracy |
|---|---|---|---|---|---|---|---|
| MADBase/Digits | 548 | 0.46 | 99.88 | 1.41 | 99.73 | 0.09 | 99.91 |
| CMATERDB/ | 22 | 3.43 | 98.85 | 6.66 | 98.96 | 0.28 | 99.72 |
| SUST-ALT/ | 282 | 1.15 | 99.65 | 3.24 | 99.34 | 0.18 | 99.82 |
| HACDB/ | 53 | 6.84 | 97.49 | 9.25 | 96.97 | 0.09 | 99.91 |
| SUST-ALT/ | 344 | 3.71 | 98.73 | 9.18 | 97.97 | 0.14 | 99.86 |
| SUST-ALT/ | 534 | 1.86 | 99.48 | 3.97 | 99.13 | 0.05 | 99.95 |
Proposed DCNN model’s performances metrics with respect to different offline Arabic handwritten databases.
| DatabaseName | DatabaseType | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|
| MADBase | Digits | 0.9953 | 0.9953 | 0.9953 | 0.99906 |
| CMATERDB | Digits | 0.98608 | 0.98608 | 0.98608 | 0.99722 |
| SUST-ALT | Digits | 0.99107 | 0.99107 | 0.99107 | 0.99821 |
| HACDB | Characters | 0.96967 | 0.96967 | 0.96967 | 0.99908 |
| SUST-ALT | Characters | 0.97591 | 0.97591 | 0.97591 | 0.99858 |
| SUST-ALT | Words | 0.99038 | 0.99038 | 0.99038 | 0.99952 |
VGGNet-19 model’s performance metrics of different offline Arabic handwritten databases.
| Database Name | Database Type | Training Time | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|---|
| MADBase | Digits | 964 | 0.9921 | 0.9921 | 0.9921 | 0.99842 |
| CMATERDB | Digits | 32 | 0.97667 | 0.97667 | 0.97667 | 0.99533 |
| SUST-ALT | Digits | 423 | 0.98755 | 0.98755 | 0.98755 | 0.99751 |
| HACDB | Characters | 70 | 0.91439 | 0.91439 | 0.91439 | 0.99741 |
| SUST-ALT | Characters | 606 | 0.93002 | 0.93002 | 0.93002 | 0.99588 |
| SUST-ALT | Words | 892 | 0.866 | 0.866 | 0.866 | 0.9933 |
MobileNet model’s performance metrics with respect to different offline Arabic handwritten databases.
| Database Name | Database Type | Training Time | Precision | Recall | F-Measure | Accuracy |
|---|---|---|---|---|---|---|
| MADBase | Digits | 548 | 0.8221 | 0.8221 | 0.8221 | 0.96442 |
| CMATERDB | Digits | 19 | 0.41167 | 0.41167 | 0.41167 | 0.88233 |
| SUST-ALT | Digits | 263 | 0.29088 | 0.29088 | 0.29088 | 0.85818 |
| HACDB | Characters | 42 | 0.19697 | 0.19697 | 0.19697 | 0.97567 |
| SUST-ALT | Characters | 330 | 0.1456 | 0.1456 | 0.1456 | 0.94974 |
| SUST-ALT | Words | 275 | 0.915 | 0.915 | 0.915 | 0.95458 |
Performance comparisons with other state-of-the-art approaches.
| Literature | Method Name | Database Name | Database Type | ECR/ | Accuracy |
|---|---|---|---|---|---|
| Elleuch et al. [ | DBNN | HACDB (66) | Characters | 2.10% | 97.90% |
| Elleuch et al. [ | DBN | HACDB (66) | Characters | 2.10% | 97.90% |
| Elleuchet al. [ | CDBN | HACDB (66) | Characters | 1.82% | 98.18% |
| Elleuch et al. [ | CNN | HACDB (66) | Characters | 14.71 | 85.29% |
| Elleuch et al. [ | DBN | HACDB (66) | Characters | 3.64% | 96.36% |
| Elleuch et al. [ | CNN based-SVM | HACDB (66) | Characters | 5.83% | 94.17% |
| Elleuch et al. [ | DSVM | HACDB (66) | Characters | 5.68% | 94.32% |
| Elleuch et al. [ | CDBN | HACDB (66) | Characters | 1.14% | 98.86% |
|
|
|
|
|
|
|
| El-Sawy et al. [ | CNN | MADBase (10) | Digits | 12% | 88% |
| Loey et al. [ | SAE | MADBase (10) | Digits | 1.50% | 98.50% |
|
|
|
|
|
|
|
| Ashiquzzaman et al. [ | CNN | CMATERDB (10) | Digits | 2.60% | 97.40% |
| Ashiquzzaman et al. [ | CNN | CMATERDB (10) | Digits | 0.60% | 99.40% |
|
|
|
|
|
|
|
| Elbashir et al. [ | CNN | SUST-ALT (40) | Words | 6.50% | 93.50% |
| Mustafa et al. [ | CNN | SUST-ALT (20) | Words | 0.86% | 99.14% |
|
|
|
|
|
|
|