| Literature DB >> 26368566 |
Riaz Ahmad1, Saeeda Naz2, Muhammad Zeshan Afzal3, Sayed Hassan Amin4, Thomas Breuel3.
Abstract
The presence of a large number of unique shapes called ligatures in cursive languages, along with variations due to scaling, orientation and location provides one of the most challenging pattern recognition problems. Recognition of the large number of ligatures is often a complicated task in oriental languages such as Pashto, Urdu, Persian and Arabic. Research on cursive script recognition often ignores the fact that scaling, orientation, location and font variations are common in printed cursive text. Therefore, these variations are not included in image databases and in experimental evaluations. This research uncovers challenges faced by Arabic cursive script recognition in a holistic framework by considering Pashto as a test case, because Pashto language has larger alphabet set than Arabic, Persian and Urdu. A database containing 8000 images of 1000 unique ligatures having scaling, orientation and location variations is introduced. In this article, a feature space based on scale invariant feature transform (SIFT) along with a segmentation framework has been proposed for overcoming the above mentioned challenges. The experimental results show a significantly improved performance of proposed scheme over traditional feature extraction techniques such as principal component analysis (PCA).Entities:
Mesh:
Year: 2015 PMID: 26368566 PMCID: PMC4569441 DOI: 10.1371/journal.pone.0133648
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Pashto Characters.
Pashto has 44 basic Alphabets [13].
Fig 2Shape variation of characters within Pashto character set [13].
Fig 3Location variation; Ligature in (a) is in test set, while (b) shows the same ligature in training set.
Fig 4Scale and orientation variations in Pashto image dataset.
Fig 5Illustration of Ligature matching with all available ligatures against SIFT-S1 descriptor.
Confusion matrix.
| S1 | S2 | S3 | S4 | Average | |
|---|---|---|---|---|---|
|
| 92.40% | 59.80% | 42.20% | 49.0% | 61.0% |
|
| 59.40% | 96.90% | 66.10% | 74.40% | 74.0% |
|
| 44.20% | 57.00% | 95.70% | 63.60% | 65.0% |
|
| 54.20% | 75.10% | 59.20% | 97.78% | 72.0% |
Ligatures having sizes S1, S2, S3 and S4 are shown along y-axis, as a test data, and along x-axis as training data. The correct classification rate (%) is shown at intersections of training and test data [16].
Confusion matrix.
| S1R | S2R | S3R | S4R | Average | |
|---|---|---|---|---|---|
|
| 92.70% | 59.10% | 41.50% | 48.0% | 60.0% |
|
| 57.30% | 96.90% | 66.80% | 72.50% | 73.0% |
|
| 42.10% | 59.50% | 95.20% | 64.10% | 65.0% |
|
| 51.70% | 74.80% | 59.70% | 97.50% | 71.0% |
Sizes S1R, S2R, S3R and S4R with rotated ligatures are shown along x-axis, as a test data. The correct classification rate (%) is shown along x-axis for each distinct SIFT descriptor [16].
Fig 6In case of PCA, the recognition rate has been obtained using font size S2 and S3, whereas in case of SIFT approach recognition rate has been obtained using font size S1 and S4.
Fig 7Base and Special Ligatures.
Confusion matrix.
| Plain Text | S1 | S2 | S3 | S4 | Average |
|---|---|---|---|---|---|
|
| 95.90% | 68.90% | 54.90% | 59.8% | 69.87% |
|
| 71.20% | 99.70% | 77.90% | 77.3% | 81.52% |
|
| 60.50% | 76.10% | 99.40% | 74.90% | 77.72% |
|
| 61.10% | 81.20% | 72.30% | 99.80% | 78.60% |
Sizes S1, S2, S3 and S4 with non-rotated ligatures are shown along y-axis, as a test data. The correct classification rate (%) is shown along x-axis for each distinct SCBC descriptor.
Confusion matrix.
| Rotated Text | S1R | S2R | S3R | S4R | Average |
|---|---|---|---|---|---|
|
| 98.10% | 69.00% | 54.90% | 58.60% | 70.15% |
|
| 70.80% | 99.50% | 77.30% | 76.60% | 81.05% |
|
| 58.70% | 79.10% | 99.40% | 76.20% | 78.35% |
|
| 60.90% | 80.50% | 72.40% | 99.80% | 78.40% |
Sizes S1, S2, S3 and S4 with non- rotated ligatures are shown along y-axis, as a test data. The correct classification rate (%) is shown at intersection of two axis for each distinct SCBC descriptor.