| Literature DB >> 29350328 |
Shancheng Fang1,2, Hongtao Xie3, Zhineng Chen4, Yizhi Liu5, Yan Li6.
Abstract
How to read Uyghur text from biomedical graphic images is a challenge problem due to the complex layout and cursive writing of Uyghur. In this paper, we propose a system that extracts text from Uyghur biomedical images, and matches the text in a specific lexicon for semantic analysis. The proposed system possesses following distinctive properties: first, it is an integrated system which firstly detects and crops the Uyghur text lines using a single fully convolutional neural network, and then keywords in the lexicon are matched by a well-designed matching network. Second, to train the matching network effectively an online sampling method is applied, which generates synthetic data continually. Finally, we propose a GPU acceleration scheme for matching network to match a complete Uyghur text line directly rather than a single window. Experimental results on benchmark dataset show our method achieves a good performance of F-measure 74.5%. Besides, our system keeps high efficiency with 0.5s running time for each image due to the GPU acceleration scheme.Keywords: Text detection; Text extracting; Text recognition; Uyghur
Mesh:
Year: 2018 PMID: 29350328 DOI: 10.1007/s12021-017-9350-0
Source DB: PubMed Journal: Neuroinformatics ISSN: 1539-2791