| Literature DB >> 31193721 |
Hafsa Masroor1, Muhammad Saeed1, Maryam Feroz1, Kamran Ahsan2, Khawar Islam2.
Abstract
Advances in machine and language translation immerge new fields and research opportunities for researchers, whereas Natural Language Processing and Computational Linguistics deal with communication between natural languages and their interaction. The objective of this research is to develop and test a novel tactic to solve the issue of translation from Roman Urdu to the English language. The approach used to construct this practical model is divided into three stages; each stage works out to achieve its desired task. Self-maintained corpus alongwith its corresponding tag-set is used for tokenization. The syntactical structure is covered by writing Urdu POS tagger based on grammatical rules. We prepared the grammatical structures of different sentences for Roman Urdu to English translation. Since Roman script can be expressed in numerous ways, our grammatical structures fulfill the maximum possible needs of writing and produce the best possible English translation. We entered a sentence in Roman Urdu which gave the best possible translation in the English language. In comparison with Google Translator, Transtech worked better and gives more accurate results.Entities:
Keywords: Computer science; Linguistics
Year: 2019 PMID: 31193721 PMCID: PMC6538981 DOI: 10.1016/j.heliyon.2019.e01780
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Fig. 1A systematic overview of Roman Urdu Translator.
Corpus collection of Roman Urdu data.
| Tum konse bazar jati thi | |||||
|---|---|---|---|---|---|
| English | You | which | market | go | did |
| Roman Urdu | Tum | konse | bazar | jati | thi |
Fig. 2Internal view of Roman Urdu Translator.
The list of tag set for Urdu POS tagger.
| S. No | Categories | Types | POS tag |
|---|---|---|---|
| 1 | Noun | Common | NN |
| Proper | NNP | ||
| 2 | Verb | Main Verb Infinite | VBI |
| Main Verb Finite | VBF | ||
| 3 | Auxiliary | Aspectual | AUXA |
| Progressive | AUXP | ||
| Tense | AUXT | ||
| Modals | AUXM | ||
| Present Tense | AUXIT | ||
| Past Tense | AUCTP | ||
| Future Tense | AUXTF | ||
| Perfect Tense | AUXTC | ||
| Continuous Tense | AUXTR | ||
| 4 | Pronoun | Personal | PRP |
| Demonstrative | PDM | ||
| Possessive | PRS | ||
| Relative Demonstrative | PRD | ||
| Relative Personal | PRR | ||
| Reflexive | PRF | ||
| Reflexive APNA | APNA | ||
| 5 | Nominal Modifier | Adjective | JJ |
| Quantifier | Q | ||
| Cardinal | CD | ||
| Ordinal | OD | ||
| Fraction | FR | ||
| Multiplicative | QM | ||
| 6 | Adverb | Common | RB |
| Negative | NEG | ||
| 7 | Ad Position | Preposition | PRE |
| Postposition | PSP | ||
| 8 | Interrogative | WH Question | WH |
Comparison between Google translator & transtech.
| Roman Urdu | Google 2017 | Google 2019 | Transtech |
|---|---|---|---|
| Tum konse bazar jati thi | You are what the market | What market did you go? | Which market did you go |
| Wo bohat achay kapre pehnti hai | She wears nice clothes many | Wear good clothes | She wears very good clothes |
| Imran waqt par ghar nahi pohanchta hai | He does not come home on time | Imran does not know home at time | Imran do not reach home on time |
| Ali ajkal bohat pareshan hai | Many consignment Ali today | Eli is a booming trend today | Ali is very upset now-a-days |
| Areeba khamoshi se apna kaam kar rahi hai | Areeba quietly doing its job | Aurabagh is doing his job quietly | Areeba is doing work silently |
Check speech tagging parts. Make division in chunks and generate a parse tree. Find an appropriate set of grammatical rules. Rearrange the English words based on rules. |