An Ocr system for printed Nasta'liq script: a segmentation based approach
Version 2 2024-06-05, 06:29Version 2 2024-06-05, 06:29
Version 1 2019-11-26, 10:12Version 1 2019-11-26, 10:12
conference contribution
posted on 2024-06-05, 06:29 authored by S Naz, AI Umar, SB Ahmed, SH Shirazi, Imran Razzak, I Siddiqi© 2014 IEEE. Machine simulation of human reading has been a subject of intensive research for almost four decades. Automatic Urdu character recognition remains a challenging task due to its cursive nature despite the fact that the latest improvements in recognition methods and systems for Latin script are very promising. This work introduces a robust approach based on statistical models that provide solution for recognition of Urdu text Nasta'liq style. Contrary to classical approaches which segment text into words, ligatures or characters, we intend to employ an implicit segmentation where text lines are recognized during segmentation. The developed system will be evaluated on standard Urdu text databases and compared with the state-ofthe- art recognition techniques proposed till date.
History
Pagination
255-259Location
Karachi, PakistanPublisher DOI
Start date
2014-12-08End date
2014-12-10ISBN-13
9781479957545Language
engPublication classification
E1.1 Full written paper - refereedTitle of proceedings
17th IEEE International Multi Topic Conference 2014Event
Multi Topic. International Conference (17th : 2014 : Karachi, Pakistan)Publisher
IEEEPlace of publication
Piscataway, N.J.Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC