The optical character recognition of Urdu-like cursive scripts
Version 2 2024-06-05, 06:29Version 2 2024-06-05, 06:29
Version 1 2019-11-26, 10:44Version 1 2019-11-26, 10:44
conference contribution
posted on 2024-06-05, 06:29 authored by S Naz, K Hayat, Imran Razzak, M Waqas Anwar, SA Madani, SU KhanWe survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image databases. For the sake of clarity, the various attempts are grouped into three parts, namely: (a) printed, (b) handwritten, and (c) online character recognition. Within each part, the works are analyzed par rapport a typical OCR pipeline with an emphasis on the preprocessing, segmentation, feature extraction, classification, and recognition. © 2013 Elsevier Ltd. All rights reserved.
History
Volume
47Pagination
1229-1248Location
Bari, ItalyPublisher DOI
Start date
2012-09-18End date
2012-09-20ISSN
0031-3203Language
engNotes
Published in the journal Pattern Recognition, vol. 47, iss. 3, 2014, pp.1229-1248Publication classification
E1.1 Full written paper - refereedTitle of proceedings
ICFHR 2012 : International Conference on Frontiers in Handwriting RecognitionEvent
Frontiers in Handwriting Recognition. International Conference (2012 : Bari, Italy)Issue
3Publisher
ElsevierPlace of publication
Amsterdam, The NetherlandsUsage metrics
Categories
Keywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC