Deakin University
Browse

The optical character recognition of Urdu-like cursive scripts

Version 2 2024-06-05, 06:29
Version 1 2019-11-26, 10:44
conference contribution
posted on 2024-06-05, 06:29 authored by S Naz, K Hayat, Imran Razzak, M Waqas Anwar, SA Madani, SU Khan
We survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image databases. For the sake of clarity, the various attempts are grouped into three parts, namely: (a) printed, (b) handwritten, and (c) online character recognition. Within each part, the works are analyzed par rapport a typical OCR pipeline with an emphasis on the preprocessing, segmentation, feature extraction, classification, and recognition. © 2013 Elsevier Ltd. All rights reserved.

History

Volume

47

Pagination

1229-1248

Location

Bari, Italy

Start date

2012-09-18

End date

2012-09-20

ISSN

0031-3203

Language

eng

Notes

Published in the journal Pattern Recognition, vol. 47, iss. 3, 2014, pp.1229-1248

Publication classification

E1.1 Full written paper - refereed

Title of proceedings

ICFHR 2012 : International Conference on Frontiers in Handwriting Recognition

Event

Frontiers in Handwriting Recognition. International Conference (2012 : Bari, Italy)

Issue

3

Publisher

Elsevier

Place of publication

Amsterdam, The Netherlands

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC