UCOM offline dataset-an Urdu handwritten dataset generation

Bin Ahmed, S; Naz, S; Swati, S; Razzak, Imran; Umar, AI; Ali Khan, A

UCOM offline dataset-an Urdu handwritten dataset generation

journal contribution

posted on 2024-06-05, 06:29 authored by S Bin Ahmed, S Naz, S Swati, Imran Razzak, AI Umar, A Ali Khan

© 2017, Zarka Private University. All rights reserved. A benchmark database for character recognition is an essential part for efficient and robust development. Unfortunately, there is no comprehensive handwritten dataset for Urdu language that would be used to compare the state of the art techniques in the field of optical character recognition. In this paper, we present a new and publically available dataset comprising 600 pages of handwritten Urdu text written in Nasta’liq style in conjunction with detailed ground truth for the evaluation of handwritten Urdu character recognition. This dataset contains text lines written in Nasta’liq style by limited individuals on A4 size paper. The acquired data on page was scanned and text lines were segmented. UCOM database covers all Urdu characters and ligatures with different variation in addition to Urdu numeric data. We have considered that ligature consists of up to five characters in this dataset. The UCOM dataset can be used for handwritten character recogntition as well as writer identification. We proposed and evaluated the strength of Recurrent Neural Networks (RNN) on UCOM offline database sample text line.

History

Journal

The international Arab journal of information technology

Volume

14

Pagination

239-245

Location

Zarqa, Jordan

Open access

Yes

Link to full text

https://iajit.org/PDF/Vol 14, No. 2/8721.pdf

ISSN

1683-3198

eISSN

2309-4524

Language

eng

Publication classification

C1.1 Refereed article in a scholarly journal

Issue

2

Publisher

Zarqa Private University

Publication URL

https://iajit.org/PDF/Vol 14, No. 2/8721.pdf

Usage metrics

Keywords

4603 Computer vision and multimedia computation

UCOM offline dataset-an Urdu handwritten dataset generation

History

Journal

Volume

Pagination

Location

Open access

Link to full text

ISSN

eISSN

Language

Publication classification

Issue

Publisher

Publication URL

Usage metrics

Categories

Keywords

Licence

Exports