Deakin University
Browse

File(s) under permanent embargo

Evaluation of cursive and non-cursive scripts using recurrent neural networks

journal contribution
posted on 2016-04-01, 00:00 authored by S B Ahmed, S Naz, Imran RazzakImran Razzak, S F Rashid, M Z Afzal, T M Breuel
Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured documents. There exist multiple scripts in which the languages are written. The scripts could broadly be divided into cursive and non-cursive scripts. The recurrent neural networks have been proved to obtain state-of-the-art results for optical character recognition. We present a thorough investigation of the performance of recurrent neural network (RNN) for cursive and non-cursive scripts. We employ bidirectional long short-term memory (BLSTM) networks, which is a variant of the standard RNN. The output layer of the architecture used to carry out our investigation is a special layer called connectionist temporal classification (CTC) which does the sequence alignment. The CTC layer takes as an input the activations of LSTM and aligns the target labels with the inputs. The results were obtained at the character level for both cursive Urdu and non-cursive English scripts are significant and suggest that the BLSTM technique is potentially more useful than the existing OCR algorithms.

History

Journal

Neural computing and applications

Volume

27

Issue

3

Pagination

603 - 613

Publisher

Springer

Location

London, Eng.

ISSN

0941-0643

Language

eng

Publication classification

C1.1 Refereed article in a scholarly journal

Copyright notice

2015, The Natural Computing Applications Forum