Evaluation of cursive and non-cursive scripts using recurrent neural networks
Version 2 2024-06-05, 06:29Version 2 2024-06-05, 06:29
Version 1 2019-11-27, 09:42Version 1 2019-11-27, 09:42
journal contribution
posted on 2024-06-05, 06:29authored bySB Ahmed, S Naz, Imran RazzakImran Razzak, SF Rashid, MZ Afzal, TM Breuel
Character recognition has been widely used since its inception in applications involved processing of scanned or camera-captured documents. There exist multiple scripts in which the languages are written. The scripts could broadly be divided into cursive and non-cursive scripts. The recurrent neural networks have been proved to obtain state-of-the-art results for optical character recognition. We present a thorough investigation of the performance of recurrent neural network (RNN) for cursive and non-cursive scripts. We employ bidirectional long short-term memory (BLSTM) networks, which is a variant of the standard RNN. The output layer of the architecture used to carry out our investigation is a special layer called connectionist temporal classification (CTC) which does the sequence alignment. The CTC layer takes as an input the activations of LSTM and aligns the target labels with the inputs. The results were obtained at the character level for both cursive Urdu and non-cursive English scripts are significant and suggest that the BLSTM technique is potentially more useful than the existing OCR algorithms.