Authors: Hui Li, Peng Wang, Chunhua Shen, Guyu Zhang
Description: We propose an easy-to-implement strong baseline for irregular scene text recognition, using off- the-shelf neural network components and only word-level annotations. It is composed of a 31-layer ResNet, an LSTM- based encoder-decoder framework and a 2-dimensional attention module. Despite its simplicity, the proposed method is robust. It achieves state-of-the-art performance on irregular text recognition benchmarks and comparable results on regular text datasets.