method: TencentAILab2018-04-24

Authors: Jingchao Zhou, Tianlin Gao, Zheng Zhou, Zhifeng Li

Description: We train a network to recognize the word images. First, we correct the oblique and vertical arranged text lines using tranditional OCR technologies. Second, we generate several batches of synthesized images with similar style and arrangement as training samples. Last, we adopt DenseNet as the backbone to extract features, Bi-direction LSTM to learn sequential information, and CTC as the transcription layer.