method: Clova AI / Lens2018-09-17

Authors: Seolki Baek, Geonmo Gu, Jeongo Seo

Description: Our model is featured by CNN/RNN-based encoder and Hybrid CTC/Attention decoder. Moreover we proposed new text synthesis tools to make our model robust and high performance in the wild.

method: TencentAILab2018-04-24

Authors: Jingchao Zhou, Tianlin Gao, Zheng Zhou, Zhifeng Li

Description: We train a network to recognize the word images. First, we correct the oblique and vertical arranged text lines using tranditional OCR technologies. Second, we generate several batches of synthesized images with similar style and arrangement as training samples. Last, we adopt DenseNet as the backbone to extract features, Bi-direction LSTM to learn sequential information, and CTC as the transcription layer.

method: Tencent-OCR+2017-06-30

Authors: Chunchao Guo, Weichen Zhang, Yi Li, Hui Song, Ming Liu, Hongfa Wang, Lei Xiao

Description: Data Platform Department, Tencent. We adapt CNN-LSTM-CTC architecture to recognize the text line. In addition, a knowledge-based post processing is used for adjusting the result.

Ranking Table

Description Paper Source Code
DateMethodTotal Edit distance (case sensitive)Correctly Recognised Words (case sensitive)T.E.D. (case insensitive)C.R.W. (case insensitive)
2018-09-17Clova AI / Lens101.354694.60%92.772994.93%
2018-04-24TencentAILab112.802692.34%101.509093.09%
2017-06-30Tencent-OCR+158.341889.83%121.766791.24%
2017-07-01HIK_OCR198.648388.54%179.419089.17%
2017-06-29baseline1,749.873634.51%1,501.228042.91%
2017-06-26textminer2,399.817924.80%1,577.139550.08%
2017-06-30onceAgain2,744.681313.68%2,038.880029.53%

Ranking Graphic

Ranking Graphic