method: CLOVA-AI v22019-02-18

Authors: Jeonghun Baek, Junyeop Lee, Sungrae Park, Moonbin Yim, Seonghyeon Kim, Hwalsuk Lee

Description: We used Thin-plate-spline (TPS) based Spatial transformer network (STN) which normalizes the input text images, ResNet based feature extractor, BiLSTM, and attention mechanism.
This model was developed based on the analysis of scene text recognition modules.
See our paper and source code.

Authors: Rachit S Munjal, Arun D Prabhu, Sukumar Moharana, Nikhil Arora

Affiliation: Samsung R & D, Bangalore

Description: SRIB_STRIDE -Scene Text Recognition In Device is a very lightweight STR model designed for real-time on device usage, with selective-rotation, CNN with Global Squeeze-Excite Modules , Bi-LSTM with Projection and CTC Decoder.
Total Parameters: 0.85M supporting Latin diacritic characters as well.
Execution Time on S10 + :- 5ms ( dimension 16*128)

Ranking Table

Description Paper Source Code
DateMethodTotal Edit distance (case sensitive)Correctly Recognised Words (case sensitive)T.E.D. (case insensitive)C.R.W. (case insensitive)
2019-02-18CLOVA-AI v226.951595.98%23.483696.35%
2020-09-30SRIB-STRIDE (Scene Text Recognition In Device)72.792986.67%62.631487.58%

Ranking Graphic

Ranking Graphic