method: TPS-ResNet2019-06-04

Authors: Jeonghun Baek, Youngmin Baek, Seung Shin, Bado Lee, Chae Young Lee, and Hwalsuk Lee

Description: we used Thin-plate-spline (TPS) based Spatial transformer network (STN) which normalizes the input text images, ResNet based feature extractor, BiLSTM, and attention mechanism.
This model was developed based on the analysis of scene text recognition modules.
See our paper and source code.

Clova AI OCR Team, NAVER/LINE Corp.

Confusion Matrix

Detection
ArabicLatinChineseJapaneseKoreanBanglaHindiSymbolsNone
GTArabic470730138581522190
Latin121591334283752443372960
Chinese4741537694722841050
Japanese81149412335132164199250
Korean841012290168113622440120
Bangla1110629162023392310
Hindi81032611922404050
Symbols3092866311233326510
None000000000