Description: We reimplemented the standalone recognition method according to the end-to-end text spotting code released by the Mask TextSpotter[TPAMI]. It is a seq-to-seq method based on 2D attention. We synthesize curved text images for pretraining by the method of VGG synthtext. We add public dataset including icdar2013-2015, CUTE, SVT, IIIT5k, RCTW2017, LSVT to finetune and don't use any private data.
Description: instance segment based method for text detection and attention based method for text recognition with threshold 0.5 and 5435 classes. Data augmentation and extra datasets including LSVT, ICDAR2017, COCO-Text, RECTS are used for text recognition.