method: NSTD-MCEM-iFLYTEK2019-10-12

Authors: iFLYTEK

Affiliation: iFLYTEK

Description: Natural scene text detector(NSTD-iFLYTEK) is based on MaskRcnn with resnet-101. Only ICDAR2019 datasets are used for training, including Rects, LSVT, MLT and Art. Multi-scale training and single-scale testing are used to generate the final result, no model ensemble. Recognition ensemble model is based on attention-based text recognizer. The final results are fused with different channel information on different models.
Xiangxiang Wang (王翔翔) iFLYTEK (科大讯飞)
Jian Dong(董健) iFLYTEK(科大讯飞)
Fengren Wang(王烽人) iFLYTEK(科大讯飞)
Jiajia Wu(吴嘉嘉) iFLYTEK(科大讯飞)
Yin Lin(林垠) iFLYTEK(科大讯飞)
Lou Shun(娄舜) iFLYTEK(科大讯飞)
Jinshui Hu(胡金水) iFLYTEK(科大讯飞)