Authors: VIS-VAR Team, Baidu Inc.*
Affiliation: VIS-VAR Team, Baidu Inc.*
Description: We are from the Department of Computer Vison, Baidu Inc. Our method mainly composes of three parts:Text detection, Script identification and Text recognition. Text detection mainly relies on LOMO and EAST, Multi-scale testing is adopted and the final result is boosted with Resnet-50 and Inception-v4 as different backbones. Next, all text lines are recognized by the unified language classification model to identify the script of the text. Eight single-language text recognition models based on Res-SENet are used to finally recognize the text line images.