method: NXB OCR2019-05-27

Authors: Yupeng Cao(X), Qiufeng Wang*(X), Qi Qu(B), Jing Li(X), Cheng Cheng*(N), Kaizhu Huang*(X) (Equal Contribution)

Description: A CNN-based method is used for training script identification classifier in cropped word images. We use VGG-19 architecture as the training model. For each convoluntional layer, we add the batch normalization and choose max pooling as the pooling layer.

P.S.Affiliation of Authors
(N:Institute of Nanotechnology and Nano-Bionics, Chinese Academy of Sciences ;
X:Xi’an Jiaotong-liverpool University ;
B:Beijing Babel Tenchnology Co., Ltd.)

Confusion Matrix

Detection
ArabicLatinChineseJapaneseKoreanBanglaHindiSymbolsNone
GTArabic455546217502796160
Latin316578024048316591951303000
Chinese1445330791062832516180
Japanese982329126139423475869530
Korean104264552362789556544290
Bangla6227172316210614820
Hindi2107511484400920
Symbols5211992216729101325230
None000000000