method: NXB OCR2019-05-27
Authors: Yupeng Cao(X), Qiufeng Wang*(X), Qi Qu(B), Jing Li(X), Cheng Cheng*(N), Kaizhu Huang*(X) (Equal Contribution)
Description: A CNN-based method is used for training script identification classifier in cropped word images. We use VGG-19 architecture as the training model. For each convoluntional layer, we add the batch normalization and choose max pooling as the pooling layer.
P.S.Affiliation of Authors
(N:Institute of Nanotechnology and Nano-Bionics, Chinese Academy of Sciences ;
X:Xi’an Jiaotong-liverpool University ;
B:Beijing Babel Tenchnology Co., Ltd.)
Confusion Matrix
Detection | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Arabic | Latin | Chinese | Japanese | Korean | Bangla | Hindi | Symbols | None | ||
GT | Arabic | 4555 | 462 | 17 | 50 | 27 | 9 | 6 | 16 | 0 |
Latin | 316 | 57802 | 404 | 831 | 659 | 195 | 130 | 300 | 0 | |
Chinese | 14 | 453 | 3079 | 1062 | 83 | 25 | 16 | 18 | 0 | |
Japanese | 98 | 2329 | 1261 | 3942 | 347 | 58 | 69 | 53 | 0 | |
Korean | 104 | 2645 | 523 | 627 | 8955 | 65 | 44 | 29 | 0 | |
Bangla | 6 | 227 | 17 | 23 | 16 | 2106 | 148 | 2 | 0 | |
Hindi | 2 | 107 | 5 | 11 | 4 | 84 | 4009 | 2 | 0 | |
Symbols | 52 | 1199 | 22 | 167 | 29 | 10 | 13 | 2523 | 0 | |
None | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |