method: TH-CNN2017-07-01

Authors: Yejun Tang, Haoyu Qin, Liangrui Peng, Department of Electronic Engineering, Tsinghua University, Beijing, China

Description: A simplified GoogLeNet is used (Caffe implementation). The network is trained by using augmented samples. The original samples in the training set are rotated, blurred, mirrored and inverted. The numbers of training sam- ples of different scripts are balanced. The input images are resized into 256x256 pixels and cropped into 227x227 pixels.

Confusion Matrix

Detection
ArabicLatinChineseJapaneseKoreanBanglaSymbolsMixedNone
GTArabic346329919448246225710200
Latin4394395921856538450972821139300
Chinese400314713939736119311300
Japanese722521924274968237916400
Korean103485073881148119440631500
Bangla159175988195199836200
Symbols2642316932852911588900
Mixed000000000
None000000000