method: An approach towards Word-Level Multi-Script Identification using Deep Transfer Features and SVM2017-07-01

Authors: Arindam Das, Saikat Roy

Description: An approach towards Word-Level Multi-Script Identification using Deep Transfer Features and SVM Method description: A pre-trained model of VGG16 is used where weights are adapted the problem of script identification. Each labeled image is initially resized to 224x224 and passed through this deep CNN as a 3D matrix to extract features. The images in each set are first normalized based on mean and standard deviation of the training set. The CNN was not trained further, but the features (4096 sized vectors) are extracted from the last fully connected layer through forward propagation (for each dataset). An SVM with RBF Kernel is used as classifier and trained on the training set. An accuracy of 85.03% was achieved on the validation set, the same hyper-parameters are used to predict the scripts in the test set.

Confusion Matrix

Detection
ArabicLatinChineseJapaneseKoreanBanglaSymbolsMixedNone
GTArabic3024191428516950600
Latin2245919725434428412810600
Chinese691585227365215019200
Japanese1295328548186125725900
Korean20466814969934495118500
Bangla428593563371508100
Symbols142797194466700
Mixed000000000
None000000000