method: GS_HUST2019-05-28
Authors: Changxu Cheng, Qiuhui Huang, Wuheng Xu and Hao Wang at Huazhong University of Science and Technology
Description: We use a simple VGG16 followed by Global Average Pooling and a linear classifier. Data augmentation, including random crop, changing hsv and perspective transformation, is imposed on the training data. Besides, we adopt grouping resizing strategy to deal with the trainging images. Specifically, we resize the images to a certain size according to their aspect ratios. In this way, only images with similar aspect ratios has the same final size.
Confusion Matrix
Detection | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Arabic | Latin | Chinese | Japanese | Korean | Bangla | Hindi | Symbols | None | ||
GT | Arabic | 4810 | 233 | 23 | 17 | 23 | 11 | 9 | 16 | 0 |
Latin | 228 | 57663 | 902 | 519 | 645 | 158 | 132 | 390 | 0 | |
Chinese | 7 | 135 | 4207 | 326 | 44 | 9 | 9 | 13 | 0 | |
Japanese | 37 | 1528 | 1965 | 4213 | 271 | 28 | 63 | 52 | 0 | |
Korean | 29 | 1309 | 553 | 230 | 10763 | 35 | 49 | 24 | 0 | |
Bangla | 9 | 138 | 27 | 12 | 8 | 2227 | 118 | 6 | 0 | |
Hindi | 3 | 21 | 5 | 1 | 0 | 14 | 4177 | 3 | 0 | |
Symbols | 49 | 532 | 44 | 105 | 70 | 6 | 11 | 3198 | 0 | |
None | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |