- Task 2 - Script identification - Method: CNN based method 7
- Method info
- Samples list
- Per sample details
method: CNN based method 72017-07-02
Authors: Yash Patel, Michal Bušta, Lukáš Neumann, Jiri Matas
Description: A CNN-based approach is used for script- identification in cropped word images. The convolutional lay- ers from VGG-16 architecture are used along with a Global- Average-Pooling and two fully connected layers. To preserve the aspect ratio of input images in both training and testing, the images are resized into fixed-height (64) and variable-width tensors. For training, the convolutional layers are initialized with ImageNet weights. The categorical-cross-entropy loss is utilized, and all the layers (both convolutional and fully connected) are updated during back-propagation.
Confusion Matrix
Detection | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Arabic | Latin | Chinese | Japanese | Korean | Bangla | Symbols | Mixed | None | ||
GT | Arabic | 4751 | 302 | 12 | 33 | 19 | 9 | 16 | 0 | 0 |
Latin | 183 | 58849 | 245 | 565 | 467 | 98 | 130 | 0 | 0 | |
Chinese | 22 | 271 | 3540 | 817 | 80 | 11 | 9 | 0 | 0 | |
Japanese | 51 | 2039 | 980 | 4528 | 488 | 50 | 21 | 0 | 0 | |
Korean | 46 | 2302 | 436 | 519 | 9626 | 56 | 7 | 0 | 0 | |
Bangla | 3 | 213 | 19 | 25 | 19 | 2266 | 0 | 0 | 0 | |
Symbols | 65 | 907 | 9 | 38 | 33 | 8 | 2436 | 0 | 0 | |
Mixed | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
None | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |