method: Multi_scale_v12019-09-29
Authors: Wuheng Xu, Changxu Cheng, Bohan Li
Description: We used area block feature information using images on multiple scales.This model has 4 scales and 8 branches.We also used some data augments and improved ROI pooling.Finally, we used three training sets(mlt17, mlt19_train, mlt19_val).
Confusion Matrix
Detection | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Arabic | Latin | Chinese | Japanese | Korean | Bangla | Hindi | Symbols | None | ||
GT | Arabic | 4814 | 258 | 6 | 22 | 22 | 5 | 2 | 13 | 0 |
Latin | 141 | 59544 | 84 | 321 | 329 | 50 | 21 | 147 | 0 | |
Chinese | 7 | 310 | 3535 | 796 | 75 | 6 | 4 | 17 | 0 | |
Japanese | 32 | 1862 | 547 | 5403 | 262 | 12 | 12 | 27 | 0 | |
Korean | 37 | 1432 | 119 | 234 | 11110 | 30 | 19 | 11 | 0 | |
Bangla | 1 | 177 | 1 | 21 | 23 | 2247 | 74 | 1 | 0 | |
Hindi | 5 | 50 | 2 | 2 | 1 | 17 | 4146 | 1 | 0 | |
Symbols | 34 | 745 | 1 | 36 | 31 | 4 | 0 | 3164 | 0 | |
None | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |