method: 4Paradigm-Data-Intelligence2019-05-30

Authors: ACVG

Description: Recognition model: Based on Transformer with backbone ResNet50. A voting process is done to identify the language of recognized transcript. Train-set: 2017 MLT task2 train-set & 2019 MLT task2 train-set & 2019 MLT Synthetic dataset.

Confusion Matrix

Detection
ArabicLatinChineseJapaneseKoreanBanglaSymbolsMixedNone
GTArabic49801145111291100
Latin423590219919741014324400
Chinese263093481632253193000
Japanese192150812424438613828200
Korean11012352573451080414110000
Bangla108427562383300
Symbols12640666815286600
Mixed000000000
None000000000