method: CBL_OCR2022-01-14

Authors: Guokun Wang(王国坤), Jingyi Shen(沈静逸), Yue Wu(吴岳), Chang Zhou(周昌), Jianqiang Huang(黄建强)

Affiliation: Alibaba

Description: The Training method is based on transformer, which used in both encoder and decoder, multiple loss is combined for better accuracy. Our training data consists of serveral public datasets including CTW, LSVT, RCTW, ReCTS, Baidu Scene Text Recognition contest data. We Train the model on the whole dataset at first, and finetuned on the ReCTS for several epochs.

method: Unis_OCR2021-05-08

Authors: Jie.Li(李杰),BaoLin.Zhang(张保林),KeJie.Liu(刘克捷),Yuan.Hu(胡源)

Affiliation: UNISINSIGHT

Description: First, the training method is based on crnn framework, which takes SE-ResNet with multi-scale features as the backbone, and uses the BiLSTM and attention mechanism to integrate multiple model results. At the same time, the training data used include LSVT, ReCTS, RCTW, ArT and other public free data.

method: DH_OCR2021-05-08

Authors: Qiang Zeng(曾强),Zhaolin You(游照林),Yuanyuan Chen(陈媛媛),Jianping Xiong(熊剑平)

Affiliation: ZHEJIANG DAHUA TECHNOLOGY CO.,LTD

Description: Our training data included ReCTS, LSVT, RCTW, ART and some high-quality artificial synthetic data. We used the CRNN framework for text recognition, and different structures of multi-scale feature extraction backbone such as SA-ResNet, SE-ResNet were used.We used an efficient shuffle attention method which combine spatial attention with channel attention.Meanwhile we used multi-model fusion to predict the final result.

Ranking Table

Description Paper Source Code
DateMethodResult
2022-01-14CBL_OCR97.40%
2021-05-08Unis_OCR97.01%
2021-05-08DH_OCR97.00%
2020-12-23Sogou_OCR96.84%
2020-10-28Eleme-AI-V196.81%
2020-10-20PingAn_VisualComputing96.62%
2020-08-10HIK_OCR96.59%
2019-10-15MCEM_v3-iFLYTEK95.75%
2019-04-30SANHL_v195.55%
2020-10-12transformer_v195.10%
2019-10-30Encoder_Decoder_v195.09%
2019-04-30Tencent-DPPR Team94.86%
2019-04-30HUST_VLRGROUP94.83%
2019-04-30TPS-ResNet v194.77%
2019-04-23baseline94.37%
2019-04-30MCEM v294.22%
2021-04-19Aster93.33%
2023-04-06svtr_v293.15%
2019-04-30VOCR92.50%
2019-04-29CLTDR92.33%
2019-04-29凉凉92.03%
2019-04-30Task2-re591.45%
2019-04-30KyrieNet90.26%
2019-04-28Amap-CVLab89.55%
2019-04-30HUST_Reg89.01%
2020-05-22My method88.48%
2019-04-29ReCTS_HWY87.65%
2019-04-24ocr_densenet86.35%
2020-04-11My method85.19%
2019-04-29Ssubm190429_nonchar_thres0082.17%
2019-04-30LCT_OCR (中国科学院信息工程研究所)80.96%
2019-04-27resnet101lstm78.26%
2019-11-23test77.50%
2019-10-31test76.89%
2019-04-30baseline75.50%
2019-04-30Scene text detection of polar coordinate regression73.79%
2019-04-30jxl_ocr66.46%
2019-04-30ECUST_dpc66.46%
2020-11-24cjsoft65.48%
2019-04-30ECUST_OCR63.96%
2019-04-30ReCTS-task257.48%
2019-04-30task255.52%

Ranking Graphic