method: CBL_OCR2022-01-30

Authors: Jingyi Shen(沈静逸), Guokun Wang(王国坤), Yue Wu(吴岳), Chang Zhou(周昌), Jianqiang Huang(黄建强)

Affiliation: Alibaba

Description: Anchor-free based detection framework with poly regression and text segmentation was used here. The model was firstly pretrained on LSVT、MLT、ReCTS,and then finetuned on ReCTS. Weighted loss was used to enhance small text instances. At testing phase, single-model-multi-scale with post-processing was used to generate the final results.

method: Unis_OCR2021-04-27

Authors: BaoLin.Zhang(张保林),Jie.Li(李杰),KeJie.Liu(刘克捷),Yuan.Hu(胡源)

Affiliation: UNISINSIGHT

Description: First, the training method is based on SAST framework, which takes RES101 with multi-scale features as the backbone, and adds attention and lamdba method . At the same time, the training data used include LSVT,LSVT Weak, ReCTS, Weak, icdar and other free data from internet.

method: NSTD-iFLYTEK2019-10-10

Authors: iFLYTEK

Affiliation: iFLYTEK

Description: Natural scene text detector(NSTD-iFLYTEK) is based on MaskRcnn with resnet-101. Only ICDAR2019 datasets are used for training, including Rects, LSVT, MLT and Art. Multi-scale training and single-scale testing are used to generate the final result, no model ensemble.
name and organization:
Jian Dong(董健) iFLYTEK(科大讯飞)
Fengren Wang(王烽人) iFLYTEK(科大讯飞)
Jiajia Wu(吴嘉嘉) iFLYTEK(科大讯飞)
Yin Lin(林垠) iFLYTEK(科大讯飞)
Lou Shun(娄舜) iFLYTEK(科大讯飞)
Jinshui Hu(胡金水) iFLYTEK(科大讯飞)

Ranking Table

Description Paper Source Code
DateMethodRecallPrecisionHmean
2022-01-30CBL_OCR93.27%94.08%93.67%
2021-04-27Unis_OCR94.66%92.51%93.57%
2019-10-10NSTD-iFLYTEK93.17%93.62%93.40%
2019-05-01SANHL_v493.97%92.76%93.36%
2019-05-01Tencent-DPPR Team93.46%92.59%93.03%
2019-04-29Amap-CVLab93.41%91.62%92.50%
2021-04-26ZJUT94.10%90.46%92.25%
2019-04-30HUST_VLRGROUP93.51%89.15%91.27%
2019-04-30maskrcnn_text91.96%90.09%91.02%
2019-04-30Task3-re590.03%91.65%90.83%
2019-04-22oo91.56%90.08%90.81%
2019-04-23A region proposal and fcn model based method 88.64%92.72%90.64%
2019-04-30Mask R-CNN89.84%91.41%90.62%
2019-04-30COLD AND COOL90.99%89.59%90.28%
2019-04-26baseline_0.793.66%86.35%89.86%
2019-04-30pursuer86.13%92.72%89.31%
2019-04-29CLTDR88.92%88.70%88.81%
2020-05-18MMTD86.63%89.92%88.25%
2019-04-30CRAFT85.33%89.38%87.31%
2019-04-30FRCC84.67%89.53%87.03%
2019-04-25EAST检测网络82.27%88.49%85.27%
2019-04-26JDIVA_Textboxes++87.02%81.23%84.03%
2019-04-30FFLOVE88.52%79.32%83.66%
2019-04-29Subm19042985.18%79.66%82.33%
2019-04-23PSENet_v183.16%80.77%81.94%
2019-04-30Sogou_MM96.17%69.20%80.48%
2019-04-30WHUT79.53%79.36%79.45%
2019-04-30PixelBased Prediction86.02%70.68%77.60%
2019-10-31Cluster75.80%77.05%76.42%
2019-04-28gd method73.05%78.35%75.61%
2019-04-28CornerNet Multi Scale70.35%80.19%74.95%
2019-04-30Textboxes++ detects arbitrary-oriented scene text in a single network forward pass60.66%90.87%72.76%
2019-04-25The improved CTPN66.83%75.87%71.07%
2019-04-30Scene text detection of polar coordinate regression72.54%56.44%63.48%
2019-04-30Multi-scale Pixellink50.57%32.98%39.92%
2019-04-29task37.82%8.14%7.98%

Ranking Graphic