method: I3CL2021-07-05

Authors: Jian Ye, Jing Zhang, Juhua Liu, Bo Du and Dacheng Tao

Affiliation: Wuhan University SigmaLab, JD Explore Academy

Email: leaf-yej@whu.edu.cn

Description: A arbitrary-shaped scene text detector based on Mask R-CNN. In this result, we use ResNeSt-101 as the backbone. Multi-scale training and testing are applied to get the final result. Our training datasets contain SynthText (pretrain), ArT, ICDAR2019-MLT, and part of LSVT.

method: DuXiaoman_OCR2020-05-21

Authors: Hang Yang, Yangchun Wan

Affiliation: Du Xiaoman Financial

Description: Our method is based on Mask RCNN. ResNeXt-152 as our backbone, we first pretrain the model on synthtext 800k, and then finetune on ArT2019,MLT2019 and part of LSVT. Multi-scale training and testing are used to get the final results.
AI-Lab, Du Xiaoman Financial

method: Tencent TEG OCR2019-12-17

Authors: Pei Xu, Hongzhen Wang, Shan Huang, Shen Huang, Qi Ju

Description: This method is based on Mask RCNN. We use resnet152 as backbone and don't use any ensemble methods. We train and test the model in multi scales. We synthesized curved data to pretrain the model. MLT2017 and a small part of LSVT data are used in training.

Ranking Table

Description Paper Source Code
DateMethodRecallPrecisionHmean
2021-07-05I3CL81.03%87.26%84.03%
2020-05-21DuXiaoman_OCR79.35%87.81%83.36%
2019-12-17Tencent TEG OCR81.16%85.64%83.34%
2019-11-04Sogou_OCR78.49%87.94%82.95%
2019-04-30MEGVII_Detection76.68%89.64%82.65%
2020-04-22Mask R-CNN78.55%86.43%82.30%
2019-05-01NJU-ImagineLab74.21%87.35%80.24%
2019-04-29ArtDet-v273.54%86.45%79.48%
2019-04-26baseline_polygon75.38%82.51%78.79%
2020-10-01TextFuseNet (ResNeXt-101)72.77%85.42%78.59%
2019-04-30CUTeOCR71.56%86.57%78.36%
2019-04-29Sg_ptd70.41%85.98%77.42%
2019-04-28 Alibaba-PAI73.25%79.18%76.10%
2021-03-26TextFuseNet (ResNet-50)69.42%82.59%75.44%
2019-04-30Fudan-Supremind Detection v371.61%79.26%75.24%
2019-04-29SRCB_Art70.30%80.41%75.02%
2019-04-30A scene text detection method based on maskrcnn66.25%85.69%74.72%
2019-04-30DMText_art66.15%85.09%74.43%
2021-04-28NN_Chinese_and_euro666.51%82.74%73.74%
2019-04-30TEXT_SNIPER71.45%76.17%73.74%
2019-04-28CLTDR65.92%82.58%73.32%
2021-04-08AutoCV69.59%77.25%73.22%
2019-04-29CRAFT68.93%77.25%72.85%
2019-04-30MaskRCNN_Text67.28%79.06%72.69%
2019-04-30QAQ63.45%83.76%72.21%
2019-04-30MaskDet67.04%76.47%71.44%
2019-04-24fdu_ai61.61%82.11%70.40%
2019-04-30CCISTD60.72%81.16%69.47%
2019-04-30Mask RCNN73.20%65.16%68.95%
2019-05-01TextMask_V170.58%67.33%68.92%
2019-04-22MFTD: Mask Filters for Text Detection63.05%72.09%67.27%
2021-04-23HOCRA64.35%69.75%66.94%
2019-04-25Art detect by vivo57.15%80.72%66.92%
2019-04-29PAT-S.Y59.64%75.72%66.72%
2019-04-16Art_test_baseline_task162.27%71.38%66.51%
2019-04-30DMCA64.01%69.08%66.45%
2019-04-30TMIS53.49%86.19%66.01%
2021-04-28NN_euro651.76%85.50%64.48%
2019-04-22mask rcnn55.61%74.83%63.81%
2019-05-01Unicamp-SRBR-PN-157.59%68.02%62.37%
2019-04-26TP51.62%78.18%62.18%
2019-04-28Improved Progressive scale expansion Net52.24%75.88%61.88%
2019-04-23159.04%57.38%58.20%
2019-04-27TextCohesion_143.66%68.08%53.20%
2019-04-30EM-DATA45.11%61.34%51.99%
2021-04-29HOCRA_base33.63%83.00%47.87%
2019-04-26RAST: Robust Arbitrary Shape Text Detector35.44%71.08%47.30%
2021-03-31inception baseline27.68%54.89%36.80%
2019-04-30MSR0.46%0.55%0.50%

Ranking Graphic