method: NJU-ImagineLab2019-06-02

Authors: Xiaoge Song,Yao Xiao,Wenhai Wang, Enze Xie, Tong Lu.

Description: We are from ImagineLab, Nanjing University and TongJi University. It is a Mask R-CNN based framework.

method: GNNets (single scale)2019-03-29

Authors: Jiaqi Duan; Youjiang Xu; Zhanghui Kuang; Hongbin Sun; Yue Guan; Wei Zhang

Description: Large geometry (eg, orientation) variances are the key challenges in the scene text detection. In this work, we first conduct experiments to investigate the capacity of networks for learning geometry variances on detecting scene texts, and find that networks can handle only limited text geometry variances. Then, we put forward a novel Geometry Normalization Module (GNM) with multiple branches, each of which is composed of one Scale Normalization Unit and one Orientation Normalization Unit, to normalize each text instance to one desired canonical geometry range through at least one branch. The GNM is general and readily plugged into existing convolutional neural network based text detectors to construct end-to-end Geometry Normalization Networks (GNNets).

method: Baidu-VIS2019-05-08

Authors: Baidu-VIS

Description: We are from the Department of Computer Vision, Baidu Inc. Our approach follows the framework of LOMO and mainly upgrades the backbone network. For more details about LOMO, please refer to the paper: "Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes", accepted by CVPR2019. (https://128.84.21.199/abs/1904.06535)

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2019-06-02NJU-ImagineLab47.22%32.46%86.61%68.80%
2019-03-29GNNets (single scale)47.15%33.90%77.45%51.73%
2019-05-08Baidu-VIS46.60%32.89%79.96%26.26%
2019-03-23PMTD45.64%32.09%78.99%52.51%
2018-11-20Pixel-Anchor44.36%32.15%71.54%27.03%
2018-01-22FOTS_v243.40%29.89%79.20%41.42%
2019-03-19ccnet single scale43.18%30.21%75.68%45.22%
2019-06-11 4Paradigm-Data-Intelligence43.11%29.07%83.37%24.42%
2019-05-234Paradigm-Data-Intelligence42.96%29.06%82.32%24.00%
2017-06-28SCUT_DLVClab142.20%29.86%71.92%52.71%
2018-10-29Amap-CVLab41.86%28.52%78.63%51.64%
2018-12-04Mask R-CNN -multi scale41.12%28.02%77.24%55.40%
2018-11-28CRAFT40.43%28.37%70.34%19.96%
2017-11-09EAST++40.08%27.29%75.47%34.10%
2018-05-18PSENet_NJU_ImagineLab (single-scale)39.63%27.08%73.87%20.21%
2018-11-15USTC-NELSLIP38.09%24.99%80.04%46.23%
2018-12-22PKU_VDIG37.80%24.42%83.58%65.44%
2018-12-02Shape-Aware Based Scene Text Detector (single scale)36.42%24.01%75.41%18.22%
2019-01-08ALGCD_CP36.41%23.88%76.63%32.94%
2018-08-23Sogou_MM36.00%23.34%78.67%57.38%
2018-12-04 SPCNet_TongJi & UESTC (multi scale)35.96%23.95%72.19%17.42%
2018-12-05EPTN-SJTU33.30%21.52%73.52%33.46%
2019-05-30Thesis-SE32.77%21.27%71.39%30.12%
2018-03-12ATL Cangjie OCR32.11%20.84%69.98%35.50%
2019-07-15stela32.07%22.31%57.05%28.65%
2017-06-29SARI_FDU_RRPN_v130.15%19.23%69.71%40.70%
2018-12-13AutoCV29.06%17.81%78.78%36.49%
2018-12-03SPCNet_TongJi & UESTC (single scale)27.97%17.16%75.68%13.15%
2017-06-28SARI_FDU_RRPN_v025.27%15.60%66.48%36.44%
2017-06-30TH-DL25.09%16.98%48.08%20.81%
2019-01-03YY AI OCR Group23.12%14.40%58.51%15.26%
2017-06-30linkage-ER-Flow17.34%10.30%54.72%12.37%
2017-06-30Sensetime OCR14.98%8.27%80.04%40.38%

Ranking Graphic

Ranking Graphic