method: Sogou_OCR2019-11-11

Authors: Xudong Rao, Lulu Xu, Long Ma, Xuefeng Su

Description: An arbitrary-shaped text detection method based on Mask R-CNN, we use resnext-152 as our backbone, multi-scale training and testing are adopted to get the final results.

method: Tencent-DPPR Team2019-06-04

Authors: Longhuang Wu, Shangxuan Tian, Chang Liu, Wenjie Cai, Jiachen Li, Sicong Liu, Haoxi Li, Chunchao Guo, Hongfa Wang, Hongkai Chen, Qinglin lu, Chun Yang, Xucheng Yin, Lei Xiao

Description: We are Tencent-DPPR (Data Platform Precision Recommendation) team. Our method follows the framework of Mask R-CNN that employs mask to detect multi-oriented scene texts. We use the MLT-19 and the MSRA-TD500 dataset to train our text detector, and we also apply a multi-scale training approach during training. To obtain the final ensemble results, we combined two different backbones and different multi-scale testing approaches.

Authors: Longhuang Wu, Shangxuan Tian, Chang Liu, Wenjie Cai, Jiachen Li, Sicong Liu, Haoxi Li, Chunchao Guo, Hongfa Wang, Hongkai Chen, Qinglin lu, Chun Yang, Xucheng Yin, Lei Xiao

Description: We are Tencent-DPPR (Data Platform Precision Recommendation) team. Our method follows the framework of Mask R-CNN that employs mask to detect multi-oriented scene texts. We use the MLT-19 and the MSRA-TD500 dataset to train our text detector, and we also apply a multi-scale training approach during training. To obtain the final ensemble results, we combined two different backbones and different multi-scale testing approaches.

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2019-11-11Sogou_OCR83.72%89.60%78.57%76.23%
2019-06-04Tencent-DPPR Team83.61%87.52%80.05%77.33%
2019-06-04Tencent-DPPR Team (Method_v0.3)83.61%87.57%79.99%77.28%
2019-06-04multi-stage_text_detector_v483.59%87.75%79.80%70.00%
2019-06-03Tencent-DPPR Team (Method_v0.2)83.55%87.78%79.72%77.05%
2019-06-04multi-stage_text_detector_v283.32%87.04%79.91%69.51%
2019-06-04multi-stage_text_detector_v383.30%87.15%79.78%69.49%
2019-06-03multi-stage_text_detector83.25%87.47%79.42%69.44%
2019-06-03NJU-ImagineLab(v3)83.07%87.85%78.79%76.21%
2019-05-30PMTD82.53%87.47%78.12%75.80%
2019-05-27Tencent-DPPR Team (Method_v0.1)81.88%89.41%75.52%72.87%
2019-05-29maskrcnn++ result80.35%82.64%78.19%64.64%
2019-05-29IC_RL80.11%82.97%77.44%64.29%
2019-06-02A two-stage text detector based on cascade rcnn(using total 10000 images of mlt19)78.38%82.26%74.85%71.27%
2019-05-31A two-stage text detector based on cascade rcnn78.11%82.89%73.85%70.31%
2019-06-03mm-maskrcnn_v276.79%84.73%70.21%67.44%
2019-06-04TH-DL-v276.64%84.55%70.09%64.44%
2019-06-03TH-DL-v176.59%84.51%70.03%64.35%
2019-05-27TH-DL76.53%84.70%69.80%64.07%
2019-05-26two stage text detector75.04%82.61%68.74%65.29%
2019-06-03sot74.24%79.96%69.28%65.94%
2019-06-02DISTILLED CRAFT72.94%81.22%66.19%59.16%
2019-06-03text-mountain71.95%72.12%71.77%51.90%
2019-06-03CRAFTS70.86%81.42%62.73%56.63%
2019-06-04Unicamp-SRBR-MLT2019-PELEETEXT70.81%81.58%62.54%59.01%
2019-06-03RRPN69.56%77.71%62.95%58.07%
2019-06-04Unicamp-SRBR-MLT2019-FUSION-PSENET-PELEETEXT68.56%77.00%61.79%56.03%
2019-05-28CRAFTS(Initial)68.11%79.51%59.56%54.50%
2019-06-04Lomin OCR67.65%71.62%64.09%57.95%
2019-06-03 NXB OCR65.96%70.59%61.90%43.72%
2019-05-24PSENet_v165.83%73.52%59.59%52.73%
2019-05-27MLT2019 ETD64.36%78.71%54.44%42.93%
2019-05-27CLTDR63.53%77.20%53.97%41.63%
2019-05-27NXB OCR61.31%71.84%53.48%38.48%
2019-06-03TP58.01%77.59%46.32%37.26%
2019-05-28Unicamp-SRBR-MLT2019-S151.00%75.22%38.58%35.30%
2019-06-04Cyberspace47.09%69.48%35.61%26.17%
2019-05-28PydBox-TextDetector29.79%59.56%19.86%11.83%
2019-05-05AAAA0.02%0.07%0.01%0.00%
2019-05-27Unicamp-SRBR-MLT2019-S10.00%0.03%0.00%0.00%
2019-06-01tsinghuaee51_MLT20190.00%0.04%0.00%0.00%
2019-05-274Paradigm-Data-Intelligence0.00%0.00%0.00%0.00%

Ranking Graphic

Ranking Graphic