method: Tencent-DPPR Team2019-06-04

Authors: Longhuang Wu, Shangxuan Tian, Chang Liu, Wenjie Cai, Jiachen Li, Sicong Liu, Haoxi Li, Chunchao Guo, Hongfa Wang, Hongkai Chen, Qinglin lu, Chun Yang, Xucheng Yin, Lei Xiao

Description: We are Tencent-DPPR (Data Platform Precision Recommendation) team. Our method follows the framework of Mask R-CNN that employs mask to detect multi-oriented scene texts. We use the MLT-19 and the MSRA-TD500 dataset to train our text detector, and we also apply a multi-scale training approach during training. To obtain the final ensemble results, we combined two different backbones and different multi-scale testing approaches.

Authors: Longhuang Wu, Shangxuan Tian, Chang Liu, Wenjie Cai, Jiachen Li, Sicong Liu, Haoxi Li, Chunchao Guo, Hongfa Wang, Hongkai Chen, Qinglin lu, Chun Yang, Xucheng Yin, Lei Xiao

Description: We are Tencent-DPPR (Data Platform Precision Recommendation) team. Our method follows the framework of Mask R-CNN that employs mask to detect multi-oriented scene texts. We use the MLT-19 and the MSRA-TD500 dataset to train our text detector, and we also apply a multi-scale training approach during training. To obtain the final ensemble results, we combined two different backbones and different multi-scale testing approaches.

Authors: Pengfei Wang~*, Mengyi En*, Xiaoqiang Zhang*, Chengquan Zhang*

Affiliation: VIS-VAR Team, Baidu Inc.*; Xidian University~

Description: The method mainly relies on a two-stage text detector, namely LOMO [1], which is inspired by Mask-R-CNN and where an iterative refinement module is introduced to refine the boundary of text region once or more times during testing to get the more accurate detection results. As extra data sets, ICDAR15 and partial KAIST are also used in the training phase. Multi-scale testing is adopted and the final result is boosted from LOMOs with Resnet-50 and Inception-v4 as different backbones.

*This work is done when Pengfei Wang is an intern at Baidu Inc.

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2019-06-04Tencent-DPPR Team83.61%87.52%80.05%77.33%
2019-06-04Tencent-DPPR Team (Method_v0.3)83.61%87.57%79.99%77.28%
2019-06-04multi-stage_text_detector_v483.59%87.75%79.80%70.00%
2019-06-03Tencent-DPPR Team (Method_v0.2)83.55%87.78%79.72%77.05%
2019-06-04multi-stage_text_detector_v283.32%87.04%79.91%69.51%
2019-06-04multi-stage_text_detector_v383.30%87.15%79.78%69.49%
2019-06-03multi-stage_text_detector83.25%87.47%79.42%69.44%
2019-06-03NJU-ImagineLab(v3)83.07%87.85%78.79%76.21%
2019-05-30PMTD82.53%87.47%78.12%75.80%
2019-05-27Tencent-DPPR Team (Method_v0.1)81.88%89.41%75.52%72.87%
2019-05-29maskrcnn++ result80.35%82.64%78.19%64.64%
2019-05-29IC_RL80.11%82.97%77.44%64.29%
2019-05-294Paradigm-Data-Intelligence79.84%83.44%76.54%63.94%
2019-06-02A two-stage text detector based on cascade rcnn(using total 10000 images of mlt19)78.38%82.26%74.85%71.27%
2019-05-31A two-stage text detector based on cascade rcnn78.11%82.89%73.85%70.31%
2019-06-03mm-maskrcnn_v276.79%84.73%70.21%67.44%
2019-06-04TH-DL-v276.64%84.55%70.09%64.44%
2019-06-03TH-DL-v176.59%84.51%70.03%64.35%
2019-05-27TH-DL76.53%84.70%69.80%64.07%
2019-05-26two stage text detector75.04%82.61%68.74%65.29%
2019-06-03sot74.24%79.96%69.28%65.94%
2019-06-02DISTILLED CRAFT72.94%81.22%66.19%59.16%
2019-06-03text-mountain71.95%72.12%71.77%51.90%
2019-06-03CRAFTS70.86%81.42%62.73%56.63%
2019-06-04Unicamp-SRBR-MLT2019-PELEETEXT70.81%81.58%62.54%59.01%
2019-06-03RRPN69.56%77.71%62.95%58.07%
2019-06-04Unicamp-SRBR-MLT2019-FUSION-PSENET-PELEETEXT68.56%77.00%61.79%56.03%
2019-05-28CRAFTS(Initial)68.11%79.51%59.56%54.50%
2019-06-04Lomin OCR67.65%71.62%64.09%57.95%
2019-06-03 NXB OCR65.96%70.59%61.90%43.72%
2019-05-24PSENet_v165.83%73.52%59.59%52.73%
2019-05-27MLT2019 ETD64.36%78.71%54.44%42.93%
2019-05-27CLTDR63.53%77.20%53.97%41.63%
2019-05-27NXB OCR61.31%71.84%53.48%38.48%
2019-06-03TP58.01%77.59%46.32%37.26%
2019-05-28Unicamp-SRBR-MLT2019-S151.00%75.22%38.58%35.30%
2019-06-04Cyberspace47.09%69.48%35.61%26.17%
2019-05-28PydBox-TextDetector29.79%59.56%19.86%11.83%
2019-05-05AAAA0.02%0.07%0.01%0.00%
2019-05-27Unicamp-SRBR-MLT2019-S10.00%0.03%0.00%0.00%
2019-06-01tsinghuaee51_MLT20190.00%0.04%0.00%0.00%
2019-05-274Paradigm-Data-Intelligence0.00%0.00%0.00%0.00%

Ranking Graphic

Ranking Graphic