method: TH2020-04-19

Authors: Tsinghua University and Hyundai Motor Group AIRS Company

Email: Shanyu Xiao: xiaosy19@mails.tsinghua.edu.cn

Description: We have built an end-to-end scene text spotter based on Mask R-CNN & Transformer. The ResNeXt-101 backbone and multiscale training/testing are used.

method: Sogou_OCR2019-11-11

Authors: Xudong Rao, Lulu Xu, Long Ma, Xuefeng Su

Description: An arbitrary-shaped text detection method based on Mask R-CNN, we use resnext-152 as our backbone, multi-scale training and testing are adopted to get the final results.

method: Tencent-DPPR Team2019-06-04

Authors: Longhuang Wu, Shangxuan Tian, Chang Liu, Wenjie Cai, Jiachen Li, Sicong Liu, Haoxi Li, Chunchao Guo, Hongfa Wang, Hongkai Chen, Qinglin lu, Chun Yang, Xucheng Yin, Lei Xiao

Description: We are Tencent-DPPR (Data Platform Precision Recommendation) team. Our method follows the framework of Mask R-CNN that employs mask to detect multi-oriented scene texts. We use the MLT-19 and the MSRA-TD500 dataset to train our text detector, and we also apply a multi-scale training approach during training. To obtain the final ensemble results, we combined two different backbones and different multi-scale testing approaches.

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2020-04-19TH84.09%86.67%81.65%78.40%
2019-11-11Sogou_OCR82.99%86.15%80.05%76.55%
2019-06-04Tencent-DPPR Team82.75%84.26%81.30%77.56%
2019-06-04Tencent-DPPR Team (Method_v0.3)82.75%84.29%81.25%77.51%
2019-06-03Tencent-DPPR Team (Method_v0.2)82.69%84.50%80.96%77.25%
2019-06-04multi-stage_text_detector_v482.65%84.62%80.76%68.43%
2019-06-03NJU-ImagineLab(v3)82.42%84.06%80.84%77.21%
2019-06-03multi-stage_text_detector82.37%84.39%80.45%68.04%
2019-06-04multi-stage_text_detector_v382.33%84.04%80.69%67.96%
2019-06-04multi-stage_text_detector_v282.29%83.91%80.74%67.90%
2019-05-30PMTD81.79%83.96%79.74%76.65%
2019-05-27Tencent-DPPR Team (Method_v0.1)81.21%86.65%76.41%72.62%
2021-03-11SituTech_OCR80.51%89.31%73.30%65.35%
2019-05-29IC_RL79.19%79.06%79.32%62.97%
2019-05-29maskrcnn++ result79.03%78.32%79.75%62.74%
2019-06-02A two-stage text detector based on cascade rcnn(using total 10000 images of mlt19)78.21%79.31%77.13%72.54%
2021-02-04NCU_MSP78.09%80.42%75.90%60.81%
2019-05-31A two-stage text detector based on cascade rcnn77.90%79.94%75.96%71.42%
2022-11-02ESTextSpotter77.34%79.33%75.45%71.17%
2019-05-27TH-DL76.78%83.33%71.19%65.06%
2019-06-04TH-DL-v276.70%82.36%71.76%65.23%
2019-06-03TH-DL-v176.59%82.34%71.59%65.14%
2019-06-03mm-maskrcnn_v275.86%81.49%70.96%67.15%
2020-10-16Drew75.71%81.06%71.02%66.56%
2019-06-02DISTILLED CRAFT75.61%81.81%70.29%63.82%
2023-05-22DeepSolo++ (ResNet-50)74.93%82.89%68.36%65.63%
2020-05-30NCU74.29%73.90%74.68%54.64%
2019-05-26two stage text detector74.08%78.70%69.97%65.38%
2019-06-03CRAFTS72.49%80.63%65.84%59.84%
2019-06-03sot72.10%75.57%68.93%64.28%
2023-05-30TD-PPIoU71.10%68.45%73.97%68.66%
2019-06-03text-mountain71.02%69.67%72.43%50.95%
2019-06-04Unicamp-SRBR-MLT2019-PELEETEXT68.25%76.04%61.91%56.83%
2019-06-03RRPN68.01%73.62%63.19%57.02%
2019-05-28CRAFTS(Initial)67.88%76.95%60.72%56.01%
2019-06-04Unicamp-SRBR-MLT2019-FUSION-PSENET-PELEETEXT65.87%72.57%60.31%53.05%
2019-06-04Lomin OCR65.84%67.23%64.51%56.70%
2019-05-24PSENet_v165.77%73.21%59.69%52.45%
2019-06-03 NXB OCR63.38%64.04%62.73%40.13%
2019-05-27CLTDR61.73%73.94%52.98%39.35%
2019-05-27MLT2019 ETD60.43%72.45%51.83%37.73%
2020-10-07MEAST_V2_8_oct59.72%66.15%54.44%38.78%
2020-10-23MEAST_V3_23_Oct59.62%64.91%55.13%38.74%
2019-05-27NXB OCR59.26%67.52%52.81%36.99%
2019-06-03TP54.98%74.40%43.61%34.10%
2019-05-28Unicamp-SRBR-MLT2019-S146.05%71.89%33.88%30.19%
2019-06-04Cyberspace42.47%58.18%33.44%21.05%
2019-05-28PydBox-TextDetector35.92%66.64%24.58%16.63%
2020-12-15DSIT-UOA21.12%21.03%21.20%5.85%
2019-05-05AAAA0.02%0.03%0.01%0.00%
2019-05-274Paradigm-Data-Intelligence0.00%0.00%0.00%0.00%
2019-05-27Unicamp-SRBR-MLT2019-S10.00%0.00%0.00%0.00%
2019-06-01tsinghuaee51_MLT20190.00%0.00%0.00%0.00%

Ranking Graphic

Ranking Graphic