method: SituTech_OCR2021-03-11

Authors: Kui Lyu, Chuanhe Liu

Affiliation: Beijing Situ Vision Technologies Co. Ltd

Email: lvkui@situdata.com

Description: In this work, we design an elegant text detection model. Our detector is similar to DBNet, but there are some difference. More specifically, we have introduced an advanced detector backbone, a classic network EfficientDet, with flexible scales and stronger ability to extract features. Another breakthrough is that we optimized the label generation strategy in DBNet. In the original work, the positive area generation and the expansion of the positive area to the bounding box used the Vatti clipping algorithm, which is less robust with different area perimeter ratios. We optimized this function to make the label transform between positive area and bounding box more reasonable.

If you have any questions, please contact us.
SituAIgorithm Team, Beijing Situ Vision Technologies Co. Ltd

method: CPN (multi-scale)2024-05-30

Authors: Longhuang Wu, Shangxuan Tian, Youxin Wang, Pengfei Xiong

Email: wlonghuang@gmail.com

Description: We propose a Complementary Proposal Network (CPN) that seamlessly and parallelly integrates semantic and geometric information for superior performance. This Result is achieved with single Swin-L backbone and multi-scale testing policy. No model ensemble is used.

method: TH2020-04-19

Authors: Tsinghua University and Hyundai Motor Group AIRS Company

Email: Shanyu Xiao: xiaosy19@mails.tsinghua.edu.cn

Description: We have built an end-to-end scene text spotter based on Mask R-CNN & Transformer. The ResNeXt-101 backbone and multiscale training/testing are used.

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2021-03-11SituTech_OCR62.45%54.60%72.93%39.06%
2024-05-30CPN (multi-scale)58.98%47.28%78.37%58.25%
2020-04-19TH57.64%48.07%71.96%48.51%
2019-06-04multi-stage_text_detector_v456.37%45.16%74.99%35.34%
2019-06-03multi-stage_text_detector55.69%44.49%74.44%34.47%
2019-06-04multi-stage_text_detector_v355.43%43.99%74.90%34.32%
2019-06-04multi-stage_text_detector_v255.31%43.77%75.11%34.18%
2019-05-27Tencent-DPPR Team (Method_v0.1)55.18%47.80%65.26%43.16%
2019-11-11Sogou_OCR54.73%45.24%69.27%46.57%
2019-06-03Tencent-DPPR Team (Method_v0.2)54.49%43.78%72.16%47.86%
2019-06-04Tencent-DPPR Team (Method_v0.3)54.29%43.45%72.34%47.80%
2019-06-04Tencent-DPPR Team54.29%43.43%72.38%47.78%
2019-06-03NJU-ImagineLab(v3)53.62%42.44%72.80%48.78%
2019-05-30PMTD53.02%42.11%71.56%49.26%
2022-11-02ESTextSpotter48.42%38.30%65.82%42.13%
2019-05-27TH-DL47.92%40.60%58.47%28.12%
2019-06-04TH-DL-v247.91%39.96%59.81%29.77%
2019-06-03TH-DL-v147.84%39.99%59.52%29.25%
2019-06-03mm-maskrcnn_v246.98%38.00%61.51%38.58%
2019-05-31A two-stage text detector based on cascade rcnn46.31%36.13%64.47%40.73%
2019-06-02A two-stage text detector based on cascade rcnn(using total 10000 images of mlt19)45.75%35.15%65.54%40.48%
2019-05-29IC_RL45.55%33.60%70.70%24.80%
2021-02-04NCU_MSP45.51%35.00%65.05%22.77%
2023-05-22DeepSolo++ (ResNet-50)45.45%40.17%52.32%32.32%
2019-05-29maskrcnn++ result45.18%32.88%72.16%24.76%
2019-06-02DISTILLED CRAFT44.71%37.51%55.34%26.73%
2020-10-16Drew43.92%35.16%58.47%32.58%
2019-05-26two stage text detector42.58%33.37%58.83%34.28%
2019-06-03CRAFTS42.10%36.28%50.15%21.36%
2019-06-03sot39.88%29.95%59.64%34.85%
2020-05-30NCU39.87%28.27%67.60%19.22%
2019-05-28CRAFTS(Initial)38.98%31.03%52.41%17.55%
2019-06-03text-mountain37.01%25.64%66.47%17.82%
2019-06-04Unicamp-SRBR-MLT2019-PELEETEXT36.70%28.28%52.24%26.22%
2019-06-03RRPN36.11%26.81%55.28%27.88%
2019-05-24PSENet_v134.47%27.67%45.69%22.64%
2023-05-30TD-PPIoU34.42%22.91%69.17%39.54%
2019-06-04Unicamp-SRBR-MLT2019-FUSION-PSENET-PELEETEXT33.89%25.10%52.14%21.80%
2019-05-27MLT2019 ETD33.77%26.20%47.52%12.57%
2019-05-27CLTDR33.68%26.83%45.25%12.27%
2019-06-04Lomin OCR30.29%21.20%53.02%22.43%
2019-05-27NXB OCR29.75%21.98%46.01%14.44%
2019-06-03TP28.94%26.06%32.55%9.80%
2019-06-03 NXB OCR28.86%19.26%57.55%11.20%
2019-05-28Unicamp-SRBR-MLT2019-S128.07%26.13%30.33%16.13%
2020-10-07MEAST_V2_8_oct27.49%19.78%45.04%13.80%
2020-10-23MEAST_V3_23_Oct26.82%18.88%46.23%13.82%
2019-06-04Cyberspace26.02%19.38%39.59%8.62%
2019-05-28PydBox-TextDetector11.13%11.63%10.67%1.40%
2020-12-15DSIT-UOA2.62%1.54%8.79%0.10%
2019-05-05AAAA0.01%0.01%0.01%0.00%
2019-05-274Paradigm-Data-Intelligence0.00%0.00%0.00%0.00%
2019-05-27Unicamp-SRBR-MLT2019-S10.00%0.00%0.00%0.00%
2019-06-01tsinghuaee51_MLT20190.00%0.00%0.00%0.00%

Ranking Graphic

Ranking Graphic