method: CPN (multi-scale)2024-05-30

Authors: Longhuang Wu, Shangxuan Tian, Youxin Wang, Pengfei Xiong

Email: wlonghuang@gmail.com

Description: We propose a Complementary Proposal Network (CPN) that seamlessly and parallelly integrates semantic and geometric information for superior performance. This Result is achieved with single Swin-L backbone and multi-scale testing policy. No model ensemble is used.

method: CBL_OCR2022-01-30

Authors: Jingyi Shen(沈静逸), Guokun Wang(王国坤), Yue Wu(吴岳), Chang Zhou(周昌), Jianqiang Huang(黄建强)

Affiliation: Alibaba

Description: Anchor-free based detection framework with poly regression and text segmentation was used here. The model was firstly pretrained on LSVT、MLT、ReCTS,and then finetuned on ReCTS. Weighted loss was used to enhance small text instances. At testing phase, single-model-multi-scale with post-processing was used to generate the final results.

method: Unis_OCR2021-04-27

Authors: BaoLin.Zhang(张保林),Jie.Li(李杰),KeJie.Liu(刘克捷),Yuan.Hu(胡源)

Affiliation: UNISINSIGHT

Description: First, the training method is based on SAST framework, which takes RES101 with multi-scale features as the backbone, and adds attention and lamdba method . At the same time, the training data used include LSVT,LSVT Weak, ReCTS, Weak, icdar and other free data from internet.

Ranking Table

Description Paper Source Code
DateMethodRecallPrecisionHmean
2024-05-30CPN (multi-scale)94.50%94.61%94.56%
2022-01-30CBL_OCR93.27%94.08%93.67%
2021-04-27Unis_OCR94.66%92.51%93.57%
2019-10-10NSTD-iFLYTEK93.17%93.62%93.40%
2019-05-01SANHL_v493.97%92.76%93.36%
2019-05-01Tencent-DPPR Team93.46%92.59%93.03%
2019-04-29Amap-CVLab93.41%91.62%92.50%
2021-04-26ZJUT94.10%90.46%92.25%
2019-04-30HUST_VLRGROUP93.51%89.15%91.27%
2019-04-30maskrcnn_text91.96%90.09%91.02%
2019-04-30Task3-re590.03%91.65%90.83%
2019-04-22oo91.56%90.08%90.81%
2019-04-23A region proposal and fcn model based method 88.64%92.72%90.64%
2019-04-30Mask R-CNN89.84%91.41%90.62%
2019-04-30COLD AND COOL90.99%89.59%90.28%
2019-04-26baseline_0.793.66%86.35%89.86%
2019-04-30pursuer86.13%92.72%89.31%
2019-04-29CLTDR88.92%88.70%88.81%
2020-05-18MMTD86.63%89.92%88.25%
2019-04-30CRAFT85.33%89.38%87.31%
2019-04-30FRCC84.67%89.53%87.03%
2019-04-25EAST检测网络82.27%88.49%85.27%
2019-04-26JDIVA_Textboxes++87.02%81.23%84.03%
2019-04-30FFLOVE88.52%79.32%83.66%
2019-04-29Subm19042985.18%79.66%82.33%
2019-04-23PSENet_v183.16%80.77%81.94%
2019-04-30Sogou_MM96.17%69.20%80.48%
2019-04-30WHUT79.53%79.36%79.45%
2019-04-30PixelBased Prediction86.02%70.68%77.60%
2019-10-31Cluster75.80%77.05%76.42%
2019-04-28gd method73.05%78.35%75.61%
2019-04-28CornerNet Multi Scale70.35%80.19%74.95%
2019-04-30Textboxes++ detects arbitrary-oriented scene text in a single network forward pass60.66%90.87%72.76%
2019-04-25The improved CTPN66.83%75.87%71.07%
2019-04-30Scene text detection of polar coordinate regression72.54%56.44%63.48%
2019-04-30Multi-scale Pixellink50.57%32.98%39.92%
2019-04-29task37.82%8.14%7.98%

Ranking Graphic