method: HIT2020-05-13

Authors: Sihwan Kim and Taejang Park

Affiliation: Hana Institute of Technology

Description: we present the network architecture to maximize conditional log-likelihood by optimizing the lower bound with a proper approximate posterior that has shown impressive performance in several generative model. In addition, by extending layer of latent variables to multiple layers, the network is able to learn scale robust features with no task specific regularization or data augmentation. We provide a detailed analysis and show the results of three public benchmarks to confirm the efficiency and reliability of the proposed algorithm.

method: Craft++2020-05-19

Authors: Xiangyuan Ren, Anjie Song, Zikun Zhou

Affiliation: Shanghai Jiao Tong University, ShannonAi

Email: xiangyuan_ren@shannonai.com

Description: Out Method is based on CRAFT, with Self Supervised Learning for pretraining and stroke level segmentation for multi-task training

method: CRAFT2018-11-07

Authors: Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, and Hwalsuk Lee

Description: We propose a novel text detector called CRAFT. The proposed method effectively detects text area by exploring each character and affinity between characters. To overcome the lack of individual character level annotations, our framework exploits the pseudo character-level bounding boxes acquired by the learned interim model in a weakly-supervised manner.

Clova AI OCR Team, NAVER/LINE Corp.

Ranking Table

Description Paper Source Code
DateMethodRecallPrecisionHmean
2020-05-13HIT95.14%98.48%96.78%
2020-05-19Craft++94.56%96.32%95.43%
2018-11-07CRAFT93.06%97.43%95.20%
2020-12-15VARCO92.82%97.56%95.13%
2020-07-31TextFuseNet91.96%97.38%94.60%
2020-01-20VARCO92.47%95.02%93.73%
2020-11-10Hancom Vision89.81%97.26%93.39%
2018-01-22FOTS90.41%95.36%92.82%
2018-12-03SPCNet_TongJi & UESTC (single scale)90.76%94.15%92.42%
2017-08-10SRC-B-MachineLearningLab-v490.50%94.26%92.34%
2017-03-16Ali-Amap-xlab-v291.54%92.16%91.85%
2019-07-12stela89.37%93.80%91.53%
2017-12-15EPTN-SJTU88.95%93.55%91.19%
2016-12-16RRPN-487.31%95.19%91.08%
2016-12-04Ali-Amap-xlab90.30%91.26%90.78%
2017-03-22MCLAB_TextBoxes_v285.57%91.87%88.61%
2018-01-04crpn84.04%92.12%87.89%
2016-11-08CTPN82.98%92.98%87.69%
2017-03-05WeText83.07%91.06%86.88%
2018-12-08Unicamp-SRBR-v282.06%92.15%86.82%
2016-03-16TextConv+WordGraph81.02%93.38%86.76%
2016-06-23SRC-B-TextProcessingLab81.52%91.11%86.04%
2016-08-31MCLAB_TextBoxes83.00%89.00%85.89%
2015-11-04MSER_Binary_CNN82.37%89.12%85.61%
2015-04-03StradVision80.15%90.93%85.20%
2015-03-26VGGMaxNet_cmb77.32%92.18%84.10%
2015-03-23VGGMaxNet_01376.38%93.00%83.88%
2015-04-02VGGMaxNet_02579.76%88.42%83.87%
2015-03-23VGGMaxNet_1.675.62%93.45%83.59%
2018-12-08Unicamp-SRBR-v376.16%91.51%83.14%
2014-06-10IWRR201478.65%85.89%82.11%
2014-11-12HUST_MCLAB76.05%87.96%81.58%
2019-06-26std(single-scale)78.05%85.02%81.38%
2016-11-13RRPN-372.00%90.97%80.38%
2015-01-01BUCT_YST73.88%84.64%78.90%
2013-08-29UMD_IntegratedDisrimination69.97%89.45%78.52%
2013-04-07USTB_TexStar69.28%88.80%77.83%
2013-04-08Text_detector_CASIA67.27%84.97%75.09%
2015-07-22ZText67.05%85.01%74.97%
2013-04-05TextSpotter64.97%87.49%74.56%
2018-12-08Unicamp-SRBR-v164.40%87.40%74.16%
2013-04-08CASIA_NLPR68.82%79.26%73.67%
2013-04-09I2R_NUS_FAR70.92%75.71%73.24%
2017-10-12TextFCN V277.77%67.98%72.55%
2014-08-18DetectText68.99%75.93%72.29%
2013-04-08I2R_NUS69.84%73.29%71.52%
2015-03-23VGGMaxNet_1055.11%97.35%70.38%
2013-04-08TH-TextLoc69.95%70.47%70.21%
2013-04-06Text Detection66.05%74.50%70.02%
2018-12-29fast_ret_sh_0254.76%77.09%64.03%
2015-08-18MSER with LocalSWT48.11%65.93%55.63%
2013-04-10Inkam42.54%31.73%36.35%

Ranking Graphic