method: HIT2020-05-13

Authors: Sihwan Kim and Taejang Park

Affiliation: Hana Institute of Technology

Description: we present the network architecture to maximize conditional log-likelihood by optimizing the lower bound with a proper approximate posterior that has shown impressive performance in several generative model. In addition, by extending layer of latent variables to multiple layers, the network is able to learn scale robust features with no task specific regularization or data augmentation. We provide a detailed analysis and show the results of three public benchmarks to confirm the efficiency and reliability of the proposed algorithm.

method: Craft++2020-05-19

Authors: Xiangyuan Ren, Anjie Song, Zikun Zhou

Affiliation: Shanghai Jiao Tong University, ShannonAi

Email: xiangyuan_ren@shannonai.com

Description: Out Method is based on CRAFT, with Self Supervised Learning for pretraining and stroke level segmentation for multi-task training

method: VARCO2020-12-15

Authors: Jaemyung Lee, Jusung Lee, Younghyun Lee, Joonsoo Lee

Affiliation: NCSOFT

Description: This work was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.1711117050, Text Localization and Recognition for Efficient Digital Contents Analysis)

Ranking Table

Description Paper Source Code
DateMethodRecallPrecisionHmean
2020-05-13HIT94.36%98.48%96.38%
2020-05-19Craft++94.67%96.44%95.54%
2020-12-15VARCO92.62%97.54%95.02%
2018-11-07CRAFT92.40%97.67%94.96%
2020-07-31TextFuseNet92.09%97.27%94.61%
2020-01-20VARCO92.49%94.74%93.60%
2020-11-10Hancom Vision88.27%97.24%92.54%
2018-01-22FOTS90.47%94.63%92.50%
2018-12-03SPCNet_TongJi & UESTC (single scale)90.59%93.77%92.16%
2017-08-10SRC-B-MachineLearningLab-v489.77%93.73%91.71%
2019-07-12stela89.66%93.74%91.65%
2016-12-16RRPN-487.85%94.91%91.25%
2017-12-15EPTN-SJTU89.02%93.17%91.05%
2017-03-16Ali-Amap-xlab-v287.40%91.80%89.54%
2016-12-04Ali-Amap-xlab86.16%90.89%88.46%
2017-03-22MCLAB_TextBoxes_v284.38%91.21%87.67%
2018-01-04crpn83.80%91.90%87.66%
2018-12-08Unicamp-SRBR-v280.97%91.80%86.05%
2016-08-31MCLAB_TextBoxes82.59%87.73%85.08%
2016-06-23SRC-B-TextProcessingLab80.18%90.50%85.03%
2015-04-03StradVision78.85%90.21%84.15%
2015-04-02VGGMaxNet_02579.58%88.04%83.59%
2015-03-26VGGMaxNet_cmb77.57%90.47%83.53%
2015-11-04MSER_Binary_CNN78.67%88.79%83.42%
2016-03-16TextConv+WordGraph75.23%93.28%83.29%
2015-03-23VGGMaxNet_01376.49%91.36%83.27%
2017-03-05WeText76.40%91.16%83.13%
2015-03-23VGGMaxNet_1.675.71%91.85%83.00%
2018-12-08Unicamp-SRBR-v374.92%91.38%82.34%
2016-11-08CTPN73.72%92.77%82.15%
2019-06-26std(single-scale)77.13%84.48%80.64%
2014-11-12HUST_MCLAB74.28%87.68%80.43%
2016-11-13RRPN-371.89%90.22%80.02%
2015-01-01BUCT_YST72.84%84.43%78.21%
2014-06-10IWRR201470.01%85.61%77.03%
2013-04-07USTB_TexStar66.45%88.47%75.89%
2013-04-05TextSpotter64.84%87.51%74.49%
2013-08-29UMD_IntegratedDisrimination62.26%89.17%73.33%
2013-04-08CASIA_NLPR68.24%78.89%73.18%
2018-12-08Unicamp-SRBR-v162.56%86.81%72.71%
2015-07-22ZText63.51%84.73%72.60%
2013-04-08Text_detector_CASIA62.85%84.70%72.16%
2013-04-09I2R_NUS_FAR69.00%75.08%71.91%
2017-10-12TextFCN V275.53%67.75%71.43%
2014-08-18DetectText66.05%75.82%70.60%
2015-03-23VGGMaxNet_1054.87%97.35%70.18%
2013-04-08I2R_NUS66.17%72.54%69.21%
2013-04-08TH-TextLoc65.19%69.96%67.49%
2018-12-29fast_ret_sh_0255.32%76.09%64.06%
2013-04-06Text Detection53.42%74.15%62.10%
2015-08-18MSER with LocalSWT45.37%65.05%53.46%
2013-04-10Inkam35.27%31.20%33.11%

Ranking Graphic