method: Shopee MMU OCR2022-10-31

Authors: Jianqiang Liu, Hanfei Xu, Bin Zheng, Eric W, Ronnie T, Alex X

Affiliation: Shopee MMU OCR

Description: Our method adopts a transformer-based context-aware framework. We utilize a hybrid architecture encoder and a context-aware autoregressive decoder to construct the recognition pipeline. Finally, a simple but effective multi-model fusion strategy is adopted.

method: CLOVA-AI v22019-02-18

Authors: Jeonghun Baek, Junyeop Lee, Sungrae Park, Moonbin Yim, Seonghyeon Kim, Hwalsuk Lee

Description: We used Thin-plate-spline (TPS) based Spatial transformer network (STN) which normalizes the input text images, ResNet based feature extractor, BiLSTM, and attention mechanism.
This model was developed based on the analysis of scene text recognition modules.
See our paper and source code.

method: VARCO_v22020-12-15

Authors: Jusung Lee, Jaemyung Lee, Younghyun Lee, Joonsoo Lee

Affiliation: VARCO

Description: This work was supported by Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No.1711097855, Text Localization and Recognition for Efficient Digital Contents Analysis)

Ranking Table

Description Paper Source Code
DateMethodTotal Edit distance (case sensitive)Correctly Recognised Words (case sensitive)T.E.D. (case insensitive)C.R.W. (case insensitive)
2022-10-31Shopee MMU OCR15.956796.53%10.953997.17%
2019-02-18CLOVA-AI v226.951595.98%23.483696.35%
2020-12-15VARCO_v236.879794.61%26.213195.89%
2018-11-12CLOVA-AI / PAPAGO37.793494.52%30.701895.34%
2020-09-04Hancom Vision39.046693.52%33.613393.97%
2018-09-12Clova AI / Lens39.142294.25%36.342294.61%
2020-01-20VARCO39.855293.79%28.888595.07%
2020-09-04Huawei_GDE_AI40.744990.32%32.521691.60%
2017-08-14TencentAILab42.000395.07%39.345495.34%
2017-07-28Tencent Youtu48.123992.42%40.371193.42%
2021-05-14Baseline62.097188.49%29.627394.52%
2017-02-24HIK_OCR64.953190.78%42.313493.33%
2016-06-23Baidu IDL70.391888.31%57.529989.95%
2020-09-30SRIB-STRIDE (Scene Text Recognition In Device)72.792986.67%62.631487.58%
2016-01-25SRC-B-TextProcessingLab74.454987.40%63.178788.95%
2019-09-05juxinli78.936988.58%49.804391.32%
2015-12-29SRC-B-TextProcessingLab88.272784.66%78.251286.12%
2017-05-31CVTE_OCR93.439180.55%81.813082.19%
2015-11-09Megvii-Image++115.912482.83%94.067686.03%
2013-04-06PhotoOCR122.748382.83%109.901285.30%
2018-06-04English Test136.766077.72%123.912179.18%
2016-12-05SemaMediaData&HPI_Real-time-VideoOCR151.516382.37%123.251684.75%
2016-11-29CNN-WRDF209.193672.66%193.474774.14%
2017-04-11Dycn270.760766.21%234.499070.68%
2013-04-08PicRead332.369957.99%290.832761.92%
2013-04-05NESP355.241164.20%340.315964.84%
2013-04-05PLT379.084262.37%362.219663.11%
2013-04-05MAPS396.611062.74%380.860363.29%
2013-04-09Feild's Method422.118747.95%390.619952.33%
2013-04-06PIONEER479.820853.70%426.835355.71%
2013-05-08Baseline538.955745.30%517.922346.58%
2013-04-05TextSpotter606.288626.85%597.274828.13%

Ranking Graphic

Ranking Graphic