method: Samsung Life Insurance2020-10-16

Authors: Dongyoung Kim, Myungsung Kwak

Affiliation: Data Analytics Laboratory (DA Lab), Samsung Life Insurance

Description: A document Text Localization Generative Adversarial Nets (TLGAN) model is utilized to perform the text localization task using SROIE data set. TLGAN learns text-image features via ImageNet pre-trained VGG network in adversarial manner and points out text locations. Note the images were scaled in an arbitrary ratio and the detected coordinates were re-scaled into original image space for the submission.

method: NetEase OCR2020-12-11

Authors: John

Affiliation: NetEase

Description: Our method breaks the task down into two stages´╝îtext area detect and text segmentation´╝îlike maskrcnn. The difference is the first stage in our model just detect the text area not the text lines, the second stage is like dbnet, segment the text area into lines. We merge the two stage into one model and use data augmentation and mutli-scale train and test.

method: BOE_AIoT_CTO2020-08-10

Authors: Guangwei Huang, Yue Li, Xiaojun Tang

Description: Our model trained PANNet and multi oriented corner text detectors, and ensemble multi-scale images detection results. Besides, we pre-process the training and testing images to make them clear. Multi-scale training, training data augment are used.

Ranking Table

Description Paper Source Code
2020-10-16Samsung Life Insurance98.64%99.83%99.23%
2020-12-11NetEase OCR98.37%99.59%98.98%
2019-04-22Ping An Property & Casualty Insurance Company98.60%98.40%98.50%
2019-04-22H&H Lab97.93%97.95%97.94%
2020-09-27only PAN96.51%96.80%96.66%
2021-01-2858 OCR1000097.48%95.43%96.45%
2019-04-22GREAT-OCR Shanghai University96.62%96.21%96.42%
2019-05-10Clova OCR96.04%95.79%95.92%
2019-04-22A Single-Shot Model for Robust Text Localization93.93%94.80%94.37%
2019-04-23SROIE Fourth Submission92.98%94.99%93.97%
2020-06-15EfficientDet and EAST91.91%95.68%93.76%
2019-04-22Pixellink multi-scale Detection93.07%92.84%92.95%
2019-04-19BiLSTM Based on CTPN91.40%94.03%92.69%
2020-06-25EAST modified90.94%92.63%91.78%
2019-04-22CITlab Argus Textline Detection92.02%91.34%91.68%
2019-04-19Unet and Morphology Prediction93.28%89.43%91.31%
2019-04-17Textline detection89.85%92.72%91.26%
2019-04-20A Text Localization Method Based on CTPN85.23%88.73%86.94%
2019-04-22YOLO Text Detector77.29%79.32%78.29%
2019-04-17Improved yolov3 model68.52%78.23%73.06%
2019-04-22Task 1 - Scanned Receipt Text Localisation (Submitted by Intuit Inc.)71.14%63.76%67.25%
2019-04-17scene text detection weapon49.61%64.75%56.18%
2019-04-22Unet Segmentation and Watershed56.31%53.46%54.85%
2019-04-22Receipt Info Extracting Task1 zone-dividing32.62%46.48%38.33%

Ranking Graphic