method: SituTech_OCR2021-03-11

Authors: Kui Lyu, Chuanhe Liu

Affiliation: Beijing Situ Vision Technologies Co. Ltd


Description: In this work, we design an elegant text detection model. Our detector is similar to DBNet, but there are some difference. More specifically, we have introduced an advanced detector backbone, a classic network EfficientDet, with flexible scales and stronger ability to extract features. Another breakthrough is that we optimized the label generation strategy in DBNet. In the original work, the positive area generation and the expansion of the positive area to the bounding box used the Vatti clipping algorithm, which is less robust with different area perimeter ratios. We optimized this function to make the label transform between positive area and bounding box more reasonable.

If you have any questions, please contact us.
SituAIgorithm Team, Beijing Situ Vision Technologies Co. Ltd

method: PMTD2019-05-30

Authors: Xuebo Liu,Jingchao Liu, Ding Liang

Description: Pyramid Mask Text Detector predicts a soft mask for every text instance, and then uses the plane clustering algorithm to get the final text box. See for detail. Trained model and inference code will be released. If you have questions, please feel free to contact Jingchao Liu ( and Xuebo Liu (

Authors: Yudong Liu, Yongtao Wang

Description: We modify the regression of cascade rcnn at the third stage, we train with 9000 images of mlt19 data.The rest 1000 images are used for validation. Multi scale and large model are used.

Ranking Table

Description Paper Source Code
DateMethodHmeanPrecisionRecallAverage Precision
2019-06-02A two-stage text detector based on cascade rcnn(using total 10000 images of mlt19)78.38%82.26%74.85%71.27%
2019-05-31A two-stage text detector based on cascade rcnn78.11%82.89%73.85%70.31%
2019-05-26two stage text detector75.04%82.61%68.74%65.29%

Ranking Graphic

Ranking Graphic