method: TH-DL2022-07-20

Authors: Changyuan Li, Zhuxuan Liang, Liangrui Peng, Pei Tang, Haodong Shi, Gang Yao, Ning Ding

Affiliation: Tsinghua University

Description: For detection, a modified Mask-RCNN model with an extra branch predicting text boundaries is designed. For recognition, a Transformer-based encoder-decoder model is adopted.