method: TH-DL2022-07-20
Authors: Changyuan Li, Zhuxuan Liang, Liangrui Peng, Pei Tang, Haodong Shi, Gang Yao, Ning Ding
Affiliation: Tsinghua University
Description: For detection, a modified Mask-RCNN model with an extra branch predicting text boundaries is designed. For recognition, a Transformer-based encoder-decoder model is adopted.