Authors: yuchen su, yongkun du, tianlun zheng, zhineng chen, yi gan, zhineng chen
Affiliation: Fudan University, Paddle OCR
Description: Our method is based on PAN, ResNet-50 pre-trained on ImageNet as our backbone. We only use the training images of ReST for training. For data augmentation, we apply random scale, random flip, random rotation and random crop on training images, and manually select difficult samples from training images for crop, color jitter, contrast jitter and occlusion data augmentation.