method: Dao Xianghu light of TianQuan2023-03-19
Authors: Kai Yang, Ye Wang, Bin Wang, Wentao Liu, Xiaolu Ding, Jun Zhu, Ming Chen, Peng Yao, Zhixin Qiu
Affiliation: CCB Financial Technology Co. Ltd, China
Description: 1. Data Analysis
This competition provided 5000 pieces of training data officially. Upon analyzing the data, we found that it can be classified into four categories: round, oval, square, and triangular, with the round and oval categories being the primary ones. The training set contains various conditions, including multi-directional rotations, uneven colors, overlapping seals, and indistinct seal patterns.
2. Data Processing
When it comes to data analysis, we began by re-annotating the training set images and enlarging them to squares. We then rotated the data and produced a total of 15,000 images. Data generation was carried out on difficult samples, including those with overlapping or blurry stamps. Prior to generating the seal data, we gathered a large number of company and organization names from the internet. Then, we generated the rotation angle and position of each individual character based on its length and merged them into the seal's background image. Moreover, we output the coordinates of the outer edge points of the text. To create a more realistic representation of seals in the generated data, we incorporated various colors, fonts, backgrounds, and textures. The base image for each seal was created by randomly cropping backgrounds, and we used RGBA format during data generation to allow for control over the color depth of the seal by adding a transparency channel. We also included two types of seal borders: solid and fragmented.
3. Model Introduction
In this segmentation task, we employed a “voting ensemble” method to detect the content of the seal title. Five models are utilized in the method, namely Mask R-CNN, K-Net, Segformer, Segmenter, and UperNet. Each model generates a mask. And we utilize a majority vote to derive the final mask, which allows us to identify the seal title area on the mask.
method: SPDB LAB2023-03-16
Authors: Jie Li 、Wei Wang、Yuqi Zhang、Ruixue Zhang、Yiru Zhao、Danya Zhou、Di Wang、Dong Xiang、Hui Wang、Min Xu、Pengyu Chen、Bin Zhang、Chao Li、Shiyu Hu、Songtao Li、Yunxin Yang
Affiliation: Shanghai Pudong Development Bank
Email: zhangyq26@outlook.com、wangdee0805@139.com、lij131@spdb.com.cn
Description: Circle seals, ellipse seals, rectangle seals and triangle seals were trained with different method in task 1. The seal title detection model is trained using the provided training data and the synthetic data in the team, and the detection model is PANNet.The synthetic data is based on the style analysis of training data, and more than 20,000 training samples are synthesized in total.Two different PANNet models based on Circle and ellipse seals,rectangle and triangle seals are trained respectively for test set testing.
method: PAN_ReST_42023-03-21
Authors: yuchen su, yongkun du, tianlun zheng, zhineng chen, yi gan, zhineng chen
Affiliation: Fudan University, Paddle OCR
Description: Our method is based on PAN, ResNet-50 pre-trained on ImageNet as our backbone. We only use the training images of ReST for training. For data augmentation, we apply random scale, random flip, random rotation and random crop on training images, and manually select some hard samples from training images for crop, color jitter, contrast jitter and occlusion data augmentation.
Date | Method | Precision-0.7 | Recall-0.7 | Hmean-0.7 | Precision | Recall | Hmean | |||
---|---|---|---|---|---|---|---|---|---|---|
2023-03-19 | Dao Xianghu light of TianQuan | 99.06% | 99.06% | 99.06% | 99.92% | 99.92% | 99.92% | |||
2023-03-16 | SPDB LAB | 97.60% | 97.60% | 97.60% | 99.92% | 99.92% | 99.92% | |||
2023-03-21 | PAN_ReST_4 | 96.86% | 96.86% | 96.86% | 99.70% | 99.70% | 99.70% | |||
2023-03-21 | DB with SegFormer | 98.11% | 95.42% | 96.75% | 99.65% | 96.92% | 98.27% | |||
2023-03-14 | Aaaaa_v1 | 96.72% | 96.64% | 96.68% | 99.16% | 99.08% | 99.12% | |||
2023-03-21 | DB with SegFormer | 97.37% | 95.68% | 96.52% | 99.17% | 97.44% | 98.30% | |||
2023-03-16 | DB with SegFormer | 97.72% | 95.34% | 96.52% | 99.61% | 97.18% | 98.38% | |||
2023-03-21 | DB with SegFormer | 97.04% | 95.70% | 96.36% | 99.01% | 97.64% | 98.32% | |||
2023-03-20 | ratio_4.0 | 95.96% | 95.96% | 95.96% | 99.38% | 99.38% | 99.38% | |||
2023-03-21 | DB with SegFormer | 95.95% | 95.78% | 95.87% | 98.00% | 97.82% | 97.91% | |||
2023-03-19 | PAN++ with Res101 | 92.22% | 92.22% | 92.22% | 97.48% | 97.48% | 97.48% | |||
2023-03-20 | ratio_3.5 | 90.52% | 90.52% | 90.52% | 99.38% | 99.38% | 99.38% | |||
2023-03-13 | Seal Detect | 88.32% | 88.32% | 88.32% | 99.48% | 99.48% | 99.48% | |||
2023-03-07 | detect_test | 85.96% | 85.96% | 85.96% | 99.00% | 99.00% | 99.00% | |||
2023-03-20 | Seal Detect | 82.28% | 82.28% | 82.28% | 99.50% | 99.50% | 99.50% | |||
2023-03-17 | Seal Detect | 80.84% | 80.84% | 80.84% | 99.62% | 99.62% | 99.62% | |||
2023-03-07 | Seal Detect | 61.40% | 61.40% | 61.40% | 97.28% | 97.28% | 97.28% | |||
2023-03-20 | ratio_3.0 | 56.74% | 56.74% | 56.74% | 99.20% | 99.20% | 99.20% | |||
2023-03-20 | ratio_2.5 | 32.06% | 32.06% | 32.06% | 97.98% | 97.98% | 97.98% | |||
2023-03-20 | ratio_2.0 | 7.60% | 7.60% | 7.60% | 68.66% | 68.66% | 68.66% | |||
2023-03-21 | PAN_ReST | 3.72% | 3.72% | 3.72% | 6.48% | 6.48% | 6.48% | |||
2023-03-20 | ratio_1.5 | 1.60% | 1.60% | 1.60% | 30.22% | 30.22% | 30.22% | |||
2023-03-20 | Mask way | 1.28% | 1.28% | 1.28% | 4.30% | 4.30% | 4.30% |