method: SenseTime2016-01-27
Authors: wei wu, junjie yan
Description: A combination of FCN and RPN.
method: TextFuseNet2020-07-31
Authors: Jian Ye, Zhe Chen, Juhua Liu and Bo Du
Affiliation: Wuhan University, The University of Sydney
Email: liujuhua@whu.edu.cn
Description: Arbitrary shape text detection in natural scenes is an extremely challenging task. Unlike existing text detection approaches that only perceive texts based on limited feature representations, we propose a novel framework, namely TextFuseNet, to exploit the use of richer features fused for text detection. More specifically, we propose to perceive texts from three levels of feature representations, i.e., character-, word- and global-level, and then introduce a novel text representation fusion technique to help achieve robust arbitrary text detection. The multi-level feature representation can adequately describe texts by dissecting them into individual characters while still maintaining their general semantics. TextFuseNet then collects and merges the texts’ features from different levels using a multi-path fusion architecture which can effectively align and fuse different representations. In practice, our proposed TextFuseNet can learn a more adequate description of arbitrary shapes texts, suppressing false positives and producing more accurate detection results. Our proposed framework can also be trained with weak supervision for those datasets that lack character-level annotations. Experiments on several datasets show that the proposed TextFuseNet achieves state-of-the-art performance. Specifically, we achieve an F-measure of 94.3% on ICDAR2013, 92.1% on ICDAR2015,87.1% on Total-Text and 86.6% on CTW-1500, respectively.
method: TencentAILab2017-11-21
Authors: Jingchao Zhou, Yitong Wang, Xing Ji, Zhifeng Li
Description: An FCN architecture with U-Net as the backbone for extracting multi-scale low level feature maps.
Date | Method | Recall | Precision | Hmean | |||
---|---|---|---|---|---|---|---|
2016-01-27 | SenseTime | 91.87% | 95.45% | 93.62% | |||
2020-07-31 | TextFuseNet | 90.78% | 95.58% | 93.11% | |||
2017-11-21 | TencentAILab | 94.79% | 91.37% | 93.05% | |||
2020-12-15 | VARCO | 89.86% | 93.63% | 91.71% | |||
2020-05-13 | HIT | 89.22% | 93.85% | 91.48% | |||
2018-11-07 | CRAFT | 89.04% | 93.93% | 91.42% | |||
2020-01-20 | VARCO | 90.50% | 92.01% | 91.25% | |||
2021-01-21 | NCU_MSP | 88.86% | 93.02% | 90.89% | |||
2017-06-25 | FEN | 88.49% | 93.35% | 90.86% | |||
2018-01-22 | FOTS | 89.68% | 91.43% | 90.55% | |||
2019-05-29 | crpn.v2 | 88.58% | 92.56% | 90.53% | |||
2017-08-15 | MultDet | 89.68% | 91.09% | 90.38% | |||
2019-05-03 | Mask Textspotter | 87.40% | 93.55% | 90.37% | |||
2018-12-03 | SPCNet_TongJi & UESTC (single scale) | 88.68% | 91.86% | 90.24% | |||
2019-07-23 | CM-CV&AR | 89.32% | 90.72% | 90.01% | |||
2018-03-05 | HappyCCL | 88.40% | 91.67% | 90.00% | |||
2019-07-12 | stela | 88.13% | 91.38% | 89.73% | |||
2017-02-17 | NLPR-CASIA | 86.76% | 92.23% | 89.41% | |||
2018-04-17 | Ali-Amap-xlab-v4 | 87.49% | 90.98% | 89.20% | |||
2017-08-10 | SRC-B-MachineLearningLab-v4 | 87.49% | 90.81% | 89.12% | |||
2017-12-15 | EPTN-SJTU | 87.31% | 90.62% | 88.93% | |||
2020-05-19 | Craft++ | 86.67% | 91.07% | 88.82% | |||
2020-01-21 | ChinaUnicom-AI | 86.58% | 88.68% | 87.62% | |||
2021-01-09 | FENET | 79.27% | 97.31% | 87.37% | |||
2020-11-10 | Hancom Vision | 81.74% | 92.94% | 86.98% | |||
2022-08-26 | HBLAB-OCR | 83.29% | 90.12% | 86.57% | |||
2020-01-06 | NCU_MSP | 85.75% | 87.35% | 86.54% | |||
2017-03-22 | MCLAB_TextBoxes_v2 | 83.29% | 89.94% | 86.49% | |||
2017-04-04 | xmu403 | 83.11% | 90.10% | 86.46% | |||
2016-12-16 | RRPN-4 | 83.56% | 89.53% | 86.44% | |||
2017-07-04 | FPTD | 86.03% | 86.74% | 86.38% | |||
2018-05-06 | cvmt_allStepNoLkTree | 85.30% | 86.96% | 86.12% | |||
2018-01-04 | crpn | 82.28% | 89.65% | 85.81% | |||
2019-02-12 | NCSOFT VISION AI LAB | 88.04% | 83.46% | 85.69% | |||
2017-05-27 | BUPT_NIRC_multi | 80.55% | 90.93% | 85.42% | |||
2018-12-08 | Unicamp-SRBR-v2 | 80.82% | 90.49% | 85.38% | |||
2018-05-06 | CVMT_frm62 | 84.38% | 86.03% | 85.20% | |||
2016-08-31 | MCLAB_TextBoxes | 82.28% | 87.82% | 84.96% | |||
2017-05-27 | BUPT_NIRC_multi | 79.09% | 91.74% | 84.94% | |||
2019-04-19 | CenterText(Single-scale) | 82.47% | 86.74% | 84.55% | |||
2018-07-03 | ACI | 76.89% | 92.32% | 83.91% | |||
2015-03-26 | VGGMaxNet_cmb | 78.08% | 90.09% | 83.66% | |||
2015-04-02 | VGGMaxNet_025 | 79.82% | 87.58% | 83.52% | |||
2018-12-08 | Unicamp-SRBR-v3 | 75.62% | 92.62% | 83.26% | |||
2015-03-23 | VGGMaxNet_013 | 76.53% | 90.50% | 82.93% | |||
2015-03-23 | VGGMaxNet_1.6 | 75.89% | 91.32% | 82.89% | |||
2019-03-23 | SSP-RPNs with Pytorch | 83.65% | 80.70% | 82.15% | |||
2017-06-09 | MPT+Jar | 71.96% | 93.92% | 81.49% | |||
2017-05-29 | ssd pretrain on synthtext and scut | 80.64% | 81.53% | 81.08% | |||
2017-09-05 | P-SSD.v1 | 81.64% | 80.40% | 81.01% | |||
2017-05-02 | TsinghuaOCR | 77.63% | 84.33% | 80.84% | |||
2018-05-06 | cvmt_frm59 | 84.66% | 77.06% | 80.68% | |||
2016-04-10 | SCUT-HCII | 78.63% | 82.31% | 80.43% | |||
2016-07-18 | SCUT-HCII | 75.71% | 84.51% | 79.87% | |||
2020-03-26 | RRPN R-50 model_final 20200327 | 76.07% | 83.80% | 79.75% | |||
2019-01-31 | ssprpns | 72.51% | 87.64% | 79.36% | |||
2019-06-26 | std(single-scale) | 76.99% | 80.98% | 78.93% | |||
2016-11-13 | RRPN-3 | 70.50% | 88.23% | 78.38% | |||
2015-01-01 | BUCT_YST | 72.15% | 83.60% | 77.45% | |||
2016-03-16 | TextConv+WordGraph | 68.22% | 89.35% | 77.37% | |||
2016-03-16 | TextConv+WordGraph | 67.67% | 89.49% | 77.07% | |||
2015-10-21 | MCLAB_FCN | 70.59% | 83.93% | 76.69% | |||
2016-06-23 | Baidu IDL | 70.23% | 82.07% | 75.69% | |||
2014-11-12 | HUST_MCLAB | 68.49% | 83.33% | 75.19% | |||
2014-05-13 | SWT | 72.69% | 76.69% | 74.64% | |||
2018-12-08 | Unicamp-SRBR-v1 | 63.20% | 88.04% | 73.58% | |||
2015-04-03 | StradVision | 66.03% | 80.87% | 72.70% | |||
2017-08-13 | P-SSD.v1 | 68.86% | 76.24% | 72.36% | |||
2017-03-23 | RTN | 65.21% | 80.77% | 72.16% | |||
2022-05-08 | 11111 | 77.72% | 66.59% | 71.72% | |||
2013-04-09 | I2R_NUS_FAR | 68.95% | 74.46% | 71.60% | |||
2013-04-07 | USTB_TexStar | 61.46% | 84.76% | 71.25% | |||
2017-03-16 | Ali-Amap-xlab-v2 | 66.03% | 76.35% | 70.81% | |||
2016-06-23 | SRC-B-TextProcessingLab | 64.02% | 79.03% | 70.74% | |||
2017-10-12 | TextFCN V2 | 74.52% | 66.61% | 70.34% | |||
2013-04-08 | CASIA_NLPR | 66.12% | 74.64% | 70.12% | |||
2017-03-24 | (๑•̀ㅂ•́)و✧ | 66.67% | 73.74% | 70.02% | |||
2016-12-04 | Ali-Amap-xlab | 64.93% | 75.96% | 70.01% | |||
2013-04-05 | TextSpotter | 61.19% | 81.61% | 69.94% | |||
2015-03-23 | VGGMaxNet_10 | 54.70% | 96.61% | 69.85% | |||
2015-07-22 | ZText | 60.00% | 82.95% | 69.63% | |||
2013-12-23 | BayesText | 60.00% | 82.54% | 69.49% | |||
2015-11-04 | MSER_Binary_CNN | 63.29% | 76.49% | 69.27% | |||
2017-11-15 | Sensetime line-level detection | 59.54% | 81.81% | 68.92% | |||
2016-01-18 | Text-CNN | 59.09% | 81.59% | 68.54% | |||
2013-04-08 | I2R_NUS | 65.30% | 72.08% | 68.52% | |||
2017-01-22 | SRC-B-MachineLearningLab | 60.46% | 79.00% | 68.49% | |||
2018-12-29 | fast_ret_sh_02 | 58.81% | 76.76% | 66.60% | |||
2014-08-18 | DetectText | 59.82% | 71.58% | 65.17% | |||
2013-04-08 | Text_detector_CASIA | 54.70% | 80.19% | 65.04% | |||
2017-04-26 | Cascade Filtering and Grouping | 56.71% | 75.27% | 64.69% | |||
2013-08-29 | UMD_IntegratedDisrimination | 52.69% | 81.61% | 64.04% | |||
2017-02-28 | Tencent Youtu | 53.79% | 76.30% | 63.10% | |||
2017-02-21 | STDN-2 | 51.51% | 75.60% | 61.27% | |||
2017-02-07 | CAS_HotEye | 51.51% | 75.10% | 61.11% | |||
2017-02-27 | CTDN-3-PN | 50.87% | 76.09% | 60.97% | |||
2017-02-25 | CTDN2 | 50.14% | 76.46% | 60.56% | |||
2017-04-02 | CNN_Text | 61.19% | 56.40% | 58.69% | |||
2017-03-24 | CNN | 62.01% | 54.85% | 58.21% | |||
2013-04-08 | TH-TextLoc | 50.78% | 59.66% | 54.86% | |||
2015-08-18 | MSER with LocalSWT | 38.90% | 60.43% | 47.33% | |||
2016-12-21 | MSRA_v1 | 35.71% | 61.09% | 45.07% | |||
2013-04-06 | Text Detection | 34.25% | 60.29% | 43.68% | |||
2017-03-01 | STDN | 36.07% | 54.86% | 43.53% | |||
2013-04-25 | Baseline | 34.52% | 57.62% | 43.18% | |||
2017-05-29 | FCN based network for Text Detection | 32.60% | 60.51% | 42.37% | |||
2019-07-17 | AFCTPN | 31.96% | 60.55% | 41.84% | |||
2014-06-10 | IWRR2014 | 32.24% | 56.12% | 40.95% | |||
2017-03-05 | WeText | 31.23% | 56.16% | 40.14% | |||
2015-11-28 | CASIA_USTB-Cascaded | 31.51% | 55.20% | 40.12% | |||
2017-04-06 | ConnLink_pre | 30.05% | 58.02% | 39.59% | |||
2017-04-01 | ConnLink | 30.32% | 56.18% | 39.38% | |||
2017-08-25 | bupt | 29.59% | 55.48% | 38.59% | |||
2017-07-24 | Jack's TD | 29.77% | 54.61% | 38.53% | |||
2016-11-08 | CTPN | 28.40% | 54.85% | 37.42% | |||
2015-12-17 | TEST17 | 26.39% | 58.27% | 36.33% | |||
2018-07-17 | 123_0.927_1280_960 | 28.22% | 50.08% | 36.10% | |||
2018-07-17 | 0.956_1280_960 | 26.03% | 52.78% | 34.86% | |||
2019-02-16 | [BKU K15] | 27.40% | 43.48% | 33.61% | |||
2018-12-18 | test | 25.94% | 47.73% | 33.61% | |||
2018-07-17 | 0.956_600_1000 | 24.47% | 46.53% | 32.08% | |||
2013-04-10 | Inkam | 28.04% | 29.10% | 28.56% | |||
2015-08-18 | MSER | 20.18% | 34.97% | 25.59% | |||
2014-09-16 | MSERs | 19.63% | 34.57% | 25.04% |