method: GoMatching2024-01-24
Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng
Affiliation: Wuhan University
Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.
method: LOGO2024-05-30
Authors: Hongen Liu, Di Sun, Jiahao Wang, Yi Liu, Gang Pan
Affiliation: College of Intelligence and Computing, Tianjin University;Tianjin University of Science and Technology; Baidu Inc.
Description: We propose a Language Collaboration and Glyph Perception Model, termed LOGO to enhance the performance of conventional text spotters through the integration of a synergy module. To achieve this goal, a language synergy classifier (LSC) is designed to explicitly discern text instances from background noise in the recognition stage. Besides, the glyph supervision and visual position mixture module are proposed to enhance the recognition accuracy of noisy text regions, and acquire more discriminative tracking features, respectively.
method: SVRepV2(Kuaishou-MMU)2022-05-08
Authors: lizhuang05
Affiliation: Zhejiang University(weijiaWu)&Kuaishou-MMU(zhuangLi)
Description: Based on our paper currently under review.
Date | Method | MOTA | MOTP | IDF1 | Mostly Matched | Partially Matched | Mostly Lost | |||
---|---|---|---|---|---|---|---|---|---|---|
2024-01-24 | GoMatching | 60.02% | 77.85% | 70.85% | 1095 | 345 | 476 | |||
2024-05-30 | LOGO | 55.92% | 71.89% | 68.27% | 953 | 455 | 508 | |||
2022-05-08 | SVRepV2(Kuaishou-MMU) | 54.89% | 75.14% | 69.17% | 983 | 527 | 406 | |||
2021-12-22 | GOCR Offline | 54.54% | 73.61% | 71.21% | 1121 | 299 | 496 | |||
2021-08-02 | h&h_lab | 53.74% | 76.92% | 65.51% | 874 | 536 | 506 | |||
2021-12-22 | GOCR | 52.01% | 73.40% | 69.98% | 1194 | 355 | 367 | |||
2022-01-16 | TA-VTT | 51.10% | 76.02% | 65.57% | 812 | 497 | 607 | |||
2022-03-20 | TransDETR | 48.86% | 74.01% | 65.33% | 813 | 466 | 637 | |||
2022-01-26 | Semantic-Aware Video Text Detection | 48.41% | 76.31% | 63.65% | 693 | 500 | 723 | |||
2021-08-25 | TransVTSpotter | 45.75% | 73.58% | 57.56% | 658 | 611 | 647 | |||
2022-06-09 | VideoTextSCM | 44.07% | 75.19% | 58.23% | 858 | 502 | 556 | |||
2020-02-26 | HIK_OCR | 43.16% | 76.78% | 57.92% | 702 | 364 | 850 | |||
2016-04-14 | SRC-B-TextProcessingLab | 23.09% | 68.51% | 39.40% | 274 | 481 | 1161 | |||
2015-04-01 | AJOU | 16.44% | 72.71% | 36.07% | 271 | 458 | 1187 | |||
2015-04-02 | USTB_TexVideo II-2 | 12.29% | 71.78% | 21.93% | 92 | 439 | 1385 | |||
2016-04-13 | Megvii-Image++ | 11.02% | 66.75% | 37.30% | 185 | 612 | 1119 | |||
2015-04-17 | StradVision-1 | 7.92% | 70.17% | 25.87% | 124 | 436 | 1356 | |||
2015-04-02 | USTB_TexVideo | 7.42% | 70.75% | 25.92% | 142 | 507 | 1267 | |||
2015-04-02 | USTB_TexVideo II-1 | -59.62% | 69.06% | 17.59% | 129 | 368 | 1419 | |||
2015-03-28 | RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2) | -106.48% | 64.64% | 3.45% | 47 | 308 | 1561 |