method: GoMatching++2024-12-12

Authors: HeHaibin

Affiliation: Wuhan University

Description: An extension version of GoMatching with a more concise framework

method: GoMatching2024-01-24

Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng

Affiliation: Wuhan University

Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.

method: LOGO2024-05-30

Authors: Hongen Liu, Di Sun, Jiahao Wang, Yi Liu, Gang Pan

Affiliation: College of Intelligence and Computing, Tianjin University;Tianjin University of Science and Technology; Baidu Inc.

Description: We propose a Language Collaboration and Glyph Perception Model, termed LOGO to enhance the performance of conventional text spotters through the integration of a synergy module. To achieve this goal, a language synergy classifier (LSC) is designed to explicitly discern text instances from background noise in the recognition stage. Besides, the glyph supervision and visual position mixture module are proposed to enhance the recognition accuracy of noisy text regions, and acquire more discriminative tracking features, respectively.

Ranking Table

Description Paper Source Code
DateMethodMOTAMOTPIDF1Mostly MatchedPartially MatchedMostly Lost
2024-12-12GoMatching++62.41%77.94%72.87%1170320426
2024-01-24GoMatching60.02%77.85%70.85%1095345476
2024-05-30LOGO55.92%71.89%68.27%953455508
2022-05-08SVRepV2(Kuaishou-MMU)54.89%75.14%69.17%983527406
2021-12-22GOCR Offline54.54%73.61%71.21%1121299496
2021-08-02h&h_lab53.74%76.92%65.51%874536506
2021-12-22GOCR52.01%73.40%69.98%1194355367
2022-01-16TA-VTT51.10%76.02%65.57%812497607
2022-03-20TransDETR48.86%74.01%65.33%813466637
2022-01-26Semantic-Aware Video Text Detection48.41%76.31%63.65%693500723
2021-08-25TransVTSpotter45.75%73.58%57.56%658611647
2022-06-09VideoTextSCM44.07%75.19%58.23%858502556
2020-02-26HIK_OCR43.16%76.78%57.92%702364850
2016-04-14SRC-B-TextProcessingLab23.09%68.51%39.40%2744811161
2015-04-01AJOU16.44%72.71%36.07%2714581187
2015-04-02USTB_TexVideo II-212.29%71.78%21.93%924391385
2016-04-13Megvii-Image++11.02%66.75%37.30%1856121119
2015-04-17StradVision-17.92%70.17%25.87%1244361356
2015-04-02USTB_TexVideo7.42%70.75%25.92%1425071267
2015-04-02USTB_TexVideo II-1-59.62%69.06%17.59%1293681419
2015-03-28RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2)-106.48%64.64%3.45%473081561

Ranking Graphic