Results - Text in Videos - Robust Reading Competition

Evaluation

Default

method: GoMatching++2024-12-12

Authors: HeHaibin

Affiliation: Wuhan University

Description: An extension version of GoMatching with a more concise framework

Source code

method: GoMatching2024-01-19

Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng

Affiliation: Wuhan University-AI.INST

Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.

GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching

Source code

method: LOGO2024-05-30

Authors: Hongen Liu, Di Sun, Jiahao Wang, Yi Liu, Gang Pan

Affiliation: College of Intelligence and Computing, Tianjin University；Tianjin University of Science and Technology; Baidu Inc.

Description: We propose a Language Collaboration and Glyph Perception Model, termed LOGO to enhance the performance of conventional text spotters through the integration of a synergy module. To achieve this goal, a language synergy classifier (LSC) is designed to explicitly discern text instances from background noise in the recognition stage. Besides, the glyph supervision and visual position mixture module are proposed to enhance the recognition accuracy of noisy text regions, and acquire more discriminative tracking features, respectively.

Hongen Liu, Di Sun, Jiahao Wang, Yi Liu and Gang Pan. "LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model", arXiv preprint arXiv:2405.19194 , 2024.

Ranking Table

Description Paper Source Code

Date	Method	MOTA	MOTP	IDF1	Mostly Matched	Partially Matched	Mostly Lost
2024-12-12	GoMatching++	72.20%	78.52%	80.11%	1004	204	159
2024-01-19	GoMatching	72.04%	78.53%	80.11%	1002	205	160
2024-05-30	LOGO	68.07%	73.00%	75.85%	795	294	278
2022-05-07	CoText(Kuaishou_MMU)	66.96%	76.55%	75.24%	837	323	207
2021-12-22	GOCR Offline	63.90%	74.46%	77.56%	903	212	252
2021-08-02	h&h_lab	63.76%	77.78%	71.08%	673	381	313
2021-12-22	GOCR	63.05%	74.31%	76.95%	945	260	162
2022-04-15	TransDETR	60.96%	74.61%	72.80%	644	323	400
2022-01-09	CoText	58.94%	74.53%	71.66%	719	296	352
2020-02-26	HIK_OCR	52.98%	74.88%	61.85%	618	253	487
2016-04-14	SRC-B-TextProcessingLab	29.51%	68.53%	48.10%	272	474	621
2016-04-13	Megvii-Image++	19.11%	67.28%	34.97%	134	397	836
2015-04-02	USTB_TexVideo	15.57%	68.47%	28.18%	122	382	778
2015-04-02	Deep2Text I (Video)	14.35%	68.75%	32.05%	200	296	786
2015-04-02	USTB_TexVideo II-2	13.24%	66.61%	21.25%	84	327	859
2015-04-17	Stradvision-1	8.98%	70.20%	31.95%	122	432	813
2015-04-02	USTB_TexVideo II-1	5.64%	58.76%	19.74%	111	263	872
2015-03-30	Baseline-TextSpotter	0.00%	0.00%	0.00%

Inactive evaluations

method: GoMatching++2024-12-12

method: GoMatching2024-01-19

method: LOGO2024-05-30

Ranking Table

Ranking Graphic