Results - Text in Videos - Robust Reading Competition

Evaluation

Default

method: GoMatching++2024-12-12

Authors: HeHaibin

Affiliation: Wuhan University

Description: An extension version of GoMatching with a more concise framework

Source code

method: GoMatching2024-01-24

Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng

Affiliation: Wuhan University

Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.

GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching

Source code

method: LOGO2024-05-30

Authors: Hongen Liu, Di Sun, Jiahao Wang, Yi Liu, Gang Pan

Affiliation: College of Intelligence and Computing, Tianjin University；Tianjin University of Science and Technology; Baidu Inc.

Description: We propose a Language Collaboration and Glyph Perception Model, termed LOGO to enhance the performance of conventional text spotters through the integration of a synergy module. To achieve this goal, a language synergy classifier (LSC) is designed to explicitly discern text instances from background noise in the recognition stage. Besides, the glyph supervision and visual position mixture module are proposed to enhance the recognition accuracy of noisy text regions, and acquire more discriminative tracking features, respectively.

Hongen Liu, Di Sun, Jiahao Wang, Yi Liu and Gang Pan. "LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model", arXiv preprint arXiv:2405.19194 , 2024.

Ranking Table

Description Paper Source Code

Date	Method	MOTA	MOTP	IDF1	Mostly Matched	Partially Matched	Mostly Lost
2024-12-12	GoMatching++	62.41%	77.94%	72.87%	1170	320	426
2024-01-24	GoMatching	60.02%	77.85%	70.85%	1095	345	476
2024-05-30	LOGO	55.92%	71.89%	68.27%	953	455	508
2022-05-08	SVRepV2(Kuaishou-MMU)	54.89%	75.14%	69.17%	983	527	406
2021-12-22	GOCR Offline	54.54%	73.61%	71.21%	1121	299	496
2021-08-02	h&h_lab	53.74%	76.92%	65.51%	874	536	506
2021-12-22	GOCR	52.01%	73.40%	69.98%	1194	355	367
2022-01-16	TA-VTT	51.10%	76.02%	65.57%	812	497	607
2022-03-20	TransDETR	48.86%	74.01%	65.33%	813	466	637
2022-01-26	Semantic-Aware Video Text Detection	48.41%	76.31%	63.65%	693	500	723
2021-08-25	TransVTSpotter	45.75%	73.58%	57.56%	658	611	647
2022-06-09	VideoTextSCM	44.07%	75.19%	58.23%	858	502	556
2020-02-26	HIK_OCR	43.16%	76.78%	57.92%	702	364	850
2016-04-14	SRC-B-TextProcessingLab	23.09%	68.51%	39.40%	274	481	1161
2015-04-01	AJOU	16.44%	72.71%	36.07%	271	458	1187
2015-04-02	USTB_TexVideo II-2	12.29%	71.78%	21.93%	92	439	1385
2016-04-13	Megvii-Image++	11.02%	66.75%	37.30%	185	612	1119
2015-04-17	StradVision-1	7.92%	70.17%	25.87%	124	436	1356
2015-04-02	USTB_TexVideo	7.42%	70.75%	25.92%	142	507	1267
2015-04-02	USTB_TexVideo II-1	-59.62%	69.06%	17.59%	129	368	1419
2015-03-28	RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2)	-106.48%	64.64%	3.45%	47	308	1561

Inactive evaluations

method: GoMatching++2024-12-12

method: GoMatching2024-01-24

method: LOGO2024-05-30

Ranking Table

Ranking Graphic