method: GoMatching2024-01-24

Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng

Affiliation: Wuhan University

Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.

method: h&h_lab2021-08-02

Authors: HUST_VLRGROUP(Dian Jin) & HUAWEI_CLOUD_EI(Jing Wang, Shenggao Zhu)

Affiliation: h&h_lab

Description: This method is a technical method specially designed for the competition. Specifically, we try to employ the textual transcription of the word in the video to distinguish different text objects in the video. The use of textual transcription of the word boosts the performance of video text tracking a lot but also requires the supervision of the recognition results of the text instances. It’s a rude but useful method. For task 4, we further apply the post-processing method to get a more accurate recognition result for each text object after the text trajectory being generated.

method: TransVTSpotter2021-08-25

Authors: weijia

Affiliation: Zhejiang University&Kuaishou(MMU)

Description: This is base on our paper `A Multilingual, Open World Video Text Dataset and End-to-end Video Text Spotter with Transformer'

Ranking Table

Description Paper Source Code
DateMethodMOTAMOTPIDF1Mostly MatchedPartially MatchedMostly Lost
2024-01-24GoMatching60.02%77.85%70.85%1095345476
2021-08-02h&h_lab53.74%76.92%65.51%874536506
2021-08-25TransVTSpotter45.75%73.58%57.56%658611647
2022-06-09VideoTextSCM44.07%75.19%58.23%858502556
2020-02-26HIK_OCR43.16%76.78%57.92%702364850
2015-04-01AJOU16.44%72.71%36.07%2714581187
2015-04-02USTB_TexVideo II-212.29%71.78%21.93%924391385
2016-04-13Megvii-Image++11.02%66.75%37.30%1856121119
2015-04-17StradVision-17.92%70.17%25.87%1244361356
2015-04-02USTB_TexVideo7.42%70.75%25.92%1425071267
2015-04-02USTB_TexVideo II-1-59.62%69.06%17.59%1293681419
2015-03-28RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2)-106.48%64.64%3.45%473081561

Ranking Graphic