method: GoMatching2024-01-19

Authors: HeHaibin, YeMaoyuan, ZhangJing, LiuJuhua, TaoDacheng

Affiliation: Wuhan University-AI.INST

Description: We extend off-the-shelf image text spotter DeepSolo to video text spotter via long-short term matching module.

method: TransDETR2022-04-15

Authors: weijia

Affiliation: Zhejiang University&Kuaishou(MMU)

Email: weijiawu@zju.edu.cn

Description: A simple, but effective end-to-end video text DEtection, Tracking, and Recognition framework (TransDETR). TransDETR mainly includes two advantages: 1) Different from the explicit match paradigm in the adjacent frame, TransDETR tracks and recognizes each text implicitly by the different query termed text query over long-range temporal sequence (more than 7 frames). 2) TransDETR is the first end-to-end trainable video text spotting framework, which simultaneously addresses the three sub-tasks (e.g., text detection, tracking, recognition).

Ranking Table

Description Paper Source Code
DateMethodMOTAMOTPIDF1Mostly MatchedPartially MatchedMostly Lost
2024-01-19GoMatching72.04%78.53%80.11%1002205160
2022-04-15TransDETR60.96%74.61%72.80%644323400

Ranking Graphic