method: AJOU2015-04-01

Authors: Hyung Il Koo and Yong Gyun Kim

Description: This text detection and tracking system was submitted by Hyung Il Koo and Yong Gyun Kim from Ajou University, Suwon, Korea. The system first extracts connected components (CCs) with the maximally stable extremal region (MSER) algorithm and finds word candidates by using the bottom-up grouping clustering method [1]. For each candidate region, we find corresponding regions in the other frames with the CSK trackers [2] and build chains. We also detect collisions between the chains and filter out non-promising ones.
[1] Hyung Il Koo and Duck Hoon Kim, "Scene Text Detection via Connected Component Clustering and Nontext Filtering," IEEE Transactions on Image Processing, vol.22, no.6, pp.2296,2305, June 2013
[2] J. F. Henriques, R. Caseiro, P. Martins, J. Batista, “Exploiting the Circulant Structure of Tracking-by-detection with Kernels,”, ECCV 2012

method: USTB_TexVideo II-22015-04-02

Authors: Xu-Cheng Yin, Shu Tian, Ze-Yu Zuo, Wei-Yi Pei, Chun Yang

Description: In USTB_TexVideo II-1, multi-orientation text detection is conducted with USTB_TexStar [1,2], and text tracking with empricial rules is performed for improving detection results. In addition, during tracking, SURF is used to handle miss detecionts. At last, a trained classifier is used to remove false detections and retrieve detections which are decided to be false detections by tracking.

[1] Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, “Robust text detection in natural scene images”, IEEE Trans. Pattern Analysis and Machine Intelligence, 36(5): 970-983, 2014.
[2] Xu-Cheng Yin, Wei-Yi Pei, Xuwang Yin, Jun Zhang, and Hong-Wei Hao, “Multi-orientation scene text detection with adaptive clustering,” IEEE Trans. Pattern Analysis and Machine Intelligence (TPAMI), preprint, 2015.

method: Megvii-Image++2016-04-13

Authors: Jia Yu, Xinyu Zhou, Cong Yao, Jianan Wu, Chi Zhang, Shuchang Zhou

Description: The detection part is accomplished by a FCN which directly extracts text regions from original images. The tracker is a net flow based association algorithm.

Ranking Table

Description Paper Source Code
DateMethodMOTAMOTPIDF1Mostly MatchedPartially MatchedMostly Lost
2015-04-01AJOU53.45%73.25%0.00%
2015-04-02USTB_TexVideo II-250.38%72.47%0.00%
2016-04-13Megvii-Image++49.68%67.41%0.00%
2015-04-02USTB_TexVideo49.33%71.33%0.00%
2015-04-17StradVision-147.58%70.82%0.00%
2015-04-02USTB_TexVideo II-119.69%69.51%0.00%
2015-03-28RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2)-20.28%64.44%0.00%
2016-04-14SRC-B-TextProcessingLab0.00%0.00%0.00%

Ranking Graphic