GoMatching++ | 83.29% | 82.44% | 87.16% | 22 | 11 | 5 | 1422 | 12 | 46 | 218 |
TransDETR | 79.84% | 76.97% | 84.82% | 21 | 7 | 10 | 1352 | 7 | 33 | 293 |
GoMatching | 78.33% | 81.01% | 80.82% | 20 | 11 | 7 | 1374 | 12 | 80 | 266 |
h&h_lab | 76.51% | 80.26% | 75.46% | 20 | 10 | 8 | 1334 | 16 | 70 | 302 |
LOGO | 75.54% | 71.82% | 79.58% | 18 | 11 | 9 | 1303 | 11 | 55 | 338 |
TransVTSpotter | 74.03% | 76.30% | 81.55% | 18 | 9 | 11 | 1248 | 13 | 25 | 391 |
GOCR Offline | 68.40% | 76.00% | 78.31% | 18 | 10 | 10 | 1266 | 21 | 136 | 365 |
SVRepV2(Kuaishou-MMU) | 67.86% | 78.95% | 75.25% | 19 | 15 | 4 | 1232 | 11 | 111 | 409 |
GOCR | 67.13% | 75.92% | 77.61% | 20 | 13 | 5 | 1276 | 31 | 167 | 345 |
VideoTextSCM | 64.95% | 78.24% | 73.78% | 15 | 14 | 9 | 1158 | 19 | 85 | 475 |
TA-VTT | 52.42% | 76.89% | 56.49% | 12 | 14 | 12 | 947 | 18 | 81 | 687 |
Semantic-Aware Video Text Detection | 51.09% | 76.66% | 57.75% | 10 | 16 | 12 | 903 | 18 | 59 | 731 |
HIK_OCR | 49.09% | 80.74% | 53.69% | 12 | 7 | 19 | 845 | 17 | 34 | 790 |
SRC-B-TextProcessingLab | 32.99% | 72.33% | 48.85% | 4 | 11 | 23 | 637 | 18 | 92 | 997 |
AJOU | 16.22% | 74.16% | 33.09% | 2 | 10 | 26 | 461 | 27 | 193 | 1164 |
StradVision-1 | 13.86% | 70.86% | 29.64% | 3 | 7 | 28 | 507 | 39 | 278 | 1106 |
USTB_TexVideo | 13.50% | 74.27% | 25.77% | 1 | 6 | 31 | 346 | 13 | 123 | 1293 |
Megvii-Image++ | 9.87% | 64.74% | 39.17% | 3 | 11 | 24 | 546 | 8 | 383 | 1098 |
USTB_TexVideo II-2 | 8.35% | 73.54% | 17.09% | 1 | 4 | 33 | 190 | 14 | 52 | 1448 |
USTB_TexVideo II-1 | 1.21% | 69.64% | 32.22% | 3 | 9 | 26 | 480 | 15 | 460 | 1157 |
RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2) | -115.68% | 65.57% | 5.92% | 1 | 8 | 29 | 139 | 214 | 2050 | 1299 |