SVRepV2(Kuaishou-MMU) | 43.46% | 77.94% | 61.10% | 31 | 19 | 10 | 1986 | 128 | 697 | 852 |
h&h_lab | 42.18% | 78.92% | 47.35% | 11 | 36 | 13 | 1472 | 68 | 221 | 1426 |
LOGO | 40.46% | 71.80% | 45.09% | 24 | 21 | 15 | 1797 | 87 | 597 | 1082 |
GoMatching++ | 40.42% | 81.25% | 53.42% | 35 | 15 | 10 | 2060 | 84 | 861 | 822 |
GoMatching | 38.64% | 80.60% | 52.27% | 34 | 14 | 12 | 1986 | 85 | 840 | 895 |
HIK_OCR | 33.85% | 75.95% | 44.89% | 20 | 25 | 15 | 1490 | 92 | 486 | 1384 |
VideoTextSCM | 33.11% | 76.76% | 45.90% | 16 | 24 | 20 | 1348 | 39 | 366 | 1579 |
TransDETR | 27.48% | 76.84% | 45.03% | 11 | 31 | 18 | 1479 | 48 | 664 | 1439 |
GOCR | 26.43% | 75.73% | 57.33% | 28 | 22 | 10 | 1951 | 66 | 1167 | 949 |
GOCR Offline | 26.40% | 75.96% | 57.60% | 27 | 20 | 13 | 1898 | 56 | 1115 | 1012 |
TA-VTT | 26.26% | 80.40% | 50.75% | 19 | 22 | 19 | 1598 | 79 | 819 | 1289 |
Semantic-Aware Video Text Detection | 25.46% | 80.83% | 45.69% | 13 | 26 | 21 | 1390 | 65 | 635 | 1511 |
TransVTSpotter | 21.21% | 76.56% | 28.97% | 8 | 39 | 13 | 1121 | 123 | 492 | 1722 |
AJOU | 13.99% | 72.91% | 42.11% | 15 | 21 | 24 | 1257 | 36 | 842 | 1673 |
SRC-B-TextProcessingLab | 8.19% | 64.25% | 24.67% | 3 | 23 | 34 | 666 | 55 | 423 | 2245 |
USTB_TexVideo II-2 | 7.79% | 70.68% | 14.44% | 0 | 19 | 41 | 380 | 50 | 149 | 2536 |
USTB_TexVideo | 2.56% | 68.20% | 17.49% | 0 | 22 | 38 | 541 | 54 | 465 | 2371 |
USTB_TexVideo II-1 | -20.33% | 67.01% | 10.12% | 2 | 12 | 46 | 272 | 17 | 875 | 2677 |
RTST Lucas-Kanade-2 (RealTimeSceneText_LucasKanade_v2) | -16.86% | 72.90% | 1.24% | 0 | 3 | 57 | 25 | 28 | 525 | 2913 |
Megvii-Image++ | -12.71% | 60.17% | 17.56% | 4 | 15 | 41 | 474 | 14 | 851 | 2478 |
StradVision-1 | -0.57% | 63.20% | 14.13% | 2 | 14 | 44 | 391 | 43 | 408 | 2532 |