masker | 1.00 |
SA-M4C | 1.00 |
TAP | 1.00 |
M4C-CVL | 1.00 |
m4c_demo | 1.00 |
DXM_DI_AI_CV_NLP | 1.00 |
TWA | 1.00 |
GIT, Single Model | 1.00 |
ROQ | 1.00 |
LTG | 1.00 |
PreSTU CC15M-SplitOCR B+B | 1.00 |
PaLI-3B | 1.00 |
PaLI-15B | 1.00 |
KgMr | 1.00 |
PaLI-X | 1.00 |
SMoLA-PaLI-X Generalist Model | 1.00 |
tap_visualbert | 1.00 |
PaliGemma-3B (finetune, 896px) | 1.00 |
PaliGemma-3B (finetune, 448px) | 1.00 |
PaliGemma-3B (finetune, 224px) | 1.00 |
VTA | 0.83 |
ssbaseline | 0.83 |
TAG | 0.71 |
M4C (single model) | 0.67 |
vm | 0.67 |
TIG | 0.67 |
Clova AI OCR | 0.55 |
USTB-TQA | 0.00 |
Focus: A bottom-up approach for Scene Text VQA | 0.00 |
USTB-TVQA | 0.00 |
QAQ | 0.00 |
MM-GNN | 0.00 |
RUArt | 0.00 |
SMA | 0.00 |
danet | 0.00 |
GIT2, Single Model | 0.00 |
PaLI-17B | 0.00 |
unitnt blip | 0.00 |
micro_60 | 0.00 |
a | 0.00 |