SA-M4C | 1.00 |
GIT2, Single Model | 1.00 |
PreSTU CC15M-SplitOCR B+B | 1.00 |
PaLI-3B | 1.00 |
PaLI-15B | 1.00 |
PaLI-17B | 1.00 |
micro_60 | 1.00 |
PaLI-X | 1.00 |
SMoLA-PaLI-X Generalist Model | 1.00 |
PaliGemma-3B (finetune, 896px) | 1.00 |
PaliGemma-3B (finetune, 448px) | 1.00 |
PaliGemma-3B (finetune, 224px) | 0.94 |
unitnt blip | 0.81 |
ROQ | 0.69 |
LTG | 0.69 |
M4C (single model) | 0.56 |
DXM_DI_AI_CV_NLP | 0.56 |
SMA | 0.50 |
vm | 0.50 |
ssbaseline | 0.50 |
m4c_demo | 0.50 |
KgMr | 0.50 |
Clova AI OCR | 0.00 |
USTB-TQA | 0.00 |
Focus: A bottom-up approach for Scene Text VQA | 0.00 |
USTB-TVQA | 0.00 |
VTA | 0.00 |
QAQ | 0.00 |
MM-GNN | 0.00 |
RUArt | 0.00 |
masker | 0.00 |
TIG | 0.00 |
TAP | 0.00 |
M4C-CVL | 0.00 |
TWA | 0.00 |
GIT, Single Model | 0.00 |
danet | 0.00 |
TAG | 0.00 |
a | 0.00 |
tap_visualbert | 0.00 |