VTA | 1.00 |
QAQ | 1.00 |
M4C (single model) | 1.00 |
RUArt | 1.00 |
SMA | 1.00 |
masker | 1.00 |
SA-M4C | 1.00 |
vm | 1.00 |
TIG | 1.00 |
ssbaseline | 1.00 |
TAP | 1.00 |
M4C-CVL | 1.00 |
m4c_demo | 1.00 |
DXM_DI_AI_CV_NLP | 1.00 |
GIT, Single Model | 1.00 |
danet | 1.00 |
GIT2, Single Model | 1.00 |
TAG | 1.00 |
ROQ | 1.00 |
LTG | 1.00 |
PreSTU CC15M-SplitOCR B+B | 1.00 |
PaLI-3B | 1.00 |
PaLI-15B | 1.00 |
PaLI-17B | 1.00 |
unitnt blip | 1.00 |
micro_60 | 1.00 |
a | 1.00 |
PaLI-X | 1.00 |
SMoLA-PaLI-X Generalist Model | 1.00 |
tap_visualbert | 1.00 |
PaliGemma-3B (finetune, 896px) | 1.00 |
PaliGemma-3B (finetune, 448px) | 1.00 |
PaliGemma-3B (finetune, 224px) | 0.83 |
Clova AI OCR | 0.00 |
USTB-TQA | 0.00 |
Focus: A bottom-up approach for Scene Text VQA | 0.00 |
USTB-TVQA | 0.00 |
MM-GNN | 0.00 |
TWA | 0.00 |
KgMr | 0.00 |