method: GRAM2024-01-16

Authors: Tsachi Blau, Sharon Fogel, Roi Ronen, Alona Golts, Roy Ganz, Shahar Tsiper, Elad Ben Avraham, Aviad Aberdam, Ron Litman

Affiliation: AWS AI Labs and Technion Israel

Description: GRAM model based on Docformerv2 trained on DUDE and Multi-Page DocVQA dataset.

Ranking Table

Description Paper Source Code
AnswerCalibrationOOD DetectionANLS per Answer type
DateMethodANLSECEAURCAUROCExtractiveAbstractiveList of answersUnanswerable
2024-08-30Snowflake Arctic-TILT 0.8B0.58090.07630.25290.52890.62710.56450.46690.6261
2024-05-31GPT-4 Vision Turbo + Azure OCR0.53920.55830.43170.50000.59730.52480.57850.5131
2024-01-16GRAM0.53360.44040.44040.50000.56830.52320.19960.6543
2024-01-16GRAM C-Former0.50970.46130.46130.50000.55150.50460.17260.6104
2023-04-20DocGptVQA0.50020.22400.42100.87440.51860.48320.28220.6204
2023-04-16DocBlipVQA0.47620.30650.48600.78290.50690.46310.30730.5522
2023-03-27model_03270.46590.19040.43980.88540.55210.46600.17860.4726
2023-03-16T5-concat0.38670.24890.43430.51130.37270.37500.16810.5289
2023-04-20Multi-Modal T5 VQA0.37900.59310.59310.50000.41550.40240.20210.3467
2023-04-19Multi-Modal T5 VQA0.37890.59310.59310.50000.41540.40220.20310.3467
2023-04-18Hi-VT5-beamsearch0.35740.61040.61040.50000.28310.32980.10600.6290
2023-04-21Hi-VT5-beamsearch with token type embeddings0.35590.28030.46030.48760.30950.35150.11760.5250
2023-04-26QAP0.11590.41680.90760.50140.00090.00070.00000.6199

Ranking Graphic

Ranking Graphic

Ranking Graphic

Ranking Graphic