method: GRAM2024-01-16

Authors: Tsachi Blau, Sharon Fogel, Roi Ronen, Alona Golts, Roy Ganz, Shahar Tsiper, Elad Ben Avraham, Aviad Aberdam, Ron Litman

Affiliation: AWS AI Labs and Technion Israel

Description: GRAM model based on Docformerv2 trained on DUDE and Multi-Page DocVQA dataset.

method: GRAM C-Former2024-01-16

Authors: Tsachi Blau, Sharon Fogel, Roi Ronen, Alona Golts, Roy Ganz, Shahar Tsiper, Elad Ben Avraham, Aviad Aberdam, Ron Litman

Affiliation: AWS AI Labs and Technion Israel

Description: GRAM model with C-Former based on Docformerv2 trained on DUDE and Multi-Page DocVQA dataset.

method: Hi-VT5-beamsearch2023-04-18

Authors: JiangLong He, Mamatha N, Shiv Vignesh, Deepak Kumar

Description: Hi-VT5 model pretrained with private custom document collection using span masking objective. Pretrained model is then trained with DUDE dataset and Multi-Page DocVQA dataset.

Ranking Table

Description Paper Source Code
AnswerCalibrationOOD DetectionANLS per Answer type
DateMethodANLSECEAURCAUROCExtractiveAbstractiveList of answersUnanswerable
2024-01-16GRAM0.53360.44040.44040.50000.56830.52320.19960.6543
2024-01-16GRAM C-Former0.50970.46130.46130.50000.55150.50460.17260.6104
2023-04-18Hi-VT5-beamsearch0.35740.61040.61040.50000.28310.32980.10600.6290
2023-04-21Hi-VT5-beamsearch with token type embeddings0.35590.28030.46030.48760.30950.35150.11760.5250

Ranking Graphic

Ranking Graphic

Ranking Graphic

Ranking Graphic