method: Human Performance2020-06-13

Authors: Task1 Organizers

Affiliation: CVIT, IIIT Hyderabad, CVC-UAB, Amazon

Description: Human performance on the test set.
A small group of volunteers were asked to enter an answer for the given question and the image.

method: Applica.ai TILT2021-02-12

Authors: Applica.ai Research Team

Affiliation: Applica.ai

Description: TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics. Contrary to previous approaches, we rely on a encoder-decoder architecture. We submitted results from a single model.

method: LayoutLM 2.0 (single model)2020-12-22

Authors: LayoutLM Team

Affiliation: LayoutLM Team

Description: Multi-modal Pre-training for Visually-Rich Document Understanding

Ranking Table

Description Paper Source Code
DateMethodScoreFigure/DiagramFormTable/ListLayoutFree_textImage/PhotoHandwrittenYes/NoOthers
2020-06-13Human Performance0.98110.97560.98250.97800.98450.98390.97400.97170.99740.9828
2021-02-12Applica.ai TILT0.87050.60820.94590.89800.85920.85810.55080.81390.68970.7788
2020-12-22LayoutLM 2.0 (single model)0.86720.65740.89530.87690.87910.87070.72870.67290.55170.8103
2020-05-16HyperDQA_V40.68930.38740.77920.63090.74780.71870.48670.56300.41380.5685
2021-02-08seq2seq0.10810.07580.12830.08290.13320.08220.07860.07790.48280.1052

Ranking Graphic

Ranking Graphic