method: Lambert 2.0 + Excluding OCR Mismatch2021-01-01

Authors: research team


Description: Upgraded Lambert model (based on RoBERTa-base) trained longer on bigger datasets with extra layout embeddings (sinusoidal + relative) for each subtoken (paper is under preparation).

Following the same evaluation rules as others, the OCR mismatch errors are excluded in the submission.

1. We submitted the best solution out of 100 fine-tuned models