News - ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

02/17/2023

Updated training/validation set

Dear participants, 

Update:
[17/02]: The training and validation dataset has been updated to also contain list and unanswerable questions. Question-answer pairs from the previous subset have been revised.

The links under Download have been updated to point to the newest annotations file, as well as to the binaries with some patched OCR files.

For the easiest access to the latest version of the dataset, we suggest you use DUDE_loader on HuggingFace Datasets.

 

Important Dates

Note: The time zone of all deadlines is UTC-12. The cut-off time for all dates is 11:59 PM.

December 30, 2022

Website ready

January 10-12, 2023

1) Task 1&2 training dataset available

2) Task 3&4 training dataset available

March 10, 2023

1) Test set of task 1 available, submission open

March 15, 2023

1) Task 1 submission deadline

2) Test set of task 2 available and submission open

March 20, 2023

1) Task 2 submission deadline

------------------------------------------------------

March 6, 2023

1) Test set of task 3 available

March 10, 2023

1) Task 3 submission open

March 17, 2023

1) Task 3 submission deadline

2) Test set of task 4 available and submission open

3) Few-shot training examples of task 4 available

March 24, 2023

1) Task 4 submission deadline

March 25, 2023

Submit reproducible script and short description of the method for Task 1-4. (The detailed instructions will be uploaded.)

March 27, 2023

The notification for reproducible script submission has been sent to top-5 participants via email.

Note: Task 1&2  submission data has been extended.

Note: Task 3&4  submission data has been extended.