method: iFLYTEK-DOCR2020-05-05

Authors: Chenyu Liu, Fengren Wang, Jiajia Wu, Jinshui Hu, Bing Yin, Cong Liu

Affiliation: iFLYTEK Research

Description: By treating this problem as a retrieval task, we presented DOCument OCR Retrieval (DOCR). Our method consists of three building blocks:
1) layout & document level analysis, within OCR and Key-Value Extracting included;
2) sequence labeling and parsing for each question;
3) a fuzzy search algorithm for retrieving and ranking.