Overview - ICDAR 2017 Robust Reading Challenge on Omnidirectional Video
This challenge focuses on scene text localization and recognition on the Downtown Osaka Scene Text (DOST) dataset . Five tasks will be opened within this Challenge: Text Localisation in Videos, Text Localisation in Still Images, Cropped Word Recognition, End-to-End Recognition in Videos, and End-
The DOST dataset preserves scene texts observed in the real environment as they were. The dataset contains videos (sequential images) captured in shopping streets in downtown Osaka with an omnidirectional camera. Use of the omnidirectional camera contributes to excluding user’s intention in capturing images. Sequential images contained in the dataset contribute to
Important Notice about Ground Truth Quality and Renovation
Before releasing the DOST dataset, we decided to improve (renovate) the ground truth (GT) quality of the dataset. Particularly, transcriptions and bounding boxes of GTs are improved and more non-readable text regions are covered as "Don't care" regions. However, the renovation took
We also notice the GTs are not perfect. As you can imagine, improvement of the ground truths takes a lot of effort. Though we will try best, please understand the limitation. We are
Anyway, enjoy the DOST dataset!
 Iwamura, M., Matsuda, T., Morimoto, N., Sato, H., Ikeda, Y., Kise, K..: Downtown Osaka Scene Text Dataset. ECCV 2016 International Workshop of Robust Reading, pp