Last update: Dec. 18, 2020
The following database / datasets have been developed at our laboratory and made open to the public for research purposes.
This is a Japanese scene character dataset consisting of Hiragana, Katakana, and Kanji scene character images taken in real scenes in and around Sendai, Japan.
This dataset contains 4-class Ground Truth data for the natural scene images with text provided at http://www.cs.osakafu-u.ac.jp/document/ . This dataset is intended to be used for evaluations of block-based text detection algorithms.
This dataset contains 4-class Ground Truth data of the natural scene images with text from the ICDAR 2003 Robust Reading Competition. The original image data can be found at http://algoval.essex.ac.uk/icdar/Datasets.html . This dataset is intended to be used for evaluations of block-based text detection algorithms.