cff-version: 1.2.0
abstract: "
The original dataset is Ot & Sien Dataset (https://lab.kb.nl/dataset/ot-sien-dataset). We corrected mistakes and made it ML-ready.
The purpose of this dataset is to help the development of automatic visual object detection in children's book illustrations. The properties of our dataset are summarized as:
- The dataset consists of illustrations rather than standard photos.
- 1452 images with 8241 objects (5.7 per image) are annotated including the category and bounding boxes.
- All images are resized to 416 x 416 with black fitting edges to adapt to the training procedure.
- The dataset follows a natural long-tail property, with some object categories being rare.
- The dataset has imbalanced categories.
"
authors:
- family-names: Wang
given-names: Haoran
- family-names: Khademi
given-names: Seyran
title: "Ot & Sien, a dataset to help the development of object detection in children's book illustrations"
keywords:
version: 1
identifiers:
- type: doi
value: 10.4121/d1f3ca5c-f1e4-48f5-9a04-0564572d2b9c.v1
license: CC BY 4.0
date-released: 2023-06-23