Skip to content

pynicolas/fairscan-dataset

Repository files navigation

FairScan Dataset

This repository contains a small, manually annotated dataset for training and evaluating the segmentation model used by FairScan to detect documents.

Overview

  • Contents: Images of documents (photos) and corresponding binary masks
  • Format: JPEG images + PNG masks (1 channel)
  • Annotations: Created manually using LabelMe
  • License: CC BY-NC-SA 4.0

Structure

The dataset is available via the Releases section of this repository. It is structured as follows:

.
├── train/
│   ├── images/
│   │   ├── img001.jpg
│   │   └── ...
│   └── masks/
│       ├── img001.png
│       └── ...
└──  val/
    ├── images/
    └── masks/

Example

Here's an example of an image and the associated mask:

image mask

License

The dataset is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license.

You are free to:

  • Share — copy and redistribute the material in any medium or format
  • Adapt — remix, transform, and build upon the material

Under the following terms:

  • Attribution — You must give appropriate credit.
  • NonCommercial — You may not use the material for commercial purposes.
  • ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license.

See the full license text here.


Feel free to open an issue if you notice errors in the annotations or would like to contribute improvements.

About

A small, manually annotated dataset for training and evaluating FairScan's segmentation model.

Topics

Resources

License

Stars

Watchers

Forks

Languages