en

Image-annotations-to-dataset

Start using DSS computer vision feature on your datasets by using Convert object detection annotations recipe !

Plugin information

Version 1.0.0
Author Dataiku (Claire BEHUE)
Released 2021-11
Last updated 2021-11
License Apache Software License
Source code Github repository
Reporting issues Github issues

With this plugin, you will be able to:

  • Convert image annotations into a dataset compatible with object detection in DSS

How to set up

Just install the plugin, no specific code environment is required.

How to use

Let’s assume that you have a Dataiku DSS project with a folder containing images & annotations in COCO or Pascal VOC format

To create your first recipe, navigate to the Flow, click on the + RECIPE button and access the recipe from there. If your folder is selected, you can directly find the plugin on the right panel.

DSS Action panel containing the plugin recipe

Convert object detection annotations recipe

Input

  • A folder with images and object detection annotations under COCO or Pascal VOC.

Output

  • A dataset with two columns : images_annotations & images_path, compatible with object detection in DSS.
Output dataset format

Settings

Recipe parameters

The settings modal lets you choose first the input format (see official documentations to know which format match your data : COCO or Pascal VOC).

Images Folder

For both formats you need to enter the path to the subfolder containing your images. The image folder path must be relative to the managed folder root and can be left empty if the images files are directly at the root of your managed folder.
Example: ‘B/’ if your managed folder contains a subfolder B which contains the images.

Annotations file path (COCO)

Coco annotations are all stored into a single JSON file, you must specify the relative path to this file from your managed folder root. This path must include the file name and be relative to the managed folder root.
Example: ‘annotations/_annotations.coco.json’ if your managed folder contains a subfolder ‘annotations’ which contains your annotations file.

Annotations folder (Pascal VOC)

For Pascal VOC, annotations are split into different files: one XML file for each image. You must specify the path of the subfolder containing these XML files. This annotation folder path must be relative to the managed folder root and can be left empty if your annotations files are directly at the root of your managed folder.
Example: ‘Annotations/’ if your managed folder contains a subfolder Annotations which contains the XML annotations files.
Note: from Pascal VOC documentation “The top-left pixel in the image has coordinates (1,1)”. Running this recipe will remove this offset to be compliant with DSS.

Get the Dataiku Data Sheet

Learn everything you ever wanted to know about Dataiku (but were afraid to ask), including detailed specifications on features and integrations.

Get the data sheet