Datasets

A dataset is a grouping of images that you use to train machine learning models. On the DATA tab in the Viam app, you can create and manage datasets.

Labels

You label the images in your dataset with bounding boxes or image tags, depending on the type of model you intend to train:

API

To interact with datasets programmatically, use the data client API, which supports the following methods for working with datasets:

Method NameDescription
CreateDatasetCreate a new dataset.
DeleteDatasetDelete a dataset.
RenameDatasetRename a dataset specified by the dataset ID.
ListDatasetsByOrganizationIDGet the datasets in an organization.
ListDatasetsByIDsGet a list of datasets using their IDs.

Sample dataset

If you are testing, you can use one of two provided sample dataset, one of shapes and the other of a wooden figure:

The shapes dataset. The datasets subtab of the data tab in the Viam app, showing a custom 'viam-figure' dataset of 25 images, most containing the wooden Viam figure
  1. Download the shapes dataset or download the wooden figure dataset.

  2. Unzip the download.

  3. Open a terminal and go to the dataset folder.

  4. In it you will find a Python script to upload the data to the Viam app.

  5. Open the script and fill in the constants at the top of the file.

  6. Run the script to upload the data into a dataset in Viam app:

    python3 upload_data.py
    

Next steps

The following how-to guide contains instructions on creating datasets as well as on how to train a model on a dataset:

Have questions, or want to meet other people working on robots? Join our Community Discord.

If you notice any issues with the documentation, feel free to file an issue or edit this file.