- You are registered as a user on the tracebloc platform
- You have a client deployed using kubernetes either locally or in the cloud
- Your dataset is cleaned and preprocessed
- You are familiar with the supported data types and tasks
- Preparing and ingesting the dataset
- Defining the use case
- Setting evaluation metrics
- Evaluating models
Supported Data Types and Tasks
Image Data
Requirements for all image data tasks: Uniform image sizes and uniform file types. For example all images as 256x256 rgb .jpg files. Convert files if necessary and in case your images do not fit the supported size, crop or resize accordingly. All images are validated before ingestion by the data ingestor. The ingestion process only starts when every file meets the requirements. Fix or remove any invalid images, then retry. For object detection, images and annotations are automatically up- or downsized to 448x448 pixels.| Task | Input file type | Color mode | Supported image size | Label file type | Requirements | Links |
|---|---|---|---|---|---|---|
| Classification | PNG, JPG, JPEG | rgb (3 channels) or grayscale (1 channel), 8-bit per channel | Square (height = width) | CSV | Uniform image size and file type per dataset | Detailed structure Example |
| Keypoint Detection | PNG, JPG, JPEG | rgb (3 channels) or grayscale (1 channel), 8-bit per channel | Square (height = width) | CSV | Uniform image sizes Same number of keypoints per image and class | Detailed structure Example |
| Object Detection | PNG, JPG, JPEG | rgb (3 channels) or grayscale (1 channel), 8-bit per channel | Square (height = width). Images and annotations will be resized to 448x448 px automatically | Pascal VOC | Uniform image sizes, one xml per image | Detailed structure Example |
| Semantic Segmentation | PNG, JPG, JPEG | rgb (3 channels) or grayscale (1 channel), 8-bit per channel | Square (height = width) | PNG, JPG, JPEG | Uniform image and mask sizes | Detailed structure Example |
Image Classification
Image Keypoint Detection
The number of keypoints per class and per image must be fixed. For example, in a person/car keypoint detection project, both classes must define the same keypoints, and every image must contain the full set for its class. You cannot mix classes with different keypoint counts (e.g., 16 for person and 32 for car) or annotate some images with fewer keypoints for the same class.- X and Y determine the X-/Y-coordinates of a keypoint
- Visibility indicates whether a keypoint is visible in the image or not: 0 = not visible (point outside the image or point is in the image but occluded), 1 = visible
- Filename should not include the file extension.
Image Object Detection
The filename like “street01.png” specifies the link between images and annotations. XML-file annotations are in Pascal VOC format. The labels.csv contains a global list of all images and objects. Currently, the images and annotations are resized to 448x448 pixels.Image Semantic Segmentation
Each mask is an rgb image whose pixel values map to classes defined in labels.csv. The labels.csv contains a global list per image and class. All masks must exactly match their corresponding image sizes and file names. For binary segmentation (two classes), provide a single-channel grayscale mask where background pixels are black (0) and foreground pixels are white (255). For three or more classes, supply an RGB mask where each class is represented by a unique color (or pixel value). The filename should not include the file extension.Tabular Data
Requirements for all tabular data tasks: Each dataset must be provided as a single CSV file with a header row. Every column must contain uniform data types, for example numeric values for features and a categorical or alphanumeric column for labels. Use UTF-8 encoding with comma separators and validate that your schema matches the expected types. Invalid rows are skipped by the ingestor.| Task | Data file type | Requirements | Links |
|---|---|---|---|
| Classification | CSV (features and label in one single file) | Uniform data formats per column. Feature columns: Numeric Label columns: Alphanumeric | Detailed structure Example |
Tabular Classification
Include a header row with clear column names, using a dedicated column for the labels. Anid column is recommended but not required.
Regression
Regression tasks are supported on the platform. Evaluation uses Mean Absolute Error (MAE). For details on supported metrics, see Supported Metrics. Reach out to us at [email protected] for guidance on data format requirements for regression use cases.Text Data
| Task | Input files | Label file type | Requirements | Links |
|---|---|---|---|---|
| Classification | TXT | CSV | Text file may not be empty | Detailed structure Example |
Text Classification
The filename should not include the file extension.file example
Next Steps
- Prepare and ingest your dataset: Prepare Data
Need Help?
- Email us at [email protected]