Datasets

Open Datasets: gathered in the wild, tamed in the lab

DroneWaste

DroneWaste

UAV imagery dataset for waste recognition in real landfill sites.

DroneWaste targets automated identification of waste materials from drone-derived orthomosaics, supporting environmental inspection workflows. Each visible waste instance is annotated with a segmentation mask, bounding box, and category mapped to the European Waste Code (EWC).

ModalityRGB aerial image tiles
Scale4,993 images • 5,135 annotations • 20 materials
TasksInstance segmentation, object detection
LicenseCC BY 4.0
BaleUAVision

BaleUAVision

High-resolution UAV dataset for hay bale detection, segmentation, and counting.

BaleUAVision provides large-scale, field-diverse aerial imagery of hay bale fields with human annotations and multiple annotation formats (COCO/YOLO/CSV/JSON/masks), plus orthomosaics to support mapping and simulation workflows.

ModalityHigh-res RGB (4056x3040), geo-referenced + orthophotos
Scale2,599 images • 16 fields • ~44GB
TasksSegmentation, Detection, Counting
LicenseCC BY 4.0
Delivering Data

Delivering Data

Real-world last-mile pharmaceutical delivery dataset for VRP research.

A benchmark dataset built from real 3PL operations. It provides privacy-preserving matrices (no raw addresses) and richly structured constraints to test VRP solvers and learning-based routing.

ProblemCVRPTW-like daily routing instances
Scale9 daily instances • ~60-85 stops/day
DataDistance + Time matrices (optimistic/most-likely/pessimistic) + Order characteristics
LicenseCC BY 4.0
CoFly-WeedDB

CoFly-WeedDB

UAV-labelled dataset for weed detection and species identification in cotton fields.

CoFly-WeedDB includes expert agronomist annotations for three common weed species (Johnson grass, field bindweed, purslane) and is suitable for training segmentation models in precision agriculture.

ModalityRGB (1280x720)
Scale201 annotated images (LabelMe) • 3 weed species
Taskssemantic segmentation, species classification
LicenseCC BY 4.0