Getting Started¶

Prerequisites¶

Required tools

Python 3.12+
uv (package manager)
Docker (for model runtime containers)

Installation¶

Local development Kubernetes dev stack As a library

Clone and set up the project

git clone https://github.com/hotosm/fAIr-models.git
cd fAIr-models
just setup

Clone and set up with k8s extras

git clone https://github.com/hotosm/fAIr-models.git
cd fAIr-models
just k8s
just setup

just k8s switches to k8s mode (sticky, persists across sessions). just setup then installs k8s extras and checks that kind, kubectl, helm, helmfile, and mc are on $PATH. Use just local to switch back. See Kubernetes Dev Stack.

Add to your project

uv add fair-py-ops

Running the Example Pipelines¶

Three example pipelines demonstrate the full workflow for each supported task type: register a base model, finetune on sample data, promote the best version, and run inference.

Example	Task	Model
`examples/segmentation/`	Semantic segmentation	UNet (torchgeo)
`examples/classification/`	Binary classification	ResNet18 (torchvision)
`examples/detection/`	Object detection	YOLOv11n (ultralytics)

Data Preparation¶

Classification and detection labels are derived from the segmentation labels. Run the conversion scripts before the first pipeline run:

Generate derived labels

python scripts/convert_segmentation_to_classification.py
python scripts/convert_segmentation_to_detection.py

Running All Pipelines¶

Run all three pipelines

just example  # converts labels, then runs segmentation -> classification -> detection

Running a single example

python examples/segmentation/run.py all      # segmentation only
python examples/classification/run.py all     # classification only
python examples/detection/run.py all          # detection only

Individual steps (segmentation)

python examples/segmentation/run.py init       # Initialize ZenML + STAC catalog
python examples/segmentation/run.py register   # Register base model + dataset
python examples/segmentation/run.py finetune   # Train (1 epoch on sample data)
python examples/segmentation/run.py promote    # Promote to production + publish STAC
python examples/segmentation/run.py predict    # Run inference

Verifying Results¶

After the pipeline completes

Artifact	Location
STAC catalog	`stac_catalog/` (3 collections)
Trained weights	`artifacts/`
Predictions	`data/sample/predict/predictions/`
ZenML dashboard	http://localhost:8080

Project Structure¶

fair/                  # Core library (pip-installable as fair-py-ops)
  stac/                # STAC catalog management, builders, validators
  utils/               # Data helpers, model validation
  zenml/               # ZenML config generation, promotion, steps
models/                # Base model contributions (one subdir per model)
examples/              # CLI runners for local development
infra/dev/             # Kind cluster Helm values for K8s dev stack
tests/                 # pytest suite

Development Commands¶

All targets adapt to the active mode (local by default). Switch with just k8s or just local.

Available recipes

just local             # switch to local mode (default)
just k8s               # switch to k8s mode (sticky)
just setup             # install deps (k8s mode adds extras + tool checks)
just lint              # ruff check + format + ty check
just test              # pytest
just validate          # validate STAC items + model pipelines
just example           # run example pipeline (k8s mode delegates to infra/dev)
just docs              # serve documentation locally
just clean             # remove ZenML state + artifacts (k8s mode also tears down cluster)

Next Steps¶

Tip

Read the Architecture overview to understand the system
Contribute a model to fAIr
Set up the Kubernetes dev stack for production-like testing
Learn Markdown authoring with Zensical for writing documentation