Tools for Every Lifecycle Stage
A curated, production-grade stack. Pick one tool per category, integrate them through standard interfaces, and evolve the stack as scale demands.
Data & Features
4 toolsApache Airflow
DAG-based workflow orchestration for batch pipelines.
dbt
SQL-first transformations with tests and lineage.
DVC
Git-style version control for datasets and models.
Feast
Open-source feature store for online + offline parity.
Training & Experimentation
4 toolsMLflow
Experiment tracking, model registry, and packaging.
Weights & Biases
Hosted experiment dashboards and sweeps.
Ray Train
Distributed training across heterogeneous clusters.
Optuna
Hyperparameter optimization with pruning.
Packaging & Registry
4 toolsDocker
Container runtime for reproducible model environments.
BentoML
Standardized model packaging with serving APIs.
MLflow Registry
Stage-gated model promotion (Staging → Production).
Cosign
Sign and verify container images for supply chain safety.
Deployment & Serving
4 toolsKServe
Kubernetes-native model serving with autoscale-to-zero.
Seldon Core
Advanced inference graphs, A/B, and shadow traffic.
NVIDIA Triton
High-throughput inference for GPU + CPU backends.
Istio
Service mesh for canary, mTLS, and traffic shaping.
Orchestration & CI/CD
4 toolsKubeflow Pipelines
End-to-end ML pipelines on Kubernetes.
Argo Workflows
Container-native workflow engine.
Flyte
Strongly-typed, reproducible workflow orchestration.
GitHub Actions
CI/CD for model code, configs, and infrastructure.
Monitoring & Observability
4 toolsPrometheus
Time-series metrics for serving infrastructure.
Grafana
Dashboards for latency, throughput, and drift.
Evidently AI
Open-source data and model drift reports.
WhyLabs
Hosted observability for ML data and predictions.