Operations

Any model.
Your choice.

Use cloud AI services for production, local models for privacy, or any combination. Switch providers without changing a line of code. Automatic routing picks the right model for each job.

Request Access

COMPLEXITY-BASED ROUTING

Job Queue
SIMPLE linting fix
MEDIUM api endpoint
COMPLEX refactor DAG
SIMPLE doc update
MEDIUM test suite
Router

COMPLEXITY
ROUTER

score ≤ 0.3 → Haiku

score ≤ 0.7 → Sonnet

score > 0.7 → Opus

Model Lanes

HAIKU

fast · low cost · high throughput

< $0.01 / job

SONNET

balanced · best coding · medium cost

~$0.10 / job

OPUS

powerful · deepest reasoning · max cost

~$0.49 / job

5 PROVIDERS ANTHROPIC AWS BEDROCK OPENAI COMPAT OLLAMA CLAUDE CLI
Jobs classified — each incoming job receives a complexity score (simple / medium / complex)
Router directs traffic — Haiku for fast low-cost tasks, Sonnet for code, Opus for deep reasoning
5 providers — Anthropic SDK, AWS Bedrock, OpenAI-Compatible (Groq, HuggingFace…), Ollama, Claude CLI
Production

Cloud Providers

Production-grade AI services with enterprise SLAs. EU data residency for GDPR compliance. Scales with your usage.

Offline

Local Models

Run AI models on your own hardware. Zero internet dependency. Perfect for air-gapped environments and maximum privacy.

Dev

Development

Fast iteration during development. Mounted credentials, zero configuration. Not recommended for production workloads.

Intelligent model routing

Not every job needs the most expensive model. The platform automatically classifies job complexity and routes to the right model — fast models for simple work, powerful models for complex reasoning. The result: significant cost savings without sacrificing quality where it matters.

Zero vendor lock-in

Switch model providers with a single configuration change. No code modifications required.

See model routing in action

Request access to see how automatic routing optimizes your AI spend.

Request Early Access