Test before
you commit.
A safe sandbox for experimenting with agents, comparing models side-by-side, validating guardrails, and previewing workflows — without touching production.
Agent Chat
Interactive conversations with any agent. Select your model, choose an agent type, and test how it responds to real prompts. Full session history with auto-save.
Prompt Lab
Compare multiple models side-by-side. See word-level diffs between responses. Compare token usage and cost. Find the best model for each use case.
Guardrail Tester
Validate your guardrail rules against real file changes before deploying. See exactly which rules would trigger and why.
Workflow Dry Run
Preview what an agent would plan for a given task — without executing. Review the strategy before committing resources.
Sessions that persist
Every conversation, comparison, and test is automatically saved. Pick up where you left off, review past experiments, or share sessions with your team. Auto-named from your first message for easy discovery.
Try the sandbox
Request access to experiment with agents in a safe environment.
Request Early Access