Simple Pricing

One API.
Every Model.
One Bill.

Intelligent routing picks the most efficient model for every request — fast inference for simple tasks, high-reasoning models for complex work. Start free, scale as you grow.

Get Free API Key →How It Works

Pay-as-you-go

Add Credits

$10minimum

minimum

Add credits as you need them. No subscription - pay only for what you use.

$10 minimum top-up
Add credits anytime, $10–$10,000
Credits never expire
Intelligent routing across 52+ models
OpenAI, Anthropic, Google, Meta, DeepSeek & more
Community support
Up to 5 API keys

Grizzly Pro

$99/month

/month

Built for agents who need more power, caching, and efficiency.

10 million tokens included
Semantic caching
Guardrails & safety filtering
Advanced analytics dashboard
Cost optimizer recommendations
Virtual keys with budgets
Up to 100 API keys
Email support

Unlimited

Business Agent

Custom

For mid-market & enterprises needing custom integrations and SLAs.

Unlimited tokens (fair-use SLA)
Everything in Grizzly Pro, plus:
Custom industry APIs
Managed Hermes/OpenClaw agents
Long-term memory & self-improvement
Tool calling & orchestration
SOC2 compliance available
SLA guarantees
Dedicated account manager

Business Agent — What You Get

A managed AI team, scoped and priced.

We deploy and run production AI agents in your environment. Fixed scope, fixed phases, a real number to react to.

Custom Managed Agents

Setup fee

$5K+

Covers Discovery + Build + Integrate. Fixed bid after discovery call.

Monthly retainer

$2K+ monthly

Includes hosting, monitoring, model costs up to 50M tokens/mo, and a dedicated account manager.

What you get, in 4 phases

Discovery

Week 1–2

We map your workflows, identify high-ROI LLM use cases, and write a build-vs-buy memo. You own the output regardless of whether you proceed.

Build

Week 3–6

We deploy a managed Hermes/OpenClaw agent in your environment. Custom tools, guardrails, and routing rules configured for your exact prompt mix.

Integrate

Week 7–8

Wire the agent into your existing systems (Slack, Linear, Notion, internal APIs, your CRM). SSO, audit logs, and SOC2 controls enabled by default.

Manage

Ongoing

We monitor, retrain, and improve. Quarterly reviews. Slack Connect. Dedicated account manager. Sub-1hr response on Sev-1.

Example engagements

3 verticals, 2 Hermes & OpenClaw agent use cases each. Real projects we've scoped.

Finance

•Hermes: 10-K / 10-Q deep-dive research that pulls citations from 500+ pages and writes the analyst memo (cuts review time from days to ~30 min)
•OpenClaw: Real-time earnings-call summarization pushed to Slack the moment a transcript drops (covers ~40 SEC filings/day, zero humans in the loop)

Coding

•Hermes: Multi-file code refactors with reasoning traces — Hermes plans the change across 20+ files, then ships a PR description that links each edit to the design doc
•OpenClaw: Automated PR review bot in Discord — every push gets a routing-aware code review in seconds, with cheap models for lint/typo passes and Claude/GPT-4o reserved for the hard parts

Creative

•Hermes: Long-form content research + drafting (blog posts, marketing briefs, product launches) where the agent chains 10+ web searches into a single grounded draft
•OpenClaw: Social copy + image-prompt generation as a Discord slash-command — give the agent a brand voice once, get 30 on-brand post variants per call, all routed through the cheapest model that hits quality

Don't see your vertical? Most engagements we run are net-new — discovery is the place to find out.

Token Pricing

We add a small markup over provider costs. See live pricing across all models on our Models page.

Input Tokens

$0.04

per 1M tokens

vs OpenAI $2.50/1M

Output Tokens

$0.12

per 1M tokens

vs OpenAI $10/1M

Intelligent Routing

Every call

to the most efficient model

simple tasks → Llama 3B · reasoning → Claude

Feature Comparison

Compare every tier side-by-side. All plans include intelligent routing, semantic caching, and our unified API.

Feature	Add Credits	Grizzly Pro	Business Agent
Monthly tokens	Pay-as-you-go	10M	Unlimited
API keys	5	100	Unlimited
Intelligent routing	✓	✓	✓
Semantic caching	—	✓	✓
Guardrails & safety	—	✓	✓
Analytics dashboard	—	Advanced	Custom
Virtual keys & budgets	—	✓	✓
Custom guardrails	—	—	✓
Budget alerts	—	—	✓
Custom industry APIs	—	—	✓
Managed agents	—	—	✓
SLA & SOC2	—	—	✓
Support	Community	Email	Dedicated

Frequently Asked Questions

How does intelligent routing work?

Our routing layer analyzes each request and picks the most efficient model for the job. Simple tasks like summarization go to fast, low-cost models (starting at $0.04/1M), while complex reasoning goes to high-reasoning models like Claude 70B or Grizzly 1.0. You always get the best quality-to-cost ratio.

What happens to my credits?

Credits never expire. Grizzly and Grizzly Pro plans include monthly token allocations. Business Agent plans have fair-use unlimited pricing.

Can I switch plans?

Yes, you can upgrade or downgrade anytime. Upgrades take effect immediately. Downgrades take effect at the start of your next billing cycle.

What models are available?

Access to 52+ models across OpenAI, Anthropic, Google, Meta, DeepSeek, Mistral, Groq, Fireworks, Together.ai, and more. Full pricing list on our Models page.

Route every LLM call through one API

Get a free API key. No credit card required. Start routing every LLM call automatically.

Get Free API Key →Read the Docs

One API.Every Model.One Bill.