SKILL PACK · FINOPS

Watch the bill. Cap the burn.

Cloud cost + LLM token spend in one console. Dynamic model routing keeps every agent call on the cheapest provider that meets the policy. Spike detection halts runaway agents before the invoice arrives.

Install FinOps Pack Back to Forge

WHAT'S INSIDE

Three sources of truth, one burn rate.

AWS, model providers, and your routing layer plug in as first-class MCP servers. Forge correlates every dollar back to the agent / pack / user that spent it.

AWS Cost Explorer

Daily run-rate + per-service drill-down

Provider billing

Anthropic / OpenAI / Vercel raw spend

OpenRouter

Live model price routing + failover

APPROVAL TEMPLATES

Spend changes pass through you.

Four templates for the budget-shaping actions agents actually request. Each one writes a Guardian audit row with the dollar delta.

Raise monthly budget capPRO+$0.05

Swap default modelPRO$0.02

Change rate-limit policyPRO+$0.02

Provider failoverAGENCY$0.05

ANOMALY DETECTORS

Two ledgers watching burn velocity.

Streaming anomaly detection on cloud + token spend with 14-day rolling baselines. A 3σ jump triggers a kill-switch offer before the daily total reaches yesterday's.

Cloud burn

AWS daily $ vs week-ago
GCP / Vercel cross-check
Per-service spike detector

Token burn

Anthropic / OpenAI $ / hour
Per-agent cost attribution
Per-skill-pack chargeback

BUDGET CONTEXT

“Month-to-date spend trending 38% over plan. Auto-route Sonnet workloads to Haiku for 72h?”

FinOps Pack reads Kairon's billing-anomaly index alongside your routing config. Most agents are happy on a cheaper model; the pack proposes the swap and you swipe yes or no.

GOVERNANCE PROOF

Cost watched. Risk scored. Per pack.

Every paid tier ships the live governance console — the same one running over this page right now.

GOVERNANCE SCORE

60

Guarded

Cost Visibility
Per-provider spend + live burn rate aggregated into the Cost Radar.
Agent Audit
Every agent action stamped with risk level and a 5-year retention log.
Guardrails
Risk Console + emergency kill-switch wired to a kairon-guardian binary.
Dynamic Routing
Workloads are routed across providers by cost / quality / latency / context.
Identity Provenance
MCP token issued and at least one client (Claude Desktop / AEGIS / Cursor) connected.

NEXT STEP

Install kairon-guardian and bind your kill-switch (KaironOS Pro+).

COST RADAR · LIVE

Aggregate cloud-LLM spend across the monitored workload.

WASTED THIS SESSION

$0.0000

OpenAI · GPT-4o$62,000/mo

Anthropic · Claude Sonnet$47,000/mo

Google · Gemini Pro$15,500/mo

Groq · Llama 3 70B$5,400/mo

UNLOCK

Pro+ is the baseline. Pick a tier.

Pro$19200 / moSingle dev, 1 device

Pro+RECOMMENDED$29600 / moAnomaly detectors + 5 devices

Agency$993 000 pooledTeam + shared memory

EnterprisecustomuncappedGuardian audit + SLA

Ready to swipe your next action?

Install FinOps Pack

Watch the bill. Cap the burn.

Three sources of truth, one burn rate.

AWS, model providers, and your routing layer plug in as first-class MCP servers. Forge correlates every dollar back to the agent / pack / user that spent it.

AWS Cost Explorer

Daily run-rate + per-service drill-down

Provider billing

Anthropic / OpenAI / Vercel raw spend

OpenRouter

Live model price routing + failover

Two ledgers watching burn velocity.

Streaming anomaly detection on cloud + token spend with 14-day rolling baselines. A 3σ jump triggers a kill-switch offer before the daily total reaches yesterday's.

Cloud burn

AWS daily $ vs week-ago
GCP / Vercel cross-check
Per-service spike detector

Token burn

Anthropic / OpenAI $ / hour
Per-agent cost attribution
Per-skill-pack chargeback

Cost watched. Risk scored. Per pack.

Every paid tier ships the live governance console — the same one running over this page right now.

GOVERNANCE SCORE

60

Guarded

Cost Visibility
Per-provider spend + live burn rate aggregated into the Cost Radar.
Agent Audit
Every agent action stamped with risk level and a 5-year retention log.
Guardrails
Risk Console + emergency kill-switch wired to a kairon-guardian binary.
Dynamic Routing
Workloads are routed across providers by cost / quality / latency / context.
Identity Provenance
MCP token issued and at least one client (Claude Desktop / AEGIS / Cursor) connected.

NEXT STEP

Install kairon-guardian and bind your kill-switch (KaironOS Pro+).

COST RADAR · LIVE

Aggregate cloud-LLM spend across the monitored workload.

WASTED THIS SESSION

$0.0000

OpenAI · GPT-4o$62,000/mo

Anthropic · Claude Sonnet$47,000/mo

Google · Gemini Pro$15,500/mo

Groq · Llama 3 70B$5,400/mo