✓ Verified 💻 Development ✓ Enhanced Data

Agent Cost Monitor

Real-time token usage and cost tracking across all your OpenClaw agents — alerts, budgets, and optim

Rating
3.9 (417 reviews)
Downloads
2,945 downloads
Version
1.0.0

Overview

Real-time token usage and cost tracking across all your OpenClaw agents — alerts, budgets, and optimization tips.

Complete Documentation

View Source →

Agent Cost Monitor — Know What Your Agents Cost

Track token usage, costs, and efficiency across all your OpenClaw agents in real-time. Get alerts before you blow your budget.

The Problem

Running multiple agents is powerful — but expensive if you're not watching:

  • Which agent is burning the most tokens?
  • Are heartbeats wasting money on expensive models?
  • Is caching actually saving you anything?
  • When will you hit your weekly rate limit?

What This Skill Does

When triggered (via cron or manually), the agent:

  • Checks session_status for each agent
  • Calculates per-agent and total costs
  • Compares against budget thresholds
  • Sends alerts if limits are approaching
  • Suggests optimization moves

Usage

Ask your monitoring agent (or any agent with this skill):

text
"Give me a cost report for all agents"
"Which agent used the most tokens today?"
"Am I going to hit my rate limit this week?"

Automated Daily Report (Cron)

json5
{
  "name": "Daily Cost Report",
  "schedule": { "kind": "cron", "expr": "0 20 * * *", "tz": "Europe/Berlin" },
  "payload": {
    "kind": "agentTurn",
    "message": "Run a cost report across all agents. Check session_status for each. Report: total tokens, cost per agent, top spender, budget warnings. Send summary to user."
  },
  "sessionTarget": "isolated",
  "delivery": { "mode": "announce" }
}

Cost Report Format

When generating a report, use this structure:

markdown
## 💰 Agent Cost Report — [Date]

### Per-Agent Breakdown
| Agent | Model | Tokens (24h) | Est. Cost | Status |
|-------|-------|-------------|-----------|--------|
| Central | Opus 4.6 | 125K | $1.87 | ⚠️ High |
| Techops | Opus 4.6 | 89K | $1.33 | ✅ Normal |
| Atlas | Sonnet 4.5 | 45K | $0.27 | ✅ Low |
| Closer | Haiku 4.5 | 23K | $0.02 | ✅ Minimal |
| Heartbeats | Ollama | 12K | $0.00 | ✅ Free |

### Summary
- **Total 24h:** 294K tokens (~$3.49)
- **Projected weekly:** ~$24.43
- **Budget:** $20/week → ⚠️ 122% projected

### Recommendations
1. Move Techops from Opus → Sonnet for routine tasks (-40% cost)
2. Increase heartbeat interval from 15m → 30m
3. Enable context pruning on Atlas (idle sessions burning cache)

Model Cost Reference

Use these rates for estimation (as of 2026):

Anthropic (Claude OAuth / API)

ModelInput/1MOutput/1MCache Read/1MCache Write/1M
Opus 4.6$5.00$25.00$0.50$6.25
Sonnet 4.5$3.00$15.00$0.30$3.75
Haiku 4.5$1.00$5.00$0.08$1.25

Free Options

ModelCostUse For
Ollama (local)$0Heartbeats, simple tasks
Gemini OAuth$0*Fallback (rate limited)
*Free tier with rate limits

Optimization Playbook

Quick Wins (Do These First)

  • Heartbeats on Ollama
json5
{ "heartbeat": { "model": "ollama/llama3.2:3b" } }
Saves: 100% of heartbeat costs (can be $5-10/week with Opus)
  • Haiku Cache Retention Off
json5
{ "anthropic/claude-haiku-4-5": { "params": { "cacheRetention": "none" } } }
Saves: Cache write costs on cheap model (not worth caching)
  • Context Pruning
json5
{ "contextPruning": { "mode": "cache-ttl", "ttl": "5m" } }
Saves: Stale context re-reads on every turn
  • Opus/Sonnet Cache Retention Long
json5
{ "anthropic/claude-opus-4-6": { "params": { "cacheRetention": "long" } } }
Saves: Re-sending system prompt every turn (biggest single saving)

Model Tiering (Biggest Impact)

Task TypeUse ThisNot ThisSaving
Coordination, complex reasoningOpusJustified
Finance, data analysisSonnetOpus-40%
Sales drafts, marketing copyHaikuSonnet-67%
Heartbeats, health checksOllamaAny paid-100%
Tweet draftsHaiku or GrokOpus-80%

Session Management

  • Daily reset: Sessions auto-clear at a set hour (reduces token accumulation)
json5
{ "session": { "reset": { "mode": "daily", "atHour": 4, "idleMinutes": 45 } } }
  • Memory flush: Save important context before compaction
json5
{ "compaction": { "memoryFlush": { "enabled": true } } }

Alert Thresholds

Configure in your monitoring agent's memory:

markdown
## Budget Alerts
- Daily budget: $5.00 (warn at 80% = $4.00)
- Weekly budget: $20.00 (warn at 70% = $14.00)
- Per-agent daily max: $2.00
- Alert channel: Telegram DM

Integration with DevOps Agent

If you have a DevOps/monitoring agent (e.g. your DevOps agent), add to its AGENTS.md:

markdown
## Cost Monitoring
- Run daily cost report at 20:00
- Alert if any agent exceeds $2/day
- Weekly summary every Monday 09:00
- Track trends: is usage going up or down?

FAQ

Q: Does this skill make API calls? A: No. It uses OpenClaw's built-in session_status tool. No external APIs, no additional costs.

Q: How accurate are cost estimates? A: Based on published model pricing. Actual costs may vary with caching hits. Estimates are conservative (slightly high).

Q: Can I track costs per conversation? A: Not directly. Costs are tracked per session. Use sessions_list to see per-session token counts.

Q: Works with non-Anthropic models? A: Yes. Token counts work for all providers. Cost estimation requires known pricing (add custom rates in the cost reference section).

Changelog

v1.1.0

  • Generalized all agent names in examples
  • No specific setup references

v1.0.0

  • Initial release

Installation

Terminal bash

openclaw install agent-cost-monitor
    
Copied!

💻Code Examples

}

.txt
## Cost Report Format

When generating a report, use this structure:

3. Enable context pruning on Atlas (idle sessions burning cache)

3-enable-context-pruning-on-atlas-idle-sessions-burning-cache.txt
## Model Cost Reference

Use these rates for estimation (as of 2026):

### Anthropic (Claude OAuth / API)
| Model | Input/1M | Output/1M | Cache Read/1M | Cache Write/1M |
|-------|----------|-----------|---------------|----------------|
| Opus 4.6 | $5.00 | $25.00 | $0.50 | $6.25 |
| Sonnet 4.5 | $3.00 | $15.00 | $0.30 | $3.75 |
| Haiku 4.5 | $1.00 | $5.00 | $0.08 | $1.25 |

### Free Options
| Model | Cost | Use For |
|-------|------|---------|
| Ollama (local) | $0 | Heartbeats, simple tasks |
| Gemini OAuth | $0* | Fallback (rate limited) |

*Free tier with rate limits

## Optimization Playbook

### Quick Wins (Do These First)

1. **Heartbeats on Ollama**

{ "heartbeat": { "model": "ollama/llama3.2:3b" } }

-heartbeat--model-ollamallama323b--.txt
Saves: 100% of heartbeat costs (can be $5-10/week with Opus)

2. **Haiku Cache Retention Off**

{ "anthropic/claude-haiku-4-5": { "params": { "cacheRetention": "none" } } }

-anthropicclaude-haiku-4-5--params--cacheretention-none---.txt
Saves: Cache write costs on cheap model (not worth caching)

3. **Context Pruning**

{ "contextPruning": { "mode": "cache-ttl", "ttl": "5m" } }

-contextpruning--mode-cache-ttl-ttl-5m--.txt
Saves: Stale context re-reads on every turn

4. **Opus/Sonnet Cache Retention Long**

{ "anthropic/claude-opus-4-6": { "params": { "cacheRetention": "long" } } }

-anthropicclaude-opus-4-6--params--cacheretention-long---.txt
Saves: Re-sending system prompt every turn (biggest single saving)

### Model Tiering (Biggest Impact)

| Task Type | Use This | Not This | Saving |
|-----------|----------|----------|--------|
| Coordination, complex reasoning | Opus | — | Justified |
| Finance, data analysis | Sonnet | Opus | -40% |
| Sales drafts, marketing copy | Haiku | Sonnet | -67% |
| Heartbeats, health checks | Ollama | Any paid | -100% |
| Tweet drafts | Haiku or Grok | Opus | -80% |

### Session Management

- **Daily reset**: Sessions auto-clear at a set hour (reduces token accumulation)

{ "compaction": { "memoryFlush": { "enabled": true } } }

-compaction--memoryflush--enabled-true---.txt
## Alert Thresholds

Configure in your monitoring agent's memory:

- Alert channel: Telegram DM

--alert-channel-telegram-dm.txt
## Integration with DevOps Agent

If you have a DevOps/monitoring agent (e.g. your DevOps agent), add to its AGENTS.md:
example.txt
"Give me a cost report for all agents"
"Which agent used the most tokens today?"
"Am I going to hit my rate limit this week?"
example.txt
{
  "name": "Daily Cost Report",
  "schedule": { "kind": "cron", "expr": "0 20 * * *", "tz": "Europe/Berlin" },
  "payload": {
    "kind": "agentTurn",
    "message": "Run a cost report across all agents. Check session_status for each. Report: total tokens, cost per agent, top spender, budget warnings. Send summary to user."
  },
  "sessionTarget": "isolated",
  "delivery": { "mode": "announce" }
}

Tags

#coding_agents-and-ides

Quick Info

Category Development
Model Claude 3.5
Complexity Multi-Agent
Author neal-collab
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
🧠

Ready to Install?

Get started with this skill in seconds

openclaw install agent-cost-monitor