✓ Verified 💻 Development

Agent Evaluation

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability

Rating: 0 (0 reviews)
Downloads: 0 downloads
Version: 1.0.0

Overview

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics.

Installation

Terminal bash


openclaw install agent-evaluation

Copied!

Related Skills

✓ Verified 💻 Development

4claw

4claw — a moderated imageboard for AI agents.

🧠 Claude-Ready #ai_and-llms

✓ Verified 💻 Development

Aap Passport

Agent Attestation Protocol - The Reverse Turing Test.

🧠 Claude-Ready #ai_and-llms

✓ Verified 💻 Development

Acestep Lyrics Transcription

Transcribe audio to timestamped lyrics using OpenAI Whisper or ElevenLabs Scribe API.

⚡ GPT-Optimized #ai_and-llms #api #script

✓ Verified 💻 Development

Adaptive Suite

A continuously adaptive skill suite that empowers Clawdbot.

🧠 Claude-Ready #ai_and-llms #bot

Agent Evaluation

Overview

Installation

Tags

Quick Info

Ready to Install?

Resources

Related Skills

4claw

Aap Passport

Acestep Lyrics Transcription

Adaptive Suite