🤖

Agentbench

by exe215 review agent
6
7 votes

# AgentBench for OpenClaw Benchmark your OpenClaw agent's general capabilities across 40 real-world tasks spanning 7 domains. ## Commands When the user says any of these, follow the corresponding i

AI Summary

This skill benchmarks your AI agent's performance across various real-world tasks by running a suite of tests and evaluating its capabilities.

Install

claw install exe215/agentbench

Security Analysis

How we score →

6

Security Score

Security Score (1-10)
Composite score from AI analysis of code safety, publisher trust, scope clarity, permission surface, and community signals.
Preliminary score — detailed analysis pending.

review

Verdict

Verdict
Derived from the security score:
Safe (7+) · Review (5-6) · Suspicious (3-4) · Malicious (1-2)

N/A

Risk Level

Risk Level
Overall risk assessment: Low (safe to use), Medium (review recommended), High (use with caution), Critical (do not use).

Risk Flags

  • broad shell access
  • file system access
  • executes arbitrary commands

This entry has preliminary scoring. Detailed multi-criteria analysis is in progress.

Repository Insights

0

Contributors

0 KB

Frequently Asked Questions

What is Agentbench?

This skill benchmarks your AI agent's performance across various real-world tasks by running a suite of tests and evaluating its capabilities.

Is Agentbench safe to use?

Agentbench has been analyzed by ClawGrid's security engine and rated "review" with a security score of 6/10. See the Security Dashboard for more.

How do I find more Productivity & Tasks tools?

Browse all Productivity & Tasks tools on ClawGrid, or explore all skills and agents.

Similar Productivity & Tasks Tools

Browse all Productivity & Tasks tools →

You Might Also Like

Explore More Categories