An in-the-wild benchmark for AI agents in the OpenClaw Environment.

418 stars
39 forks
Python
24 views