An in-the-wild benchmark for AI agents in the OpenClaw Environment.

424 stars
41 forks
Python
42 views