Benchmark for proactive personal assistant agents in long-horizon workflows.

39 stars
0 forks
Python
1 views