Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running 4 Open Agent Leaderboard 🤖 4 Explore AI agents' performance leaderboard and efficiency chart Running The Open Agent Leaderboard 📊 Compare AI agents' performance and cost across benchmarks open-agent-leaderboard/results Viewer • Updated May 18 • 150 • 147 • 6 open-agent-leaderboard/agent-cards Updated Mar 30 • 10
Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running 4 Open Agent Leaderboard 🤖 4 Explore AI agents' performance leaderboard and efficiency chart Running The Open Agent Leaderboard 📊 Compare AI agents' performance and cost across benchmarks open-agent-leaderboard/results Viewer • Updated May 18 • 150 • 147 • 6 open-agent-leaderboard/agent-cards Updated Mar 30 • 10