Ready nowResult archiveReview neededRun guide readyPublic data
Berkeley Function Calling Leaderboard
Strong public benchmark for function calling, multi-turn, live, and agentic tool categories.
- Category
- Tool use
- Owner
- UC Berkeley Gorilla
- Data path
- Use the latest dated result archive after matching it to the public leaderboard. Prefer category rows first.