tau2-bench

About tau2-bench

$\tau$-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains 🚀 τ³-bench is here! From text-only to multimodal, knowledge-aware agent evaluation. Voice full-duplex · Knowledge ret

ai benchmark conversational-agents language-model-agent llm

Quick Facts

Stars	1,489
Forks	377
Language	Python
Category	AI Tool
License	MIT
Quality Score	73.8759201914111/100
Open Issues	139
Last Updated	2026-07-02
Created	2025-06-09
Platforms	python
Est. Tokens	~15k

Compatible Skills

These tools work well together with tau2-bench for enhanced workflows:

mcpmark — semantic(0.42)+complementary+rare_topics+same_lang+similar_pop+shared_platform (64%)
MCPBench — semantic(0.27)+complementary+rare_topics+same_lang+similar_pop+shared_platform (59%)
MLLM-Tool — semantic(0.35)+complementary+same_lang+similar_pop+shared_platform (57%)
Toucan — semantic(0.23)+complementary+same_lang+similar_pop+shared_platform (53%)
mcp-bench — semantic(0.20)+complementary+same_lang+similar_pop+shared_platform (52%)

More AI Tool Tools

Explore other popular ai tool tools:

stagehand ⭐ 23.4k
promptfoo ⭐ 23.0k
gorilla ⭐ 12.8k
PocketFlow-Tutorial-Codebase-Knowledge ⭐ 12.4k
llm ⭐ 12.2k
llm-engineer-toolkit ⭐ 10.5k
phoenix ⭐ 10.5k
code2prompt ⭐ 7.4k
superagent ⭐ 6.5k
ai-cookbook ⭐ 4.1k

View all AI Tool tools →

Popular Python Agent Tools

hermes-agent ⭐ 211.2k · Codex Skill
AutoGPT ⭐ 185.4k · Agent Tool
skills ⭐ 157.3k · Claude Skill
langflow ⭐ 151.3k · Agent Tool
open-webui ⭐ 143.9k · MCP Server

Frequently Asked Questions

What is tau2-bench?

tau2-bench is τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains. It is categorized as a AI Tool with 1.5k GitHub stars.

What programming language is tau2-bench written in?

tau2-bench is primarily written in Python. It covers topics such as ai, benchmark, conversational-agents.

How do I install or use tau2-bench?

You can find installation instructions and usage details in the tau2-bench GitHub repository at github.com/sierra-research/tau2-bench. The project has 1.5k stars and 377 forks, indicating an active community.

What license does tau2-bench use?

tau2-bench is released under the MIT license, making it free to use and modify according to the license terms.

View on GitHub → Browse AI Tool tools