tau2-bench

by sierra-research · AI Tool · ★ 1.2k

About tau2-bench

$\tau$-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains 🚀 τ³-bench is here! From text-only to multimodal, knowledge-aware agent evaluation. Voice full-duplex · Knowledge ret

aibenchmarkconversational-agentslanguage-model-agentllm

Quick Facts

Stars1,213
Forks313
LanguagePython
CategoryAI Tool
LicenseMIT
Quality Score58.678/100
Open Issues118
Last Updated2026-05-21
Created2025-06-09
Platformspython
Est. Tokens~22279k

Compatible Skills

These tools work well together with tau2-bench for enhanced workflows:

  • mcpmark — semantic(0.42)+complementary+rare_topics+same_lang+similar_pop+shared_platform (64%)
  • MCPBench — semantic(0.27)+complementary+rare_topics+same_lang+similar_pop+shared_platform (59%)
  • MLLM-Tool — semantic(0.35)+complementary+same_lang+similar_pop+shared_platform (57%)
  • Toucan — semantic(0.23)+complementary+same_lang+similar_pop+shared_platform (53%)
  • mcp-bench — semantic(0.20)+complementary+same_lang+similar_pop+shared_platform (52%)

More AI Tool Tools

Explore other popular ai tool tools:

View all AI Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is tau2-bench?

tau2-bench is τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains. It is categorized as a AI Tool with 1.2k GitHub stars.

What programming language is tau2-bench written in?

tau2-bench is primarily written in Python. It covers topics such as ai, benchmark, conversational-agents.

How do I install or use tau2-bench?

You can find installation instructions and usage details in the tau2-bench GitHub repository at github.com/sierra-research/tau2-bench. The project has 1.2k stars and 313 forks, indicating an active community.

What license does tau2-bench use?

tau2-bench is released under the MIT license, making it free to use and modify according to the license terms.

View on GitHub → Browse AI Tool tools