VisualAgentBench

by THUDM · Agent Tool · ★ 256

About VisualAgentBench

Towards Large Multimodal Models as Visual Foundation Agents

gptllm-agentmultimodal-large-language-models

Quick Facts

Stars256
Forks9
LanguagePython
CategoryAgent Tool
LicenseApache-2.0
Quality Score39.2/100
Open Issues16
Last Updated2025-04-24
Created2024-08-08
Platformspython
Est. Tokens~378k

Compatible Skills

These tools work well together with VisualAgentBench for enhanced workflows:

  • multimind-sdk — semantic(0.31)+complementary+rare_topics+same_lang+similar_pop+shared_platform (60%)
  • MLLM-Tool — semantic(0.34)+complementary+same_lang+similar_pop+shared_platform (57%)
  • OpenAdapt — semantic(0.34)+complementary+same_lang+similar_pop+shared_platform (57%)
  • SimplerLLM — semantic(0.30)+complementary+same_lang+similar_pop+shared_platform (56%)
  • multimodal-agents-course — semantic(0.27)+complementary+same_lang+similar_pop+shared_platform (54%)

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is VisualAgentBench?

VisualAgentBench is Towards Large Multimodal Models as Visual Foundation Agents. It is categorized as a Agent Tool with 256 GitHub stars.

What programming language is VisualAgentBench written in?

VisualAgentBench is primarily written in Python. It covers topics such as gpt, llm-agent, multimodal-large-language-models.

How do I install or use VisualAgentBench?

You can find installation instructions and usage details in the VisualAgentBench GitHub repository at github.com/THUDM/VisualAgentBench. The project has 256 stars and 9 forks, indicating an active community.

What license does VisualAgentBench use?

VisualAgentBench is released under the Apache-2.0 license, making it free to use and modify according to the license terms.

View on GitHub → Browse Agent Tool tools