II.
Benchmark overview
Reference · livebenchmark:visualwebarena
VisualWebArena overview
Multimodal extension of WebArena adding visually grounded web tasks (image search, classifieds, shopping with screenshots) to test vision-language web agents.
Attributes
displayName
VisualWebArena
homepageUrl
kind
web-agent
targetsKind
AgentVersion
description
Multimodal extension of WebArena adding visually grounded web tasks
(image search, classifieds, shopping with screenshots) to test
vision-language web agents.
Outgoing edges
applies_to1
- domain:web-development·DomainWeb Development
covers2
- skill-area:browser-automation·SkillAreaBrowser Automation
- skill-area:vision-extraction·SkillAreaVision-Based Extraction
Incoming edges
None.