II.
TestSet overview
Reference · livetest-set:androidworld-programmatic-tasks
AndroidWorld programmatic task suite overview
Canonical AndroidWorld artifact for autonomous Android UI-control evaluation.
Attributes
displayName
AndroidWorld programmatic task suite
benchmarkId
caseCount
116
composition
AndroidWorld's reproducible mobile-agent task suite covers real
Android apps with programmatic rewards and dynamically-instantiated
natural-language tasks.
homepageUrl
description
Canonical AndroidWorld artifact for autonomous Android UI-control
evaluation.
Outgoing edges
belongs_to_benchmark1
- benchmark:android-world·BenchmarkAndroidWorld
Incoming edges
None.