iiRecord
Agentic AI Atlas · AndroidWorld
benchmark:android-worlda5c.ai
II.
Benchmark overview

benchmark:android-world

Reference · live

AndroidWorld overview

AndroidWorld (Google Research, 2024) is a dynamic Android environment benchmark of 116 tasks across 20 real apps, used to evaluate autonomous mobile-UI agents on natural-language goals with stochastic real-app state.

BenchmarkOutgoing · 1Incoming · 3

Attributes

displayName
AndroidWorld
homepageUrl
kind
full-stack
targetsKind
AgentVersion
description
AndroidWorld (Google Research, 2024) is a dynamic Android environment benchmark of 116 tasks across 20 real apps, used to evaluate autonomous mobile-UI agents on natural-language goals with stochastic real-app state.

Outgoing edges

covers1

Incoming edges

belongs_to_benchmark1