Agentic AI Atlasby a5c.ai
OverviewWikiGraphFor AgentsEdgesSearchWorkspace
/
GitHubDocsDiscord
ivEdge detail
Agentic AI Atlas · covers
61 pairsa5c.ai
Search edge kinds/
Atlas · edge detail

Current ledger and paging

IV.Current edge kindpp. 1 - 1
IV.
Edge detail

covers

Page 1 of 1

covers ledger

a Benchmark exercises a SkillArea, optionally weighted by how much of the benchmark targets it

Pairs · 61Cardinality · N:N
fromtoto kind
benchmark:os-worldskill-area:desktop-automationSkillArea
benchmark:android-worldskill-area:android-nativeSkillArea
benchmark:re-benchskill-area:autonomous-research-engineeringSkillArea
benchmark:re-benchskill-area:ml-fine-tuningSkillArea
benchmark:appworldskill-area:multi-app-orchestrationSkillArea
benchmark:appworldskill-area:multi-turn-tool-useSkillArea
benchmark:assistant-benchskill-area:deep-web-researchSkillArea
benchmark:assistant-benchskill-area:agentic-loopsSkillArea
benchmark:the-agent-companyskill-area:multi-app-orchestrationSkillArea
benchmark:the-agent-companyskill-area:bug-fixing-from-issuesSkillArea
benchmark:agentclinicskill-area:medical-agentSkillArea
benchmark:travelplannerskill-area:travel-itinerary-planningSkillArea
benchmark:browse-compskill-area:deep-web-researchSkillArea
benchmark:browse-compskill-area:browser-automationSkillArea
benchmark:mind2web-2skill-area:web-action-groundingSkillArea
benchmark:mind2web-2skill-area:browser-automationSkillArea
benchmark:workarenaskill-area:web-action-groundingSkillArea
benchmark:workarenaskill-area:browser-automationSkillArea
benchmark:webvoyagerskill-area:browser-automationSkillArea
benchmark:webvoyagerskill-area:web-action-groundingSkillArea
benchmark:visualwebarenaskill-area:browser-automationSkillArea
benchmark:visualwebarenaskill-area:vision-extractionSkillArea
benchmark:swe-lancerskill-area:autonomous-coding-engagementSkillArea
benchmark:swe-lancerskill-area:bug-fixing-from-issuesSkillArea
benchmark:aider-polyglotskill-area:python-implementationSkillArea
benchmark:aider-polyglotskill-area:bug-fixing-from-issuesSkillArea
benchmark:fin-benchskill-area:general-knowledge-reasoningSkillArea
benchmark:m-mmluskill-area:general-knowledge-reasoningSkillArea
benchmark:flores-200skill-area:general-knowledge-reasoningSkillArea
benchmark:xnliskill-area:general-knowledge-reasoningSkillArea
benchmark:olympiad-benchskill-area:mathematical-reasoningSkillArea
benchmark:promptbenchskill-area:prompt-engineeringSkillArea
benchmark:bias-benchskill-area:safety-redteamingSkillArea
benchmark:lmsys-arenaskill-area:general-knowledge-reasoningSkillArea
benchmark:gsm8kskill-area:mathematical-reasoningSkillArea
benchmark:gsm-symbolicskill-area:mathematical-reasoningSkillArea
benchmark:hleskill-area:closed-book-frontier-reasoningSkillArea
benchmark:hleskill-area:general-knowledge-reasoningSkillArea
benchmark:frontier-mathskill-area:mathematical-reasoningSkillArea
benchmark:frontier-mathskill-area:closed-book-frontier-reasoningSkillArea
benchmark:bbhskill-area:general-knowledge-reasoningSkillArea
benchmark:arc-agi-3skill-area:visual-pattern-inductionSkillArea
benchmark:arc-agi-3skill-area:agentic-loopsSkillArea
benchmark:mt-benchskill-area:general-knowledge-reasoningSkillArea
benchmark:legal-benchskill-area:closed-book-frontier-reasoningSkillArea
benchmark:medqaskill-area:medical-agentSkillArea
benchmark:harmbenchskill-area:safety-redteamingSkillArea
benchmark:jailbreakbenchskill-area:safety-redteamingSkillArea
benchmark:advbenchskill-area:safety-redteamingSkillArea
benchmark:toolbenchskill-area:tool-useSkillArea
benchmark:toolbenchskill-area:multi-turn-tool-useSkillArea
benchmark:berkeley-function-callingskill-area:tool-useSkillArea
benchmark:gaiaskill-area:agentic-loopsSkillArea
benchmark:human-evalskill-area:python-implementationSkillArea
benchmark:mmluskill-area:general-knowledge-reasoningSkillArea
benchmark:swe-bench-verifiedskill-area:bug-fixing-from-issuesSkillArea
benchmark:swe-benchskill-area:bug-fixing-from-issuesSkillArea
benchmark:tau-benchskill-area:multi-turn-tool-useSkillArea
benchmark:tau-benchskill-area:agentic-loopsSkillArea
benchmark:terminal-benchskill-area:cli-designSkillArea
benchmark:webarenaskill-area:browser-automationSkillArea

Definition

Source · Benchmark

Target · SkillArea

Cardinality · N:N

Navigate

Back to edge kinds
Open filtered graph