II.
Workflow overview
Reference · liveworkflow:llm-cost-optimization
LLM Cost Optimization overview
Reviews and optimises spend across LLM API providers — analysing per-model token consumption, identifying prompt-length bloat, evaluating cache-hit rates, testing cheaper model substitutions for low-criticality tasks, auditing retry and fallback policies that inflate costs, and projecting budget burn-rate against forecasts. Produces a cost breakdown dashboard and actionable savings plan. Excludes model fine-tuning work.
Attributes
displayName
LLM Cost Optimization
workflowKind
operational
triggerType
scheduled
typicalCadence
weekly
complexity
cross-team
description
Reviews and optimises spend across LLM API providers — analysing per-model
token consumption, identifying prompt-length bloat, evaluating cache-hit
rates, testing cheaper model substitutions for low-criticality tasks,
auditing retry and fallback policies that inflate costs, and projecting
budget burn-rate against forecasts. Produces a cost breakdown dashboard
and actionable savings plan. Excludes model fine-tuning work.
Outgoing edges
applies_to_domain2
- domain:ml-ops·DomainMLOps
- domain:platform-engineering·DomainPlatform Engineering
involves_role3
- role:ml-engineer·RoleMachine Learning Engineer
- role:cost-tracker·RoleCost Tracker (Bot)
- role:staff-engineer·RoleStaff Engineer
performed_by_org_unit2
- org-unit:ml-platform-team·OrgUnitML Platform Team
- org-unit:infra-engineering·OrgUnitInfrastructure Engineering
requires_skill_area2
- skill-area:prompt-engineering·SkillAreaPrompt Engineering
- skill-area:context-management·SkillAreaLLM Context Management
triggers_responsibility2
- responsibility:cost-optimization·Responsibility
- responsibility:capacity-planning·ResponsibilityCapacity Planning
Incoming edges
supports_work2
- tool:fireworks-ai·ToolFireworks AI
- tool-server:mcp-fireworks-ai-candidate·ToolServerFireworks AI MCP candidate