Cloud Cost Optimizer

FinOps / Infrastructure

Analyze cloud billing data and identify savings opportunities with ROI estimates

Analyzes AWS/GCP/Azure billing exports and identifies: oversized instances, idle resources, unused reserved capacity, storage orphans, NAT gateway optimization, spot instance candidates. Produces a prioritized savings report with estimated monthly impact per recommendation.

Time Saved

2-3 days of monthly FinOps audit reduced to minutes

Cost Reduction

Typically identifies 20-40% savings ($50K-$500K/year)

Risk Mitigation

Prevents budget overruns with automated threshold alerts

System Prompt

You are a FinOps analyst. Analyze cloud billing data and identify cost optimization opportunities. Rules: - Categorize savings: rightsizing | idle_resources | reserved_capacity | storage | networking | licensing - For each finding, provide: current cost, optimized cost, savings %, implementation effort (low/medium/high) - Prioritize by savings/effort ratio - Flag resources with < 10% average utilization - Identify RI/Savings Plan coverage gaps - Output JSON: { totalCurrentCost: number, totalOptimizedCost: number, savingsPercent: number, recommendations: [...] } Never recommend downsizing production databases without load analysis.

Skills

aws-pricing

<skill name="aws-pricing"> Common AWS savings patterns: - EC2: t3.medium rarely uses burst credits -> switch to t3a.small (30% savings) - RDS: Multi-AZ on dev/staging -> remove (50% savings) - EBS: gp2 volumes -> gp3 (20% savings, better performance) - NAT Gateway: $0.045/GB -> VPC endpoints for S3/DynamoDB ($0) - Data Transfer: cross-AZ at $0.01/GB -> use AZ-aware routing - S3: Intelligent Tiering saves 40-60% on infrequently accessed data - Lambda: ARM (Graviton) = 20% cheaper, 34% better perf </skill>

Tools

parse_billing_csv

Description: Parses cloud provider billing export CSV into structured data

Parameters:

{ "csvContent": { "type": "string" }, "provider": { "type": "string", "enum": ["aws", "gcp", "azure"] } }

get_utilization_metrics

Description: Fetches CPU/memory/network utilization for a resource

Parameters:

{ "resourceId": { "type": "string" }, "period": { "type": "string" } }

MCP Integration

Schedule monthly: export billing CSV, POST to /api/mcp. Agent returns prioritized savings report. Dashboard displays recommendations with one-click implementation tickets.

Grading Suite

Detect idle EC2 instance

Input:

Instance i-abc123: type m5.xlarge, avg CPU 3%, avg network 100KB/s, running 730 hours

Criteria:

- output_match: recommends downsizing or termination (weight: 0.4) - output_match: provides savings estimate (weight: 0.3) - schema_validation: valid JSON with recommendations array (weight: 0.2) - llm_judge: recommendation is safe and actionable (weight: 0.1)