Changelog
Release history for the AgentKavach SDK and platform.
v0.2.0 March 2026 #
Duration tracking, expanded budget types, and rate limiting improvements.
Duration Tracking
- New
duration_msfield on ingest events — tracks LLM call latency in milliseconds - Dashboard overview and agent detail pages now display cumulative duration
- Duration budgets are fully evaluable — alerts fire when duration thresholds are crossed
- Custom date range picker supports duration aggregation
Budget Types
- 7 budget types now supported: cost, tokens_total, tokens_input, tokens_output, calls, duration, and cost/monthly
- All 6 non-period budget types are independently evaluable for alerts
- Alert dedup key includes budget_type — same threshold fires once per type
Rate Limiting
- Simplified burst limits: Free=1K, Pro=5K, Max=10K events/second
- Removed dead RPM (requests per minute) code path
- Per-org rate limit overrides via admin API
- IP rate limiting simplified to flat 10K ceiling
v0.1.0 March 2026 #
Initial release of AgentKavach — hard budget limits for LLM APIs.
Python SDK
- Budget enforcement with daily, monthly, and total limits
- Shared budgets across multiple agents and services
- Guardrails: token limits per run, call count caps, runtime limits, and loop detection
- Multi-provider support: OpenAI, Anthropic, Google (Gemini), and Mistral
- Streaming support with
GeneratorExithandling - YAML configuration for team-wide budget management
- Fail-open design — internal errors never block LLM calls
Alert Channels
- Email notifications via Resend
- Slack webhook integration
- PagerDuty incident creation
- Generic webhook with HMAC signing
- Kill switch callback for emergency shutdowns
Telemetry & API
- OpenTelemetry-based telemetry export
- REST API with JWT and API key authentication
- Redis-backed rate limiting (daily + burst)
- Kafka event pipeline for high-throughput ingestion
Dashboard
- Real-time spend tracking per agent
- Budget configuration and alert rule management
- Run-level detail with event timeline
- Organization-scoped multi-tenant architecture