Documentation Index
Fetch the complete documentation index at: https://docs.svantic.com/llms.txt
Use this file to discover all available pages before exploring further.
Datadog Plugin for Savant
A standalone A2A agent that bridges Datadog monitoring alerts and metrics into the Savant mesh and exposes Datadog operations as tools for automated incident response.What It Does
- Listens to Datadog monitor webhooks (
@webhook-savantin monitor notifications) and polls the Monitors and Events APIs as fallback - Transforms monitor state transitions into structured Savant cues with priority routing
- Exposes 9 Datadog tools (query metrics, get/mute/unmute monitors, get events, host metrics, create events, get SLOs) callable by any agent on the mesh
Architecture
Deployment
settings.json
Tools
| Tool | Description |
|---|---|
datadog_get_alerts | Fetch monitors in alert/warn state |
datadog_query_metrics | Query time-series metrics |
datadog_get_monitor | Get specific monitor details |
datadog_mute_monitor | Mute a monitor for a duration |
datadog_unmute_monitor | Unmute a previously muted monitor |
datadog_get_events | Fetch recent Datadog events |
datadog_get_host_metrics | Get CPU/memory/disk for a host |
datadog_create_event | Post an event to Datadog |
datadog_get_slo | Fetch SLO status and error budget |
End-to-End Example: Automated Incident Response
- Datadog monitor triggers: “API latency > 500ms for 5 minutes”
- Plugin receives webhook, creates critical cue (tags include
pager) - Planner generates incident response plan:
- Query recent metrics for the affected service
- Check for recent deployments (via GitHub tools)
- Gather host metrics
- Synthesize findings into incident summary
- Post to Slack #incidents channel
- Create Datadog event documenting the investigation
- If a recent deploy is correlated, suggest rollback (pending human approval)
