How issues get created
From alerts. When an alert fires in a connected platform (Datadog, Sentry, PagerDuty, CloudWatch, etc.), Antimetal creates an issue and begins investigating automatically. From search. Ask anything about your infrastructure, services, or systems and kick off an investigation from what you find. From chat. Describe a problem directly in Antimetal to create an issue and start an investigation. Multiple alerts can map to the same issue. Antimetal groups related signals together.Issue statuses
| Status | What it means |
|---|---|
| Investigating | Antimetal is actively gathering evidence and building a root cause analysis. Default state when an issue is created. |
| Ready to fix | Investigation complete. Root cause, findings, and remediation steps are available. |
| Resolved | Marked as fixed by your team. |
| Muted | Acknowledged but deprioritized. If a new alert fires for a muted issue, it will reopen. |
Managing issues
The issues board is your control plane for managing all issues in one place. Issues are organized by status in a kanban view, and you can search and filter by status or environment.The investigation
Antimetal runs an automated investigation:- Evidence collection. Queries your connected platforms for logs, metrics, traces, events, and code relevant to the incident.
- Causal analysis. Analyzes collected evidence, builds hypotheses, and tests them against the data.
- Root cause report. Produces a summary of what’s happening, why, and the supporting findings.
- Remediation. Generates immediate fixes to get your systems back up, long-term root cause fixes, and preventative recommendations.
- Causal tree. Builds a visual graph showing how issues propagate through your system.
Chat
Chat is always available on any issue. You can ask Antimetal questions about the issue, how it relates to other issues, provide additional context, or direct the investigation.Answers
The primary view once an investigation completes.- What’s happening. Plain-language summary of the incident.
- Root cause. The identified cause with an explanation of why it’s happening.
- Findings. Individual pieces of evidence discovered during the investigation. Each finding links to the underlying data in your provider. Findings are categorized as cause, symptom, or ruled out.
Fixes
Remediation actions grouped into three categories:- Immediate. Steps to stabilize the situation now.
- Root cause. Changes that fix the underlying problem.
- Preventative. Improvements to avoid recurrence.