Service Operations · Observability
Live signal feed
Real-time service health, SLO tracking, and AI-correlated anomalies across infrastructure.
Anomaly: payments-svc error rate climbing
FiringDetected 4m ago · 3σ above 14d baselineForge correlated this with deploy v2.184.0 at 09:14 UTC. Linked incident INC-2841.
Services up
184/186
2 degraded
Requests / sec
42.6K
+4.2% vs 1h
Active alerts
7
2 P1 · 3 P2
P99 latency
612ms
↑ from 380ms
Latency · payments-svc
p50 / p95 / p99 · last 30 minutes
p50p95p99
Error rate
5xx %, threshold 1%
Service health (SLOs)
12 SLOs across 7 critical services
Healthy 9Watch 2Burning 1
Payments API
99.91% / target 99.95%
Auth Service
99.99% / target 99.99%
Core ITSM
99.97% / target 99.95%
Ingestion Pipeline
99.62% / target 99.90%
Knowledge Graph
99.96% / target 99.95%
AI Inference
99.88% / target 99.90%
Service map · payments flow
AI-traced critical path · click any node to drill in
HealthyWatchDegraded