Observability

Live
Service Operations · Observability

Live signal feed

Real-time service health, SLO tracking, and AI-correlated anomalies across infrastructure.

Anomaly: payments-svc error rate climbing

FiringDetected 4m ago · 3σ above 14d baseline

Forge correlated this with deploy v2.184.0 at 09:14 UTC. Linked incident INC-2841.

Services up
184/186
2 degraded
Requests / sec
42.6K
+4.2% vs 1h
Active alerts
7
2 P1 · 3 P2
P99 latency
612ms
↑ from 380ms

Latency · payments-svc

p50 / p95 / p99 · last 30 minutes

p50p95p99

Error rate

5xx %, threshold 1%

Above SLO

Service health (SLOs)

12 SLOs across 7 critical services

Healthy 9Watch 2Burning 1

Payments API

99.91% / target 99.95%
1.4× burn

Auth Service

99.99% / target 99.99%
0.2× burn

Core ITSM

99.97% / target 99.95%
0.5× burn

Ingestion Pipeline

99.62% / target 99.90%
3.2× burn

Knowledge Graph

99.96% / target 99.95%
0.3× burn

AI Inference

99.88% / target 99.90%
1.1× burn

Service map · payments flow

AI-traced critical path · click any node to drill in

Auto-mapped
🌐42Kapi-gateway🔐18Kauth-svc🛒9.2Korders-svc38Kcache-svc💳6.4Kpayments-svc🗄️postgres
HealthyWatchDegraded