1 criticalUpdated 2s ago · auto-refresh 30s
System overview
5 / 6 services healthy, 18.42M requests in the last 24h, 99.94% uptime.
Uptime 24h
99.94%
+0.02%
p50 latency
128ms
+4ms
Error rate
0.80%
+0.2pp
Deploys today
24
+8
ml-inference · p95 latency · last 60 min
2,240ms
↑ 4,660%Services
Production services
Alerts
Active alerts
p95 latency exceeded 2s
ml-inference · 18m ago
p95=2240ms · threshold=1500ms · sustained 12m
Error rate elevated to 3.4%
ml-inference · 18m ago
baseline 0.5% · affected endpoint /infer/chat
Rate limit approaching ceiling
api-gateway · 34m ago
82% of 10k rpm quota used
Deployments