← Back to tasks
Metrics promqlqueuebacklogtriageincident

promql-retry-backlog-triage

View in GitHub

Instruction

We think the payment incident may have built up retries behind the scenes. Over roughly the last six hours, which service showed the highest retry/backlog depth, about how high did it get, and does the next-worst service look like a smaller spillover or a comparable primary problem?