← Back to tasks
View in GitHub
Metrics promqlqueuebacklogtriageincident
promql-retry-backlog-triage
Instruction
We think the payment incident may have built up retries behind the scenes. Over roughly the last six hours, which service showed the highest retry/backlog depth, about how high did it get, and does the next-worst service look like a smaller spillover or a comparable primary problem?