Browse all 63 observability tasks used in the benchmark. The current public benchmark has 63 tasks.
Can you audit the saved "Service Overview Audit" dashboard (`service-overview-audit`) for me? I just want a quick panel-...
Audit the saved "Service Overview Variable Audit" dashboard (`service-overview-variable-audit`) for me. I want to know w...
Before we call this a broad backend rollout issue, check the earlier user-service cache-refresh incident and tell me whe...
Earlier we had user-profile complaints that did not look like the payments incident. Using the telemetry around that ear...
I want a quick causal check on the earlier user-service cache incident: did the user-service v2.5.0 rollout actually loo...
Quick health check - but don't stop at scrape status. Tell me whether our monitored targets are up right now, and whethe...
On-call needs the saved "User Access Overview" dashboard (`user-access-overview`) to make the new cache-refresh incident...
Could you update the saved "Service Overview Annotate" dashboard (`svc-annotate`) so deployment rollouts are visible on ...
Could you update the saved "Service Overview" dashboard (`service-overview`) so on-call can see retry pile-up directly d...
Could you create a saved Grafana dashboard titled exactly "Cache Incident Review" so on-call can review the earlier user...
Could we add a Grafana dashboard titled exactly "Service Overview" - the saved dashboard resource that appears in dashbo...
The saved "User Access Cache Review" dashboard (`user-access-cache-review`) is supposed to help us review the earlier us...
We'd like a service dropdown on "Service Overview" (`service-overview`) so on-call can scope the whole board to one or m...
During order failures we're seeing 500s at the api-gateway. Is the gateway itself the problem, or is something behind it...
Before we blame code changes for the payment incident, check whether there were deployment rollout events for the key ba...
Over the last six hours, are our HTTP request logs showing a really slow outlier? If so, which request or route is it, a...
Anything in our traces pointing to unusually slow payment or checkout paths? Which services keep showing up in those slo...
How is our Prometheus datasource configured in Grafana - what URL is it hitting, and is it proxy or direct access?
We had a noisy incident window a few hours ago - error rates jumped across several services. Give me a tight triage grou...
Someone sent me our "Service Overview" dashboard (`svc-overview`). What is each panel actually querying in Grafana right...
What datasources are configured in Grafana? List their names and types.
Over roughly the last day of data in the environment, which services actually have tracing coverage - who shows up? Spli...
For the earlier user-service cache incident, I want the warning-log view rather than Prometheus. From the cache refresh ...
We think a v2.5.0 rollout landed within roughly the last day, shortly before things went sideways. Which services logged...
Over the last six hours, which API endpoints look slowest in our request logs? Call out the worst paths and give a few r...
Find HTTP 5xx request logs from the last 24 hours. Lines are JSON with structured fields such as `status` and `service`....
Over roughly the last six hours, how noisy were warning logs from payment-service? Give me an approximate warning count ...
Can you check whether the retry backlog warning logs are actually showing up? Tell me which queue name appears in those ...
Order flow has been flaky lately. Does that sound like mostly harmless retry chatter, or real backend failures underneat...
From Loki request logs (JSON lines with HTTP path and numeric status), which API path had the most true 5xx responses in...
What's the p95 latency (milliseconds) for POST /api/orders over the last six hours? Work from the order-service request ...
Over roughly the last six hours, did the payment incident mostly stay in payment-service or did it clearly spill into or...
Orders are failing, but I need to know whether this is really a payments-path incident or a broader backend mess. Using ...
payment-service threw a nasty 5xx spike a few hours ago. I need to know if it's still eating us or if it calmed down. Co...
I want the metric correlation for the earlier user-service cache incident. Over roughly the last 12 hours, tell me how h...
We had an earlier user-service cache-refresh issue and I want the metric summary, not a general RCA. From Prometheus, te...
Using Prometheus `process_cpu_seconds_total` and `process_resident_memory_bytes`, assess how our services are doing on C...
I'm writing a traffic-by-service chart for **user-service**, **order-service**, and **payment-service**. In this environ...
Over the last hour, what overall share of HTTP requests returned 5xx across user-service, order-service, and payment-ser...
Over the last 6 hours, which backend had the highest 5xx share? Give the backend and its share.
From Prometheus, which backend ``job`` labels matching ``*-service`` recorded any HTTP 5xx in the last 24 hours? List th...
Has traffic to order-service changed compared to an hour ago? Report the current and previous request rates and whether ...
Order-service feels slow. Looking at the last 1h only, is that mainly traffic volume or are requests just slower per req...
We think the payment incident may have built up retries behind the scenes. Over roughly the last six hours, which servic...
What was the worst HTTP error rate (5xx share) order-service hit over the last 6 hours? I'm trying to catch the spike in...
Over the last 24 hours, which backend service contributed the largest share of all backend 5xx responses? Report the ser...
Checkout felt sluggish around six hours ago. Compare average CPU over that same ~6h window across the monitored jobs: ra...
Same slowdown window: is memory pressure a convincing lead? Over roughly the last 6 hours, give the average overall resi...
During the payment-led incident, did retries actually pile up somewhere or was that a false lead? Using the last six hou...
What's on the "Service Overview" dashboard (`svc-overview`)? How many panels are there, what are they called, and what G...
Checkout felt sluggish about six hours ago - customers noticed. What slowed down, why, and which piece of the stack was ...
Some requests have been noticeably slow, and I need something tighter than "latency was up." Using roughly the last six ...
Over roughly the last day, are we seeing failed or errored traces anywhere? Which services look bad? If you find a concr...
Earlier user-profile requests slowed down, but I want the trace-level hotspot rather than a general service summary. Loo...
Checkout on POST /api/orders has been dragging. For those slow requests, which backend's own request-handling tail looks...
Slow checkout: I'm focused on POST /api/orders taking over a second end-to-end. When you dig into those slow requests, w...
Orders are failing on POST /api/orders. Walk me through one failed request end-to-end - what services show up, what fail...
Can you pull a failing POST /api/orders trace from roughly the last day? I'd love a trace id if you have one, how the re...
Where are we seeing error spans in distributed traces over roughly the last day? Which services show up and what operati...
What do recent order-service requests look like in tracing? A trace id would be great if you find one, plus whatever qui...
Over the last day, which service looks worst for errors on POST /api/payments? I want the name, a trace id if you find o...
When someone goes through checkout, what does order-service call underneath it? Name the downstream pieces you see, walk...
Order flow sometimes spikes into ugly tail latency. Show me a concrete slow example: where does the time go, and which d...