Task Library

Browse all 63 observability tasks used in the benchmark. The current public benchmark has 63 tasks.

Dashboards & Config

audit-service-overview-datasources

Can you audit the saved "Service Overview Audit" dashboard (`service-overview-audit`) for me? I just want a quick panel-...

Dashboards & Config

audit-service-overview-variable

Audit the saved "Service Overview Variable Audit" dashboard (`service-overview-variable-audit`) for me. I want to know w...

Investigation

cache-incident-blast-radius

Before we call this a broad backend rollout issue, check the earlier user-service cache-refresh incident and tell me whe...

Investigation

cache-refresh-lag-handoff

Earlier we had user-profile complaints that did not look like the payments incident. Using the telemetry around that ear...

Investigation

cache-rollout-trigger-check

I want a quick causal check on the earlier user-service cache incident: did the user-service v2.5.0 rollout actually loo...

Metrics

check-service-health

Quick health check - but don't stop at scrape status. Tell me whether our monitored targets are up right now, and whethe...

Dashboards

dashboard-add-cache-lag-panels

On-call needs the saved "User Access Overview" dashboard (`user-access-overview`) to make the new cache-refresh incident...

Dashboards

dashboard-add-deployment-annotation

Could you update the saved "Service Overview Annotate" dashboard (`svc-annotate`) so deployment rollouts are visible on ...

Dashboards

dashboard-add-retry-backlog-panels

Could you update the saved "Service Overview" dashboard (`service-overview`) so on-call can see retry pile-up directly d...

Dashboards

dashboard-create-cache-incident-review

Could you create a saved Grafana dashboard titled exactly "Cache Incident Review" so on-call can review the earlier user...

Dashboards

dashboard-create-service-overview

Could we add a Grafana dashboard titled exactly "Service Overview" - the saved dashboard resource that appears in dashbo...

Dashboards

dashboard-repair-cache-review

The saved "User Access Cache Review" dashboard (`user-access-cache-review`) is supposed to help us review the earlier us...

Dashboards

dashboard-update-add-service-variable

We'd like a service dropdown on "Service Overview" (`service-overview`) so on-call can scope the whole board to one or m...

Investigation

dependency-outage-false-lead

During order failures we're seeing 500s at the api-gateway. Is the gateway itself the problem, or is something behind it...

Investigation

deployment-blast-radius-check

Before we blame code changes for the payment incident, check whether there were deployment rollout events for the key ba...

Logs

find-slow-requests

Over the last six hours, are our HTTP request logs showing a really slow outlier? If so, which request or route is it, a...

Traces

find-slow-traces

Anything in our traces pointing to unusually slow payment or checkout paths? Which services keep showing up in those slo...

Dashboards & Config

get-datasource-details

How is our Prometheus datasource configured in Grafana - what URL is it hitting, and is it proxy or direct access?

Investigation

incident-triage

We had a noisy incident window a few hours ago - error rates jumped across several services. Give me a tight triage grou...

Dashboards & Config

inspect-dashboard-queries

Someone sent me our "Service Overview" dashboard (`svc-overview`). What is each panel actually querying in Grafana right...

Dashboards & Config

list-datasources

What datasources are configured in Grafana? List their names and types.

Traces

list-services-traces

Over roughly the last day of data in the environment, which services actually have tracing coverage - who shows up? Spli...

Logs

logql-cache-refresh-peak-lag

For the earlier user-service cache incident, I want the warning-log view rather than Prometheus. From the cache refresh ...

Logs

logql-deployment-rollout-events

We think a v2.5.0 rollout landed within roughly the last day, shortly before things went sideways. Which services logged...

Logs

logql-multi-stage-pipeline

Over the last six hours, which API endpoints look slowest in our request logs? Call out the worst paths and give a few r...

Logs

logql-parse-json-logs

Find HTTP 5xx request logs from the last 24 hours. Lines are JSON with structured fields such as `status` and `service`....

Logs

logql-payment-warning-volume

Over roughly the last six hours, how noisy were warning logs from payment-service? Give me an approximate warning count ...

Logs

logql-retry-backlog-warnings

Can you check whether the retry backlog warning logs are actually showing up? Tell me which queue name appears in those ...

Logs

logql-retry-vs-real-errors

Order flow has been flaky lately. Does that sound like mostly harmless retry chatter, or real backend failures underneat...

Logs

logql-top-5xx-endpoint

From Loki request logs (JSON lines with HTTP path and numeric status), which API path had the most true 5xx responses in...

Logs

logql-unwrap-orders-p95-latency

What's the p95 latency (milliseconds) for POST /api/orders over the last six hours? Work from the order-service request ...

Investigation

payment-error-blast-radius

Over roughly the last six hours, did the payment incident mostly stay in payment-service or did it clearly spill into or...

Investigation

payments-path-root-cause

Orders are failing, but I need to know whether this is really a payments-path incident or a broader backend mess. Using ...

Metrics

promql-burn-rate-assessment

payment-service threw a nasty 5xx spike a few hours ago. I need to know if it's still eating us or if it calmed down. Co...

Metrics

promql-cache-lag-vs-user-latency

I want the metric correlation for the earlier user-service cache incident. Over roughly the last 12 hours, tell me how h...

Metrics

promql-cache-refresh-lag-peak

We had an earlier user-service cache-refresh issue and I want the metric summary, not a general RCA. From Prometheus, te...

Metrics

promql-capacity-analysis

Using Prometheus `process_cpu_seconds_total` and `process_resident_memory_bytes`, assess how our services are doing on C...

Metrics

promql-discover-http-metric

I'm writing a traffic-by-service chart for **user-service**, **order-service**, and **payment-service**. In this environ...

Metrics

promql-error-rate

Over the last hour, what overall share of HTTP requests returned 5xx across user-service, order-service, and payment-ser...

Metrics

promql-highest-backend-error-ratio

Over the last 6 hours, which backend had the highest 5xx share? Give the backend and its share.

Metrics

promql-label-matchers-service-errors

From Prometheus, which backend ``job`` labels matching ``*-service`` recorded any HTTP 5xx in the last 24 hours? List th...

Metrics

promql-offset-traffic-compare

Has traffic to order-service changed compared to an hour ago? Report the current and previous request rates and whether ...

Metrics

promql-order-latency-vs-traffic

Order-service feels slow. Looking at the last 1h only, is that mainly traffic volume or are requests just slower per req...

Metrics

promql-retry-backlog-triage

We think the payment incident may have built up retries behind the scenes. Over roughly the last six hours, which servic...

Metrics

promql-subquery-peak-error-rate

What was the worst HTTP error rate (5xx share) order-service hit over the last 6 hours? I'm trying to catch the spike in...

Metrics

promql-topk-5xx-share

Over the last 24 hours, which backend service contributed the largest share of all backend 5xx responses? Report the ser...

Metrics

query-cpu-metrics

Checkout felt sluggish around six hours ago. Compare average CPU over that same ~6h window across the monitored jobs: ra...

Metrics

query-memory-usage

Same slowdown window: is memory pressure a convincing lead? Over roughly the last 6 hours, give the average overall resi...

Investigation

retry-backlog-incident

During the payment-led incident, did retries actually pile up somewhere or was that a false lead? Using the last six hou...

Dashboards & Config

search-dashboards

What's on the "Service Overview" dashboard (`svc-overview`)? How many panels are there, what are they called, and what G...

Investigation

service-degradation-rca

Checkout felt sluggish about six hours ago - customers noticed. What slowed down, why, and which piece of the stack was ...

Investigation

slow-path-hotspot-correlation

Some requests have been noticeably slow, and I need something tighter than "latency was up." Using roughly the last six ...

Traces

trace-error-analysis

Over roughly the last day, are we seeing failed or errored traces anywhere? Which services look bad? If you find a concr...

Traces

traceql-cache-refresh-hotspot

Earlier user-profile requests slowed down, but I want the trace-level hotspot rather than a general service summary. Loo...

Traces

traceql-checkout-p99-by-service

Checkout on POST /api/orders has been dragging. For those slow requests, which backend's own request-handling tail looks...

Traces

traceql-cross-service-analysis

Slow checkout: I'm focused on POST /api/orders taking over a second end-to-end. When you dig into those slow requests, w...

Traces

traceql-discover-orders-error-attributes

Orders are failing on POST /api/orders. Walk me through one failed request end-to-end - what services show up, what fail...

Traces

traceql-error-chain-orders

Can you pull a failing POST /api/orders trace from roughly the last day? I'd love a trace id if you have one, how the re...

Traces

traceql-error-span-analysis

Where are we seeing error spans in distributed traces over roughly the last day? Which services show up and what operati...

Traces

traceql-find-service-traces

What do recent order-service requests look like in tracing? A trace id would be great if you find one, plus whatever qui...

Traces

traceql-metrics-error-rate-by-service

Over the last day, which service looks worst for errors on POST /api/payments? I want the name, a trace id if you find o...

Traces

traceql-structural-query

When someone goes through checkout, what does order-service call underneath it? Name the downstream pieces you see, walk...

Traces

traceql-tail-latency-bottleneck

Order flow sometimes spikes into ugly tail latency. Show me a concrete slow example: where does the time go, and which d...