Skip to content

Session and Trace Debugging

Start from a conversation transcript, inspect execution traces, and use Agents analysis subtabs to decide whether a failure is isolated or systemic.

Use this recipe when you already have one bad conversation, one failing sample, or one suspicious run. The reliable order is session first, traces second, Agents third.

  • you need to answer “what actually happened in this conversation?”
  • you need to identify which tool call, span, or runtime action caused the outcome
  • you need to decide whether the failure is isolated or systemic
  • the session ID, evaluation sample, or run you care about
  • the correct workspace and project
  • a decision about whether you are debugging the current local TUI process or a remote runtime
Start hereUse it for
Ctrl+B or /sessionstranscript and narrative context
Ctrl+T or /tracesremote OTEL-backed execution detail
/spanslocal TUI JSONL span output for the current process
Agents Dataexact queries against otel_traces after the target run is known
Terminal window
dreadnode
# inside the TUI:
# 1. press Ctrl+B or run /sessions
# 2. reopen the thread you want to inspect
# 3. use /rename <title> or /compact [guidance] if the thread needs cleanup before continuing

This gives you the narrative record: what the user asked, what the assistant said, and whether the conversation itself became confused.

2. Move to remote traces for execution detail

Section titled “2. Move to remote traces for execution detail”

Use Ctrl+T or /traces when the question becomes execution behavior:

  • which tools ran
  • which spans were slow or failed
  • whether retries or branches explain the bad output
  • whether runtime state points to a missing secret, bad environment, or tool mismatch

3. Use /spans only for the current local TUI session

Section titled “3. Use /spans only for the current local TUI session”

Use /spans when the bug is in the TUI process itself and you want the raw local event stream before it reaches the remote trace store.

This is most useful for:

  • confirming spans are being emitted at all
  • checking local ordering of task and tool events
  • debugging exporter behavior

4. Widen into Agents after you know the target run

Section titled “4. Widen into Agents after you know the target run”

Move into Agents only after you know which run or span matters:

  • Charts for broad trend questions
  • Data for exact queries and exports from otel_traces
  • Notebook when you need traces, runs, and evaluation context together

Carry the same workspace and project context forward so you do not compare unrelated work.

Once the failure mode is clear:

  • continue the session after /compact [guidance]
  • reprovision or reset the runtime if the problem is environmental
  • update secrets, capability config, or task selection if configuration is wrong
  • promote the issue into a wider evaluation or optimization workflow if the pattern is systemic
  • the session ID
  • the trace or span IDs that explain the failure
  • the exact prompt or assistant turn that anchors the investigation
  • the saved Agents query or export if the issue widened beyond one run
  • if the failure starts from one conversation, use traces before Agents
  • if the issue only exists in the local TUI process, stay in /spans and local debugging longer
  • if the same pattern appears across multiple sessions or evaluations, switch from debugging to regression or ops workflow design