Skip to content

Evaluations

Press Ctrl+E to open evaluations.

Dreadnode TUI evaluations screen

The evaluations screen is built for monitoring runs without leaving the TUI.

  • left side - the job table with status, progress, pass rate, duration, and creation time
  • bottom left - a compact progress bar for the selected run
  • right side - detailed metadata for the highlighted evaluation

This is the best place to watch AI red team and security evaluation jobs move from queued to running to completed.

The detail panel shows the information you typically want during an evaluation run:

  • job status
  • model and agent type
  • concurrency and dataset size
  • item counts across passed, failed, timed out, and in-progress states
  • billed and estimated credits
  • timing metadata and run id
  • Ctrl+E - open evaluations
  • r - refresh
  • c - cancel the selected evaluation
  • t - retry the selected evaluation
  • Esc - close the screen

You can also open the screen with /evaluations.