May 14, 2026
Free-text search in evaluation trajectories
← All updates
Free-text search in evaluation trajectories
4 new 8 improved 15 fixed
Free-text search lands in hosted evaluation trajectories this week, alongside session rewind, cross-org capability share links, and sandbox dependency auto-install.
New
- Free-text search in evaluation trajectories. Matches are highlighted inline in hosted evaluation trajectories and tool calls auto-expand when their content matches the search term. (dreadnode/dreadnode-tiger#1491)
- Session rewind. Roll back a conversation to any prior user message via
/rewindor double-Esc in the TUI — the full session history is preserved. (dreadnode/dreadnode-tiger#1436) - Cross-org capability share links. Capability hub links now work across orgs — recipients are automatically redirected to the right page regardless of their org slug. (dreadnode/dreadnode-tiger#1487)
- Sandbox dependency auto-install. Sandbox runtimes can now automatically install Python packages and system dependencies declared in capability manifests at load time. (dreadnode/dreadnode-tiger#1458)
Improvements
- AIRT auto-discovery with updated capability counts. The AIRT CLI now auto-discovers attacks, transforms, and scorers at runtime (61 attacks, 547 transforms, 141 scorers), and capability counts in the hub accurately reflect these results. (dreadnode/dreadnode-tiger#1459, dreadnode/capabilities#6)
- Simplified AIRT workspace layout. The AI red-teaming capability uses a cleaner
~/.dreadnode/airt/[org]/[workspace]/workspace structure with improved error messages and input validation. (dreadnode/capabilities#10) - AIRT docs cover all result-review surfaces. AIRT docs and CLI help text now document all result-review paths — overview dashboard, per-assessment view, trace view, and the custom report builder — not just CLI commands. (dreadnode/dreadnode-tiger#1445)
- Settings sidebar grouped into Personal vs Organization. Secrets and Chat Models are now clearly labeled as Personal, separate from Organization settings. (dreadnode/dreadnode-tiger#1474)
- ATIF trajectory exports include source lineage. Exported trajectories now carry an
extra.dreadnodeblock with org, workspace, project, and optional evaluation/item IDs for traceability. (dreadnode/dreadnode-tiger#1450) - AIRT assessments Details glossary consolidated. The Details section now shows a single glossary popover at the section header instead of three separate per-cell tooltips. (dreadnode/dreadnode-tiger#1493)
- Optimization job sidebar shows author email. The optimization job sidebar now displays the author’s email instead of a raw user UUID, consistent with Evaluation and Training job lists. (dreadnode/dreadnode-tiger#1454)
- Task name copy button. A copy-to-clipboard button next to the task name in the environment detail drawer makes it easier to grab long task names for CLI commands or evaluation setup. (dreadnode/dreadnode-tiger#1072)
Fixes
- AIRT CLI runs now appear in the platform UI. Assessment creation was silently succeeding without connecting to the platform; AIRT CLI runs now correctly appear in the UI and analytics. (dreadnode/dreadnode-tiger#1499)
ModuleNotFoundErrorin UV tool and managed Python environments resolved. AI red-teaming workflows no longer fail withModuleNotFoundError: No module named dreadnodein UV tool, container, and managed Python environments. (dreadnode/dreadnode-tiger#1476, dreadnode/dreadnode-tiger#1479, dreadnode/capabilities#7)- AIRT
--projectflag now correctly scopes assessments. The--projectflag no longer silently falls back todefault— assessments and trace exports are scoped to the specified project. (dreadnode/dreadnode-tiger#1452) - TUI context gauge shows last-generation tokens. The context gauge now shows last-generation input tokens (e.g. 800k/1M) instead of a cumulative sum that could exceed the model’s context limit. (dreadnode/dreadnode-tiger#1451)
- TUI tool output and XMLModel serialization fixed. TUI tool output no longer gets clipped after completion, and XMLModel serialization no longer crashes on NUL bytes or ANSI escape sequences. (dreadnode/dreadnode-tiger#1457)
- ASR values display consistently as percentages. Attack Success Rate values now render as percentages across all AIRT views — no more mixed decimal/percentage formatting. (dreadnode/dreadnode-tiger#1489)
- All 15 AIRT goal categories now surfaced. The AI red-teaming agent now offers all 15 goal categories (including
reasoning_exploitation,supply_chain, andresource_exhaustion) instead of only 9. (dreadnode/capabilities#14) - AIRT severity reference documents all five levels. Critical, High, Medium, Low, and Info are now all defined — previously only Critical and High were documented. (dreadnode/dreadnode-tiger#1444)
- Trace copy button restored on Safari. The trace copy button (rich and log views) now works correctly on Safari after breaking in v2.0.20. (dreadnode/dreadnode-tiger#1469)
- AIRT report generation no longer floods UI with toasts. Generating a report no longer triggers an infinite loop of ‘Report build failed’ toasts that locked the UI. (dreadnode/dreadnode-tiger#1470)
- Onboarding username now names the default workspace. Your chosen username during onboarding correctly names your default workspace instead of an autogenerated placeholder. (dreadnode/dreadnode-tiger#1473)
- Deep links to evaluation samples no longer show ‘sample not found’. Navigating directly to a specific evaluation sample via deep link now loads correctly. (dreadnode/dreadnode-tiger#1484)
projectscommand in TUI scoped to current workspace. TheprojectsTUI command now loads projects for the active workspace only. (dreadnode/dreadnode-tiger#1464)ProcessJudgeno longer crashes on rubrics mentioning.yaml. Plaintext rubrics that reference a.yamlfilename no longer triggerOSError: File name too long. (dreadnode/dreadnode-tiger#1465)- Docs code blocks render
--flags with correct spacing. Geist Mono ligatures are disabled in docs code blocks, sodn --capabilityno longer renders asdn--capability. (dreadnode/dreadnode-tiger#1500)