Runs

Early Access

Spyglass is currently in early access with trusted partners. Contact us for early access.

Spyglass Runs#

A Run in Spyglass is the process of evaluating an AI system by sending either a series of prompts or Datasets to specific Targets and measuring the results with Scorers. Runs are a core component that enable you to:

Simulate potential attacks and manipulations
Test model resilience under adversarial conditions
Generate actionable insights about vulnerabilities
Compare performance across multiple models
Download results for further analysis

Each Run consists of a Dataset, Target, and Scorer combination within a Project. The results help identify security risks and vulnerabilities in your AI applications during both development and deployment.

Following a run execution, datasets are available to download which includes inputs from the original dataset and model responses in a new column. Additionally, you can navigate through the run logs within the Spyglass UI by navigating to Spyglass > Projects or Spyglass > Runs in the navigation bar: