Skip to content

Experiments

Experiments are individual test executions that generate attack prompts, run conversations with your AI agent, and produce security verdicts.

List Experiments

hb experiments list

Show Experiment Details

hb experiments show <id>

Check Experiment Status

# Single status check
hb experiments status <id>

# Live updates (refreshes every 10 seconds)
hb experiments status <id> --watch

# Dashboard: all experiments, polls every 60s until all complete
hb experiments status --all

Wait for Completion

Block until an experiment completes (useful for CI/CD pipelines):

# Wait indefinitely
hb experiments wait <id>

# Wait with timeout (minutes)
hb experiments wait <id> --timeout 60

View Experiment Logs

# View logs for a specific experiment
hb logs <id>

# Filter by verdict
hb logs <id> --verdict fail
hb logs <id> --verdict pass

# Export branded HTML report
hb logs <id> --format html -o report.html

# Export as JSON
hb logs <id> --format json --all -o results.json

# Project-wide logs with scope flags
hb logs --last 5                           # Last 5 experiments
hb logs --last 3 --verdict fail            # Failed logs from last 3
hb logs --category owasp_agentic           # Filter by test category
hb logs --days 7 --format json -o week.json
hb logs --from 2026-01-01 --until 2026-02-01 --format html -o jan.html

Generate Experiment Report

# Generate HTML report (opens in browser by default)
hb experiments report <id>

# Save to file
hb experiments report <id> -o report.html

# Save without opening browser
hb experiments report <id> -o report.html --no-open

Terminate Running Experiment

hb experiments terminate <id>

Delete Experiment

# Delete with confirmation
hb experiments delete <id>

# Skip confirmation
hb experiments delete <id> --force