IntermediateAI SQL Functionstrendingfeatured

cortex-agent

Name: cortex-agent
Author: cortex-community

[REQUIRED] Use for ALL requests that mention agents: list, show, create, build, set up, edit, modify, update, delete, drop, remove, download, export, debug, fix, troubleshoot, optimize, improve, evaluate, or analyze a (Cortex) agent.

Jump to Installation

Compatible with

Cortex

About this skill

Main

When to Use

When a user wants to list, create, edit, delete, download, debug, evaluate, or optimize a (Cortex) agent. This is the entry point for all (Cortex) agent workflows.

Setup

Load agent-system-of-record/SKILL.md: Required first step for all sessions.
Load best-practices/SKILL.md: Required to help maintain best practices for agent development.

⚠️ CRITICAL SAFETY INSTRUCTION: Before modifying an agent check with a user if it is a production agent and offer to create a clone. Ask user for the fully qualified clone name (DATABASE.SCHEMA.CLONE_AGENT_NAME) where they want the clone created. Follow agent-system-of-record/SKILL.md for clone creation.

Intent Detection

When user makes a request, detect their intent and load the appropriate sub-skill:

CREATE Intent - User wants to create/build a new agent:

Trigger phrases: "create agent", "build agent", "set up agent", "new agent", "make an agent"
→ Load create-cortex-agent/SKILL.md

EDIT Intent - User wants to edit/modify an existing agent:

Trigger phrases: "edit agent", "modify agent", "update agent", "change agent instructions", "change agent", "update instructions"
→ Load edit-cortex-agent/SKILL.md

ADHOC_TESTING Intent - User wants to test questions interactively:

Trigger phrases: "test questions", "try queries", "test agent", "run some questions"
→ Load adhoc-testing-for-cortex-agent/SKILL.md

EVALUATE Intent - User wants to run formal evaluation or benchmark agent:

Trigger phrases: "evaluate agent", "run evaluation", "benchmark", "measure accuracy", "check metrics", "evaluation results"
→ Load evaluate-cortex-agent/SKILL.md

DATASET Intent - User wants to create or manage evaluation datasets:

Trigger phrases: "create dataset", "build dataset", "evaluation dataset", "add questions to dataset", "curate dataset"
→ Load dataset-curation/SKILL.md

DEBUG_SINGLE_QUERY Intent - User wants to debug specific query or agent request:

Trigger phrases: "debug query", "why did this fail", "analyze response", "investigate issue", "debug request ID", "Snowflake Intelligence error with request ID", "SI request ID"
→ Load debug-single-query-for-cortex-agent/SKILL.md

DEBUG_EVAL Intent - User wants to debug/investigate evaluation runs or results:

Trigger phrases: "debug evaluation", "investigate agent evaluations", "eval timed out", "evaluation error", "missing eval metrics", "analyze low scores", "evaluation traces"
→ Load investigate-cortex-agent-evals/SKILL.md

OPTIMIZE Intent - User wants to improve agent performance:

Trigger phrases: "optimize", "improve accuracy", "production ready", "make it better"
→ Load optimize-cortex-agent/SKILL.md

DELETE Intent - User wants to delete/drop/remove an agent:

Trigger phrases: "delete agent", "drop agent", "remove agent", "destroy agent", "clean up agent"
→ Load delete-cortex-agent/SKILL.md

ACCESS Intent - User wants to grant, revoke, or check access on an agent:

Trigger phrases: "grant access", "revoke access", "share agent", "who has access", "show grants", "agent permissions"
→ Load create-cortex-agent/ACCESS_MANAGEMENT.md

LIST Intent - User wants to list/show existing agents:

Trigger phrases: "list agents", "show agents", "what agents exist", "find agents", "which agents"
→ Load list-cortex-agents/SKILL.md

DOWNLOAD Intent - User wants to download/export an agent's configuration:

Trigger phrases: "download agent", "export agent", "save agent config", "get agent spec", "dump agent", "back up agent"
→ Run get_agent_config.py directly (no sub-skill needed):

uv run python scripts/get_agent_config.py --agent-name <AGENT_NAME> \
  --database <DATABASE> --schema <SCHEMA> --connection <CONNECTION> \
  --output <OUTPUT_PATH>

Ask the user for agent coordinates (database, schema, agent name) and where to save the file. Defaults to ./<AGENT_NAME>_spec.json in the current directory.

Core Capabilities

Primary Workflows

1. Create Cortex Agent Flow

Load create-cortex-agent/SKILL.md when user chooses CREATE mode.

2. Edit Cortex Agent Flow

Load edit-cortex-agent/SKILL.md when user chooses EDIT mode.

Edit existing agent configuration - update instructions, add/remove tools, modify settings.

3. Adhoc Testing Flow

Load adhoc-testing-for-cortex-agent/SKILL.md when user chooses ADHOC_TESTING mode.

Interactive testing of agent responses - explore behavior, debug issues, validate fixes.

4. Evaluate Cortex Agent Flow

Load evaluate-cortex-agent/SKILL.md when user chooses EVALUATE mode.

Run formal evaluations using Snowflake's native Agent Evaluations with metrics:

answer_correctness - Is the answer correct?
tool_selection_accuracy - Did agent select the right tool?
logical_consistency - Is response logically consistent?

5. Dataset Curation Flow

Load dataset-curation/SKILL.md when user chooses DATASET mode.

Create and manage evaluation datasets - from scratch, from production data, or add to existing.

6. Debug Single Query Flow

Load debug-single-query-for-cortex-agent/SKILL.md when user chooses DEBUG_SINGLE_QUERY mode.

7. Optimize Cortex Agent Flow

Load optimize-cortex-agent/SKILL.md when user chooses OPTIMIZE mode.

Full optimization workflow: benchmark → identify issues → improve → validate.

8. Delete Cortex Agent Flow

Load delete-cortex-agent/SKILL.md when user chooses DELETE mode.

Safely delete an agent: backup spec → production safety check → drop → verify.

9. Agent Access Management Flow

Load create-cortex-agent/ACCESS_MANAGEMENT.md when user chooses ACCESS mode.

Grant, revoke, or inspect access grants on an agent.

10. List Cortex Agents Flow

Load list-cortex-agents/SKILL.md when user chooses LIST mode.

List agents by scope: account, database, or schema.

11. Download Agent Config Flow

Run get_agent_config.py directly when user chooses DOWNLOAD mode.

Download/export an agent's full specification JSON to a local file.

12. Debug Evaluation Flow

Load investigate-cortex-agent-evals/SKILL.md when user chooses DEBUG_EVAL mode.

Debug evaluation failures, investigate task timeouts, analyze low scores, and troubleshoot AI Observability issues. Provides SQL queries using GET_AI_EVALUATION_DATA and GET_AI_RECORD_TRACE functions.

Workflow Decision Tree

Start Session
    ↓
Run setup (Load `agent-system-of-record/SKILL.md` and `best-practices/SKILL.md`)
    ↓
Detect User Intent
    ↓
    ├─→ CREATE/BUILD → Load `create-cortex-agent/SKILL.md`
    │   (Triggers: "create agent", "build agent", "set up agent", "new agent")
    │
    ├─→ EDIT/MODIFY → Load `edit-cortex-agent/SKILL.md`
    │   (Triggers: "edit agent", "modify agent", "update agent", "change agent")
    │
    ├─→ ADHOC_TESTING → Load `adhoc-testing-for-cortex-agent/SKILL.md`
    │   (Triggers: "test questions", "try queries", "test agent")
    │
    ├─→ EVALUATE → Load `evaluate-cortex-agent/SKILL.md`
    │   (Triggers: "evaluate agent", "run evaluation", "benchmark", "metrics")
    │
    ├─→ DATASET → Load `dataset-curation/SKILL.md`
    │   (Triggers: "create dataset", "build dataset", "evaluation dataset")
    │
    ├─→ DEBUG_SINGLE_QUERY → Load `debug-single-query-for-cortex-agent/SKILL.md`
    │   (Triggers: "debug query", "why did this fail", "analyze response")
    │
    ├─→ OPTIMIZE → Load `optimize-cortex-agent/SKILL.md`
    │   (Triggers: "optimize", "improve accuracy", "production ready")
    │
    ├─→ DELETE → Load `delete-cortex-agent/SKILL.md`
    │   (Triggers: "delete agent", "drop agent", "remove agent")
    │   ⚠️ Backs up spec, requires explicit confirmation
    │
    ├─→ ACCESS → Load `create-cortex-agent/ACCESS_MANAGEMENT.md`
    │   (Triggers: "grant access", "revoke access", "share agent", "agent permissions")
    │
    ├─→ LIST → Load `list-cortex-agents/SKILL.md`
    │   (Triggers: "list agents", "show agents", "what agents exist")
    │
    ├─→ DOWNLOAD → Run `get_agent_config.py`
    │   (Triggers: "download agent", "export agent", "save agent config")
    │
    └─→ DEBUG_EVAL → Load `investigate-cortex-agent-evals/SKILL.md`
        (Triggers: "debug evaluation", "eval failure", "eval timed out", "low scores")

Typical User Journeys

Journey 1: New Agent Development

CREATE → ADHOC_TESTING → DATASET → EVALUATE → OPTIMIZE

Journey 2: Production Agent Improvement

EVALUATE (baseline) → OPTIMIZE → EVALUATE (validate)

Journey 3: Quick Edit

EDIT → ADHOC_TESTING (verify changes)

Journey 4: Quick Testing

ADHOC_TESTING → DEBUG_SINGLE_QUERY (if issues found)

Journey 5: Formal Benchmarking

DATASET → EVALUATE → compare results

Journey 6: Agent Cleanup

DELETE (with production safety check + backup if prod)

Journey 7: Discovery

LIST (show all agents in account/database/schema)

Journey 8: Export

DOWNLOAD (save agent spec to local file)

Rules

Running Scripts

When running any scripts in any of the above skills, make sure to do all of the following:

Check if uv is installed by running uv --version. If it's not installed, prompt the user to install it using one of these methods:
- curl -LsSf https://astral.sh/uv/install.sh | sh (recommended)
- brew install uv (macOS)
- pip install uv
When running python scripts, use uv run --project <DIRECTORY THIS SKILL.md file is in> python <DIRECTORY THIS SKILL.md file is in>/scripts/script_name.py to run them.
Do not cd into another directory to run them, but run them from whatever directory you're already in. WHY: This maintains your current working context and prevents path confusion. When using uv run --project, you must provide absolute paths for BOTH the --project flag AND the script itself.
Just run the script the way the skill says. Do not question it by running --help or reading the script unless the script fails when run as intended.

Common Mistakes When Running Scripts

❌ WRONG: uv run --project <DIRECTORY THIS SKILL.md file is in> python scripts/test_agent.py ... (Relative path to script will fail)
❌ WRONG: cd <DIRECTORY THIS SKILL.md file is in> && uv run python scripts/test_agent.py ... (Violates the "don't cd" rule)
✅ CORRECT: uv run --project <DIRECTORY THIS SKILL.md file is in> python <DIRECTORY THIS SKILL.md file is in>/scripts/test_agent.py ... (Use the same base directory for both --project and the script path)

System of Record

Load agent-system-of-record/SKILL.md.

Installation

Install using the Cortex CLI:

bash

$cortex skill install cortex-agent

Requires Cortex CLI v2.0+. Install guide →

Repository Sources

Loading repository files...

SKILL.md Preview

SKILL.md

---
name: cortex-agent
description: "**[REQUIRED]** Use for ALL requests that mention agents: list, show, create, build, set up, edit, modify, update, delete, drop, remove, download, export, debug, fix, troubleshoot, optimize, improve, evaluate, or analyze a (Cortex) agent. Also use when debugging Snowflake Intelligence with a request ID (SI is powered by Cortex Agents). This is the REQUIRED entry point - even if the request seems simple. DO NOT attempt to debug (Cortex) agents manually - always invoke this skill first. This skill guides users through listing, creating, editing, deleting, downloading, auditing, evaluating, and debugging workflows for (Cortex) agents."
---

# Main

## When to Use

When a user wants to list, create, edit, delete, download, debug, evaluate, or optimize a (Cortex) agent. This is the entry point for all (Cortex) agent workflows.

## Setup

1. **Load** `agent-system-of-record/SKILL.md`: Required first step for all sessions.
2. **Load** `best-practices/SKILL.md`: Required to help maintain best practices for agent development.

⚠️ CRITICAL SAFETY INSTRUCTION: Before modifying an agent check with a user if it is a production agent and offer to create a clone. Ask user for the **fully qualified clone name** (`DATABASE.SCHEMA.CLONE_AGENT_NAME`) where they want the clone created. Follow `agent-system-of-record/SKILL.md` for clone creation. 

## Intent Detection

When user makes a request, detect their intent and load the appropriate sub-skill:

**CREATE Intent** - User wants to create/build a new agent:

- Trigger phrases: "create agent", "build agent", "set up agent", "new agent", "make an agent"
- **→ Load** `create-cortex-agent/SKILL.md`

**EDIT Intent** - User wants to edit/modify an existing agent:

- Trigger phrases: "edit agent", "modify agent", "update agent", "change agent instructions", "change agent", "update instructions"
- **→ Load** `edit-cortex-agent/SKILL.md`

**ADHOC_TESTING Intent** - User wants to test questions interactively:

- Trigger phrases: "test questions", "try queries", "test agent", "run some questions"
- **→ Load** `adhoc-testing-for-cortex-agent/SKILL.md`

**EVALUATE Intent** - User wants to run formal evaluation or benchmark agent:

- Trigger phrases: "evaluate agent", "run evaluation", "benchmark", "measure accuracy", "check metrics", "evaluation results"
- **→ Load** `evaluate-cortex-agent/SKILL.md`

**DATASET Intent** - User wants to create or manage evaluation datasets:

- Trigger phrases: "create dataset", "build dataset", "evaluation dataset", "add questions to dataset", "curate dataset"
- **→ Load** `dataset-curation/SKILL.md`

**DEBUG_SINGLE_QUERY Intent** - User wants to debug specific query or agent request:

- Trigger phrases: "debug query", "why did this fail", "analyze response", "investigate issue", "debug request ID", "Snowflake Intelligence error with request ID", "SI request ID"
- **→ Load** `debug-single-query-for-cortex-agent/SKILL.md`

**DEBUG_EVAL Intent** - User wants to debug/investigate evaluation runs or results:

- Trigger phrases: "debug evaluation", "investigate agent evaluations", "eval timed out", "evaluation error", "missing eval metrics", "analyze low scores", "evaluation traces"
- **→ Load** `investigate-cortex-agent-evals/SKILL.md`

**OPTIMIZE Intent** - User wants to improve agent performance:

- Trigger phrases: "optimize", "improve accuracy", "production ready", "make it better"
- **→ Load** `optimize-cortex-agent/SKILL.md`

**DELETE Intent** - User wants to delete/drop/remove an agent:

- Trigger phrases: "delete agent", "drop agent", "remove agent", "destroy agent", "clean up agent"
- **→ Load** `delete-cortex-agent/SKILL.md`

**ACCESS Intent** - User wants to grant, revoke, or check access on an agent:

- Trigger phrases: "grant access", "revoke access", "share agent", "who has access", "show grants", "agent permissions"
- **→ Load** `create-cortex-agent/ACCESS_MANAGEMENT.md`

**LIST Intent** - User wants to list/show existing agents:

- Trigger phrases: "list agents", "show agents", "what agents exist", "find agents", "which agents"
- **→ Load** `list-cortex-agents/SKILL.md`

**DOWNLOAD Intent** - User wants to download/export an agent's configuration:

- Trigger phrases: "download agent", "export agent", "save agent config", "get agent spec", "dump agent", "back up agent"
- **→ Run** `get_agent_config.py` directly (no sub-skill needed):

```bash
uv run python scripts/get_agent_config.py --agent-name <AGENT_NAME> \
  --database <DATABASE> --schema <SCHEMA> --connection <CONNECTION> \
  --output <OUTPUT_PATH>
```

Ask the user for agent coordinates (database, schema, agent name) and where to save the file. Defaults to `./<AGENT_NAME>_spec.json` in the current directory.

## Core Capabilities

### Primary Workflows

#### 1. Create Cortex Agent Flow

**Load** `create-cortex-agent/SKILL.md` when user chooses CREATE mode.

#### 2. Edit Cortex Agent Flow

**Load** `edit-cortex-agent/SKILL.md` when user chooses EDIT mode.

Edit existing agent configuration - update instructions, add/remove tools, modify settings.

#### 3. Adhoc Testing Flow

**Load** `adhoc-testing-for-cortex-agent/SKILL.md` when user chooses ADHOC_TESTING mode.

Interactive testing of agent responses - explore behavior, debug issues, validate fixes.

#### 4. Evaluate Cortex Agent Flow

**Load** `evaluate-cortex-agent/SKILL.md` when user chooses EVALUATE mode.

Run formal evaluations using Snowflake's native Agent Evaluations with metrics:
- `answer_correctness` - Is the answer correct?
- `tool_selection_accuracy` - Did agent select the right tool?
- `logical_consistency` - Is response logically consistent?

#### 5. Dataset Curation Flow

**Load** `dataset-curation/SKILL.md` when user chooses DATASET mode.

Create and manage evaluation datasets - from scratch, from production data, or add to existing.

#### 6. Debug Single Query Flow

**Load** `debug-single-query-for-cortex-agent/SKILL.md` when user chooses DEBUG_SINGLE_QUERY mode.

#### 7. Optimize Cortex Agent Flow

**Load** `optimize-cortex-agent/SKILL.md` when user chooses OPTIMIZE mode.

Full optimization workflow: benchmark → identify issues → improve → validate.

#### 8. Delete Cortex Agent Flow

**Load** `delete-cortex-agent/SKILL.md` when user chooses DELETE mode.

Safely delete an agent: backup spec → production safety check → drop → verify.

#### 9. Agent Access Management Flow

**Load** `create-cortex-agent/ACCESS_MANAGEMENT.md` when user chooses ACCESS mode.

Grant, revoke, or inspect access grants on an agent.

#### 10. List Cortex Agents Flow

**Load** `list-cortex-agents/SKILL.md` when user chooses LIST mode.

List agents by scope: account, database, or schema.

#### 11. Download Agent Config Flow

Run `get_agent_config.py` directly when user chooses DOWNLOAD mode.

Download/export an agent's full specification JSON to a local file.

#### 12. Debug Evaluation Flow

**Load** `investigate-cortex-agent-evals/SKILL.md` when user chooses DEBUG_EVAL mode.

Debug evaluation failures, investigate task timeouts, analyze low scores, and troubleshoot AI Observability issues. Provides SQL queries using `GET_AI_EVALUATION_DATA` and `GET_AI_RECORD_TRACE` functions.

## Workflow Decision Tree

```
Start Session
    ↓
Run setup (Load `agent-system-of-record/SKILL.md` and `best-practices/SKILL.md`)
    ↓
Detect User Intent
    ↓
    ├─→ CREATE/BUILD → Load `create-cortex-agent/SKILL.md`
    │   (Triggers: "create agent", "build agent", "set up agent", "new agent")
    │
    ├─→ EDIT/MODIFY → Load `edit-cortex-agent/SKILL.md`
    │   (Triggers: "edit agent", "modify agent", "update agent", "change agent")
    │
    ├─→ ADHOC_TESTING → Load `adhoc-testing-for-cortex-agent/SKILL.md`
    │   (Triggers: "test questions", "try queries", "test agent")
    │
    ├─→ EVALUATE → Load `evaluate-cortex-agent/SKILL.md`
    │   (Triggers: "evaluate agent", "run evaluation", "benchmark", "metrics")
    │
    ├─→ DATASET → Load `dataset-curation/SKILL.md`
    │   (Triggers: "create dataset", "build dataset", "evaluation dataset")
    │
    ├─→ DEBUG_SINGLE_QUERY → Load `debug-single-query-for-cortex-agent/SKILL.md`
    │   (Triggers: "debug query", "why did this fail", "analyze response")
    │
    ├─→ OPTIMIZE → Load `optimize-cortex-agent/SKILL.md`
    │   (Triggers: "optimize", "improve accuracy", "production ready")
    │
    ├─→ DELETE → Load `delete-cortex-agent/SKILL.md`
    │   (Triggers: "delete agent", "drop agent", "remove agent")
    │   ⚠️ Backs up spec, requires explicit confirmation
    │
    ├─→ ACCESS → Load `create-cortex-agent/ACCESS_MANAGEMENT.md`
    │   (Triggers: "grant access", "revoke access", "share agent", "agent permissions")
    │
    ├─→ LIST → Load `list-cortex-agents/SKILL.md`
    │   (Triggers: "list agents", "show agents", "what agents exist")
    │
    ├─→ DOWNLOAD → Run `get_agent_config.py`
    │   (Triggers: "download agent", "export agent", "save agent config")
    │
    └─→ DEBUG_EVAL → Load `investigate-cortex-agent-evals/SKILL.md`
        (Triggers: "debug evaluation", "eval failure", "eval timed out", "low scores")
```

## Typical User Journeys

### Journey 1: New Agent Development
```
CREATE → ADHOC_TESTING → DATASET → EVALUATE → OPTIMIZE
```

### Journey 2: Production Agent Improvement
```
EVALUATE (baseline) → OPTIMIZE → EVALUATE (validate)
```

### Journey 3: Quick Edit
```
EDIT → ADHOC_TESTING (verify changes)
```

### Journey 4: Quick Testing
```
ADHOC_TESTING → DEBUG_SINGLE_QUERY (if issues found)
```

### Journey 5: Formal Benchmarking
```
DATASET → EVALUATE → compare results
```

### Journey 6: Agent Cleanup
```
DELETE (with production safety check + backup if prod)
```

### Journey 7: Discovery
```
LIST (show all agents in account/database/schema)
```

### Journey 8: Export
```
DOWNLOAD (save agent spec to local file)
```

## Rules

### Running Scripts

When running any scripts in any of the above skills, make sure to do all of the following:

1. **Check if `uv` is installed** by running `uv --version`. If it's not installed, prompt the user to install it using one of these methods:
   - `curl -LsSf https://astral.sh/uv/install.sh | sh` (recommended)
   - `brew install uv` (macOS)
   - `pip install uv`
2. When running python scripts, use `uv run --project <DIRECTORY THIS SKILL.md file is in> python <DIRECTORY THIS SKILL.md file is in>/scripts/script_name.py` to run them.
3. Do not `cd` into another directory to run them, but run them from whatever directory you're already in.
   WHY: This maintains your current working context and prevents path confusion. When using `uv run --project`, you must provide absolute paths for BOTH the --project flag AND the script itself.
4. Just run the script the way the skill says. Do not question it by running `--help` or reading the script unless the script fails when run as intended.

#### Common Mistakes When Running Scripts

1. ❌ WRONG: `uv run --project <DIRECTORY THIS SKILL.md file is in> python scripts/test_agent.py ...`
   (Relative path to script will fail)
2. ❌ WRONG: `cd <DIRECTORY THIS SKILL.md file is in> && uv run python scripts/test_agent.py ...`
   (Violates the "don't cd" rule)
3. ✅ CORRECT: `uv run --project <DIRECTORY THIS SKILL.md file is in> python <DIRECTORY THIS SKILL.md file is in>/scripts/test_agent.py ...`
   (Use the same base directory for both --project and the script path)

### System of Record

**Load** `agent-system-of-record/SKILL.md`.

Related Skills

Browse all

v1.0.0

cortex-agent

Compatible with

About this skill

Main

When to Use

Setup

Intent Detection

Core Capabilities

Primary Workflows

1. Create Cortex Agent Flow

2. Edit Cortex Agent Flow

3. Adhoc Testing Flow

4. Evaluate Cortex Agent Flow

5. Dataset Curation Flow

6. Debug Single Query Flow

7. Optimize Cortex Agent Flow

8. Delete Cortex Agent Flow

9. Agent Access Management Flow

10. List Cortex Agents Flow

11. Download Agent Config Flow

12. Debug Evaluation Flow

Workflow Decision Tree

Typical User Journeys

Journey 1: New Agent Development

Journey 2: Production Agent Improvement

Journey 3: Quick Edit

Journey 4: Quick Testing

Journey 5: Formal Benchmarking

Journey 6: Agent Cleanup

Journey 7: Discovery

Journey 8: Export

Rules

Running Scripts

Common Mistakes When Running Scripts

System of Record

Installation

Repository Sources

SKILL.md Preview

Tags

Related Skills

data-governance

declarative-sharing

organization-management

snowflake-notebooks

cortex-agent

Compatible with

About this skill

Main

When to Use

Setup

Intent Detection

Core Capabilities

Primary Workflows

1. Create Cortex Agent Flow

2. Edit Cortex Agent Flow

3. Adhoc Testing Flow

4. Evaluate Cortex Agent Flow

5. Dataset Curation Flow

6. Debug Single Query Flow

7. Optimize Cortex Agent Flow

8. Delete Cortex Agent Flow

9. Agent Access Management Flow

10. List Cortex Agents Flow

11. Download Agent Config Flow

12. Debug Evaluation Flow

Workflow Decision Tree

Typical User Journeys

Journey 1: New Agent Development

Journey 2: Production Agent Improvement

Journey 3: Quick Edit

Journey 4: Quick Testing

Journey 5: Formal Benchmarking

Journey 6: Agent Cleanup

Journey 7: Discovery

Journey 8: Export

Rules

Running Scripts

Common Mistakes When Running Scripts

System of Record

Installation