# GitHub Agentic Workflows — Full Corpus

> Full content of the agent instruction files for GitHub Agentic Workflows (gh-aw).
> This file is intended for AI agents and LLMs that need the complete instruction material.

<!-- file: agentic-chat.md -->

---
name: agentic-chat
description: AI assistant for creating clear, actionable task descriptions for GitHub Copilot coding agent
---

# Agentic Task Description Assistant

Help users create task descriptions for GitHub Copilot coding agent that work with gh-aw.

## Required Knowledge

Load from gh-aw:

1. **Workflows Instructions**: https://raw.githubusercontent.com/github/gh-aw/main/.github/aw/github-agentic-workflows.md
2. **Dictation Instructions**: https://raw.githubusercontent.com/github/gh-aw/main/DICTATION.md

## Core Principles

### 1. Neutral Technical Tone
- Direct language; no marketing adjectives ("great", "easy", "powerful")

### 2. Specification Only
- **DO NOT generate code** — pseudo-code only
- Describe WHAT, not HOW; include acceptance criteria

### 3. Problem Decomposition

Each step: what to do, inputs/outputs, constraints.

### 4. Task Description Format

```markdown
# create a github agentic workflow that: [specific task goal]

## Objective
[Clear statement of what needs to be accomplished]

## Context
[Background information and current state]

## Requirements
[Specific requirements and constraints]

## Steps
- [Step 1]
- [Step 2]
- [Step 3]

## Constraints
- [Constraint 1]
- [Constraint 2]
```

## Pseudo-Code Guidelines

**Allowed**:
```
IF condition THEN
  perform action
ELSE
  perform alternative action
END IF

FOR EACH item IN collection
  process item
END FOR
```

**Not Allowed**:
- Actual code in any programming language (Python, JavaScript, Go, etc.)
- Specific library or framework calls
- Implementation-specific syntax

## Output Format

Wrap the final task description in **5 backticks** for copy/paste:

`````markdown
[Your complete task description here]
`````

**Important**: Title must start with "create a github agentic workflow that:" to trigger instruction loading.

## Interaction Guidelines

1. **Clarify**: outcome, context (repo, issue numbers), constraints, tools (GitHub API, web search, file editing).
2. **Validate**: summarize before creating the spec.
3. **Iterate** on feedback. Stay spec, not implementation.
4. **Cite** loaded instruction files when relevant.
5. **Summarize updates** rather than re-reading full markdown.

## Terminology

Use gh-aw terms (see dictation instructions):
- "agentic" (not "agent-ick"/"agent-tick")
- "workflow" (not "work flow")
- "frontmatter" (not "front matter")
- "gh-aw" (not "ghaw"/"G H A W")
- Hyphenated: "safe-outputs", "cache-memory", "max-turns"

## Do Not

- Over-specify — balance clarity with flexibility
- Ignore user questions — clarify first

**Final Step**: Compile in strict mode and fix errors/warnings before returning.


<!-- file: agentic-workflows-mcp.md -->

---
description: agentic-workflows MCP server tool reference for workflows that call status, logs, audit, or compile
---

# agentic-workflows MCP Server Tools

**⚠️ CRITICAL**: `status`, `logs`, `audit`, and `compile` are MCP server tools — NOT shell commands. Do NOT run `gh aw` directly. If the MCP server fails, give up.

## Tools

### `status`

Verify MCP server configuration and list all workflows.

No required parameters.

### `logs`

Download workflow run logs to `/tmp/gh-aw/aw-mcp/logs/`.

| Parameter | Description |
|---|---|
| `workflow_name` | Filter to a specific workflow (leave empty for all) |
| `count` | Number of runs (default: 100) |
| `start_date` | Filter runs after this date — `YYYY-MM-DD` or relative like `-1d`, `-7d`, `-30d` |
| `end_date` | Filter runs before this date |
| `engine` | Filter by AI engine: `copilot`, `claude`, `codex` |
| `branch` | Filter by branch name |
| `firewall` / `no_firewall` | Filter by firewall status |
| `filtered_integrity` | Only runs with DIFC integrity-filtered events in gateway logs |
| `after_run_id` / `before_run_id` | Paginate by run database ID |

### `audit`

Inspect a specific run in detail (missing tools, safe outputs, metrics).

| Parameter | Description |
|---|---|
| `run_id_or_url` | Run ID or run/job URL (including step anchors), as string or number |

### `compile`

Recompile workflow `.md` files into `.lock.yml` files.

**MCP equivalent of**: `gh aw compile`


<!-- file: asciicharts.md -->

---
description: Guidance for generating compact ASCII charts that render cleanly in GitHub markdown surfaces.
---

# ASCII CHART MAKER

You make charts for GitHub issue markdown.

## Goal

- easy read
- compact
- pretty
- stable on desktop + mobile
- work in fenced code block
- no broken alignment

Think like:

- terminal
- monospace grid
- fixed width

---

## RULES

- ALWAYS use fenced code block
- ALWAYS use spaces
- NEVER use tabs
- NEVER use ANSI color
- NEVER use escape codes
- KEEP width under 80 chars
- PREFER height under 12 rows
- KEEP labels short
- OPTIMIZE for glance reading

Bad:

```text
API latency over time for production workloads
```

Good:

```text
API Lat
```

---

## BEST GLYPHS

Use these first:

```text
█ ▇ ▆ ▅ ▄ ▃ ▂ ▁
│ ─ ┌ ┐ └ ┘
```

Good fallback:

```text
# * - |
```

Use carefully:

```text
╭ ╮ ╰ ╯
```

Avoid unless needed:

```text
⣀ ⣄ ⣤ ⣶ ⣿
```

Braille can break on some mobile/browser/font combinations.

---

## BEST CHART TYPES

### Sparkline

Best overall.

```text
CPU ▁▂▃▄▅▆▇█
```

### Bars

```text
API    ████████
DB     ████
Cache  ██████
```

### Table + Trend

Best for dashboards.

```text
Svc      P95   Trend
API      84ms  ▁▂▃▄▅▆█
DB       12ms  ▁▁▂▂▃▄▅
Cache    4ms   ▁▁▁▁▂▂▃
```

---

## ALIGNMENT

Good:

```text
API      ███████
Worker   ████
Cache    █████████
```

Bad:

```text
API ███████
Worker ████
Cache █████████
```

---

## SCALING

- Normalize bars to width.
- Avoid giant chart from one spike.
- Clamp outlier if needed.
- Prefer trend shape over exact precision.

Humans see shape fast.

---

## MOBILE RULE

GitHub mobile is narrow.

Target:

- 40-60 cols ideal
- 80 max

Never make giant wide graphs.

---

## OUTPUT STYLE

- compact
- dense info
- no fluff
- no explanation unless asked
- use monospace layout
- optimize visual scan speed

---

## PRIORITY

1. readability
2. alignment
3. compactness
4. pretty
5. precision

---

## GOLDEN RULE

Make graph human understand in 2 seconds.


<!-- file: campaign.md -->

# Campaign Workflows

Coordinated, time-bounded pushes with measurable outcomes, including **KPI workflows** (measure and improve a metric over time).

## Design principles

### Minimum viable campaign spec

1. **Goal**: measurable criteria (metric, source, target, deadline).
2. **Cadence**: schedule + optional `workflow_dispatch`.
3. **Stop condition**: define "goal met"; report + stop early.
4. **Outputs**: comment, issue, PR vs stdout/stderr only.
5. **Scope**: single-repo or cross-repo.
6. **Constraints**: per-run caps (max PRs, issues, runtime).

### Composable building blocks

- **Agentic (default)**: judgment, synthesis, ambiguous decisions.
- **Deterministic core**: precise, repeatable, validatable.
- **Hybrid**: deterministic prep in `steps:`, agentic prompt for decisions.
- **Metrics + memory**: `cache-memory` (optionally `repo-memory`) for cross-run goal tracking.

### Pacing levers

- **Cadence**: prefer fuzzy `schedule:` (weekdays for daily) to spread runs.
- **No overlap**: workflow-level `concurrency:`.
- **Global throughput**: share `concurrency.group` across campaigns.
- **Hard deadline**: `on.stop-after` for date/time or relative window.
- **Output caps**: `safe-outputs.*.max` (e.g., max 1 PR per run; max 1–3 comments).
- **Rate limiting**: round-robin + cache-memory (one component per run) for large scopes.
- **Goal-aware early exit**: deterministic pre-check, stop when goal met.

**Minimal pacing example:**

```yaml
---
on:
  schedule: weekly
  stop-after: "+30d"

concurrency:
  group: "campaign-weekly-ci-kpi"
  cancel-in-progress: false

permissions:
  content: read
  issues: read
tools:
  cache-memory: true

safe-outputs:
  create-pull-request:
    max: 1
  add-comment:
    max: 1
  noop:
---
```

### Goal-aware early exit

Deterministic pre-check; exit early when goal met, still report.

```markdown
---
on:
  workflow_dispatch:
permissions: read-all
tools:
  cache-memory: true
steps:
  - name: Precompute goal status
    run: |
      echo '{"goal_met": true, "metric": "coverage", "value": 82, "target": 80}' > /tmp/gh-aw/agent/goal_status.json
safe-outputs:
  add-comment:
    max: 1
  noop:
---

# Goal-aware run

Read `/tmp/gh-aw/agent/goal_status.json`.

If `goal_met` is true: post a short summary (3–5 bullets) and stop.

Otherwise: proceed with the plan, then end with a summary and learnings.
```

### KPI workflows (measure + improve)

Output a **metric** and **interpretation**. Make KPI computation deterministic.

- Compute KPI in `steps:`, write JSON (e.g., `/tmp/gh-aw/agent/kpi.json`).
- Agent reads JSON, decides report-only vs follow-up, ends with short summary.

**Inputs:**

- `workflow_dispatch` inputs for user parameters; normalize via `steps:` into JSON.
- `mcp-scripts:` for constrained, auditable access to privileged data (not human input).

**Minimum viable KPI spec:**

- `kpi.name` + `kpi.definition` (formula)
- `kpi.source` (command, GitHub API read, file parse)
- `kpi.target` (threshold + timeframe)
- `kpi.scope` (branch, directory, package set)
- `kpi.publish_to` (comment/issue/discussion) + "update existing?"

**Standard deterministic payload:**

```json
{
  "kpi": "ci_success_rate",
  "value": 0.92,
  "target": 0.95,
  "window": "last_30_runs",
  "goal_met": false,
  "notes": "2 failures were flaky tests"
}
```

### Cross-repo coordination

- `safe-outputs.dispatch-workflow` is same-repo by default; cross-repo needs `target-repo` plus `allowed-repos` allowlist and a token with `actions: write` on the target.
- For org-wide/multi-org, use a coordinator sending `repository_dispatch` to each target.
  - Requires PAT or GitHub App token with access to every dispatched repo.
  - Prefer fine-grained PAT scoped to specific repos with `Actions: Read & Write`.
  - Keep permissions minimal, lock down inputs.


<!-- file: charts-trending.md -->

---
description: Full trending-analysis patterns, best practices, and reporting guidance for chart workflows.
---
# Charts with Trending

## Option C: Charts with Trending (Full Guide)

Use when you need full trending analysis with cache-memory persistence.

### Frontmatter

```yaml
imports:
  - shared/python-dataviz.md
  - shared/trends.md

tools:
  cache-memory:
    key: charts-trending-${{ github.workflow }}-${{ github.run_id }}

safe-outputs:
  upload-asset:
    max: 3
    allowed-exts: [.png, .jpg, .jpeg, .svg]
```

### Agent Instructions

**Cache-Memory Organization**:

```
/tmp/gh-aw/cache-memory/
├── trending/
│   ├── <metric-name>/
│   │   ├── history.jsonl      # Time-series data (JSON Lines format)
│   │   ├── metadata.json      # Data schema and descriptions
│   │   └── last_updated.txt   # Timestamp of last update
│   └── index.json             # Index of all tracked metrics
```

**Load Historical Data**:

```bash
if [ -f /tmp/gh-aw/cache-memory/trending/issues/history.jsonl ]; then
  echo "Loading historical data..."
  cp /tmp/gh-aw/cache-memory/trending/issues/history.jsonl /tmp/gh-aw/python/data/
else
  echo "No historical data found. Starting fresh."
  mkdir -p /tmp/gh-aw/cache-memory/trending/issues
fi
```

**Append New Data**:

```python
import json
from datetime import datetime

data_point = {
    "timestamp": datetime.now().isoformat(),
    "metric": "issue_count",
    "value": 42,
    "metadata": {"source": "github_api"}
}

with open('/tmp/gh-aw/cache-memory/trending/issues/history.jsonl', 'a') as f:
    f.write(json.dumps(data_point) + '\n')
```

**Load History into DataFrame**:

```python
import pandas as pd, json, os

history_file = '/tmp/gh-aw/cache-memory/trending/issues/history.jsonl'
if os.path.exists(history_file):
    df = pd.read_json(history_file, lines=True)
    df['timestamp'] = pd.to_datetime(df['timestamp'])
    df = df.sort_values('timestamp')
else:
    df = pd.DataFrame()
```

### Trending Analysis Patterns

**Pattern 1: Daily Metrics Tracking**

```python
#!/usr/bin/env python3
import pandas as pd, matplotlib.pyplot as plt, seaborn as sns, json, os
from datetime import datetime

sns.set_style("whitegrid")
sns.set_palette("husl")

history_file = '/tmp/gh-aw/cache-memory/trending/daily_metrics/history.jsonl'
today_data = {
    "timestamp": datetime.now().isoformat(),
    "issues_opened": 5,
    "issues_closed": 3,
    "prs_merged": 2
}

os.makedirs(os.path.dirname(history_file), exist_ok=True)
with open(history_file, 'a') as f:
    f.write(json.dumps(today_data) + '\n')

data = pd.read_json(history_file, lines=True)
data['date'] = pd.to_datetime(data['timestamp']).dt.date
daily_stats = data.groupby('date').sum()

fig, ax = plt.subplots(figsize=(12, 7), dpi=300)
daily_stats.plot(ax=ax, marker='o', linewidth=2)
ax.set_title('Daily Metrics Trends', fontsize=16, fontweight='bold')
ax.set_xlabel('Date', fontsize=12)
ax.set_ylabel('Count', fontsize=12)
ax.legend(loc='best')
ax.grid(True, alpha=0.3)
plt.xticks(rotation=45)
plt.tight_layout()
plt.savefig('/tmp/gh-aw/python/charts/daily_metrics_trend.png',
            dpi=300, bbox_inches='tight', facecolor='white')
```

**Pattern 2: Moving Averages and Smoothing**

```python
df['rolling_avg'] = df['value'].rolling(window=7, min_periods=1).mean()

fig, ax = plt.subplots(figsize=(12, 7), dpi=300)
ax.plot(df['date'], df['value'], label='Actual', alpha=0.5, marker='o')
ax.plot(df['date'], df['rolling_avg'], label='7-day Average', linewidth=2.5)
ax.fill_between(df['date'], df['value'], df['rolling_avg'], alpha=0.2)
```

**Pattern 3: Comparative Trends**

```python
fig, ax = plt.subplots(figsize=(14, 8), dpi=300)
for metric in ['metric_a', 'metric_b', 'metric_c']:
    metric_data = df[df['metric'] == metric]
    ax.plot(metric_data['timestamp'], metric_data['value'],
            marker='o', label=metric, linewidth=2)
ax.set_title('Comparative Metrics Trends', fontsize=16, fontweight='bold')
ax.legend(loc='best', fontsize=12)
ax.grid(True, alpha=0.3)
plt.xticks(rotation=45)
```

**Data Retention (90 days)**:

```python
from datetime import timedelta
cutoff_date = datetime.now() - timedelta(days=90)
df = df[df['timestamp'] >= cutoff_date]
df.to_json('/tmp/gh-aw/cache-memory/trending/history.jsonl', orient='records', lines=True)
```

**Complete Trending Example**:

```python
#!/usr/bin/env python3
import pandas as pd, matplotlib.pyplot as plt, seaborn as sns, json, os
from datetime import datetime, timedelta

CACHE_DIR = '/tmp/gh-aw/cache-memory/trending'
METRIC_NAME = 'github_activity'
HISTORY_FILE = f'{CACHE_DIR}/{METRIC_NAME}/history.jsonl'
CHARTS_DIR = '/tmp/gh-aw/python/charts'

os.makedirs(f'{CACHE_DIR}/{METRIC_NAME}', exist_ok=True)
os.makedirs(CHARTS_DIR, exist_ok=True)

today_data = {
    "timestamp": datetime.now().isoformat(),
    "issues_opened": 8, "prs_merged": 12, "commits": 45, "contributors": 6
}
with open(HISTORY_FILE, 'a') as f:
    f.write(json.dumps(today_data) + '\n')

df = pd.read_json(HISTORY_FILE, lines=True)
df['date'] = pd.to_datetime(df['timestamp']).dt.date
df = df.sort_values('timestamp')
daily_stats = df.groupby('date').sum()

sns.set_style("whitegrid")
sns.set_palette("husl")

fig, axes = plt.subplots(2, 2, figsize=(16, 12), dpi=300)
fig.suptitle('GitHub Activity Trends', fontsize=18, fontweight='bold')

axes[0, 0].plot(daily_stats.index, daily_stats['issues_opened'], marker='o', linewidth=2, color='#FF6B6B')
axes[0, 0].set_title('Issues Opened', fontsize=14)
axes[0, 0].grid(True, alpha=0.3)

axes[0, 1].plot(daily_stats.index, daily_stats['prs_merged'], marker='s', linewidth=2, color='#4ECDC4')
axes[0, 1].set_title('PRs Merged', fontsize=14)
axes[0, 1].grid(True, alpha=0.3)

axes[1, 0].plot(daily_stats.index, daily_stats['commits'], marker='^', linewidth=2, color='#45B7D1')
axes[1, 0].set_title('Commits', fontsize=14)
axes[1, 0].grid(True, alpha=0.3)

axes[1, 1].plot(daily_stats.index, daily_stats['contributors'], marker='D', linewidth=2, color='#FFA07A')
axes[1, 1].set_title('Active Contributors', fontsize=14)
axes[1, 1].grid(True, alpha=0.3)

plt.tight_layout()
plt.savefig(f'{CHARTS_DIR}/activity_trends.png', dpi=300, bbox_inches='tight', facecolor='white')
print(f"✅ Trend chart generated with {len(df)} data points")
```

---

## Trends Visualization Best Practices

### Example Chart Types

**Temporal Trends**:
```python
fig, ax = plt.subplots(figsize=(12, 7), dpi=300)
for column in data.columns:
    ax.plot(data.index, data[column], marker='o', label=column, linewidth=2)
ax.set_title('Trends Over Time', fontsize=16, fontweight='bold')
ax.set_xlabel('Date', fontsize=12)
ax.set_ylabel('Value', fontsize=12)
ax.legend(loc='best')
ax.grid(True, alpha=0.3)
plt.xticks(rotation=45)
```

**Growth Rates**:
```python
fig, ax = plt.subplots(figsize=(10, 6), dpi=300)
growth_data.plot(kind='bar', ax=ax, color=sns.color_palette("husl"))
ax.set_title('Growth Rates by Period', fontsize=16, fontweight='bold')
ax.axhline(y=0, color='black', linestyle='-', linewidth=0.8)
ax.set_ylabel('Growth %', fontsize=12)
```

**Moving Averages**:
```python
fig, ax = plt.subplots(figsize=(12, 7), dpi=300)
ax.plot(dates, values, label='Actual', alpha=0.5, linewidth=1)
ax.plot(dates, moving_avg, label='7-day Moving Average', linewidth=2.5)
ax.fill_between(dates, values, moving_avg, alpha=0.2)
```

### Data Preparation

```python
# Time-based indexing
data['date'] = pd.to_datetime(data['date'])
data.set_index('date', inplace=True)
data = data.sort_index()

# Resampling
weekly_data = data.resample('W').mean()
data['rolling_mean'] = data['value'].rolling(window=7).mean()

# Growth calculations
data['pct_change'] = data['value'].pct_change() * 100
data['yoy_growth'] = data['value'].pct_change(periods=365) * 100
```

### Color Palettes

- **Sequential**: `sns.color_palette("viridis", n_colors=5)`
- **Diverging**: `sns.color_palette("RdYlGn", n_colors=7)`
- **Multiple series**: `sns.color_palette("husl", n_colors=8)`
- **Categorical**: `sns.color_palette("Set2", n_colors=6)`

### Annotation

```python
max_idx = data['value'].idxmax()
max_val = data['value'].max()
ax.annotate(f'Peak: {max_val:.2f}',
            xy=(max_idx, max_val),
            xytext=(10, 20),
            textcoords='offset points',
            arrowprops=dict(arrowstyle='->', color='red'),
            fontsize=10, fontweight='bold')
```

---

## Embedding Charts in Reports

1. Save chart to `/tmp/gh-aw/python/charts/`
2. Upload via `upload asset` tool → raw GitHub URL
3. Embed: `![Chart description](URL_FROM_UPLOAD_ASSET)`

Assets are published to an orphaned git branch and become URL-addressable after workflow completion.

Example report:

```markdown
## 📈 Trending Analysis

![Activity Trends](URL_FROM_UPLOAD_ASSET)

Analysis shows:
- Issues opened: Up 15% from last week
- PR velocity: Stable at 12 PRs/day
- Active contributors: Growing trend (+20% this month)

**Data**: {count} points | **Range**: {start} to {end}
```

---

## Session Analysis Chart Pattern

For Copilot coding agent session data, generate two charts:

**Chart 1: Session Completion Trends** — multi-line: successful (green), failed/abandoned (red), completion rate % (secondary y-axis). X: last 30 days. Save as `/tmp/gh-aw/python/charts/session_completion_trends.png`.

**Chart 2: Session Duration & Efficiency** — avg duration (line), median (line), sessions with loops (bar overlay). X: last 30 days. Y: minutes. Save as `/tmp/gh-aw/python/charts/session_duration_trends.png`.

**Data files**:
- `session_completion.csv` — date, successful, failed, completion_rate
- `session_duration.csv` — date, avg_duration_min, median_duration_min, loop_count

If fewer than 7 days of data, use bar charts instead of line charts and note the limited range.

---


<!-- file: charts.md -->

---
description: Guidance for adding Python data visualization to agentic workflows with compact setup patterns and links to the full trending guide.
---

# Python Data Visualization in Agentic Workflows

## Choosing a Shared Workflow

| Import | Best for |
|---|---|
| `shared/trending-charts-simple.md` | Quick setup with cache-memory-backed trend charts |
| `shared/python-dataviz.md` | One-off charts from current-run data |
| `shared/charts-with-trending.md` | Full trending analysis with richer historical guidance |

Default to `shared/trending-charts-simple.md` for new charting workflows.

If the shared files are not present locally, import them with:

```bash
gh aw add githubnext/agentics/python-dataviz
```

## Option A: Trending Charts (Simple)

Use when you need trend charts with cache-memory persistence and minimal configuration.

```yaml
tools:
  cache-memory:
    key: trending-data-${{ github.workflow }}-${{ github.run_id }}
  bash:
    - "*"
network:
  allowed:
    - defaults
    - python
steps:
  - name: Setup Python environment
    run: |
      mkdir -p /tmp/gh-aw/python/{data,charts,artifacts}
      pip install --user --quiet numpy pandas matplotlib seaborn scipy
safe-outputs:
  upload-asset:
    max: 3
    allowed-exts: [.png, .jpg, .jpeg, .svg]
```

Agent guidance:

- write data to `/tmp/gh-aw/python/data/`
- write charts to `/tmp/gh-aw/python/charts/`
- append history to `/tmp/gh-aw/cache-memory/trending/<metric>/history.jsonl`
- use ISO 8601 timestamps
- generate charts at 300 DPI with clear labels

## Option B: Current-Run Charts Only

Use when the workflow needs charts from current data without historical tracking.

```yaml
tools:
  cache-memory: true
  bash:
    - "*"
network:
  allowed:
    - defaults
    - python
safe-outputs:
  upload-asset:
    max: 3
    allowed-exts: [.png, .jpg, .jpeg, .svg]
steps:
  - name: Setup Python environment
    run: |
      mkdir -p /tmp/gh-aw/python/{data,charts,artifacts}
      pip install --user --quiet numpy pandas matplotlib seaborn scipy
```

Rules:

- never inline dataset values directly in Python code
- store input data in files and load with pandas
- keep reusable helpers in cache-memory when that improves later runs
- save chart images under `/tmp/gh-aw/python/charts/`

## Full Trending Guide

Load [charts-trending.md](charts-trending.md) only when you need:

- detailed historical-data layouts
- moving averages, comparative trends, and retention patterns
- reporting templates with embedded chart assets
- session-analysis chart patterns


<!-- file: cli-commands.md -->

---
description: Complete reference for gh aw CLI commands and their MCP tool equivalents for restricted environments
---

# gh aw CLI Commands Reference

## CLI vs MCP Tool — When to Use Each

| Environment | Use |
|---|---|
| **Local development** (terminal with `gh` auth) | `gh aw <command>` CLI |
| **GitHub Copilot Cloud** (coding agent, Copilot Chat) | `agentic-workflows` MCP tool |
| **GitHub Actions workflow step** | `gh aw <command>` after installing `github/gh-aw/actions/setup-cli` |
| **CI runner without gh auth** | `agentic-workflows` MCP tool |

> [!NOTE]
> **agentic-workflows MCP tool availability**
>
> The MCP tool is available when `agentic-workflows:` is added to a workflow's `tools:` section. In Copilot Chat / Copilot coding agent, it is pre-configured and always available.
>
> In a GitHub Actions workflow step, install the CLI first:
> ```yaml
> - uses: github/gh-aw/actions/setup-cli@<version>
> - run: gh aw compile
> ```

---

## Command Reference

### `gh aw init`

Initialize a repository for agentic workflows.

```bash
gh aw init
```

Creates `.github/skills/agentic-workflows/SKILL.md` and supporting files.

**MCP equivalent**: Not available — run from a local terminal or use the `upgrade` tool for updates.

---

### `gh aw compile`

Compile workflow `.md` files into GitHub Actions `.lock.yml` files.

```bash
gh aw compile                     # Compile all workflows
gh aw compile <workflow-name>     # Compile a specific workflow
gh aw compile --strict            # Compile with strict mode validation
gh aw compile --validate          # Validate without emitting lock files
gh aw compile --fail-fast         # Stop at first error
gh aw compile --purge             # Remove orphaned .lock.yml files
gh aw compile --approve           # Approve new secrets / action changes
```

**MCP equivalent**: `compile` tool

---

### `gh aw run`

> [!IMPORTANT]
> **Always prefer `gh aw run` over `gh workflow run <file>.lock.yml`** — it handles workflow resolution by short name, validates inputs, and enables correct run-tracking with `gh aw audit` and `gh aw logs`.

Trigger a workflow on demand using `workflow_dispatch`.

```bash
gh aw run                           # Interactive mode — pick workflow and fill inputs
gh aw run <workflow-name>           # Run by short name
gh aw run <workflow-name>.md        # Alternative: explicit .md extension
gh aw run <workflow-name> --ref main              # Run on a specific branch/tag/SHA
gh aw run <workflow-name> --repeat 3              # Run 4 times total (1 + 3 repeats)
gh aw run <workflow-name> -F key=value            # Pass a specific input (alias: --raw-field)
```

**MCP equivalent**: Not available. Fallback: use the GitHub MCP server's `create_workflow_dispatch` with `workflow_id: <workflow-name>.lock.yml`.

---

### `gh aw logs`

Download and analyze workflow execution logs.

```bash
gh aw logs                          # Logs for all agentic workflows
gh aw logs <workflow-name>          # Logs for a specific workflow
gh aw logs <workflow-name> --json   # JSON output for programmatic use
gh aw logs --engine copilot         # Filter by engine
gh aw logs -c 10                    # Last 10 runs
gh aw logs --start-date -1w         # Last week's runs
gh aw logs --start-date 2024-01-01 --end-date 2024-01-31
gh aw logs -o ./workflow-logs       # Save to directory
gh aw logs --repo owner/repo        # Query logs in another repository
```

**MCP equivalent**: `logs` tool

---

### `gh aw audit`

Investigate a specific workflow run in detail (missing tools, safe outputs, metrics).

```bash
gh aw audit <run-id>                # Audit a single run
gh aw audit <run-id> --json         # JSON output
gh aw audit <base-id> <compare-id>  # Diff two runs (regression detection)
gh aw audit <id1> <id2> <id3> --json  # Multi-run diff
```

**MCP equivalent**: `audit` tool (single run) / `audit-diff` tool (multi-run comparison)

---

### `gh aw status`

Show the status of all agentic workflows in the repository.

```bash
gh aw status
gh aw status --repo owner/repo    # Query status in another repository
```

**MCP equivalent**: `status` tool

---

### `gh aw checks`

Show check run results for a workflow run.

```bash
gh aw checks <run-id>
```

**MCP equivalent**: `checks` tool

---

### `gh aw fix`

Apply automatic codemods to fix deprecated fields in workflow files.

```bash
gh aw fix                   # Preview changes (dry run)
gh aw fix --write           # Apply changes
```

**MCP equivalent**: `fix` tool

---

### `gh aw upgrade`

Upgrade the repository's agentic workflows configuration to the latest gh-aw version.

```bash
gh aw upgrade               # Upgrade agent files + codemods + compile
gh aw upgrade -v            # Verbose output
gh aw upgrade --no-fix      # Skip codemods and compilation
gh aw upgrade --create-pull-request         # Open a PR with the upgrade changes (alias: --pr)
gh aw upgrade --org my-org                  # Preview upgrade PRs across an organization
gh aw upgrade --org my-org --repos '*-service'  # Limit org mode to matching repos
gh aw upgrade --org my-org --create-issue   # Open issues in org repos with agentic workflows (requires --org)
```

**MCP equivalent**: `upgrade` tool

---

### `gh aw add`

Add a new shared workflow component as an import.

```bash
gh aw add <workflow-url>
```

**MCP equivalent**: `add` tool

---

### `gh aw update`

Update imported shared workflow components.

```bash
gh aw update                                # Update all workflows from source
gh aw update <workflow-name>                # Update a specific workflow
gh aw update --major                        # Allow major version updates
gh aw update --create-pull-request          # Update and open a PR (alias: --pr)
gh aw update --repo owner/repo              # Update workflows in another repository (isolated shallow checkout)
gh aw update --cool-down 3d                 # Custom cooldown before applying pending releases
```

**MCP equivalent**: `update` tool

---

### `gh aw deploy`

Deploy workflows to a target repository (chains update, add, compile --purge, opens a PR). `--repo` is required.

```bash
gh aw deploy <workflow>... --repo owner/repo            # Deploy listed workflows
gh aw deploy githubnext/agentics/ci-doctor --repo o/r   # Deploy a shared workflow
gh aw deploy ./local-workflow.md --repo owner/repo      # Deploy a local workflow
gh aw deploy <workflow> --repo owner/repo --force       # Overwrite without confirmation
```

**MCP equivalent**: Not available — run from a local terminal or invoke the CLI inside a workflow step with `github/gh-aw/actions/setup-cli`.

---

### `gh aw env`

Manage compiler default variables (`GH_AW_DEFAULT_*`) as repo/org/enterprise GitHub Actions variables. YAML file uses lowercase `default_*` keys. `null` deletes the variable; any non-null string sets it (`""` = set-to-empty, not delete).

```bash
gh aw env get [file]                          # Download defaults to file.yml (default name)
gh aw env get --scope org --org myorg         # Org-scope export
gh aw env update file.yml --scope repo        # Apply with interactive confirmation
gh aw env update file.yml --scope ent --enterprise myent --yes  # Skip confirmation
gh aw env update file.yml --scope repo --dry-run  # Preview without applying
```

Example file:

```yaml
default_max_ai_credits: "1000"
default_max_turn_cache_misses: "5"
default_detection_max_ai_credits: "400"
default_max_turns: "12"
default_model_copilot: "gpt-5-mini"
default_model_codex: null   # delete this variable
```

Recognized keys include `default_max_ai_credits`, `default_max_turn_cache_misses`, `default_detection_max_ai_credits`, `default_max_daily_ai_credits`, `default_timeout_minutes`, `default_max_turns`, `default_detection_model`, `default_utc`, `default_model_copilot`, `default_model_claude`, `default_model_codex`. The compiler resolves model selection as `GH_AW_MODEL_*` → `GH_AW_DEFAULT_MODEL_*` → built-in engine fallback.

**MCP equivalent**: Not available — run from a local terminal.

---

### `gh aw mcp inspect`

Inspect and analyze MCP server configurations in workflows.

```bash
gh aw mcp inspect <workflow-name>
gh aw mcp inspect <workflow-name> --inspector   # Launch web-based inspector UI
gh aw mcp list                                   # List workflows with MCP servers
```

**MCP equivalent**: `mcp-inspect` tool

---

## MCP Tool ↔ CLI Quick Reference

| CLI command | MCP tool |
|---|---|
| `gh aw status` | `status` |
| `gh aw compile` | `compile` |
| `gh aw run` | *(use GitHub MCP `create_workflow_dispatch`)* |
| `gh aw logs` | `logs` |
| `gh aw audit` | `audit` |
| `gh aw audit <id1> <id2>` | `audit-diff` |
| `gh aw checks` | `checks` |
| `gh aw mcp inspect` | `mcp-inspect` |
| `gh aw add` | `add` |
| `gh aw update` | `update` |
| `gh aw fix` | `fix` |
| `gh aw upgrade` | `upgrade` |
| `gh aw deploy` | *(local only)* |
| `gh aw env` | *(local only)* |
| `gh aw init` | *(local only)* |


<!-- file: context.md -->

---
description: GitHub context expression variables and Handlebars-style template conditionals ({{#if}}) for agentic workflows.
---

## GitHub Context Expression Interpolation

**For security reasons, only specific expressions are allowed.**

### Allowed Context Variables

- **`${{ github.event.after }}`** - SHA of the most recent commit after the push
- **`${{ github.event.before }}`** - SHA of the most recent commit before the push
- **`${{ github.event.check_run.id }}`** - ID of the check run
- **`${{ github.event.check_suite.id }}`** - ID of the check suite
- **`${{ github.event.comment.id }}`** - ID of the comment
- **`${{ github.event.deployment.id }}`** - ID of the deployment
- **`${{ github.event.deployment_status.id }}`** - ID of the deployment status
- **`${{ github.event.head_commit.id }}`** - ID of the head commit
- **`${{ github.event.installation.id }}`** - ID of the GitHub App installation
- **`${{ github.event.issue.number }}`** - Issue number
- **`${{ github.event.issue.state }}`** - State of the issue (open/closed)
- **`${{ github.event.issue.title }}`** - Title of the issue
- **`${{ github.event.label.id }}`** - ID of the label
- **`${{ github.event.milestone.id }}`** - ID of the milestone
- **`${{ github.event.milestone.number }}`** - Number of the milestone
- **`${{ github.event.organization.id }}`** - ID of the organization
- **`${{ github.event.page.id }}`** - ID of the GitHub Pages page
- **`${{ github.event.project.id }}`** - ID of the project
- **`${{ github.event.project_card.id }}`** - ID of the project card
- **`${{ github.event.project_column.id }}`** - ID of the project column
- **`${{ github.event.pull_request.number }}`** - Pull request number
- **`${{ github.event.pull_request.state }}`** - State of the pull request (open/closed)
- **`${{ github.event.pull_request.title }}`** - Title of the pull request
- **`${{ github.event.pull_request.head.sha }}`** - SHA of the PR head commit
- **`${{ github.event.pull_request.base.sha }}`** - SHA of the PR base commit
- **`${{ github.event.discussion.number }}`** - Discussion number
- **`${{ github.event.discussion.title }}`** - Title of the discussion
- **`${{ github.event.discussion.category.name }}`** - Category name of the discussion
- **`${{ github.event.release.assets[0].id }}`** - ID of the first release asset
- **`${{ github.event.release.id }}`** - ID of the release
- **`${{ github.event.release.name }}`** - Name of the release
- **`${{ github.event.release.tag_name }}`** - Tag name of the release
- **`${{ github.event.repository.id }}`** - ID of the repository
- **`${{ github.event.repository.default_branch }}`** - Default branch of the repository
- **`${{ github.event.review.id }}`** - ID of the review
- **`${{ github.event.review_comment.id }}`** - ID of the review comment
- **`${{ github.event.sender.id }}`** - ID of the user who triggered the event
- **`${{ github.event.deployment.environment }}`** - Deployment environment name
- **`${{ github.event.workflow_job.id }}`** - ID of the workflow job
- **`${{ github.event.workflow_job.run_id }}`** - Run ID of the workflow job
- **`${{ github.event.workflow_run.id }}`** - ID of the workflow run
- **`${{ github.event.workflow_run.number }}`** - Number of the workflow run
- **`${{ github.event.workflow_run.conclusion }}`** - Conclusion of the workflow run
- **`${{ github.event.workflow_run.status }}`** - Status of the workflow run
- **`${{ github.event.workflow_run.event }}`** - Event that triggered the workflow run
- **`${{ github.event.workflow_run.html_url }}`** - HTML URL of the workflow run
- **`${{ github.event.workflow_run.head_sha }}`** - Head SHA of the workflow run
- **`${{ github.event.workflow_run.run_number }}`** - Run number of the workflow run
- **`${{ github.actor }}`** - Username of the person who initiated the workflow
- **`${{ github.event_name }}`** - Name of the event that triggered the workflow
- **`${{ github.job }}`** - Job ID of the current workflow run
- **`${{ github.repository }}`** - Repository name in "owner/name" format
- **`${{ github.repository_owner }}`** - Owner of the repository (organization or user)
- **`${{ github.run_id }}`** - Unique ID of the workflow run
- **`${{ github.run_number }}`** - Number of the workflow run
- **`${{ github.server_url }}`** - Base URL of the server, e.g. <https://github.com>
- **`${{ github.workflow }}`** - Name of the workflow
- **`${{ github.workspace }}`** - The default working directory on the runner for steps

#### Special Pattern Expressions

- **`${{ needs.* }}`** - Any outputs from previous jobs (e.g., `${{ needs.pre_activation.outputs.activated }}`, or `${{ needs.activation.outputs.label_command }}` for the triggering label when using a `label_command` trigger). The activation job cannot reference its own outputs—only jobs after activation can.
- **`${{ steps.* }}`** - Any outputs from previous steps (e.g., `${{ steps.my-step.outputs.result }}`)
- **`${{ github.event.inputs.* }}`** - Any workflow inputs when triggered by workflow_dispatch (e.g., `${{ github.event.inputs.environment }}`)

All other expressions are disallowed.

### Sanitized Context Text (`steps.sanitized.outputs.text`)

**RECOMMENDED**: Prefer `${{ steps.sanitized.outputs.text }}` over individual `github.event` fields for issue/PR content.

Auto-populated per triggering event:

- **Issues / Pull Requests**: `title + "\n\n" + body`
- **Issue Comments / PR Review Comments**: `comment.body`
- **PR Reviews**: `review.body`
- **Other events**: Empty string

**Security Benefits:**

- **@mention neutralization**: converts `@user` to `` `@user` ``
- **Bot trigger protection**: converts `fixes #123` to `` `fixes #123` ``
- **XML tag safety**: converts XML tags to parentheses to prevent injection
- **URI filtering**: only allows HTTPS URIs from trusted domains; others become "(redacted)"
- **Content limits**: truncates to 0.5MB / 65k lines max
- **Control character removal**: strips ANSI escape sequences and non-printable characters

### Security Validation

Expression safety is validated at compile time. Unauthorized expressions cause compilation to fail with an error listing them.

### Example Usage

```markdown
# Valid — prefer sanitized text
Analyze issue #${{ github.event.issue.number }} in repository ${{ github.repository }}.
The issue content is: "${{ steps.sanitized.outputs.text }}"

# Valid — individual fields (less secure)
Created by ${{ github.actor }} with title: "${{ github.event.issue.title }}"
Deploy to environment: "${{ github.event.inputs.environment }}"

# Invalid (compile errors)
# ${{ secrets.GITHUB_TOKEN }}
# ${{ env.MY_VAR }}
# ${{ toJson(github.workflow) }}
```

## Prompt Template Conditionals (`{{#if}}`)

Conditional blocks resolved **at runtime, before the agent sees the prompt** — the agent only sees the final resolved text.

### Syntax

```
{{#if <condition>}}
...true branch content...
{{#else}}
...false branch content (optional)...
{{#endif}}
```

- **`{{#if <condition>}}`** — content included only when truthy
- **`{{#else}}`** — optional false-branch separator
- **`{{#endif}}`** — closes block (**preferred** closing tag)
- **`{{/if}}`** — alternate closing tag (also supported)

Block form (tag on its own line) is recommended for readability.

### Supported Conditions

| Form | Example | Truthy when |
|---|---|---|
| Bare value | `{{#if experiments.flag }}` | value is non-empty and not `"false"` |
| Equality | `{{#if experiments.style == "concise" }}` | value equals the quoted string |
| Inequality | `{{#if experiments.style != "verbose" }}` | value does not equal the quoted string |
| Strict equality | `{{#if experiments.style === "concise" }}` | value strictly equals the quoted string |
| Strict inequality | `{{#if experiments.style !== "verbose" }}` | value strictly differs from the quoted string |

### Example: Conditional Without Else

```markdown
{{#if experiments.skill_hint == "enabled" }}
Check `skills/` for SKILL.md files relevant to this task and apply their guidance.
{{#endif}}
```

### Example: Conditional With Else

```markdown
{{#if experiments.output_style == "concise" }}
Write a maximum of 5 bullet points. Each bullet is one sentence.
{{#else}}
Write a structured report with sections for new features, bug fixes, and refactors.
Include a one-paragraph executive summary at the top.
{{#endif}}
```

### Integration with Experiments

When `experiments:` is set in frontmatter, the selected variant is substituted into `{{#if experiments.<name> == "..." }}` conditions before rendering. See [A/B Testing Experiments](../aw/experiments.md).

### Notes

- **Fenced code blocks are preserved** — `{{#if}}` tags inside `` ``` `` blocks appear verbatim.
- **No nested conditionals** — inner tags become literal text.
- **Tags are stripped before the agent runs** — never visible in the final prompt.


<!-- file: create-agentic-workflow.md -->

---
description: Create new agentic workflows using GitHub Agentic Workflows (gh-aw) with concise guidance on triggers, tools, and security.
disable-model-invocation: true
---

# GitHub Agentic Workflow Creator

Create new workflow files under `.github/workflows/` using the installed `gh aw` CLI.

## Load These References First

- [github-agentic-workflows.md](github-agentic-workflows.md)
- [workflow-editing.md](workflow-editing.md)
- [workflow-constraints.md](workflow-constraints.md)
- [workflow-patterns.md](workflow-patterns.md)
- [safe-outputs.md](safe-outputs.md)
- [syntax.md](syntax.md)
- [mcp-clis.md](mcp-clis.md)

Load these topic files only when relevant:

- [campaign.md](campaign.md) for campaign, KPI, pacing, cadence, or `stop-after`
- [experiments.md](experiments.md) for experiments, A/B tests, variants, or prompt comparisons
- [visual-regression.md](visual-regression.md) for screenshot comparison workflows
- [deployment-status.md](deployment-status.md) for external deployment monitoring
- [charts.md](charts.md) for chart-generation workflows
- [report.md](report.md) for reporting output structure and recurring report lifecycle

## Two Modes

### Interactive mode

Start with exactly:

> What do you want to automate today?

Then ask only the next question needed.

### Issue-form mode

When triggered from a workflow-creation issue form, read the form fields and generate the workflow without further conversation.

## Conversation Rules

- Keep the conversation short and iterative.
- Translate user intent into workflow structure.
- Ask about the trigger, desired action, and required write outputs.
- When the user asks for exploration, evaluation, or scenario design rather than file creation, stay in ad hoc evaluation mode.
- In ad hoc evaluation mode, do not create `.github/workflows/*.md`.
- Do not overwhelm the user with long option dumps unless they ask.
- If the request exceeds the single-job model, explain the constraint and recommend traditional GitHub Actions.

## Ad Hoc Evaluation Mode

Use this mode for exploratory testing, persona walkthroughs, and "what workflow would you create for this scenario?" requests.

- Do not create or edit workflow files.
- Return a compact recommendation covering trigger, any scoped `paths:` filters for file-event triggers, read tools, safe outputs, permissions, and explicit `noop` criteria.
- For recurring reports or digests, always include the report window, grouping dimensions, and deduplication key. See [triggers.md](triggers.md) for key-format examples.
- Exit ad hoc evaluation mode only when the user explicitly asks to create, implement, or write the workflow file.
- End by offering to turn the recommendation into `.github/workflows/<workflow-id>.md` if the user wants to proceed.

## Design Checklist

### 1. Pick the workflow ID

- Derive kebab-case from the workflow name.
- Before creating the file, check whether `.github/workflows/<workflow-id>.md` already exists.
- If it exists, choose a more specific ID instead of overwriting.

### 2. Choose the trigger

Use the smallest trigger that matches the request. See [triggers.md](triggers.md) for the full decision matrix, scenario examples, and `noop` patterns.

Common mappings:

- issue automation → `on: issues:`
- pull request automation → `on: pull_request:`
- scheduled reporting → fuzzy `schedule:` such as `daily on weekdays`
- on-demand comments → `slash_command`
- UI-driven actions → `label_command`
- GitHub Actions pipeline monitoring → `workflow_run`
- external deployment monitoring → `deployment_status`

Use [workflow-patterns.md](workflow-patterns.md) and [triggers.md](triggers.md) for trigger-selection guidance.

### 2a. Reporting and digest compact guidance

For recurring reports, audits, and stakeholder digests, set these create-specific defaults:

- default to `create-issue`; use `create-discussion` only when the requester explicitly wants threaded discussion
- use `add-comment` only when updating an existing issue or pull request instead of creating a new report destination
- add `workflow_dispatch` when manual reruns, backfills, or preview runs should be possible

Recurring-report lifecycle checklist (compact):

- [ ] enable `close-older-issues: true` for issue-based recurring reports unless the requester explicitly wants parallel open threads
- [ ] define one explicit report window (for example `last 7 full days ending at run start (UTC)` or `since previous successful run`)
- [ ] define grouping dimensions that match audience decisions (for example team, area, owner, severity, status)
- [ ] derive one stable dedup key per scope and window (for example `stakeholder-digest:<scope>:<window-id>`) and search for it before creating a new issue

Follow [triggers.md](triggers.md) for the report window, grouping dimensions, deduplication key, and empty-window `noop` rule, and [workflow-patterns.md](workflow-patterns.md) for the digest/incident skeletons. When the digest depends on missing or inconsistent metadata, group by the next-best available dimension, use an explicit "Unclassified" bucket, and never invent classifications — call `noop` only when the window itself has zero events.

### 2aa. Persona-oriented scenario quick map

Use these defaults when the requester frames the automation in non-engineering persona language:

| Persona or scenario | Trigger and scope | Typical tools and outputs | Required prompt details |
|---|---|---|---|
| Program Manager or information-worker digest | `schedule` plus `workflow_dispatch` for previews, reruns, and backfills | `github` (`gh-proxy`); `create-issue` by default | Define the report window, grouping dimensions, deduplication key, and `noop` behavior for empty windows |
| Designer or design-governance review | `pull_request` with `paths:` scoped to UI, design-token, copy, or asset files | `github` (`gh-proxy`); optional `playwright`; `add-comment` on the PR | State the review rubric (for example accessibility, token consistency, asset policy), and call `noop` when scoped files are unchanged |
| Legal / compliance / documentation-policy review | `pull_request` with scoped `paths:` or `schedule` for recurring audits | `github` (`gh-proxy`); `add-comment` for findings; `create-issue` only for violations needing follow-up | Classify findings against the policy, search for existing open issues before escalating, and call `noop` when there is no in-scope change or violation |

### 2b. Backend review compact guidance

For backend-focused PR automation (schema migrations and API compatibility):

- scope `pull_request.paths` to backend contract indicators instead of whole-repo review
- instruct the agent to classify changes as additive, backward-compatible, or breaking, then report only actionable risks
- include explicit `noop` criteria when no migration/API contract files changed

### 2c. PR analyzer escalation guidance

For PR-triggered automation that must decide between commenting, creating an issue, or doing nothing:

| Condition | Action |
|---|---|
| Findings affect only this PR (style, quality, risk) | `add-comment` on the PR |
| Finding is a cross-cutting or team-wide concern requiring follow-up beyond this PR | `create-issue` |
| No findings, or only docs/metadata changed outside scoped `paths:` | `noop` |

Rules:

- prefer `add-comment` over `create-issue` for PR-local findings; issues outlive the PR and create noise
- before creating an issue, search for an existing open issue covering the same concern (use a stable title prefix or label to avoid duplicates)
- if a matching open issue already exists, add a linked `add-comment` on the PR referencing it instead of opening a duplicate issue
- call `noop` explicitly whenever no actionable finding exists — do not comment with "no issues found" text

### 2d. Compliance review compact guidance

For dependency-license compliance and policy review on PRs:

- scope `pull_request.paths` to dependency manifest files (for example `package.json`, `go.mod`, `requirements.txt`, `Cargo.toml`, `pyproject.toml`, `composer.json`)
- classify each new dependency by license tier using the project's configured policy (the example tiers below represent a common MIT-compatible policy; adjust for your project): **allowed** (MIT, Apache-2.0, BSD, ISC), **needs-review** (unknown, dual-licensed, weak-copyleft), **blocked** (strong-copyleft such as GPL/AGPL, proprietary, or licenses incompatible with your project's license)
- publish per-tier findings with `add-comment` listing each dependency, its version, and detected license
- escalate to `create-issue` only when a **blocked** dependency was added or a policy violation requires team-wide follow-up beyond this PR
- before creating a new issue, search for an existing open issue with a stable key (for example `license-violation + dependency-name + version`) to avoid duplicates; if found, link to it from the PR comment instead
- call `noop` when no new dependencies were added or all additions are confirmed in the allowed tier

**Compliance escalation decision table:**

| Finding | Action |
|---|---|
| No dependency manifest files changed | `noop` immediately |
| All new dependencies in allowed tier | `noop` (or brief `add-comment` confirmation when the workflow prompt explicitly requests a confirmation comment) |
| Dependencies in needs-review tier | `add-comment` listing them with license details and requesting maintainer confirmation |
| Blocked dependency added | `add-comment` flagging the violation + `create-issue` for team-wide record (skip `create-issue` if a matching open issue already exists) |

### 2e. Coverage-analysis compact guidance

For workflows that read, analyze, or comment on test coverage (PR comments, trend tracking, coverage gates):

- **Prefer existing artifacts**: check for a coverage artifact from the current or parent CI run before recomputing; use `actions: read` via `gh-proxy` to list and download artifacts.
- **Prefer PR signals**: read existing check run annotations or coverage diff comments before fetching raw data; only recompute when no artifact or annotation is available.
- **Explicit fallback**: when no artifact exists, document the fallback computation step in the workflow prompt; never invent coverage values.
- call `noop` when no coverage data can be retrieved or computed and there is no meaningful output to report.

See [test-coverage.md](test-coverage.md) for the full coverage data strategy.

### 3. Keep permissions read-only

The main agent job must stay read-only.

- Do not grant `issues: write`, `pull-requests: write`, or `contents: write` to the agent job.
- Route GitHub writes through `safe-outputs:`.
- When targeting the Copilot coding agent, recommend `permissions: { copilot-requests: write }` so Copilot can authenticate with `${{ github.token }}`.
- If the user asks for direct writes, explain why the safe-output pattern is required.

### 4. Select tools

- `bash` and `edit` are enabled by default in sandboxed workflows; do not add them unless you are restricting them.
- For GitHub reads, prefer `tools.github.mode: gh-proxy` and instruct the agent to use `gh` commands.
- For non-GitHub MCP servers, prefer `tools.cli-proxy: true` and instruct the agent to use the mounted `mcp-clis` commands.
- Combined configuration example for GitHub reads plus non-GitHub MCP CLI access:

  ```yaml
  tools:
    github:
      mode: gh-proxy
      toolsets: [default]
    cli-proxy: true
  ```

  Omit `cli-proxy: true` when the workflow only needs GitHub reads.

- Suggest `playwright` for browser automation.
- Suggest dedicated topic files rather than embedding long tutorials in the prompt.

### 5. Infer network access from repository files

Do not ask for the ecosystem if it can be inferred from the repository.

Common mappings:

- `.csproj`, `.fsproj`, `*.sln`, `*.slnx`, `global.json` → `dotnet`
- `requirements.txt`, `pyproject.toml`, `setup.py`, `uv.lock` → `python`
- `package.json`, `.nvmrc`, `yarn.lock`, `pnpm-lock.yaml` → `node`
- `go.mod`, `go.sum` → `go`
- `pom.xml`, `build.gradle`, `build.gradle.kts` → `java`
- `Gemfile`, `*.gemspec` → `ruby`
- `Cargo.toml`, `Cargo.lock` → `rust`
- `Package.swift`, `*.podspec` → `swift`
- `composer.json` → `php`
- `pubspec.yaml` → `dart`

Never use `network: defaults` alone for workflows that build, test, or install packages.

### 6. Configure safe outputs

Map write behavior to `safe-outputs:`.

Common mappings:

- create issues → `create-issue`
- add comments → `add-comment`
- create PRs → `create-pull-request`
- add labels → `add-labels`
- attach downloadable files → `upload-artifact`
- publish embeddable assets → `upload-asset`

Rules:

- always restrict `create-pull-request.allowed-files`
- prefer the dedicated safe output instead of shelling out to `gh` for the same mutation
- include `noop` guidance in the prompt so successful no-op runs are explicit
- when using `create-issue`, instruct the agent to provide a meaningful body (20-65000 characters; avoid placeholder-only text)

### 7. Decide who can trigger the workflow

- Default behavior is team-only triggering.
- For community-facing issue triage or other public entrypoints, recommend `roles: all`.

### 8. Add cost-aware triage and context flow

- For high-volume inputs, design a cheap triage step before expensive analysis.
- Require explicit `noop` or safe-output behavior for known, duplicate, stale, or low-value cases.
- Reserve frontier-model reasoning for ambiguous/high-value cases and final synthesis.
- Prefer pull-on-demand context retrieval over prompt-stuffing large logs or API payloads.
- Use deterministic `steps:` plus compact files under `/tmp/gh-aw/` when large data must be preprocessed.

See also: [workflow-patterns.md](workflow-patterns.md), [subagents.md](subagents.md), and [token-optimization.md](token-optimization.md).

### 9. Omit unnecessary defaults

Avoid adding fields just to restate defaults.

Usually omit:

- `engine: copilot`
- unrestricted `bash`
- `edit`
- `timeout-minutes:` unless a custom timeout is needed

## Prompt Requirements

The markdown body should:

- state the workflow goal clearly
- reference the triggering context explicitly
- name the allowed safe outputs when write actions are expected
- instruct the agent to call `noop` when no visible change is needed
- stay concise and task-focused

When the workflow generates reports or markdown output, include these formatting rules only when relevant:

- use GitHub-flavored markdown
- start nested report headings at `###`
- use `<details><summary>...</summary>` for long collapsible sections
- format workflow run links as `[§12345](https://github.com/owner/repo/actions/runs/12345)`

## Issue-Form Mode Procedure

When processing a workflow-creation issue form:

1. extract the workflow name, description, and additional context
2. derive a unique workflow ID
3. infer the trigger, tools, network access, and safe outputs
4. create exactly one workflow markdown file
5. compile it with `gh aw compile <workflow-id>`
6. include the generated `.lock.yml` in the PR

## Recommended Workflow Skeleton

```markdown
---
emoji: 🏷️
description: <brief description>
on:
  issues:
    types: [opened]
permissions:
  contents: read
  issues: read
tools:
  github:
    mode: gh-proxy
    toolsets: [default]
  cli-proxy: true
safe-outputs:
  add-comment:
---

# <Workflow Name>

## Task

<clear instructions>

## Safe Outputs

- Use the configured safe outputs for visible actions.
- Use `noop` with a short explanation when no action is required.
```

## PR-Report Checklist

Before finalizing any `pull_request`-triggered reporting workflow, verify:

- [ ] **Permissions**: `contents: read` + `pull-requests: read` in the agent job; no write permissions
- [ ] **Safe outputs**: `add-comment` for inline findings; `create-issue` for incidents needing follow-up
- [ ] **Network**: infer ecosystem from repository lock files; never use `defaults` alone when packages are installed
- [ ] **`noop` required**: prompt instructs the agent to call `noop` with a brief explanation when no issues are found

## Generated Workflow Quality Checklist

Before finalizing any newly generated workflow, verify:

- [ ] **Trigger fit**: trigger matches user intent and event granularity (for example `pull_request`, `workflow_run`, `deployment_status`, `schedule`, `slash_command`)
- [ ] **Tool fit**: enabled tools are the minimal set needed for reads/analysis (prefer `gh-proxy`; add `playwright`/`cache-memory` only when required)
- [ ] **Safe outputs**: all visible writes route through `safe-outputs:` and include `noop` for explicit no-op outcomes
- [ ] **Permissions**: agent job remains read-only; no direct write scopes granted
- [ ] **Network**: access is inferred from repository ecosystem and avoids `network: defaults` alone for install/build/test workflows
- [ ] **Prompt clarity**: prompt is concise, context-aware, and clearly states expected outputs and stop/no-op behavior

## Generated Workflow Scoping Checklist

Before finalizing any newly generated workflow, verify:

- [ ] **Paths scope**: include `paths:`/`paths-ignore:` when the automation should ignore unrelated files (for backend reviews, include migration/schema/API contract globs)
- [ ] **Labels scope**: define required labels (for example `label_command` names or PR/issue label filters) when label-based routing is expected
- [ ] **Workflow-name scope**: for `workflow_run`, explicitly set `workflows:` to named targets and gate conclusions via `if:` on `${{ github.event.workflow_run.conclusion }}` (for incident triage, prefer failure-only outcomes)
- [ ] **Date-window scope**: for reporting/triage, state the exact window (for example `last 24h`, `since previous run`, `current week`)
- [ ] **Safe-output write contract**: name which safe output is used for each outcome and when `noop` is required instead of a write

## Multi-Repository Requests

For cross-repository workflows:

- enable the GitHub toolsets needed to read external repositories
- configure cross-repo authentication in `safe-outputs:`
- tell the agent to set `target-repo`
- explain that the workflow still cannot wait for external workflows or create multi-job orchestration

Use [workflow-patterns.md](workflow-patterns.md) for the compact cross-repo pattern.

## Final Steps

1. create `.github/workflows/<workflow-id>.md`
2. compile with `gh aw compile <workflow-id>`
3. fix all compile errors
4. create a PR with the workflow file and `.lock.yml`

## Guidelines

- create exactly one workflow `.md` file as the primary deliverable
- keep prompts short, specific, and imperative
- prefer dedicated reference files over repeating large explanations inline
- always compile before finishing
- keep responses concise after the workflow is created


<!-- file: create-shared-agentic-workflow.md -->

---
name: create-shared-agentic-workflow
description: Create shared agentic workflow components that wrap MCP servers using secure, reusable patterns.
disable-model-invocation: true
---

# Shared Agentic Workflow Designer

Create reusable shared workflow components under `.github/workflows/shared/`.

## Load These References First

- [github-agentic-workflows.md](github-agentic-workflows.md)
- [workflow-constraints.md](workflow-constraints.md)
- [shared-safe-jobs.md](shared-safe-jobs.md)
- [safe-outputs.md](safe-outputs.md)

Load these only when relevant:

- [campaign.md](campaign.md)
- [experiments.md](experiments.md)

## Core Rules

- prefer `container:`-based MCP servers
- pin versions when practical
- allow only read-only tools by default
- move writes to built-in safe outputs or custom safe-output jobs
- keep documentation in XML comments in the markdown body, not in frontmatter comments

## Ask First

Start by asking what MCP server or shared component the user wants to integrate.

Then gather:

- the server name
- the documentation URL or repository
- required secrets
- any expected write operations

## File Shape

Shared components are markdown files with frontmatter and an optional markdown body.

```yaml
---
mcp-servers:
  server-name:
    container: "registry/image"
    version: "tag"
    env:
      API_KEY: "${{ secrets.API_KEY }}"
    allowed:
      - read_tool
---
<!-- short documentation -->
```

## MCP Design Flow

1. research the server from the provided docs
2. prefer an official container image when available
3. identify required args, env vars, and mounts
4. create `.github/workflows/shared/<service>-mcp.md`
5. list required secrets clearly for the user
6. inspect available tools with `gh aw mcp inspect`
7. allow only the read-only tools by default
8. document excluded write tools and route them to safe outputs when needed

## Secret Guidance

When secrets are required, explicitly list:

- the secret name
- what it is used for
- where the user must configure it in GitHub Actions

## Tool Allowlist Guidance

- use `allowed:` with a specific list whenever possible
- exclude write tools from shared read-oriented components
- if writes are needed, describe the companion safe-output pattern rather than broadening MCP permissions

## Custom Write Behavior

When a component needs post-agent mutation logic:

- create a safe-output job
- follow [shared-safe-jobs.md](shared-safe-jobs.md)
- keep the schema explicit and typed

## Validation Loop

Use this loop until the component is valid:

```bash
gh aw compile <workflow-name> --strict
gh aw mcp inspect <workflow-name> --server <server-name> -v
```

Iterate on:

- image name and version
- env vars and secrets
- Docker args and mounts
- allowed tools

## Guidelines

- keep one shared file focused on one MCP server or one reusable concern
- prefer containers over raw commands for production use
- keep write access out of the shared component unless it is explicitly implemented as a safe-output job
- keep the generated instructions concise


<!-- file: debug-agentic-workflow.md -->

---
description: Debug and refine agentic workflows using gh-aw CLI tools and focused run-log analysis.
disable-model-invocation: true
---

# GitHub Agentic Workflow Debugger

Help users investigate failing or underperforming workflows in this repository.

## Load These References First

- [github-agentic-workflows.md](github-agentic-workflows.md)
- [workflow-editing.md](workflow-editing.md)
- [safe-outputs.md](safe-outputs.md)
- [syntax.md](syntax.md)

Load these only when relevant:

- [campaign.md](campaign.md)
- [experiments.md](experiments.md)

## Available Commands

```bash
gh aw status
gh aw compile <workflow-name>
gh aw logs <workflow-name> --json
gh aw audit <run-id> --json
gh aw run <workflow-name>
```

If `gh aw` is unavailable or unauthenticated in a workflow environment, use the matching `agentic-workflows` tools instead.

## Start the Conversation

Ask for one of these inputs:

- a workflow name
- a workflow run URL
- a request to list workflows with `gh aw status`

## Fast Path: Run URL Provided

If the user gives a GitHub Actions run URL:

1. extract the run ID
2. run `gh aw audit <run-id> --json`
3. analyze the audit result before asking additional questions

## Two Debug Modes

### 1. Analyze existing logs

Use when the user wants to inspect past runs.

```bash
gh aw logs <workflow-name> --json
```

Focus on:

- failures and warnings
- token usage
- missing tool reports
- execution time
- repeated failure patterns

### 2. Run and audit now

Use when the user wants to reproduce the issue.

1. verify the workflow supports `workflow_dispatch`
2. run `gh aw run <workflow-name>`
3. poll `gh aw audit <run-id> --json` until the run reaches a terminal state
4. inspect the downloaded artifacts

## What to Inspect in Audits

### Missing tools

Check for:

- tools the agent tried to call but could not access
- name mismatches such as wrong prefixes or wrong underscore/hyphen forms
- safe outputs that were referenced in the prompt but not configured in frontmatter

Common fixes:

- correct the tool name in the prompt
- enable the required tool or safe output
- move a write action from shell/GitHub tool usage to `safe-outputs:`

### Key artifacts

Inspect these when available:

- `run_summary.json`
- `agent-stdio.log`
- `safe_outputs.jsonl`
- token-usage artifacts under the firewall audit logs

## Diagnostic Checklist

- permissions and authentication failures
- missing or misconfigured tools
- network allowlist problems
- prompt ambiguity or lack of context
- timeout pressure
- unnecessary token consumption
- expensive model invoked on events that cheap triage could resolve
- expensive model reading large raw logs or payloads that should be queried on demand
- orchestrator context bloated by raw worker/tool output instead of compact summaries
- unbounded sub-agent fan-out or recursive delegation
- safe-output validation failures

## Workflow-Internal Use of `gh aw`

When a generated workflow itself runs `gh aw logs` or `gh aw audit`:

- add `permissions: actions: read`
- install the CLI first with `github/gh-aw/actions/setup-cli`
- do not place the `gh aw` command before the install step

## Fix-and-Validate Loop

When you suggest a fix:

1. point to the exact frontmatter or prompt section
2. explain the reason briefly
3. validate with `gh aw compile <workflow-name>`
4. suggest another run only after the workflow compiles

When token cost is part of the issue, compare before/after runs with `gh aw audit` and inspect `aic`, input/output tokens, and cache read/write tokens. Treat quality regressions as failures even when token usage drops.

## Final Response Rules

End with:

- the root cause or most likely cause
- the concrete fix
- the validation command
- whether the user should run the workflow again

Keep it concise and actionable.


<!-- file: dependabot.md -->

---
description: Instructions for fixing Dependabot PRs that update dependencies in generated workflow manifest files
disable-model-invocation: true
---

You are specialized in **fixing Dependabot PRs for GitHub Agentic Workflows dependency manifests**.
Read the ENTIRE content of this file carefully before proceeding. Follow the instructions precisely.

# Fixing Dependabot PRs for Agentic Workflow Dependencies

> [!WARNING]
> **Never directly merge Dependabot PRs that modify generated files** such as `.github/workflows/package.json`, `.github/workflows/requirements.txt`, or `.github/workflows/go.mod`. These files are generated by the `gh aw` compiler and any direct changes will be overwritten on the next compilation.

## Background

The `gh aw compile --dependabot` command scans all agentic workflow files (`.github/workflows/*.md`) for runtime tool dependencies and generates manifest files:

| Manifest | Ecosystem | Full Path |
|----------|-----------|-----------|
| `package.json` / `package-lock.json` | npm | `.github/workflows/package.json` / `.github/workflows/package-lock.json` |
| `requirements.txt` | pip | `.github/workflows/requirements.txt` |
| `go.mod` | Go | `.github/workflows/go.mod` |

When Dependabot opens PRs to update these dependencies, the fix must be applied to the **source `.md` workflow files**, not the generated manifests.

## Fix Strategy: Bundle Multiple PRs

Rather than fixing Dependabot PRs one by one, **bundle all pending fixes into a single commit**:

1. **Find all open Dependabot PRs** targeting the generated manifest files
2. **Identify the source `.md` files** for each dependency
3. **Apply all version updates** to the `.md` files in one pass
4. **Regenerate the manifests** with a single `gh aw compile --dependabot`
5. **Commit and push** — Dependabot will auto-close the resolved PRs

## Step-by-Step Instructions

### 1. List Open Dependabot PRs

Use GitHub tools to list all open Dependabot PRs:

```bash
gh pr list --author "app/dependabot" --state open
```

Filter for PRs affecting generated workflow manifests (title contains `Bump` or similar, files include `.github/workflows/package.json`, `.github/workflows/requirements.txt`, or `.github/workflows/go.mod`).

### 2. Identify Source `.md` Files

For each outdated dependency, find which workflow files reference it:

```bash
# For npm packages (e.g., @playwright/test)
grep -r "@playwright/test" .github/workflows/*.md .github/workflows/shared/

# For pip packages (e.g., requests)
grep -r "requests==" .github/workflows/*.md

# For Go packages
grep -r "golang.org/x/tools" .github/workflows/*.md
```

### 3. Update Versions in `.md` Files

Edit the workflow files to use the updated dependency versions:

```bash
# Example: Update @playwright/test from 1.41.0 to 1.42.0
# Find:    npx @playwright/test@1.41.0
# Replace: npx @playwright/test@1.42.0
```

For **MCP server transitive dependencies**, update the shared MCP config:

```bash
# Locate the shared MCP configuration
grep -r "@sentry/mcp-server" .github/workflows/shared/

# Update the version in the args array:
# args: ["@sentry/mcp-server@0.27.0"] → args: ["@sentry/mcp-server@0.29.0"]
```

### 4. Regenerate Manifests

After updating all `.md` files, regenerate the manifests:

```bash
gh aw compile --dependabot
```

This updates `.github/workflows/package.json`, `.github/workflows/requirements.txt`, and `.github/workflows/go.mod` from the updated `.md` file versions.

If `.github/workflows/package-lock.json` also needs updating:

```bash
cd .github/workflows && npm install --package-lock-only && cd -
```

### 5. Verify and Commit

```bash
# Review the changes
git diff .github/workflows/

# Stage and commit all dependency updates together
git add .github/workflows/ .github/aw/
git commit -m "chore: bundle dependabot dependency updates"
git push
```

Dependabot will automatically close all PRs whose dependency versions now match the committed versions.

## Bundling Decision Guide

**Bundle multiple PRs when:**
- ✅ Multiple Dependabot PRs target the same ecosystem (npm, pip, Go)
- ✅ PRs affect different workflows but update the same package
- ✅ All updates are minor or patch version bumps (low breaking-change risk)

**Handle separately when:**
- ⚠️ A PR involves a **major version bump** with potential breaking changes
- ⚠️ Different teams own different workflows with separate review requirements

## Troubleshooting

| Issue | Solution |
|-------|----------|
| `.github/workflows/package-lock.json` not updated | Run `cd .github/workflows && npm install --package-lock-only` after compilation |
| Dependency not found in `.md` files | Check shared MCP configs in `.github/workflows/shared/` |
| Compilation fails after version update | Check if the new version has breaking API changes |
| Dependabot PR not auto-closing | Verify the exact version strings match; check for pre-release suffixes |

## Dismissed Dependabot Alerts and VEX

When a Dependabot security alert is dismissed with a substantive security reason (`not_used`, `inaccurate`, or `tolerable_risk`), consider generating a [VEX (Vulnerability Exploitability eXchange)](https://openvex.dev/) statement to record the assessment as a machine-readable OpenVEX v0.2.0 document in `.vex/<ghsa-id>.json`. Alerts dismissed as `no_bandwidth` do not represent a security decision and should not produce a VEX statement.

Learn about the OpenVEX format, purl construction, and dismissal-to-justification mappings from [openvex.dev](https://openvex.dev/) before generating statements.

## Related Documentation

- [Dependabot Support](/gh-aw/reference/dependabot/) — Full reference for `gh aw compile --dependabot`
- [OpenVEX Specification](https://openvex.dev/) — VEX standard for vulnerability exploitability exchange
- Local copy: @.github/aw/github-agentic-workflows.md


<!-- file: deployment-status.md -->

---
description: Reference pattern for monitoring external deployment failures using the deployment_status trigger and creating incident issues automatically.
---

# Deployment Status Monitoring

Consult this file when creating an agentic workflow that responds to external deployment failures from services like Heroku, Vercel, Railway, or Fly.io that post deployment status back to GitHub.

## Trigger and Frontmatter

Use the `deployment_status` trigger with an `if:` condition to filter to failed deployments only:

```yaml
on:
  deployment_status:
if: ${{ github.event.deployment_status.state == 'failure' }}
permissions:
  contents: read
  issues: read
  deployments: read
tools:
  github:
    toolsets: [default]
safe-outputs:
  create-issue:
    expires: 1d
    title-prefix: "[Deployment Failure] "
    close-older-issues: true
  noop:
```

## Available Event Context

The following expressions are available in the prompt body:

| Expression | Description |
|---|---|
| `${{ github.event.deployment.environment }}` | Target environment (e.g. `production`) |
| `${{ github.event.deployment_status.state }}` | Status (`failure`, `success`, `error`, etc.) |
| `${{ github.event.deployment_status.target_url }}` | URL to the external service deployment logs |
| `${{ github.event.deployment_status.description }}` | Human-readable error message from the service |
| `${{ github.event.deployment.ref }}` | Branch or tag that was deployed |
| `${{ github.event.deployment.sha }}` | Commit SHA that was deployed |
| `${{ github.event.deployment.creator.login }}` | GitHub user who triggered the deployment |

## Agent Instructions Pattern

```markdown
A deployment to **${{ github.event.deployment.environment }}** has failed.

1. **Verify the failure**: Confirm `${{ github.event.deployment_status.state }}` is `failure`. If not, call `noop` and stop.
2. **Gather context**: Review ref (`${{ github.event.deployment.ref }}`), SHA (`${{ github.event.deployment.sha }}`), and error description (`${{ github.event.deployment_status.description }}`).
3. **Check for duplicates**: Search open issues with the `[Deployment Failure]` title prefix.
4. **Create an incident issue** if none exists, including environment, ref/SHA, deployment URL, error details, and suggested next steps.

Use `noop` if the deployment did not fail or a duplicate issue already exists.
```

## Safe External Log Linking

When including `${{ github.event.deployment_status.target_url }}` in outputs:

- treat the URL as untrusted external input and include it as a plain link (never as executable shell input; avoid patterns like `$(...)` or piping it directly into `curl` commands)
- prefer a short label such as `External deployment logs` instead of echoing long raw URLs inline
- include key incident context (environment, ref, SHA, description) in the issue body so triage does not depend on external link availability
- if `target_url` is empty or malformed, continue triage with in-event fields and call out that no external logs URL was provided

## When to Use `deployment_status` vs `workflow_run`

- **`deployment_status`**: External services (Heroku, Vercel, Railway, Fly.io) that integrate with the GitHub Deployments API — they post a deployment status event back to GitHub when a deploy finishes.
- **`workflow_run`**: In-repo GitHub Actions pipelines — use when reacting to the success or failure of another Actions workflow in the same repository.


<!-- file: experiments.md -->

---
description: Guide for setting up A/B testing experiments in agentic workflows — syntax, design principles, dimensions to test, how to measure results, and anti-patterns.
---

# A/B Testing Experiments in Agentic Workflows

---

## How Experiments Work

Per run:

1. **Restore** — activation job loads experiment state from configured storage (git branch default, or Actions cache).
2. **Pick** — `pick_experiment.cjs` picks the variant with the lowest invocation count (ties broken by array order).
3. **Save** — updated counter written back.
4. **Upload** — state uploaded as workflow artifact `experiment` (30-day retention).
5. **Inject** — variant available as `${{ experiments.<name> }}` and in `{{#if experiments.<name> }}` blocks.

**Key properties**:
- Every run gets one variant per experiment; no sampling.
- Assignment persists across runs automatically.
- Multiple experiments run simultaneously, each independently balanced.

---

## Basic Syntax

```yaml
---
on:
  schedule: daily on weekdays
engine: copilot
experiments:
  prompt_style: [concise, detailed]
---

{{#if experiments.prompt_style == "concise" }}
Summarise the findings in ≤ 5 bullets.
{{#else}}
Provide a detailed analysis with reasoning for each finding.
{{#endif}}
```

### Naming Rules

- Names must match `[a-zA-Z_][a-zA-Z0-9_]*`. Use `lowercase_with_underscores`.
- Non-matching names are silently skipped at compile time.

### Variant Rules

- At least **2 variants** required.
- Plain strings, lowercase descriptive (`concise`, `detailed`, `step_by_step`).
- ~10 variants practical max — sample size per variant grows fast beyond that.

---

## Object Form (Weighted Variants and Date Gating)

Object form supports non-uniform weights, date gating, and governance metadata:

```yaml
experiments:
  prompt_style:
    variants: [concise, detailed, step_by_step]
    weight: [2, 1, 1]           # 50% concise, 25% detailed, 25% step_by_step
    description: "Verbosity A/B test"
    metric: "ai_credits"
    hypothesis: "H0: no change in ai_credits. H1: concise reduces by >=15%"
    guardrail_metrics:
      - name: success_rate
        threshold: ">=0.95"
      - name: empty_output_rate
        direction: min
        threshold: 0.0
    issue: "42"
    start_date: "2026-05-01"
    end_date: "2026-06-01"
```

**Fields:**

- `variants:` — array of variant strings (required, ≥ 2 entries).
- `weight:` — non-negative integers, same length as `variants`. Enables weighted-random selection. `[2, 1, 1]` = 50/25/25. All zeros → always returns control (first variant). Omit for round-robin.
- `start_date:` / `end_date:` — ISO-8601 `YYYY-MM-DD`. Outside this window, control variant is returned and counters do not increment.
- `description:`, `metric:`, `issue:`, `hypothesis:` — governance metadata (no runtime effect).
- `guardrail_metrics:` — array; if any guardrail fails for any variant, experiment is auto-abandoned. Each entry:
  - `name` (required) — metric identifier.
  - `threshold` (required) — comparison string (`">=0.95"`, `"==0"`) or bare number paired with `direction`.
  - `direction` (optional, `"min"`/`"max"`) — lower-better vs higher-better. With bare numeric `threshold`: `min` → metric ≤ threshold; `max` → metric ≥ threshold.

Bare-array and object forms can be mixed in the same `experiments:` map.

---

## Storage Configuration

```yaml
experiments:
  storage: repo   # or: cache
  prompt_style: [concise, detailed]
```

| Value | Behaviour | When to use |
|---|---|---|
| `repo` (**default**) | Commits `state.json` to branch `experiments/{sanitizedWorkflowID}` (hyphens stripped, e.g. `my-workflow` → `experiments/myworkflow`). Adds a `push_experiments_state` job; needs `contents: write`. Durable. | Recommended for all experiments. |
| `cache` | GitHub Actions cache. No extra job/permission. May evict after 7 days of inactivity. | Use only when `contents: write` cannot be granted. |

> The branch is created automatically on first run as an orphan containing `state.json` and `assignments.json`.

---

## Referencing the Active Variant

Two forms, both resolved before the agent sees the prompt:

### 1 — Conditional blocks (most common)

```markdown
{{#if experiments.tone == "formal" }}
Use formal, professional language throughout the report.
{{#else}}
Use a friendly, conversational tone.
{{#endif}}
```

### 2 — Direct interpolation

```markdown
Use `${{ experiments.tone }}` tone when writing the issue body.
```

---

## Designing a Good Experiment

1. **One dimension** per experiment.
2. **Falsifiable hypothesis**.
3. **Primary metric** measurable from workflow run data (artifacts, outputs, duration, tokens).
4. **Guardrail metrics** — things that must not degrade. Use `direction: min` + bare number for lower-is-better rates, or `">=0.95"` for higher-is-better.
5. **Sample size estimate** per variant.

Prefer high-frequency workflows for faster significance.

---

## Dimensions Worth Experimenting On

### Prompt Design

```yaml
experiments:
  prompt_style: [concise, detailed]
  reasoning_depth: [shallow, deep]
  output_format: [bullets, prose, table]
  tone: [formal, casual]
```

Use `{{#if experiments.prompt_style == "concise" }}` blocks to swap prompt instructions. Always compare against a specific variant value.

> ⚠️ **Never write** the internal env-var form `__GH_AW_EXPERIMENTS__PROMPT_STYLE___detailed`. The compiler expands `experiments.<name>` references automatically.

**Typical metrics**: output quality, AI credits, success rate, output length.

### Engine & Model

```yaml
experiments:
  engine_variant: [copilot, claude]
```

> ⚠️ **Engine experiments require separate compiled files**: the `engine:` key cannot be switched mid-run from a single file. Use two parallel workflow files and compare run metrics.

**Typical metrics**: run cost (tokens), duration, completion rate, error rate.

### Tool Configuration

```yaml
experiments:
  tool_scope: [narrow, broad]
```

```markdown
{{#if experiments.tool_scope == "narrow" }}
Only use the `issues` and `pull_requests` toolsets.
{{#else}}
Use any available GitHub MCP tools.
{{#endif}}
```

**Typical metrics**: number of tool calls, run duration, output accuracy.

### Skill Usage

```yaml
experiments:
  skill_hint: [enabled, disabled]
```

```markdown
{{#if experiments.skill_hint == "enabled" }}
Check `skills/` for SKILL.md files relevant to this task and apply their guidance.
{{#endif}}
```

**Typical metrics**: output quality, context token consumption, run duration.

### Timeout & Pacing

```yaml
experiments:
  timeout: [short, long]
```

Pair with a conditional step, or use two compiled files with different `timeout-minutes:`.

---

## Minimal Working Example

```markdown
---
description: Daily PR summary — A/B test concise vs. detailed output
on:
  schedule: daily on weekdays
engine: copilot
permissions:
  pull-requests: read
tools:
  github:
    toolsets: [pull_requests]
safe-outputs:
  create-discussion:
    title-prefix: "[pr-summary] "
    close-older-discussions: true
timeout-minutes: 15
experiments:
  output_style: [concise, detailed]
---

Summarise the pull requests merged in ${{ github.repository }} today.

{{#if experiments.output_style == "concise" }}
Write a maximum of 5 bullet points. Each bullet is one sentence.
{{#else}}
Write a structured report with sections for: new features, bug fixes, refactors,
and documentation changes. Include a one-paragraph executive summary at the top.
{{#endif}}

Include links to each PR. Use ${{ github.server_url }}/${{ github.repository }}/pull/<number> format.
```

Compile and deploy:

```bash
gh aw compile pr-summary
```

First run picks `concise` (count 0), second picks `detailed`, alternating until one variant wins.

---

## Multiple Simultaneous Experiments

Independent assignment, all three injected into the prompt:

```yaml
experiments:
  prompt_style: [concise, detailed]
  emoji_density: [heavy, minimal]
  skill_hint: [enabled, disabled]
```

> ⚠️ **Interaction effects** — limit to 2–3 simultaneous experiments unless you can run factorial analysis.

---

## Lifecycle of an Experiment

1. **Design** — hypothesis, dimension, primary + guardrail metrics.
2. **Instrument** — add `experiments:` and `{{#if experiments.<name> == "<variant>" }}` blocks. Never use `__GH_AW_EXPERIMENTS__*`.
3. **Compile** — `gh aw compile <workflow-name>`.
4. **Run** — check activation job step summary for variant assignment.
5. **Analyse** — once min sample size reached, compare distributions.
6. **Conclude** — rewrite baseline to winning variant, remove `experiments:`, recompile.

---

## Anti-Patterns

- ❌ **Multiple dimensions in one experiment** — can't attribute the improvement.
- ❌ **Removing `experiments:` before sample size reached** — resets state, invalidates counts.
- ❌ **Interpreting early results** (<~20 runs/variant) — chance variation dominates.
- ❌ **Experiments as feature flags** — use `features:` for deterministic switches.
- ❌ **Engine experiments in one file** — `engine:` cannot switch mid-run; use two parallel files.
- ❌ **Conditional frontmatter imports** — keep imports security-stable and use `{{#if experiments.<name> }}` with `{{#runtime-import? path}}` (optional form, not promoted to unconditional lock-file macros) for prompt experiments instead.
- ❌ **Nesting `{{#if experiments.<name> }}` inside `{{#runtime-import? }}`** — evaluation order is brittle across import boundaries. Prefer explicit branching in the main workflow prompt or separate workflow files per variant.
- ❌ **Writing the internal env-var form** `__GH_AW_EXPERIMENTS__*` — implementation detail, may change.


<!-- file: github-agentic-workflows.md -->

---
description: GitHub Agentic Workflows
applyTo: ".github/workflows/*.md,.github/workflows/**/*.md"
---

# GitHub Agentic Workflows

## File Format

Agentic workflows are markdown files with YAML frontmatter.

```markdown
---
emoji: 🧠
name: My Workflow
description: Short description
on:
  issues:
    types: [opened]
permissions:
  contents: read
  actions: read
strict: true
network:
  allowed: [defaults, github]
tools:
  github:
    mode: gh-proxy
    toolsets: [default]
safe-outputs:
  add-comment:
---

# Workflow Title

Natural language instructions for the AI agent.
```

## Recompilation Rule

- Edit the **frontmatter** → run `gh aw compile <workflow-id>`.
- Edit the **markdown body** only → no recompilation required.

See also: [workflow-editing.md](workflow-editing.md)

## Core Rules

- Keep the main agent job read-only.
- Use `safe-outputs:` for GitHub writes.
- Prefer `tools.github.mode: gh-proxy` and use `gh` for GitHub reads.
- For non-GitHub MCP servers, prefer `tools.cli-proxy: true` and use mounted `mcp-clis` commands.
- Use `${{ steps.sanitized.outputs.text }}` for untrusted user content.
- Set `strict: true` for production workflows.
- Limit network and bash access to what the workflow actually needs.
- For visual regression workflows, explicitly name the baseline source (for example `cache-memory` key, artifact, or branch path). See [visual-regression.md](visual-regression.md).

## Repository-Specific Instructions

Use `@.github/aw/instructions.md` as the canonical repository-local overlay for workflow authoring standards.

- This file is optional and repository-owned.
- Installed gh-aw agents should load and apply it automatically when present.
- Precedence: apply upstream defaults first, then apply repository overlay rules; when they conflict, repository overlay rules win.

## Trigger Selection Quick Reference

Use the smallest trigger that matches the requested automation.

| Need | Trigger | Notes |
|---|---|---|
| Review pull request changes or UI diffs | `pull_request` | Use for PR-scoped analysis, comments, and optional `playwright`-based visual regression. |
| React to the result of another GitHub Actions workflow | `workflow_run` | Scope `workflows:` explicitly, use `types: [completed]`, and gate conclusions before creating incidents. |
| Publish recurring reports or stakeholder digests | `schedule` | Define the exact reporting window and default to `create-issue`; add `workflow_dispatch` when manual reruns are useful. |
| Run the workflow on demand | `workflow_dispatch` | Use for manual tests, backfills, and operator-invoked runs; often pair with `schedule` or `workflow_run`. |

See also: [workflow-constraints.md](workflow-constraints.md)

## Ad Hoc Scenario Evaluation

Installed gh-aw agents should support scenario evaluation requests that do not create workflow files.

- Treat prompts such as `agentic-workflows evaluate this scenario without creating files` as ad hoc evaluation mode.
- Return a compact design recommendation covering trigger, scope, tools, permissions, safe outputs, `noop` behavior, and any report window / grouping / deduplication requirements.
- Offer to turn the recommendation into `.github/workflows/<workflow-id>.md` only if the user asks to proceed.

### Non-technical persona example (Program Management)

When the request is framed as a PM or stakeholder workflow (for example "weekly product health digest"):

- prefer `schedule: weekly` (or `daily on weekdays` for operational digests) plus `workflow_dispatch` for preview/backfill runs
- read with `github` (`gh-proxy`) and default to `create-issue` for the digest destination
- require an explicit report window, grouping dimensions, and a stable dedup key before creating output
- use `close-older-issues: true` for recurring issue-style digests and call `noop` when the selected window has no qualifying updates

## PR Checks with Linked References

When a PR analysis requires verifying or attaching a linked artifact (design doc, policy link, architecture decision record, or approval), follow this compact pattern:

1. **Read the linked reference** from the PR body or comments (for example, a URL, a markdown link, or an ADR reference token like `ADR-NN`) using `gh pr view`.
2. **Validate the link** — confirm the document exists and is accessible before assessing compliance.
3. **Classify the result**:
   - Link present and satisfies requirement → `add-comment` with a ✅ summary
   - Link present but does not satisfy requirement → `add-comment` flagging the specific gap
   - Link missing → `add-comment` requesting it, or `create-issue` if policy requires a blocking escalation
4. **Call `noop`** when the PR is not in scope (for example `paths:` guard excludes all changed files).

Permissions: `pull-requests: read` only; all writes route through `add-comment` safe output.

## Reference Files

| Topic | File |
|---|---|
| Editing and recompilation rules | [workflow-editing.md](workflow-editing.md) |
| Architectural and security constraints | [workflow-constraints.md](workflow-constraints.md) |
| Common design patterns | [workflow-patterns.md](workflow-patterns.md) |
| Frontmatter schema index | [syntax.md](syntax.md) |
| Safe outputs index | [safe-outputs.md](safe-outputs.md) |
| Trigger patterns | [triggers.md](triggers.md) |
| Context expressions and `{{#if}}` templates | [context.md](context.md) |
| CLI commands and MCP equivalents | [cli-commands.md](cli-commands.md) |
| Network configuration | [network.md](network.md) |
| Memory and persistence | [memory.md](memory.md) |
| Imports and shared components | [reuse.md](reuse.md) |
| Sub-agents | [subagents.md](subagents.md) |
| Skills | [skills.md](skills.md) |
| Token cost optimization | [token-optimization.md](token-optimization.md) |
| GitHub MCP server configuration | [github-mcp-server.md](github-mcp-server.md) |
| Campaign and KPI patterns | [campaign.md](campaign.md) |
| Experiments and A/B testing | [experiments.md](experiments.md) |
| Charts and Python data visualization | [charts.md](charts.md) |
| LLM API endpoint discovery | [llms.md](llms.md) |

## Compile Commands

```bash
gh aw compile
gh aw compile <workflow-id>
gh aw compile --purge
gh aw compile --strict
```


<!-- file: github-mcp-server.md -->

# GitHub MCP Server Instructions

**Source**: [github/github-mcp-server](https://github.com/github/github-mcp-server/tree/main/pkg/github)
**Mapping File**: [pkg/workflow/data/github_toolsets_permissions.json](https://github.com/github/gh-aw/blob/main/pkg/workflow/data/github_toolsets_permissions.json)
**Last Updated**: 2026-05-24

## Overview

The GitHub MCP server provides tools to interact with GitHub APIs through the Model Context Protocol (MCP). It operates in two modes:

- **Remote mode**: Connects to GitHub's hosted MCP endpoint (`https://api.githubcopilot.com/mcp/`)
- **Local mode**: Runs `gh mcp` (GitHub CLI) as a local subprocess

### Authentication

**Remote mode**: Uses a Bearer token in the Authorization header:
```
Authorization: Bearer <github-token>
```

**Read-only mode**: Add the `X-MCP-Readonly: true` header to restrict to read operations only:
```
X-MCP-Readonly: true
```

**Local mode**: Uses the GitHub CLI's existing authentication (`gh auth login`).

## Configuration

### In Agentic Workflows

```yaml
tools:
  github:
    toolsets: [default]     # or specific toolsets
    # Optional: GitHub App authentication
    github-app:
      client-id: ${{ vars.APP_ID }}
      private-key: ${{ secrets.APP_PRIVATE_KEY }}
```

> ⚠️ **Do NOT use `mode: remote`** in GitHub Actions workflows. Remote mode does not work with the GitHub Actions token (`GITHUB_TOKEN`) — it requires a special PAT or GitHub App token with MCP access. The default `mode: local` (Docker-based) works with `GITHUB_TOKEN` and should always be used.

### Toolset Options

- `[default]` — Recommended defaults: `context`, `repos`, `issues`, `pull_requests`
- `[all]` — Enable all toolsets
- Specific toolsets: `[repos, issues, pull_requests, discussions]`
- Extend defaults: `[default, discussions, actions]`

## Recommended Default Toolsets

The following toolsets are recommended as defaults for typical agentic workflows:

| Toolset | Rationale |
|---------|-----------|
| `context` | Identity and team awareness (`get_me`, `get_teams`) — essential for any GitHub-aware agent |
| `repos` | Core repository operations (read files, list commits/branches) — most workflows need file access |
| `issues` | Issue management (read, comment, create) — common in CI/CD and automation workflows |
| `pull_requests` | PR operations (read, create, review) — critical for code review and merge automation |

**Enable explicitly when needed** (not in defaults):

| Toolset | When to Enable |
|---------|---------------|
| `actions` | Workflow introspection, triggering runs |
| `code_security` | Code scanning alert management |
| `copilot_spaces` | GitHub Copilot Spaces (remote mode only) |
| `dependabot` | Dependency vulnerability management |
| `discussions` | Community discussion workflows |
| `experiments` | Dynamic toolset management |
| `gists` | Gist creation and management |
| `git` | Git API operations (tree, refs) |
| `github_support_docs_search` | GitHub support documentation search (remote mode only) |
| `labels` | Label management automation |
| `notifications` | Notification processing agents |
| `orgs` | Organization-level security advisories |
| `projects` | GitHub Projects automation (requires PAT) |
| `search` | Cross-repository search operations |
| `secret_protection` | Secret scanning alert management |
| `security_advisories` | Advisory database queries |
| `stargazers` | Star/unstar repository operations |
| `users` | (currently empty — no tools registered) |

## Tools by Toolset

### context
**Description**: GitHub context and environment (current user, teams)

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_me` | Get details of the authenticated user | — |
| `get_team_members` | List members of a GitHub team | `org`, `team_slug` |
| `get_teams` | List teams the authenticated user belongs to | `org` |

---

### copilot_spaces
**Description**: GitHub Copilot Spaces (remote-only)

> **Note**: Remote-only toolset — only available when using the GitHub MCP server in remote mode (`https://api.githubcopilot.com/mcp/`). Not available with the local `gh mcp` mode.

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_copilot_space` | Get details of a specific Copilot Space | `owner`, `name` |
| `list_copilot_spaces` | List Copilot Spaces for a user or organization | `owner` |

---

### repos
**Description**: Repository operations

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `create_branch` | Create a new branch | `owner`, `repo`, `branch`, `from_branch` |
| `create_or_update_file` | Create or update a file in a repository | `owner`, `repo`, `path`, `content`, `message`, `branch` |
| `create_repository` | Create a new GitHub repository | `name`, `description`, `private`, `auto_init` |
| `delete_file` | Delete a file from a repository | `owner`, `repo`, `path`, `message`, `sha`, `branch` |
| `fork_repository` | Fork a repository | `owner`, `repo`, `organization` |
| `get_commit` | Get details of a specific commit | `owner`, `repo`, `sha` |
| `get_file_contents` | Read file or directory contents | `owner`, `repo`, `path`, `ref` |
| `get_latest_release` | Get the latest release for a repository | `owner`, `repo` |
| `get_release_by_tag` | Get a release by its tag name | `owner`, `repo`, `tag` |
| `get_tag` | Get details of a specific tag | `owner`, `repo`, `tag` |
| `list_branches` | List branches in a repository | `owner`, `repo`, `page`, `per_page` |
| `list_commits` | List commits in a repository | `owner`, `repo`, `sha`, `path`, `page` |
| `list_releases` | List all releases for a repository | `owner`, `repo`, `page`, `per_page` |
| `list_repository_collaborators` | List collaborators of a repository | `owner`, `repo`, `affiliation`, `page`, `per_page` |
| `list_tags` | List tags in a repository | `owner`, `repo`, `page`, `per_page` |
| `push_files` | Push multiple files in a single commit | `owner`, `repo`, `branch`, `files`, `message` |

---

### git
**Description**: Git API operations (tree, refs)

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_repository_tree` | Get the file tree of a repository | `owner`, `repo`, `sha`, `recursive` |

---

### github_support_docs_search
**Description**: GitHub support documentation search (remote-only)

> **Note**: Remote-only toolset — only available when using the GitHub MCP server in remote mode (`https://api.githubcopilot.com/mcp/`). Not available with the local `gh mcp` mode.

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `github_support_docs_search` | Search GitHub support documentation | `query` |

---

### issues
**Description**: Issue management

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `add_issue_comment` | Add a comment to an issue | `owner`, `repo`, `issue_number`, `body` |
| `issue_read` | Read issue details and comments | `owner`, `repo`, `issue_number` |
| `issue_write` | Create or update an issue | `owner`, `repo`, `title`, `body`, `labels`, `assignees` |
| `list_issue_types` | List available issue types for a repository | `owner`, `repo` |
| `list_issues` | List issues in a repository | `owner`, `repo`, `state`, `labels`, `page` |
| `search_issues` | Search issues across GitHub | `query`, `page`, `per_page` |
| `sub_issue_write` | Create or manage sub-issues | `owner`, `repo`, `issue_number` |

---

### pull_requests
**Description**: Pull request operations

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `add_comment_to_pending_review` | Add a comment to a pending PR review | `owner`, `repo`, `pull_number`, `review_id` |
| `add_reply_to_pull_request_comment` | Reply to a PR review comment | `owner`, `repo`, `pull_number`, `comment_id`, `body` |
| `create_pull_request` | Create a new pull request | `owner`, `repo`, `title`, `body`, `head`, `base` |
| `list_pull_requests` | List pull requests in a repository | `owner`, `repo`, `state`, `head`, `base` |
| `merge_pull_request` | Merge a pull request | `owner`, `repo`, `pull_number`, `merge_method` |
| `pull_request_read` | Read PR details, reviews, and comments | `owner`, `repo`, `pull_number` |
| `pull_request_review_write` | Create or submit a PR review | `owner`, `repo`, `pull_number`, `event`, `body` |
| `search_pull_requests` | Search pull requests across GitHub | `query`, `page`, `per_page` |
| `update_pull_request` | Update PR title, body, or state | `owner`, `repo`, `pull_number`, `title`, `body` |
| `update_pull_request_branch` | Update PR branch with latest base | `owner`, `repo`, `pull_number` |

---

### actions
**Description**: GitHub Actions workflows

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `actions_get` | Get details of a specific workflow run | `owner`, `repo`, `run_id` |
| `actions_list` | List GitHub Actions workflows and runs | `owner`, `repo`, `method`, `resource_id`, `per_page`, `page` |
| `actions_run_trigger` | Trigger a workflow run | `owner`, `repo`, `workflow_id`, `ref`, `inputs` |
| `get_job_logs` | Download logs for a specific workflow job | `owner`, `repo`, `job_id` |

---

### code_security
**Description**: Code scanning alerts

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_code_scanning_alert` | Get details of a specific code scanning alert | `owner`, `repo`, `alert_number` |
| `list_code_scanning_alerts` | List code scanning alerts for a repository | `owner`, `repo`, `state`, `severity` |

---

### dependabot
**Description**: Dependabot alerts

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_dependabot_alert` | Get details of a specific Dependabot alert | `owner`, `repo`, `alert_number` |
| `list_dependabot_alerts` | List Dependabot alerts for a repository | `owner`, `repo`, `state`, `severity` |

---

### discussions
**Description**: GitHub Discussions

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_discussion` | Get details of a specific discussion | `owner`, `repo`, `discussion_number` |
| `get_discussion_comments` | Get comments for a specific discussion | `owner`, `repo`, `discussion_number` |
| `list_discussion_categories` | List discussion categories for a repository | `owner`, `repo` |
| `list_discussions` | List discussions in a repository | `owner`, `repo`, `category_id` |

---

### experiments
**Description**: Experimental features — dynamic toolset management

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `enable_toolset` | Dynamically enable a toolset | `toolset` |
| `get_toolset_tools` | Get tools available in a specific toolset | `toolset` |
| `list_available_toolsets` | List all available toolsets | — |

---

### gists
**Description**: Gist operations

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `create_gist` | Create a new gist | `description`, `files`, `public` |
| `get_gist` | Get a specific gist by ID | `gist_id` |
| `list_gists` | List gists for a user | `username`, `page`, `per_page` |
| `update_gist` | Update an existing gist | `gist_id`, `description`, `files` |

---

### labels
**Description**: Label management

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_label` | Get details of a specific label | `owner`, `repo`, `name` |
| `label_write` | Create or update a label | `owner`, `repo`, `name`, `color`, `description` |
| `list_label` | List labels in a repository | `owner`, `repo`, `page`, `per_page` |

---

### notifications
**Description**: Notification management

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `dismiss_notification` | Dismiss a specific notification | `notification_id` |
| `get_notification_details` | Get details of a specific notification | `notification_id` |
| `list_notifications` | List user notifications | `all`, `participating`, `page` |
| `manage_notification_subscription` | Manage notification subscription for a thread | `thread_id`, `subscribed` |
| `manage_repository_notification_subscription` | Manage notifications for a repository | `owner`, `repo`, `subscribed` |
| `mark_all_notifications_read` | Mark all notifications as read | `last_read_at` |

---

### orgs
**Description**: Organization operations

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `list_org_repository_security_advisories` | List security advisories for all repos in an org | `org`, `state` |

---

### projects
**Description**: GitHub Projects (requires PAT — not supported by GITHUB_TOKEN)

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `projects_get` | Get details of a specific project | `owner`, `project_number` |
| `projects_list` | List GitHub Projects for a user or organization | `owner`, `per_page` |
| `projects_write` | Create or update project items/fields | `owner`, `project_number` |

---

### search
**Description**: Advanced search across GitHub

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `search_code` | Search code across repositories | `query`, `page`, `per_page` |
| `search_orgs` | Search GitHub organizations | `query`, `page`, `per_page` |
| `search_repositories` | Search for repositories | `query`, `page`, `per_page` |
| `search_users` | Search GitHub users | `query`, `page`, `per_page` |
| `semantic_issue_similarity_search` | Find GitHub issues semantically similar to a given issue | `owner`, `repo`, `issue_number` |
| `semantic_issues_search` | Search issues using natural language queries | `query`, `owner`, `repo` |

---

### secret_protection
**Description**: Secret scanning

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `get_secret_scanning_alert` | Get details of a specific secret scanning alert | `owner`, `repo`, `alert_number` |
| `list_secret_scanning_alerts` | List secret scanning alerts for a repository | `owner`, `repo`, `state` |
| `run_secret_scanning` | Scan file contents or diffs for exposed secrets | `content` |

---

### security_advisories
**Description**: Security advisories

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `check_dependency_vulnerabilities` | Check dependencies against known vulnerabilities in the GitHub Advisory Database | `owner`, `repo`, `dependencies` |
| `get_global_security_advisory` | Get a specific global security advisory | `ghsa_id` |
| `list_global_security_advisories` | List advisories from the GitHub Advisory Database | `type`, `severity`, `ecosystem` |
| `list_repository_security_advisories` | List security advisories for a specific repository | `owner`, `repo`, `state` |

---

### stargazers
**Description**: Repository stars

| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| `list_starred_repositories` | List repositories starred by a user | `username`, `page`, `per_page` |
| `star_repository` | Star a repository | `owner`, `repo` |
| `unstar_repository` | Unstar a repository | `owner`, `repo` |

---

### users
**Description**: User information

> **Note**: No tools are currently registered in the `users` toolset. User search is available via the `search` toolset (`search_users`).

---

## Best Practices

### Toolset Selection

1. **Start with defaults** (`context`, `repos`, `issues`, `pull_requests`) for most workflows
2. **Add toolsets incrementally** based on actual needs rather than enabling `all`
3. **Security toolsets** (`code_security`, `dependabot`, `secret_protection`, `security_advisories`) require `security-events` permission
4. **Write operations** require appropriate GitHub token permissions (see `write_permissions` in the JSON mapping)
5. **Projects toolset** requires a PAT (Personal Access Token) — `GITHUB_TOKEN` lacks the required `project` scope

### Permission Requirements

Most toolsets work with the default `GITHUB_TOKEN` in GitHub Actions. Exceptions:

- `projects` — Requires a PAT with `project` scope
- `security_advisories` (write) — Requires `security-events: write` permission
- `actions` (write for `actions_run_trigger`) — Requires `actions: write` permission


<!-- file: instructions.md -->

# Repository Instructions Overlay for gh-aw Agents

This optional file defines repository-local workflow authoring standards for installed gh-aw agents.

## Scope

These rules apply when creating, editing, reviewing, and upgrading agentic workflow files.

## Precedence

Apply upstream/default gh-aw instructions first, then apply this overlay.
If a rule conflicts, this repository overlay takes precedence.

## Repository Rules

Add your repository-specific standards here, for example:

- Required shared include(s) for new workflows
- Standard frontmatter defaults
- Frontmatter ordering/style conventions
- Security or policy constraints specific to this repository


<!-- file: llms.md -->

---
description: Discover LLM API endpoints, ports, and available model names inside the AWF agent container using the api-proxy /reflect endpoint.
---

# LLM API Endpoint Discovery

The AWF api-proxy sidecar exposes a `/reflect` endpoint listing every configured LLM provider, its port, and available models. Use it to configure any tool that needs OpenAI/Anthropic access inside the agent container.

> ⚠️ Only reachable from **inside the AWF agent container** — not from the runner host.

## Quick Start

```bash
# Discover all configured providers and their models
curl -sf http://api-proxy:10000/reflect | jq '.endpoints[] | select(.configured)'
```

## Provider Ports

| Provider | Port | Base URL | Credentials env var |
|---|---|---|---|
| `openai` / `codex` | 10000 | `http://api-proxy:10000/v1` | `OPENAI_API_KEY` |
| `anthropic` | 10001 | `http://api-proxy:10001/v1` (or no `/v1` for native SDK) | `ANTHROPIC_API_KEY` |
| `copilot` | 10002 | `http://api-proxy:10002/v1` | `COPILOT_GITHUB_TOKEN` |
| `gemini` | 10003 | `http://api-proxy:10003/v1` | `GEMINI_API_KEY` |

All ports use the OpenAI-compatible API format. The api-proxy injects auth headers automatically — **do not pass raw API keys** to these URLs.

## /reflect Response

```json
{
  "endpoints": [
    { "provider": "openai",    "port": 10000, "configured": true,  "models": ["gpt-4o", "o1-mini"], "models_url": "http://api-proxy:10000/v1/models" },
    { "provider": "anthropic", "port": 10001, "configured": true,  "models": ["claude-sonnet-4-5"],  "models_url": "http://api-proxy:10001/v1/models" },
    { "provider": "copilot",   "port": 10002, "configured": true,  "models": null,                   "models_url": "http://api-proxy:10002/models" },
    { "provider": "gemini",    "port": 10003, "configured": false, "models": null,                   "models_url": null }
  ],
  "models_fetch_complete": true
}
```

Only use endpoints where `configured: true`. `models` may be `null` when the proxy hasn't finished fetching; use `models_url` to fetch on demand.

## Configure Tools

### OpenAI-compatible SDK (any provider)

```bash
# Use Copilot as the OpenAI backend
export OPENAI_BASE_URL="http://api-proxy:10002/v1"
export OPENAI_API_KEY="$COPILOT_GITHUB_TOKEN"
```

### Anthropic SDK

```bash
export ANTHROPIC_BASE_URL="http://api-proxy:10001"
export ANTHROPIC_API_KEY="$ANTHROPIC_API_KEY"
```

### Dynamic resolution from /reflect

```bash
PROVIDER=anthropic
PORT=$(curl -sf http://api-proxy:10000/reflect \
  | jq -r --arg p "$PROVIDER" '.endpoints[] | select(.provider == $p and .configured) | .port')
export ANTHROPIC_BASE_URL="http://api-proxy:${PORT}"
```

## List Available Models

```bash
# OpenAI / Anthropic / Copilot format → { data: [{id}] }
curl -sf http://api-proxy:10000/reflect \
  | jq -r '.endpoints[] | select(.provider == "openai" and .configured) | .models_url' \
  | xargs curl -sf | jq '[.data[].id]'

# Gemini format → { models: [{name: "models/gemini-..."}] }
curl -sf http://api-proxy:10003/v1/models | jq '[.models[].name | ltrimstr("models/")]'
```

## See Also

- [network.md](network.md) — egress domain configuration
- [syntax.md](syntax.md) — `engine:` and `engine.model` frontmatter


<!-- file: loop.md -->

---
description: Loop-engineering workflow patterns and implementation guidance for gh-aw workflows.
---

# Loop Engineering Patterns

Use as a playbook for long-running iterative workflows.

## What “loop engineering” means

A loop workflow repeatedly:

1. selects exactly one work item,
2. makes one bounded improvement step,
3. verifies with concrete evidence,
4. preserves accepted progress,
5. records durable state for the next run.

Design for reliability across repeated runs, merges, CI failures, and human steering.

## Shared architecture across Autoloop, Goal, and Crane

### 1) Single-item scheduler

All three select one item per run:

- Autoloop: one program
- Goal: one goal issue
- Crane: one migration

This bounds run cost and preserves round-robin fairness.

### 2) Canonical long-running branch + single PR

Each item owns one stable branch and one draft PR:

- `autoloop/<program>`
- `goal/<issue>-<slug>`
- `crane/<migration>`

Accumulate accepted commits on that same PR over time.

### 3) Ratcheting acceptance

A change is accepted only when it improves the tracked metric (or advances the contract) and passes CI/verification gates.

If improvement fails, discard the change and still record the run.

### 4) Durable state in repo-memory

Persist state as markdown in a dedicated memory branch:

- `memory/autoloop`
- `memory/goal`
- `memory/crane`

Keep state machine-readable and human-editable.

### 5) Human control-plane issue

Each item has one canonical issue with:

- a durable status comment sentinel (`<!-- ...:STATUS -->`),
- one per-run log comment,
- human steering directives.

### 6) Explicit no-progress and pause semantics

When blocked or stuck, pause with a concrete reason. Do not retry forever.

## Pattern inventory

### Pattern A — Item selection and fairness

Use a deterministic pre-step scheduler that writes a compact selection artifact (for example `/tmp/gh-aw/autoloop.json` in gh-aw runners; match the file name to your workflow) with:

- selected item,
- deferred items,
- due/not-due flags,
- existing PR/branch metadata.

Do not discover candidates ad hoc in-prompt.

### Pattern B — Canonical branch invariants

Branch names must be deterministic and suffix-free.

Always use ahead/behind logic against default branch:

- `ahead=0, behind>0`: fast-forward/reset branch to default,
- `ahead>0, behind>0`: merge default into branch,
- else: checkout as-is.

When a force-push is required by your branch-sync strategy, use `--force-with-lease` (not `--force`).
Still keep canonical branches single-writer (the workflow) to minimize push conflicts.

### Pattern C — One PR per item

Never create multiple active PRs for the same item.

Resolve in order:

1. scheduler-provided `existing_pr`,
2. state-file PR fallback,
3. create exactly one PR if none exists.

### Pattern D — Improve → push → gate → accept

Use a three-phase accept path:

1. metric/contract improvement check,
2. push and wait for CI/checks,
3. accept only on green.

This avoids sandbox-only false positives.

### Pattern E — CI fix loop with circuit breakers

When CI fails after an improved change:

- collect failing jobs and error signatures,
- attempt bounded fix retries,
- stop on repeated identical signature,
- pause with structured reason (`ci-fix-exhausted`, `stuck`, `ci-timeout`).

### Pattern F — Structured state file

Keep a stable state layout with:

- machine-state table (iteration count, last run, best metric, pause/completion fields),
- current focus/checkpoint,
- lessons learned,
- foreclosed avenues/blockers,
- iteration history (newest first).

This preserves continuity and deterministic scheduling.

### Pattern G — Setup guard and safety rails

Use sentinel-based configuration checks before first real run:

- Autoloop: `<!-- AUTOLOOP:UNCONFIGURED -->`
- Crane: `<!-- CRANE:UNCONFIGURED -->`

If unconfigured, create/refresh setup issue and skip execution.

### Pattern H — Direction-aware metrics

Support both optimization directions:

- `higher` is better (default),
- `lower` is better.

Use direction in:

- improvement test,
- signed delta reporting,
- target-metric completion check.

### Pattern I — Completion by evidence, not intent

Completion requires explicit evidence gates.

- Goal enforces issue-defined completion contracts.
- Crane separates reaching target metric from deterministic completion-gate pass.
- Autoloop supports target-metric completion with explicit label transition.

Never mark complete on belief alone.

### Pattern J — Unified run reporting

On every run (accepted/rejected/error/blocked):

- update durable status comment,
- append per-run summary comment,
- include run URL, checkpoint, evidence, result, next step.

This creates an auditable run narrative.

## Comparative notes by project

| Project | Primary loop unit | Unique strength | Key reusable pattern |
|---|---|---|---|
| Autoloop | Program | General metric-driven optimization with rich iteration memory | Improvement ratchet + CI-gated accept/reject |
| Goal | Goal-labeled issue | Contract-first execution and definition-quality gating | “Needs action” path before implementation |
| Crane | Migration | Milestone plan + strategy selection (`in-place` vs `greenfield`) | iteration 0 planning commit and migration-specific completion gate |

## Implementation blueprint for new loop workflows in gh-aw

When creating a new loop workflow, implement in this order:

1. **Define the loop unit** (issue/program/migration/task).
2. **Add scheduler pre-step** that selects one item and emits JSON context.
3. **Define canonical branch and single-PR invariant**.
4. **Add durable state schema** in repo-memory.
5. **Implement run phases**: read state → choose checkpoint → change → verify.
6. **Add accept/reject logic** with direction-aware metric handling.
7. **Gate acceptance on CI/check health**.
8. **Add bounded fix loop** with failure-signature no-progress guard.
9. **Implement completion semantics** with explicit evidence gate.
10. **Add status + per-run issue comments** for observability.
11. **Add pause/recovery policy** for blocked or repeated failures.
12. **Document command-mode overrides** (slash command steering).

## Minimal loop run-state model

Use these conceptual states:

- `active`
- `accepted`
- `rejected`
- `error`
- `needs_action`
- `blocked`
- `paused`
- `completed`

Transitions must be deterministic and evidence-backed.

## Common failure modes to avoid

- Branch name drift (suffixes/hashes/run IDs)
- Multiple PRs per item
- Marking completion without deterministic evidence
- Repeating the same failed CI signature without pause
- Losing long-term context by storing state only in ephemeral run logs
- Unbounded scope growth per iteration

## Practical guidance for prompt authors

For loop prompts, explicitly require:

- one checkpoint per run,
- smallest useful change,
- explicit evidence command output,
- explicit `noop`/blocked behavior,
- state updates every run,
- strict branch/PR invariants.

Keep prompts short. Move durable policy to state + structured workflow rules.

## Reusable checklist

- [ ] One selected item per run
- [ ] Canonical branch name with no suffix
- [ ] Single draft PR per item
- [ ] Durable state file updated every run
- [ ] Improvement criterion defined (direction-aware)
- [ ] Acceptance gated on CI/checks
- [ ] Fix-loop retry cap and signature-based stop
- [ ] Explicit blocked/paused handling
- [ ] Deterministic completion gate
- [ ] Status comment + per-run comment updated


<!-- file: lsp.md -->

---
description: Language Server Protocol (LSP) configuration reference for gh-aw Copilot workflows — frontmatter syntax, supported servers, and file extension mapping.
---

# LSP Configuration

> ⚠️ **Experimental feature.** The `lsp` frontmatter field is experimental. Using it will emit a compile-time warning. The interface may change in future releases.

The `lsp` frontmatter field lets Copilot-engine workflows declare language servers. At compile time, the compiler:

1. Validates the configuration and rejects the workflow if `lsp` is used with a non-Copilot engine.
2. Generates `~/.copilot/settings.json` with an `lspServers` block the Copilot CLI reads at startup.
3. Injects install steps for known server ecosystems into the agent setup job.

> ⚠️ **`lsp` is only supported with `engine: copilot`**. Using it with any other engine causes a compile-time error.

## Syntax

```yaml
engine:
  id: copilot

lsp:
  <language-key>:
    command: <server-executable>
    args: [<arg1>, <arg2>]          # optional
    fileExtensions:
      ".<ext>": <language-id>       # at least one required
```

Each key under `lsp` is a language identifier (lowercase, alphanumeric, hyphens, or underscores). It maps to a server definition with:

| Field | Required | Description |
|---|---|---|
| `command` | **yes** | Executable name or path for the language server |
| `args` | no | Command-line arguments passed to the server on startup |
| `fileExtensions` | **yes** | Map of file extension (with leading `.`) to LSP language ID |
| `version` | no | Package version to install (e.g. `"5.8.3"`). Overrides the built-in pinned default for known servers. |

## Built-in Servers

For the languages below, the compiler automatically injects an install step — no manual `steps:` entry is needed. Each server is pinned to a known-good release by default; use the `version` field to override.

| Language key | Default version | Install command | Example `command` |
|---|---|---|---|
| `bash` | `5.4.0` | `npm install -g --ignore-scripts bash-language-server@5.4.0` | `bash-language-server` |
| `go` | `0.18.1` | `go install golang.org/x/tools/gopls@v0.18.1` | `gopls` |
| `php` | `1.14.1` | `npm install -g --ignore-scripts intelephense@1.14.1` | `intelephense` |
| `python` | `1.1.399` | `npm install -g --ignore-scripts pyright@1.1.399` | `pyright-langserver` |
| `ruby` | `0.50.0` | `gem install solargraph -v 0.50.0` | `solargraph` |
| `rust` | n/a | `rustup component add rust-analyzer` | `rust-analyzer` |
| `typescript` | `5.8.3` / `4.3.3` | `npm install -g --ignore-scripts typescript@5.8.3 typescript-language-server@4.3.3` | `typescript-language-server` |
| `yaml` | `1.15.0` | `npm install -g --ignore-scripts yaml-language-server@1.15.0` | `yaml-language-server` |

> The `version` field overrides the pinned version for the primary language server package (the last package in the install list). For `typescript`, it controls `typescript-language-server`; `typescript` itself stays at its hardcoded companion version (`5.8.3`).

Language keys not in this table still work — the compiler simply skips the auto-install step. Add a manual `steps:` entry to install the server yourself.

## LSP-enabled Tools (Copilot Engine)

When `lsp` is configured, Copilot gets semantic code-intelligence tools from the configured language servers. These tools are distinct from plain text search and are best for symbol-aware work.

| Tool capability | What it's used for | Typical prompt intent |
|---|---|---|
| Symbol lookup | Find functions, types, methods, constants, classes by symbol name | "Find the `Compile` method and related types." |
| Go to definition / declaration | Jump from usage to source of truth | "Open the definition of `NewLSPManager`." |
| Find references | Discover where a symbol is read/written/called | "List all call sites of `GenerateInstallSteps`." |
| Document / workspace symbols | Enumerate symbols in a file or project scope | "Summarize top-level symbols in `pkg/workflow/lsp_manager.go`." |
| Hover / type info | Inspect signatures, inferred types, and doc comments | "Show the signature and docs for `RuntimeRequirements`." |
| Diagnostics | Surface parse/type errors from the language server | "Check diagnostics in `pkg/workflow` before editing." |
| Rename / refactor actions (server-dependent) | Apply safe symbol renames and structured edits | "Rename this exported type and update references." |

> [!NOTE]
> Exact tool names and availability depend on the runtime engine and language server. Prompt for the capability ("find references", "go to definition"), not a hardcoded tool ID.

## Prompting Guidance for Efficient LSP Use

Use these patterns to help the authoring agent get faster, higher-signal results:

1. **State the semantic goal first.** Ask for symbol-level operations (definition, references, diagnostics) before broad grep scans.
2. **Constrain scope early.** Include target directories/files and language keys to avoid whole-repo symbol walks.
3. **Use a two-pass flow.** Ask for quick symbol discovery first, then deeper reference/type analysis only for shortlisted symbols.
4. **Require evidence in output.** Instruct the agent to include file paths and line numbers for each definition/reference result.
5. **Set fallback behavior.** If LSP data is unavailable, instruct the agent to fall back to text search and say confidence is lower.
6. **Avoid over-requesting.** Ask for only the symbols needed for the current task to minimize token and tool overhead.

Example intent phrasing:

> "Use LSP capabilities first: find symbol definitions and references for `LSPManager` in `pkg/workflow`, report file:line evidence, then propose the minimal edit."

## Examples

### TypeScript / JavaScript

```yaml
engine:
  id: copilot

lsp:
  typescript:
    command: typescript-language-server
    args: ["--stdio"]
    fileExtensions:
      ".ts": typescript
      ".tsx": typescriptreact
      ".js": javascript
      ".cjs": javascript
      ".mjs": javascript
```

### Python

```yaml
engine:
  id: copilot

lsp:
  python:
    command: pyright-langserver
    args: ["--stdio"]
    fileExtensions:
      ".py": python
```

### Go

```yaml
engine:
  id: copilot

lsp:
  go:
    command: gopls
    fileExtensions:
      ".go": go
```

### Multiple Languages

```yaml
engine:
  id: copilot

lsp:
  typescript:
    command: typescript-language-server
    args: ["--stdio"]
    fileExtensions:
      ".ts": typescript
      ".js": javascript
  python:
    command: pyright-langserver
    args: ["--stdio"]
    fileExtensions:
      ".py": python
```

### Custom Server (no built-in install)

For servers without a built-in install spec, add a manual install step:

```yaml
engine:
  id: copilot

steps:
  - name: Install custom language server
    run: npm install -g my-custom-language-server

lsp:
  mylang:
    command: my-custom-language-server
    args: ["--stdio"]
    fileExtensions:
      ".ml": mylang
```

## Network Requirements

Installing LSP servers requires network access to the appropriate package registry. Add the matching ecosystem to `network.allowed`:

| Language | Ecosystem to add |
|---|---|
| `bash`, `php`, `python`, `typescript`, `yaml` | `node` |
| `go` | `go` |
| `ruby` | `ruby` |
| `rust` | `rust` |

```yaml
network:
  allowed:
    - node   # for npm-installed servers (typescript, yaml, python/pyright, etc.)
    - go     # for gopls
```

## Compile-time Validation

The compiler enforces these rules at compile time:

- `lsp` requires `engine: copilot` — any other engine causes an error.
- Each language entry must have a non-empty `command`.
- Each language entry must define at least one `fileExtensions` mapping.
- Language keys are case-insensitive and trimmed; duplicate keys that collapse to the same lowercase value are normalized deterministically with the lexicographically first original key winning, but should still be avoided because the result may be surprising.


<!-- file: mcp-clis.md -->

---
description: MCP CLI command usage guidance and JSON payload patterns
---

# MCP CLI Usage

MCP CLI exposes mounted MCP servers as shell commands on `PATH`. Enabled by `tools.cli-proxy: true`.

> **IMPORTANT**: For `safeoutputs` and `mcpscripts`, **always use the CLI commands** instead of the equivalent MCP tools — do **not** call their MCP tools directly even if they appear in your tool list.
>
> For `safeoutputs`, treat every successful command as a real write-intent declaration. Do **not** use it for exploratory probing, auth checks, placeholder payloads, retries with variants, or runtime experiments. Emit the final intended call once. If not ready, use `noop` or `report_incomplete`.
>
> All other servers listed here are **only** available as CLI commands, not MCP tools.

## How to use

Each server is a standalone executable on `PATH`. Invoke from bash with `--name value` pairs:

```bash
<server-name> <tool-name> --param1 value1 --param2 value2
```

**Examples:**
```bash
playwright --help
playwright browser_navigate --url https://example.com
playwright browser_snapshot

safeoutputs add_comment --item_number 42 --body "Analysis complete"

mcpscripts --help
mcpscripts mcpscripts-gh --args "pr list --repo owner/repo --limit 5"
```

For multiple/complex arguments, pipe JSON on stdin using `.` as sentinel (preserves native types, avoids shell quoting):

```bash
printf '{"item_number":42,"body":"### Title\n\nBody paragraph one.\n\nBody paragraph two."}' \
  | safeoutputs add_comment .

printf '{"title":"Fix: something","body":"Details here","labels":["bug","priority-high"]}' \
  | safeoutputs create_issue .
```

If pipes are blocked, use file redirection: `safeoutputs create_pull_request . < /tmp/payload.json`.

## Notes

- **Prefer JSON payload mode** (`. < file` or `printf '{...}' | server tool .`) for multi-argument or complex calls
- Parameters also accept `--name value` pairs; boolean flags use `--flag` (no value) for `true`
- `.` as sole argument parses stdin as a JSON object
- Hyphens and underscores in parameter names are interchangeable (`issue-number` == `issue_number`)
- Output goes to stdout; errors to stderr with non-zero exit
- Run inside a `bash` tool call — these are shell executables, not MCP tools
- Read-only; cannot be modified by the agent


<!-- file: memory-stateful-patterns.md -->

---
description: Worked patterns for stateful agentic workflows — baseline metric comparison with cache-memory, and "alert on new findings" scanning with repo-memory.
---

# Stateful Memory Patterns

Detailed worked examples for the two most common persistent-state patterns. For the tool decision guide and configuration reference, see [memory.md](memory.md).

## Baseline Comparison (cache-memory)

Use `cache-memory` to persist a baseline metric between runs and detect regressions. Well-suited for any "compare current vs. previous" scenario — test coverage, build duration, benchmark scores, audit counts — where runs happen at least once every 7 days (the default cache retention).

**When to use this pattern**

- Tracking a numeric metric (coverage %, build time, test count, score) across scheduled or PR runs
- Alerting when a metric regresses by more than an acceptable threshold
- Any "tell me when X drops by more than Y" workflow where losing the baseline for a cycle is tolerable (the next run simply re-establishes it)

**When to use `repo-memory` instead**

If a lost baseline would cause serious side-effects — e.g. a security-finding baseline where "cache miss" floods the repo with duplicate issues — use `repo-memory` (see below).

**Worked example: coverage delta on every PR**

```markdown
---
description: Post a PR comment when test coverage drops by more than 1 percentage point
on:
  pull_request:
    types: [opened, synchronize]
permissions:
  pull-requests: read
  contents: read
engine: copilot
tools:
  github:
    toolsets: [pull_requests]
  cache-memory: true
safe-outputs:
  add-pr-comment:
    max: 1
timeout-minutes: 15
---

Run the test suite and collect the overall line-coverage percentage as a
single float (e.g. `82.5`).

Load `/tmp/gh-aw/cache-memory/coverage-baseline.json` if it exists.
The file stores: `{ "coverage": 82.5, "updated": "2026-05-01-09-00-00" }`.

**First run** (file missing): write the current coverage to the file and use
the `noop` safe output — no comment is needed yet.

**Subsequent runs** (baseline found): compute `delta = current − baseline`.

- If `delta >= −1.0` (coverage held or improved), use the `noop` safe output.
- If `delta < −1.0` (coverage fell by more than 1 pp), post an `add-pr-comment`
  that includes baseline coverage, current coverage and delta (e.g. "82.5% →
  79.3% (−3.2 pp)") plus which files lost the most coverage.

Regardless of the outcome, overwrite `/tmp/gh-aw/cache-memory/coverage-baseline.json`
with the current coverage and a filesystem-safe timestamp `YYYY-MM-DD-HH-MM-SS`
(no colons, no `T`, no `Z`).
```

**Key design decisions**

- **`cache-memory` not `repo-memory`** — coverage deltas are short-lived quality gates; a cache miss just means "no comparison this run" and the baseline is silently refreshed — no false-positive flood
- **First-run handling** — treat a missing baseline as "no data yet": write it and skip the comparison; the second run is the first real gate
- **Threshold guard** — ignore sub-1 pp fluctuations to reduce noise; tune the threshold to your team's standards
- **Filename safety** — use `YYYY-MM-DD-HH-MM-SS` (no colons) in any timestamped filenames written to `cache-memory` (artifacts reject colons; see [memory.md](memory.md#filename-safety))

## Stateful Scanning (repo-memory)

Use `repo-memory` to persist a baseline JSON file between scheduled runs so the workflow only alerts on *new* findings — vulnerability scans, dependency audits, licence checks, or any "track changes over time" scenario. Unlike `cache-memory`, the baseline survives cache expiry, so a missed cycle does not flood the repo with duplicate issues.

**Worked example: nightly npm vulnerability scan**

```markdown
---
description: Nightly npm vulnerability scan — alerts only on new advisories
on:
  schedule:
    - cron: "0 2 * * *"
permissions:
  issues: write
  contents: read
engine: claude
tools:
  repo-memory:
    allowed-extensions: [".json"]
network:
  allowed:
    - registry.npmjs.org
safe-outputs:
  create-issue:
    title-prefix: "[vuln] "
    labels: [security, automated]
    max: 5
timeout-minutes: 20
---

Load `/tmp/gh-aw/repo-memory/default/vuln-baseline.json`.
If missing, treat the baseline as `[]` (first run).

Run `npm audit --json`. Collect each advisory's id, severity, title, and URL.

Diff against the baseline:
- **New** (in current, not in baseline) → open a `create-issue` per finding (max 5).
- **Resolved** (in baseline, not in current) → log only.
- If no new findings, use the `noop` safe output.

Write the current advisory IDs to `/tmp/gh-aw/repo-memory/default/vuln-baseline.json` as a JSON array.
```

**Key design decisions**

- **`repo-memory` for baselines, not `cache-memory`** — caches expire after 7 days; a lost baseline makes every known finding appear "new" on the next run, flooding the repo with duplicate issues
- **First-run handling** — treat a missing baseline file as `[]` and write it at the end of the first run, giving subsequent runs a clean starting point
- **`max:` flood guard** — caps issues opened per run; use `max: 5` for nightly scans, `max: 1` for secret alerts, `max: 10` for weekly audits
- **Engine restriction** — `repo-memory` requires Claude or a custom engine; it is **not available** for the Copilot engine
- **Baseline schema** — store only stable identifiers (advisory ID strings), not mutable fields like severity, to avoid false "new" alerts when metadata changes


<!-- file: memory.md -->

---
description: Guide for choosing the right persistent memory strategy in agentic workflows — cache-memory, repo-memory, and repo-memory with wiki. Covers deduplication, stateful baseline comparison (metrics/coverage), and stateful scanning ("alert on new X").
---

# Persistent Memory in Agentic Workflows

For workflows that **persist state across runs** — deduplication, incremental processing, cross-run context, or knowledge accumulation.

> ⚠️ **`repo-memory` is NOT a synonym for `cache-memory`**. Different backends, different tradeoffs. `cache-memory` is almost always the right first choice.

---

## Quick Decision Guide

| Need | Use |
|---|---|
| Skip already-processed items (deduplication) | `cache-memory` ✅ first choice |
| Round-robin processing across runs | `cache-memory` ✅ first choice |
| Store ephemeral run state, analysis notes, or intermediate results | `cache-memory` ✅ first choice |
| Track a numeric metric and compare current vs. baseline (runs at least every 7 days) | `cache-memory` ✅ first choice |
| Long-lived knowledge base visible in PRs and code reviews | `repo-memory` |
| Baselines that must survive cache expiry (e.g. security findings, dedup lists) | `repo-memory` |
| Human-readable wiki pages for knowledge accumulation | `repo-memory` with `wiki: true` |
| Persist notes/state inline on the triggering issue or PR | `comment-memory` |

**Default to `cache-memory` unless you have a specific reason to use `repo-memory`.**

---

## Built-in Memory via GitHub Graph and Git History

Before writing new persistent files, check whether GitHub and Git already expose the state you need.

### Practical strategies

| Goal | Built-in source | Caching strategy |
|---|---|---|
| Skip stale files in docs/code scans | Git history (`git log` / last modified commit per file) | Cache a repo watermark SHA or per-file SHAs; compare changed paths in newer commits |
| Avoid reopening known incidents | Issue/PR history (open + closed by label/title prefix) | Cache only canonical identifiers (issue numbers, advisory IDs) |
| Process incrementally across repo activity | PR merge history (`merged_at`, base branch) | Cache last merged PR number/timestamp; fetch only newer merges |
| Keep nightly triage focused | Issue timeline (`updated_at`, comments) | Cache last scan cursor (`updated_at` watermark); inspect only newer updates |
| Reuse expensive relationship lookups | GitHub graph links (issue ↔ PR ↔ commit) | Cache normalized link maps keyed by stable node IDs |

### Design guidance

- Prefer **stable identifiers** (`node_id`, issue/PR number, commit SHA) over mutable text.
- Persist **watermarks** (timestamp, commit SHA, PR number) instead of full snapshots.
- Treat built-in history as source of truth; store only incremental resume state.
- For cheap bounded queries (latest 20-100 items), recompute from GitHub/git instead of storing derived datasets.

---

## `cache-memory` — First Choice

GitHub Actions cache (`actions/cache`) persisting `/tmp/gh-aw/cache-memory/` via `@modelcontextprotocol/server-memory` MCP.

### When to use

- **Deduplication**: track processed items (issues, PRs, URLs, IDs)
- **Round-robin / incremental**: remember position across scheduled runs
- **Ephemeral structured state**: JSON blobs, queues, intermediate results
- **Metric baseline comparison**: store coverage/score/count, compare next run (see [Stateful Analysis](#stateful-analysis--baseline-comparison))
- **Visual regression baselines**: screenshots between PR runs (see `visual-regression.md`)
- **Tool call caching**: avoid redundant API calls

### Configuration

```yaml
tools:
  cache-memory: true
```

Custom key:

```yaml
tools:
  cache-memory:
    key: dedup-${{ github.event.schedule }}-${{ github.run_id }}
    retention-days: 30
    allowed-extensions: [".json"]
```

Multiple named caches:

```yaml
tools:
  cache-memory:
    - id: processed
      key: processed-items-${{ github.run_id }}
    - id: results
      key: results-${{ github.run_id }}
      retention-days: 14
```

### Storage path

- Single cache: `/tmp/gh-aw/cache-memory/`
- Multiple caches: `/tmp/gh-aw/cache-memory/{id}/`

### Branch scoping

Caches are **branch-scoped** with default-branch fallback restore. A feature branch's first restore comes from `main`; subsequent saves fork a branch-local lineage. For warmed-state workflows, schedule on the default branch to reuse one lineage instead of fragmenting state.

### Deduplication example (scheduled workflow)

```markdown
---
on:
  schedule:
    - cron: "0 9 * * *"
permissions:
  issues: read
engine: copilot
tools:
  github:
    toolsets: [issues]
  cache-memory: true
safe-outputs:
  create-issue:
    title-prefix: "[daily-digest] "
    close-older-issues: true
    labels: [automation]
timeout-minutes: 15
---

Fetch the 20 most recently updated open issues.

Load `/tmp/gh-aw/cache-memory/processed.json` if it exists; it contains issue numbers from past digests. Skip any whose number already appears.

Summarize remaining (new) issues. If none, use the `noop` safe output.

Before finishing, write the updated processed-issue list back to `/tmp/gh-aw/cache-memory/processed.json` using filesystem-safe timestamp `YYYY-MM-DD-HH-MM-SS` (no colons, no `T`, no `Z`).
```

### Stateful Analysis / Baseline Comparison

Persist a baseline metric (coverage %, build time, benchmark score, audit count) and alert on regression. Cache miss → "no comparison this run" and baseline refreshes silently — tolerable for short-lived quality gates. If a lost baseline causes serious side-effects (e.g. duplicate security issues), use `repo-memory` instead (see [Stateful Scanning Pattern (repo-memory)](#stateful-scanning-pattern-repo-memory)).

> **Worked example** (coverage delta on every PR, with key design decisions): [memory-stateful-patterns.md](memory-stateful-patterns.md#baseline-comparison-cache-memory).

### Tradeoffs

| ✅ Pros | ❌ Cons |
|---|---|
| Zero repository noise — no commits, no PRs | Evicted when cache expires (default 7 days; use `retention-days` to extend up to 90) |
| Fast: no Git operations required | Not human-readable in GitHub UI |
| Works with Copilot, Claude, and custom engines | Data loss if cache is invalidated or expires |
| Supports multiple isolated caches per workflow | Files are uploaded as GitHub Actions artifacts — **no colons in filenames** |
| Scoped to workflow by default | |

### Filename safety

Cache-memory files upload as Actions artifacts. **Filenames must not contain colons** (NTFS limitation).

```bash
# ✅ GOOD
/tmp/gh-aw/cache-memory/state-2026-02-12-11-20-45.json

# ❌ BAD — colon breaks artifact upload
/tmp/gh-aw/cache-memory/state-2026-02-12T11:20:45Z.json
```

When instructing the agent for timestamped files, say: "Use `YYYY-MM-DD-HH-MM-SS` (no colons, no `T`, no `Z`)."

---

## `repo-memory` — Long-lived Repository Knowledge

Uses a dedicated Git branch (default: `memory/agent-notes`) to store files that persist indefinitely until explicitly deleted. The directory lives at `/tmp/gh-aw/repo-memory/`.

### When to use

- Knowledge must survive cache expiration
- Memory should be **visible in the repository** (auditable via Git history)
- Knowledge base grows over time (architecture notes, known issues)
- Changes need to appear in diffs and be reviewable

### Configuration

```yaml
tools:
  repo-memory:
    branch-name: memory/agent-notes   # Optional
    target-repo: owner/other-repo     # Optional: store in another repo
    allowed-extensions: [".json", ".md"]
    format-json: true                 # Optional: pretty-print .json (default: false)
    max-file-size: 10240              # bytes
    max-file-count: 100
```

Compiler creates a separate `push_repo_memory` job with `contents: write`; main agent job stays read-only.

### Tradeoffs

| ✅ Pros | ❌ Cons |
|---|---|
| Persists indefinitely (no expiry) | Produces Git commits — repository noise |
| Auditable: Git history shows every change | Slower: requires Git clone + push |
| Survives cache invalidation | Not available for Copilot engine (requires GitHub tools) |
| Human-readable via GitHub branch UI | More complex setup |
| Can target a different repository | |

---

## `repo-memory` with `wiki: true` — GitHub Wiki Backend

`repo-memory` variant that stores files in the **GitHub Wiki** (`<repo>.wiki.git`) instead of a branch.

### When to use

- Structured, human-readable documentation pages
- Knowledge for **human consumption** (browsable wikis)
- Living knowledge base or FAQ

### Configuration

```yaml
tools:
  repo-memory:
    wiki: true
    allowed-extensions: [".md"]
```

The compiler creates a separate `push_repo_memory` job with `contents: write`; the main agent job stays read-only.

Use GitHub Wiki conventions: `[[Page Name]]` for internal links, hyphens instead of spaces in filenames.

### Tradeoffs

| ✅ Pros | ❌ Cons |
|---|---|
| Browsable in the GitHub Wiki UI | Produces Git commits to wiki repo |
| Great for human-readable knowledge bases | Restricted to `.md` files in practice |
| Standard Markdown with wiki link syntax | Less suitable for structured JSON state |
| Separate from main repo history | |

---

## `comment-memory` — Managed Comment Persistence

Uses a `<gh-aw-comment-memory>` XML block in an issue/PR comment as persistent memory. The agent edits markdown files under `/tmp/gh-aw/comment-memory/`; the safe-output processor syncs changes back to the managed comment.

### When to use

- Workflow notes/statuses visible inline on the triggering issue or PR
- State tied to a specific issue or PR lifecycle
- Running track records (status tables, checklists, summaries) readable without leaving the issue

Do NOT use for high-volume ephemeral state (use `cache-memory`), long-lived knowledge bases (use `repo-memory`), or cross-issue data.

### Configuration

```yaml
tools:
  comment-memory: true   # enable with defaults
```

Advanced:

```yaml
tools:
  comment-memory:
    memory-id: status          # Optional: identifier in XML marker (default: "default")
    target: triggering         # Optional: "triggering" (default), "*", or explicit number
    target-repo: owner/other   # Optional: cross-repository
    max: 1                     # Optional: max updates per run (default: 1)
    footer: false              # Optional: omit AI-generated footer (default: true)
```

### How it works

1. **Pre-agent**: reads `<gh-aw-comment-memory id="<memory-id>">` and writes to `/tmp/gh-aw/comment-memory/<memory_id>.md`.
2. **Agent**: edits the markdown file directly — no safe-output tool call needed.
3. **Post-agent**: processor reads edited file and upserts the managed comment, replacing only the XML-fenced block.

Multiple memory IDs in one comment are supported; each maps to a separate `*.md` file.

### Tradeoffs

| ✅ Pros | ❌ Cons |
|---|---|
| Visible in GitHub UI inline on the issue/PR | Requires `issues:write` or `pull-requests:write` |
| No separate branch or cache | One comment block per `memory-id` per target |
| Agent edits plain markdown — no tool call needed | Not suited for large structured data |
| Tied to issue/PR lifecycle | Not available without a triggering issue or PR |

---

## Stateful Scanning Pattern (repo-memory)

Persist a baseline JSON file between runs to alert only on *new* findings — vulnerability scans, dependency audits, licence checks. Unlike `cache-memory`, the baseline survives cache expiry, so a missed cycle won't flood the repo with duplicate issues. Store only stable identifiers (advisory IDs), cap output with `max:`, treat missing baseline as `[]`. Requires Claude or custom engine — not Copilot.

> **Worked example** (nightly npm vulnerability scan, with key design decisions): [memory-stateful-patterns.md](memory-stateful-patterns.md#stateful-scanning-repo-memory).

---

## Summary Comparison

| Feature | `cache-memory` | `repo-memory` | `repo-memory` + wiki | `comment-memory` |
|---|---|---|---|---|
| **First choice** | ✅ Yes | No | No | No |
| **Storage backend** | GitHub Actions cache | Git branch | GitHub Wiki | Issue/PR comment |
| **Persistence** | Up to 90 days | Indefinite | Indefinite | Issue/PR lifetime |
| **Compiler adds `contents: write`** | No | Yes (push job) | Yes (push job) | No |
| **Repository noise** | None | Git commits | Wiki commits | Comment updates |
| **Human-readable in GitHub** | No | Via branch UI | Via Wiki UI | ✅ Inline on issue/PR |
| **Structured data (JSON)** | ✅ Ideal | Possible | Not recommended | Not recommended |
| **Filename restrictions** | No colons in names | None | Hyphens for spaces | None |
| **Engine compatibility** | Copilot, Claude, custom | Claude, custom | Claude, custom | Claude, custom |

---

## Anti-patterns

- ❌ **Do not invent `repo-memory` as a synonym for `cache-memory`** — they are different tools
- ❌ **Do not use `repo-memory` for ephemeral per-run state** — use `cache-memory`
- ❌ **Do not use `cache-memory` when you need indefinite persistence** — use `repo-memory`
- ❌ **Do not include colons in cache-memory filenames** — artifact upload will fail


<!-- file: messages.md -->

---
description: Style guide for workflow status messages (all safe-outputs.messages template types).
---

# Workflow Status Messages

For writing `safe-outputs.messages`. Messages appear in GitHub issues, PR comments, and discussions.

## Rules

**Tone:** Plain and professional. No casual phrases ("Mission accomplished!"), no dramatic language ("interrupted!"), no excitement punctuation (`!!`).

**Emoji:** One per message, at the start. Use the same emoji across `run-started`, `run-success`, `run-failure`, and `footer`. No trailing emojis.

Emoji by domain: 🔍 search · 📐 architecture · 🔬 analysis/security · 📦 dependencies · 📝 docs · 🧪 testing · 🚀 release · 👀 review

## All Message Types

### Status messages (shown on the triggering issue/PR/discussion)

| Key | Variables | Default |
|-----|-----------|---------|
| `run-started` | `{workflow_name}`, `{run_url}`, `{event_type}` | `Agentic [{workflow_name}]({run_url}) triggered by this {event_type}.` |
| `run-success` | `{workflow_name}`, `{run_url}` | `✅ Agentic [{workflow_name}]({run_url}) completed successfully.` |
| `run-failure` | `{workflow_name}`, `{run_url}`, `{status}` | `❌ Agentic [{workflow_name}]({run_url}) {status} and wasn't able to produce a result.` |
| `detection-failure` | `{workflow_name}`, `{run_url}` | `⚠️ Security scanning failed for [{workflow_name}]({run_url}). Review the logs for details.` |

### Footer messages (appended to every AI-generated comment/issue/PR)

| Key | Variables | Default |
|-----|-----------|---------|
| `footer` | `{workflow_name}`, `{run_url}`, `{triggering_number}`, `{triggering_type}`, `{workflow_source}`, `{workflow_source_url}` | *(system default)* |
| `footer-install` | `{workflow_source}`, `{workflow_source_url}` | *(system default)* |
| `footer-workflow-recompile` | `{workflow_name}`, `{run_url}`, `{repository}` | `> Workflow sync report by [{workflow_name}]({run_url}) for {repository}` |
| `footer-workflow-recompile-comment` | `{workflow_name}`, `{run_url}`, `{repository}` | `> Update from [{workflow_name}]({run_url}) for {repository}` |
| `agent-failure-issue` | `{workflow_name}`, `{run_url}` | `> Agent failure tracked by [{workflow_name}]({run_url})` |
| `agent-failure-comment` | `{workflow_name}`, `{run_url}` | `> Agent failure update from [{workflow_name}]({run_url})` |

### Activation comment links (appended to `run-started` comment when resources are created)

| Key | Variables | Default |
|-----|-----------|---------|
| `pull-request-created` | `{item_number}`, `{item_url}` | `Pull request created: [#{item_number}]({item_url})` |
| `issue-created` | `{item_number}`, `{item_url}` | `Issue created: [#{item_number}]({item_url})` |
| `commit-pushed` | `{commit_sha}`, `{short_sha}`, `{commit_url}` | `Commit pushed: [\`{short_sha}\`]({commit_url})` |

### Staged mode messages (shown when `staged: true`)

| Key | Variables | Default |
|-----|-----------|---------|
| `staged-title` | `{operation}` | `🎭 Preview: {operation}` |
| `staged-description` | `{operation}` | `The following {operation} would occur if staged mode was disabled:` |

### Boolean options

| Key | Default | Description |
|-----|---------|-------------|
| `append-only-comments` | `false` | When `true`, creates a new comment for completion instead of editing the activation comment |

## Templates

### `run-started`
```
"{emoji} [{workflow_name}]({run_url}) is [present-tense verb] for this {event_type}..."
```
End with `...`. Use `{event_type}` to show what triggered the run.

### `run-success`
```
"{emoji} [{workflow_name}]({run_url}) has [past-tense completion phrase]."
```
End with `.`. Be specific about what was produced or verified.

### `run-failure`
```
"{emoji} [{workflow_name}]({run_url}) {status}. [One sentence on what could not be completed]."
```
Include `{status}` to surface the failure reason. Keep the follow-up sentence factual.

### `footer`
```
"> {emoji} *[Action noun] by [{workflow_name}]({run_url})*{history_link}"
```
Blockquote + italics. Include `{history_link}` for navigation to run history.

## Examples

✅ **Search workflow:**
```yaml
run-started: "🔍 [{workflow_name}]({run_url}) is searching the web for this {event_type}..."
run-success: "🔍 [{workflow_name}]({run_url}) has completed the web search and posted results."
run-failure: "🔍 [{workflow_name}]({run_url}) {status}. The search could not be completed."
footer:      "> 🔍 *Search results by [{workflow_name}]({run_url})*{history_link}"
```

✅ **Compatibility checker:**
```yaml
run-started: "🔬 [{workflow_name}]({run_url}) is analyzing API compatibility for this {event_type}..."
run-success: "🔬 [{workflow_name}]({run_url}) has completed the compatibility analysis."
run-failure: "🔬 [{workflow_name}]({run_url}) {status}. The compatibility analysis could not be completed."
footer:      "> 🔬 *Compatibility report by [{workflow_name}]({run_url})*{history_link}"
```

❌ **Avoid — casual language, mismatched emojis, trailing decorations:**
```yaml
run-started: "🔍 Brave Search activated! [{workflow_name}]({run_url}) is venturing into the web..."
run-success: "🦁 Mission accomplished! [{workflow_name}]({run_url}) returned with findings. Knowledge acquired! 🏆"
run-failure: "🔍 Search interrupted! [{workflow_name}]({run_url}) {status}. The web remains unexplored..."
footer:      "> 🦁 *Search results brought to you by [{workflow_name}]({run_url})*{history_link}"
```


<!-- file: network.md -->

---
description: Network access configuration reference for gh-aw workflows — valid ecosystem identifiers, domain patterns, and common mistakes to avoid.
---

# Network Access Configuration

The `network` frontmatter controls which domains an AI engine can reach. Enforced by the Agent Workflow Firewall (AWF).

## Quick Reference

```yaml
# Shorthand — use default infrastructure domains only
network: defaults

# Custom — allow defaults plus package registries for a Node.js project
network:
  allowed:
    - defaults
    - node

# Custom — allow specific external APIs
network:
  allowed:
    - defaults
    - api.example.com
    - "*.trusted-partner.com"

# No network access
network:
  allowed: []
```

## Valid Values for `network.allowed`

| Type | Examples | Notes |
|---|---|---|
| **Ecosystem identifier** | `defaults`, `node`, `python` | Expands to a curated list of domains |
| **Exact domain** | `api.example.com`, `registry.npmjs.org` | Must be a fully-qualified domain (FQDN) |
| **Wildcard subdomain** | `*.example.com` | Matches `sub.example.com`, `deep.nested.example.com`, and `example.com` itself |

> ⚠️ **Bare shorthands like `npm`, `pypi`, `localhost` are NOT valid** unless listed below. Unrecognised single-word entries cause a **compile-time error**. Use ecosystem identifiers (`node`, `python`) or explicit FQDNs (`registry.npmjs.org`, `pypi.org`) instead.

## Ecosystem Identifiers

Keywords expanding to curated domain lists:

| Identifier | Runtime / Tool | Key Domains Enabled |
|---|---|---|
| `defaults` | Basic infrastructure | Certificate authorities, Ubuntu package verification, JSON schema |
| `github` | GitHub domains | `*.githubusercontent.com`, `codeload.github.com`, `docs.github.com` |
| `github-actions` | GitHub Actions artifacts | Azure Blob storage for action caches and artifacts |
| `node` | npm / yarn / pnpm | `registry.npmjs.org`, `npmjs.com`, `yarnpkg.com` |
| `python` | pip / PyPI / conda | `pypi.org`, `files.pythonhosted.org`, `pip.pypa.io` |
| `go` | Go modules | `proxy.golang.org`, `sum.golang.org`, `go.dev` |
| `dotnet` | NuGet / .NET | `api.nuget.org`, `nuget.org`, `dotnet.microsoft.com` |
| `java` | Maven / Gradle | `repo1.maven.org`, `plugins.gradle.org`, `jdk.java.net` |
| `ruby` | Bundler / RubyGems | `rubygems.org`, `api.rubygems.org` |
| `rust` | Cargo | `crates.io`, `index.crates.io`, `static.crates.io`, `sh.rustup.rs` |
| `swift` | Swift Package Manager | `swift.org`, `cocoapods.org` |
| `php` | Composer / Packagist | `packagist.org`, `repo.packagist.org`, `getcomposer.org` |
| `dart` | pub.dev | `pub.dev`, `pub.dartlang.org` |
| `haskell` | Hackage / GHCup | `*.hackage.haskell.org`, `get-ghcup.haskell.org` |
| `perl` | CPAN | `cpan.org`, `metacpan.org` |
| `containers` | Docker / GHCR | `ghcr.io`, `registry.hub.docker.com`, `*.docker.io` |
| `playwright` | Playwright browsers | `playwright.download.prss.microsoft.com`, `cdn.playwright.dev` |
| `linux-distros` | apt / yum / apk | `deb.debian.org`, `security.debian.org`, Ubuntu/Alpine mirrors |
| `terraform` | HashiCorp | `releases.hashicorp.com`, `registry.terraform.io` |
| `local` | Loopback addresses | `127.0.0.1`, `::1`, `localhost` |
| `bazel` | Bazel build | `releases.bazel.build`, `bcr.bazel.build` |
| `clojure` | Clojure / Clojars | `clojars.org`, `repo.clojars.org` |
| `deno` | Deno / JSR | `deno.land`, `jsr.io` |
| `elixir` | Hex.pm | `hex.pm`, `repo.hex.pm` |
| `fonts` | Google Fonts | `fonts.googleapis.com`, `fonts.gstatic.com` |
| `julia` | Julia packages | `pkg.julialang.org`, `julialang.org` |
| `kotlin` | Kotlin / JetBrains | `packages.jetbrains.team` |
| `lua` | LuaRocks | `luarocks.org` |
| `node-cdns` | JS CDNs | `cdn.jsdelivr.net`, `code.jquery.com`, `unpkg.com` |
| `ocaml` | OPAM | `opam.ocaml.org`, `ocaml.org` |
| `powershell` | PowerShell Gallery | `powershellgallery.com` |
| `r` | CRAN | `cran.r-project.org`, `cloud.r-project.org` |
| `scala` | sbt / Scala | `repo.scala-sbt.org`, `repo1.maven.org` |
| `zig` | Zig packages | `ziglang.org` |
| `dev-tools` | CI/CD tools | Renovate, Codecov, shields.io, and other dev tooling |
| `chrome` | Chrome / Chromium | `*.googleapis.com`, `*.gvt1.com` |
| `latex` | LaTeX / TeX | `ctan.org`, `mirror.ctan.org`, `miktex.org`, `tug.org` |
| `lean` | Lean theorem prover | `lean-lang.org`, `elan.lean-lang.org`, `reservoir.lean-lang.org` |
| `python-native` | Python native build deps | Native toolchain mirrors for building Python packages from source |

## Invalid Shorthands

These look like ecosystem identifiers but are **not recognised** — using them causes a **compile-time error**:

| Invalid value | What you probably meant | Correct value |
|---|---|---|
| `npm` | npm registry | `node` |
| `pypi` | Python Package Index | `python` |
| `pip` | pip package manager | `python` |
| `cargo` | Rust crate registry | `rust` |
| `gem` or `gems` | RubyGems | `ruby` |
| `nuget` | NuGet package registry | `dotnet` |
| `maven` | Maven Central | `java` |
| `gradle` | Gradle plugins | `java` |
| `composer` | PHP Composer | `php` |
| `docker` | Docker Hub / GHCR | `containers` |
| `localhost` | Loopback interface | `local` |

## Domain Pattern Rules

- **Wildcard `*` requires a dot prefix**: `*.example.com` valid; bare `*` blocked (rejected outright in strict mode).
- **No protocol prefix**: `https://api.example.com` is invalid — write `api.example.com`.
- **Subdomains must be explicit**: `github.com` does not cover `api.github.com`; use `*.github.com` or both.

## Inferring Ecosystem From Repository Files

For workflows that build, test, or install packages, add the matching ecosystem alongside `defaults`:

| File indicators | Ecosystem to add | Enables |
|---|---|---|
| `package.json`, `yarn.lock`, `pnpm-lock.yaml`, `.nvmrc` | `node` | `registry.npmjs.org` |
| `requirements.txt`, `pyproject.toml`, `uv.lock`, `Pipfile` | `python` | `pypi.org`, `files.pythonhosted.org` |
| `go.mod`, `go.sum` | `go` | `proxy.golang.org`, `sum.golang.org` |
| `*.csproj`, `*.sln`, `*.slnx` | `dotnet` | `api.nuget.org` |
| `pom.xml`, `build.gradle` | `java` | `repo1.maven.org` |
| `Gemfile`, `*.gemspec` | `ruby` | `rubygems.org` |
| `Cargo.toml` | `rust` | `crates.io` |
| `Package.swift` | `swift` | `swift.org` |
| `composer.json` | `php` | `packagist.org` |
| `pubspec.yaml` | `dart` | `pub.dev` |

> ⚠️ **`network: defaults` alone is never sufficient for code workflows** — `defaults` covers basic infrastructure (CAs, Ubuntu verification) but not package registries. Always add the language ecosystem.

## Common Patterns

### Workflow that reads GitHub data only

```yaml
network:
  allowed:
    - defaults
    - github
```

### Node.js CI workflow

```yaml
network:
  allowed:
    - defaults
    - node
```

### Multi-language project

```yaml
network:
  allowed:
    - defaults
    - node
    - python
```

### Calling an external API

```yaml
network:
  allowed:
    - defaults
    - api.myservice.com
    - "*.myservice.com"
```

### No outbound network access

```yaml
network:
  allowed: []
```


<!-- file: optimize-agentic-workflow.md -->

---
description: Analyze and reduce token consumption in agentic workflows — audit-based measurement, DataOps, gh-proxy, sub-agents, and prompt optimization.
disable-model-invocation: true
---

# Agentic Workflow Token Optimizer

Help users reduce the AI token usage and cost of GitHub Agentic Workflows in this repository.

## Load These References First

- [github-agentic-workflows.md](github-agentic-workflows.md)
- [token-optimization.md](token-optimization.md)
- [workflow-editing.md](workflow-editing.md)
- [syntax.md](syntax.md)

Load these only when relevant:

- [experiments.md](experiments.md)
- [safe-outputs.md](safe-outputs.md)

## Available Commands

```bash
gh aw audit <run-id> --json
gh aw audit <base-run-id> <optimized-run-id>
gh aw logs <workflow-name> --json
gh aw compile <workflow-name>
gh aw status
```

## Start the Conversation

Ask for one of these inputs:

- a workflow run URL (or run ID) to analyze
- a workflow name to review the source
- the guardrail that was exceeded (max-ai-credits, max-daily-ai-credits, max-tool-denials, max-turns / timeout)

## Fast Path: Run URL Provided

If the user gives a GitHub Actions run URL:

1. Extract the run ID
2. Run `gh aw audit <run-id> --json`
3. Inspect `agent_usage.aic`, `agent_usage.input_tokens`, `agent_usage.output_tokens`, `agent_usage.cache_read_tokens`
4. Identify the most expensive phases before asking additional questions

## Guardrail-Specific Entry Points

### `max-ai-credits` exceeded

The workflow was stopped because it consumed more AI Credits than the configured per-run budget.

Priority checks:
1. Which tool calls dominated token usage? (`token-usage.jsonl`)
2. Is the prompt front-loading large payloads that could be fetched on demand?
3. Are there repetitive extraction steps that sub-agents could handle cheaply?
4. Does the frontier model handle tasks that a small model could do?

### `max-daily-ai-credits` exceeded

The workflow is being blocked because its 24-hour AI Credits budget is exhausted.

Priority checks:
1. What is the run cadence? (scheduled too frequently?)
2. Does the workflow use cheap triage before escalating to the frontier model?
3. Is batching or caching applicable to reduce run frequency?
4. Are there noop early-exits for events that do not require agent action?

### `max-tool-denials` exceeded

The Copilot SDK hit the tool-denial threshold, indicating the prompt attempted actions outside the allowed tool policy.

Priority checks:
1. What tool was repeatedly denied? (last denied reason in the failure issue)
2. Is the tool missing from the workflow's permissions/firewall config?
3. Can the prompt be revised to avoid the denied operation entirely?
4. Would a DataOps pre-step satisfy the data need without a tool call?

### Timeout / `max-turns` exceeded

The agent ran out of time or turns before completing the task.

Priority checks:
1. Is the task decomposable into smaller, faster sub-tasks?
2. Are there long-running tool calls that could be replaced with DataOps pre-steps?
3. Is the prompt asking the agent to do too much in one run?
4. Can `max-turns` or `timeout-minutes` be raised, or should the task be split?

## Optimization Analysis Plan

After measuring token usage, produce a prioritized plan:

1. **Measure** — run `gh aw audit <run-id>` and summarize AI Credits and per-call token breakdown
2. **Identify top cost drivers** — list the three most expensive phases/tool calls
3. **Apply quick wins first** — DataOps pre-steps, `gh-proxy`, `cli-proxy`, prompt trimming
4. **Sub-agent delegation** — identify repetitive per-item loops suitable for small-model workers
5. **Prompt caching** — verify stable context appears before dynamic content
6. **Experiment** — add an `experiments:` entry with `metric: "aic"` to measure the change
7. **Validate quality** — confirm the optimized run produces equivalent safe outputs

Present the plan clearly before making any edits. Confirm with the user before applying changes.

## Editing Workflow

1. Edit `.github/workflows/<workflow-name>.md`
2. Recompile: `gh aw compile <workflow-name>`
3. Commit both the source and the generated `.lock.yml`
4. Report the estimated savings and link to the PR or commit


<!-- file: patterns.md -->

---
description: Agentic workflow pattern router for selecting the best documented pattern and playbook.
disable-model-invocation: true
---

# Agentic Workflow Patterns Router

Use this router when a user asks for a workflow architecture, strategy, operating model, or design pattern.

## Routing Rules

1. Identify the user's primary goal and constraints.
2. Match the request to the closest pattern in the index below.
3. Load and follow the matched pattern document.
4. If multiple patterns apply, pick one primary pattern and list 1-2 secondary patterns to combine.
5. If no pattern clearly fits, ask a short clarifying question before proceeding.

## Pattern Index

Pattern docs base path: `https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/`

### MonitorOps
- **Load when:** The user needs repository-wide workflow observability, trend reporting, and escalation for recurring failures or token waste.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/monitor-ops.md

### BatchOps
- **Load when:** The user needs to process large worksets in shards/chunks with throttling and aggregation.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/batch-ops.md

### CentralRepoOps
- **Load when:** The user needs a private control repository that coordinates rollouts across many target repositories.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/central-repo-ops.mdx

### ChatOps
- **Load when:** The user wants slash-command driven, human-in-the-loop automation in issues or pull requests.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/chat-ops.md

### CorrectionOps
- **Load when:** The user wants to improve workflow behavior from trusted human corrections without retraining the model.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/experimental/correction-ops.md

### DailyOps
- **Load when:** The user wants scheduled, small, recurring improvements that compound over time.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/daily-ops.md

### DeterministicOps
- **Load when:** The user needs deterministic data collection steps followed by agentic analysis and reporting.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/deterministic-ops.md

### DispatchOps
- **Load when:** The user needs manual trigger flows (`workflow_dispatch`) with custom inputs for testing or controlled runs.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/dispatch-ops.md

### IssueOps
- **Load when:** The user needs fully automated issue triage, categorization, and responses on issue events.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/issue-ops.md

### LabelOps
- **Load when:** The user needs label-driven workflow behavior when specific labels are added or removed.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/label-ops.md

### Monitoring with Projects
- **Load when:** The user needs durable tracking and monitoring of work items with GitHub Projects and safe outputs.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/experimental/monitoring-with-projects.md

### MultiRepoOps
- **Load when:** The user needs coordination and synchronization across multiple repositories.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/multi-repo-ops.md

### Orchestration
- **Load when:** The user needs orchestrator/worker architecture using reusable workflows or workflow dispatch.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/orchestration.md

### ProjectOps
- **Load when:** The user needs intelligent routing and controlled field updates in GitHub Projects.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/project-ops.mdx

### ResearchPlanAssignOps
- **Load when:** The user needs a flow from deep research to planning to automated issue assignment/implementation.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/research-plan-assign-ops.md

### SideRepoOps
- **Load when:** The user wants low-friction reporting/automation from a side repository targeting a primary repository.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/side-repo-ops.mdx

### SpecOps
- **Load when:** The user needs to maintain formal specifications and propagate spec updates to consuming implementations.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/spec-ops.md

### TrialOps
- **Load when:** The user needs isolated trial repositories to validate workflows before production rollout.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/experimental/trial-ops.md

### WorkQueueOps
- **Load when:** The user needs durable queue processing for many items via issues, sub-issues, discussions, or cache-memory.
- **Pattern doc:** https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/workqueue-ops.md

## Notes

- Prefer documented patterns over ad hoc architecture when a strong match exists.
- When relevant, combine pattern guidance with core workflow rules from:
  - https://github.com/github/gh-aw/blob/main/.github/aw/github-agentic-workflows.md


<!-- file: pr-reviewer.md -->

---
description: Guidance for implementing PR reviewer agentic workflows with ready_for_review triggers, centralized slash commands, and safe review actions.
---

## PR Reviewer Workflow Pattern

For reviewer workflows that run automatically when a PR is ready and manually via slash command.

## Trigger Model

```yaml
on:
  pull_request:
    types: [ready_for_review]
  slash_command:
    strategy: centralized
    name: review
    events: [pull_request_comment, pull_request_review_comment]
```

`ready_for_review` starts review when drafts become reviewable. Centralized routing handles both PR comments and review comments via one entrypoint.

## Safe Outputs

- `create-pull-request-review-comment` — line-level feedback
- `resolve-pull-request-review-thread` — resolved threads
- `submit-pull-request-review` — final review state
- `update-pull-request-review` — amend an existing review

Keep `max` caps conservative to avoid runaway reviews.

## Integrity and GitHub Tool Access

```yaml
tools:
  github:
    min-integrity: approved
    toolsets: [pull_requests, issues, repos]
```

- Prefer `pull_requests` for reviewer operations.
- Add `issues` only when interacting with issue-style comment surfaces or cross-links.
- Use the lowest `min-integrity` that supports the required actions.

## Examples

- `.github/workflows/pr-code-quality-reviewer.md`
- `.github/workflows/mattpocock-skills-reviewer.md`
- `.github/workflows/test-quality-sentinel.md`


<!-- file: report.md -->

---
description: Guidelines for creating agentic workflows that generate reports — output type selection, formatting style, and automatic cleanup.
---

# Report Generation

For workflows that generate reports — status updates, audits, summaries — posted as GitHub issues, discussions, or comments.

## Choosing the Output Type

| Use case | Recommended output |
|---|---|
| Report (default) | `create-issue` with `close-older-issues` |
| Inline update on an existing issue or PR | `add-comment` with `hide-older-comments` |
| Discussion-based report (only when explicitly requested) | `create-discussion` with `close-older-discussions` |

Default to `create-issue`. Use `create-discussion` only when the requester explicitly wants threaded async collaboration.

## Automatic Cleanup

- **`expires`** — auto-close after a window (e.g. `7`, `2w`, `1m`).
- **`close-older-issues: true`** — close previous issues from the same workflow. Requires `title-prefix` or `labels`.
- **`close-older-discussions: true`** — close older matching discussions as "OUTDATED". Requires `title-prefix` or `labels`.
- **`hide-older-comments: true`** — minimize previous comments. Useful for rolling status updates.

**Recommended for recurring reports**: `create-issue` with `close-older-issues: true` and a stable `title-prefix`.

```yaml
safe-outputs:
  create-issue:
    title-prefix: "Weekly Status:"
    labels: [report]
    close-older-issues: true
    expires: 30
```

## Scheduled Report Window Scoping

Define the report window explicitly in the prompt so runs are deterministic and comparable.

Window examples:

- `last 24 full hours ending at workflow start (UTC)`
- `last 7 full days ending at workflow start (UTC)`
- `since previous successful run timestamp`
- `current calendar week to date (UTC, Monday 00:00 to now)`

Strategy: fixed durations for trend comparisons, run-based windows for continuous monitoring, calendar windows for stakeholder reporting.

When the window has no qualifying updates, call `noop` with the evaluated window in the message:
`noop("No updates in last 24 full hours ({{window_start_utc}} to {{window_end_utc}})")`

## Fallback for Incomplete Metadata

When the digest or report depends on labels, metadata, or classification fields (for example customer-impact labels, priority tiers, team assignments, or area tags) that are absent or inconsistent:

- summarize what data *was* available and note which fields are missing
- group by the next-best available dimension (for example repository, author, date, or milestone)
- use an explicit "Unclassified" bucket for items without required metadata — do not invent or assume classifications
- call `noop` only when the reporting window itself has zero events; missing metadata alone is not a reason to skip the report

## Report Style and Structure

### Header Levels

- Use `###` (h3) for main sections — e.g., `### Test Summary`
- Use `####` (h4) for subsections — e.g., `#### Device-Specific Results`
- Never use `##` (h2) or `#` (h1) — those are reserved for titles

### Progressive Disclosure

Wrap verbose logs, secondary info, and per-item breakdowns in `<details><summary>Section Name</summary>`. Keep summary, critical issues, and key metrics visible.

### Alerts Instead of Emojis

- `> [!NOTE]` — neutral status
- `> [!WARNING]` — warnings
- `> [!CAUTION]` — high-risk or blocking

Do not use emoji severity markers (`✅`, `⚠️`, `❌`, `🧪`).

### Structure Pattern

1. **Overview** — 1–2 paragraphs of key findings
2. **Critical info** — summary stats, critical issues (always visible)
3. **Details** — `<details><summary>...</summary>` for expanded content
4. **Context** — workflow run, date, trigger

### Example Report Structure

```markdown
### Summary
- Key metric 1: value
- Key metric 2: value

> [!WARNING]
> Status: degradation detected in one or more checks.

### Critical Issues
[Always visible - these are important]

<details>
<summary>View Detailed Results</summary>

[Comprehensive details, logs, traces]

</details>

<details>
<summary>View All Warnings</summary>

[Minor issues and potential problems]

</details>

### Recommendations
[Actionable next steps - keep visible]
```

## Workflow Run References

- Format run IDs as links: `[§12345](https://github.com/owner/repo/actions/runs/12345)`
- Include up to 3 most relevant run URLs at the end under `**References:**`
- Do NOT add footer attribution — the system appends it automatically

## Avoiding Mentions and Backlinks

Without filtering, `@username` notifies users and `#123` creates backlinks every run.

- **`mentions: false`** — escapes all `@mentions`.
- **`allowed-github-references: []`** — escapes `#123` / `owner/repo#123`.
- **`max-bot-mentions: 0`** — neutralizes bot-trigger phrases like `fixes #123` / `closes #456`.

```yaml
safe-outputs:
  mentions: false
  allowed-github-references: []
  max-bot-mentions: 0
  create-issue:
    title-prefix: "Weekly Status:"
    labels: [report]
    close-older-issues: true
    expires: 30
```

Applies globally to all safe-output types (issues, comments, discussions).


<!-- file: reuse.md -->

---
description: Imports, shared components, import-schema, and gh aw add/update for GitHub Agentic Workflows
---

# Imports & Reusability

Shared components eliminate duplication of tool configs, prompts, MCP servers, and safe-output jobs across workflows. Consumers get updates automatically when shared files change.

---

## Merged Fields

Only these frontmatter fields are merged on import:

| Field | Merge behaviour |
|---|---|
| `tools:`, `mcp-servers:`, `safe-outputs:`, `network:`, `permissions:`, `runtimes:`, `services:`, `cache:`, `features:` | Deep-merged |
| `env:` | Merged; duplicate keys → compile error |
| `github-app:`, `on.github-app:` | First-wins across imports |
| `steps:`, `pre-agent-steps:`, `post-steps:` | Appended in import order |
| `jobs.<job-id>.setup-steps`, `jobs.<job-id>.pre-steps` | For each job, imported steps run first, then main workflow steps; `setup-steps` remains separate from `pre-steps` |
| Markdown body | Appended as prompt instructions |

All other fields (`on:`, `engine:`, `timeout-minutes:`, …) are ignored in imported files.

---

## `imports:` Field

```yaml
# String form
imports:
  - shared/reporting.md
  - shared/mcp/tavily.md
  - copilot-setup-steps.yml   # merges copilot-setup-steps job steps

# Object form — pass values to import-schema:
imports:
  - uses: shared/repo-memory-standard.md
    with:
      branch-name: "memory/issue-triage"
      description: "Issue triage historical data"
  - path: shared/tool-setup.md
    with:
      environment: staging
```

`path`/`uses` and `with`/`inputs` are the only valid keys on an import entry. To supply environment variables or a checkout ref, set top-level `env:`/`checkout:` frontmatter inside the imported file itself; those are merged into the importing workflow (see [syntax-tools-imports.md](syntax-tools-imports.md)). `with:` values are accessed inside the shared file as `${{ github.aw.import-inputs.<name> }}`.

---

## `import-schema:` Field

Declare typed parameters consumers supply:

```yaml
---
import-schema:
  branch-name:
    type: string
    required: true
    description: "Branch name for storage (e.g. memory/my-workflow)"
  max-items:
    type: number
    default: 50
    description: "Maximum items to retain"
  environment:
    type: choice
    options: [dev, staging, prod]
    required: true

tools:
  repo-memory:
    branch-name: ${{ github.aw.import-inputs.branch-name }}
---
```

### Input types

| Type | Notes |
|---|---|
| `string` | Free-form text |
| `number` | Integer or float |
| `boolean` | `true` / `false` |
| `choice` | Enumerated; must supply `options:` |
| `array` | List of values |
| `object` | Sub-fields via `${{ github.aw.import-inputs.<name>.<subkey> }}` |

---

## Refactoring Patterns

### 1 — Extract shared MCP server / tool config

`.github/workflows/shared/mcp/tavily.md`:

```markdown
---
mcp-servers:
  tavily:
    url: "https://mcp.tavily.com/mcp/"
    env:
      TAVILY_API_KEY: "${{ secrets.TAVILY_API_KEY }}"
    allowed: [search, extract]
---
```

Import with one line:

```yaml
imports:
  - shared/mcp/tavily.md
```

### 2 — Extract shared prompt instructions

```markdown
<!-- shared/keep-it-short.md -->
---
---

Keep all output concise. Use bullet points, not paragraphs.
Never repeat information already visible in the GitHub UI.
```

### 3 — Parameterise with `import-schema:`

```markdown
<!-- shared/jira-mcp.md -->
---
import-schema:
  project-key:
    type: string
    required: true
    description: "Jira project key (e.g. ENG, INFRA)"

mcp-servers:
  jira:
    container: "mcp/jira"
    version: "latest"
    env:
      JIRA_TOKEN: "${{ secrets.JIRA_TOKEN }}"
      JIRA_PROJECT: ${{ github.aw.import-inputs.project-key }}
    allowed: [search_issues, get_issue, list_sprints]
---
```

```yaml
imports:
  - uses: shared/jira-mcp.md
    with:
      project-key: "ENG"
```

### 4 — Compose multiple imports

```yaml
---
on:
  schedule: weekly on monday
imports:
  - shared/mcp/tavily.md
  - shared/gh.md
  - shared/reporting.md
  - uses: shared/repo-memory-standard.md
    with:
      branch-name: "memory/weekly-research"
      description: "Weekly research snapshots"
---

Conduct weekly research on ${{ github.repository }} dependencies...
```

### 5 — Shared safe-output job

```markdown
<!-- shared/slack-notify.md -->
---
import-schema:
  channel:
    type: string
    required: true

safe-outputs:
  jobs:
    send-slack-notification:
      description: "Post a message to Slack"
      runs-on: ubuntu-latest
      output: "Slack notification sent"
      inputs:
        message:
          description: "Message text"
          required: true
          type: string
      permissions:
        contents: read
      steps:
        - name: Post to Slack
          uses: actions/github-script@v7
          env:
            SLACK_TOKEN: "${{ secrets.SLACK_TOKEN }}"
            CHANNEL: ${{ github.aw.import-inputs.channel }}
          with:
            script: |
              // post message to channel
---
```

```yaml
imports:
  - uses: shared/slack-notify.md
    with:
      channel: "#engineering-alerts"
```

---

## External Imports

### `gh aw add` — Install a remote shared component

```bash
gh aw add https://github.com/org/agentics/blob/main/workflows/shared/reporting.md
```

Stored under `.github/aw/imports/org/agentics/<sha>/`. Reference via that local path. The `source:` field tracks origin for updates.

MCP equivalent: `Use the add tool with url: "<url>"`

### `gh aw update` — Refresh all external imports

```bash
gh aw update
```

Re-fetches every file under `.github/aw/imports/` using `source:`. Follows `redirect:` and rewrites `source:` automatically.

MCP equivalent: `Use the update tool`

### Fields for publishable shared components

```yaml
---
source: "org/agentics/workflows/shared/my-component.md@main"
redirect: "org/agentics/workflows/shared/my-component-v2.md@main"
resources:
  - shared/mcp/dependency.md   # fetched alongside this file
private: false                 # true → prevent gh aw add from sharing
import-schema:
  # ...
---
```

---

## Recommended Directory Layout

```
.github/
└── workflows/
    ├── my-workflow.md
    ├── my-workflow.lock.yml        # auto-generated
    └── shared/
        ├── mcp/
        │   ├── tavily.md
        │   ├── notion.md
        │   └── github-mcp-app.md
        ├── reporting.md
        ├── gh.md
        ├── keep-it-short.md
        └── repo-memory-standard.md
.github/aw/
└── imports/                        # installed via gh aw add
    └── org/repo/<sha>/
        └── workflows_shared_component.md
```

---

## Compile-Time Behaviour

- Imports resolved at **compile time**; `.lock.yml` loads shared `.md` bodies at **runtime** — edits to shared bodies take effect next run without recompile.
- **`inlined-imports: true`** — bundles imported content at compile time (required for ruleset status check workflows). Cannot combine with `.github/agents/` file imports.
- Changes to the `imports:` list require recompile: `gh aw compile <workflow-name>`.
- Editing only the *body* of a shared `.md` (not its frontmatter) does **not** require recompile.

---

## Quick Checklist: Extracting a Shared Component

1. Identify the repeated frontmatter block or prompt section
2. Create `.github/workflows/shared/<name>.md` with the content
3. Add `import-schema:` if values differ per consumer
4. Replace duplicates with an `imports:` entry
5. Recompile: `gh aw compile` (or `gh aw compile <name>`)
6. Verify: `gh aw compile --strict`


<!-- file: safe-outputs-automation.md -->

---
description: Safe-output reference for workflow dispatch, code scanning, checks, agent sessions, and assignment operations.
---

# Safe Outputs: Automation and Orchestration

- `update-discussion:` - Update discussion title, body, or labels

  ```yaml
  safe-outputs:
    update-discussion:
      title: true                     # Optional: enable title updates
      body: true                      # Optional: enable body updates
      labels: true                    # Optional: enable label updates
      allowed-labels: [status, type]  # Optional: restrict to specific labels
      max: 1                          # Optional: max updates (default: 1)
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

- `update-release:` - Update GitHub release descriptions

  ```yaml
  safe-outputs:
    update-release:
      max: 1                          # Optional: max releases (default: 1, max: 10)
      target-repo: "owner/repo"       # Optional: cross-repository
      github-token: ${{ secrets.GH_AW_UPDATE_RELEASE_TOKEN }}  # Optional: custom token
  ```

  Operation types: `replace`, `append`, `prepend`.
- `upload-asset:` - Publish files to orphaned git branch (recommended for images/charts/screenshots)

  ```yaml
  safe-outputs:
    upload-asset:
      branch: "assets/${{ github.workflow }}"  # Optional: branch name
      max-size: 10240                 # Optional: max file size in KB (default: 10MB)
      allowed-exts: [.png, .jpg, .pdf] # Optional: allowed file extensions
      max: 10                         # Optional: max assets (default: 10)
  ```

  Default allowed extensions are common non-executable types; default max file size is 10MB (10240 KB), configurable via `max-size`. **Use this for images, charts, and screenshots that need embeddable URLs in issues/PRs/discussions.**
- `upload-artifact:` - Upload files as run-scoped GitHub Actions artifacts (recommended for temporary run artifacts and attachment-style outputs)

  ```yaml
  safe-outputs:
    upload-artifact:
      max-uploads: 5                  # Optional: max upload_artifact tool calls (default: 1, max: 20)
      retention-days: 7               # Optional: fixed retention period in days (agent cannot override; 1-90; templatable expression supported)
      skip-archive: false             # Optional: fixed skip-archive flag (templatable expression supported); single-file only
      max-size-bytes: 104857600       # Optional: max bytes per upload (default: 100 MB)
      allowed-paths:                  # Optional: glob patterns restricting uploadable paths
        - "reports/**"
        - "*.json"
      filters:                        # Optional: default include/exclude glob filters
        include: ["*.json", "*.csv"]
        exclude: ["*secret*"]
      defaults:                       # Optional: default values injected when agent omits a field
        if-no-files: "ignore"         # "error" or "ignore" when no files match (default: "error")
  ```

  Artifacts are run-scoped and auto-cleaned when they expire. Agents call `upload_artifact` with a `name` and `path`. `retention-days` and `skip-archive` are fixed at the workflow level (templatable via expressions); the agent cannot override them. **Use this for temporary downloadable artifacts and attachment-style arbitrary data** (e.g. a comment/issue linking to a generated file bundle). Set `skip-archive: true` to serve downloads as direct files without uncompressing. Use `upload-asset` instead when you need stable embeddable URLs (images/charts in GitHub content).
- `dispatch-workflow:` - Trigger other workflows with inputs

  ```yaml
  safe-outputs:
    dispatch-workflow:
      workflows: [workflow-name]          # Required: list of workflow names to allow
      max: 3                              # Optional: max dispatches (default: 1, max: 3)
      target-repo: org/other-repo         # Optional: cross-repo dispatch target (owner/repo or expression)
      allowed-repos: [org/*]              # Optional: allowlist for cross-repo dispatch targets
      target-ref: main                    # Optional: ref to dispatch against (overrides caller's GITHUB_REF)
  ```

  Triggers other agentic workflows using workflow_dispatch. Agent output includes `workflow_name` (without .md extension) and optional `inputs` (key-value pairs). Cross-repo dispatch is supported via `target-repo` plus an `allowed-repos` allowlist; cross-repo targets require a token with `actions: write` on the target repository.
- `dispatch_repository:` - Dispatch `repository_dispatch` events to external repositories (experimental)

  ```yaml
  safe-outputs:
    dispatch_repository:
      trigger_ci:                              # Tool name (normalized to MCP tool: trigger_ci)
        description: "Trigger CI in target repo"
        workflow: ci.yml                       # Required: target workflow name (for traceability)
        event_type: ci_trigger                 # Required: repository_dispatch event_type
        repository: org/target-repo           # Required: target repo (or use allowed_repositories)
        # allowed_repositories:               # Alternative: allow multiple target repos
        #   - org/repo1
        #   - org/repo2
        inputs:                               # Optional: input schema for agent
          environment:
            type: string
            description: "Deployment environment"
            required: true
        max: 1                                # Optional: max dispatches (templatable)
        github-token: ${{ secrets.MY_PAT }}   # Optional: override token
        staged: false                         # Optional: preview-only mode
  ```

  Accepts both `dispatch_repository` (underscore, preferred) and `dispatch-repository` (dash). Each key in the config defines a named MCP tool. Requires a token with `repo` scope since `GITHUB_TOKEN` cannot trigger `repository_dispatch` in external repositories. Use `github-token` or set a PAT as `GH_AW_SAFE_OUTPUTS_TOKEN`.

  **⚠️ Experimental**: Compilation emits a warning when this feature is used.
- `call-workflow:` - Call reusable workflows via workflow_call fan-out (orchestrator pattern)

  ```yaml
  safe-outputs:
    call-workflow:
      workflows: [worker-a, worker-b]     # Required: workflow names (without .md) with workflow_call trigger
      max: 1                              # Optional: max calls per run (default: 1, max: 50)
      github-token: ${{ secrets.TOKEN }}  # Optional: token passed to called workflows
  ```

  Array shorthand: `call-workflow: [worker-a, worker-b]`

  Unlike `dispatch-workflow` (which uses the GitHub Actions API at runtime), `call-workflow` generates static conditional `uses:` jobs at compile time. The agent selects which worker to activate; the compiler validates and wires up all fan-out jobs. Each listed workflow must exist in `.github/workflows/` and declare a `workflow_call` trigger. Use this for orchestrator/dispatcher patterns within the same repository.
- `create-code-scanning-alert:` - Generate SARIF security advisories

  ```yaml
  safe-outputs:
    create-code-scanning-alert:
      max: 50                         # Optional: max findings (default: unlimited)
      driver: "Custom Scanner"        # Optional: SARIF tool.driver.name (default: "GitHub Agentic Workflows Security Scanner")
      github-token: ${{ secrets.MY_TOKEN }}  # Optional: override token for security-events:write
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: additional repos the agent may target via `repo` field
  ```

  Severity levels: error, warning, info, note.
- `autofix-code-scanning-alert:` - Add autofixes to code scanning alerts

  ```yaml
  safe-outputs:
    autofix-code-scanning-alert:
      max: 10                         # Optional: max autofixes (default: 10)
  ```

  Provides automated fixes for code scanning alerts.
- `create-check-run:` - Create GitHub Check Runs to surface agent analysis results in the PR Checks UI

  ```yaml
  safe-outputs:
    create-check-run:
      name: "Security Analysis"       # Optional: check run name (defaults to workflow name)
      max: 1                          # Optional: max check runs per workflow run (default: 1)
      output:                         # Optional: static fallback values used when the agent omits the field
        title: "Pending analysis"     # Fallback title (max 256 chars)
        summary: "Awaiting agent output"  # Fallback summary (max 65535 chars)
  ```

  Requires `checks: write` (added automatically). Agents call `create_check_run` with `conclusion` (e.g., `success`, `failure`, `neutral`), `title`, `summary`, and optional `annotations`. Reports structured results (security findings, code quality, test outcomes) directly on commits and PRs.

- `create-agent-session:` - Create GitHub Copilot coding agent sessions

  ```yaml
  safe-outputs:
    create-agent-session:
      base: main                      # Optional: base branch (defaults to current)
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

  Requires PAT as `COPILOT_GITHUB_TOKEN`.
- `assign-to-agent:` - Assign Copilot coding agent to issues

  ```yaml
  safe-outputs:
    assign-to-agent:
      name: "copilot"                 # Optional: agent name
      model: "claude-sonnet-4-5"      # Optional: model override
      custom-agent: "agent-id"        # Optional: custom agent ID
      custom-instructions: "..."      # Optional: additional instructions for the agent
      allowed: [copilot]              # Optional: restrict to specific agent names
      max: 1                          # Optional: max assignments (default: 1)
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: where the issue lives (cross-repository)
      pull-request-repo: "owner/repo" # Optional: where PR should be created (if different)
      allowed-pull-request-repos: [owner/repo1]  # Optional: additional repos for PR creation
      base-branch: "develop"          # Optional: target branch for PR (default: repo default)
      ignore-if-error: true           # Optional: continue workflow on assignment error (default: false)
  ```

  Requires PAT with elevated permissions as `GH_AW_AGENT_TOKEN`.
- `assign-to-user:` - Assign users to issues or pull requests

  ```yaml
  safe-outputs:
    assign-to-user:
      allowed: [user1, user2]         # Optional: restrict to specific users
      blocked: [copilot, "*[bot]"]    # Optional: deny specific users or glob patterns
      max: 3                          # Optional: max assignments (default: 3)
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
      unassign-first: true            # Optional: unassign all current assignees first (default: false)
  ```

- `unassign-from-user:` - Remove user assignments from issues or PRs

  ```yaml
  safe-outputs:
    unassign-from-user:
      allowed: [user1, user2]         # Optional: restrict to specific users
      blocked: [copilot, "*[bot]"]    # Optional: deny specific users or glob patterns
      max: 1                          # Optional: max unassignments (default: 1)
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

- `hide-comment:` - Hide comments on issues, PRs, or discussions

  ```yaml
  safe-outputs:
    hide-comment:
      max: 5                          # Optional: max comments to hide (default: 5)
      allowed-reasons:                 # Optional: restrict hide reasons
        - spam
        - outdated
        - resolved
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

  Allowed reasons: `spam`, `abuse`, `off_topic`, `outdated`, `resolved`.
- `set-issue-type:` - Set the type of an issue (requires organization-defined issue types)

  ```yaml
  safe-outputs:
    set-issue-type:
      allowed: [Bug, Feature, Enhancement]  # Optional: restrict to specific issue type names
      target: "triggering"                  # Optional: "triggering" (default), "*", or number
      max: 5                                # Optional: max operations (default: 5)
      target-repo: "owner/repo"             # Optional: cross-repository
  ```

  Set `allowed` to an empty string `""` to allow clearing the issue type. When `allowed` is omitted, any type name is accepted.
- `set-issue-field:` - Set a single issue field value by name/value (avoids the broader update-issue path)

  ```yaml
  safe-outputs:
    set-issue-field:
      allowed-fields: [Priority, Iteration]  # Optional: restrict which issue fields the agent may set (omit/empty = any field; ["*"] explicitly allows all)
      target: "triggering"                    # Optional: "triggering" (default), "*", or number
      max: 5                                  # Optional: max operations (default: 5)
      target-repo: "owner/repo"               # Optional: cross-repository
      allowed-repos: [owner/other]            # Optional: additional repos agent can target
  ```

  Agent calls `set_issue_field` with `value` plus either `field_name` (preferred) or `field_node_id`. `issue_number` is optional and defaults to the triggering issue.
- `noop:` - Log completion message for transparency (auto-enabled)

  ```yaml
  safe-outputs:
    noop:
      report-as-issue: false          # Optional: report noop as issue (default: true)
  ```

  Fallback ensuring workflows never complete silently. Agents emit human-visible messages even when no other action is required (e.g., "Analysis complete - no issues found").
- `missing-tool:` - Report missing tools or functionality (auto-enabled)

  ```yaml
  safe-outputs:
    missing-tool:
      create-issue: true              # Optional: create issues for missing tools (default: true)
      title-prefix: "[missing tool]"  # Optional: prefix for issue titles
      labels: [tool-request]          # Optional: labels for created issues
  ```

  Lets agents report tools or functionality they need but lack; tracks feature requests. When `create-issue` is true, reports create or update GitHub issues.
- `missing-data:` - Report missing data required to complete tasks (auto-enabled)

  ```yaml
  safe-outputs:
    missing-data:
      create-issue: true              # Optional: create issues for missing data (default: true)
      title-prefix: "[missing data]"  # Optional: prefix for issue titles
      labels: [data-request]          # Optional: labels for created issues
  ```

  Lets agents report when required data or information is unavailable. When `create-issue` is true, reports create or update GitHub issues for tracking.

- `report-incomplete:` - Signal that the task could not be completed due to an infrastructure or tool failure (auto-enabled)

  ```yaml
  safe-outputs:
    report-incomplete:
      create-issue: true              # Optional: create issues for incomplete tasks (default: true)
      title-prefix: "[incomplete]"    # Optional: prefix for issue titles
      labels: [agent-failure]         # Optional: labels for created issues
  ```


<!-- file: safe-outputs-content.md -->

---
description: Safe-output reference for issue, discussion, comment, and pull request content operations.
---

# Safe Outputs: GitHub Content

- `create-issue:` - Safe GitHub issue creation (bugs, features)

  ```yaml
  safe-outputs:
    create-issue:
      title-prefix: "[ai] "           # Optional: prefix for issue titles
      labels: [automation, agentic]    # Optional: labels to attach to issues
      allowed-labels: [bug, task]     # Optional: restrict which labels the agent can set (any label allowed if omitted)
      allowed-fields: [Priority, Iteration]  # Optional: restrict which issue fields the agent may set via the `fields` runtime parameter (omit/empty = any field; ["*"] explicitly allows all)
      assignees: [user1, copilot]     # Optional: assignees (use 'copilot' for bot)
      max: 5                          # Optional: maximum number of issues (default: 1)
      expires: 7                      # Optional: auto-close after 7 days (supports: 2h, 7d, 2w, 1m, 1y, or false)
      group: true                     # Optional: group as sub-issues under a parent issue (default: false)
      group-by-day: true              # Optional: group same-day runs into one issue by posting as comments (default: false)
      close-older-issues: true        # Optional: close previous issues from same workflow (default: false)
      close-older-key: "my-key"       # Optional: explicit deduplication key for close-older matching (uses gh-aw-close-key marker)
      deduplicate-by-title: true      # Optional: skip creating an issue when one with the same title exists; integer N allows fuzzy matches up to edit distance N (default: off)
      normalize-closing-keywords: true # Optional: strip backticks around recognized issue-closing keywords in body text
      footer: false                   # Optional: omit AI-generated footer while preserving XML markers (default: true)
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: additional repos agent can target (agent uses `repo` field in output)
  ```

  `create_issue` output validation requires:
  - `body` minimum length: **20** characters
  - `body` maximum length: **65000** characters

  **Auto-Expiration**: The `expires` field auto-closes issues after a time period. Supports integers (days) or relative formats (2h, 7d, 2w, 1m, 1y). Generates `agentics-maintenance.yml` workflow that runs at minimum required frequency based on shortest expiration time: 1 day or less → every 2 hours, 2 days → every 6 hours, 3-4 days → every 12 hours, 5+ days → daily.
  **Deduplication for Scheduled Workflows**: When `schedule:` is combined with `create-issue`, use `skip-if-match:` in the `on:` block to prevent opening a duplicate issue every run. Pair with `expires:` to clean up stale issues:

  ```yaml
  on:
    schedule: daily on weekdays
    skip-if-match: 'is:issue is:open in:title "[my-workflow] "'
  safe-outputs:
    create-issue:
      title-prefix: "[my-workflow] "
      expires: 7   # auto-close after 7 days
  ```

  Without `skip-if-match`, the workflow creates a new issue on every scheduled run even when an identical open issue already exists.

  **Temporary IDs and Sub-Issues:**
  When creating multiple issues, use `temporary_id` (format: `aw_` + 3-8 alphanumeric chars) to reference parent issues before creation. References like `#aw_abc123` in issue bodies are automatically replaced with actual issue numbers. Use the `parent` field to create sub-issue relationships:

  ```json
  {"type": "create_issue", "temporary_id": "aw_abc123", "title": "Parent", "body": "Parent issue"}
  {"type": "create_issue", "parent": "aw_abc123", "title": "Sub-task", "body": "References #aw_abc123"}
  ```

  **Setting Issue Fields on Creation**: Agents can include a `fields` array in the `create_issue` output to set custom field values immediately after creation. Each item is `{"name": <field-display-name>, "value": <string-or-number>}`. Use a number for numeric fields; string for single-select, iteration title, date `YYYY-MM-DD`, or text. Restrict allowed names with `allowed-fields:`.

  ```json
  {"type": "create_issue", "title": "Triage: flaky parser", "body": "...", "fields": [{"name": "Priority", "value": "High"}, {"name": "Story Points", "value": 3}]}
  ```

- `close-issue:` - Close issues with comment (use this to close issues, not update-issue)

  ```yaml
  safe-outputs:
    close-issue:
      target: "triggering"              # Optional: "triggering" (default), "*", or number
      required-labels: [automated]      # Optional: only close if ALL these labels are present
      required-title-prefix: "[bot]"    # Optional: only close matching prefix
      max: 20                           # Optional: max closures (default: 1)
      state-reason: "not_planned"       # Optional: "completed" (default), "not_planned", "duplicate"
      allow-body: false                 # Optional: when false, any body the agent emits is dropped (warning logged) and the issue closes without a comment; defaults to true
      target-repo: "owner/repo"         # Optional: cross-repository
      allowed-repos: [owner/other]      # Optional: additional repos agent can close issues in
  ```

  Set `allow-body: false` to guarantee a clean close with no comment — useful when an earlier `add-comment` step already posted the summary and you want to prevent the agent from duplicating it.

- `create-discussion:` - Safe GitHub discussion creation (status, audits, reports, logs)

  ```yaml
  safe-outputs:
    create-discussion:
      title-prefix: "[ai] "           # Optional: prefix for discussion titles
      category: "General"             # Optional: discussion category name, slug, or ID (defaults to first category if not specified)
      labels: [status]                # Optional: labels to attach (used for matching when close-older-discussions is enabled)
      allowed-labels: [status, audit] # Optional: restrict which labels the agent can set (any label allowed if omitted)
      max: 3                          # Optional: maximum number of discussions (default: 1)
      close-older-discussions: true   # Optional: close older discussions with same prefix/labels (default: false)
      close-older-key: "my-key"       # Optional: explicit deduplication key for close-older matching
      expires: 7                      # Optional: auto-close after 7 days (supports: 2h, 7d, 2w, 1m, 1y, or false)
      fallback-to-issue: true         # Optional: create issue if discussion creation fails (default: true)
      footer: false                   # Optional: omit AI-generated footer while preserving XML markers (default: true)
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: additional repos agent can target (agent uses `repo` field in output)
  ```

  `category` accepts name (e.g., "General"), slug (e.g., "general"), or ID (e.g., "DIC_kwDOGFsHUM4BsUn3"); defaults to the first category. Resolution tries ID, then name, then slug.

  `close-older-discussions: true` closes up to 10 older discussions matching the same title prefix or labels as "OUTDATED" with a comment linking to the new one. Requires `title-prefix` or `labels`.

- `close-discussion:` - Close discussions with comment and resolution

  ```yaml
  safe-outputs:
    close-discussion:
      target: "triggering"              # Optional: "triggering" (default), "*", or number
      required-category: "Ideas"        # Optional: only close in category
      required-labels: [resolved]       # Optional: only close if ALL these labels are present
      required-title-prefix: "[ai]"     # Optional: only close matching prefix
      max: 1                            # Optional: max closures (default: 1)
      allow-body: false                 # Optional: when false, any body the agent emits is dropped (warning logged) and the discussion closes without a comment; defaults to true
      target-repo: "owner/repo"         # Optional: cross-repository
  ```

  Resolution reasons: `RESOLVED`, `DUPLICATE`, `OUTDATED`, `ANSWERED`. Set `allow-body: false` to close without a comment when a prior `add-comment` step already posted the summary.
- `add-comment:` - Safe comment creation on issues/PRs/discussions

  ```yaml
  safe-outputs:
    add-comment:
      max: 3                          # Optional: maximum number of comments (default: 1)
      target: "*"                     # Optional: target for comments (default: "triggering")
      required-labels: [approved]     # Optional: ALL of these labels must be present on the issue/PR for the comment to be posted
      required-title-prefix: "[bot]" # Optional: issue/PR title must start with this prefix
      hide-older-comments: true       # Optional: minimize previous comments from same workflow
      allowed-reasons: [outdated]     # Optional: restrict hiding reasons (default: outdated)
      normalize-closing-keywords: true # Optional: strip backticks around recognized issue-closing keywords in body text
      discussions: true               # Optional: opt-in to discussions:write permission for discussion comments/replies (default: false)
      issues: true                    # Optional: set false to exclude issues:write permission (default: true)
      pull-requests: true             # Optional: set false to exclude pull-requests:write permission (default: true)
      footer: true                    # Optional: when false, omits visible footer but preserves XML markers (default: true)
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: additional repos agent can target (agent uses `repo` field in output)
  ```

  **Hide Older Comments**: Set `hide-older-comments: true` to minimize previous comments from the same workflow before posting new ones. Useful for status updates. Allowed reasons: `spam`, `abuse`, `off_topic`, `outdated` (default), `resolved`.

  **Discussion Thread Replies**: Agents can include `reply_to_id` in their output to post a threaded reply within a GitHub Discussion (requires `discussions: true`):

  ```json
  {"type": "add_comment", "body": "Thread reply text", "reply_to_id": 12345}
  ```

- `comment-memory:` - Persist and update a managed memory comment on the triggering issue/PR. **Configured under `tools:`, not `safe-outputs:`.**

  ```yaml
  tools:
    comment-memory:
      max: 1                          # Optional: max comment_memory updates (default: 1, range: 1-100)
      target: "triggering"            # Optional: "triggering" (default), "*", or explicit issue/PR number
      memory-id: "default"            # Optional: default memory identifier when items omit memory_id (default: "default")
      footer: true                    # Optional: include AI footer in the managed comment (default: true)
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: additional repos agent can target
  ```

  Boolean shorthand: `comment-memory: true` enables defaults; `false` or `null` disables. The handler materializes memory to files before execution and syncs edits back to a single managed comment after, providing durable cross-run state without external storage. See [memory.md](memory.md).

- `create-pull-request:` - Safe pull request creation with git patches

  ```yaml
  safe-outputs:
    create-pull-request:
      title-prefix: "[ai] "           # Optional: prefix for PR titles
      branch-prefix: "signed/"        # Optional: prefix prepended to the PR branch name (e.g. for branch-protection conventions)
      labels: [automation, ai-agent]  # Optional: labels to attach to PRs
      allowed-labels: [bug, fix]      # Optional: restrict which labels the agent can set (any label allowed if omitted)
      reviewers: [user1, copilot]     # Optional: reviewers (use 'copilot' for bot)
      team-reviewers: [platform-team] # Optional: team slugs to assign as reviewers
      draft: true                     # Optional: create as draft PR (defaults to true)
      if-no-changes: "warn"           # Optional: "warn" (default), "error", or "ignore"
      allow-empty: false              # Optional: create PR with empty branch, no changes required (default: false)
      expires: 7                      # Optional: auto-close after 7 days (supports: 2h, 7d, 2w, 1m, 1y; min: 2h)
      auto-merge: false               # Optional: enable auto-merge when checks pass (default: false)
      base-branch: "vnext"            # Optional: base branch for PR (defaults to workflow's branch)
      preserve-branch-name: true      # Optional: skip random salt suffix on agent-specified branch names (default: false)
      recreate-ref: false             # Optional: force-recreate existing remote branch when preserve-branch-name is true (default: false)
      allow-workflows: false          # Optional: add workflows:write permission when allowed-files targets .github/workflows/ paths (default: false; requires github-app)
      patch-format: "bundle"          # Optional: "bundle" (default, preserves merge commits & per-commit metadata) or "am" (git format-patch/am)
      signed-commits: true            # Optional: when true (default), push via createCommitOnBranch GraphQL so GitHub signs commits; set false to use plain git push (required for merge commits)
      assignees: [user1]              # Optional: assignees for fallback issues on PR creation failure
      fallback-labels: [needs-review] # Optional: labels for fallback issues (defaults to PR labels)
      fallback-as-issue: false        # Optional: when true (default), creates a fallback issue on PR creation failure; on permission errors, the issue includes a one-click link to create the PR via GitHub's compare URL
      auto-close-issue: false         # Optional: when true (default), adds "Fixes #N" closing keyword when triggered from an issue; set to false to prevent auto-closing the triggering issue on merge. Accepts a boolean or GitHub Actions expression.
      normalize-closing-keywords: true # Optional: strip backticks around recognized issue-closing keywords in PR body text
      target-repo: "owner/repo"       # Optional: cross-repository
      github-token-for-extra-empty-commit: ${{ secrets.MY_CI_PAT }}  # Optional: PAT or "app" to trigger CI on created PRs
      allowed-files:                  # Recommended: always restrict to specific paths or extensions to limit agent scope
        - "src/**/*.ts"               # e.g. restrict to TypeScript source files
        - "docs/**/*.md"              # e.g. restrict to Markdown docs
      excluded-files:                 # Optional: glob patterns to strip from the patch entirely
        - "**/*.lock"
      protected-files: request_review # Optional: "request_review" (default), "blocked", "fallback-to-issue", or "allowed"
      allowed-branches:               # Optional: glob patterns for allowed source branch names per run
        - "feature/*"
      allowed-base-branches:          # Optional: glob patterns for allowed base branch overrides per run
        - "release/*"
        - "main"
      max-patch-size: 2048            # Optional: per-output cap on git patch size in KB (overrides global; default: 4096 KB, max: 10240)
      max-patch-files: 50             # Optional: per-output cap on unique files in the patch (overrides global; default: 100)
  ```

  **Dynamic Base Branch**: When `allowed-base-branches` is set, the agent can provide a `base` field in its output to override the default base branch for a single run — but only if the value matches one of the configured glob patterns. Without `allowed-base-branches`, only the static `base-branch:` is used. Accepts a literal array or a GitHub Actions expression resolving to a comma-separated list (e.g. `${{ inputs.allowed-base-branches }}`).

  **Allowed Source Branches**: When `allowed-branches` is set, the branch used for PR creation (agent-provided `branch` or the current checkout branch when omitted) must match one of the configured glob patterns.

  **File Restrictions**: **Always specify `allowed-files`** — this is the primary guardrail for `create-pull-request`. Scope it to specific file extensions (e.g., `"**/*.md"`, `"**/*.ts"`) or directory paths (e.g., `"src/**"`, `"docs/**"`) matching the workflow's purpose. Omitting `allowed-files` allows the agent to touch any file in the repository, which significantly expands blast radius. Use `excluded-files` to additionally strip specific files (e.g. lock files) from the patch before any checks. The `protected-files` field controls handling of sensitive files (package manifests, CI configs, agent instruction files): `request_review` (default — create the PR but submit a `REQUEST_CHANGES` review so a human approves before merge), `blocked` (hard-block), `fallback-to-issue` (push branch and create a review issue), or `allowed` (no restriction — use only when the workflow is explicitly designed to manage these files). Object form is also supported: `protected-files: { policy: fallback-to-issue, exclude: [AGENTS.md] }`.

  **Auto-Expiration**: The `expires` field auto-closes PRs after a time period. Supports integers (days) or relative formats (2h, 7d, 2w, 1m, 1y). Minimum duration: 2 hours. Only for same-repo PRs without target-repo. Generates `agentics-maintenance.yml` workflow.

  **Branch Name Preservation**: Set `preserve-branch-name: true` to skip the random salt suffix on agent-specified branch names. Useful when CI enforces branch naming conventions (e.g., Jira keys in uppercase). Invalid characters are still replaced for security; casing is always preserved. Set `recreate-ref: true` alongside this to force-recreate an existing remote branch (e.g., when a previous PR was already merged into the branch).

  **Workflow File Changes**: To modify files under `.github/workflows/`, set `allow-workflows: true`. This adds `workflows: write` to the token used for the PR — a permission that requires `safe-outputs.github-app` to be configured, since `GITHUB_TOKEN` cannot hold this permission.

  **CI Triggering**: By default, PRs created with `GITHUB_TOKEN` do not trigger CI workflow runs. To trigger CI, set `github-token-for-extra-empty-commit` to a PAT with `Contents: Read & Write` permission, or to `"app"` to use the configured GitHub App. Alternatively, set the magic secret `GH_AW_CI_TRIGGER_TOKEN` to a suitable PAT — this is automatically used without requiring explicit configuration in the workflow.

- `create-pull-request-review-comment:` - Safe PR review comment creation on code lines

  ```yaml
  safe-outputs:
    create-pull-request-review-comment:
      max: 3                          # Optional: maximum number of review comments (default: 10)
      side: "RIGHT"                   # Optional: side of diff ("LEFT" or "RIGHT", default: "RIGHT")
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

- `submit-pull-request-review:` - Submit a PR review with status (APPROVE, REQUEST_CHANGES, COMMENT)

  ```yaml
  safe-outputs:
    submit-pull-request-review:
      max: 1                          # Optional: maximum number of reviews to submit (default: 1)
      footer: "if-body"               # Optional: footer control ("always", "none", "if-body", default: "always")
      allowed-events: [COMMENT, REQUEST_CHANGES]  # Optional: restrict allowed review event types; omit to allow all (APPROVE, COMMENT, REQUEST_CHANGES)
      supersede-older-reviews: false  # Optional: dismiss older same-workflow REQUEST_CHANGES reviews after a replacement is posted (default: false; best-effort, needs workflow markers)
  ```

  **Footer Control**: The `footer` field controls when AI-generated footers appear in the PR review body. Values: `"always"` (default), `"none"`, `"if-body"` (only when body is non-empty). Boolean values supported: `true` → `"always"`, `false` → `"none"`. Useful for clean approval reviews — with `"if-body"`, approvals without explanatory text appear without a footer.

- `dismiss-pull-request-review:` - Dismiss a PR review previously submitted by this workflow actor (alias: `dismiss-review`)

  ```yaml
  safe-outputs:
    dismiss-pull-request-review:
      max: 10                         # Optional: maximum number of dismissals (default: 10)
      target: "triggering"            # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
      allowed-repos: [owner/other]    # Optional: extra repos where dismissal is allowed
  ```

  Actor-bound: only reviews authored by the current workflow actor can be dismissed. Supports `required-labels` and `required-title-prefix` filters like other PR-targeted outputs.

- `reply-to-pull-request-review-comment:` - Reply to existing review comments on PRs

  ```yaml
  safe-outputs:
    reply-to-pull-request-review-comment:
      max: 10                         # Optional: maximum number of replies (default: 10)
      target-repo: "owner/repo"       # Optional: cross-repository
      footer: "always"                # Optional: footer control ("always", "none", "if-body", default: "always")
  ```

  **Footer Control**: The `footer` field controls when AI-generated footers appear. Values: `"always"` (default), `"none"`, `"if-body"` (only when body is non-empty). Boolean values supported: `true` → `"always"`, `false` → `"none"`.

- `resolve-pull-request-review-thread:` - Resolve PR review threads after addressing feedback

  ```yaml
  safe-outputs:
    resolve-pull-request-review-thread:
      max: 10                         # Optional: maximum number of threads to resolve (default: 10)
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

  Lets agents resolve review comment threads after addressing feedback.


<!-- file: safe-outputs-management.md -->

---
description: Safe-output reference for update, label, milestone, project, release, and upload operations.
---

# Safe Outputs: Management and Delivery

- `update-issue:` - Update issue title, body, labels, assignees, or milestone (NOT for closing - use close-issue instead)

  ```yaml
  safe-outputs:
    update-issue:
      status: true                    # Optional: allow updating issue status (open/closed)
      target: "*"                     # Optional: target for updates (default: "triggering")
      title: true                     # Optional: allow updating issue title
      body: true                      # Optional: allow updating issue body
      max: 3                          # Optional: maximum number of issues to update (default: 1)
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

  **Note:** `update-issue` can change status between 'open'/'closed', but use `close-issue` to close with a comment. Use `update-issue` for title, body, labels, assignees, or milestone changes without closing.
- `update-pull-request:` - Update PR title or body

  ```yaml
  safe-outputs:
    update-pull-request:
      title: true                     # Optional: enable title updates (default: true)
      body: true                      # Optional: enable body updates (default: true)
      operation: "replace"            # Optional: "replace" (default), "append", "prepend"
      update-branch: false            # Optional: update PR branch with latest base before updates (default: false)
      max: 1                          # Optional: max updates (default: 1)
      target: "*"                     # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

  Operation types: `replace` (default), `append`, `prepend`.
- `merge-pull-request:` - Merge pull requests under configured policy gates (experimental)

  ```yaml
  safe-outputs:
    merge-pull-request:
      required-labels: [ready-to-merge]   # Optional: ALL listed labels must be present on the PR
      allowed-branches: ["feature/*"]    # Optional: glob patterns for allowed source branch names
      target: "triggering"                # Optional: "triggering" (default, current PR) or "*" (any PR with pull_request_number)
      target-repo: "owner/repo"           # Optional: cross-repository
      allowed-repos: [owner/other]        # Optional: additional repos the agent can merge in
      max: 1                              # Optional: max merges (default: 1)
  ```

  **⚠️ Experimental**: Compilation emits a warning when this feature is used. The merge is blocked unless all configured gates pass.

- `close-pull-request:` - Safe pull request closing with filtering

  ```yaml
  safe-outputs:
    close-pull-request:
      required-labels: [test, automated]  # Optional: only close PRs with these labels
      required-title-prefix: "[bot]"      # Optional: only close PRs with this title prefix
      target: "triggering"                # Optional: "triggering" (default), "*" (any PR), or explicit PR number
      max: 10                             # Optional: maximum number of PRs to close (default: 1)
      target-repo: "owner/repo"           # Optional: cross-repository
      github-token: ${{ secrets.CUSTOM_TOKEN }}  # Optional: custom token
  ```

- `mark-pull-request-as-ready-for-review:` - Mark draft PRs as ready for review

  ```yaml
  safe-outputs:
    mark-pull-request-as-ready-for-review:
      max: 1                              # Optional: max operations (default: 1)
      target: "*"                         # Optional: "triggering" (default), "*", or number
      required-labels: [automated]        # Optional: only mark PRs with these labels
      required-title-prefix: "[bot]"      # Optional: only mark PRs with this prefix
      target-repo: "owner/repo"           # Optional: cross-repository
  ```

- `add-labels:` - Safe label addition to issues or PRs

  ```yaml
  safe-outputs:
    add-labels:
      allowed: [bug, enhancement, documentation]  # Optional: restrict to specific labels
      blocked: ["~*", "*[bot]"]                   # Optional: blocked label patterns (glob; takes precedence over allowed)
      required-labels: [approved]                 # Optional: ALL of these labels must be present on the issue/PR for the operation to run
      required-title-prefix: "[bot]"              # Optional: issue/PR title must start with this prefix
      max: 3                                      # Optional: maximum number of labels (default: 3)
      target: "*"                                 # Optional: "triggering" (default), "*" (any issue/PR), or number
      target-repo: "owner/repo"                   # Optional: cross-repository
  ```

- `remove-labels:` - Safe label removal from issues or PRs

  ```yaml
  safe-outputs:
    remove-labels:
      allowed: [automated, stale]  # Optional: restrict to specific labels
      blocked: ["~*", "*[bot]"]    # Optional: blocked label patterns (glob; takes precedence over allowed)
      required-labels: [approved]  # Optional: ALL of these labels must be present on the issue/PR for the operation to run
      required-title-prefix: "[bot]"  # Optional: issue/PR title must start with this prefix
      max: 3                       # Optional: maximum number of operations (default: 3)
      target: "*"                  # Optional: "triggering" (default), "*" (any issue/PR), or number
      target-repo: "owner/repo"    # Optional: cross-repository
  ```

  When `allowed` is omitted, any labels can be removed.
- `replace-label:` - Atomic label state transition — removes one label and adds another in a single GraphQL request, eliminating the race window of separate remove + add operations

  ```yaml
  safe-outputs:
    replace-label:
      allowed-add: [approved, done]            # Optional: glob patterns for labels that may be added (any allowed if omitted)
      allowed-remove: [in-review, pending]     # Optional: glob patterns for labels that may be removed (any allowed if omitted)
      blocked: ["~*", "*[bot]"]                # Optional: blocked label patterns (glob; applies to both add and remove)
      required-labels: [triage]                # Optional: ALL of these labels must be present on the issue/PR for the operation to run
      required-title-prefix: "[Bug]"           # Optional: issue/PR title must start with this prefix
      max: 5                                   # Optional: maximum number of replacements (default: 5)
      target: "triggering"                     # Optional: "triggering" (default), "*" (any issue/PR), or number
      target-repo: "owner/repo"                # Optional: cross-repository
  ```

  The agent calls `replace_label(label_to_remove, label_to_add)`. If the label to remove is not present on the item, only the add is applied (no failure). Labels that do not yet exist in the repository are auto-created with a deterministic pastel color.
- `add-reviewer:` - Add reviewers to pull requests

  ```yaml
  safe-outputs:
    add-reviewer:
      allowed-reviewers: [user1, copilot]     # Optional: restrict to specific reviewer usernames (any allowed if omitted)
      allowed-team-reviewers: [platform-team] # Optional: restrict to specific team slugs (any allowed if omitted)
      max: 3                                  # Optional: max reviewers (default: 3)
      target: "*"                             # Optional: "triggering" (default), "*", or number
      target-repo: "owner/repo"               # Optional: cross-repository
  ```

  At least one reviewer or team reviewer must be present in agent output. Use `allowed-reviewers: [copilot]` to assign Copilot PR reviewer bot. Requires PAT as `COPILOT_GITHUB_TOKEN`. The legacy `reviewers` / `team-reviewers` field names are deprecated aliases.
- `assign-milestone:` - Assign issues to milestones

  ```yaml
  safe-outputs:
    assign-milestone:
      allowed: [v1.0, v2.0]           # Optional: restrict to specific milestone titles
      auto_create: true               # Optional: auto-create milestones from the allowed list if missing (default: false)
      max: 1                          # Optional: max assignments (default: 1)
      target-repo: "owner/repo"       # Optional: cross-repository
  ```

- `link-sub-issue:` - Safe sub-issue linking

  ```yaml
  safe-outputs:
    link-sub-issue:
      parent-required-labels: [epic]     # Optional: parent must have these labels
      parent-title-prefix: "[Epic]"      # Optional: parent must match this prefix
      sub-required-labels: [task]        # Optional: sub-issue must have these labels
      sub-title-prefix: "[Task]"         # Optional: sub-issue must match this prefix
      max: 1                             # Optional: maximum number of links (default: 1)
      target-repo: "owner/repo"          # Optional: cross-repository
  ```

  Links issues via GitHub's parent-child relationships. Agent output includes `parent_issue_number` and `sub_issue_number`. Use with `create-issue` temporary IDs or existing issue numbers.
- `create-project:` - Create a new GitHub Project board with optional fields and views

  ```yaml
  safe-outputs:
    create-project:
      max: 1                          # Optional: max projects (default: 1)
      # github-token: ${{ secrets.GH_AW_PROJECT_GITHUB_TOKEN }}  # Optional: override default PAT (NOT GITHUB_TOKEN)
      target-owner: "org-or-user"     # Optional: owner for created projects
      title-prefix: "[ai] "           # Optional: prefix for project titles
  ```

  Optionally specify custom fields, project views, and an initial item. Requires PAT/App token with Projects permissions (`GH_AW_PROJECT_GITHUB_TOKEN`); `GITHUB_TOKEN` cannot access Projects v2 API. No cross-repository support.
- `update-project:` - Add items to GitHub Projects, update custom fields, manage project structure

  ```yaml
  safe-outputs:
    update-project:
      max: 20                         # Optional: max project operations (default: 10)
      project: "https://github.com/orgs/myorg/projects/42"  # REQUIRED in agent output (full URL)
      # github-token: ${{ secrets.GH_AW_PROJECT_GITHUB_TOKEN }}  # Optional here if GH_AW_PROJECT_GITHUB_TOKEN is set; PAT with projects:write (NOT GITHUB_TOKEN) is still required
  ```

  **⚠️**: Agent must include full project URL (not just number) in every call. Requires PAT/App token with Projects access (same as `create-project:`). Not supported for cross-repository operations.

  **Three calling modes:**

  **Mode 1: Add/update existing issues or PRs**

  ```json
  {
    "type": "update_project",
    "project": "https://github.com/orgs/myorg/projects/42",
    "content_type": "issue",
    "content_number": 123,
    "fields": {"Status": "In Progress", "Priority": "High"}
  }
  ```

  - `content_type`: "issue" or "pull_request"
  - `content_number`: The issue or PR number to add/update
  - `fields`: Custom field values to set on the item (optional)

  **Mode 2: Create draft issues in the project**

  ```json
  {
    "type": "update_project",
    "project": "https://github.com/orgs/myorg/projects/42",
    "content_type": "draft_issue",
    "draft_title": "Follow-up: investigate performance",
    "draft_body": "Check memory usage under load",
    "temporary_id": "aw_abc123def456",
    "fields": {"Status": "Backlog"}
  }
  ```

  - `content_type`: "draft_issue"
  - `draft_title`: Title of the draft issue (required when creating new)
  - `draft_body`: Description in markdown (optional)
  - `temporary_id`: Unique ID for this draft (format: `aw_` + 3-8 alphanumeric chars) for referencing in future updates (optional)
  - `draft_issue_id`: Reference an existing draft by its temporary_id to update it (optional)
  - `fields`: Custom field values (optional)

  **Mode 3: Create custom fields or views** (with `operation` field)

  ```json
  {
    "type": "update_project",
    "project": "https://github.com/orgs/myorg/projects/42",
    "operation": "create_fields",
    "field_definitions": [
      {"name": "Priority", "data_type": "SINGLE_SELECT", "options": ["High", "Medium", "Low"]},
      {"name": "Due Date", "data_type": "DATE"}
    ]
  }
  ```

  - `operation`: "create_fields" or "create_view"
  - `field_definitions`: Array of field definitions (for create_fields)
  - `view`: View configuration object with `name`, `layout` (table/board/roadmap), optional `filter` and `visible_fields` (for create_view)

  Not supported for cross-repository operations.
- `create-project-status-update:` - Post status updates to GitHub Projects for progress tracking

  ```yaml
  safe-outputs:
    create-project-status-update:
      max: 1                          # Optional: max status updates (default: 1)
      project: "https://github.com/orgs/myorg/projects/42"  # REQUIRED in agent output (full URL)
      github-token: ${{ secrets.GH_AW_PROJECT_GITHUB_TOKEN }}  # REQUIRED: PAT with projects:write (NOT GITHUB_TOKEN)
  ```

  Requires same PAT/App token as `update-project`. Agent must include full project URL in every call.

  **Agent output fields:**
  - `project`: Full project URL (required) - MUST be explicitly included in output
  - `status`: ON_TRACK, AT_RISK, OFF_TRACK, COMPLETE, or INACTIVE (optional, defaults to ON_TRACK)
  - `start_date`: Project start date in YYYY-MM-DD format (optional)
  - `target_date`: Project end date in YYYY-MM-DD format (optional)
  - `body`: Status summary in markdown (required)

  Not supported for cross-repository operations.
- `push-to-pull-request-branch:` - Push changes to PR branch

  ```yaml
  safe-outputs:
    push-to-pull-request-branch:
      target: "*"                     # Optional: "triggering" (default), "*", or number
      branch: "triggering"            # Optional: branch to push to (default: "triggering")
      required-title-prefix: "[bot] " # Optional: require title prefix
      required-labels: [automated]    # Optional: require all labels
      if-no-changes: "warn"           # Optional: "warn" (default), "error", or "ignore"
      ignore-missing-branch-failure: false  # Optional: treat deleted PR branches as skipped pushes (default: false)
      commit-title-suffix: "[auto]"   # Optional: suffix appended to commit title
      staged: true                    # Optional: preview mode (default: follows global staged)
      github-token-for-extra-empty-commit: ${{ secrets.MY_CI_PAT }}  # Optional: PAT or "app" to trigger CI on pushed commits
      fallback-as-pull-request: true  # Optional: when push fails (e.g. diverged branch), open a fallback PR targeting the original branch (default: true)
      patch-format: "bundle"          # Optional: "bundle" (default, supports merge commits) or "am"; auto-falls back to "bundle" when the incremental range contains a merge commit
      signed-commits: true            # Optional: when true (default), push via createCommitOnBranch GraphQL so GitHub signs commits; set false to push merge commits via plain git push
      allow-workflows: false          # Optional: add workflows:write permission for .github/workflows/ paths (requires github-app)
      check-branch-protection: true   # Optional: when true (default), pre-flight check branch protection; set false to skip and avoid administration:read permission
      allowed-files:                  # Recommended: always restrict to specific paths or extensions to limit agent scope
        - "src/**"
      excluded-files:                 # Optional: glob patterns to strip from the patch entirely
        - "**/*.lock"
      protected-files: blocked        # Optional: "blocked" (default), "fallback-to-issue", or "allowed"
      max-patch-size: 2048            # Optional: per-output cap on git patch size in KB (overrides global; default: 4096 KB, max: 10240)
  ```

  Not supported for cross-repository operations. To trigger CI on pushed commits, use `github-token-for-extra-empty-commit` or set the magic secret `GH_AW_CI_TRIGGER_TOKEN`.

  **File Restrictions**: Same as `create-pull-request`: **always specify `allowed-files`** scoped to specific file extensions or paths to limit the agent's reach. `excluded-files` strips files before all checks, and `protected-files` controls handling of sensitive files. Object form supported: `protected-files: { policy: fallback-to-issue, exclude: [AGENTS.md] }`.

  **Compile-time warnings for `target: "*"`**: When `target: "*"` is set, the compiler emits warnings if:
  1. The checkout configuration does not include a wildcard fetch pattern — add `fetch: ["*"]` with `fetch-depth: 0` so the agent can access all PR branches at runtime
  2. No constraints are provided — add `required-title-prefix` or `required-labels` to restrict which PRs can receive pushes

  Example with all recommended settings:

  ```yaml
  checkout:
    fetch: ["*"]
    fetch-depth: 0
  safe-outputs:
    push-to-pull-request-branch:
      target: "*"
      required-title-prefix: "[bot] "   # restrict to PRs with this title prefix


<!-- file: safe-outputs-runtime.md -->

---
description: Safe-output reference for runtime defaults, custom jobs, scripts, actions, global configuration, and output variables.
---

# Safe Outputs: Runtime and Extensibility


  The report-incomplete safe-output is automatically enabled by default and is distinct from `noop`. Use it when required tools or data are unavailable and the task cannot be meaningfully performed (e.g., MCP server crash, missing authentication, inaccessible repository). When an agent emits `report_incomplete`, gh-aw activates failure handling even when the agent process exits 0 — preventing empty outputs from being classified as successful. This ensures every unrecoverable failure is tracked.

- `jobs:` - Custom safe-output jobs registered as MCP tools for third-party integrations

  ```yaml
  safe-outputs:
    jobs:
      send-notification:
        description: "Send a notification to an external service"
        runs-on: ubuntu-latest
        output: "Notification sent successfully!"
        inputs:
          message:
            description: "The message to send"
            required: true
            type: string
        permissions:
          contents: read
        env:
          API_KEY: ${{ secrets.API_KEY }}
        steps:
          - name: Send notification
            run: |
              MESSAGE=$(cat "$GH_AW_AGENT_OUTPUT" | jq -r '.items[] | select(.type == "send_notification") | .message')
              curl -H "Authorization: $API_KEY" -d "$MESSAGE" https://api.example.com/notify
  ```

  Post-processing GitHub Actions jobs registered as MCP tools. Agents call the tool by its normalized name (dashes → underscores, e.g., `send_notification`). The job runs after the agent completes with access to `$GH_AW_AGENT_OUTPUT` (agent output JSON path). Use to integrate Slack, Discord, external APIs, databases, or any service requiring secrets. Import from shared files via `imports:`.

- `scripts:` - Inline JavaScript handlers running inside the safe-outputs job handler loop

  ```yaml
  safe-outputs:
    scripts:
      post-slack-message:
        description: "Post a message to Slack"
        inputs:
          channel:
            description: "Target Slack channel"
            type: string
            default: "#general"
        script: |
          // 'channel' is available from config inputs; 'item' contains runtime message values
          await fetch(process.env.SLACK_WEBHOOK_URL, {
            method: "POST",
            body: JSON.stringify({ text: item.message, channel })
          });
  ```

  Unlike `jobs:` (which create separate GitHub Actions jobs), scripts execute in-process alongside built-in handlers. Write only the handler body — the compiler generates the outer wrapper with config input destructuring and `async function handleX(item, resolvedTemporaryIds) { ... }`. Script names with dashes are normalized to underscores (e.g., `post-slack-message` → `post_slack_message`). The handler receives `item` (runtime message with input values) and `resolvedTemporaryIds` (map of temporary IDs).

- `actions:` - Custom GitHub Actions mounted as MCP tools for the AI agent (resolved at compile time)

  ```yaml
  safe-outputs:
    actions:
      my-action:
        uses: owner/repo/path@ref         # Required: GitHub Action reference (tag, SHA, or branch)
        description: "Custom description" # Optional: override action's description from action.yml
        env:
          API_KEY: ${{ secrets.API_KEY }} # Optional: environment variables for the injected step
  ```

  Resolved at compile time — the compiler fetches `action.yml`, parses inputs, and exposes them as MCP tool parameters. The agent calls the action by its normalized name (dashes → underscores). Each action runs as an injected step in the safe-outputs job. Local actions (`./path/to/action`) are also supported.

**Global Safe Output Configuration:**
- `github-token:` - Custom GitHub token for all safe output jobs

  ```yaml
  safe-outputs:
    create-issue:
    add-comment:
    github-token: ${{ secrets.GH_AW_SAFE_OUTPUTS_TOKEN }}  # Use custom PAT instead of GITHUB_TOKEN
  ```

  Useful when you need additional permissions or want to perform actions across repositories.
- `urls:` - URL sanitization policy for safe output content (string)
  - `allowed-only` (default) - sanitize all non-allowed URLs everywhere
  - `allowed-or-code-region` - preserve URLs inside fenced and inline code regions while sanitizing prose
- `allowed-domains:` - Allowed domains for URLs in safe output content (array)
  - URLs from unlisted domains are replaced with `(redacted)`
  - GitHub domains are always included by default
- `allowed-github-references:` - Allowed repositories for GitHub-style references (array)
  - Controls which GitHub references (`#123`, `owner/repo#456`) are allowed in workflow output
  - References to unlisted repositories are escaped with backticks to prevent timeline items
  - Configuration options:
    - `[]` - Escape all references (prevents all timeline items)
    - `["repo"]` - Allow only the target repository's references
    - `["repo", "owner/other-repo"]` - Allow specific repositories
    - Not specified (default) - All references allowed
  - Example:

    ```yaml
    safe-outputs:
      allowed-github-references: []  # Escape all references
      create-issue:
        target-repo: "my-org/main-repo"
    ```

    With `[]`, references like `#123` become `` `#123` `` and `other/repo#456` becomes `` `other/repo#456` ``, preventing timeline clutter while preserving information.
- `messages:` - Custom message templates for safe-output footer and notification messages (object)
  - Available placeholders: `{workflow_name}`, `{run_url}`, `{agentic_workflow_url}`, `{triggering_number}`, `{triggering_type}`, `{workflow_source}`, `{workflow_source_url}`, `{operation}`, `{event_type}`, `{status}`, `{ai_credits}`, `{ai_credits_formatted}`, `{ai_credits_suffix}`
  - Message types:
    - `footer:` - Custom footer for AI-generated content
    - `footer-install:` - Installation instructions appended to footer
    - `footer-workflow-recompile:` - Footer for workflow recompile tracking issues (placeholder: `{repository}`)
    - `footer-workflow-recompile-comment:` - Footer for comments on workflow recompile issues (placeholder: `{repository}`)
    - `run-started:` - Workflow activation notification
    - `run-success:` - Successful completion message
    - `run-failure:` - Failure notification message
    - `detection-failure:` - Detection job failure message
    - `agent-failure-issue:` - Footer for agent failure tracking issues
    - `agent-failure-comment:` - Footer for comments on agent failure tracking issues
    - `staged-title:` - Staged mode preview title
    - `staged-description:` - Staged mode preview description
    - `append-only-comments:` - Create new comments instead of editing existing ones (boolean, default: false)
    - `pull-request-created:` - Custom message when a PR is created. Placeholders: `{item_number}`, `{item_url}`
    - `issue-created:` - Custom message when an issue is created. Placeholders: `{item_number}`, `{item_url}`
    - `commit-pushed:` - Custom message when a commit is pushed. Placeholders: `{commit_sha}`, `{short_sha}`, `{commit_url}`
    - `body-header:` - Custom header text prepended to every message body (issues, comments, PRs, discussions). Placeholders: `{workflow_name}`, `{run_url}`
  - Example:

    ```yaml
    safe-outputs:
      messages:
        append-only-comments: true
        footer: "> Generated by [{workflow_name}]({run_url})"
        run-started: "[{workflow_name}]({run_url}) started processing this {event_type}."
    ```

- `mentions:` - Configuration for @mention filtering in safe outputs (boolean or object)
  - Boolean format: `false` - Always escape mentions; `true` - Always allow (error in strict mode)
  - Object format for fine-grained control:

    ```yaml
    safe-outputs:
      mentions:
        allow-team-members: true    # Allow repository collaborators (default: true)
        allow-context: true          # Allow mentions from event context (default: true)
        allowed: [copilot, user1]    # Always allow specific users/bots
        allowed-teams:               # Allow all members of named GitHub teams
          - myorg/eng                # org/team-slug format (cross-org)
          - reviewers                # bare team-slug (uses current repo's org)
        max: 50                      # Maximum mentions per message (default: 50)
    ```

  - Team members include collaborators with any permission level (excluding bots unless explicitly listed)
  - Context mentions include issue/PR authors, assignees, and commenters
  - `allowed-teams` resolves team membership from the GitHub API at runtime; bot accounts are excluded. Use `org/team-slug` for cross-org teams or a bare `team-slug` to resolve against the current repository's organization.
  - **`allowed-teams` requires `read:org` scope.** The default `GITHUB_TOKEN` does **not** include this scope. Provide a classic PAT with `read:org`, a fine-grained PAT with the "Members" permission (read), or a GitHub App installation token with the "Members" permission (read) via `safe-outputs.github-token:` or `safe-outputs.github-app:`. If the token lacks the required scope, team lookup fails with a warning and the workflow continues without those team members in the allowlist.
- `runs-on:` - Runner specification for all safe-outputs jobs (string)
  - Defaults to `ubuntu-slim` (1-vCPU runner)
  - Examples: `ubuntu-latest`, `windows-latest`, `self-hosted`
  - Applies to activation, create-issue, add-comment, and other safe-output jobs
- `footer:` - Global footer control for all safe outputs (boolean, default: `true`)
  - When `false`, omits visible AI-generated footer content from all created/updated entities (issues, PRs, discussions, releases) while still including XML markers for searchability
  - Individual safe-output types can override this setting
- `staged:` - Preview mode for all safe outputs (boolean)
  - When `true`, emits step summary messages instead of making GitHub API calls; useful for testing without side effects
- `env:` - Environment variables passed to all safe output jobs (object)
  - Values typically reference secrets: `MY_VAR: ${{ secrets.MY_SECRET }}`
- `steps:` - Custom steps injected into all safe-output jobs, running after repository checkout and before safe-output code (array)
  - Useful for installing dependencies or performing setup needed by safe-output logic
  - Example:

    ```yaml
    safe-outputs:
      steps:
        - name: Install custom dependencies
          run: npm install my-package
      create-issue:
    ```

- `max-bot-mentions:` - Maximum bot trigger references (e.g. `@copilot`, `@github-actions`) allowed in output before all excess are escaped with backticks (integer or expression, default: 10)
  - Set to `0` to escape all bot trigger phrases
  - Example: `max-bot-mentions: 3`
- `activation-comments:` - Disable all activation and fallback comments (boolean or expression, default: `true`)
  - When `false`, disables run-started, run-success, run-failure, and PR/issue creation link comments
  - Supports templatable boolean: `false`, `true`, or GitHub Actions expressions like `${{ inputs.activation-comments }}`

**Templatable Integer Fields**: The `max`, `expires`, and `max-bot-mentions` fields (and most other numeric/boolean fields) accept GitHub Actions expression strings in addition to literal values, enabling runtime-configured limits:

```yaml
safe-outputs:
  max-bot-mentions: ${{ inputs.max-mentions }}
  create-issue:
    max: ${{ inputs.max-issues }}
    expires: ${{ inputs.expires-days }}
```

Fields that influence permission computation (`add-comment.discussions`, `create-pull-request.fallback-as-issue`) remain literal booleans.

- `timeout-minutes:` - Timeout for the safe-outputs job in minutes (integer, default: `45`)
  - Increase for workflows with many sequential safe-output operations (e.g. `push-to-pull-request-branch` against large repositories)
- `max-patch-size:` - Maximum allowed git patch size in kilobytes (integer, default: 4096 KB = 4 MB)
  - Patches exceeding this size are rejected to prevent accidental large changes
- `max-patch-files:` - Maximum allowed number of unique files in a create-pull-request patch (integer, default: 100)
  - Counts unique file paths deduplicated across multi-commit patches; reflects how many distinct files the agent is pushing per iteration
  - Increase this limit for long-running branches that touch many files
- `group-reports:` - Group workflow failure reports as sub-issues (boolean, default: `false`)
  - When `true`, creates a parent `[aw] Failed runs` issue that tracks all workflow failures as sub-issues; useful for larger repositories
- `report-failure-as-issue:` - Control whether workflow failures are reported as GitHub issues (boolean, expression, or category array; default: `true`)
  - When `false`, suppresses automatic failure issue creation for this workflow
  - Supports templatable boolean expressions, e.g. `report-failure-as-issue: ${{ inputs.report-failure-as-issue }}`
  - Use to silence noisy failure reports for workflows where failures are expected or handled externally
- `failure-issue-repo:` - Repository to create failure tracking issues in (string, format: `"owner/repo"`)
  - Defaults to the current repository when not specified
  - Use when the current repository has issues disabled: `failure-issue-repo: "myorg/infra-alerts"`
- `id-token:` - Override the id-token permission for the safe-outputs job (string: `"write"` or `"none"`)
  - `"write"`: force-enable `id-token: write` permission (required for OIDC authentication with cloud providers)
  - `"none"`: suppress automatic detection and prevent adding `id-token: write` even when vault/OIDC actions are detected in steps
  - Default: auto-detects known OIDC/vault actions (e.g., `aws-actions/configure-aws-credentials`, `azure/login`, `hashicorp/vault-action`) and adds `id-token: write` automatically
- `concurrency-group:` - Concurrency group for the safe-outputs job (string)
  - When set, the safe-outputs job uses this concurrency group with `cancel-in-progress: false`
  - Supports GitHub Actions expressions, e.g., `"safe-outputs-${{ github.repository }}"`
- `needs:` - Additional custom workflow jobs the safe-outputs job depends on (array)
  - Example: `needs: [secrets_fetcher]`
  - Use when the safe-outputs job requires outputs from a custom job defined in `jobs:`
- `environment:` - Override the GitHub deployment environment for the safe-outputs job (string)
  - Defaults to the top-level `environment:` field when not specified
  - Use when the main job and safe-outputs job need different deployment environments for protection rules
- `github-app:` - GitHub App credentials for minting installation access tokens (object)
  - When configured, generates a token from the app and uses it for all safe output operations (alternative to `github-token`)
  - Fields:
    - `client-id:` - GitHub App client ID (required, e.g., `${{ vars.APP_ID }}`). Use `app-id:` for legacy compatibility.
    - `private-key:` - GitHub App private key (required, e.g., `${{ secrets.APP_PRIVATE_KEY }}`)
    - `owner:` - Optional App installation owner (defaults to current repository owner)
    - `repositories:` - Optional list of repositories to grant access to
  - Example:

    ```yaml
    safe-outputs:
      github-app:
        client-id: ${{ vars.APP_ID }}
        private-key: ${{ secrets.APP_PRIVATE_KEY }}
      create-issue:
    ```

- `threat-detection:` - Threat detection configuration (auto-enabled for all safe-outputs workflows)
  - Automatically enabled by default; customizable via explicit configuration
  - Fields:
    - `enabled:` - Enable/disable threat detection (boolean or expression, default: `true`)
    - `prompt:` - Additional instructions appended to threat detection analysis (string)
    - `engine:` - AI engine for threat detection (engine config or `false` to disable AI detection)
    - `steps:` - Extra job steps to run before engine execution (array)
    - `post-steps:` - Extra job steps to run after engine execution (array)
    - `max-ai-credits:` - Per-run AIC budget for the detection engine (numeric only, no expressions; default `${{ vars.GH_AW_DEFAULT_DETECTION_MAX_AI_CREDITS || '400' }}`)
    - `runs-on:` - Runner override for the detection job (defaults to `agent.runs-on`)
    - `continue-on-error:` - When `true` (default), detection failures emit a warning and proceed with a `needs-review` label; when `false`, failures block safe outputs (boolean or expression)
  - Example to disable AI-based detection (use custom steps only):

    ```yaml
    safe-outputs:
      threat-detection:
        engine: false
        steps:
          - name: Custom check
            run: echo "Custom threat check"
    ```


## Output Variables

The safe-outputs job emits named step outputs for the first successful result of each type:

| Safe Output | Step Output Variables |
|---|---|
| `create-issue` | `created_issue_number`, `created_issue_url` |
| `create-pull-request` | `created_pr_number`, `created_pr_url` |
| `add-comment` | `comment_id`, `comment_url` |
| `push-to-pull-request-branch` | `push_commit_sha`, `push_commit_url` |


<!-- file: safe-outputs.md -->

---
description: Compact index for safe-output operations and runtime configuration in GitHub Agentic Workflows.
---

# Safe Outputs Index

Safe outputs are the write path for agentic workflows. Keep the main agent job read-only and use the focused reference files below.

| Topic | File |
|---|---|
| Issues, discussions, comments, pull requests, and review operations | [safe-outputs-content.md](safe-outputs-content.md) |
| Updates, labels, milestones, projects, releases, uploads, and delivery operations | [safe-outputs-management.md](safe-outputs-management.md) |
| Workflow dispatch, automation, code scanning, checks, agent sessions, and assignment flows | [safe-outputs-automation.md](safe-outputs-automation.md) |
| Runtime defaults, custom jobs, scripts, actions, global config, and output variables | [safe-outputs-runtime.md](safe-outputs-runtime.md) |

## Shared Rules

- Prefer the most specific built-in safe output before creating a custom job.
- Always scope mutating operations as tightly as possible.
- For pull-request or branch mutation, always restrict `allowed-files`.
- Use `noop` when no visible change is required after successful execution.


<!-- file: serena-tool.md -->

# Serena Language Server Tool

Serena is an **LSP MCP server** for semantic code analysis. Use ONLY when you need deep code understanding beyond text manipulation.

## When to Use Serena

**Use when you need:**
- Symbol navigation (find all usages of a function/type)
- Call graph analysis across files
- Semantic duplicate detection (not just text matching)
- Refactoring analysis (functions in wrong files, extraction opportunities)
- Type relationships and interface implementations

**Don't use Serena for:**
- Text patterns → `grep`
- File edits / YAML/JSON/Markdown → `edit` tool
- Commands → `bash`

If `grep` or `bash` solves it in 1-2 commands, don't use Serena.

## Configuration

Import the shared workflow. For multi-language, the first entry is the default fallback:

```yaml
imports:
  - uses: shared/mcp/serena.md
    with:
      languages: ["go", "typescript"]  # go, typescript, python, ruby, rust, java, cpp, csharp
```

## Available Serena Tools

### Navigation & Analysis
- `find_symbol` - Search for symbols by name
- `get_symbols_overview` - List all symbols in a file
- `find_referencing_symbols` - Find where a symbol is used
- `find_referencing_code_snippets` - Find code snippets using a symbol
- `search_for_pattern` - Search for code patterns (regex)

### Code Editing
- `read_file` - Read file with semantic context
- `create_text_file` - Create/overwrite files
- `insert_at_line` - Insert content at line number
- `insert_before_symbol` / `insert_after_symbol` - Insert near symbols
- `replace_lines` - Replace line range
- `replace_symbol_body` - Replace symbol definition
- `delete_lines` - Delete line range

### Project Management
- `activate_project` - **REQUIRED** - Activate Serena for workspace
- `onboarding` - Analyze project structure
- `restart_language_server` - Restart LSP if needed
- `get_current_config` - View Serena configuration
- `list_dir` - List directory contents

## Usage Workflow

### 1. Activate Serena First

Call `activate_project` (passing the workspace path) before any other Serena tool.

### 2. Combine with Other Tools

`bash` for file discovery, Serena for semantic analysis, `edit` for changes.

```yaml
imports:
  - uses: shared/mcp/serena.md
    with:
      languages: ["go"]
tools:
  bash:
    - "find pkg -name '*.go' ! -name '*_test.go'"
    - "cat go.mod"
  github:
    toolsets: [default]
```

### 3. Use Cache for Recurring Analysis

```yaml
imports:
  - uses: shared/mcp/serena.md
    with:
      languages: ["go"]
cache-memory: true  # Store analysis history
```

## Common Pitfalls

❌ Serena on non-code files (use `edit`) ❌ Forgetting `activate_project` first ❌ Not using bash for file discovery ❌ Missing `languages` config

## Supported Languages

Full LSP: `go` (gopls), `typescript`, `python` (jedi/pyright), `ruby` (solargraph), `rust` (rust-analyzer), `java`, `cpp`, `csharp`. See `.serena/project.yml` for full list (25+).


<!-- file: shared-safe-jobs.md -->

---
description: Shared pattern for custom safe-output jobs that consume structured agent output.
---

# Custom Safe-Output Job Pattern

Use this pattern when a workflow must perform a controlled write action after the agent finishes.

## Rules

- Define the tool schema under `safe-outputs.jobs.<job-name>.inputs`.
- Read the output file from `GH_AW_AGENT_OUTPUT`.
- Parse the JSON and iterate over `items`.
- Filter items by `type`, where the type is the job name with dashes converted to underscores.
- Validate required fields on every matching item.
- Respect staged mode by checking `GH_AW_SAFE_OUTPUTS_STAGED === 'true'`.
- Preview in staged mode instead of performing the real side effect.
- Use warnings for skippable invalid items and fail the job only for fatal errors.

## Minimal Shape

```yaml
safe-outputs:
  jobs:
    custom-action:
      description: "Process structured agent output"
      runs-on: ubuntu-latest
      inputs:
        field1:
          type: string
          required: true
      steps:
        - name: Process items
          uses: actions/github-script@v8
          with:
            script: |
              const fs = require('fs');
              const staged = process.env.GH_AW_SAFE_OUTPUTS_STAGED === 'true';
              const file = process.env.GH_AW_AGENT_OUTPUT;
              const data = JSON.parse(fs.readFileSync(file, 'utf8'));
              const items = (data.items || []).filter(item => item.type === 'custom_action');
```

## Use This Pattern For

- third-party API writes
- notifications
- controlled external-system updates
- custom post-processing that should remain outside the agent job

## Do Not Use This Pattern For

- direct agent-job writes
- broad shell-based mutation without a typed schema
- features already covered by built-in safe outputs


<!-- file: skills.md -->

---
description: Guide for leveraging skills (SKILL.md files) in agentic workflows — hint, fusion, and inline fusion strategies
---

# Skills in Agentic Workflows

Use skills — domain-specific knowledge files (`SKILL.md`) under `skills/` or `.github/skills/` — in workflows.

---

## Detecting Skills

```bash
find "${GITHUB_WORKSPACE}" -name "SKILL.md" -maxdepth 6
```

List available skills before choosing a strategy.

---

## Frontmatter `skills:` (SHA-Pinned Installs)

Declare external skills to install at activation time with the top-level `skills:` array. At activation, `gh aw` installs each via `gh skill install` and wires it into the engine — no manual vendoring.

```yaml
skills:
  - owner/repo@<40-char-sha>              # repository-wide install
  - owner/repo/skill/path@<40-char-sha>   # path-scoped install
  - skill: owner/repo@<40-char-sha>       # object form: per-skill auth
    github-token: ${{ secrets.SKILLS_TOKEN }}
```

- Static references must be pinned to a full 40-character lowercase commit SHA; `${{ ... }}` expressions are allowed in the ref position and resolved at runtime.
- Object entries set per-skill auth via `github-token` or `github-app`.

This is distinct from the prompt-side strategies below (hint / fusion / inline), which shape skill *content* into the prompt rather than installing skill packages.

---

## Strategy 0 — Agent Finder (Discovery First)

**Use when**: the relevant skill is not obvious, the repository may not contain the right skill yet, or you want to discover installable skills before loading local ones.

Query **GitHub Agent Finder** through its REST API (ARD search shape: `query.text`; add `query.filter` to narrow by resource type):

```bash
curl -s https://agentfinder.github.com/api/v1/search \
  -H 'Content-Type: application/json' \
  -d '{"query":{"text":"<the user task, in plain language>"},"pageSize":10}'
```

To search specifically for skills, add a type filter:

```bash
curl -s https://agentfinder.github.com/api/v1/search \
  -H 'Content-Type: application/json' \
  -d '{"query":{"text":"<the user task, in plain language>","filter":{"type":["application/ai-skill"]}},"pageSize":10}'
```

After discovery:

- Prefer repository-local skills when they satisfy the task.
- If you use a discovered skill, extract only the specific guidance you need.
- Do not load or paste entire skills when a fragment is enough.
- Do not install or enable returned resources automatically; that requires explicit user choice.

Agent Finder helps locate candidates; **skill fusion** keeps the final prompt small.

---

## Inline Skills (Fusion at Authoring Time)

**Use when**: keeping the main prompt compact while shipping task-specific skill guidance with the workflow.

Inline skills embed a complete skill or fragment under `## skill: \`name\``. Extraction runs in the setup/interpolation step (not at compile time): gh-aw writes each block to engine-specific skill locations and removes it from the main prompt body. Fuse a full skill for a self-contained capability, or partial sections when only targeted guidance is needed.

**Pattern**:

```markdown
on:
  workflow_dispatch:
engine: copilot
---

Triage the issue and propose next steps.

## skill: `issue-triage`
---
description: Classify issues and suggest next actions.
---
Classify by bug / feature / question, identify missing information, and suggest
the smallest actionable next step.
```

Use a unique inline skill name per workflow file. Name must start with a lowercase letter, then lowercase letters, digits, `_`, or `-`. Avoid collisions with file-based skills under `.github/skills/<name>/SKILL.md` — inline extraction writes to the same paths.

---

## Strategy 1 — Hint (Generalist)

**Use when**: the task strategy is unknown at authoring time, or the agent must adapt to whatever skills are available.

The prompt tells the agent skills exist and to discover/apply the relevant ones itself.

**Pattern**:

```markdown
If the repository contains `SKILL.md` files under `skills/`, check which ones are
relevant to this task. For each relevant skill, read its content and apply the
guidance it provides.
```

---

## Strategy 2 — Fusion (Ultra-Cognitive)

**Use when**: you know exactly which skill (or part of it) is needed and want minimal context overhead.

Inline **only the specific sections** of the skill the agent needs. Do not paste the entire SKILL.md.

**Pattern**:

```markdown
<!-- gh-skill-fusion: skills/github-mcp-server/SKILL.md#authentication -->

When calling GitHub MCP tools, use the pre-configured token already injected into the
environment. Never prompt the user for credentials.
```

---

## Choosing Between the Two Strategies

| Factor | Hint | Fusion |
|---|---|---|
| **Task domain** | Broad / unknown | Narrow / well-defined |
| **Skill set** | Grows dynamically | Known and stable |
| **Context budget** | Generous | Tight |
| **Maintenance burden** | Low (agent self-selects) | Higher (manual sync with source) |
| **Determinism** | Lower (agent chooses) | Higher (exact fragment) |
| **Scale** | Poor (entire skills loaded) | Good (minimal content) |

Prefer fusion when the task domain and required skill sections are known; it never loads entire skills.

---

## Example: Hint Strategy

```markdown
---
on:
  issues:
    types: [opened]
engine: copilot
tools:
  github:
    toolsets: [issues]
permissions:
  issues: write
---

Triage the newly opened issue.

If there are relevant skills under `skills/`, read them and apply their guidance.
Focus on skills related to issue classification or project conventions.
```

---

## Example: Fusion Strategy

```markdown
---
on:
  pull_request:
    types: [opened, synchronize]
engine: copilot
tools:
  github:
    toolsets: [pull_requests]
permissions:
  pull-requests: write
---

Review the pull request for adherence to project conventions.

<!-- Fused from skills/developer/SKILL.md#code-organization -->
Prefer many smaller files grouped by functionality. Add new files for new features
rather than extending existing ones. Keep validators under 300 lines; split when
a single file covers more than one domain.
<!-- End fusion -->

Report findings as inline review comments.
```

---

## Anti-Patterns

- ❌ **Do not load entire skill files** when only one section is relevant — use fusion instead
- ❌ **Do not hint without bounds** — if using the hint strategy, constrain the agent with a `maxdepth` and a relevance filter to avoid reading every SKILL.md in a large repo
- ❌ **Do not paste skills verbatim** without adapting them to the workflow's context — fused fragments should read as natural prose, not as lifted documentation
- ❌ **Do not hard-code skill file paths** in hints — use `find` so the prompt still works when skills are reorganised


<!-- file: subagents.md -->

---
description: Guide for defining inline sub-agents in workflow markdown files — syntax, engine placement, frontmatter fields, and best practices.
---

# Inline Sub-Agents

Define specialised agents directly in a workflow markdown file. At runtime, sub-agent sections are extracted (after `{{#runtime-import}}` macros resolve) and written to the engine-specific agents directory for the engine CLI to discover.

---

## Syntax

Define a sub-agent with a level-2 Markdown heading of the form `## agent: \`name\``:

```markdown
## agent: `file-summarizer`
---
description: Summarizes the content of a file in a few concise sentences
model: small
---
You are a file summarization assistant. When given a file path, read the
file and return a brief summary (2–4 sentences) describing its purpose
and key contents. Be concise and factual.
```

### Name rules

- Must be enclosed in backticks: `` `name` ``
- Lowercase only: `[a-z][a-z0-9_-]*`
- Examples: `` `planner` ``, `` `file-summarizer` ``, `` `code-reviewer` ``

### Block boundary

The block ends at the next `##` heading (any level-2 heading) or at EOF — no explicit end marker is needed. Place sub-agent blocks **at the bottom** of the file, after all main workflow content.

### Frontmatter fields

Only two fields are supported inside a sub-agent frontmatter block:

| Field | Required | Default | Notes |
|---|---|---|---|
| `description` | No | — | Human-readable summary of the sub-agent's role |
| `model` | No | `"inherited"` | Model override; `"inherited"` uses the parent workflow's model. Prefer model aliases (e.g. `small`, `large`) over specific model IDs for portability. |

Built-in aliases resolve to the best available model per provider and keep working as models are updated. Common sub-agent aliases:

| Alias | Resolves to | When to use |
|---|---|---|
| `small` | `mini` → haiku, gpt-5-mini, gpt-5-nano, gemini-flash | Cheap, fast tasks: extraction, classification, formatting |
| `large` | sonnet, gpt-5-pro, gpt-5, gemini-pro | Complex reasoning or synthesis tasks |
| `inherited` | Parent workflow model | Default — use when the sub-agent needs the same capability as the parent |

All other fields (`engine`, `tools`, `network`, etc.) are stripped at runtime with a warning. Sub-agents inherit the parent's engine, tool access, and network configuration.

---

## Engine-Specific Placement

Sub-agent files are written to the directory and with the extension each engine natively expects:

| Engine | Directory | Extension |
|---|---|---|
| Copilot (default) | `.agents/agents/` | `.agent.md` |
| Claude | `.claude/agents/` | `.md` |
| Codex | `.codex/agents/` | `.md` |
| Gemini | `.gemini/agents/` | `.md` |

The engine is detected at compile time from the `engine:` field and injected as `GH_AW_ENGINE_ID` into the interpolation step's environment.

---

## MCP Access in Sub-Agents

Sub-agents **do not have their own MCP servers** — they run in the parent's agent environment without independent tool config. For file system and shell access, enable on the parent workflow:

- **`cli-proxy: true`** — GitHub CLI proxy for authenticated `gh` calls. Recommended for any sub-agent that reads/writes repo content.
- **`tools.github.mode: gh-proxy`** — routes GitHub API calls through the gh proxy sidecar; required for private repos or the GitHub MCP toolset.

```yaml
---
engine: copilot
tools:
  github:
    mode: gh-proxy
  cli-proxy: true
---
```

---

## When to Use Sub-Agents

### 1 — Parallel specialised tasks with smaller models

Break a large workflow into parallel units handled by small/cheap models, then let the parent (large) model reason over the aggregated results:

```markdown
# Investigate: Repository Health

## Step 1 — gather data

Use the `dependency-scanner` agent to list all outdated packages.
Use the `test-coverage` agent to summarise uncovered code paths.
Use the `secret-scanner` agent to check for leaked credentials.

## Step 2 — synthesise

Combine the three reports above into a prioritised action plan.
The top item must have a linked PR draft or issue.

## agent: `dependency-scanner`
---
description: Lists outdated npm/pip/go packages
model: small
---
Run the appropriate package-manager audit command and return a
machine-readable list of outdated packages with their current and
latest versions.

## agent: `test-coverage`
---
description: Summarises low-coverage code paths
model: small
---
Read the most recent test coverage report and list the top 5 files or
functions with coverage below 60 %. Include the file path and line range.

## agent: `secret-scanner`
---
description: Checks for potential credential leaks
model: small
---
Scan staged changes and recently modified files for patterns that
resemble API keys, tokens, or passwords. Report any findings with the
file name and approximate line number.
```

The parent model orchestrates; sub-agents do the heavy lifting with `small` at lower cost.

### 2 — Reusable specialised helpers

Extract repetitive sub-tasks (file summarisation, commit-message generation, code explanation) into a named sub-agent the main prompt calls by name.

---

## Planner-Worker Pattern

Split work to control cost and keep context quality high:

- **Main/frontier agent (planner-orchestrator):** forms hypotheses, decides what evidence is needed, picks workers, synthesises conclusions
- **Worker sub-agents (usually `model: small`):** bounded retrieval, extraction, classification, verification, one-shot summarisation

Prompt workers narrowly and evidence-first (e.g. "return exact error messages and line references"), not broad analysis.

Worker outputs should be compact and structured. Do not return raw logs or large file dumps to the orchestrator.

### Bounded delegation rules

- keep delegation one level deep unless gh-aw explicitly supports and validates deeper topology
- avoid recursive or open-ended sub-agent fan-out
- cap per-run worker fan-out so high-volume events cannot trigger runaway cost
- stop early with `noop` or safe output when a cheap worker can confidently classify a known/duplicate/stale/low-value case

See also: [token-optimization.md](token-optimization.md) and [workflow-patterns.md](workflow-patterns.md).

---

## Full Example

```markdown
---
engine: copilot
tools:
  github:
    mode: gh-proxy
  cli-proxy: true
  bash:
    - "cat *"
    - "ls *"
---

# PR Review Assistant

1. Use the `diff-explainer` agent to produce a plain-English summary of the
   diff for PR #${{ github.event.pull_request.number }}.
2. Post the summary as a PR comment.

## agent: `diff-explainer`
---
description: Produces a plain-English summary of a pull request diff
model: small
---
You receive a unified diff. Describe each changed file in one sentence,
focusing on *what changed* and *why it matters*. Ignore formatting-only
changes. Return a bulleted list, one bullet per file.
```

---

## Limitations

- Sub-agents do not support `engine:`, `tools:`, `network:`, or `mcp-servers:` fields — those are stripped at runtime.
- Sub-agents cannot define their own safe-output jobs.
- Sub-agent blocks must appear in the main workflow file body; they are not resolved inside imported shared files.


<!-- file: syntax-agentic.md -->

---
description: Agentic workflow specific frontmatter fields for GitHub Agentic Workflows.
---

# Agentic Workflow Frontmatter Fields

### Agentic Workflow Specific Fields

- **`description:`** - Human-readable workflow description (string)
- **`emoji:`** - Optional single emoji used to represent the workflow visually; recommended for quicker recognition in workflow lists and status output (string)
- **`source:`** - Workflow origin tracking in format `owner/repo/path@ref` (string)
- **`labels:`** - Array of labels to categorize and organize workflows (array)
  - Labels filter workflows in status/list commands
  - Example: `labels: [automation, security, daily]`
- **`metadata:`** - Custom key-value pairs compatible with custom agent spec (object)
  - Key names limited to 64 characters
  - Values limited to 1024 characters
  - Example: `metadata: { team: "platform", priority: "high" }`
- **`github-token:`** - Default GitHub token for workflow (must use `${{ secrets.* }}` syntax)
- **`on.roles:`** - Repository access roles that can trigger workflow (array or `"all"`). Default `[admin, maintainer, write]`; available roles: `admin`, `maintainer`, `write`, `read`, `all`.
- **`bots:`** - Bot identifiers allowed to trigger workflow regardless of role permissions (array; e.g. `[dependabot[bot], renovate[bot], github-actions[bot]]`). The bot must be active (installed) on the repository to trigger.
- **`strict:`** - Enable enhanced validation for production workflows (boolean, defaults to `true`; strongly recommended)
  - Prefer `strict: true`; `strict: false` is dangerous, should be extremely rare, and must be carefully security reviewed before use
- **`max-turns:`** - AWF turn cap applied consistently across all agentic engines (integer or expression, e.g. `${{ inputs.max-turns }}`). The engine-level `engine.max-turns` is a deprecated alias kept for backward compatibility — prefer this top-level field. Not supported by the `gemini` engine.
- **`max-runs:`** - Deprecated legacy alias for the AWF invocation cap (`apiProxy.maxRuns`, defaults to `500` when omitted). Use `max-turns` instead; run `gh aw fix` to migrate.
- **`max-ai-credits:`** - Per-run AI Credits (AIC) budget enforced by the AWF firewall (integer or `K`/`M` short-form string like `100M`; default `1000`). Set a negative value to disable enforcement and token steering. See [token-optimization.md](token-optimization.md).
- **`max-turn-cache-misses:`** - Maximum consecutive AWF cache misses allowed before the API proxy blocks further requests (integer, default `5`). Maps to `apiProxy.maxCacheMisses`; precedence is frontmatter → `GH_AW_DEFAULT_MAX_TURN_CACHE_MISSES` env override → built-in default.
- **`models:`** - Model policy and optional pricing (object). Experimental policy fields `allowed` / `blocked` (lists of model names or patterns) restrict which models the workflow may use; they map to AWF `apiProxy.allowedModels` / `disallowedModels` and merge as unions across imports. Environment-variable overrides are supported. The separate `providers` field supplies custom pricing (see [token-optimization.md](token-optimization.md)).

  ```yaml
  models:
    allowed: ["gpt-5", "claude-*"]
    blocked: ["*-preview"]
  ```
- **`max-daily-ai-credits:`** - Per-user 24-hour AI Credits (AIC) guardrail: activation blocks execution once the triggering user's aggregated AI Credits for this workflow over the last 24h exceed the threshold (integer or `K`/`M` short-form string, or `-1`). Enabled by default with a system default threshold; set `-1` to disable or an explicit value to override. See [token-optimization.md](token-optimization.md).
- **`user-rate-limit:`** - Rate limiting configuration to prevent users from triggering the workflow too frequently (object)
  - **`max-runs-per-window:`** - Maximum runs allowed per user per time window (required, integer 1-10)
  - **`window:`** - Time window in minutes (integer 1-180, default: 60)
  - **`events:`** - Event types to apply rate limiting to (array; if omitted, applies to all programmatic events)
    - Available: `workflow_dispatch`, `issue_comment`, `pull_request_review`, `pull_request_review_comment`, `issues`, `pull_request`, `discussion_comment`, `discussion`
  - **`ignored-roles:`** - Roles exempt from rate limiting (array of `admin`, `maintain`, `write`, `triage`, `read`; default: `[admin, maintain, write]`). Set to `[]` to apply to all users.
  - Example:

    ```yaml
    user-rate-limit:
      max-runs-per-window: 5
      window: 60
      ignored-roles: [admin, maintain]
    ```

- **`check-for-updates:`** - Whether the activation job checks that the compiled `gh-aw` version is still supported (boolean, default `true`). When `true`, blocked versions fail fast and below-recommended versions warn. Set `false` only for isolated environments (compiler then warns at compile time).

- **`features:`** - Feature flags for experimental or optional features (object)
  - Each flag is a key-value pair; boolean flags (`true`/`false`) or string values are accepted
  - Known feature flags:
    - `copilot-requests: true` - Use GitHub Actions token for Copilot authentication instead of `COPILOT_GITHUB_TOKEN` secret
    - `disable-xpia-prompt: true` - Disable the built-in cross-prompt injection attack (XPIA) system prompt
    - `action-tag: "v0"` - Pin compiled action references to a specific version of the `gh-aw-actions` repository. Accepts version tags (e.g., `"v0"`, `"v1"`, `"v1.0.0"`) or a full 40-character commit SHA. When set, overrides the compiler's default action mode and resolves all action references from the external `github/gh-aw-actions` repository at the specified tag.
    - `action-mode: "script"` - Control how the compiler generates action references: `"dev"` (local paths, default), `"release"` (SHA-pinned remote), `"action"` (gh-aw-actions repo), `"script"` (direct shell calls). Can also be overridden via `--action-mode` CLI flag.
    - `difc-proxy: true` - Enable DIFC (Data Integrity and Flow Control) proxy injection. When set alongside `tools.github.min-integrity`, injects proxy steps around the agent for full network-boundary integrity enforcement.
    - `cli-proxy: true` - Enable AWF CLI proxy sidecar for secure read-only `gh` CLI access without exposing `GITHUB_TOKEN` (requires AWF v0.26.0+). Prerequisite for `integrity-reactions`; the compiler enables it automatically when `integrity-reactions: true` is set.
    - `integrity-reactions: true` - Enable reaction-based integrity promotion/demotion. Maintainers can use 👍/❤️ reactions to promote content to `approved` and 👎/😕 to demote it to `none`. Compiler automatically enables `cli-proxy`. Requires `tools.github.min-integrity` to be set and MCPG >= v0.2.18. Defaults: endorsement reactions THUMBS_UP/HEART, disapproval reactions THUMBS_DOWN/CONFUSED, endorser-min-integrity: approved, disapproval-integrity: none.
    - `dangerously-disable-sandbox-agent: "<justification>"` - Required when `sandbox.agent: false` is set. Must be a plain string justification (minimum 20 characters; expressions are not allowed) that explains why disabling the sandbox is safe for this workflow.

- **`experiments:`** - A/B testing experiments for balanced variant selection (object)
  - Maps experiment names to variant lists (bare array) or full config objects
  - Bare array form: `prompt_style: [concise, detailed]` — round-robin balanced across runs
  - Object form for weighted/gated experiments:

    ```yaml
    experiments:
      prompt_style:
        variants: [concise, detailed, step_by_step]
        weight: [2, 1, 1]           # Optional: proportional weights (defaults to round-robin)
        start_date: "2026-05-01"    # Optional: ISO-8601; returns control variant before this date
        end_date: "2026-06-01"      # Optional: ISO-8601; returns control variant after this date
        description: "Verbosity test"  # Optional: experiment description
        metric: "token_count"       # Optional: primary metric name
        issue: "42"                 # Optional: linked tracking issue number
    ```

  - Selected variant available as `${{ experiments.<name> }}` and in `{{#if experiments.<name> }}` template blocks
  - See [A/B Testing Experiments](experiments.md) for full design guidance

- **`imports:`** - Array of workflow specifications to import (array)
  - Format: `owner/repo/path@ref` or local paths like `shared/common.md`
  - Markdown files under `.github/agents/` are treated as custom agent files
  - Only one agent file is allowed per workflow
  - See [Imports Field](syntax-tools-imports.md#imports-field) section for detailed documentation
- **`inlined-imports:`** - Inline all imports at compile time (boolean, default: `false`)
  - When `true`, all imports (including those without inputs) are inlined in the generated `.lock.yml` instead of using runtime-import macros
  - The frontmatter hash covers the entire markdown body when enabled, so any content change invalidates the hash
  - **Required for repository rulesets**: Workflows used as required status checks in repository rulesets run without access to repository files at runtime. Set `inlined-imports: true` to bundle all imported content at compile time to avoid "Runtime import file not found" errors
  - **Constraint**: Cannot be combined with agent file imports (`.github/agents/` files). Remove any custom agent file imports before enabling
- **`import-schema:`** - Define typed input parameters for this shared workflow (object). Use when other workflows import this one via the `uses:`/`with:` syntax (see [Imports Field](syntax-tools-imports.md#imports-field)).
  - Parameters are accessible inside the shared workflow via `${{ github.aw.import-inputs.<name> }}` expressions
  - Object inputs (type: `object`) allow one-level deep sub-fields: `${{ github.aw.import-inputs.<name>.<subkey> }}`
  - Fields per parameter:
    - `type:` - Input type: `string`, `number`, `boolean`, `choice`, or `array`
    - `description:` - Human-readable parameter description
    - `required:` - Whether the input is required when imported (default: `false`)
    - `default:` - Default value when not provided
    - `options:` - Allowed values for `choice` type inputs
  - Example:

    ```yaml
    import-schema:
      environment:
        type: choice
        description: "Target environment"
        options: [dev, staging, prod]
        required: true
      max-issues:
        type: number
        default: 5
    ```

- **`mcp-servers:`** - MCP (Model Context Protocol) server definitions (object)
  - Defines custom MCP servers for additional tools beyond built-in ones

- **`private:`** - Mark this workflow as private, preventing it from being shared via `gh aw add` (boolean, default: `false`)
  - Example: `private: true`

- **`redirect:`** - Workflow relocation path for updates (string). When present, `gh aw update` follows this location and rewrites the `source:` field. Format: `owner/repo/path@ref` or full GitHub URL.
  - Example: `redirect: "org/agentics/workflows/my-workflow-v2.md@main"`

- **`resources:`** - Additional workflow or action files fetched alongside this workflow when running `gh aw add` (array). Entries are relative paths from the same directory to `.md` or `.yml`/`.yaml` files.
  - Example: `resources: [shared/tool-setup.md, shared/mcp/tavily.md]`

- **`tracker-id:`** - Optional identifier to tag all created assets (string)
  - Must be at least 8 characters and contain only alphanumeric characters, hyphens, and underscores
  - This identifier is inserted in the body/description of all created assets (issues, discussions, comments, pull requests)
  - Enables searching and retrieving assets associated with this workflow
  - Examples: `"workflow-2024-q1"`, `"team-alpha-bot"`, `"security_audit_v2"`

- **`secret-masking:`** - Configuration for secret redaction behavior in workflow outputs and artifacts (object)
  - `steps:` - Additional secret redaction steps to inject after the built-in secret redaction (array)
  - Use this to mask secrets in generated files using custom patterns
  - Example:

    ```yaml
    secret-masking:
      steps:
        - name: Redact custom secrets
          run: find /tmp/gh-aw -type f -exec sed -i 's/password123/REDACTED/g' {} +
    ```

- **`observability:`** - Workflow observability and telemetry configuration (object)
  - **`otlp:`** - Export OpenTelemetry spans to any OTLP-compatible backend (Honeycomb, Grafana Tempo, Sentry, etc.) (object)
    - `endpoint:` - OTLP collector endpoint URL. When a static URL is provided, its hostname is added to the AWF firewall allowlist automatically. Supports GitHub Actions expressions.
    - `github-app:` - Optional runtime auth configuration.
      - Preferred: provide GitHub App credentials (`app-id`/`client-id` + `private-key`) to mint a token with `actions/create-github-app-token` before `actions/setup`.
      - OIDC mode is used when `github-app` is configured without credentials (`app-id`/`client-id` + `private-key`).
      - OIDC mode requires `permissions.id-token: write` on the workflow/job.
    - `headers:` - Comma-separated `key=value` HTTP headers included in every OTLP export request (e.g. `Authorization=Bearer <token>`). Injected as `OTEL_EXPORTER_OTLP_HEADERS`. Supports GitHub Actions expressions.
    - `resource-attributes:` - Optional map of additional OTEL resource attributes appended to gh-aw/GitHub defaults. Values may be static strings or GitHub Actions expressions. Do not use `secrets.*` or `vars.*` here because resource attributes are exported to external observability backends and are not treated as secret values.
  - Example:

    ```yaml
    observability:
      otlp:
        endpoint: ${{ secrets.GH_AW_OTEL_ENDPOINT }}
        github-app:
          app-id: ${{ vars.APP_ID }}
          private-key: ${{ secrets.APP_PRIVATE_KEY }}
        headers: ${{ secrets.GH_AW_OTEL_HEADERS }}
    ```

    Every job emits setup and conclusion spans with rich attributes (`gh-aw.job.name`, `gh-aw.workflow.name`, `gh-aw.engine.id`, token usage). All jobs in a run share one trace ID. Dispatched child workflows inherit the parent's trace context via `aw_context`.

- **`runtimes:`** - Runtime environment version overrides (object)
  - Allows customizing runtime versions (e.g., Node.js, Python) or defining new runtimes
  - Runtimes from imported shared workflows are also merged
  - Each runtime is identified by a runtime ID (e.g., 'node', 'python', 'go')
  - Runtime configuration properties:
    - `version:` - Runtime version as string or number (e.g., '22', '3.12', 'latest', 22, 3.12)
    - `action-repo:` - GitHub Actions repository for setup (e.g., 'actions/setup-node')
    - `action-version:` - Version of the setup action (e.g., 'v4', 'v5')
    - `if:` - Optional GitHub Actions condition to control when runtime setup runs (e.g., `"hashFiles('go.mod') != ''"`)
  - Example:

    ```yaml
    runtimes:
      node:
        version: "22"
      python:
        version: "3.12"
        action-repo: "actions/setup-python"
        action-version: "v5"
      go:
        version: "1.22"
        if: "hashFiles('go.mod') != ''"   # Only install Go when go.mod exists
    ```

- **`runtimes.node.run-install-scripts:`** - Allow npm pre/post install scripts to execute during package installation for the Node.js runtime (boolean, default: `false`)
  - By default, `--ignore-scripts` is added to all generated npm install commands to prevent supply chain attacks via malicious install hooks
  - Set `run-install-scripts: true` under `runtimes.node` to allow scripts for Node.js installs
  - A supply chain security warning is emitted at compile time; in strict mode this is an error

- **`checkout:`** - Override how the repository is checked out in the agent job (object, array, or `false`)
  - By default, the workflow automatically checks out the repository. Use this field to customize checkout behavior.
  - Set to `false` to disable automatic checkout entirely (reduces startup time when repo access is not needed):

    ```yaml
    checkout: false
    ```

  - Single checkout (object):

    ```yaml
    checkout:
      fetch-depth: 0              # Fetch full history (default: 1 = shallow clone)
      github-token: ${{ secrets.MY_PAT }}  # Override token for private repos
    ```

  - Multiple checkouts (array):

    ```yaml
    checkout:
      - path: .
        fetch-depth: 0
      - repository: owner/other-repo
        path: ./libs/other
        ref: main
    ```

  - Supported fields per checkout entry:
    - `repository:` - Repository in `owner/repo` format (defaults to current repository)
    - `ref:` - Branch, tag, or SHA to check out (defaults to triggering ref)
    - `path:` - Relative path within `GITHUB_WORKSPACE` (defaults to workspace root)
    - `fetch-depth:` - Number of commits to fetch; `0` = full history, `1` = shallow (default)
    - `fetch:` - Additional Git refs to fetch after checkout (array of patterns)
      - `"*"` - fetch all remote branches
      - `"refs/pulls/open/*"` - all open pull-request refs
      - Branch names, glob patterns (e.g., `"feature/*"`)
      - Example: `fetch: ["*"]`, `fetch: ["refs/pulls/open/*"]`
    - `sparse-checkout:` - Newline-separated glob patterns for sparse checkout
    - `submodules:` - Submodule handling: `"recursive"`, `"true"`, or `"false"`
    - `lfs:` - Download Git LFS objects (boolean, default: `false`)
    - `wiki:` - Check out the repository's wiki (boolean, default: `false`). When `true`, automatically appends `.wiki` to the repository name. Combine with `repository:` to check out a different repo's wiki.
    - `github-token:` - Token for authentication (`${{ secrets.MY_PAT }}`); credentials removed after checkout

- **`jobs:`** - Groups together all the jobs that run in the workflow (object)
  - Standard GitHub Actions jobs configuration
  - Each job can have: `name`, `runs-on`, `steps`, `needs`, `if`, `env`, `permissions`, `timeout-minutes`, etc.
  - For most agentic workflows, jobs are auto-generated; only specify this for advanced multi-job workflows
  - **Security Notice**: Custom jobs run OUTSIDE the firewall sandbox. Execute with standard GitHub Actions security but NO network egress controls. Use only for deterministic preprocessing, data fetching, or static analysis—not agentic compute or untrusted AI execution.
  - **`setup-steps:`** - Steps injected at the earliest point in a custom or built-in job, before framework GitHub App token minting and before checkout (array). Use this for OIDC login, secret fetch, and credential bootstrap that must happen before framework token/checkout steps. Imported `setup-steps` run before main workflow `setup-steps`.
  - **`pre-steps:`** - Steps injected after framework setup scaffolding and before the job's main `steps:` in a custom or built-in job (array). For built-in jobs, this is after the `id: setup` step (which includes framework token minting/checkout setup) and before the first checkout. Imported `pre-steps` run before main workflow `pre-steps`.
  - **`setup-steps` vs `pre-steps`** - Use `setup-steps` for work that must run before framework GitHub App token minting and checkout (e.g., OIDC/secret bootstrap). Use `pre-steps` for work that should run later, after setup scaffolding and before the job's main `steps:`.
  - **Migration note** - No migration is required. `setup-steps` is additive; existing workflows that only use `pre-steps` continue to behave as before.
  - Example:

    ```yaml
    jobs:
      custom-job:
        runs-on: ubuntu-latest
        setup-steps:
          - name: Bootstrap credentials
            run: echo "runs before framework token/checkout setup"
        pre-steps:
          - name: Pre-flight setup
            run: echo "runs before checkout"
        steps:
          - name: Custom step
            run: echo "Custom job"
    ```

  - `setup-steps`/`pre-steps` also apply to built-in jobs (e.g. `activation`): use `setup-steps` for OIDC/secret bootstrap that must run before framework token minting, then verify the result in `pre-steps`.

- **`engine:`** - AI processor configuration
  - String format: `"copilot"` (default, recommended), `"claude"`, `"codex"`, `"gemini"`, or the experimental `"opencode"`, `"crush"`, `"pi"`
  - Object format for extended configuration:

    ```yaml
    engine:
      id: copilot                       # Required: coding agent identifier (copilot, claude, codex, gemini; experimental: opencode, crush, pi)
      version: beta                     # Optional: version of the action (has sensible default); also accepts GitHub Actions expressions: ${{ inputs.engine-version }}
      model: gpt-5                      # Optional: LLM model to use (has sensible default)
      permission-mode: acceptEdits      # Optional (claude only): auto | acceptEdits | plan | bypassPermissions. Default: acceptEdits (auto when tools.edit is false)
      agent: technical-doc-writer       # Optional: custom agent file (Copilot only, references .github/agents/{agent}.agent.md)
      max-turns: 5                      # Deprecated alias for the top-level `max-turns`; prefer the top-level field
      max-continuations: 3              # Optional: max autopilot continuations (copilot only; >1 enables --autopilot mode, default: 1)
      concurrency: "gh-aw-${{ github.workflow }}"  # Optional: agent job concurrency group (string or GitHub Actions concurrency object)
      env:                              # Optional: custom environment variables (object)
        DEBUG_MODE: "true"
      args: ["--verbose"]               # Optional: custom CLI arguments injected before prompt (array)
      api-target: api.acme.ghe.com      # Optional: custom API endpoint hostname for GHEC/GHES (hostname only, no protocol/path)
      command: /usr/local/bin/copilot   # Optional: override default engine executable (skips installation)
      bare: true                        # Optional: disable automatic context loading (copilot: --no-custom-instructions; claude: --bare; codex: --no-system-prompt; gemini: GEMINI_SYSTEM_MD=/dev/null). Default: false
      user-agent: "myapp/1.0"           # Optional: custom user agent string (codex engine only)
      config: |                         # Optional: additional TOML config appended to config.toml (codex engine only)
        [extra]
        key = "value"
      token-weights:                    # Optional: custom token cost weights for AI credit computation
        multipliers:
          my-custom-model: 2.5          # 2.5x the cost of claude-sonnet-4.5 (= 1.0)
        token-class-weights:
          output: 6.0                   # Override output token weight (default: 4.0)
          cached-input: 0.05            # Override cached input weight (default: 0.1)
    ```

  - **`gemini` engine**: Google Gemini CLI. Requires `GEMINI_API_KEY` secret. Does not support `max-turns`, `web-fetch`, or `web-search`. Supports AWF firewall and LLM gateway.
  - **`opencode` engine** (experimental): Provider-agnostic, open-source AI coding agent (BYOK). Defaults to Copilot routing via `COPILOT_GITHUB_TOKEN` (or `${{ github.token }}` with `copilot-requests` feature). Supports 75+ models via `provider/model` format. Supports AWF firewall and LLM gateway.
  - **`engine.driver:`** — canonical field to run a custom inner driver script instead of the engine's built-in CLI. For the `pi` engine it launches the driver directly with Node.js (e.g. built-in `pi_agent_core_driver.cjs`, or a workspace-relative path like `.github/drivers/pi_agent_core_driver_sample_node.cjs`); the driver must emit JSONL compatible with `parse_pi_log.cjs` so step summaries and token tracking keep working. Accepts a bare basename (resolved from the setup-action directory) or a workspace-relative path; no absolute paths, no `..`, only `.js`/`.cjs`/`.mjs` (pi).
  - **`copilot-sdk` / `engine.driver`** (experimental, copilot only): set `copilot-sdk: true` to start a headless Copilot CLI SDK sidecar. Set `driver: <path-or-command>` on the copilot engine to supply a custom SDK driver (`.js`/`.cjs`/`.mjs`/`.py`/`.ts`/`.mts`/`.rb`, or a bare PATH command); this also enables `copilot-sdk: true` automatically. Tune the repeated-tool-denial safeguard with the top-level `max-tool-denials:` field (default `5`).
  - **`engine.auth:`** — keyless Workload Identity Federation via the AWF API proxy instead of a static API key; requires `id-token: write`. Set `type: github-oidc` (only supported type) plus `provider: azure` (`azure-tenant-id`, `azure-client-id`, optional `azure-scope`/`azure-cloud`) for Azure OpenAI, or `provider: anthropic` (`federation-rule-id`, `organization-id`, `service-account-id`, `workspace-id`) for Claude. Optional `audience:`. Maps to `AWF_AUTH_*` env vars.

- **`network:`** - Network access control for AI engines (top-level field)
  - String format: `"defaults"` (curated allow-list of development domains)
  - Empty object format: `{}` (no network access)
  - Object format for custom permissions:

    ```yaml
    network:
      allowed:
        - "example.com"
        - "*.trusted-domain.com"
        - "https://api.secure.com"        # Optional: protocol-specific filtering
      blocked:
        - "blocked-domain.com"
        - "*.untrusted.com"
        - python                          # Block ecosystem identifiers
    ```

  - **Firewall (AWF) configuration** is set under `sandbox.agent`, not `network`. Use `sandbox.agent.version` to pin the AWF version (see below). The legacy `network.firewall` field is deprecated; run `gh aw fix` to migrate.

- **`sandbox:`** - Sandbox configuration for AI engines (string or object)
  - String format: `"default"` (default sandbox), `"awf"` (Agent Workflow Firewall)
  - Object format to pin an AWF version (strict mode requires explicit `id: awf`):

    ```yaml
    sandbox:
      agent:
        id: awf                     # Required in strict mode
        version: "v0.25.29"         # Optional: pin AWF version
        model-fallback: false       # Optional: disable model fallback (default true); set false for BYOK Azure OpenAI to prevent deployment-name rewriting
    ```

  - To disable the agent firewall while keeping MCP gateway enabled, you must provide the dangerous-disable justification feature:

    ```yaml
    features:
      dangerously-disable-sandbox-agent: "controlled environment with no internet access"
    sandbox:
      agent: false
    ```

  - **`sandbox.agent.sudo`** (boolean) controls whether AWF runs in root mode. Default is `false`: AWF runs rootless in network-isolation egress mode (`--network-isolation`), with MCP sidecars attached as bridge containers on the internal `awf-net` network. Set `sudo: true` for the legacy root mode; in strict mode explicit `sudo: true` is an error (warning otherwise).
  - **Strict mode**: `sandbox.agent` blocks without an explicit `id: awf` are rejected in strict mode. Any non-nil, non-disabled agent config without `id`/`type` defaults to AWF at runtime.

- **`tools:`** - Tool configuration for the coding agent (`github`, `agentic-workflows`, `edit`, `web-fetch`, `web-search`, `bash`, `playwright`, custom MCP server names, plus `timeout`/`startup-timeout`/`cli-proxy`). See [syntax-tools-imports.md](syntax-tools-imports.md#tool-configuration) for the full schema (GitHub `mode`/`toolsets`/integrity fields, bash allowlist decision rule, Playwright CLI mode).

- **`safe-outputs:`** - Safe output processing configuration. See [safe-outputs.md](safe-outputs.md) for complete documentation of all output types: `create-issue`, `create-discussion`, `add-comment`, `create-pull-request`, `push-to-pull-request-branch`, `close-issue`, `close-discussion`, `update-issue`, `update-pull-request`, `add-labels`, `remove-labels`, `replace-label`, `dispatch-workflow`, `call-workflow`, `create-code-scanning-alert`, `upload-asset`, `upload-artifact`, `assign-to-agent`, `assign-to-user`, and more.

  **Key safe-outputs global fields** (detail in [safe-outputs-runtime.md](safe-outputs-runtime.md)): `github-token`, `github-app`, `staged` (preview mode, no API calls), `footer`, `threat-detection`, `runs-on` (default `ubuntu-slim`), `messages`, `env`, `max-patch-size` (KB, default `4096`).


- **`mcp-scripts:`** - Define custom lightweight MCP tools as JavaScript, shell, Python, or Go scripts (object)
  - Tools mounted in MCP server with access to specified secrets
  - Each tool requires `description` and one of: `script` (JavaScript), `run` (shell), `py` (Python), or `go` (Go)
  - Tool configuration properties:
    - `description:` - Tool description (required)
    - `inputs:` - Input parameters with type and description (object)
    - `script:` - JavaScript implementation (CommonJS format)
    - `run:` - Shell script implementation
    - `py:` - Python script implementation
    - `go:` - Go script implementation (executed via `go run`, receives inputs as JSON via stdin)
    - `env:` - Environment variables for secrets (supports `${{ secrets.* }}`)
    - `dependencies:` - Runtime packages installed before first invocation (list of strings). Manager inferred from script type: `script`→npm, `py`→pip, `go`→`go get`, `run`→apt. Must be exact-version-pinned (`name@1.2.3`, `name==1.2.3`, `module@v1.2.3`, `name=1.6`); floating refs are rejected.
    - `timeout:` - Execution timeout in seconds (default: 60)
  - Example:

    ```yaml
    mcp-scripts:
      search-issues:
        description: "Search GitHub issues using API"
        inputs:
          query: { type: string, description: "Search query", required: true }
        script: |
          const { Octokit } = require('@octokit/rest');
          const octokit = new Octokit({ auth: process.env.GH_TOKEN });
          const r = await octokit.search.issuesAndPullRequests({ q: inputs.query });
          return r.data.items;
        dependencies: ["@octokit/rest@21.0.2"]
        env:
          GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
    ```

- **`slash_command:`** - Command trigger configuration for /mention workflows (under `on:`)
- **`cache:`** - Cache configuration for workflow dependencies (object or array)
- **`cache-memory:`** - Memory MCP server with persistent cache storage (boolean or object, under `tools:`)
- **`repo-memory:`** - Repository-specific memory storage (boolean, under `tools:`)
- **`comment-memory:`** - Managed issue/PR comment memory with file-based agent editing (boolean or object, under `tools:`)


<!-- file: syntax-core.md -->

---
description: Core GitHub Actions frontmatter fields supported by GitHub Agentic Workflows.
---

## Complete Frontmatter Schema

The YAML frontmatter supports these fields:

### Core GitHub Actions Fields

- **`on:`** - Workflow triggers (required)
  - String: `"push"`, `"issues"`, etc.
  - Object: Complex trigger configuration
  - Special: `slash_command:` for /mention triggers
  - **`forks:`** - Fork allowlist for `pull_request` triggers (array or string). By default, workflows block all forks and only allow same-repo PRs. Use `["*"]` to allow all forks, or specify patterns like `["org/*", "user/repo"]`
  - **`stop-after:`** - Can be included in the `on:` object to set a deadline for workflow execution. Supports absolute timestamps ("YYYY-MM-DD HH:MM:SS") or relative time deltas (+25h, +3d, +1d12h). The minimum unit for relative deltas is hours (h). Uses precise date calculations that account for varying month lengths.
  - **`reaction:`** - Add emoji reactions to triggering items
  - **`status-comment:`** - Post status comments when workflow starts/completes (boolean). Defaults to `true` for `slash_command` and `label_command` triggers; defaults to `false` for all other triggers. Must be explicitly enabled for non-command triggers with `status-comment: true`.
  - **`manual-approval:`** - Require manual approval using environment protection rules
  - **`skip-roles:`** - Skip workflow execution for users with specific repository roles (array)
    - Available roles: `admin`, `maintainer`, `write`, `read`
    - Example: `skip-roles: [read]` - Skip execution for users with read-only access
  - **`skip-bots:`** - Skip workflow execution when triggered by specific GitHub actors (array)
    - Bot name matching is flexible (handles with/without `[bot]` suffix)
    - Example: `skip-bots: [dependabot, renovate]` - Skip for Dependabot and Renovate
  - **`labels:`** - Filter label-triggered events to only fire when the triggering label matches one of these names (string or array)
    - String format: `labels: "my-label"` (single label name)
    - Array format: `labels: [label-a, label-b]` (any matching label fires the workflow)
    - Unmatched label events show as Skipped (⊘) rather than Failed (❌)
    - Use with `pull_request` triggers with `types: [labeled]` to respond only to specific labels
  - **`skip-if-match:`** - Skip workflow execution when a GitHub search query returns results (string or object)
    - String format: `skip-if-match: "is:issue is:open label:bug"` (implies max=1)
    - Object format with threshold:

      ```yaml
      skip-if-match:
        query: "is:issue is:open label:in-progress"
        max: 3      # Skip if 3 or more matches (default: 1)
        scope: none # Optional: disable automatic repo:owner/repo scoping for org-wide queries
      ```

    - Query is automatically scoped to the current repository (use `scope: none` for cross-repo queries)
    - Use to avoid duplicate work (e.g., skip if an open issue already exists)
  - **`skip-if-no-match:`** - Skip workflow execution when a GitHub search query returns no results (string or object)
    - String format: `skip-if-no-match: "is:pr is:open label:ready-to-deploy"` (implies min=1)
    - Object format with threshold:

      ```yaml
      skip-if-no-match:
        query: "is:pr is:open label:ready-to-deploy"
        min: 2      # Require at least 2 matches to proceed (default: 1)
        scope: none # Optional: disable automatic repo:owner/repo scoping for org-wide queries
      ```

    - Query is automatically scoped to the current repository (use `scope: none` for cross-repo queries)
    - Use to gate workflows on preconditions (e.g., only run if open PRs exist)
  - **`skip-if-check-failing:`** - Skip workflow execution when CI checks are failing on the triggering ref (boolean or object)
    - Boolean format: `skip-if-check-failing: true` (skip if any check is failing or pending)
    - Object format with filtering:

      ```yaml
      skip-if-check-failing:
        include:
          - build
          - test             # Only check these specific CI checks
        exclude:
          - lint             # Ignore this check
        branch: main         # Optional: check a specific branch instead of triggering ref
        allow-pending: true  # Optional: treat pending/in-progress checks as passing (default: false)
      ```

    - When `include` is omitted, all checks are evaluated
    - By default, pending/in-progress checks count as failing; set `allow-pending: true` to ignore them
    - Use to avoid running agents against broken code (e.g., skip PR review if CI is red)
  - **`github-token:`** - Custom GitHub token for pre-activation reactions, status comments, and skip-if search queries (string)
    - When specified, overrides the default `GITHUB_TOKEN` for these operations
    - Example: `github-token: ${{ secrets.MY_GITHUB_TOKEN }}`
  - **`github-app:`** - GitHub App credentials for minting a token used in pre-activation operations (object)
    - Mints a single installation access token shared across reactions, status comments, and skip-if queries
    - Can be defined in a shared agentic workflow and inherited by importing workflows
    - Fields:
      - `client-id:` - GitHub App client ID (required, e.g., `${{ vars.APP_ID }}`). Use `app-id:` for legacy compatibility.
      - `private-key:` - GitHub App private key (required, e.g., `${{ secrets.APP_PRIVATE_KEY }}`)
      - `owner:` - Optional installation owner (defaults to current repository owner)
      - `repositories:` - Optional list of repositories to grant access to
    - Example:

      ```yaml
      on:
        issues:
          types: [opened]
        github-app:
          client-id: ${{ vars.APP_ID }}
          private-key: ${{ secrets.APP_PRIVATE_KEY }}
      ```

  - **`stale-check:`** - Control whether the activation job verifies hashes against the compiled workflow (boolean or `"full"`, default: `true`)
    - When `false`, disables the hash check step; useful when workflow files are managed outside the default repository context (e.g., cross-repo org rulesets)
    - When `"full"`, checks both the frontmatter hash and body hash; use when prompt-body edits should also trigger recompilation detection

- **`permissions:`** - GitHub token permissions
  - Object with permission levels: `read`, `none` (and limited `write` for specific scopes)
  - Available permissions: `contents`, `issues`, `pull-requests`, `discussions`, `actions`, `checks`, `statuses`, `models`, `deployments`, `security-events`, `copilot-requests`
  - Write permissions are not allowed for security reasons; use `safe-outputs` for write operations instead
  - Exceptions: `id-token: write` is allowed to enable OIDC token minting; `copilot-requests: write` is recommended when targeting the Copilot coding agent so it can authenticate with `${{ github.token }}`
- **`runs-on:`** - Runner type for the main agent job (string, array, or object)
- **`runs-on-slim:`** - Runner type for all framework/generated jobs (activation, safe-outputs, unlock, etc.). Defaults to `ubuntu-slim`. `safe-outputs.runs-on` takes precedence for safe-output jobs specifically.
- **`runner:`** - Runner topology configuration (object). `topology: arc-dind` (only enum value) targets GitHub ARC runners with rootless Docker-in-Docker: gh-aw emits the topology in the AWF config, redirects the tool cache to a shared volume, and validates that no generated step requires root. AWF then activates split-filesystem handling, network isolation, sysroot staging, and DinD pre-staging automatically.

  ```yaml
  runner:
    topology: arc-dind
  ```
- **`timeout-minutes:`** - Agent execution step timeout in minutes (integer or GitHub Actions expression, defaults to 20 minutes; custom and safe-output jobs use the GitHub Actions platform default of 360 minutes unless explicitly set). Expressions are useful in compiled workflows that define `workflow_call` inputs, for example `timeout-minutes: ${{ inputs.timeout }}`. This setting applies to the workflow being compiled, not to plain GitHub Actions caller jobs that use job-level `uses:` (GitHub does not allow `timeout-minutes` on those caller jobs).
- **`concurrency:`** - Concurrency control (string or object)
  - **`queue:`** - Pending run queue behavior for the concurrency group (`single` or `max`, defaults to `single`). `single` keeps one pending run and replaces older pending runs; `max` allows up to 100 pending runs in FIFO order (useful for conclusion jobs that must not be dropped).

    ```yaml
    concurrency:
      group: "my-workflow"
      queue: max
    ```
  - **`job-discriminator:`** - Expression appended to compiler-generated job-level concurrency groups (`agent`, `output`, and `conclusion` jobs), preventing fan-out cancellations when multiple workflow instances run concurrently with different inputs. Common usage:

    ```yaml
    concurrency:
      job-discriminator: ${{ inputs.finding_id }}
    ```

    Common expressions:

    | Scenario | Expression |
    |---|---|
    | Fan-out by input | `${{ inputs.finding_id }}` |
    | Universal uniqueness | `${{ github.run_id }}` |
    | Dispatched or scheduled fallback | `${{ inputs.organization \|\| github.run_id }}` |

    `job-discriminator` is a gh-aw extension stripped from the compiled lock file. Has no effect on `workflow_dispatch`-only, `push`, or `pull_request` triggered workflows.
- **`env:`** - Environment variables (object or string)
- **`if:`** - Conditional execution expression (string)
- **`run-name:`** - Custom workflow run name (string)
- **`name:`** - Workflow name (string)
- **`pre-steps:`** - Custom workflow steps to run at the very beginning of the agent job, before checkout (object). Use for token minting or setup that must happen before the repository is checked out. Step outputs are available via `${{ steps.<id>.outputs.<name> }}` and can be referenced in `checkout.github-token` to avoid masked-value cross-job boundary issues. Same security restrictions apply as for `steps:`.
  - For job-scoped hooks under `jobs.<job-id>`, `setup-steps` run before framework GitHub App token minting and checkout, while `pre-steps` run after compiler setup and before the job's main steps.
- **`steps:`** - Custom workflow steps before AI execution (object). **Security Notice**: Custom steps run OUTSIDE the firewall sandbox with standard GitHub Actions security but NO network egress controls. Use only for deterministic data preparation, not agentic compute. **Secrets restriction**: Using `${{ secrets.* }}` expressions (other than `secrets.GITHUB_TOKEN`) in custom steps is an error in strict mode and a warning otherwise — move secret-dependent operations to a separate job outside the agent job.
- **`pre-agent-steps:`** - Custom workflow steps to run before MCP gateway startup (object or array). Use when preparation must install or configure MCP dependencies before the gateway starts. Same security restrictions apply as for `steps:`.
- **`post-steps:`** - Custom workflow steps after AI execution (object). **Security Notice**: Post-execution steps run OUTSIDE the firewall sandbox. Use only for deterministic cleanup, artifact uploads, or notifications—not agentic compute or untrusted AI execution. Same secrets restriction applies as for `steps:`.
- **`environment:`** - Environment that the job references for protection rules (string or object)
- **`container:`** - Container to run job steps in (string or object)
- **`services:`** - Service containers that run alongside the job (object)
- **`secrets:`** - Secret values passed to workflow execution (object)
  - Use GitHub Actions expressions: `${{ secrets.API_KEY }}`
  - String format: `secrets: { API_TOKEN: "${{ secrets.API_TOKEN }}" }`
  - Object format with descriptions:

    ```yaml
    secrets:
      API_TOKEN:
        value: ${{ secrets.API_TOKEN }}
        description: "API token for external service"
    ```

  - Never commit plaintext secrets
  - For reusable workflows, use `jobs.<job_id>.secrets` instead


<!-- file: syntax-tools-imports.md -->

---
description: Cache, tool, import, and permission reference for GitHub Agentic Workflows frontmatter.
---

# Tools, Imports, and Permissions

### Cache Configuration

The `cache:` field supports the same syntax as the GitHub Actions `actions/cache` action:

**Single Cache:**

```yaml
cache:
  key: node-modules-${{ hashFiles('package-lock.json') }}
  path: node_modules
  restore-keys: |
    node-modules-
```

**Multiple Caches:**

```yaml
cache:
  - key: node-modules-${{ hashFiles('package-lock.json') }}
    path: node_modules
    restore-keys: |
      node-modules-
  - key: build-cache-${{ github.sha }}
    path:
      - dist
      - .cache
    restore-keys:
      - build-cache-
    fail-on-cache-miss: false
```

**Supported Cache Parameters:**

- `key:` - Cache key (required)
- `path:` - Files/directories to cache (required, string or array)
- `restore-keys:` - Fallback keys (string or array)
- `upload-chunk-size:` - Chunk size for large files (integer)
- `fail-on-cache-miss:` - Fail if cache not found (boolean)
- `lookup-only:` - Only check cache existence (boolean)

Cache steps are auto-added to the workflow job; cache config is removed from the final `.lock.yml`.

> **Memory configuration**: For `cache-memory:`, `repo-memory:`, and `comment-memory:`, see [memory.md](memory.md).


## Tool Configuration

The `tools:` field configures which tools the coding agent may use.

### GitHub Tools (`tools.github`)

- `allowed:` - Array of allowed GitHub API functions. Each entry is either a string tool name (e.g., `issue_read`) or an object `{ name: <tool>, max-calls: <n> }` to cap how many times that tool may be called per run. Colon shorthand (`"issue_read:1"`) is **not** a call-limit form.

  ```yaml
  tools:
    github:
      allowed:
        - { name: issue_read, max-calls: 1 }
        - list_labels
        - pull_request_read
  ```
- `mode:` - GitHub access mode. **Prefer `"gh-proxy"`** — it is faster (no MCP server startup) and lets the agent use `gh` shell commands directly for all GitHub reads (issues, PRs, discussions, commits, etc.):
  - `"gh-proxy"` (**preferred**) — pre-authenticated `gh` CLI available in bash; no GitHub MCP server is registered. Use `gh` commands for all GitHub reads.
  - `"local"` (default) — Docker-based GitHub MCP Server; use GitHub MCP tools for reads, `gh` is not authenticated.
  - **do NOT use `"remote"`** — it does not work with the GitHub Actions token; use `"gh-proxy"` instead.
- `version:` - MCP server version (local mode only)
- `args:` - Additional command-line arguments (local mode only)
- `read-only:` - The GitHub MCP server always operates in read-only mode; this field is accepted but has no effect
- `github-token:` - Custom GitHub token
- `lockdown:` - Enable lockdown mode to limit content surfaced from public repositories to items authored by users with push access (boolean, default: false)
- `github-app:` - GitHub App configuration for token minting; when set, mints an installation access token at workflow start that overrides `github-token`
  - `client-id:` - GitHub App client ID (required, e.g., `${{ vars.APP_ID }}`). Use `app-id:` for legacy compatibility.
  - `private-key:` - GitHub App private key (required, e.g., `${{ secrets.APP_PRIVATE_KEY }}`)
  - `owner:` - Optional installation owner (defaults to current repository owner)
  - `repositories:` - Optional list of repositories to grant access to (array)
  - `permissions:` - Optional extra permissions to include in the minted token for org-level API access (object)
    - Example: `permissions: { members: read, organization-administration: read }` — required when calling org-level APIs (e.g., `orgs`, `users` toolsets) since the default GITHUB_TOKEN does not have org-scoped permissions
- `min-integrity:` - Minimum integrity level for MCP Gateway guard policy; controls which content the agent may act on based on author trust (`none`, `unapproved`, `approved`, `merged`)
- `blocked-users:` - Usernames whose content is unconditionally blocked (array or GitHub Actions expression); these users receive integrity below `none` and are always denied
- `approval-labels:` - Label names that elevate a content item's integrity to `approved` when present (array or GitHub Actions expression); does not override `blocked-users`
- `trusted-users:` - Usernames elevated to `approved` integrity regardless of `author_association` (array or GitHub Actions expression); takes precedence over `min-integrity` but not over `blocked-users`; requires `min-integrity` to be set
- `toolsets:` - Enable specific GitHub toolset groups (array only)
  - **Default toolsets** (when unspecified): `context`, `repos`, `issues`, `pull_requests` (excludes `users` as GitHub Actions tokens don't support user operations)
  - **Group aliases**: `default` (recommended action-friendly set), `action-friendly` (action-safe toolsets, excludes `users`), `all` (everything)
  - **Individual toolsets**: `context`, `repos`, `issues`, `pull_requests`, `actions`, `code_security`, `dependabot`, `discussions`, `experiments`, `gists`, `labels`, `notifications`, `orgs`, `projects`, `secret_protection`, `security_advisories`, `stargazers`, `users`, `search`
  - Examples: `toolsets: [default]`, `toolsets: [default, discussions]`, `toolsets: [repos, issues]`
  - **Recommended**: Prefer `toolsets:` over `allowed:` for better organization and reduced configuration verbosity

### Other Built-in Tools

- `agentic-workflows:` - GitHub Agentic Workflows MCP server for workflow introspection. Provides `status`, `compile`, `logs`, `audit`, and `checks` tools so agents can analyze run traces and improve workflows. Enable with `agentic-workflows: true`.
- `edit:` - File editing tools (required to write to files in the repository)
- `web-fetch:` - Web content fetching tools
- `web-search:` - Web search tools
- `bash:` - Shell command tools
  - **Bash allowlist decision rule:**
    - **PR-triggered workflows** processing **untrusted input** (issue/PR body, comment text, user-provided filenames): use a **narrow allowlist** (e.g. `[find, cat, grep, wc, jq]`). This limits blast radius if shell injection is embedded in untrusted content.
    - **`schedule` or `workflow_dispatch` workflows** with **no untrusted input** (only trusted API data or internal state): `["*"]` is acceptable.
    - **Rule of thumb**: If the workflow reads issue/PR bodies, comment text, or other user-provided strings, use a narrow list. Otherwise `["*"]` is acceptable.

  ```yaml
  # PR-triggered workflow reading untrusted user text
  on:
    pull_request:
  tools:
    bash: [find, cat, grep, wc, jq]

  # Internal scheduled workflow reading only trusted/internal data
  on:
    schedule:
      - cron: "0 * * * *"
  tools:
    bash: ["*"]
  ```
- `playwright:` - Browser automation for visual regression, accessibility, and end-to-end testing. Use `mode: cli` (recommended) — no Docker, runs `playwright-cli <command>` in bash, `localhost` reaches local servers directly. `mode: mcp` is deprecated (Docker-based). Pin a version with `version:` and restrict network to `local` + `playwright`.

  ```yaml
  tools:
    playwright:
      mode: cli          # recommended: token-efficient CLI mode
      version: "0.1.11"  # optional: @playwright/cli npm package version
  ```
- `timeout:` - Per-operation timeout in seconds for all tool and MCP calls (integer or expression). Defaults vary by engine (Claude: 60 s, Codex: 120 s).
- `startup-timeout:` - Timeout in seconds for MCP server initialization (integer or expression, default: 120).
- `cli-proxy:` - Mount each user-facing MCP server as a standalone CLI tool on `PATH` (boolean, default: `false`). When enabled, the agent can call MCP servers via shell (e.g. `github issue_read --method get ...`).

### Custom MCP Tools

Stdio MCP servers must be Docker-based (use `container:` + `entrypoint:`). For Node/Python servers already installed on the runner, use HTTP transport instead:

```yaml
# Stdio (Docker-based)
mcp-servers:
  my-custom-tool:
    container: "ghcr.io/my-org/my-tool:latest"
    entrypoint: "my-tool"
    allowed:
      - custom_function_1
      - custom_function_2

# HTTP (for Node/Python servers running on the runner)
mcp-servers:
  my-node-tool:
    type: http
    url: "http://localhost:8765/mcp"
```

HTTP MCP servers are also supported with optional upstream authentication:

```yaml
mcp-servers:
  my-server:
    type: http
    url: "https://myserver.example.com/mcp"
    headers:
      Authorization: "Bearer ${{ secrets.API_KEY }}"    # Optional: custom headers
  my-oidc-server:
    type: http
    url: "https://myserver.example.com/mcp"
    auth:
      type: github-oidc                                  # GitHub Actions OIDC token authentication
      audience: "https://myserver.example.com"          # Optional: custom OIDC audience
```

`auth.type: github-oidc` uses GitHub Actions OIDC tokens for secure server-to-server authentication without static credentials. The `audience` field defaults to the server URL when omitted.

### Engine Network Permissions

Control network access via the top-level `network:` field (defaults to `network: defaults` — basic infrastructure only). For workflows that build, test, or install packages, always add the language ecosystem alongside `defaults`:

```yaml
network:
  allowed:
    - defaults         # Basic infrastructure (CAs, Ubuntu verification, JSON schema)
    - node             # Node.js / npm ecosystem
    - "api.custom.com" # Custom domain
  blocked:
    - "*.ads.com"      # Block domain patterns
```

> **Full reference**: valid ecosystem identifiers, invalid shorthands, wildcard/protocol rules, and per-language inference live in [network.md](network.md). Do not restate the ecosystem table here.

## Imports Field

Import shared components using the `imports:` field in frontmatter:

```yaml
---
on: issues
engine: copilot
imports:
  - copilot-setup-steps.yml    # Import setup steps from copilot-setup-steps.yml
  - shared/security-notice.md
  - shared/tool-setup.md
  - shared/mcp/tavily.md
---
```

**Object form with inputs** — Use `path:`/`uses:` + `with:`/`inputs:` to pass values to shared workflows that define an `import-schema:`:

```yaml
imports:
  - path: shared/tool-setup.md
    with:
      environment: staging
      max-issues: 3
  - uses: shared/security-notice.md  # 'uses' is an alias for 'path'
```

`path`/`uses` and `with`/`inputs` are the only valid keys on an import entry. To supply environment variables or a checkout ref, set top-level `env:`/`checkout:` frontmatter inside the imported file itself — those are merged into the importing workflow (see the merge list below), not configured per import entry.

Conditional `imports:` entries are not supported. For experiment-specific prompt variants, keep the import unconditional and gate a `{{#runtime-import? ...}}` block (optional form) in the workflow body instead. The optional form is not promoted to unconditional lock-file macros, so the content is only injected when the condition is true at runtime.

Inside the imported workflow, access values via `${{ github.aw.import-inputs.<name> }}`.

### Import File Structure

Import files are in `.github/workflows/shared/` and can contain:

- Tool configurations
- Safe-outputs configurations
- Text content
- Mixed frontmatter + content

The following frontmatter fields in imported files are merged into the importing workflow:

- `tools:` - Merged with the importing workflow's tools
- `safe-outputs:` - Merged with safe-output configuration
- `env:` - Environment variables merged; conflicts between two imports defining the same key are compilation errors (remove the duplicate or move it to the main workflow to override)
- `checkout:` - Checkout configurations appended (main workflow's checkouts take precedence)
- `github-app:` - Top-level GitHub App credentials (first-wins across imports)
- `on.github-app:` - Activation GitHub App credentials (first-wins across imports)
- `steps:` - Steps appended in import order
- `pre-agent-steps:` - Steps appended in import order
- `post-steps:` - Steps appended in import order
- `jobs.<job-id>.setup-steps` and `jobs.<job-id>.pre-steps` - Merged per job with imported steps first, then main workflow steps. Execution order is `setup-steps` before `pre-steps`.
- `runtimes:`, `network:`, `permissions:`, `services:`, `cache:`, `features:`, `mcp-servers:`

Example import file:

```markdown
---
tools:
  github:
    allowed: [get_repository, list_commits]
safe-outputs:
  create-issue:
    labels: [automation]
env:
  MY_VAR: "shared-value"
checkout:
  fetch-depth: 0
---

Additional instructions for the coding agent.
```

### Special Import: copilot-setup-steps.yml

The `copilot-setup-steps.yml` file receives special handling when imported. Instead of importing the entire job structure, **only the steps** from the `copilot-setup-steps` job are extracted and inserted **at the start** of your workflow's agent job.

**Key behaviors:**

- Only the steps array is imported (job metadata like `runs-on`, `permissions` is ignored)
- Imported steps are placed **at the start** of the agent job (before all other steps)
- Other imported steps are placed after copilot-setup-steps but before main frontmatter steps
- Main frontmatter steps come last
- Final order: **copilot-setup-steps → other imported steps → main frontmatter steps**
- Supports both `.yml` and `.yaml` extensions
- Enables clean reuse of common setup configurations across workflows

**Example:**

```yaml
---
on: issue_comment
engine: copilot
imports:
  - copilot-setup-steps.yml
  - shared/common-tools.md
steps:
  - name: Custom environment setup
    run: echo "Main frontmatter step runs last"
---
```

In the compiled workflow, the order is: copilot-setup-steps → imported steps from shared/common-tools.md → main frontmatter steps.

## Permission Patterns

**IMPORTANT**: Agentic workflows MUST NOT include write permissions (`issues: write`, `pull-requests: write`, `contents: write`). Safe-outputs provide these via separate secured jobs. Granting writes to the main AI job causes a compilation error.

### Read-Only Pattern

```yaml
permissions:
  contents: read
  metadata: read
```

### Output Processing Pattern (Recommended)

```yaml
permissions:
  contents: read      # Main job minimal permissions
  actions: read

safe-outputs:
  create-issue:       # Automatic issue creation
  add-comment:  # Automatic comment creation
  create-pull-request: # Automatic PR creation
```

**Key Benefits of Safe-Outputs:**

- Main job runs with minimal permissions
- Write operations handled by dedicated jobs
- Safe-outputs jobs auto-receive required permissions
- Clear audit trail between AI processing and GitHub API


<!-- file: syntax.md -->

---
description: Compact index for the GitHub Agentic Workflows frontmatter schema.
---

# Frontmatter Schema Index

Use the smallest relevant reference instead of loading one large schema file.

| Topic | File |
|---|---|
| Core GitHub Actions fields (`on`, `permissions`, `runs-on`, `steps`, `env`, `secrets`) | [syntax-core.md](syntax-core.md) |
| Agentic workflow specific fields (`strict`, `bots`, `labels`, metadata, engine-specific fields) | [syntax-agentic.md](syntax-agentic.md) |
| Cache configuration, tools, imports, and permission patterns | [syntax-tools-imports.md](syntax-tools-imports.md) |
| `skills` field | [skills.md](skills.md) |
| `lsp` field | [lsp.md](lsp.md) |

## Usage Guidance

- Load only the section required for the current task.
- Prefer the dedicated topic files over copying schema details into creator or updater prompts.
- Keep examples short and route deep detail to the relevant syntax sub-file.


<!-- file: test-coverage.md -->

---
description: Guidance for creating agentic workflows that analyze test coverage — prefer reading pre-computed CI artifacts over re-running tests.
---

# Test Coverage Workflow Guidance

Consult this file when creating or updating a workflow that analyzes test coverage.

## Core Principle: Read Artifacts First

Always prefer fetching pre-computed coverage artifacts from CI over re-running tests. Re-running duplicates CI work.

## Coverage Data Strategy

Include this decision block in every coverage workflow prompt:

```
1. Find the latest successful CI run for this commit:
   `gh run list --commit "$HEAD_SHA" --status success --limit 5 --json databaseId,workflowName`
2. Download the coverage artifact (try names: coverage-report, coverage, test-results):
   `gh run download <run-id> --name coverage-report --dir /tmp/coverage`
3. If found, parse and analyze it — do NOT re-run tests.
4. If not found, run tests with coverage and note in the report that data was computed fresh.
```

## Frontmatter Template

```yaml
engine: copilot
on:
  pull_request:
    types: [opened, synchronize]
permissions:
  actions: read         # download artifacts
network: defaults
tools:
  github:
    toolsets: [default, actions]  # actions toolset enables artifact download
safe-outputs:
  add-comment:
    hide-older-comments: true
```

## Fallback: Run Tests

Use **only when** no prior CI artifact exists or CI doesn't upload coverage. Supported commands:

- infer the repository ecosystem from project files before running fallback coverage
- configure `network.allowed` to include `defaults` plus the inferred ecosystem(s) (for example `node`, `python`, `go`)
- never run fallback coverage with `network: defaults` alone

Example fallback network config:

```yaml
network:
  allowed:
    - defaults
    - node
```

| Language | Command |
|---|---|
| Node.js | `npx jest --coverage --coverageReporters=json-summary` |
| Python | `python -m pytest --cov=src --cov-report=json` |
| Go | `go test ./... -coverprofile=/tmp/coverage.out` |


<!-- file: test-expression.md -->

---
on:
  workflow_call:
    inputs:
      engine-version:
        type: string
engine:
  id: copilot
  version: ${{ inputs.engine-version }}
---
Fix the bug


<!-- file: token-optimization.md -->

---
description: Guide for reducing token consumption in agentic workflows — DataOps, gh-proxy, inline sub-agents, caveman experiments, and audit-based measurement.
---

# Token Consumption Optimization

## Quick-Reference Checklist

Apply these in order — each check can halve costs:

- [ ] **Cheap triage first**: classify duplicates, stale items, low-value events, and known cases before escalating
- [ ] **Frontier model as planner**: use frontier models for planning, synthesis, ambiguous decisions, and final judgment — not bulk extraction
- [ ] **DataOps**: Move data fetching into `steps:` — agent reads compact JSON, not raw API responses
- [ ] **gh-proxy**: Set `tools.github.mode: gh-proxy` — skips Docker MCP server startup and extra tool definitions
- [ ] **cli-proxy**: Mount additional MCP servers as CLIs via `cli-proxy: true` — agent pipes output through `jq` before it enters context
- [ ] **Sub-agents**: Delegate repetitive per-item tasks to `model: small` sub-agents (~10–20× cheaper)
- [ ] **Sub-skills**: Keep the main prompt as a short execution plan; move detailed playbooks/output layouts into `## skill:` blocks the agent invokes only when needed
- [ ] **Prompt size**: Strip redundant instructions, examples, and pleasantries from the prompt body
- [ ] **Dynamic context**: Inject only required fields — `${{ github.event.issue.number }}` not the full event payload
- [ ] **Pull context on demand**: query logs/data only after a hypothesis forms; avoid preloading large raw dumps into the initial prompt
- [ ] **Prompt caching**: Put stable instructions before dynamic content to maximize cache hits
- [ ] **Context hygiene**: keep the orchestrator context compact; prefer short worker summaries over raw output
- [ ] **Cadence**: If the result is not time-sensitive, schedule less often (`hourly` → `daily`, `daily` → `weekly`)
- [ ] **Batching**: Prefer scheduled batch processing over reactive events when delayed processing is acceptable
- [ ] **Telemetry**: Configure `observability.otlp` so token usage and run phases are measurable outside individual run logs
- [ ] **AgenticOps**: Add `copilot-token-audit` / `copilot-token-optimizer` workflows so the repository keeps finding waste automatically
- [ ] **Measure first**: Back every change with an `experiments:` field and `metric: "aic"` before promoting

---

## Frontier-Model Cost Pattern

A frontier model can reduce **total** cost when architecture prevents unnecessary invocations and keeps expensive context narrow.

- use frontier model for planning, hypothesis selection, synthesis, ambiguous decisions, final judgment
- do not spend frontier turns on repetitive extraction, duplicate detection, or broad first-pass scanning
- add a cheap triage stage for known/duplicate/stale/low-value events; stop with `noop` when escalation is unnecessary
- escalate to frontier model only when triage is uncertain or the case is genuinely new/high-value
- cap sub-agent fan-out so escalations cannot recurse without bound

Cost wins come from architecture and selective execution, not model tier alone.

---

## Pull Context, Do Not Push Context

Avoid front-loading large raw context when data can be fetched on demand. Prefer deterministic pre-steps that materialize compact files under `/tmp/gh-aw/`, `gh` + filtering (`jq`, `grep`) before context reaches the model, pre-aggregated summaries over full API payloads, and directed tool calls issued only after the agent forms a hypothesis. Anchoring warning: preselecting raw logs too early can make the model over-focus and miss the actual cause.

---

## How to Measure Token Usage

`gh aw audit` reports per-run cost. See [cli-commands.md](cli-commands.md#gh-aw-audit) for full command syntax (single run, `--json`, multi-run diff) and the MCP `audit` equivalent.

Token-specific fields in `gh aw audit <run-id> --json`:

- `agent_usage.aic` — AI Credits (AIC), the normalized cost metric (1 AIC = $0.01; accounts for model price differences and cache discounts)
- `agent_usage.input_tokens` / `agent_usage.output_tokens` — raw token counts
- `agent_usage.cache_read_tokens` / `agent_usage.cache_write_tokens` — tokens served from the prompt cache

For per-call detail, `gh aw audit <run-id>` downloads artifacts into `logs/run-<run-id>/`; read `firewall-audit-logs/api-proxy-logs/token-usage.jsonl` (one API call per line, with `model` and token counts) to find the most expensive calls. Diff two runs with `gh aw audit <base-id> <optimized-id>` to detect AI-credit regressions.

Treat optimization as successful only when quality remains acceptable. A quality regression is a failure even if AI Credits decrease.

---

## Technique 1 — DataOps: Move Compute to Steps

The single biggest optimization. Replace agentic data fetching with deterministic shell commands in `steps:`. Shell steps run outside the AI sandbox (no tokens) and produce structured output the agent reads directly.

### Before (agent does all the work)

```markdown
---
engine: copilot
tools:
  github:
    mode: gh-proxy
    toolsets: [default, pull_requests]
---

Fetch all open PRs in ${{ github.repository }}, compute the merge rate, identify authors with the most contributions, and create a weekly summary discussion.
```

### After (DataOps pattern)

```markdown
---
engine: copilot
tools:
  github:
    mode: gh-proxy
  bash: ["*"]

steps:
  - name: Fetch and aggregate PR data
    env:
      GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
    run: |
      mkdir -p /tmp/gh-aw/data
      gh pr list --repo "${{ github.repository }}" \
        --state all --limit 100 \
        --json number,title,state,author,createdAt,mergedAt,additions,deletions \
        > /tmp/gh-aw/data/prs.json

      jq '{
        total: length,
        merged: [.[] | select(.state=="MERGED")] | length,
        open: [.[] | select(.state=="OPEN")] | length,
        top_authors: ([.[].author.login] | group_by(.) | map({author:.[0], count:length}) | sort_by(-.count) | .[0:5])
      }' /tmp/gh-aw/data/prs.json > /tmp/gh-aw/data/stats.json

safe-outputs:
  create-discussion:
    title-prefix: "[weekly-pr] "
    category: "General"
    close-older-discussions: true
---

Read the pre-computed stats at `/tmp/gh-aw/data/stats.json` and `/tmp/gh-aw/data/prs.json`.
Create a concise weekly PR summary discussion.
```

**Best practices:**

- One JSON file per data source; `jq` to pre-aggregate
- Store files under `/tmp/gh-aw/`
- Document file locations and schema in the prompt body so the agent doesn't need to explore

See also: [DataOps pattern docs](https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/data-ops.md)

---

## Technique 2 — Use `gh-proxy` and `cli-proxy` Instead of the MCP Server

### `mode: gh-proxy` (GitHub reads)

```yaml
tools:
  github:
    mode: gh-proxy      # ✅ preferred — pre-authenticated gh CLI, no MCP server startup
    toolsets: [default]
```

Agent reads GitHub via `gh issue list`, `gh pr view`, etc. and pipes through `jq` before data enters context. `mode: local` starts a Docker-based MCP server with startup latency and verbose tool results.

### `cli-proxy: true` (other MCP servers as CLIs)

When a workflow uses additional MCP servers (e.g., a custom Notion or Slack MCP), `cli-proxy: true` mounts each server as a standalone CLI tool on `PATH`:

```yaml
tools:
  cli-proxy: true
  github:
    mode: gh-proxy
  my-custom-mcp:
    ...
```

With `cli-proxy`, the agent calls `my-custom-mcp <tool> <args>` from bash and pipes output through `jq`/`grep` to extract only needed fields — instead of receiving the full MCP tool response in context.

**Summary:**

| Mode | Docker startup | Extra tool definitions | Agent output processing |
|---|---|---|---|
| `mode: local` + MCP tools | Yes | Yes | Tool result (full JSON) |
| `mode: gh-proxy` + bash | No | No | Agent pipes through jq |
| `cli-proxy: true` + bash | Yes (once) | Reduced | Agent pipes through jq |

---

## Technique 3 — Inline Sub-Agents with Smaller Models

Sub-agents with `model: small` cost 10–20× less than the parent model. Use them for classification, one-sentence summarization, structured extraction, and scoring; reserve the large model for synthesis.

### Pattern

```
steps:        → deterministic shell (zero AI tokens)
sub-agents:   → small model per item (cheap, parallelizable)
main agent:   → synthesizes compact sub-agent results (one high-quality pass)
```

### Example

```markdown
---
engine: copilot
tools:
  cli-proxy: true
  github:
    mode: gh-proxy
  bash: ["*"]

steps:
  - name: Split issues into per-item files
    run: |
      mkdir -p /tmp/gh-aw/issues
      gh issue list --repo "${{ github.repository }}" \
        --state open --limit 50 \
        --json number,title,body,labels \
        | jq -c '.[]' \
        | while IFS= read -r issue; do
            num=$(echo "$issue" | jq -r '.number')
            echo "$issue" > /tmp/gh-aw/issues/issue-${num}.json
          done
---

## Step 1 — classify each issue

For every `/tmp/gh-aw/issues/issue-*.json`, use the `classifier` agent.
Write output to `/tmp/gh-aw/issues/cat-<number>.json`.

## Step 2 — synthesize

Read all `cat-*.json` files and create a triage report grouped by category.

## agent: `classifier`
---
description: Classifies a GitHub issue into a single category
model: small
---
Read the JSON file provided. Return only:
`{"number": <n>, "category": "bug|feature|question|docs|security|other"}`
Nothing else.
```

**Why this saves tokens:** sub-agents run the cheap `small` model; main agent reads only compact `{"number":…, "category":…}` JSON; dispatches run in parallel.

### Pair sub-agents with sub-skills (progressive disclosure)

- Keep the main prompt short and plan-like (what to do, in what order).
- Put verbose instructions (report layout, rubric details, formatting constraints) into `## skill:` blocks.
- Invoke skills only when needed (e.g., producing final output), so early turns stay lean.

This delays expensive instruction payloads until the final phase, lowering ambient context.

**Sub-agent model aliases:**

| Alias | Use when |
|---|---|
| `small` | Classification, extraction, one-sentence summaries, scoring |
| `large` | Complex reasoning, multi-step synthesis, code generation |
| `inherited` | Sub-agent needs same capability as the parent (default) |

Always use aliases, not model IDs — aliases resolve to the best available model per provider.

See also: [Inline Sub-Agents](subagents.md)

---

## Technique 4 — Apply the Caveman Technique

A/B compare a verbose prompt against a minimal one. Adopt minimal if quality holds.

```yaml
experiments:
  prompt_style: [verbose, minimal]
```

```markdown
{{#if experiments.prompt_style == "verbose" }}
Please analyze all of the open issues in this repository and provide a comprehensive, detailed report covering: the number of open issues, any significant trends or patterns you observe, the most frequently occurring labels, the oldest unresolved issues, a prioritized list of the most critical items, and any recommendations for the team.
{{#else}}
List open issues by priority. Top 5 critical items. Be brief.
{{/if}}
```

Measure AIC via run summary or `gh aw audit`. If `minimal` wins on cost at acceptable quality, promote as baseline.

---

## Technique 5 — Use Experiments to Measure Impact

Declare an experiment before making any prompt or configuration change, and compare before/after cost and quality. Run ≥ 20 cycles per variant for statistical significance on high-frequency workflows.

```yaml
experiments:
  optimization_v1:
    variants: [control, optimized]
    description: "DataOps refactor — move issue fetching to steps:"
    metric: "aic"
    issue: "123"
```

Reference the active variant in the prompt:

```markdown
{{#if experiments.optimization_v1 == "optimized" }}
Read the pre-fetched data from `/tmp/gh-aw/data/`.
{{#else}}
Fetch open issues from ${{ github.repository }} using the GitHub tools.
{{/if}}
```

**After enough runs:**

1. Compare variants using `gh aw audit <control-run-id> <optimized-run-id>`
2. Inspect `aic`, `input_tokens`, `output_tokens`, `cache_read_tokens`, and `cache_write_tokens`
3. Validate output quality and decision accuracy against the control run
4. If the optimized variant wins on cost **and** quality, rewrite the baseline prompt and remove the `experiments:` field

**Key experiment dimensions for token optimization:**

| Dimension | Example variants |
|---|---|
| Prompt verbosity | `verbose` / `concise` / `minimal` |
| Data source | `agentic-fetch` / `dataops-steps` |
| Model tier | Run separate workflows for each engine |
| Sub-agent usage | `single-agent` / `with-subagents` |
| Tool mode | `mcp-local` / `gh-proxy` |

See also: [A/B Testing Experiments](experiments.md)

---

## Technique 6 — Reduce Trigger Frequency and Batch Work

The cheapest run is the one you don't execute. If a workflow doesn't need near-real-time feedback, run it less often and batch.

### Prefer slower schedules when latency is acceptable

- `hourly` → `daily on weekdays` for team-facing summaries or audits
- `daily` → `weekly` for trend reports, optimization reviews, backlog hygiene
- `every N hours` → daily/weekly batch when the workflow only produces guidance

### Prefer scheduled batches over reactive triggers

Reactive triggers (`issues:`, `pull_request:`, comment commands) suit immediate feedback. Otherwise prefer `schedule: daily on weekdays` and batch work. Typical batch-friendly tasks: triage summaries, stale backlog review, token audits, security digests. Combine with `cache-memory` or `repo-memory` to track processed items.

---

## Technique 7 — Measure Continuously with OpenTelemetry and AgenticOps

Export telemetry automatically and add workflows that keep finding token waste over time.

### Enable OTLP export

Add workflow-level OpenTelemetry export so each run emits token and phase data to your observability backend:

```yaml
observability:
  otlp:
    endpoint: ${{ secrets.GH_AW_OTEL_ENDPOINT }}
    headers: ${{ secrets.GH_AW_OTEL_HEADERS }}
```

Setup, agent, and conclusion spans carry token usage attributes. See [Frontmatter syntax](syntax-agentic.md#agentic-workflow-specific-fields).

### Add AgenticOps token workflows

- `copilot-token-audit` — scheduled audit of token usage across workflows
- `copilot-token-optimizer` — scheduled follow-up that identifies one expensive workflow and proposes concrete savings

Loop: export OTEL → summarize usage → open optimization issues → re-measure. See `.github/workflows/` for examples.

---

## Technique 8 — Enable Prompt Caching

Prompt caching is automatic via the AWF gateway. Cached input tokens are weighted at `0.1` versus `1.0` for uncached input — repeated context (system prompt, shared preamble) costs ~10× less when cached.

To maximize cache hits:

- **Keep stable content at the top of the prompt** — instructions that don't change between runs (role, output format, schema) before dynamic content (issue body, event context).
- **Use `cache-memory`** for workflows that re-read the same large knowledge base across runs; avoids duplicate context every turn.
- **Minimize dynamic context** — inject only the fields the agent needs: `${{ github.event.issue.number }}` instead of the full event payload.

---

## Technique 9 — Cap Spend with AI-Credit Guardrails

Two top-level frontmatter fields enforce AI Credit budgets directly, independent of the techniques above. Both accept an integer or a `K`/`M` short-form string (e.g. `100M`, `500K`). Typical workflow range: `100` to `2500`.

- **`max-ai-credits:`** — Per-run AI credit budget enforced by the AWF firewall/API proxy (default `1000`). The agent is steered to stay within budget; set a negative value to disable enforcement and steering.
- **`max-daily-ai-credits:`** — Per-user 24-hour guardrail. At activation, gh-aw sums the triggering user's AI credits across their runs of this workflow over the last 24 hours and blocks execution once the total exceeds the threshold. Enabled by default with a system default threshold; set `-1` to disable, or an explicit value to override the default.

```yaml
max-ai-credits: 100M        # per-run cap (short-form string)
max-daily-ai-credits: 500M  # per-user 24h cap; -1 disables
```

For custom or private models, the top-level **`models:`** frontmatter field supplies pricing in the same structure as `models.json` (keyed `providers.<provider>.models.<model>.cost` with `input`/`output`/`cache_read`/`cache_write` per-token costs). Entries are merged with the built-in `models.json` at runtime — they override matching models and fill gaps for unknown ones — so AI Credit accounting stays accurate for models gh-aw does not price by default.

---

## Additional Resources

| Topic | File |
|---|---|
| Inline sub-agents syntax | [subagents.md](subagents.md) |
| A/B experiments | [experiments.md](experiments.md) |
| Persistent memory | [memory.md](memory.md) |
| DataOps pattern | [DataOps guide](https://github.com/github/gh-aw/blob/main/docs/src/content/docs/patterns/data-ops.md) |
| Audit command reference | [cli-commands.md](cli-commands.md) |
| Frontmatter syntax | [syntax.md](syntax.md) |


<!-- file: triggers.md -->

---
description: Trigger patterns for GitHub Agentic Workflows — events, fuzzy scheduling, fork security, slash commands, and label commands.
---

## Trigger Selection

Use the smallest trigger that matches the request.

### Decision Matrix

| User intent | Trigger | Typical read tools | Typical safe output |
|---|---|---|---|
| Review PR changes, comment on quality, suggest fixes | `pull_request` | `github` (`gh-proxy`), optional `playwright` for UI diffs | `add-comment` |
| Investigate failed CI/Actions runs and summarize incident | `workflow_run` | `github` (`gh-proxy`) with `actions: read` | `create-issue` |
| Monitor external service deployment failures (Heroku, Vercel, Fly.io) | `deployment_status` | `github` (`gh-proxy`) with `deployments: read` | `create-issue` |
| Run visual regression checks on PR UI changes | `pull_request` | `playwright` + `cache-memory` | `add-comment` |
| Publish weekly stakeholder/product digest | `schedule` | `github` (`gh-proxy`) | `create-issue` (default), `create-discussion` only if explicitly requested |
| Review dependency licenses or design-token governance on PRs | `pull_request` with `paths:` | `github` (`gh-proxy`) | `add-comment`; `create-issue` only for blocked/policy-violating findings |
| Govern documentation content (stale pages, broken links, outdated ownership) | `schedule` or `pull_request` with `paths:` | `github` (`gh-proxy`) | `add-comment` on PR; `create-issue` for stale content |
| PM / product health digest (release velocity, open issues by area) | `schedule` | `github` (`gh-proxy`) | `create-issue` with `close-older-issues: true` |
| Compliance or regulatory review | `pull_request` with `paths:` or `schedule` | `github` (`gh-proxy`) | `add-comment` for findings; `create-issue` for violations |

> **`workflow_run` vs `deployment_status`**: Use `workflow_run` when monitoring another GitHub Actions workflow in the same repository. Use `deployment_status` when an external service (Heroku, Vercel, Fly.io) reports deployment results back to GitHub via the Deployments API. See [deployment-status.md](deployment-status.md) for the full pattern.
>
> For `workflow_run`, always scope explicitly: set `workflows:` to named upstream workflow(s), use `types: [completed]`, and gate outcomes with an `if:` guard on `${{ github.event.workflow_run.conclusion }}` (for incident triage, usually `failure`, `timed_out`, `cancelled`, `action_required`) unless the user asked for success reporting.

### Scenario Examples

Engineering-focused:

- **Schema/API review on PRs**: trigger `pull_request` with `paths:` scoped to backend contract files (for example `db/migrate/**`, `migrations/**`, `schema/**`, `openapi/**`, `api/**`), read via `github` (`gh-proxy`), publish findings with `add-comment`, call `noop` when contracts are unchanged.
- **Visual regression on UI changes**: trigger `pull_request`, use only `playwright` + `cache-memory` (no extra tools), keep network minimal (allowlist only target preview/app hosts if required), state the exact baseline source (`cache-memory` key, artifact, or branch path), publish via `add-comment`, call `noop` when UI paths are unchanged.
- **Deployment incident triage**: use `deployment_status` for external provider failures and `workflow_run` for GitHub Actions failures, publish incident reports via `create-issue`, derive a stable failure key (for example workflow + job + failing step or error signature), and call `noop` when a failure self-recovers or matches an existing open incident.
- **Dependency-license compliance review on PRs**: trigger `pull_request` with `paths:` scoped to dependency manifests (for example `package.json`, `go.mod`, `requirements.txt`, `Cargo.toml`), read new dependency additions via `github` (`gh-proxy`), classify each addition by license tier (allowed / needs-review / blocked), publish findings with `add-comment`, escalate blocked additions with `create-issue` for team-wide follow-up, call `noop` when no new dependencies were added or all additions are pre-approved.

Non-engineering personas:

- **Documentation governance**: trigger `schedule` (weekly) or `pull_request` with `paths:` scoped to docs directories, check for stale ownership, broken links, or missing metadata using `github` (`gh-proxy`), publish findings with `create-issue` for pages needing owner action, call `noop` when all docs pass checks.
- **PM / roadmap health digest**:
  - trigger `schedule` (weekly on weekdays)
  - aggregate open issues by label, milestone, or area using `github` (`gh-proxy`)
  - publish a structured summary with `create-issue` and `close-older-issues: true`
  - use an explicit window such as `last 7 full days ending at run start (UTC)`
  - derive a stable key such as `pm-digest:<scope>:2026-W27`
  - call `noop` when the window has zero qualifying updates
- **Product/stakeholder digest**:
  - trigger `schedule` plus optional `workflow_dispatch`
  - define an explicit window such as `last 7 full days ending at run start (UTC)` or `since previous successful run`
  - choose grouping dimensions up front (for example team, service, owner, severity, or status)
  - publish with `create-issue` by default
  - reuse a stable deduplication key for the same scope and window
  - call `noop` when there are no updates in that window
- **Compliance audit (material drift example)**:
  - trigger `schedule: daily on weekdays` as the preferred baseline cadence, and add `workflow_dispatch` for ad hoc reruns
  - define **material drift** as any control-state change that affects compliance posture (for example: required policy file removed, control owner missing, control status downgraded from pass to fail, or required approval evidence link missing)
  - use `github` (`gh-proxy`) to compare current control evidence with the previous successful run window
  - publish violations with `create-issue` using `close-older-issues: true` and a stable key such as `compliance-drift:<framework>:<window-id>`
  - call `noop` when no material drift is detected in the selected window
- **Compliance review (regulatory/policy)**: trigger `schedule` (monthly) or `pull_request` with `paths:` scoped to policy files, read current policy state via `github` (`gh-proxy`), produce a structured compliance report per control or requirement, publish with `create-issue` and `close-older-issues: true`, call `noop` when all controls pass.

### Program Manager and Information-Worker Digest Defaults

For recurring PM, stakeholder, and other information-worker digests, specify all three of these elements up front:

| Element | Default guidance | Examples |
|---|---|---|
| Report window | Use a closed, explicit UTC window or `since previous successful run` | `last 7 full days ending at run start (UTC)`, `previous calendar month (UTC)` |
| Grouping dimensions | Group by the dimensions the audience already uses to make decisions | team, area, milestone, owner, severity, status, repository |
| Deduplication key | Derive one stable key per scope and window before creating output | `pm-digest:platform:2026-W27`, `stakeholder-digest:mobile:2026-07-02` |

Use week-based keys for weekly digests and calendar-date keys for daily or monthly reports.

Duplicate-suppression rule:

- Search for an existing open issue with the same key before creating a new digest.
- Search by a stable title prefix or dedicated label that includes the key before creating a new digest.
- If one exists, update it with `add-comment` instead of opening a duplicate issue.
- If the selected window has zero qualifying updates, call `noop`.

### Pattern-specific `noop` examples

- **PR reviewer (`pull_request`)**: `noop` when only docs/metadata changed outside scoped `paths:`.
- **Failure triage (`workflow_run`)**: `noop` when rerun succeeds, signal is flake-only, or an open incident already exists for the same failure key.
- **Scheduled digest (`schedule`)**: `noop` when the exact reporting window (for example `since previous successful run`) has zero qualifying updates.
- **Deployment monitor (`deployment_status`)**: `noop` when non-terminal statuses (`queued`, `in_progress`) arrive without a terminal failure.

## Trigger Patterns

### Standard GitHub Events

```yaml
on:
  issues:
    types: [opened, edited, closed]
  pull_request:
    types: [opened, edited, closed]
    forks: ["*"]              # Allow from all forks (default: same-repo only)
  push:
    branches: [main]
  schedule:
    - cron: "0 9 * * 1"  # Monday 9AM UTC
  workflow_dispatch:    # Manual trigger
```

### `workflow_run` Failure-Triage Pattern

Use this when reacting to failures from another workflow in the same repository:

```yaml
on:
  workflow_run:
    workflows: ["CI", "Deploy"]
    types: [completed]
  workflow_dispatch:
```

Then gate analysis to failure outcomes:

```yaml
if: contains(fromJson('["failure","timed_out","cancelled","action_required"]'), github.event.workflow_run.conclusion)
```

These are "non-success outcomes requiring triage"; keep the list explicit so readers can tighten (e.g., only `failure`) or broaden it.

No-op expectations for this pattern:

- `noop` when the monitored run concludes `success`.
- `noop` when the same failure already has an open incident issue (duplicate suppression).

#### Fuzzy Scheduling

Use fuzzy scheduling instead of exact cron to distribute execution times. Avoids load spikes and the "Monday wall of work" from weekend accumulation.

**Basic Fuzzy Schedules:**

```yaml
on:
  schedule: daily on weekdays    # Monday-Friday only (recommended for daily workflows)
  schedule: daily                # All 7 days
  schedule: weekly               # Once per week
  schedule: hourly               # Every hour
```

**Examples with Intervals:**

```yaml
on:
  schedule: every 2 hours on weekdays    # Every 2 hours, Monday-Friday
  schedule: every 6 hours                # Every 6 hours, all days
```

**Why Prefer Weekday Schedules:**

- Avoids Monday backlog from weekend accumulation
- Aligns with team business hours
- Notifications fire when team members are active

The compiler converts fuzzy schedules to deterministic cron (e.g., `daily on weekdays` → `43 5 * * 1-5`), scatters execution to avoid load spikes, and adds `workflow_dispatch:` for manual runs.

**Recommended Pattern:**

```yaml
# ✅ GOOD - Weekday schedule avoids Monday wall of work
on:
  schedule: daily on weekdays

# ⚠️ ACCEPTABLE - But may create Monday backlog
on:
  schedule: daily
```

#### Fork Security for Pull Requests

By default, `pull_request` triggers **block all forks** and only allow PRs from the same repository. Use the `forks:` field to explicitly allow forks:

```yaml
# Default: same-repo PRs only (forks blocked)
on:
  pull_request:
    types: [opened]

# Allow all forks
on:
  pull_request:
    types: [opened]
    forks: ["*"]

# Allow specific fork patterns
on:
  pull_request:
    types: [opened]
    forks: ["trusted-org/*", "trusted-user/repo"]
```

### Command Triggers (/mentions)

```yaml
on:
  slash_command:
    name: my-bot  # Responds to /my-bot in issues/comments
```

This automatically creates conditions to match `/my-bot` mentions in issue bodies and comments.

You can restrict where commands are active using the `events:` field:

```yaml
on:
  slash_command:
    name: my-bot
    events: [issues, issue_comment]  # Only in issue bodies and issue comments
```

**Supported event identifiers:**

- `issues` - Issue bodies (opened, edited, reopened)
- `issue_comment` - Comments on issues only (excludes PR comments)
- `pull_request_comment` - Comments on pull requests only (excludes issue comments)
- `pull_request` - Pull request bodies (opened, edited, reopened)
- `pull_request_review_comment` - Pull request review comments
- `*` - All comment-related events (default)

**Note**: `issue_comment` and `pull_request_comment` both map to GitHub Actions' `issue_comment` event with filtering to distinguish them.

### Label Command Triggers

Trigger workflows when specific labels are added to issues, PRs, or discussions:

```yaml
# Shorthand: trigger on any labeled event
on: label-command my-label

# Or with explicit configuration
on:
  label_command:
    name: ai-review        # Single label name (or use names: [...] for multiple)
    events: [pull_request] # Optional: restrict to issues, pull_request, discussion (default: all three)
    strategy: decentralized # Optional: route labeled events via generated agentic_commands.yml
    remove_label: false    # Optional: remove triggering label after activation (default: true)
```

Use `names:` for multiple labels that activate the same workflow:

```yaml
on:
  label_command:
    names: [ai-review, copilot-review]
    events: [pull_request]
```

By default, the triggering label is automatically removed after the workflow activates (`remove_label: true`). Set `remove_label: false` to keep the label.

The activated label name is exposed to downstream jobs as `${{ needs.activation.outputs.label_command }}`.

### Semi-Active Agent Pattern

```yaml
on:
  schedule:
    - cron: "0/10 * * * *"  # Every 10 minutes
  issues:
    types: [opened, edited, closed]
  issue_comment:
    types: [created, edited]
  pull_request:
    types: [opened, edited, closed]
  push:
    branches: [main]
  workflow_dispatch:
```


<!-- file: update-agentic-workflow.md -->

---
description: Update existing agentic workflows using GitHub Agentic Workflows (gh-aw) with concise guidance on minimal changes and validation.
disable-model-invocation: true
---

# GitHub Agentic Workflow Updater

Update existing workflow files in `.github/workflows/`.

## Load These References First

- [github-agentic-workflows.md](github-agentic-workflows.md)
- [workflow-editing.md](workflow-editing.md)
- [workflow-constraints.md](workflow-constraints.md)
- [safe-outputs.md](safe-outputs.md)
- [syntax.md](syntax.md)

Load these additional files only when relevant:

- [campaign.md](campaign.md)
- [experiments.md](experiments.md)
- [visual-regression.md](visual-regression.md)
- [serena-tool.md](serena-tool.md)

## Scope

This prompt is for **updating existing workflows only**. For new workflows, use the creator prompt.

## Start the Conversation

1. Ask which workflow to update.
2. Ask what change is needed.
3. Then inspect the existing file before proposing edits.

## First Decision: Frontmatter or Prompt Body?

Use [workflow-editing.md](workflow-editing.md) as the source of truth.

- frontmatter change → recompilation required
- markdown-body-only change → no recompilation required

## Update Rules

- make the smallest possible change
- preserve existing style and structure unless reorganization is required
- do not rewrite unrelated frontmatter sections
- keep the agent job read-only
- when targeting the Copilot coding agent, recommend `permissions: { copilot-requests: write }` for Copilot authentication
- use `safe-outputs:` for writes
- prefer `toolsets:` for GitHub tools

## Common Update Categories

See [workflow-editing.md](workflow-editing.md) for the full frontmatter-vs-body recompilation taxonomy.

- **Prompt-only updates** (clarifying instructions, tightening wording, adding or removing examples, adding guardrails or output-format guidance): do not recompile; the change applies on the next run.
- **Frontmatter updates** (triggers, permissions, tools and MCP servers, network, safe outputs, imports, timeouts or engine configuration): run `gh aw compile <workflow-id>`, fix every error, then review the `.lock.yml`.

## Cost-Oriented Update Checks

When refining existing workflows, preserve minimal edits while verifying:

- cheap triage runs before escalation for high-volume inputs
- known/duplicate/stale/low-value cases stop with explicit `noop` or safe output
- expensive/frontier reasoning is limited to ambiguous or high-value cases and final synthesis
- large raw logs/payloads are pulled on demand instead of pushed into initial prompts
- sub-agent fan-out stays bounded and worker returns stay compact
- changes are measured with `gh aw audit` (`aic`, input/output/cache token fields) and quality regressions are treated as failures

See also: [token-optimization.md](token-optimization.md), [subagents.md](subagents.md), and [workflow-patterns.md](workflow-patterns.md).

## Security Rules

- never suggest GitHub mutation through raw GitHub tools when a safe output exists
- do not recommend `mode: remote` for GitHub tools unless explicitly required and properly configured
- do not replace `pull_request` with `pull_request_target` unless the user explicitly needs a `pull_request_target` design
- do not use `post-steps:` for agent-driven write behavior that belongs in a safe-output job

## Safer-Alternatives Pattern

Follow the "Safer Alternatives First" pattern in [workflow-constraints.md](workflow-constraints.md) when a requested change raises risk.

## Minimal Examples

### Add a GitHub toolset

```yaml
tools:
  github:
    toolsets: [default]
```

### Add a safe output

```yaml
safe-outputs:
  add-comment:
    max: 1
```

### Add network access

```yaml
network:
  allowed:
    - defaults
    - node
```

## Validation Flow

- always inspect the workflow before editing
- compile after frontmatter changes
- keep the workflow valid at every step
- summarize what changed and whether recompilation was needed

## Final Message Rules

At the end, tell the user:

- what changed
- whether the change touched frontmatter or prompt body
- whether recompilation was required
- any next step they should take

Keep the summary short.


<!-- file: upgrade-agentic-workflows.md -->

---
description: Upgrade agentic workflows to the latest version of gh-aw with automated compilation and error fixing
disable-model-invocation: true
---

You are specialized in **upgrading GitHub Agentic Workflows (gh-aw)** to the latest version.
Your job is to upgrade workflows in a repository to work with the latest gh-aw version, handling breaking changes and compilation errors.

Read the ENTIRE content of this file carefully before proceeding. Follow the instructions precisely.

## Capabilities & Responsibilities

**Prerequisites**

- The `gh aw` CLI may be available in this environment.
- Always consult the **instructions file** for schema and features:
  - Local copy: @.github/aw/github-agentic-workflows.md
  - Canonical upstream: https://raw.githubusercontent.com/github/gh-aw/main/.github/aw/github-agentic-workflows.md
- If the user says “campaign”, “KPI”, “pacing”, “cadence”, or “stop-after”, consult @.github/aw/campaign.md (campaign/KPI workflows are still just agentic workflows; this is a design pattern playbook).
- If the user says "experiment", "A/B test", "variants", "prompt comparison", or "measure the impact", consult @.github/aw/experiments.md (A/B experiments are configured via the `experiments:` frontmatter field).

**Key Commands Available**

- `upgrade` → upgrade repository to latest version (combines all steps below)
- `fix` → apply automatic codemods to fix deprecated fields
- `compile` → compile all workflows
- `compile <workflow-name>` → compile a specific workflow

> [!NOTE]
> **Command Execution**
>
> When running in GitHub Copilot Cloud, you don't have direct access to `gh aw` CLI commands. Instead, use the **agentic-workflows** MCP tool:
> - `upgrade` tool → upgrade repository to latest version (recommended)
> - `fix` tool → apply automatic codemods to fix deprecated fields
> - `compile` tool → compile workflows
>
> When running in other environments with `gh aw` CLI access, prefix commands with `gh aw` (e.g., `gh aw upgrade`, `gh aw compile`).
>
> These tools provide the same functionality through the MCP server without requiring GitHub CLI authentication.

## Instructions

### 1. Fetch Latest gh-aw Changes

Before upgrading, always review what's new:

1. **Fetch Latest Release Information**
   - Use GitHub tools to fetch the CHANGELOG.md from the `github/gh-aw` repository
   - Review and understand:
     - Breaking changes
     - New features
     - Deprecations
     - Migration guides or upgrade instructions
   - Summarize key changes with clear indicators:
     - 🚨 Breaking changes (requires action)
     - ✨ New features (optional enhancements)
     - ⚠️ Deprecations (plan to update)
     - 📖 Migration guides (follow instructions)

### 2. Run the Upgrade Command

**The primary and recommended way to upgrade is to use the `gh aw upgrade` command**, which automates all the upgrade steps in one command:

1. **Run the Upgrade Command**

   ```bash
   gh aw upgrade
   ```

   This single command will automatically:
   - Update all agent and prompt files to the latest templates (like `gh aw init`)
   - Apply automatic codemods to fix deprecated fields in all workflows (like `gh aw fix --write`)
   - Update GitHub Actions versions in `.github/aw/actions-lock.json`
   - Compile all workflows to generate lock files (like `gh aw compile`)

2. **Optional Flags**

   - `gh aw upgrade --create-pull-request` - Open a pull request with the upgrade changes (alias: `--pr`)
   - `gh aw upgrade --no-fix` - Update agent files only (skip codemods, actions, and compilation)
   - `gh aw upgrade --no-actions` - Skip updating GitHub Actions versions
   - `gh aw upgrade --dir custom/workflows` - Upgrade workflows in custom directory

3. **Review the Results**
   - The command will display progress for each step
   - Note any warnings or errors that occur
   - All changes will be applied automatically

> [!TIP]
> **Use `gh aw upgrade` for most upgrade scenarios.** It combines all necessary steps and ensures consistency. Only use the manual steps below if you need fine-grained control or if the upgrade command fails.

### 3. Manual Upgrade Steps (Fallback)

If the `gh aw upgrade` command is not available or you need more control, follow these manual steps:

#### 3.1. Apply Automatic Fixes with Codemods

Before attempting to compile, apply automatic codemods:

1. **Run Automatic Fixes**
   
   Use the `fix` tool with the `--write` flag to apply automatic fixes.
   
   This will automatically update workflow files with changes like:
   - Replacing 'timeout_minutes' with 'timeout-minutes'
   - Replacing `network.firewall: false` with:
     ```yaml
     features:
       dangerously-disable-sandbox-agent: "controlled environment with no internet access"
     sandbox:
       agent: false
     ```
   - Removing deprecated 'mcp-scripts.mode' field

2. **Review the Changes**
   - Note which workflows were updated by the codemods
   - These automatic fixes handle common deprecations

#### 3.2. Attempt Recompilation

Try to compile all workflows:

1. **Run Compilation**
   
   Use the `compile` tool to compile all workflows.

2. **Analyze Results**
   - Note any compilation errors or warnings
   - Group errors by type (schema validation, breaking changes, missing features)
   - Identify patterns in the errors

### 4. Fix Compilation Errors

If compilation fails, work through errors systematically:

1. **Analyze Each Error**
   - Read the error message carefully
   - Reference the changelog for breaking changes
   - Check the gh-aw instructions for correct syntax

2. **Common Error Patterns**
   
   **Schema Changes:**
   - Old field names that have been renamed
   - New required fields
   - Changed field types or formats
   
   **Breaking Changes:**
   - Deprecated features that have been removed
   - Changed default behaviors
   - Updated tool configurations
   
   **Example Fixes:**
   
   ```yaml
   # Old format (deprecated)
   mcp-servers:
     github:
       mode: remote
   
   # New format (do NOT include mode: remote - it does not work with GitHub Actions token)
   tools:
     github:
       toolsets: [default]
   ```

3. **Apply Fixes Incrementally**
   - Fix one workflow or one error type at a time
   - After each fix, use the `compile` tool with `<workflow-name>` to verify
   - Verify the fix works before moving to the next error

4. **Document Changes**
   - Keep track of all changes made
   - Note which breaking changes affected which workflows
   - Document any manual migration steps taken

### 5. Verify All Workflows

After fixing all errors:

1. **Final Compilation Check**
   
   Use the `compile` tool to ensure all workflows compile successfully.

2. **Review Generated Lock Files**
   - Ensure all workflows have corresponding `.lock.yml` files
   - Check that lock files are valid GitHub Actions YAML

> [!NOTE]
> If you used the `gh aw upgrade` command in step 2, agent files and instructions have already been updated. The manual refresh step below is only needed if you followed the manual upgrade process.

## Creating Outputs

After completing the upgrade:

### If All Workflows Compile Successfully

Create a **pull request** with:

**Title:** `Upgrade workflows to latest gh-aw version`

**Description:**
```markdown
## Summary

Upgraded all agentic workflows to gh-aw version [VERSION].

## Changes

### gh-aw Version Update
- Previous version: [OLD_VERSION]
- New version: [NEW_VERSION]

### Key Changes from Changelog
- [List relevant changes from the changelog]
- [Highlight any breaking changes that affected this repository]

### Workflows Updated
- [List all workflow files that were modified]

### Upgrade Method
- Used `gh aw upgrade` command to automatically apply all changes

### Automatic Fixes Applied
- [List changes made by the upgrade command]
- [Reference which deprecated fields were updated by codemods]

### Manual Fixes Applied (if any)
- [Describe any manual changes made to fix compilation errors after upgrade]
- [Reference specific breaking changes that required manual fixes]

### Testing
- ✅ All workflows compile successfully
- ✅ All `.lock.yml` files generated
- ✅ No compilation errors or warnings

### Post-Upgrade Steps
- ✅ Ran `gh aw upgrade` to update all components
- ✅ All agent files and instructions updated automatically

## Files Changed
- Updated `.md` workflow files: [LIST]
- Generated `.lock.yml` files: [LIST]
- Updated agent files: [LIST]
```

### If Compilation Errors Cannot Be Fixed

Create an **issue** with:

**Title:** `Failed to upgrade workflows to latest gh-aw version`

**Description:**
```markdown
## Summary

Attempted to upgrade workflows to gh-aw version [VERSION] but encountered compilation errors that could not be automatically resolved.

## Version Information
- Current gh-aw version: [VERSION]
- Target version: [NEW_VERSION]

## Compilation Errors

### Error 1: [Error Type]
```
[Full error message]
```

**Affected Workflows:**
- [List workflows with this error]

**Attempted Fixes:**
- [Describe what was tried]
- [Explain why it didn't work]

**Relevant Changelog Reference:**
- [Link to changelog section]
- [Excerpt of relevant documentation]

### Error 2: [Error Type]
[Repeat for each distinct error]

## Investigation Steps Taken
1. [Step 1]
2. [Step 2]
3. [Step 3]

## Recommendations
- [Suggest next steps]
- [Identify if this is a bug in gh-aw or requires repository changes]
- [Link to relevant documentation or issues]

## Additional Context
- Changelog review: [Link to CHANGELOG.md]
- Migration guide: [Link if available]
```

## Best Practices

1. **Always Review Changelog First**
   - Understanding breaking changes upfront saves time
   - Look for migration guides or specific upgrade instructions
   - Pay attention to deprecation warnings

2. **Fix Errors Incrementally**
   - Don't try to fix everything at once
   - Validate each fix before moving to the next
   - Group similar errors and fix them together

3. **Test Thoroughly**
   - Compile workflows to verify fixes
   - Check that all lock files are generated
   - Review the generated YAML for correctness

4. **Document Everything**
   - Keep track of all changes made
   - Explain why changes were necessary
   - Reference specific changelog entries

5. **Clear Communication**
   - Use emojis to make output engaging
   - Summarize complex changes clearly
   - Provide actionable next steps

## Important Notes

- When running in GitHub Copilot Cloud, use the **agentic-workflows** MCP tool for all commands
- When running in environments with `gh aw` CLI access, prefix commands with `gh aw` 
- Breaking changes are inevitable - expect to make manual fixes
- If stuck, create an issue with detailed information for the maintainers


<!-- file: visual-regression.md -->

---
name: visual-regression
description: Reference prompt for visual regression testing using playwright + cache-memory for baseline screenshot storage across pull requests
---

# Visual Regression Testing

Use `playwright` for screenshots and `cache-memory` to persist baselines between PR runs.

## Example Workflow

```markdown
---
description: Capture screenshots on every PR and compare against cached baselines to detect visual regressions
on:
  pull_request:
    types: [opened, synchronize, reopened]
permissions:
  contents: read
  pull-requests: read
engine: copilot
tools:
  playwright:
    allowed_domains:
      - localhost
      - 127.0.0.1
  cache-memory:
    key: visual-regression-baselines-${{ github.event.pull_request.base.ref }}
    retention-days: 30
    allowed-extensions: [".png", ".json"]
  bash:
    - "mkdir *"
    - "cp *"
    - "diff *"
    - "date *"
safe-outputs:
  add-comment:
    max: 1
timeout-minutes: 30
---

Build and serve the app locally, then use Playwright to capture full-page screenshots of key pages into `/tmp/visual-regression/current/`.

Use filesystem-safe timestamps (no colons — colons break artifact uploads):
`date -u "+%Y-%m-%d-%H-%M-%S"`

If `/tmp/gh-aw/cache-memory/baselines/manifest.json` does not exist, copy screenshots there as new baselines and post: "Baselines initialized — N pages captured."

Otherwise compare each screenshot to its baseline. Post a comment summarizing: pages unchanged / pages with diffs. If nothing changed, use the `noop` safe-output.
```

## Key Design Decisions

- **`cache-memory` key per base branch** — scopes baselines to `main`, `develop`, etc.
- **Explicit baseline source** — state whether baselines come from `cache-memory`, a generated artifact, or a branch directory; do not leave baseline origin implicit.
- **`allowed_domains: [localhost, 127.0.0.1]`** — prevents SSRF; serve app locally
- **`retention-days: 30`** — beyond the default 7-day cache expiry
- **Filesystem-safe timestamps** — `YYYY-MM-DD-HH-MM-SS`; colons break artifact filenames
- **Minimal permissions** — all PR writes go through `safe-outputs`

## Network-Minimization Reminders

- Prefer local preview (`localhost`/`127.0.0.1`) over external preview environments.
- If external previews are required, allowlist exact domains (no broad wildcards).
- Enable `network.node` only when installing/building Node deps; scope to registries and preview hosts.
- Keep Playwright navigation limited to app-under-test URLs.


<!-- file: workflow-constraints.md -->

---
description: Shared architectural and security constraints for designing or updating agentic workflows.
---

# Agentic Workflow Constraints

## Execution Model

Agentic workflows run as a **single GitHub Actions job** with one agent execution.

## Can Do

- read GitHub data, APIs, web pages, and local repository files
- run tools inside the single job
- use MCP servers and safe outputs
- create GitHub resources through `safe-outputs:`
- persist lightweight state with `cache-memory` or other approved mechanisms

## Cannot Do

- pause and resume for external events
- orchestrate multi-stage pipelines with job dependencies
- pass state between multiple AI jobs in one workflow run
- implement built-in rollback across external systems
- wait for another workflow or deployment to finish inside the same agent run

## Recommend Traditional GitHub Actions When

- multi-stage deployment pipelines
- fan-out/fan-in job orchestration
- long waits for approvals or external systems
- rollback logic across several steps or systems
- cross-job state transfer

Suggested response:

> This requires capabilities the single-job agentic model does not support. Use traditional GitHub Actions for orchestration and agentic workflows for the AI-specific step.

## Security Posture

- Keep the main agent job read-only.
- Do not add GitHub write permissions to the agent job.
- Route GitHub writes through `safe-outputs:`.
- Prefer `tools.github.mode: gh-proxy` with `gh` for GitHub reads.
- Prefer `tools.cli-proxy: true` with mounted `mcp-clis` commands for non-GitHub MCP tools.
- Constrain `network.allowed:` to the minimum required ecosystems or domains.
- Use `${{ steps.sanitized.outputs.text }}` for untrusted user content.

## Safer Alternatives First

When a requested feature increases risk:

1. explain the risk
2. propose the safer pattern first
3. require explicit confirmation before relaxing safeguards

## Common Risk Areas

- direct write permissions instead of safe outputs
- auto-merge or bypassing review
- overly broad network access
- unbounded bash allowlists for untrusted input
- shell injection: interpolating `${{ github.event.* }}` or other untrusted expressions directly into `run:` scripts; pass untrusted values through environment variables instead
- placing OIDC/secret bootstrap in `pre-steps` instead of earlier `setup-steps`
- using `post-steps:` for agent-driven write actions

## Self-Hosted Runner Compatibility

When `runs-on` is any value other than GitHub-hosted labels (`ubuntu-latest`, `ubuntu-slim`, `windows-latest`, `macos-latest`):

- Set `runs-on` explicitly (not inherited from imports); accepts string, array, or runner-group object. Framework jobs (activation, safe-outputs, unlock, etc.) default to hosted `ubuntu-slim`, so also set `runs-on-slim` (same forms) to route them to the self-hosted runner.
- Write transient state, tool downloads, and outputs under `$RUNNER_TEMP`, not `/tmp` (which can persist across jobs on shared runners).
- Agent steps run as the runner user, not root — don't install to system-wide paths. The egress firewall needs sudo; if unavailable, it can be disabled (removing egress filtering) — surface the trade-off to the user rather than encoding it.
- Declare every outbound domain in `network.allowed` (keep `defaults` for core GitHub/Copilot/registry endpoints). Non-allow-listed domains are blocked when the firewall is enabled.
- Do not install to `/usr/local` or the toolcache (may be read-only/shared); use job-scoped writable paths.
- Do not hardcode `/home/runner` or any literal home path — read `$HOME`; use `$RUNNER_TEMP` for transient state.
- For GitHub Enterprise Server, enable GHES compatibility (GHES-compatible artifact action versions, enterprise API endpoint).

For the full set of requirements (Docker socket, ARC / Docker-in-Docker, network egress, GHES specifics), follow the [Self-Hosted Runners](/gh-aw/reference/self-hosted-runners/) reference page.

## Shared Reminder

Reference this file from creator, updater, and debugger prompts instead of repeating the architectural explanation.


<!-- file: workflow-editing.md -->

---
description: Shared guidance for editing, recompiling, and validating GitHub Agentic Workflow files.
---

# Workflow Editing Basics

Agentic workflows are single markdown files at `.github/workflows/<workflow-id>.md`.

## File Structure

1. **YAML frontmatter** between `---` markers: triggers, permissions, tools, network, imports, safe outputs.
2. **Markdown body**: the agent prompt.

## Recompile When Changing Frontmatter Fields

Run `gh aw compile <workflow-id>` after changing:

- `on:`
- `permissions:`
- `tools:`
- `network:`
- `imports:`
- `safe-outputs:`
- `mcp-servers:`
- engine, timeout, concurrency, or other YAML configuration

## No Recompile Needed

Edit the markdown body directly for:

- agent instructions
- task descriptions
- examples
- formatting guidance
- clarifications and guardrails

Body changes take effect on the next run.

## Validation Commands

```bash
gh aw compile <workflow-id>
gh aw compile <workflow-id> --strict
gh aw compile --purge
```

Use `--strict` for production-quality validation.

## Editing Rules

- Smallest change that satisfies the request.
- Preserve structure unless reorganization is the task.
- Never leave a workflow broken.
- If compile fails, fix all errors before stopping.
- After frontmatter changes, review the generated `.lock.yml`.

## Prompt-Authoring Rules

- Specific and imperative.
- Short examples over long tutorials.
- Reference dedicated instruction files instead of duplicating.
- Tell agents to use `noop` when no visible action is needed.


<!-- file: workflow-patterns.md -->

---
description: Shared design patterns for command workflows, monitoring workflows, large-repository workflows, database migration reviews, and cross-repository operations.
---

# Workflow Patterns

## Command Workflows

### Prefer `slash_command` when

- the action is conversational
- the user may pass arguments in the comment body
- the workflow should work across issues, pull requests, and discussions

### Prefer `label_command` when

- the action is one-shot and argument-free
- discoverability in the GitHub UI matters
- the workflow fits a label-driven process

### Combine both when

- the action is common enough to justify both invocation styles
- you want UI discoverability plus comment-based flexibility

See also: [triggers.md](triggers.md)

## Monitoring Workflows

### Use `workflow_run` when

- monitoring another GitHub Actions workflow in the **same repository**
- reacting to workflow completion/conclusion

Incident-triage pattern:

- trigger: `on.workflow_run` for the named deployment/CI workflow
- permissions: include `actions: read`; main job read-only
- reads: failed job logs/artifacts via GitHub tools
- output: `create-issue` with impact/root cause; `noop` when no action needed

Compact `workflow_run` examples:

- **Deploy workflow failure triage**: trigger on `workflow_run` for `Deploy`, read failed jobs/logs/artifacts, create one incident issue, `noop` when rerun succeeds.
- **CI regression watcher**: trigger on `workflow_run` for `CI`, compare current failure against recent runs, create issue only for new regressions, `noop` for known flakes.

Incident duplicate-suppression pattern:

- derive a stable incident key from the monitored workflow and failure signal (for example `<workflow-name>#<head-sha>#<failed-job-name>`)
- search open issues by title-prefix/label/key before creating a new issue
- create via `safe-outputs.create-issue` only when no matching open incident exists
- use `noop` for duplicates and include the matching issue number in the explanation

### Use `deployment_status` when

- monitoring an external deployment service reporting back to GitHub

Rule of thumb:

- `workflow_run` → GitHub Actions outcomes in this repo
- `deployment_status` → external platform outcomes via Deployments API

Single-job limits apply:

- triage, evidence collection, and summary in one agent job
- no multi-job fan-out/fan-in
- no cross-workflow waits or chaining

See also: [deployment-status.md](deployment-status.md)

## High-Volume Triage and Escalation Pattern

For workflows receiving many similar events (issues, PR comments, CI failures, security alerts, dependency events):

- start with a cheap triage/classification pass
- detect known/duplicate/stale/low-value cases first
- emit `noop` or a safe output when triage is confident
- escalate to the main agent only when uncertain or genuinely new/high-value

Decision flow:

```text
IF cheap triage is confident (known/duplicate/stale/low-value) THEN
  emit safe output or noop
ELSE
  escalate to the main agent
END IF
```

Use with pull-context workflows: fetch targeted evidence on demand instead of pushing raw logs into the initial prompt.

## Large-Repository Improvement Pattern

For recurring maintenance in large repos:

- use `cache-memory`
- process one package/module/directory per run
- store last-processed item; round-robin
- prefer small focused PRs over wide sweeps

See also: [memory.md](memory.md)

## Step Authoring Guidance

When writing `steps:`, `pre-steps:`, and `post-steps:`, choose the implementation type in this order of preference:

### 1. Preferred: `actions/github-script`

Use `actions/github-script` for GitHub API interactions and general scripting. The workflow compiler handles action pinning automatically; specify a recent major version tag (`@v7`) without a SHA.

- Provides typed access to the GitHub REST API via `github.rest.*`
- Exposes `context`, `core`, `github`, `io`, and `exec` helpers
- Eliminates shell injection risks for untrusted input
- Example:

  ```yaml
  steps:
    - name: Fetch issue data
      uses: actions/github-script@v7
      with:
        script: |
          const issue = await github.rest.issues.get({
            owner: context.repo.owner,
            repo: context.repo.repo,
            issue_number: context.issue.number,
          });
          core.setOutput('title', issue.data.title);
  ```

### 2. Shell scripts

Use `run:` steps when `actions/github-script` is not suitable. To prevent shell injection, never interpolate untrusted values directly into the script body. Any value that originates from user input — including `github.event.issue.title`, `github.event.issue.body`, `github.event.comment.body`, `github.event.pull_request.title`, `github.event.pull_request.body`, and `github.head_ref` — must be passed through environment variables:

```yaml
# ❌ Unsafe: direct expression interpolation into the shell script
- name: Unsafe comment
  run: gh issue comment ${{ github.event.issue.number }} --body "${{ github.event.issue.title }}"

# ✅ Safe: pass untrusted values through env vars and reference them as $VAR_NAME
- name: Safe comment
  env:
    ISSUE_NUMBER: ${{ github.event.issue.number }}
    TITLE: ${{ github.event.issue.title }}
  run: gh issue comment "$ISSUE_NUMBER" --body "$TITLE"
```

### 3. Python (last resort)

Use Python only when the task genuinely requires data science or numeric libraries (for example `pandas`, `numpy`, `matplotlib`). Prefer `actions/github-script` or a shell step for everything else.

## Pre-Step Data Fetching Pattern

Use deterministic `steps:` when the workflow needs large external data before the agent runs.

Rules:

- write prepared files to `/tmp/gh-aw/agent/`
- trim large outputs before handing to the agent
- set `GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}` on every `gh` step
- add `permissions: actions: read` for downloading workflow logs/artifacts
- use `jq` to reduce JSON payload size

Compact reporting/incident prefetch example:

```yaml
steps:
  - name: Prefetch compact failure context
    env:
      GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
      REPO: ${{ github.repository }}
      RUN_ID: ${{ github.event.workflow_run.id }}
    run: |
      gh api "repos/$REPO/actions/runs/$RUN_ID/jobs" \
        --jq '[.jobs[] | select(.conclusion != "success") | {name, conclusion, started_at, completed_at}]' \
        > /tmp/gh-aw/agent/failed_jobs.json
```

## PR Visual Regression Pattern

For PR UI validation and screenshot diffs:

- trigger: `pull_request`
- tools: `playwright` plus `cache-memory` for baseline metadata
- permissions: read-only repo/PR access
- output: `add-comment` with pass/fail summary and artifact links
- fallback: `noop` when no UI changes detected

## Design Token / CSS Governance Pattern

For PRs touching design tokens or CSS files that require a linked design reference (Figma link, design doc URL, or ADR token):

- trigger: `pull_request` with `paths:` scoped to token/style files (for example `tokens/**`, `**/*.tokens.json`, `**/*.css`, `src/styles/**`, `design-system/**`)
- permissions: `pull-requests: read`, `contents: read`; agent job read-only
- reads: PR body and comments via `gh pr view` to locate the linked design reference; then validate that the link target is reachable and matches the changed components
- output:
  - Link present and valid → `add-comment` with ✅ summary
  - Link present but incomplete (for example wrong component, outdated version) → `add-comment` describing the specific gap
  - Required link missing → `add-comment` requesting it; escalate to `create-issue` only when the workflow prompt explicitly requires a blocking review gate (for example a CODEOWNERS or policy rule) and no open issue already covers the same scope
- fallback: `noop` when `paths:` guard excludes all changed files

See also: the PR Checks with Linked References pattern in [github-agentic-workflows.md](github-agentic-workflows.md).

## QA Coverage Report Pattern

For PR QA coverage summaries (gaps, risks, suggested test focus):

- trigger: `pull_request` (optionally scoped with `paths:`)
- tools: `github` (`gh-proxy`) for changed files, PR metadata, labels, checks
- permissions: `contents: read`, `pull-requests: read`; agent job read-only
- output: `add-comment` with coverage matrix and untested/high-risk areas
- fallback: `noop` for non-testable changes (e.g. docs-only)

## PM Stakeholder Digest Pattern

For recurring product/stakeholder digests:

- trigger: fuzzy `schedule` (e.g. `weekly on mondays`)
- tools: `github` (`gh-proxy`), optional `cache-memory` for period-over-period continuity
- permissions: read-only
- output: `create-issue` by default; `create-discussion` only when requested
- prompt: audience-aware language (summary first, details second)

## Database Migration Safety Pattern

For PRs adding/modifying migration files:

- trigger: `pull_request` with `paths:` scoped to migration dirs (e.g. `db/migrate/**`, `migrations/**`, `*.sql`)
- permissions: `contents: read`, `pull-requests: read`; agent job read-only
- reads: changed migration content via GitHub tools
- output: `add-comment` flagging risky operations; `noop` when clean
- prompt: include migration best practices

## Cross-Repository Pattern

For cross-repo reads and writes:

- enable GitHub toolsets needed for external repos
- configure PAT or GitHub App auth in `safe-outputs:` for cross-repo writes
- tell the agent to set `target-repo` explicitly
- document required token scopes in the prompt or instructions

Cross-repo workflows inherit single-job constraints from [workflow-constraints.md](workflow-constraints.md).