[copilot-cli-research] Copilot CLI Deep Research - 2026-04-29 #29194
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-04-30T21:29:56.522Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-04-29
Repository: github/gh-aw
Scope: 205 total workflows, 110 using Copilot engine (54%), 47 Claude, 10 Codex
📊 Executive Summary
Research Topic: Copilot CLI Feature Usage & Optimization Opportunities
Key Findings:
startup-timeoutis NEVER used (0%, 11th consecutive run),max-continuationsremains near-zero (2 workflows), web-search underutilized (2 workflows), sandbox AWF coverage is low (15%), and toolsets scoping is inconsistent.Primary Recommendation: Adopt
startup-timeoutin long-running and read-heavy workflows — it costs nothing and protects against hung sessions.This is the 6th consecutive deep-research run on this repository. The workflow count continues to grow (203 → 205), with Copilot engine adoption stable at ~54%. Most persistent gaps from prior runs remain unchanged. On the positive side, cache-memory adoption is healthy (79 workflows, 72%), and repo-memory usage has grown to 23 workflows (21%).
The most actionable short-term win remains
startup-timeout— it has been at 0% for 11 consecutive analysis runs, yet it protects every single workflow from silently hanging during Copilot CLI initialization.Critical Findings
🔴 High Priority Issues
startup-timeoutunused across ALL 110 Copilot workflows (0%, 11th consecutive run)Copilot CLI can silently hang during startup. Without a startup timeout, workflows consume runner minutes until the step-level
timeout-minutesfires — potentially 60+ minutes later. Addingstartup-timeout: 120(2 min) is a free safety net.Sandbox (AWF) enabled in only 17/110 Copilot workflows (15%)
The AWF network firewall prevents data exfiltration and prompt-injection network calls. Workflows with
edit:or broadbash:access that lacksandbox: agent: awfare running without this protection.max-continuationsused in only 2/110 workflows despite autopilot being supportedMost multi-step workflows (daily reports, code fixers, refactoring agents) would benefit from
max-continuations: 3-10to allow the agent to self-correct and continue complex tasks.🟡 Medium Priority Opportunities
web-searchused in only 2 workflows — available as a builtin tool but almost unused. Useful for research-heavy workflows.mcp-scriptsused in only 6 workflows — powerful for custom tooling but low adoption.toolsets: [default](over-permissioned), others use no scoping.View Full Analysis
1️⃣ Copilot CLI Capabilities Inventory
View Available Features
Core Flags (always applied by compiler)
--add-dir /tmp/gh-aw/and--add-dir "${GITHUB_WORKSPACE}"(workspace access)--log-level all,--log-dir <path>(full logging)--disable-builtin-mcps(always applied; MCP servers go through gateway)--no-ask-user(non-interactive mode, v1.0.19+)Configurable via Frontmatter
engine.agent.github/copilot-agents/*.yamlagentengine.modelgpt-5-mini,claude-haiku-4.5engine.versionengine.bare/bare: true.github/copilot-instructions.mdmax-continuations--autopilot --max-autopilot-continues Nstartup-timeouttool-timeoutnetwork.allowedsandbox.agent: awfsandbox.agent: srttools.cache-memorytools.repo-memorymcp-scriptstools.web-fetchtools.web-searchtools.githubView Usage Statistics
Feature Adoption Across 110 Copilot Workflows
2️⃣ Missed Opportunities
View High Priority Opportunities
🔴 Opportunity 1: Add
startup-timeoutto all long-running workflowsWhat: The
startup-timeoutconfig controls how long gh-aw waits for Copilot CLI to start accepting prompts. Currently 0/110 workflows set this.Why It Matters: If Copilot CLI hangs during initialization (network timeout, token issue, rate limit), the workflow burns runner minutes until
timeout-minutesfires — potentially 60+ minutes later.How to Implement:
Affected workflows: All 110 Copilot workflows — but especially daily/weekly reports, code fixers, and analysis workflows.
🔴 Opportunity 2: Enable AWF sandbox on write-capable workflows
What: 17/110 Copilot workflows use
sandbox: agent: awf. The remaining 93 run without a network firewall.Why It Matters: Workflows with
edit:or broadbash:access can be manipulated via prompt injection to exfiltrate data. AWF blocks outbound network to non-allowlisted domains.How to Implement:
View Medium Priority Opportunities
🟡 Opportunity 3: Use
max-continuationsfor complex iterative agentsWhat: Only 2 workflows use
max-continuations(values: 2 and 40). This enables--autopilot --max-autopilot-continues N, allowing the agent to continue automatically.Why It Matters: Workflows like
dead-code-remover.md,refactoring-cadence.md,daily-workflow-updater.mdperform iterative multi-step tasks but run as single-continuation sessions.🟡 Opportunity 4: Adopt
web-searchfor research-oriented workflowsWhat: Only 2 workflows use the
web-searchbuiltin. Workflows likeresearch.md,weekly-blog-post-writer.md,daily-news.mdare research-heavy but don't use web search.🟡 Opportunity 5: Scope GitHub toolsets more precisely
Some workflows use
toolsets: [default](broad access). Recommendations:toolsets: [issues]toolsets: [pull_requests]toolsets: [code_security]toolsets: [default]is appropriate🟡 Opportunity 6: Activate unused custom agent files
These
.github/copilot-agents/files are defined but no workflow uses them:grumpy-reviewer— Critical PR review personaw3c-specification-writer— Spec writing personacreate-safe-output-type— Safe output toolingcustom-engine-implementation— Engine developmentinteractive-agent-designer— Agent design toolEach can be activated with:
View Low Priority Opportunities
🟢 Opportunity 7: Version pinning for critical workflows
0 workflows pin
engine.version. For critical automated workflows likerelease.mdordaily-secrets-analysis.md, pinning can prevent breakage from CLI updates.🟢 Opportunity 8: Use
tool-timeoutfor bash-heavy workflows0 workflows use
tool-timeout. Bash tools insuper-linter.mdordead-code-remover.mdcan run arbitrarily long. A 300s tool timeout prevents one slow tool call from blocking the entire session.3️⃣ Trends vs Previous Analysis (2026-04-28)
4️⃣ Best Practice Guidelines
startup-timeout: 120— Free protection against hung sessions.edit:orbash: ["*"]should havesandbox: agent: awf.[issues],[pull_requests],[repos]over always[default].max-continuations: 3-10for iterative tasks — Code fixers, refactoring agents, and report generators all benefit.repo-memoryfor trend analysis — 23 workflows already use it; great for recurring reports.web-searchfor research workflows — Low friction, high value for news/research/competitive analysis.7️⃣ Action Items
Immediate Actions (this week):
startup-timeout: 120to the top 10 highest-impact Copilot workflowssandbox: agent: awfon write-capable workflows missing itShort-term (this month):
grumpy-reviewer,w3c-specification-writer)max-continuations: 5to iterative code-improvement workflowstoolsets: [default]workflows and scope down where possibleLong-term (this quarter):
tool-timeout: 300to bash-heavy workflowsView Research Methodology
Analysis conducted by:
pkg/workflow/copilot_engine*.go,copilot_mcp.go,copilot_engine_tools.go.github/workflows/grep -rlacross the workflow directoryData Sources:
.github/workflows/*.md(205 files), Copilot engine Go source, repo-memory/tmp/gh-aw/repo-memory/default/References:
Generated by Copilot CLI Deep Research (Run: 25134300030)
Beta Was this translation helpful? Give feedback.
All reactions