You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Review the mcp-server command and the logic to resolve the location of the gh-aw binary. The project location changed from githubnext/gh-aw to github/aw
Why it failed: Vague "review" directive without clear fix
Use AWF --enable-chroot mode and remove unnecessary --mount and --env flags
Why it failed: Complex security/sandbox changes require more context
Key Insights
🎯 Pattern 1: Conciseness wins — Merged PRs are 32% shorter on average (114 vs 167 words). Copilot performs better with focused, actionable prompts rather than lengthy explanations.
📉 Pattern 2: Bug fix paradox — Bug fixes are the MOST common prompt type (53% of all PRs) but have the LOWEST success rate (62%). Documentation changes have 95% success. This suggests bug fixes require more context or are inherently harder to automate.
🚫 Pattern 3: Campaign PRs struggle — The keyword "campaign" appears 66× in closed PRs but rarely in merged ones. Bulk/automated PR attempts have lower quality than targeted individual fixes.
Recommendations
Based on today's analysis:
✅ DO:
Be concise and specific — Aim for 100-150 words max. Include reference URLs to failures/issues.
✅ GOOD: "Fix failing test in dispatch_workflow.test.cjs (ref: run #12345)"
❌ BAD: "The test suite has been failing intermittently and we need to investigate why..."
Start with documentation/test tasks — These have 95%/78% success rates vs 62% for bug fixes. Build confidence with easier wins.
Use workflow-specific keywords — Prompts mentioning agentic, safe, workflow have higher merge rates. Shows domain knowledge.
⚠️ AVOID:
Campaign PRs — Creating multiple PRs at once reduces quality. Focus on one targeted fix at a time.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
🤖 Copilot PR Prompt Pattern Analysis - 2026-02-03
Summary
Analysis Period: Last 30 days (Jan 4 - Feb 3, 2026)
Total PRs: 1,000 | Merged: 684 (68.4%) | Closed: 315 (31.5%) | Open: 1 (0.1%)
Key Finding: Concise prompts succeed 32% more often
Merged PRs average 114 words while closed PRs average 167 words — suggesting brevity and focus drive success.
Prompt Categories and Success Rates
Prompt Analysis
✅ Successful Prompt Patterns
Common characteristics in merged PRs:
workflow,github,update,agentic,safefix,update,add,resolve,createKeywords unique to merged PRs:
agentic(118×) — suggests familiarity with gh-aw workflowssafe(117×) — mentions safe outputs/inputs correctlycreate(99×) — constructive action orientationtest(94×) — testing mentioned proactivelyExample successful prompts:
PR #13457 (bug_fix) → Merged
Why it worked: Direct, actionable, links to specific failure
PR #13456 (bug_fix) → Merged
Why it worked: Includes verification context, specific file mentioned
❌ Unsuccessful Prompt Patterns
Common characteristics in closed PRs:
issue_title,issue,resolve,campaign,codeKeywords unique to closed PRs:
campaign(66×) — bulk/automated PR attempts often failplan(47×) — planning without executionsecurity(45×) — complex security changes harder to mergedescription(44×) — meta-information rather than actionExample unsuccessful prompts:
PR #13416 (bug_fix) → Closed
Why it failed: Vague "review" directive without clear fix
PR #13380 (bug_fix) → Closed
Why it failed: Complex security/sandbox changes require more context
Key Insights
🎯 Pattern 1: Conciseness wins — Merged PRs are 32% shorter on average (114 vs 167 words). Copilot performs better with focused, actionable prompts rather than lengthy explanations.
📉 Pattern 2: Bug fix paradox — Bug fixes are the MOST common prompt type (53% of all PRs) but have the LOWEST success rate (62%). Documentation changes have 95% success. This suggests bug fixes require more context or are inherently harder to automate.
🚫 Pattern 3: Campaign PRs struggle — The keyword "campaign" appears 66× in closed PRs but rarely in merged ones. Bulk/automated PR attempts have lower quality than targeted individual fixes.
Recommendations
Based on today's analysis:
✅ DO:
Be concise and specific — Aim for 100-150 words max. Include reference URLs to failures/issues.
Start with documentation/test tasks — These have 95%/78% success rates vs 62% for bug fixes. Build confidence with easier wins.
Use workflow-specific keywords — Prompts mentioning
agentic,safe,workflowhave higher merge rates. Shows domain knowledge.Campaign PRs — Creating multiple PRs at once reduces quality. Focus on one targeted fix at a time.
Vague action verbs — Avoid "review", "investigate", "plan". Use "fix", "add", "update", "refactor".
Over-explaining context — Copilot has repository access. Link to issues/runs instead of copying full descriptions.
No Historical Trends Available
This is the first day of data collection. Historical trend comparison will be available after 7 days of daily analysis.
Workflow Run: §21629671720
Analysis Script:
/tmp/gh-aw/agent/analyze-prompts.jsData: 1,000 Copilot PRs from last 30 days
Beta Was this translation helpful? Give feedback.
All reactions