fix: imported safe-output fragments no longer override explicit `threat-detection: false` by Copilot · Pull Request #18231 · github/gh-aw

Copilot · 2026-02-24T23:11:30Z

When a workflow sets threat-detection: false (or enabled: false), imported fragments that lack a threat-detection key would silently re-enable it via auto-defaulting in parseThreatDetectionConfig, triggering a spurious validation error when sandbox.agent: false was also set.

Root cause: result.ThreatDetection == nil was ambiguous — it could mean either "not configured" (no safe-outputs in main) or "explicitly disabled", so mergeSafeOutputConfig couldn't distinguish them.

Changes

imports.go — ThreatDetection is only merged from an imported fragment if the fragment's raw config map explicitly contains a threat-detection key. Auto-enabled defaults (fragments that have safe-outputs but no explicit threat-detection key) are ignored during merge. This makes threat-detection a true workflow-level concern.

Original prompt

This section details on the original issue you should resolve

<issue_title>fix: imported safe-output fragments override explicit threat-detection: false</issue_title>
<issue_description>## Bug

When a workflow explicitly disables threat detection (threat-detection: false or threat-detection: { enabled: false }), imported safe-output fragments re-enable it during merge, causing a compilation error when sandbox.agent: false is also set.

Reproduction

Workflow frontmatter:
sandbox:
  agent: false
safe-outputs:
  activation-comments: false
  threat-detection: false
With any imported fragment that has its own safe-outputs: (e.g., safe-output-add-comment.md with safe-outputs: { add-comment: { max: 1 } }).

Error:
error: threat detection requires sandbox.agent to be enabled. Threat detection runs inside the agent sandbox (AWF) with fully blocked network. Either enable sandbox.agent or remove the threat-detection configuration from safe-outputs.
Root Cause

mergeSafeOutputConfig() in imports.go:512-634 calls extractSafeOutputsConfig() on each imported fragment. The auto-enablement logic at safe_outputs_config.go:479-491 fires because the fragment's outputMap has no threat-detection key, so it defaults to &ThreatDetectionConfig{}.

Then at imports.go:633-635:
if result.ThreatDetection == nil && importedConfig.ThreatDetection != nil {
    result.ThreatDetection = importedConfig.ThreatDetection
}
The main workflow's explicit disable (nil) gets overwritten by the fragment's auto-enabled default.

Suggested Fix

Skip merging ThreatDetection from imported fragments — it's a workflow-level concern, not a per-fragment setting. Alternatively, suppress auto-enablement of threat detection inside extractSafeOutputsConfig when called from the import merge path.</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes fix: imported safe-output fragments override explicit threat-detection: false #18226

🔒 GitHub Advanced Security automatically protects Copilot coding agent pull requests. You can protect all pull requests by enabling Advanced Security for your repositories. Learn more about Advanced Security.

✨ PR Review Safe Output Test - Run 22377081592

💥 [THE END] — Illustrated by Smoke Claude

…t-detection: false When a workflow sets threat-detection: false (or enabled: false), add a ThreatDetectionExplicitlyDisabled flag to SafeOutputsConfig. The flag is set in extractSafeOutputsConfig when the threat-detection key is present but resolves to nil. mergeSafeOutputConfig now guards the ThreatDetection merge with !result.ThreatDetectionExplicitlyDisabled so imported fragments cannot re-enable threat detection that was explicitly disabled. Adds three tests: - TestMergeSafeOutputsThreatDetectionExplicitDisableNotOverridden (unit) - TestMergeSafeOutputsThreatDetectionImportedWhenMainHasNone (regression) - TestSafeOutputsImportDoesNotReenableThreatDetection (integration) Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2026-02-24T23:58:18Z

@copilot

pelikhan · 2026-02-24T23:58:36Z

@copilot Review the code and make it as clean and concise as possible. Simplify the implementation to improve clarity and maintainability.

…instead of sentinel Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2026-02-25T00:15:45Z

@copilot Review the code and make it as clean and concise as possible. Simplify the implementation to improve clarity and maintainability.

Simplified in eb45e44. Replaced the ThreatDetectionExplicitlyDisabled boolean sentinel (touching 3 files) with a direct raw-map check in imports.go:

if result.ThreatDetection == nil {
    if _, hasTD := config["threat-detection"]; hasTD && importedConfig.ThreatDetection != nil {
        result.ThreatDetection = importedConfig.ThreatDetection
    }
}

This means ThreatDetection is only propagated from an import that explicitly carries a threat-detection key — auto-enabled defaults in fragments are ignored. The ThreatDetectionExplicitlyDisabled struct field and the corresponding else if in safe_outputs_config.go are both gone.

github-actions · 2026-02-25T01:05:49Z

Commit pushed: 7c9871c

💥 [THE END] — Illustrated by Smoke Claude

Copilot

Pull request overview

Fixes a merge-time regression where imported safe-outputs fragments could unintentionally re-enable threat detection (via defaulting) even when the main workflow explicitly disabled it.

Changes:

Gate ThreatDetection merge from imports on the imported fragment explicitly containing a threat-detection key.
Add unit/integration coverage to ensure explicit threat-detection: false in the main workflow is preserved across imports.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
pkg/workflow/imports.go	Prevents imported fragments without an explicit `threat-detection` key from setting `ThreatDetection` during merge.
pkg/workflow/safe_outputs_import_test.go	Adds tests covering explicit disable preservation and explicit import enable behavior.
tmp-smoke-test-22377081592.txt	Adds a temporary smoke-test artifact file (likely unintended).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-25T01:53:29Z

pkg/workflow/safe_outputs_import_test.go

+	// Simulate main workflow that explicitly disabled threat-detection:
+	// threat-detection: false → parseThreatDetectionConfig returns nil.
+	topConfig := &SafeOutputsConfig{
+		ThreatDetection: nil,
+		AddComments:     &AddCommentsConfig{},
+	}
+
+	// Import fragment with safe-outputs but no threat-detection key.
+	importedJSON := []string{
+		`{"add-comment":{"max":1}}`,
+	}


This test doesn’t exercise the regression path because topConfig pre-defines AddComments, so MergeSafeOutputs will delete the imported add-comment entry as an override conflict before mergeSafeOutputConfig runs. That means the test would pass even without the threat-detection merge guard. Consider using a topConfig that represents a workflow with safe-outputs present but without add-comment (e.g., only meta/messages fields) and assert that AddComments is merged while ThreatDetection stays nil.

Copilot · 2026-02-25T01:53:30Z

tmp-smoke-test-22377081592.txt

@@ -0,0 +1 @@
+Test file for PR push - smoke test run 22377081592


This looks like a temporary CI/smoke-test artifact that shouldn’t be committed to the repository. Please remove this file from the PR (and add it to .gitignore if it’s generated by local/CI workflows).

Suggested change

Test file for PR push - smoke test run 22377081592

…cedence - Add `validate` command section to setup/cli.md (Building section, after compile). The command was added in #18191 but was missing from the main CLI reference page; only the compilation-process.md reference table had been updated. - Add NOTE callout to reference/threat-detection.md clarifying that an explicit `threat-detection: false` in a workflow takes precedence over imported fragments, documenting the behaviour fixed in #18231. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Initial plan

25a1727

Copilot AI assigned Copilot and pelikhan Feb 24, 2026

Copilot started work on behalf of pelikhan February 24, 2026 23:11 View session

Copilot AI changed the title ~~[WIP] Fix imported safe-output fragments overriding threat detection settings~~ fix: imported safe-output fragments no longer override explicit threat-detection: false Feb 24, 2026

Copilot finished work on behalf of pelikhan February 24, 2026 23:35

Copilot started work on behalf of pelikhan February 24, 2026 23:58 View session

Copilot finished work on behalf of pelikhan February 25, 2026 00:03

Copilot started work on behalf of pelikhan February 25, 2026 00:03 View session

refactor: simplify threat-detection merge by checking raw config key …

eb45e44

…instead of sentinel Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan February 25, 2026 00:16

This comment has been minimized.

Sign in to view

test: Add smoke test file for run 22377081592

7c9871c

github-actions bot mentioned this pull request Feb 25, 2026

Smoke Test: Claude - 22377081592 #18243

Closed

github-actions bot added smoke-claude spam labels Feb 25, 2026

pelikhan marked this pull request as ready for review February 25, 2026 01:49

Copilot AI review requested due to automatic review settings February 25, 2026 01:49

pelikhan merged commit da64eba into main Feb 25, 2026

pelikhan deleted the copilot/fix-imported-safe-outputs branch February 25, 2026 01:50

Copilot started reviewing on behalf of pelikhan February 25, 2026 01:50 View session

Copilot AI reviewed Feb 25, 2026

View reviewed changes

github-actions bot mentioned this pull request Feb 25, 2026

[docs] Update documentation for features from 2026-02-25 #18254

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: imported safe-output fragments no longer override explicit `threat-detection: false`#18231

fix: imported safe-output fragments no longer override explicit `threat-detection: false`#18231
pelikhan merged 4 commits intomainfrom
copilot/fix-imported-safe-outputs

Copilot AI commented Feb 24, 2026 •

edited by github-actions bot

Loading

Uh oh!

pelikhan commented Feb 24, 2026

Uh oh!

pelikhan commented Feb 24, 2026

Uh oh!

Copilot AI commented Feb 25, 2026

Uh oh!

This comment has been minimized.

github-actions bot commented Feb 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 25, 2026

Uh oh!

Copilot AI Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -0,0 +1 @@
		Test file for PR push - smoke test run 22377081592

Conversation

Copilot AI commented Feb 24, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reproduction

Root Cause

Suggested Fix

Comments on the Issue (you are @copilot in this section)

Uh oh!

pelikhan commented Feb 24, 2026

Uh oh!

pelikhan commented Feb 24, 2026

Uh oh!

Copilot AI commented Feb 25, 2026

Uh oh!

This comment has been minimized.

github-actions bot commented Feb 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented Feb 24, 2026 •

edited by github-actions bot

Loading