[Contrib] Agent-OS Governance: Kernel-Level Policy Enforcement for Crews by imran-siddique · Pull Request #4384 · crewAIInc/crewAI

imran-siddique · 2026-02-05T22:19:15Z

Summary

Adds kernel-level governance for CrewAI workflows using Agent-OS.

Why This Matters

CrewAI enables powerful multi-agent crews, but lacks built-in policy enforcement. This module provides:

Content Filtering: Block dangerous patterns (SQL injection, shell commands)
Tool Control: Limit which tools agents can use
Rate Limiting: Cap iterations and tool calls
Audit Trail: Full logging for compliance and debugging

Changes

Added \src/crewai/governance/\
- _kernel.py\ - GovernedAgent, GovernedCrew, GovernancePolicy classes
- _init_.py\ - Public exports
- \README.md\ - Documentation and examples

Example Usage

\\python
from crewai import Agent, Crew, Task
from crewai.governance import GovernedCrew, GovernancePolicy

Define policy

policy = GovernancePolicy(
max_tool_calls=20,
max_iterations=15,
blocked_patterns=["DROP TABLE", "rm -rf"],
blocked_tools=["shell_tool"],
)

Govern the crew

governed_crew = GovernedCrew(crew, policy)
result = governed_crew.kickoff()

Check audit

print(f"Violations: {len(governed_crew.violations)}")
\\

Value for CrewAI Users

Feature	Without Module	With Agent-OS
Content Filtering	Manual	Automatic
Tool Limits	None	Configurable
Audit Trail	DIY	Built-in
Violation Handling	Runtime errors	Controlled callbacks

Integration Path

This module works standalone, but can also integrate with the full Agent-OS kernel for:

GDPR/HIPAA compliance policies
Cost control limits
Human-in-the-loop approval flows
Cross-framework governance

Related Work

Similar integration accepted in microsoft/autogen#7212
Agent-Lightning integration: microsoft/agent-lightning#478

References

Agent-OS: https://github.com/imran-siddique/agent-os

Note

Medium Risk
Introduces new runtime wrappers that monkey-patch Agent.execute_task and filter/sanitize tool usage and outputs, which could subtly change agent behavior and require validation across CrewAI versions.

Overview
Adds a new crewai.governance module that provides GovernancePolicy, GovernedAgent, and GovernedCrew wrappers to apply policy checks during crew execution.

GovernedAgent wraps Agent.execute_task to filter tools (allow/deny lists), track tool-call and iteration limits (recording violations), and sanitize outputs by truncating and replacing blocked regex patterns; all violations and actions can be recorded to an audit trail with an optional on_violation callback.

GovernedCrew wraps a Crew to automatically govern all agents, aggregate violations, log crew-level lifecycle events, and record an execution-time violation (post-run) when max_execution_time is exceeded; documentation and examples are added in governance/README.md.

^{Written by Cursor Bugbot for commit 4279378. This will update automatically on new commits. Configure here.}

Adds kernel-level governance for CrewAI workflows. Features: - GovernancePolicy: Define rules for crew behavior - GovernedAgent: Wrap individual agents with policy enforcement - GovernedCrew: Govern entire crews with shared policy - Content filtering with blocked patterns - Tool filtering (blocked/allowed lists) - Full audit trail Integration with Agent-OS kernel for enterprise governance. See: https://github.com/imran-siddique/agent-os

src/crewai/governance/_kernel.py

- Fix content filtering bypass for non-string outputs - Fix double-counting of agent violations in crew totals - Remove unimplemented human approval features from docs - Add warning when agent lacks execute_task method - Use getattr consistently for agent.role access - Clarify rate limiting requires CrewAI callback integration

src/crewai/governance/_kernel.py

1. Rate limiting now enforced: - _tool_calls incremented when tools pass filter - _iterations incremented after each execution - TOOL_LIMIT_EXCEEDED and ITERATION_LIMIT_EXCEEDED violations raised 2. Content filter return type: Already fixed (returns sanitized string) 3. Timeout enforcement: Clarified in docstring that it's audit-only post-execution check. Real-time enforcement needs asyncio.timeout. 4. Audit event task_name: Fixed key mismatch - now extracts from both 'task_name' and 'task' object (via .description or .name) 5. Crew violations in audit log: _record_violation now calls _log_event to ensure all violations appear in audit trail

imran-siddique · 2026-02-07T03:04:23Z

Cursor Bugbot has reviewed your changes and found 4 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

Please review again

The _check_output method was returning a string for non-string outputs when violations were detected, which broke downstream consumers expecting specific types (e.g., TaskOutput). Now returns the original object while logging a warning. Violations are still recorded and can be retrieved via get_violations(). Fixes review comment about return type breaking downstream consumers.

imran-siddique · 2026-02-07T04:27:06Z

All Review Comments Addressed ✅

Fixed the Bugbot-identified issue:

✅ Content filter return type fix: _check_output now preserves original object type for non-string outputs instead of converting to string (which broke downstream consumers expecting TaskOutput, etc.)

The violation is still recorded and can be retrieved via get_violations() - we just don't break the type contract anymore.

Could you please provide a final review so we can move forward? Thank you! 🙏

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-02-07T04:28:00Z

src/crewai/governance/_kernel.py

+                "Returning original object to preserve type. Violations: %d",
+                type(output).__name__,
+                len(self._violations)
+            )


logger undefined in _check_output causes NameError

High Severity

logger is defined as a local variable inside _wrap_execution (line 145) but referenced in _check_output (line 279), which is a separate method with no access to that local scope. When a non-string output triggers a content violation, this raises a NameError, crashing the governed execution instead of gracefully returning the original object.

Additional Locations (1)

src/crewai/governance/_kernel.py#L143-L145

cursor · 2026-02-07T04:28:00Z

src/crewai/governance/_kernel.py

+                self._record_violation(
+                    ViolationType.TOOL_LIMIT_EXCEEDED,
+                    f"Tool calls ({self._tool_calls}) exceeded limit ({self.policy.max_tool_calls})",
+                )


Tool filtering incorrectly counted as tool invocations

Medium Severity

_filter_tools increments self._tool_calls for each tool that passes the allow/block filter (line 226), treating the count of available tools as actual tool invocations. If an agent has 10 allowed tools and max_tool_calls is 5, a TOOL_LIMIT_EXCEEDED violation fires before any tool is actually called, making rate limiting ineffective and producing false violations.

Additional Locations (1)

src/crewai/governance/_kernel.py#L166-L172

imran-siddique mentioned this pull request Feb 5, 2026

[Contrib] Agent-OS Governance Guardrails: Kernel-Level Policy Enforcement openai/openai-agents-python#2422

Closed

cursor bot reviewed Feb 5, 2026

View reviewed changes

src/crewai/governance/_kernel.py Show resolved Hide resolved

src/crewai/governance/_kernel.py Show resolved Hide resolved

src/crewai/governance/_kernel.py Outdated Show resolved Hide resolved

src/crewai/governance/_kernel.py Show resolved Hide resolved

imran-siddique mentioned this pull request Feb 6, 2026

Track PR: CrewAI #4384 - Governance integration imran-siddique/agent-os#114

Open

3 tasks

cursor bot reviewed Feb 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Contrib] Agent-OS Governance: Kernel-Level Policy Enforcement for Crews#4384

[Contrib] Agent-OS Governance: Kernel-Level Policy Enforcement for Crews#4384
imran-siddique wants to merge 4 commits intocrewAIInc:mainfrom
imran-siddique:contrib/agent-os

imran-siddique commented Feb 5, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

imran-siddique commented Feb 7, 2026

Uh oh!

imran-siddique commented Feb 7, 2026

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Feb 7, 2026

Uh oh!

cursor bot Feb 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

imran-siddique commented Feb 5, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why This Matters

Changes

Example Usage

Define policy

Govern the crew

Check audit

Value for CrewAI Users

Integration Path

Related Work

References

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

imran-siddique commented Feb 7, 2026

Uh oh!

imran-siddique commented Feb 7, 2026

All Review Comments Addressed ✅

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Feb 7, 2026

Choose a reason for hiding this comment

logger undefined in _check_output causes NameError

Uh oh!

cursor bot Feb 7, 2026

Choose a reason for hiding this comment

Tool filtering incorrectly counted as tool invocations

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

imran-siddique commented Feb 5, 2026 •

edited by cursor bot

Loading

`logger` undefined in `_check_output` causes NameError