Python: Bug: In Handoff orchestration - when handling a request message from an agent in the handoff group, the persona adoption message is sent by the User which triggers openAI jailbreak guardrails

**Describe the bug**
when handling a request message from an agent in the handoff group, the persona adoption message is sent by the User which triggers openAI jailbreak guardrails

![Image](https://github.com/user-attachments/assets/f780f3ed-ae69-4dc2-804e-72b825079e4f)

if self._agent_thread is None:
            self._chat_history.add_message(
                ChatMessageContent(
                    role=AuthorRole.USER,
                    content=f"Transferred to {self._agent.name}, adopt the persona immediately.",
                )
            )
            response_item = await self._agent.get_response(
                messages=self._chat_history.messages,  # type: ignore[arg-type]
                kernel=self._kernel,
            )
        else:
            response_item = await self._agent.get_response(
                messages=ChatMessageContent(
                    role=AuthorRole.USER,
                    content=f"Transferred to {self._agent.name}, adopt the persona immediately.",
                ),
                thread=self._agent_thread,
                kernel=self._kernel,
            )

Changing the role to Assistant fixes the issue.

**To Reproduce**
Steps to reproduce the behavior:
1. Create a handoff orchestration
2. And make sure the open ai guardrails are deployed.
3. Then run the orchestration.
4. See error

**Expected behavior**
It should handoff to other agents on runtime without Jailbreak issues

**Screenshots**
If applicable, add screenshots to help explain your problem.
![Image](https://github.com/user-attachments/assets/f780f3ed-ae69-4dc2-804e-72b825079e4f)

**Platform**
 - Language: Python
 - Source: pip package semantic-kernel 1.31.0
 - AI model: OpenAI:GPT-4o
 - IDE: VS Code
 - OS: Windows

**Additional context**
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\openai\_base_client.py", line 1549, in request
    raise self._make_status_error_from_response(err.response) from None
openai.BadRequestError: Error code: 400 - {'error': {'message': "The response was filtered due to the prompt triggering Azure OpenAI's content management policy. Please modify your prompt and retry. To learn more about our content filtering policies please read our documentation: https://go.microsoft.com/fwlink/?linkid=2198766", 'type': None, 'param': 'prompt', 'code': 'content_filter', 'status': 400, 'innererror': {'code': 'ResponsibleAIPolicyViolation', 'content_filter_result': {'hate': {'filtered': False, 'severity': 'safe'}, 'jailbreak': {'filtered': True, 'detected': True}, 'self_harm': {'filtered': False, 'severity': 'safe'}, 'sexual': {'filtered': False, 'severity': 'safe'}, 'violence': {'filtered': False, 'severity': 'safe'}}}}}

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\runtime\in_process\in_process_runtime.py", line 470, in _on_message
    return await agent.on_message(
           ^^^^^^^^^^^^^^^^^^^^^^^
    ...<2 lines>...
    )
    ^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\runtime\core\base_agent.py", line 129, in on_message
    return await self.on_message_impl(message, ctx)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\orchestration\agent_actor_base.py", line 35, in on_message_impl
    return await super().on_message_impl(message, ctx)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\runtime\core\routed_agent.py", line 488, in on_message_impl
    return await h(self, message, ctx)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\runtime\core\routed_agent.py", line 156, in wrapper
    return_value = await func(self, message, ctx)
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "d:\repo\CG_GenAI_Accelerator_repos\sk-agentic-demo\.venv\Lib\site-packages\semantic_kernel\agents\orchestration\handoffs.py", line 285, in _handle_request_message
    response_item = await self._agent.get_response(
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    ...<2 lines>...
    )
    ^


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Python: Bug: In Handoff orchestration - when handling a request message from an agent in the handoff group, the persona adoption message is sent by the User which triggers openAI jailbreak guardrails #12294

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Python: Bug: In Handoff orchestration - when handling a request message from an agent in the handoff group, the persona adoption message is sent by the User which triggers openAI jailbreak guardrails #12294

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions