prevent tool cancellation when AgentTask is called inside it #4586

longcw · 2026-01-22T09:40:42Z

when there is a speech generated alongside a tool call, the interruption to the speech shouldn't cancel the tool execution if it's await for an AgentTask.

Summary by CodeRabbit

Bug Fixes
- Enhanced tool execution reliability by preventing premature cancellation when speech generation is active.
- Improved speech pause handling with better state tracking and proper recovery after cancellation.
- Enhanced task logging for better debugging of cancellation events.
Chores
- Updated email example to use OpenAI GPT-4.1 Mini as the default LLM model.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-22T09:41:03Z

📝 Walkthrough

Walkthrough

This PR adds a SpeechHandle tool_cancelable flag and uses it to prevent mid-execution cancellations, renames/refactors paused-speech interruption APIs to _cancel_speech_pause (with an interrupt parameter), assigns names to tool-execution tasks, changes SegmentSynchronizerImpl.resume to a no-op after close, and switches the email example LLM to openai/gpt-4.1-mini.

Changes

Cohort / File(s)	Summary
Tool execution safety `livekit-agents/livekit/agents/voice/speech_handle.py`, `livekit-agents/livekit/agents/voice/agent.py`	Add `_tool_cancelable` + public `tool_cancelable` property on `SpeechHandle`. Agent code temporarily sets `speech_handle.tool_cancelable = False` while awaiting tool execution and restores the previous value in finally blocks to avoid race-condition cancellations.
Speech pause handling refactor `livekit-agents/livekit/agents/voice/agent_activity.py`	Rename `_interrupt_paused_speech_task` → `_cancel_speech_pause_task`, replace `_interrupt_paused_speech(...)` calls with `_cancel_speech_pause(..., interrupt=...)`, add `interrupt` parameter, and unify forwarded_text/speech scheduling and interruption-reset logic across flows.
Task naming for tooling `livekit-agents/livekit/agents/voice/generation.py`	Give the asyncio task created for tool execution a `name` (function call name) so cancellations/logging can reference the task name.
Transcription resume behavior `livekit-agents/livekit/agents/voice/transcription/synchronizer.py`	`SegmentSynchronizerImpl.resume` now silently returns when called after close (removed runtime warning).
Example LLM change `examples/voice_agents/email_example.py`	Change default LLM backend from `google/gemini-2.5-flash` to `openai/gpt-4.1-mini`.

Sequence Diagram(s)

sequenceDiagram
    participant Agent as Agent
    participant Speech as SpeechHandle
    participant Tool as Tool Task
    participant Finally as Finally

    Agent->>Speech: read tool_cancelable (old_state)
    Agent->>Speech: set tool_cancelable = False
    Note over Speech: prevents mid-execution cancellation

    Agent->>Tool: create named asyncio task (tool execution)
    Tool->>Tool: run tool logic
    Note over Tool: task runs without being cancelled by speech pause

    Tool-->>Agent: tool completes / returns result
    Finally->>Speech: restore tool_cancelable = old_state
    Note over Speech: original cancelability restored

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

interrupt the same speech handle #4536: Addresses race conditions around interrupting the active SpeechHandle, closely related to this PR's tool-cancelable and pause-cancellation changes.

Poem

🐰 A rabbit hops where code flags gleam,
I tuck cancels away behind a seam,
Tasks run named, safe from sudden stops,
Pauses canceled with gentler plops,
I nibble bugs and dance on logs—hip, hop! 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 8.70% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'prevent tool cancellation when AgentTask is called inside it' is specific and directly describes the main change, which involves preventing tool cancellation during AgentTask execution to fix a deadlock issue.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

chenghao-mou

LGTM. I tested with the email example, and it worked.

livekit-agents/livekit/agents/voice/transcription/synchronizer.py

livekit-agents/livekit/agents/voice/agent_activity.py

livekit-agents/livekit/agents/voice/speech_handle.py

…interruption

devin-ai-integration

Devin Review found 1 potential issue.

View issue and 5 additional flags in Devin Review.

livekit-agents/livekit/agents/voice/agent.py

devin-ai-integration

Devin Review found 1 new potential issue.

View issue and 8 additional flags in Devin Review.

devin-ai-integration · 2026-02-03T02:46:29Z

livekit-agents/livekit/agents/voice/agent_activity.py

        self._paused_speech = None

        if self._session.options.resume_false_interruption and self._session.output.audio:


🟡 Paused speech state cleared prematurely when allow_interruptions is False

When _cancel_speech_pause is called while an AgentTask has temporarily disabled interruptions (by setting speech_handle.allow_interruptions = False), the method still clears _paused_speech = None and calls resume() even though it skipped the interrupt logic.

Click to expand

Scenario

Speech is playing with allow_interruptions=True

User speaks, triggering _interrupt_by_audio_activity which pauses the audio and sets _paused_speech = self._current_speech

Tool execution calls await AgentTask(), which sets speech_handle.allow_interruptions = False (line 769 in agent.py)

User's final transcript triggers on_final_transcript which creates a task to call _cancel_speech_pause

In _cancel_speech_pause, the condition at line 2789-2792 evaluates to False because allow_interruptions is now False

The interrupt block is skipped, but _paused_speech = None is still executed (line 2797)

Audio is resumed if resume_false_interruption option is set (line 2799-2800)

Impact

The paused speech reference is cleared prematurely while an AgentTask is running. When the AgentTask completes and restores allow_interruptions, the false interruption handling state has already been cleared. This could cause:

Inconsistent state tracking where _paused_speech is None but the speech wasn't properly interrupted

The false interruption detection logic won't work correctly after AgentTask completes

Audio might resume unexpectedly during AgentTask execution

(Refers to lines 2797-2800)

Recommendation: Consider not clearing _paused_speech and not calling resume() when the speech's allow_interruptions is False due to an AgentTask lock. The cleanup should happen either when the speech is successfully interrupted or when the AgentTask completes and the original interruption handling can proceed.

Was this helpful? React with 👍 or 👎 to provide feedback.

theomonnom · 2026-02-03T02:50:04Z

livekit-agents/livekit/agents/voice/agent_activity.py

+        if speech_handle.interrupted:
+            await utils.aio.cancel_and_wait(exe_task)
+            return


I think this should be removed?

there is a guard for cancellation https://github.com/livekit/agents/blob/livekit-agents@1.3.12/livekit-agents/livekit/agents/voice/generation.py#L648-L658, we will cancel the tool execution task but not the user's function

theomonnom · 2026-02-03T02:51:06Z

livekit-agents/livekit/agents/voice/agent_activity.py

-                msg = chat_ctx.add_message(
-                    role="assistant",
-                    content=forwarded_text,
-                    id=llm_gen_data.id,
-                    interrupted=True,
-                    created_at=reply_started_at,
-                    metrics=assistant_metrics,
-                )
-                self._agent._chat_ctx.insert(msg)
-                self._session._conversation_item_added(msg)
-                speech_handle._item_added([msg])
-                current_span.set_attribute(trace_types.ATTR_RESPONSE_TEXT, forwarded_text)
-
-            if self._session.agent_state == "speaking":
-                self._session._update_agent_state("listening")
-
-            speech_handle._mark_generation_done()
-            await utils.aio.cancel_and_wait(exe_task)
-            return


Was this some duplicated logic?

yes, we have some duplicated code for interrupted and not interrupted. I merged them in this pr.

theomonnom · 2026-02-03T02:54:49Z

Otherwise it lgtm, but I'm not sure to follow the logic inside _close_session where we put interrupt=False for the paused speech.

longcw · 2026-02-03T03:05:32Z

_pause_scheduling_task will make sure all the speeches are done or should be ignored if it's in _drain_blocked_tasks, so in _close_session we only cancel the timer related to pause.

commit c46013d Author: Long Chen <longch1024@gmail.com> Date: Tue Feb 3 20:02:57 2026 +0800 add exclude_config_update to ChatContext copy (livekit#4700) commit 7849a8c Author: Chenghao Mou <chenghao.mou@livekit.io> Date: Tue Feb 3 09:51:07 2026 +0000 fix: commit user turn with STT and realtime (livekit#4663) commit edfa391 Author: Chenghao Mou <chenghao.mou@livekit.io> Date: Tue Feb 3 09:48:36 2026 +0000 add STT usage for google (livekit#4599) commit 34d0d62 Author: Long Chen <longch1024@gmail.com> Date: Tue Feb 3 15:53:42 2026 +0800 fix gemini live tool execution interrupted by generation_complete event (livekit#4699) commit 1725929 Author: Long Chen <longch1024@gmail.com> Date: Tue Feb 3 11:08:27 2026 +0800 prevent tool cancellation when AgentTask is called inside it (livekit#4586)

fix deadlock when interrupting a tool that awaiting for AgentTask

9c7d397

chenghao-mou requested a review from a team January 22, 2026 09:40

fix _previous_user_metrics

f8ba889

longcw changed the title ~~fix deadlock when interrupting a tool that awaiting for AgentTask~~ prervent tool cancellation when a AgentTask is called Jan 22, 2026

longcw changed the title ~~prervent tool cancellation when a AgentTask is called~~ prevent tool cancellation when AgentTask is called inside it Jan 22, 2026

chenghao-mou approved these changes Jan 22, 2026

View reviewed changes

livekit-agents/livekit/agents/voice/transcription/synchronizer.py Outdated Show resolved Hide resolved

livekit-agents/livekit/agents/voice/agent_activity.py Outdated Show resolved Hide resolved

livekit-agents/livekit/agents/voice/speech_handle.py Outdated Show resolved Hide resolved

longcw mentioned this pull request Feb 2, 2026

Awaiting AgentTask in tool deadlocks #4661

Open

longcw added 2 commits February 3, 2026 09:29

Merge remote-tracking branch 'origin/main' into longc/fix-agent-task-…

8f08b37

…interruption

disallow interruption

6511a91

longcw requested a review from a team February 3, 2026 02:32

devin-ai-integration bot reviewed Feb 3, 2026

View reviewed changes

livekit-agents/livekit/agents/voice/agent.py Show resolved Hide resolved

longcw added 2 commits February 3, 2026 10:38

clean

fc28e63

add check for speech handle interrupted

e0bcd6a

devin-ai-integration bot reviewed Feb 3, 2026

View reviewed changes

theomonnom reviewed Feb 3, 2026

View reviewed changes

longcw merged commit 1725929 into main Feb 3, 2026
18 checks passed

longcw deleted the longc/fix-agent-task-interruption branch February 3, 2026 03:08

		self._paused_speech = None

		if self._session.options.resume_false_interruption and self._session.output.audio:

prevent tool cancellation when AgentTask is called inside it #4586

prevent tool cancellation when AgentTask is called inside it #4586

Conversation

longcw commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 3, 2026

Choose a reason for hiding this comment

Scenario

Impact

Uh oh!

theomonnom Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

theomonnom Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Feb 3, 2026

Choose a reason for hiding this comment

Uh oh!

theomonnom commented Feb 3, 2026

Uh oh!

longcw commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw commented Jan 22, 2026 •

edited

Loading

coderabbitai bot commented Jan 22, 2026 •

edited

Loading