Dynamic sample rate handling in speaker stream #1423

yujonglee · 2025-08-30T00:45:34Z

No description provided.

coderabbitai · 2025-08-30T00:45:41Z

📝 Walkthrough

Walkthrough

Adds a new async resampler wrapper and exposes it from the audio crate. Tracks macOS speaker sample rate via a shared atomic. Replaces ad-hoc resampling in the listener plugin with the wrapper, adds a resampling helper script and new data paths, and switches notification spawn to Tauri’s runtime.

Changes

Cohort / File(s)	Summary of changes
Audio crate — resampler API `crates/audio/src/lib.rs`, `crates/audio/src/resampler.rs`	Adds a new `resampler` module and `pub use resampler::*;` in the crate root. Introduces `ResampledAsyncSource<S: AsyncSource>` implementing `Stream<Item = f32>` and `AsyncSource`, performing linear-interpolation resampling and tracking source rate. Includes tests for dynamic-rate inputs.
macOS speaker — runtime sample rate tracking `crates/audio/src/speaker/macos.rs`	Replaces per-stream ASBD storage with a shared `Arc<AtomicU32>` `current_sample_rate` in `SpeakerStream` and `Ctx`; audio callback updates the atomic and `sample_rate()` reads from it. Removes `stream_desc` field.
Data assets and scripts `crates/data/scripts/resamples.sh`, `crates/data/src/english_1/mod.rs`	Adds an executable `resamples.sh` to split/resample an input into six parts at target sample rates. Adds six new public path constants pointing to the generated resampled WAV parts.
Listener plugin — resampling and AEC handling `plugins/listener/src/fsm.rs`	Replaces manual `.resample(...)` calls with `ResampledAsyncSource::new(..., SAMPLE_RATE).chunks(...)` for mic and speaker streams. Removes a 65 ms sleep previously used for AirPods transitions. Adds error logging for AEC failures and falls back to raw mic chunks on error.
Notification plugin — async runtime `plugins/notification/src/handler.rs`	Changes spawned task from `tokio::spawn` to `tauri::async_runtime::spawn` when handling `MicStopped` pause logic.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant C as Consumer
  participant R as ResampledAsyncSource
  participant S as Inner AsyncSource
  participant L as LinearResampler

  C->>R: poll_next()
  alt Need source samples
    R->>S: poll_next()
    S-->>R: Item(f32)/Pending/None
    opt On Item
      R->>L: push source sample(s)
    end
  end
  alt Source ended
    R-->>C: None
  else Source pending
    R-->>C: Pending
  else Have data
    R->>L: interpolate(sample_position)
    L-->>R: f32 sample
    R-->>C: Item(f32)
    R->>R: advance sample_position by (src_rate/target_rate)
    R->>R: update last_source_rate if changed
  end

sequenceDiagram
  autonumber
  participant AU as AudioUnit Callback
  participant Ctx as Ctx
  participant A as Atomic current_sample_rate
  participant SS as SpeakerStream

  AU->>Ctx: process(buffer)
  AU->>A: store(device.actual_sample_rate or ctx.format.sample_rate) (Relaxed)
  Note over A,SS: Cross-thread visibility of current output rate
  SS->>A: load() (Relaxed)
  A-->>SS: u32 sample_rate
  SS-->>SS: sample_rate() returns loaded value

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

implement microphone selector #1207 — Adds/exports ResampledAsyncSource and updates listener plugin to use it; strong overlap with resampler integration here.
feat(notification): Spawn event notification in a separate task #871 — Switches notification spawn to tauri::async_runtime::spawn; matches the spawn change in this PR.
event worker and meeting detector #1410 — Also modifies notification handler spawn behavior; related at the code-level for task runtime changes.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dynamic-sample-rate

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

‼️ IMPORTANT
Auto-reply has been disabled for this repository in the CodeRabbit settings. The CodeRabbit bot will not respond to your replies unless it is explicitly tagged.

Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore or @coderabbit ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (7)

crates/data/scripts/resamples.sh (3)
11-12: Don’t assume .wav extension when deriving BASENAME

If a non-wav is passed, BASENAME includes the extension. Make this robust.
- B A S E N A M E=$(basename "$INPUT_FILE" .wav)
+ EXT="${INPUT_FILE##*.}"
+ BASENAME=$(basename "$INPUT_FILE" ."$EXT")
31-31: Seek faster by moving -ss before -i (optional)

Placing -ss before -i speeds up splitting on large files.
- ffmpeg -i "$INPUT_FILE" -ss ${START} -t ${PART_DURATION} -ar ${RATE} "$OUTPUT_FILE" -y -loglevel error
+ ffmpeg -ss "$START" -t "$PART_DURATION" -i "$INPUT_FILE" -ar "$RATE" "$OUTPUT_FILE" -y -loglevel error
20-31: Edge-case: last segment duration drift

Floating math can make the last cut a tad short/long. Consider letting the last part run to EOF by omitting -t on the final iteration.
-for i in "${!SAMPLE_RATES[@]}"; do
+for i in "${!SAMPLE_RATES[@]}"; do
   RATE=${SAMPLE_RATES[$i]}
   PART_NUM=$((i + 1))
   START=$(echo "$i * $PART_DURATION" | bc -l)
   OUTPUT_FILE="${DIR}/${BASENAME}_part${PART_NUM}_${RATE}hz.wav"
-  echo "Creating part ${PART_NUM}: ${START}s-$(echo "$START + $PART_DURATION" | bc -l)s at ${RATE}Hz"
-  ffmpeg -i "$INPUT_FILE" -ss ${START} -t ${PART_DURATION} -ar ${RATE} "$OUTPUT_FILE" -y -loglevel error
+  if [ "$PART_NUM" -eq "${NUM_PARTS}" ]; then
+    echo "Creating part ${PART_NUM}: ${START}s-EOF at ${RATE}Hz"
+    ffmpeg -ss "$START" -i "$INPUT_FILE" -ar "$RATE" "$OUTPUT_FILE" -y -loglevel error
+  else
+    echo "Creating part ${PART_NUM}: ${START}s-$(echo "$START + $PART_DURATION" | bc -l)s at ${RATE}Hz"
+    ffmpeg -ss "$START" -t "$PART_DURATION" -i "$INPUT_FILE" -ar "$RATE" "$OUTPUT_FILE" -y -loglevel error
+  fi
crates/data/src/english_1/mod.rs (1)
4-27: Optional: gate these paths behind cfg(test) if only used by tests

If production code doesn’t use these constants, guard them to avoid bundling dev-only paths into the public API.
- pub const AUDIO_PART1_8000HZ_PATH: &str = concat!(/* … */);
+#[cfg(test)]
+pub const AUDIO_PART1_8000HZ_PATH: &str = concat!(/* … */);
# …repeat for the other five
crates/audio/src/resampler.rs (2)
15-22: Initial seeding may output leading zeros for up/down-sampling

With sample_position set to rate ratio, the first 1–2 outputs can be zeros because the interpolator isn’t primed with two frames. Consider priming two frames on first poll or tracking a primed flag.
 pub fn new(source: S, target_sample_rate: u32) -> Self {
     let initial_rate = source.sample_rate();
     Self {
         source,
         target_sample_rate,
-        sample_position: initial_rate as f64 / target_sample_rate as f64,
+        // Start at >= 2.0 so we fetch at least two frames before first output.
+        sample_position: 2.0,
         resampler: dasp::interpolate::linear::Linear::new(0.0, 0.0),
         last_source_rate: initial_rate,
     }
 }
Alternative: add a primed: bool field and seed two frames once. I can provide that patch if you prefer.

35-38: last_source_rate is tracked but unused

Either use this to handle dynamic rate transitions (e.g., flush/prime on change) or remove it.
-    last_source_rate: u32,
+    // last_source_rate can be leveraged to re-prime when the source rate changes.
+    last_source_rate: u32,
If not needed, consider dropping the field and the update block.
plugins/listener/src/fsm.rs (1)
312-315: AEC failure fallback is safe; consider downgrading log level.

Falling back to mic_chunk_raw maintains continuity. To avoid log spam during transient AEC hiccups, consider tracing::warn! here.
- tracing::error!("aec_error: {:?}", e);
+ tracing::warn!("aec_error: {:?}", e);

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between a7e8313 and 8a66de5.

⛔ Files ignored due to path filters (6)

crates/data/src/english_1/audio_part1_8000hz.wav is excluded by !**/*.wav
crates/data/src/english_1/audio_part2_16000hz.wav is excluded by !**/*.wav
crates/data/src/english_1/audio_part3_22050hz.wav is excluded by !**/*.wav
crates/data/src/english_1/audio_part4_32000hz.wav is excluded by !**/*.wav
crates/data/src/english_1/audio_part5_44100hz.wav is excluded by !**/*.wav
crates/data/src/english_1/audio_part6_48000hz.wav is excluded by !**/*.wav

📒 Files selected for processing (7)

crates/audio/src/lib.rs (1 hunks)
crates/audio/src/resampler.rs (1 hunks)
crates/audio/src/speaker/macos.rs (4 hunks)
crates/data/scripts/resamples.sh (1 hunks)
crates/data/src/english_1/mod.rs (1 hunks)
plugins/listener/src/fsm.rs (3 hunks)
plugins/notification/src/handler.rs (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

**/*.{js,ts,tsx,rs}

⚙️ CodeRabbit configuration file

**/*.{js,ts,tsx,rs}: 1. Do not add any error handling. Keep the existing one.
2. No unused imports, variables, or functions.
3. For comments, keep it minimal. It should be about "Why", not "What".

Files:

plugins/notification/src/handler.rs
crates/data/src/english_1/mod.rs
crates/audio/src/lib.rs
crates/audio/src/resampler.rs
plugins/listener/src/fsm.rs
crates/audio/src/speaker/macos.rs

🧬 Code graph analysis (3)

crates/audio/src/resampler.rs (1)

crates/audio/src/lib.rs (3)

poll_next (159-183)

as_stream (187-189)

sample_rate (191-197)

plugins/listener/src/fsm.rs (3)

crates/audio/src/resampler.rs (2)

new (14-23)

new (101-107)

crates/audio/src/mic.rs (1)

new (34-66)

crates/audio/src/lib.rs (1)

from_speaker (108-115)

crates/audio/src/speaker/macos.rs (3)

crates/audio/src/lib.rs (1)

sample_rate (191-197)

crates/audio/src/mic.rs (1)

sample_rate (188-190)

crates/audio/src/speaker/mod.rs (2)

sample_rate (89-91)

sample_rate (94-96)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: ci (macos, macos-latest)
GitHub Check: ci (windows, windows-latest)

🔇 Additional comments (9)

plugins/notification/src/handler.rs (1)

81-83: Good switch to Tauri runtime spawn

Using tauri::async_runtime::spawn inside a std::thread avoids “no reactor running” issues that tokio::spawn would hit off-runtime. The clone + await usage is correct.

crates/data/src/english_1/mod.rs (1)

4-27: Audio assets verified
All six referenced WAV files are present in crates/data/src/english_1, so no changes are needed.

crates/audio/src/resampler.rs (1)

151-205: Include data crate WAV assets in CI
Tests in crates/audio/src/resampler.rs (lines 151–205) depend on six WAV files in crates/data/src/english_1; confirm your CI setup checks out the data crate and that audio_part1_8000hz.wav through audio_part6_48000hz.wav are committed or generated so the tests won’t panic.

crates/audio/src/lib.rs (2)

5-5: Module wiring looks good

Adding mod resampler integrates the new file cleanly.

12-12: Public re-export is appropriate

pub use resampler::*; exposes the new wrapper at the crate root for downstream use.

plugins/listener/src/fsm.rs (2)

12-12: Good move to the shared resampler wrapper.

Importing ResampledAsyncSource centralizes rate handling and removes bespoke resampling paths.

252-257: Migrate remaining .resample() and confirm chunk sizes

.resample() still in

owhisper/owhisper-server/src/commands/run/realtime.rs:137

crates/audio/src/resampler.rs:173

crates/audio-utils/src/lib.rs:21
ensure these aren’t on the AEC path or replace with ResampledAsyncSource

Only fsm.rs uses .chunks(hypr_aec::BLOCK_SIZE); verify chunk_size in crates/audio-utils matches hypr_aec::BLOCK_SIZE where it feeds AEC

crates/audio/src/speaker/macos.rs (2)

1-1: Atomic sample-rate plumbing is clean.

Using Arc<AtomicU32> with relaxed ops for read-mostly SR propagation is appropriate and avoids mutex contention.

Also applies to: 33-39, 46-47

165-172: SR initialization and wiring look correct.

Seeding from asbd.sample_rate and sharing into both Ctx and SpeakerStream ensures sane defaults before the first callback update.

Also applies to: 176-183

crates/audio/src/resampler.rs

crates/audio/src/speaker/macos.rs

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (1)

crates/audio/src/speaker/macos.rs (1)

115-121: Resolved prior issue: SR now read from the device each callback.

This addresses the earlier bug about writing a static rate; nice fix.

🧹 Nitpick comments (2)

crates/audio/src/speaker/macos.rs (2)

115-121: Reduce churn: round and only store on change.

Prevents noisy updates (float fuzz) and unnecessary atomic writes.

Apply:

-            ctx.current_sample_rate.store(
-                device
-                    .actual_sample_rate()
-                    .unwrap_or(ctx.format.absd().sample_rate) as u32,
-                Ordering::Relaxed,
-            );
+            let sr = device
+                .actual_sample_rate()
+                .unwrap_or(ctx.format.absd().sample_rate);
+            let sr_u32 = sr.round() as u32; // avoid truncation drift
+            if sr_u32 != ctx.current_sample_rate.load(Ordering::Relaxed) {
+                ctx.current_sample_rate.store(sr_u32, Ordering::Relaxed);
+            }

169-169: Init with rounded SR to avoid off-by-one truncation.

Avoids 44099/47999 from float imprecision.

-        let current_sample_rate = Arc::new(AtomicU32::new(asbd.sample_rate as u32));
+        let current_sample_rate = Arc::new(AtomicU32::new(asbd.sample_rate.round() as u32));

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 8a66de5 and 164c2de.

📒 Files selected for processing (2)

crates/audio/src/resampler.rs (1 hunks)
crates/audio/src/speaker/macos.rs (5 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

crates/audio/src/resampler.rs

🧰 Additional context used

📓 Path-based instructions (1)

**/*.{js,ts,tsx,rs}

⚙️ CodeRabbit configuration file

**/*.{js,ts,tsx,rs}: 1. Do not add any error handling. Keep the existing one.
2. No unused imports, variables, or functions.
3. For comments, keep it minimal. It should be about "Why", not "What".

Files:

crates/audio/src/speaker/macos.rs

🧬 Code graph analysis (1)

crates/audio/src/speaker/macos.rs (4)

crates/audio/src/resampler.rs (2)

sample_rate (69-71)

sample_rate (115-121)

crates/audio/src/lib.rs (1)

sample_rate (191-197)

crates/audio/src/mic.rs (1)

sample_rate (188-190)

crates/audio/src/speaker/mod.rs (2)

sample_rate (89-91)

sample_rate (94-96)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: ci (macos, macos-latest)
GitHub Check: ci (windows, windows-latest)

🔇 Additional comments (7)

crates/audio/src/speaker/macos.rs (7)

1-1: Atomic for cross-thread SR tracking: good call.

Appropriate choice; Ordering::Relaxed is sufficient for a standalone config value.

33-34: Stream now holds shared SR atomic.

Plumbs the state where it’s needed. Looks good.

37-39: Dynamic SR exposure via atomic load is correct.

Cheap read path; matches intended cross-thread visibility.

46-47: Ctx carries SR atomic for callback updates.

Right ownership and sharing semantics.

105-105: Use device in IO proc.

Renaming from _device to device removes the unused param and enables live SR reads.

175-175: Ctx wiring looks correct.

Shared SR is propagated properly.

186-186: Stream exposes SR atomic correctly.

Keeps the read path simple for callers.

yujonglee added 2 commits August 29, 2025 16:13

up-to-date samplerate for speaker stream

6613b4a

done

8a66de5

coderabbitai bot reviewed Aug 30, 2025

View reviewed changes

crates/audio/src/resampler.rs Show resolved Hide resolved

crates/audio/src/resampler.rs Show resolved Hide resolved

crates/audio/src/speaker/macos.rs Outdated Show resolved Hide resolved

fix problems

164c2de

coderabbitai bot reviewed Aug 30, 2025

View reviewed changes

yujonglee merged commit f624cf5 into main Aug 30, 2025
7 of 8 checks passed

yujonglee deleted the dynamic-sample-rate branch August 30, 2025 03:11

This was referenced Sep 9, 2025

Migrate to actor for audio processing #1457

Merged

Mixed audio #1471

Merged

This was referenced Nov 12, 2025

feat: Linux Support #1659

Open

Fix audio pipeline #1662

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dynamic sample rate handling in speaker stream #1423

Dynamic sample rate handling in speaker stream #1423

Uh oh!

yujonglee commented Aug 30, 2025

Uh oh!

coderabbitai bot commented Aug 30, 2025 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Dynamic sample rate handling in speaker stream #1423

Dynamic sample rate handling in speaker stream #1423

Uh oh!

Conversation

yujonglee commented Aug 30, 2025

Uh oh!

coderabbitai bot commented Aug 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Aug 30, 2025 •

edited

Loading