Skip to content

Conversation

roertbb
Copy link

@roertbb roertbb commented Jul 6, 2025

Following Google STT implementation has been heavily inspired by:

The Google SpeechClient v2 SDK for Node.js has been poorly documented and I've been following some trial and error approach (see _streamingRecognize instead of streamingRecognize, shape of initial streaming recognize config request) in which I ended up with the following working implementation.

Also, during the tests I've noticed that Google closes the connection with timeout error after it does not receive audio data. It can be observed during longer agent responses during which the audio is not transformed. To alleviate the problem I've implemented sending empty audio frames periodically, when user does not speak - see keepalive interval implementation. It was inspired on Deepgram implementation.

If there is some additional work required to integrate new plugin I'd be grateful for guidance as I'm first time contributor here.

Copy link

changeset-bot bot commented Jul 6, 2025

⚠️ No Changeset found

Latest commit: 8fa356b

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@CLAassistant
Copy link

CLAassistant commented Jul 6, 2025

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants