Add voice receive support #2

davidffa · 2022-06-02T18:34:40Z

Combines the audio received from all users on the voice channel into an mp3 file.

Uses my koe audio receive implementation (davidffa/koe#2)

WARNING: If the NAS (native audio sending) is enabled, the audio receive system only works simultaneously with the audio sending if using Epoll.

Record payload struct:

{
  op: 'record',
  guildId: 'id',
  id: 'some random id you want',
  selfAudio: record self audio or not (boolean), (optional, default=false)
  users: array of user ids to record, (optional, if not passed, all users will be recorded)
  bitrate: bitrate value, (optional, default = 64000)
  channels: 1 or 2 (int), (optional, default = 2)
  format: 'MP3' | 'PCM' The output audio file format (currently the available formats are PCM and MP3), default is MP3
}

The id is used to the identify the recorded audio file when downloading it
The mp3 output file sample rate is 48khz
To finish recording, simply send record payload only with the guildId
When the lavalink finishes processing the audio, it emits a recordFinished event, so you know when you can download the audio file.
- RecordFinished event struct: { op: 'recordFinished', guildId: <guildid>, id: <the id of the recording> }
The mp3 encoding is done by native code, using the C library libmp3lame, so it currently works on darwin-aarch64, linux-x86-64, linux-aarch64 and win-x86-64.

Added events:

You have to add `'Speaking-Events': 'true'` on WebSocket headers in order to receive this events

speakingStart and speakingStop only work while recording audio.

SpeakingStart (emitted when a user starts speaking in the voice channel)

{
  op: 'speakingEvent',
  event: 'start',
  guildId: 'guild id',
  userId: 'user id'
}

SpeakingStop (emitted when a user stops speaking in the voice channel (100ms threshold))

{
  op: 'speakingEvent',
  event: 'stop',
  guildId: 'guild id',
  userId: 'user id'
}

Disconnected (emitted when a user leaves the voice channel)

{
  op: 'speakingEvent',
  event: 'disconnected',
  guildId: 'guild id',
  userId: 'user id'
}

REST Endpoints:

Method	Endpoint	Description
GET	/records/:guildId	Returns a list with the ids of all recordings from the guild.
GET	/records/:guildId/:id	Downloads the mp3 audio file.
DELETE	/records/:guildId	Deletes all records from the guild.
DELETE	/records/:guildId/:id	Deletes one specific audio file.

TODO:

Add record op
Decode the opus frames provided by koe, using the lavaplayer libopus bindings (OpusDecoder)
Mix the pcm samples received from all users in the voice channel
Reduce heap memory allocations (were caused by foreach lol)
Encode the pcm frames in mp3 (using native C library libmp3lame)
Create REST endpoints to download and delete the recorded files
Handle AudioReceiver struct cleaning on voice channel disconnects
Mix the bot's audio with the other users

- Add darwin-aarch64 - Add linux-aarch64 & linux-x86-64 - Add win-x86-64

# Conflicts: # build.gradle.kts

5antos · 2022-06-26T13:10:52Z

It would be nice to have an option to filter certain users' audios, since some people may want to record only themselves or the bot without receiving the audio from other users connected to the voice channel

davidffa added 21 commits June 2, 2022 19:22

Update koe

ff8b75d

Implement voice receive (op: 'record')

2d2392b

Add LavalinkNatives module & refactor build script

ee090e1

Use native library (libmp3lame) to encode the pcm frames

e63bfb9

Add compiled natives for libmp3lame

971f144

- Add darwin-aarch64 - Add linux-aarch64 & linux-x86-64 - Add win-x86-64

Update koe

500e4a7

Ignore audio packets older than 100ms

515d9ff

Fix high heap allocations

9274d7e

Add records/ to gitignore

a74ae1b

Update koe

8d7869e

Fix NPE

d9f8bfc

Handle AudioReceiver cleanup

e38ab71

Allow sending record payload before voiceUpdate

b863122

Use id instead of channelid

9df9bb0

Start AudioReceiver if already connected

fa9bb95

Implement REST endpoints for audio receive system

0d1a2de

Fix response type

2de4559

Update koe & add Epoll support for linux aarch64

145ec30

Use NAS configurable

c8ca4e4

Add option to rec self audio & optimizations

82ae874

Merge branch 'dev' into feat/voice-receive

ef54adf

# Conflicts: # build.gradle.kts

davidffa marked this pull request as ready for review June 22, 2022 19:24

davidffa added 8 commits June 23, 2022 21:36

Fix race condition

6905f59

Only require id when start recording

f848c8f

Update koe

56567e2

Rewind the mp3 silence buf after write

52fec9e

Fix concurrency bug

e0cb838

Update koe

147d496

Emit event when finishing recording

e74c202

Update koe

e1b5203

Update koe

a25d063

davidffa added 7 commits June 26, 2022 18:37

Allow to select certain users to record

a60cc83

Support encoding mono audio on Mp3Encoder

ceea2ea

Initialize right channel array

7a55c93

Rebuilt natives

b6059ff

Support mono audio & pcm output

d3c13bf

Send speaking events

e0d5096

Refactor NAS disabled log

c14ab3f

davidffa merged commit 03cbe36 into dev Jul 8, 2022

davidffa deleted the feat/voice-receive branch July 8, 2022 10:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add voice receive support #2

Add voice receive support #2

Uh oh!

davidffa commented Jun 2, 2022 •

edited

Loading

Uh oh!

5antos commented Jun 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add voice receive support #2

Add voice receive support #2

Uh oh!

Conversation

davidffa commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Record payload struct:

Added events:

You have to add 'Speaking-Events': 'true' on WebSocket headers in order to receive this events

REST Endpoints:

Uh oh!

5antos commented Jun 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidffa commented Jun 2, 2022 •

edited

Loading

You have to add `'Speaking-Events': 'true'` on WebSocket headers in order to receive this events