fireredvad-onnx

Streaming Voice Activity Detection over WebSocket using FireRedVAD ONNX models. Includes optional Audio Event Detection (AED) to classify speech segments as speech, music, or noise.

Requirements

Python 3.10+
ONNX model files in onnx_models/:
- fireredvad_stream_vad_with_cache.onnx — streaming VAD model
- cmvn.ark — CMVN normalization stats
- fireredvad_aed.onnx — audio event detection model (optional)

Install

uv sync

Server

The server accepts streaming 16kHz 16-bit mono PCM audio over WebSocket, runs VAD to detect speech segments, and optionally classifies each segment using the AED model.

python server.py

Server Options

Flag	Default	Description
`--host`	`0.0.0.0`	Bind address
`--port`	`8765`	WebSocket port
`--model`	`onnx_models/fireredvad_stream_vad_with_cache.onnx`	VAD model path
`--cmvn`	`onnx_models/cmvn.ark`	CMVN stats path
`--aed-model`	`onnx_models/fireredvad_aed.onnx`	AED model path (skipped if not found)
`--output-dir`	`vad_output`	Directory for saved audio segments

WebSocket protocol

Client sends:

Binary messages: raw int16 little-endian PCM audio at 16kHz
JSON messages: {"action": "reset"} to reset VAD state

Server sends:

Speech start:

{"event": "speech_start", "time": 1.234}

Speech end (with AED when enabled):

{
  "event": "speech_end",
  "start": 1.234,
  "end": 3.456,
  "file": "vad_output/session_.../segment_0001_1.23s_3.46s.wav",
  "aed_label": "speech",
  "aed_probs": {"speech": 0.95, "music": 0.03, "noise": 0.02}
}

Client

The client streams audio to the server and prints VAD events.

Stream a WAV file

python client.py --file audio.wav

The file must be 16kHz 16-bit mono WAV. Convert with ffmpeg if needed:

ffmpeg -i input.wav -ar 16000 -ac 1 -acodec pcm_s16le audio.wav

Stream from microphone

python client.py --mic

Press Ctrl+C to stop.

Client Options

Flag	Default	Description
`--uri`	`ws://localhost:8765`	WebSocket server URI
`--file`		WAV file to stream
`--mic`		Stream from microphone

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
onnx_models		onnx_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
client.py		client.py
pyproject.toml		pyproject.toml
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fireredvad-onnx

Requirements

Install

Server

Server Options

WebSocket protocol

Client

Stream a WAV file

Stream from microphone

Client Options

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fireredvad-onnx

Requirements

Install

Server

Server Options

WebSocket protocol

Client

Stream a WAV file

Stream from microphone

Client Options

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages