Python_WakeWordDetection

Python Wake Words Detection / Keywords Detection by Davoice

Welcome to Davoice WakeWord / Keywords Detection – Wake words and keyword detection solution designed by DaVoice.io.

🔵 🟢 🟡 🔴

About this project

This is a "wake word" package for Python.

A "wake word" is a keyword or a phrase that activates your device or commands your application, like "Hey Siri" or "OK Google". "Wake Word" is also known as "keyword detection", "Phrase Recognition", "Phrase Spotting", “Voice triggered”, “hotword”, “trigger word”

Except for "Python wake word" It also provide "Python Speech to Intent". Speech to Intent refers to the ability to recognize a spoken word or phrase and directly associate it with a specific action or operation within an application. Unlike a "wake word", which typically serves to activate or wake up the application, Speech to Intent goes further by enabling complex interactions and functionalities based on the recognized intent behind the speech.

For example, a wake word like "Hey App" might activate the application, while Speech to Intent could process a phrase like "Play my favorite song" or "Order a coffee" to execute corresponding tasks within the app. Speech to Intent is often triggered after a wake word activates the app, making it a key component of more advanced voice-controlled applications. This layered approach allows for seamless and intuitive voice-driven user experiences.

To be up-to-date We are now updating integration instructions to our website - https://davoice.io/integration-guides-wake-word/python

Features

Easy to use and deploy with Python: Check out our example code and install scripts.
Cross-Platform Support: Integrate Davoice "Python wake word" into most known HW architectures and OS.
Low Latency: Experience near-instantaneous keyword detection.
High Accuracy: We have successfully reached over 99% accuracy for all our models.
Real-World Benchmarks: At DaVoice, we believe in real benchmarks done by customers on actual use cases rather than static tests. We actively encourage our customers to share their real-world experiences and results.

🟢🟢 Customer Benchmarks 🟢🟢

Provided by: Tyler Troy, CTO & Co-Founder, LookDeep Health Context: Tyler .

🔵 Criterion I — False Positives (hospital relevance)

Customer Benchmark Ⅰ — LookDeep Health (Customer-reported):

Provided by Tyler Troy, Co-Founder at LookDeep Health

Tyler Troy at LookDeep Health reported benchmark below as part of selecting a "phrase detection" vendor.

RESULTS BELOW:

🔵 Criteria Ⅰ - False Positives

In hospital settings, false alerts waste valuable time and can compromise patient care.
✅ DaVoice: "ZERO FALSE POSITIVES" within a month duration of testing.
Porcupine (Picovoice): Several false alerts triggered daily observed under a similar setup.
OpenWakeWord was not tested for false positives because its true positive rate was too low.

Definition used by the customer: a “false positive” is a wake event when no wake phrase was spoken, counted over the monitored period.

🔵 Criteria II - True Positive

Table 1: A comparison of model performance on custom keywords

MODEL         DETECTION RATE
===========================
DaVoice                    0.992481 ✅
Porcupine (Picovoice)      0.924812
OpenWakeWords              0.686567

Source: Customer-reported results received on Dec 20, 2024.
OS: [Linux Python].
Models/versions: [hey_look_deep_model_28_08122024.py2].
Thresholds/params: [0.99].
Note: Results reflect this customer’s setup. Your results may vary.

Customer Benchmark II - customer preferred to remain anonymous

Benchmark on "Python wake word", vs top competitors:

Benmark used recordings with 1326 TP files.
Second best was on of the industry top players who detected 1160 TP
Third detected TP 831 out of 1326

Table 1: A comparison of model performance on custom keywords

MODEL         DETECTION RATE
===========================
DaVoice        0.992458
Top Player     0.874811
Third          0.626697

Platforms and Supported Languages

"Python wake word " on linux.x86_64
"Python wake word " on linux.aarch64
"Python wake word " on linux.armv7
"Python wake word " on linux.ppc64
"Python wake word " on linux.ppc64le
"Python wake word " on linux.s390x
"Python wake word " on darwin.x86_64
"Python wake word" on darwin.arm64
"Python wake word" on win32
"Python wake word" on win_amd64
"Python wake word" on win.arm64

Python Wake word generator

Create your "custom wake word" for Python

In order to generate your "custom wake word" you will need to:

Create Python wake word model: Contact us at info@davoice.io with a list of your desired "custom wake words".

We will send you corresponding models typically your wake word phrase .onnx for example:

A wake word *"hey sky" will correspond to hey_sky.onnx.
Add wake words to Python example: Simply copy your model onnx files to: example/models/

In example.py change the "need_help_now.onnx" to your model.onnx keyword_detection_models = ["models/need_help_now.onnx"] run python example.py

Contact

For any questions, requirements, or more support for other platforms, please contact us at info@davoice.io.

Installation and Usage

Clone this repo

Important!

Please edit the installation files (install.sh or first_time_install.sh) and change PYTHON_VERSION=3.12 to your python version!!!

First time installation without venv environment:

source first_time_installation.sh

If you already have venv environment:

source install.sh

Important!

Please edit the installation files and change PYTHON_VERSION=3.12 to your python version!!!

Demo Instructions

$ cd example $ python example.py

Screenshots from the demo App

Usage Example

See example

API Reference

Initialization

`KeywordDetection(keyword_models=keyword_detection_models)`

Creates a new keyword detection instance. keyword_detection_models is a list of model configuration dictionaries:

from keyword_detection import KeywordDetection

keyword_detection_models = [
    {
        "model_path": "models/your_wake_word.onnx",  # Path to the ONNX model file
        "callback_function": detection_callback,       # Function called on detection
        "threshold": 0.9,                              # Detection sensitivity (0.0 - 1.0)
        "buffer_cnt": 4,                               # | `buffer_cnt` | `int` Number of sub models to predict on the buffer -> more equals less false positives |

        "wait_time": 50                                # Wait time in ms between inferences
    }
]

keyword_model = KeywordDetection(keyword_models=keyword_detection_models)

You can add multiple models to the list to detect several wake words simultaneously.

License

`set_keyword_detection_license(license_key)`

Sets the license key for the library. The license key can be read from a file:

with open("licensekey.txt", "r") as file:
    license_key = file.read().strip()

keyword_model.set_keyword_detection_license(license_key)

Callbacks

Detection Callback

The callback function receives a params dictionary with the following keys:

Key	Type	Description
`phrase`	`str`	The detected wake word / phrase
`threshold_scores`	`list[float]`	Array of detection scores
`version`	`str` (optional)	Model version

def detection_callback(params):
    phrase = params["phrase"]
    threshold_scores = params["threshold_scores"]
    version = params.get("version", "N/A")
    print(f"Detected: {phrase} scores={threshold_scores} version={version}")

`set_secondary_callback(keyword_model_name, callback, secondary_threshold)`

Sets a secondary callback that fires when audio scores are higher than usual but below the primary detection threshold. Useful for logging near-detections and improving models:

def lower_threshold_callback(params):
    print(f"Near-detection for: {params['phrase']} scores: {params['threshold_scores']}")

for name in keyword_model.keyword_models_names:
    keyword_model.set_secondary_callback(
        keyword_model_name=name,
        callback=lower_threshold_callback,
        secondary_threshold=0.9
    )

Detection Modes

Mode 1: Internal Audio (Built-in Microphone Capture)

Use start_keyword_detection() when you want the library to handle microphone audio capture internally. This is the simplest approach.

Example (example/example.py for Linux/macOS, example_windows/example.py for Windows):

import threading

thread = threading.Thread(
    target=keyword_model.start_keyword_detection,
    kwargs={"enable_vad": False, "buffer_ms": 100}
)
thread.start()
thread.join()

Parameter	Type	Description
`enable_vad`	`bool`	Enable Voice Activity Detection
`buffer_ms`	`int`	Audio buffer size in milliseconds

Mode 2: External Audio (You Provide Audio Frames)

Use the external audio API when you need to capture and control audio yourself (e.g., from a custom source, network stream, or shared microphone). Audio frames must be 16-bit PCM, mono, 16 kHz.

Example (example/example_external_audio.py for Linux/macOS, example_windows/example_external_audio.py for Windows):

import pyaudio
import numpy as np

# 1. Start external audio detection (non-blocking)
keyword_model.start_keyword_detection_external_audio(enable_vad=False, buffer_ms=100)

# 2. Optionally start standalone VAD
keyword_model.start_vad_external_audio()

# 3. Feed audio frames in a loop
FORMAT = pyaudio.paInt16
CHANNELS = 1
RATE = 16000
CHUNK = 1280

p = pyaudio.PyAudio()
stream = p.open(format=FORMAT, channels=CHANNELS, rate=RATE,
                input=True, frames_per_buffer=CHUNK)

while True:
    data = stream.read(CHUNK, exception_on_overflow=False)
    audio_frame = np.frombuffer(data, dtype=np.int16)

    # Feed audio for wake word detection
    if keyword_model.is_listening:
        keyword_model.feed_audio_frame(audio_frame)

    # Feed audio for standalone VAD
    if keyword_model.is_listening_vad_stand_alone:
        speech_probability = keyword_model.feed_audio_frame_vad(audio_frame)

    # Noise level detection
    dbfs, actual_sound = keyword_model.feed_audio_frame_noise_detection(
        audio_frame, low_noise_margin_db=20, high_noise_margin_db=40
    )

External Audio API Methods

Method	Description
`start_keyword_detection_external_audio(enable_vad, buffer_ms)`	Initialize wake word detection for external audio
`start_vad_external_audio()`	Initialize standalone Voice Activity Detection
`feed_audio_frame(audio_frame)`	Feed a `numpy.int16` audio frame for wake word detection
`feed_audio_frame_vad(audio_frame)`	Feed audio for VAD; returns `float` speech probability (0.0 - 1.0)
`feed_audio_frame_noise_detection(audio_frame, low_noise_margin_db, high_noise_margin_db)`	Returns `(dbfs, sound_type)` where `sound_type` is e.g. `'silence'`, or an indication of detected sound

Properties

Property	Type	Description
`is_listening`	`bool`	`True` when wake word detection is active and ready for audio
`is_listening_vad_stand_alone`	`bool`	`True` when standalone VAD is active and ready for audio
`keyword_models_names`	`list[str]`	List of loaded model names

File-Based Detection

`start_keyword_detection_from_file(file_path)`

Run wake word detection on a .wav file. Returns a dictionary with detection results per model:

output = keyword_model.start_keyword_detection_from_file("path/to/audio.wav")
# output: { model_name: { "detections": <int>, ... }, ... }

Documentation

"Python Wake Word" API Reference
frymanofer.github.io

Benchmark.

Our customers have benchmarked our technology against leading solutions, including Picovoice Porcupine, Snowboy, Pocketsphinx, Sensory, and others. In several tests, our performance was comparable to Picovoice Porcupine, occasionally surpassing it, however both technologies consistently outperformed all others in specific benchmarks. For detailed references or specific benchmark results, please contact us at ofer@davoice.io.

Key words

DaVoice.io Voice commands / Wake words / Voice to Intent / keyword detection npm for Android and IOS. "Python Wake word detection github" "Python Wake word detection", "Python Wake word", "Python Phrase Recognition", "Python Phrase Spotting", “Python Voice triggered”, “Python hotword”, “Python trigger word”, "Wake word detection Python" "react-native wake word", "Wake word detection github", "Wake word generator", "Custom wake word", "voice commands", "wake word", "wakeword", "wake words", "keyword detection", "keyword spotting", "speech to intent", "voice to intent", "phrase spotting", "react native wake word", "Davoice.io wake word", "Davoice wake word", "Davoice react native wake word", "Davoice react-native wake word", "wake", "word", "Voice Commands Recognition", "lightweight Voice Commands Recognition", "customized lightweight Voice Commands Recognition", "rn wake word"

Links

Here are wakeword detection GitHub links per platform:

Web / JS / Angular / React: https://github.com/frymanofer/Web_WakeWordDetection/tree/main
For React Native: ReactNative_WakeWordDetection
For Android: KeywordsDetectionAndroidLibrary
For iOS framework:
- With React Native bridge: KeyWordDetectionIOSFramework
- Sole Framework: KeyWordDetection

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
dist		dist
dist_new		dist_new
docs		docs
example		example
example_windows		example_windows
python_wake_word		python_wake_word
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_venv.sh		create_venv.sh
first_time_installation.sh		first_time_installation.sh
install.py		install.py
install.sh		install.sh
recordFromMicrophone.py		recordFromMicrophone.py
resampleTo16kHZ.sh		resampleTo16kHZ.sh

License

frymanofer/Python_WakeWordDetection

Folders and files

Latest commit

History

Repository files navigation

Python_WakeWordDetection

Python Wake Words Detection / Keywords Detection by Davoice

About this project

Features

🟢🟢 Customer Benchmarks 🟢🟢

Customer Benchmark Ⅰ — LookDeep Health (Customer-reported):

Provided by Tyler Troy, Co-Founder at LookDeep Health

RESULTS BELOW:

** 🔵 Criteria Ⅰ - False Positives**

🔵 Criteria II - True Positive

Customer Benchmark II - customer preferred to remain anonymous

Table 1: A comparison of model performance on custom keywords

Platforms and Supported Languages

Python Wake word generator

Create your "custom wake word" for Python

Contact

Installation and Usage

Important!

First time installation without venv environment:

If you already have venv environment:

Important!

Demo Instructions

Screenshots from the demo App

Usage Example

API Reference

Initialization

KeywordDetection(keyword_models=keyword_detection_models)

License

set_keyword_detection_license(license_key)

Callbacks

Detection Callback

set_secondary_callback(keyword_model_name, callback, secondary_threshold)

Detection Modes

Mode 1: Internal Audio (Built-in Microphone Capture)

Mode 2: External Audio (You Provide Audio Frames)

External Audio API Methods

Properties

File-Based Detection

start_keyword_detection_from_file(file_path)

Documentation

Benchmark.

Key words

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

🔵 Criteria Ⅰ - False Positives

`KeywordDetection(keyword_models=keyword_detection_models)`

`set_keyword_detection_license(license_key)`

`set_secondary_callback(keyword_model_name, callback, secondary_threshold)`

`start_keyword_detection_from_file(file_path)`

Packages