Feature Request
Omi is an impressive AI wearable that captures and processes audio. Suggesting SenseVoice as an ASR option — its speed and compact size make it ideal for wearable AI.
Why SenseVoice for Omi?
- Non-autoregressive — instant results, no waiting for sequential decoding
- 5x faster than Whisper — critical for always-on wearable processing
- 234M params — much lighter than Whisper large-v3 (1.5B), suitable for edge/mobile
- 50+ languages — single model for any language the user speaks
- Emotion detection — detects speaker emotions, valuable for context-aware AI
- Audio events — identifies laughter, music, applause (contextual awareness)
- ONNX available — deployable via Sherpa-ONNX on mobile (iOS/Android)
Mobile/Edge deployment options
- Sherpa-ONNX — native iOS/Android SDKs available, runs SenseVoice on-device
- Server-side — OpenAI-compatible API via
funasr-server for cloud processing
Battery/Performance benefits
Non-autoregressive inference means:
Feature Request
Omi is an impressive AI wearable that captures and processes audio. Suggesting SenseVoice as an ASR option — its speed and compact size make it ideal for wearable AI.
Why SenseVoice for Omi?
Mobile/Edge deployment options
funasr-serverfor cloud processingBattery/Performance benefits
Non-autoregressive inference means:
Predictable, minimal processing time per audio chunk
Less CPU/GPU active time = better battery life
No risk of slow transcription for long utterances
FunASR: https://github.com/modelscope/FunASR (16.6K stars)
SenseVoice: https://github.com/FunAudioLLM/SenseVoice (8.3K stars)
Sherpa-ONNX (iOS/Android): https://github.com/k2-fsa/sherpa-onnx