Feature Request: Add FunASR as self-hosted STT provider

Hi! Omi is an impressive AI 2nd brain with its extensible STT provider system.

I'd like to suggest adding [FunASR](https://github.com/modelscope/FunASR) (16K+ stars) as a self-hosted STT provider alternative to Deepgram:

**Why FunASR for Omi:**
- **Self-hosted & private**: All processing local — no data leaves the device/server
- **170x real-time GPU speed**: Handles real-time conversation transcription easily
- **Built-in speaker diarization**: cam++ model (7.2M params) identifies different speakers — key for Omi's multi-speaker scenarios
- **Emotion detection**: SenseVoice detects emotions and audio events alongside transcription
- **50+ languages**: Automatic language detection via SenseVoice model
- **OpenAI-compatible API**: Drop-in replacement for cloud APIs

```bash
pip install funasr vllm fastapi uvicorn python-multipart
funasr-server --device cuda
# POST http://localhost:8000/v1/audio/transcriptions
```

This would give privacy-conscious Omi users a fully self-hosted alternative to Deepgram with comparable features (diarization + emotion).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add FunASR as self-hosted STT provider #7551

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Feature Request: Add FunASR as self-hosted STT provider #7551

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions