Voice-to-Text, Instantly
An extremely lightweight Wispr Flow alternative. Hold a hotkey, speak, release, and your words get pasted.
One binary. Zero bloat. Speech that actually works for you.
Website • Quick Start • GitHub
Golos (Russian for "voice") is a free, open-source macOS CLI that turns your voice into text wherever your cursor is. Hold a hotkey to record, release to transcribe, and the result is instantly pasted into the focused application.
How it works:
- Hold your hotkey (default:
Right OptionorLeft Option) to start recording - Speak — audio streams to Deepgram Nova-3 in real time with a live transcript in your terminal
- Release — the final transcription is pasted into whatever app is focused via
Cmd+V
Key features:
- Push-to-talk — no always-on microphone, only records while the hotkey is held
- Single binary — no Electron, no GUI, no background service required
- Foreground or background mode — run interactively or daemonize with
golos -d - Live feedback — VU meter and interim transcript displayed in real time while speaking
- Dictionary replacements — map spoken words to text (e.g. say "period" →
., "new line" →\n) - Configurable hotkey —
right_option,right_command,fn,f18, orf19 - Two output modes — paste into focused app (
clipboard) or print tostdoutfor piping - Config layering — defaults → config file → environment variables → CLI flags
curl -fsSL https://raw.githubusercontent.com/basilysf1709/golos/main/install.sh | bashGolos needs Accessibility permission to listen for your hotkey and paste transcriptions. After running Golos for the first time, macOS will prompt you to grant access:
- Open System Settings → Privacy & Security → Accessibility
- Enable the toggle for your terminal app (e.g. Terminal, iTerm2, Alacritty)
If you skip this step, Golos won't be able to detect the hotkey or paste text.
golos setup # configure API key
golos # run in foreground
golos -d # run in background
golos --output stdout # output to stdout instead of clipboard
golos --hotkey cmd # override hotkey
golos stop # stop background process| Command | Description |
|---|---|
golos setup |
Configure Deepgram API key |
golos |
Run speech-to-text (foreground) |
golos -d |
Run speech-to-text (background) |
golos stop |
Stop the background process |
golos add <phrase> <replacement> |
Add a dictionary replacement |
golos delete <phrase> |
Delete a dictionary entry |
golos list |
List all dictionary entries |
golos import <file.toml> |
Import dictionary from a TOML file |
| Flag | Description |
|---|---|
-d, --detach |
Run in background |
--output <mode> |
Override output mode (clipboard or stdout) |
--hotkey <key> |
Override hotkey |
Manage word/phrase replacements that are applied to transcriptions:
golos add "period" "."
golos add "new line" "\n"
golos delete "period"
golos list
golos import dictionary.example.tomlConfig file: ~/.config/golos/config.toml
deepgram_api_key = "your-key"
hotkey = "right_option"
output_mode = "clipboard"
sample_rate = 16000
language = "en-US"Environment variables DEEPGRAM_API_KEY, GOLOS_OUTPUT, and GOLOS_HOTKEY override config values.
macOS, a Deepgram API key, and Accessibility permission for your terminal.
