
Voice to text.
On your Mac.
Local first.
macOS menu bar app for voice dictation, translation, and hands-free input. Powered by on-device AI on Apple Silicon.
30-day free trial · Then $49.99 one-time · macOS 14+ · Apple Silicon (M1+)

Five ways to use your voice
Each mode is designed for a different workflow. Use them independently or layer translation on top.
Dictation
Push-to-Talk
Hold your hotkey, speak, release. Text appears instantly wherever your cursor is.
Handsfree
Voice-Activated
Say the wake word and start dictating. No keyboard needed. Voice commands like "end" flush your text.
Subtitles
System Audio Capture
Capture system audio and display real-time subtitles. Perfect for meetings, lectures, and videos.
Translation
Toggle Modifier
Layer translation on top of any mode. Speak in one language, output in another. 10 language pairs.
Select & Translate
Hotkey Powered
Select any text on screen, press a hotkey, and get an instant translation right at your cursor.
Turn audio files into text
Drop any audio file and get accurate transcriptions or subtitle files. All processed locally on your Mac.
Drag & Drop
Drop audio files directly onto Sotto. Supports MP3, WAV, M4A, and more.
Text & Subtitles
Export as plain text (.txt) or subtitle files (.srt) with timestamps.
Batch Processing
Queue multiple files at once. Sotto processes them in sequence on-device.
Local by default.
Your voice data never leaves your Mac. No accounts, no data collection. Ever.
No Internet Required
Every model runs on-device using Apple Silicon GPU. Works in airplane mode.
No API Keys Needed
Works fully offline. Optional cloud API available for translation only.
No Data Collection
Your voice never leaves your Mac. No analytics, no telemetry, no accounts.
Up and running in minutes
One download, one setup wizard, zero configuration headaches.
Install
Download the .dmg, drag to Applications. One binary, no dependencies.
Setup
7-step wizard handles permissions, model download, and mic calibration.
Speak
Press your hotkey and talk. Text appears at your cursor. That's it.
Always know what Sotto is doing
A floating pill widget follows your cursor, showing real-time status with animated equalizer bars.
99 languages, one model
Whisper handles multilingual speech recognition with automatic language detection.
English
Korean
Japanese
Chinese
Cantonese
Arabic
Czech
Danish
Dutch
Filipino
Finnish
French
German
Greek
Hindi
Hungarian
Indonesian
Italian
Macedonian
Malay
Persian
Polish
Portuguese
Romanian
Russian
Spanish
Swedish
Thai
Turkish
Vietnamese
Auto Detect
How Sotto stacks up
Pay once, own forever. Fully local. No compromises.
| SottoLIFETIME | Cloud-based Tools | Subscription Apps | |
|---|---|---|---|
| 100% Local* | |||
| No Subscription | |||
| Handsfree Mode | |||
| Live Subtitles | |||
| Translation | |||
| File Transcription | |||
| Price | $49.99 once | $8–15/mo | $8–15/mo |
*Local by default. Optional API mode available for cloud-based models.
Try everything free for 30 days.
No subscriptions. No recurring fees. Pay once, use forever.
Pro
30-DAY FREE TRIALOne-time payment · Lifetime license
Download Free TrialAll Pro features for 30 days · No credit card required
Buy Now — $49.99Everything included
- All 5 modes + file transcription
- Unlimited recording
- Lifetime updates
- 99 languages
- 100% local — optional cloud API for translation
- No subscription ever
After trial without purchase
- Dictation mode only
- 1-minute recording limit
Competitors charge $8–15/mo. Sotto Pro pays for itself in 5 months.