Transcription
How Shout converts speech to text — models, modes, and system audio capture.
How it works
Shout uses OpenAI's Whisper models running locally via WhisperKit, Apple's optimized implementation for Apple Silicon. Transcription happens entirely on your Mac — your audio never touches a server.
Whisper models
Choose a model in Settings to balance speed and accuracy for your hardware. English-only (.en) variants are faster for English content.
Transcription modes
Real-time Transcription
See your words appear as you speak. Shout streams partial results in real time and refines them when you stop recording.
Retroactive Transcription
Go back in time using the timeline editor and transcribe any segment from the always-on buffer — even audio you didn't explicitly record.
System audio capture
Capture from other apps
Record audio from meetings, calls, podcasts, and videos alongside your microphone input. Enable system audio in Settings.