Transcription models

Dictato runs two model families, both entirely on-device. Pick one in Settings → Model. Unselected models download on demand. Once you've picked a model, see Languages to narrow down what Dictato listens for.

Model settings

Parakeet (recommended)

NVIDIA's Parakeet family, running through Apple Neural Engine. Fastest on modern Apple Silicon.

Model	Size	Best for
Parakeet V3	~600 MB	Multilingual (25 languages), ~210× real-time on M4
Parakeet V2	~400 MB	English only, highest recall

WhisperKit

OpenAI's Whisper models compiled for CoreML. Slower than Parakeet but more widely supported.

Model	Size	Best for
Base	~150 MB	Lightweight, good for most languages
Distil Large V3	~600 MB	6× faster than Large V3, within 1% accuracy
Medium	~1.5 GB	Higher accuracy across languages
Medium (English)	~1.5 GB	Higher accuracy for English only

Which should I pick?

English only, fast machine: Parakeet V2
Multilingual: Parakeet V3
Older Mac or wide language support: WhisperKit Base or Medium
Best accuracy, don't mind size: WhisperKit Medium or Distil Large V3

Info

Models download the first time you select them. Parakeet downloads lazily on first use; WhisperKit downloads up front so you can cancel and retry.