VoiceInk Docs
AI Models

Local Models

Download Parakeet, Whisper, and Apple Speech models for on-device transcription.

Parakeet Models

VoiceInk includes NVIDIA Parakeet models through the Local tab:

ModelNotes
Parakeet V3Realtime, multilingual, used by onboarding.
Parakeet V2Realtime, English-only.

Click Download on the model card. Once downloaded, the model is available in Mode transcription settings.

Downloaded Parakeet models can be deleted or revealed in Finder from the model card menu.

Apple Speech

Apple Speech uses the native macOS Speech framework. It is:

  • Built in.
  • On-device.
  • Multilingual.
  • Available only on macOS 26 or later.

For Apple Speech language selection, VoiceInk can show whether a language asset is downloaded. If macOS allows it, VoiceInk can start downloading the selected language asset. Apple Speech can reserve a limited number of languages, so you may need to remove one before adding another.

Whisper Models

VoiceInk includes these predefined Whisper models:

  • Tiny
  • Tiny (English)
  • Base
  • Base (English)
  • Large v2
  • Large v3
  • Large v3 Turbo
  • Large v3 Turbo (Quantized)

Download a Whisper model from its card. If you no longer need it, delete it from the card menu.

Import Local Whisper Models

Use Import Local Model... to add a custom whisper.cpp-compatible .bin file.

Requirements:

  • The file must have a .bin extension.
  • VoiceInk treats imported models as Whisper models.
  • Imported models are multilingual by default in VoiceInk.

Steps:

  1. Open AI Models -> Local.
  2. Click Import Local Model....
  3. Select your .bin file.
  4. Open Modes and choose the imported model in a Mode's Transcription -> Model picker.

Local Enhancement Providers

The bottom of the Local tab also includes:

ProviderWhat it does
OllamaConnects to a local Ollama server, defaulting to http://localhost:11434.
Local CLISends enhancement prompts to a command template such as Claude, Codex, a script, or any CLI command.

Local CLI exposes VOICEINK_SYSTEM_PROMPT, VOICEINK_USER_PROMPT, and VOICEINK_FULL_PROMPT environment variables. It also writes the full prompt to stdin.