Custom Local Whisper Models

Use your own fine-tuned Whisper models for improved domain-specific transcription accuracy.

Why use a custom local model?

Domain accuracy: Fine‑tuned models (e.g., medical, legal, meetings, accents) can outperform general models on your data.
Latency and cost: Local inference avoids network latency and API costs, though it can be slightly slower than Core ML optimized models if you're only using the GGML whisper.cpp compatible format.

A local Whisper model that you are trying to add should be whisper.cpp compatible format with the .bin extension (e.g., ggml-large-v3-turbo.bin).
Optional: For Core ML support, the Core ML model should have exactly the same name as the GGML model, with the format [model-name]-encoder.mlmodelc (e.g., ggml-large-v3-turbo-encoder.mlmodelc).

Language-Specific Models (Ready to Use):

Models Requiring Conversion (PyTorch format):

For more background, see VoiceInk docs: Custom Local Whisper Models.

From the model card menu, choose Delete Model to remove the file from VoiceInk. The card disappears immediately.
Show in Finder reveals the actual .bin file in the models directory.