Custom Local Whisper Models
Use your own fine-tuned Whisper models for improved domain-specific transcription accuracy.
Why use a custom local model?
- Domain accuracy: Fine‑tuned models (e.g., medical, legal, meetings, accents) can outperform general models on your data.
- Latency and cost: Local inference avoids network latency and API costs, though it can be slightly slower than Core ML optimized models if you're only using the GGML whisper.cpp compatible format.
Requirements
- A local Whisper model that you are trying to add should be whisper.cpp compatible format with the
.binextension (e.g.,ggml-large-v3-turbo.bin). - Optional: For Core ML support, the Core ML model should have exactly the same name as the GGML model, with the format
[model-name]-encoder.mlmodelc(e.g.,ggml-large-v3-turbo-encoder.mlmodelc).
Available Fine-Tuned Models
Language-Specific Models (Ready to Use):
- Finnish/Swedish (
Large-v3) - Nepali (
Large-v3) - German (
Large-v3-Turbo) - Hebrew (
Large-v3) - Korean (
Medium) - Swedish (
Large) - Hindi (
Base)
Models Requiring Conversion (PyTorch format):
Conversion Resources
- GGML whisper.cpp compatible conversion: whisper.cpp GGML format
- Core ML conversion: whisper.cpp Core ML support
For more background, see VoiceInk docs: Custom Local Whisper Models.
How to import in VoiceInk
- Open VoiceInk →
AI Models. - Go to the
Localtab. - Scroll to the bottom and click
Import Local Model…. - Select your
.binfile. You will be able to use it now with VoiceInk. - Click
Set as Defaulton the imported model to use it for transcription.
Deleting and managing models
- From the model card menu, choose
Delete Modelto remove the file from VoiceInk. The card disappears immediately. Show in Finderreveals the actual.binfile in the models directory.