Speech, understood.
In any language.
Transcribe and translate live audio with a developer-grade API. Pay only for what you use.
Everything you need to ship voice
A streaming pipeline, honest pricing and the kind of API you actually want to integrate.
Real-time
Streaming WebSocket API. Partial results within 200ms.
200+ languages
Fine-tuned acoustic model paired with NLLB-200 neural translation.
Flexible billing
Monthly plans for predictable cost, plus credit packs when you need to scale beyond your quota.
Developer-first
Bearer auth, a clean dashboard and usage metrics.
Capture, transcribe, translate
Three stages between a microphone and translated text — each one streaming.
Capture
Stream audio over a WebSocket from any client — browser, mobile or server.
Transcribe
Varmir Engine returns partial results in ~200ms as you speak.
Translate
NLLB-200 renders each phrase into your chosen target language.
Get started for free
- 1,000 min / month — full trial
- All 200+ languages
- Real-time WebSocket + REST
- Email notifications when quota runs low
- 1 active API key
- 8,000 min / month
- Real-time translation
- Usage dashboard
- Email support
- 5 active API keys
- 80,000 min / month
- Priority inference queue
- Unlimited team API keys
- Priority support
- SSO & audit logs
- Unlimited min / month
- Dedicated GPU + SLA
- Self-hosted option
- Dedicated account manager
- Custom model fine-tuning
Credit packs available in your dashboard when you exceed the monthly quota
Hear it work in real time
Open the live console, turn on your microphone and watch speech become text and translation.