Triple-engine speech recognition for real-time subtitle generation—online and offline, with up to 99 languages supported.
Sub!t integrates three speech recognition engines, each optimized for different scenarios. Switch between them instantly based on your needs—whether you need cloud-powered speed, fully offline privacy, or a lightweight streaming solution.
Online real-time streaming with ~200ms latency. Keywords Boosting for specialized terminology. Best for live events with internet access.
Offline recognition with Metal GPU acceleration. Supports 99 languages. Best accuracy for pre-recorded or high-fidelity scenarios.
Offline streaming (Zipformer Chinese-English) and non-streaming (SenseVoice 5 languages). Lightweight and privacy-first.
Built-in OpenCC engine automatically converts between Traditional and Simplified Chinese for offline recognition results, including Taiwan-specific vocabulary mapping.
Real-time speech-to-subtitle with Deepgram for multilingual audiences. Keywords Boosting ensures proper nouns and technical terms are recognized correctly.
Offline recognition with Sherpa-onnx or Whisper—no internet required. Perfect for venues without reliable connectivity.
Generate real-time captions and pipe them through NDI output for live broadcast overlay.