speech-to-text
English and French speech recognition running in your browser.
Rust โ WebAssembly + WebGPU. No server, no API key.
Your audio never leaves your machine.
Quantized Kyutai STT model (Mimi audio codec + text decoder), custom Rust inference compiled to WebAssembly, GPU acceleration via WebGPU.