llm
A small language model running entirely in your browser via WebGPU.
No server, no API key. Your prompt never leaves your machine.
SmolLM2-1.7B, single-turn chat.
SmolLM2-1.7B-Instruct, single-turn chat via WebGPU with WebLLM.