Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Fully private LLM chatbot that runs entirely with a browser
Lightning-fast, on-device TTS, running natively via ONNX
Universal LLM Deployment Engine with ML Compilation
Tool that provides interactive visualizations for large embeddings
Bringing large-language models and chat to web browsers
Cross-Platform, GPU Accelerated Whisper
Chat with LLM like Vicuna totally in your browser with WebGPU
Easy-to-use headless React Hooks to run LLMs in the browser with WebGP