Clean and efficient FP8 GEMM kernels with fine-grained scaling
Extension index for stable-diffusion-webui
Example Discord bot written in Python that uses the completions API
Fast uncensored Gemma model optimized for local chat and coding
Lightweight multimodal translation model for 55 languages
Large-scale xAI model for local inference with SGLang, Grok-2.5
High-efficiency reasoning and agentic intelligence model