Get up and running with Llama 2 and other large language models
ClawdBot one-click deployment tool
Unified web UI for training and running open models locally
AI video generator optimized for low VRAM and older GPUs use
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Official inference repo for FLUX.1 models
Port of Facebook's LLaMA model in C/C++
Optimize interaction with AI coding assistants
LLM Frontend for Power Users
Local AI WebUI for running and managing large language models offlineA
lightweight, standalone C++ inference engine for Google's Gemma models
Agent Orchestration Command Center
Efficient Triton Kernels for LLM Training
Code for running inference with the SAM 3D Body Model 3DB
Optimize your code automatically with AI
Run Local LLMs on Any Device. Open-source
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Local CLI Copilot, powered by Ollama
AI edge infrastructure for macOS. Run local or cloud models
CLI tool for multi-agent workflows and automated code generation
Package and deploy machine learning models using Docker containers
Continue is the leading open-source AI code assistant
Build and run AI agents like microservices
Chat with your documents using local AI
Chat experience in your terminal