Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Port of Facebook's LLaMA model in C/C++
Port of OpenAI's Whisper model in C/C++
TEN, a voice agent framework to create conversational AI.
Open-source vector similarity search for Postgres
Run OpenClaw on a $5 chip
kaldi-asr/kaldi is the official location of the Kaldi project
The Operator Splitting QP Solver
Flux 2 image generation model pure C inference
A library for audio and music analysis, feature extraction
Open-source framework for conversational voice AI agents
Your personal AI assistant at all-in 888KiB
AI video generator optimized for low VRAM and older GPUs use
Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more
Inference Llama 2 in one file of pure C
A fast image processing library with low memory needs
C++ and Python Examples
Provides CTP stock options and Zhongtai Securities XTP
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
FAIR Sequence Modeling Toolkit 2
Foundational Models for State-of-the-Art Speech and Text Translation
C++ inference library for multiple SVC/TTS
llama and other large language models on iOS and MacOS offline
Android inline hook library which supports thumb, arm32 and arm64
Low-latency AI inference engine optimized for mobile devices