Powerful AI language model (MoE) optimized for efficiency/performance
A Simple and Universal Swarm Intelligence Engine
Run Local LLMs on Any Device. Open-source
Python bindings for llama.cpp
Advanced language and coding AI model
An LLM-powered knowledge curation system that researches topics
Universal LLM Deployment Engine with ML Compilation
Open-source, high-performance AI model with advanced reasoning
A high-throughput and memory-efficient inference and serving engine
Access large language models from the command-line
High-performance inference framework for large language models
CNCF Sandbox Project
Qwen3 is the large language model series developed by Qwen team
LLM based data scientist, AI native data application
Chat with your documents using local AI
The official repo of Qwen chat & pretrained large language model
Operating LLMs in production
Phi-3.5 for Mac: Locally-run Vision and Language Models
I Agent designed to interact with ROS1- and ROS2-based robotics system
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A guidance language for controlling large language models
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Adding guardrails to large language models
Tools like web browser, computer access and code runner for LLMs
Uncertainty Quantification for Language Models, is a Python package