Structured outputs for llms
Python bindings for llama.cpp
Build AI WhatsApp Bots with Pure Python
A Simple and Universal Swarm Intelligence Engine
Run Local LLMs on Any Device. Open-source
Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
Port of Facebook's LLaMA model in C/C++
A high-throughput and memory-efficient inference and serving engine
Low-code app builder for RAG and multi-agent AI applications
Interact with your documents using the power of GPT
Powerful AI language model (MoE) optimized for efficiency/performance
Agentic, Reasoning, and Coding (ARC) foundation models
Operating LLMs in production
lightweight package to simplify LLM API calls
A Gym environment for web task automation
Language-model investigation agent with a terminal UI
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Universal LLM Deployment Engine with ML Compilation
Advanced language and coding AI model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
PandasAI is a Python library that integrates generative AI
Qwen3 is the large language model series developed by Qwen team
A modular graph-based Retrieval-Augmented Generation (RAG) system
Access large language models from the command-line
Uncertainty Quantification for Language Models, is a Python package