Browse free open source Python LLM Inference Tools and projects below. Use the toggles on the left to filter open source Python LLM Inference Tools by OS, license, language, programming language, and project status.
Superduper: Integrate AI models and machine learning workflows
A set of Docker images for training and serving models in TensorFlow
Uplift modeling and causal inference with machine learning algorithms
PyTorch library of curated Transformer models and their components
Deep learning optimization library: makes distributed training easy
Low-latency REST API for serving text-embeddings
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Neural Network Compression Framework for enhanced OpenVINO
Trainable models and NN optimization tools
Efficient few-shot learning with Sentence Transformers
Optimizing inference proxy for LLMs
MII makes low-latency and high-throughput inference possible
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
LLMFlows - Simple, Explicit and Transparent LLM Apps
A high-performance ML model serving framework, offers dynamic batching
Run 100B+ language models at home, BitTorrent-style
Phi-3.5 for Mac: Locally-run Vision and Language Models
Training and deploying machine learning models on Amazon SageMaker
Single-cell analysis in Python
Integrate, train and manage any AI models and APIs with your database
Replace OpenAI GPT with another LLM in your app
LLM training code for MosaicML foundation models
Visual Instruction Tuning: Large Language-and-Vision Assistant
A Pythonic framework to simplify AI service building
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs