Port of OpenAI's Whisper model in C/C++
Fast inference engine for Transformer models
GPU environment management and cluster orchestration
A Pythonic framework to simplify AI service building
A general-purpose probabilistic programming system
Protect and discover secrets using Gitleaks
OpenAI swift async text to image for SwiftUI app using OpenAI
LLM training code for MosaicML foundation models
Operating LLMs in production
AIMET is a library that provides advanced quantization and compression
Open-Source AI Camera. Empower any camera/CCTV
A set of Docker images for training and serving models in TensorFlow
A unified framework for scalable computing
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Replace OpenAI GPT with another LLM in your app
Official inference library for Mistral models
A library for accelerating Transformer models on NVIDIA GPUs
MII makes low-latency and high-throughput inference possible
MNN is a blazing fast, lightweight deep learning framework
PyTorch library of curated Transformer models and their components
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Library for serving Transformers models on Amazon SageMaker
State-of-the-art diffusion models for image and audio generation
Superduper: Integrate AI models and machine learning workflows