Bring the notion of Model-as-a-Service to life
A set of Docker images for training and serving models in TensorFlow
Replace OpenAI GPT with another LLM in your app
Easiest and laziest way for building multi-agent LLMs applications
Pytorch domain library for recommendation systems
PyTorch extensions for fast R&D prototyping and Kaggle farming
Library for OCR-related tasks powered by Deep Learning
Lightweight Python library for adding real-time multi-object tracking
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Multilingual Automatic Speech Recognition with word-level timestamps
Superduper: Integrate AI models and machine learning workflows
A high-performance ML model serving framework, offers dynamic batching
Libraries for applying sparsification recipes to neural networks
A library for accelerating Transformer models on NVIDIA GPUs
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Trainable models and NN optimization tools
Probabilistic reasoning and statistical analysis in TensorFlow
PyTorch library of curated Transformer models and their components
Simplifies the local serving of AI models from any source
Build your chatbot within minutes on your favorite device
Efficient few-shot learning with Sentence Transformers
Official inference library for Mistral models