Replace OpenAI GPT with another LLM in your app
lightweight, standalone C++ inference engine for Google's Gemma models
Low-latency REST API for serving text-embeddings
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
A RWKV management and startup tool, full automation, only 8MB
A unified framework for scalable computing
Single-cell analysis in Python
Tensor search for humans
Uncover insights, surface problems, monitor, and fine tune your LLM
Unified Model Serving Framework
High-level Deep Learning Framework written in Kotlin
State-of-the-art Parameter-Efficient Fine-Tuning
Data manipulation and transformation for audio signal processing
On-device Speech Recognition for Apple Silicon
The AI-native (edge and LLM) proxy for agents
Images to inference with no labeling
State-of-the-art diffusion models for image and audio generation
Turn your existing data infrastructure into a feature store
A general-purpose probabilistic programming system
Visual Instruction Tuning: Large Language-and-Vision Assistant
Easy-to-use deep learning framework with 3 key features
Create HTML profiling reports from pandas DataFrame objects
Pytorch domain library for recommendation systems
Powering Amazon custom machine learning chips
An Open-Source Programming Framework for Agentic AI