A lightweight vision library for performing large object detection
Inference script for Oasis 500M
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Multilingual Automatic Speech Recognition with word-level timestamps
Fast stable diffusion on CPU and AI PC
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Lightweight Python library for adding real-time multi-object tracking
Redundancy-aware KV Cache Compression for Reasoning Models
Efficient few-shot learning with Sentence Transformers
Integrate, train and manage any AI models and APIs with your database
Spark-TTS Inference Code
The repository provides code for running inference with SAM 2
PyTorch library of curated Transformer models and their components
Technical principles related to large models
RGBD video generation model conditioned on camera input
Offline inference engine for art, real-time voice conversations
Library for OCR-related tasks powered by Deep Learning
Low-latency AI inference engine optimized for mobile devices
Unified KV Cache Compression Methods for Auto-Regressive Models
Accessible large language models via k-bit quantization for PyTorch
Probabilistic reasoning and statistical analysis in TensorFlow
Code for running inference with the SAM 3D Body Model 3DB
A Powerful Native Multimodal Model for Image Generation
Simplifies the local serving of AI models from any source