AIMET is a library that provides advanced quantization and compression
Libraries for applying sparsification recipes to neural networks
Neural Network Compression Framework for enhanced OpenVINO
Pytorch domain library for recommendation systems
MII makes low-latency and high-throughput inference possible
Tensor search for humans
Superduper: Integrate AI models and machine learning workflows
CPU/GPU inference server for Hugging Face transformer models