C++ library for high performance inference on NVIDIA GPUs
FlashMLA: Efficient Multi-head Latent Attention Kernels
OneFlow is a deep learning framework designed to be user-friendly
Real-Time Event Frameworks based on active objects & state machines
Deep learning inference framework optimized for mobile platforms
A template for modern C++ projects using CMake, Clang-Format
Uniform deep learning inference framework for mobile
Caffe, a fast open framework for deep learning
Hashing and spatial concurrency library.