Efficient Triton Kernels for LLM Training
IPython Kernel for Jupyter
FlashInfer: Kernel Library for LLM Serving
A book-in-progress about the Linux kernel and its insides
A NumPy-compatible array library accelerated by CUDA
A set of utilities for monitoring and customizing GPU performance
A Python framework for accelerated simulation, data generation
Shredos Disk Eraser 64 bit for all Intel 64 bit processors
Development repository for the Triton language and compiler
Performance meets Productivity
Pytest in IPython notebooks
Jupyter notebook integration with Spyder
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Pytest plugin for testing notebooks
A python parametric CAD scripting framework based on OCCT
An experimental version of DeepSeek model
Voilà turns Jupyter notebooks into standalone web applications
Qiling Advanced Binary Emulation Framework
Deep and Machine Learning for Microscopy
Low-latency AI inference engine optimized for mobile devices
A book about how to write OS kernels in Rust easily
Library for efficiently connecting and optimizing teams of AI agents
A Powerful Native Multimodal Model for Image Generation
How to optimize some algorithm in cuda
Automate native Android apps with AI using accessibility APIs