Implementation of TurboQuant (ICLR 2026)
A Collection of Cheatsheets, Books, Questions, and Portfolio
Fast and memory-efficient exact attention
A collaboration friendly studio for NeRFs
AI video generator optimized for low VRAM and older GPUs use
Python chatbot framework with Natural Language Understanding
Z80-μLM is a 2-bit quantized language model
Query MCP enables end-to-end management of Supabase via chat interface
Create UIs for your machine learning model in Python in 3 minutes
ChatGPT extension for scientific research work
OCR software, free and offline
Optimizing inference proxy for LLMs
AI bridge enabling assistants to control and automate Unity Editor
Get your documents ready for gen AI
Qwen3-TTS is an open-source series of TTS models
Offline inference engine for art, real-time voice conversations
The Multi-Agent Framework
Why use many token when few token do trick
Minimal CLI coding agent by Mistral
Run PyTorch LLMs locally on servers, desktop and mobile
AI Toolkit for Healthcare Imaging
Open-Sora: Democratizing Efficient Video Production for All
Open source NLP guide with models, methods, and real use cases
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Agent harness to make your slop code well-engineered and beautiful