Repository containing notebooks of my posts on Medium
HunyuanVideo: A Systematic Framework For Large Video Generation Model
LLM Frontend for Power Users
Multi-lingual large voice generation model, providing inference
Designed for text embedding and ranking tasks
Create prompt-friendly codebase digests from any Git repository URL
Agent harness to make your slop code well-engineered and beautiful
Collection of Gemma 3 variants that are trained for performance
Long-form streaming TTS system for multi-speaker dialogue generation
Moonshot's most powerful AI model
Image generation model with single-stream diffusion transformer
Instant voice cloning by MIT and MyShell. Audio foundation model
A python library that makes AMR parsing, generation and visualization
Open Source Document Management System for Digital Archives
VS Code extension for LLM-assisted code/text completion
Annotate and review coding agent plans visually, share with your team
OCR expert VLM powered by Hunyuan's native multimodal architecture
tiktoken is a fast BPE tokeniser for use with OpenAI's models
21 Lessons, Get Started Building with Generative AI
Code and models for ICML 2024 paper, NExT-GPT
AI suite powered by state-of-the-art models and providing advanced AI
"Big Model" trains a visual multimodal VLM with 26M parameters
An open source implementation of CLIP
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Matter AI is open-source AI Code Reviewer Agent