Comprehensive Gradio WebUI for audio processing
Gracefully face hCaptcha challenge with multimodal llms
A sound cloning tool with a web interface, using your voice
Deep learning library
Open Source Differentiable Computer Vision Library
End-to-end speech processing toolkit
This project is a common knowledge point and code implementation
Machine Learning Journal for Intermediate to Advanced Topics
Optax is a gradient processing and optimization library for JAX
PyTorch3D is FAIR's library of reusable components for deep learning
Controllable & emotion-expressive zero-shot TTS
Fast image augmentation library and an easy-to-use wrapper
Generate audiobooks from e-books
Numerical differential equation solvers in JAX
Data science interview questions and answers
NeurIPS2025 Spotlight] Quantized Attention
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
CLIP, Predict the most relevant text snippet given an image
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Software that uses AI to perform real-time voice conversion
Implementation of Video Diffusion Models
Decomposable Multiscale Mixing for Time Series Forecasting
Quickly get started with AI theory and practical applications
Implementation for MatMul-free LM
A Production-ready Reinforcement Learning AI Agent Library