OCR expert VLM powered by Hunyuan's native multimodal architecture
Quick illustration of how one can easily read books together with LLMs
LLM Council works together to answer your hardest questions
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Modular quant framework
DeepMind's software stack for physics-based simulation
Create UIs for your machine learning model in Python in 3 minutes
High-Quality Voice Cloning TTS for 600+ Languages
An event-driven framework designed to build multi-agent AI systems
Implementation of AudioLM audio generation model in Pytorch
PPTAgent: Generating and Evaluating Presentations
Genome modeling and design across all domains of life
One API call, pull Claude agent, completely sandboxed
Audio foundation model excelling in audio understanding
Build GenAI application quick and easy
The best ChatGPT that $100 can buy
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Hackable and optimized Transformers building blocks
A Repo For Document AI
Your open-source LLM evaluation toolkit
Open-source choice to scale, assess and maintain natural language data
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Advancing Open-source World Models
kaldi-asr/kaldi is the official location of the Kaldi project
ComfyUI wrapper nodes for HunyuanVideo