Implementation of the Surya Foundation Model for Heliophysics
Open source codebase for Scale Agentex
Bringing BERT into modernity via both architecture changes and scaling
A curated collection of skills for AI coding agents
Stable Diffusion web UI
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Diffusion Transformer with Fine-Grained Chinese Understanding
Graph Neural Network Library for PyTorch
Models and examples built with TensorFlow
Large Multimodal Models for Video Understanding and Editing
OCR expert VLM powered by Hunyuan's native multimodal architecture
Generate high-definition story short videos with one click using AI
Handwritten Text Recognition (HTR) system implemented with TensorFlow
From nobody to big model (LLM) hero
Large Language Model Principles and Practice Tutorial from Scratch
Definitions for AI/ML tasks like dataset creation
Collection of reference environments, offline reinforcement learning
LLM training in simple, raw C/CUDA
Implementation of "MobileCLIP" CVPR 2024
Tools for merging pretrained large language models
Developer AI Persona Search Agent
LLM-based Reinforcement Learning audio edit model
MCP integration platforms for AI agents to use tools at any scale
Taming Stable Diffusion for Lip Sync
An opinionated CLI to transcribe Audio files w/ Whisper on-device