A Powerful Native Multimodal Model for Image Generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Automatic Speech Recognition with Word-level Timestamps
Agentic, Reasoning, and Coding (ARC) foundation models
RGBD video generation model conditioned on camera input
Spark-TTS Inference Code
A natural language interface for computers
Probabilistic reasoning and statistical analysis in TensorFlow
Fast and memory-efficient exact attention
Trainable models and NN optimization tools
Universal LLM Deployment Engine with ML Compilation
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Code for running inference with the SAM 3D Body Model 3DB
Audiocraft is a library for audio processing and generation
DeepSeek Coder: Let the Code Write Itself
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Synthetic data curation for post-training and data extraction
A high-performance ML model serving framework, offers dynamic batching
Visual Causal Flow
Multi-lingual large voice generation model, providing inference
Pytorch domain library for recommendation systems
A Unified Library for Parameter-Efficient Learning
State-of-the-art TTS model under 25MB
The official Meta Llama 3 GitHub site
A Tree Search Library with Flexible API for LLM Inference-Time Scaling