Structured RAG: ingest, index, query
Open source AI Agents hosted on the oTTomator Live Agent Studio
Sample code and notebooks for Generative AI on Google Cloud
Industrial-level controllable zero-shot text-to-speech system
Generative AI reference workflows
Flexible Photo Recrafting While Preserving Your Identity
Pre-trained Deep Learning models and demos
Documentation for Google's Gen AI site - including Gemini API & Gemma
Towards Efficient Self-Evolving Agent System
Interface for OuteTTS models
One-click deployment (including offline integration package)
Official implementation of DreamCraft3D
Real-time voice interactive digital human
A TTS model capable of generating ultra-realistic dialogue
Towards Human-Level Text-to-Speech through Style Diffusion
Official DeiT repository
Open Source Computer Vision Library
Generate 3D objects conditioned on text or images
Official repo for consistency models
800,000 step-level correctness labels on LLM solutions to MATH problem
Official PyTorch Implementation of "Scalable Diffusion Models"
Point cloud diffusion for 3D model synthesis
Cloud ML Engine repo
kNN, decision tree, Bayesian, logistic regression, SVM
Generative Adversarial Networks for Efficient and High Fidelity Speech