A text-to-speech, speech-to-text and speech-to-speech library
Easily turn large sets of image urls to an image dataset
Open-source abilities for OpenHome agents
1 min voice data can also be used to train a good TTS model
AI video generator optimized for low VRAM and older GPUs use
Generate short videos with one click using AI LLM
Talk to Your AI Agents from Anywhere
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
AGiXT is a dynamic AI Automation Platform
Universal LLM Deployment Engine with ML Compilation
Lightweight demo to build a conversational AI search engine quickly
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Unofficial Python API and agentic skill for Google NotebookLM
A simple native web interface that uses ChatTTS to synthesize text
Framework for building AI agents that automate complex web tasks
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
The SOTA Open-Source Browser Agent
Unified web UI for training and running open models locally
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Synchronized Translation for Videos
Google Flights MCP and Python Library
Create UIs for your machine learning model in Python in 3 minutes
Chat with your SQL database
gpt-oss-120b and gpt-oss-20b are two open-weight language models
A TTS that fits in your CPU (and pocket)