Controllable & emotion-expressive zero-shot TTS
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Framework for building real-time voice and multimodal AI agents
A python parametric CAD scripting framework based on OCCT
Rich is a Python library for rich text and beautiful formatting
Python binding to the Apache Tika™ REST services
A high-quality rapid TTS voice cloning model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
ASCII art library for Python
Official Python inference and LoRA trainer package
Generate blog articles from video or audio
Tools to ease the creation of snippets, syntax definitions, etc.
A community-supported supercharged version of paperless
Python & command-line tool to gather text on the Web
State-of-the-art (SoTA) text-to-video pre-trained model
Faster Whisper transcription with CTranslate2
Translate the video from one language to another and embed dubbing
Management of Yandex Station and other smart home devices
Controllable and fast Text-to-Speech for over 7000 languages
SoTA open-source TTS
Stable Diffusion web UI
Full git and GitHub integration with Sublime Text
Capable of understanding text, audio, vision, video
A lightweight text-to-speech model with zero-shot voice cloning
Open Source Document Management System for Digital Archives