GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Features

  • Zero‑shot TTS: generate speech from a 5‑second voice sample
  • Few‑shot fine-tuning: 1 minute of data for improved voice likeness
  • Cross-lingual support across multiple languages
  • Web UI for inference and batch generation
  • Open-source with pretrained model weights
  • Active community and publication‑grade performance

Project Samples

Project Activity

See All Activity >

Categories

Voice Cloning

License

MIT License

Follow GPT-SoVITS

GPT-SoVITS Web Site

Other Useful Business Software
Premier Construction Software Icon
Premier Construction Software

Premier is a global leader in financial construction ERP software.

Rated #1 Construction Accounting Software by Forbes Advisor in 2022 & 2023. Our modern SAAS solution is designed to meet the needs of General Contractors, Developers/Owners, Homebuilders & Specialty Contractors.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GPT-SoVITS!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Voice Cloning Software

Registered

2025-07-29