Parallel WaveGAN is an unofficial PyTorch implementation of several state-of-the-art non-autoregressive neural vocoders, centered on Parallel WaveGAN but also including MelGAN, Multiband-MelGAN, HiFi-GAN, and StyleMelGAN. Its main goal is to provide a real-time neural vocoder that can turn mel spectrograms into high-quality speech audio efficiently. The repository is designed to work hand-in-hand with ESPnet-TTS and NVIDIA Tacotron2-style front ends, so you can build complete TTS or singing voice synthesis pipelines. It includes a large collection of “Kaldi-style” recipes for many datasets such as LJSpeech, LibriTTS, VCTK, JSUT, CMU Arctic, and multiple singing voice corpora in Japanese, Mandarin, Korean, and more. The project provides pre-trained models, Colab demos, and example configurations, allowing researchers to quickly evaluate vocoder quality or adapt models to new datasets.

Features

  • PyTorch implementations of Parallel WaveGAN, MelGAN, Multiband-MelGAN, HiFi-GAN, and StyleMelGAN
  • Real-time neural vocoder compatible with ESPnet-TTS and Tacotron2 front ends
  • Extensive set of Kaldi-style recipes for speech and singing datasets in multiple languages
  • Pretrained models and Colab demos for quick listening tests and prototyping
  • Flexible training pipeline with support for multi-GPU and distributed setups
  • Very low real-time factor for fast mel-to-waveform conversion suitable for deployment

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

MIT License

Follow Parallel WaveGAN

Parallel WaveGAN Web Site

Other Useful Business Software
Skillfully - The future of skills based hiring Icon
Skillfully - The future of skills based hiring

Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Parallel WaveGAN!

Additional Project Details

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

2025-11-28