AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Features

  • MusicGen for text-to-music with optional melody conditioning
  • AudioGen for text-to-sound effects and ambient audio
  • EnCodec neural audio codec for discrete tokenization and efficient modeling
  • Ready-to-use checkpoints and straightforward Python/CLI inference
  • Training recipes and scripts for fine-tuning on custom datasets
  • Example notebooks and utilities for prompting, conditioning, and post-processing

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow AudioCraft

AudioCraft Web Site

Other Useful Business Software
Skillfully - The future of skills based hiring Icon
Skillfully - The future of skills based hiring

Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of AudioCraft!

Additional Project Details

Programming Language

Python

Related Categories

Python Sound Audio, Python Libraries, Python Deep Learning Frameworks

Registered

2025-10-06