Meta releases open-source AI tool for text-to-audio conversion

August 4, 2023

Meta has released an open-source AI tool called AudioCraft that can create sound from text-based prompts. The tool is bundled with three models namely; AudioGen, EnCodec, and MusicGen. AudioGen is designed for creating sound effects based on a written description, EnCodec is a decoding engine, and MusicGen is designed for creating music from text.

Meta is making the code and model weights for AudioCraft available on GitHub. This will allow developers and researchers to experiment with the tool and contribute to its development.

AudioCraft is regarded as a significant advancement in generative AI. Previous advancements in generative AI have focused on text and image generation. AudioCraft, on the other hand, tackles the complex task of text-to-audio conversion. By training language models over their proprietary EnCodec neural audio codec, Meta has enabled AudioCraft to understand the associations between audio and text.

AudioCraft could be used to create realistic sound effects for video games, generate music for digital worlds, or even create new forms of art.

Meta is making AudioCraft available for research use, but it is yet announced any commercial applications for the tool.

The sources for this piece include an article in Axios.

Top Stories

Related Articles

December 23, 2025 Editor's Notes: This is the first of two articles reflecting on the year but Yogi Schulz. Schulz' more...

December 23, 2025 Google parent company Alphabet said Monday that it will acquire Intersect Power for $4.75 billion in cash more...

December 22, 2025 Artificial intelligence dominated global search behaviour in 2025, with Google’s own AI assistant, Gemini, emerging as the more...

December 22, 2025 OpenAI has hired the former head of Shopify’s core product organization to lead its next phase of more...

Jim Love

Jim is an author and podcast host with over 40 years in technology.

Share:
Facebook
Twitter
LinkedIn