An open-source, sophisticated multi-model AI audio generation platform
-
Updated
Jan 1, 2026 - Python
An open-source, sophisticated multi-model AI audio generation platform
ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps, and captions directly inside ComfyUI.
A curated list of AI audio generation APIs, SDKs, and tools including text-to-speech, speech synthesis, music generation, voice cloning, sound design, and generative AI platforms. Covers commercial services, open source models with APIs, and production-ready infrastructure for developers building audio applications.
🤖 AI Agent API for Podcatalk - Intelligent podcast content generation using OpenAI Agents SDK. Built with FastAPI and Google Cloud TTS. Features multi-agent orchestration, real-time SSE streaming, SSML audio synthesis, and dynamic voice selection for automated podcast production.
A Farcaster bot made in n8n that uses Neynar and Music LLM API to Generate Songs from Casts
Streamlining Text-to-Speech Tasks Using Google Colab
Add a description, image, and links to the audio-generation-ai topic page so that developers can more easily learn about it.
To associate your repository with the audio-generation-ai topic, visit your repo's landing page and select "manage topics."