Experimental / Research Projects in Audio AI

Followers

May 9, 20251 yr

These platforms represent the frontier of audio AI research and innovation. They explore areas like AI-sung music, ultra-realistic voice cloning, and multi-modal generative models. Often not yet mainstream, these tools highlight emerging capabilities and are used by researchers, developers, and advanced creators. They push boundaries in creativity, cross-lingual audio, and expressive speech synthesis. Many are open-source or in beta, allowing early access to cutting-edge developments.

Tools:

OpenAI Jukebox – A neural network that generates music with singing, trained on a large dataset of genres and artists.
Google Gemini Audio – Developer-facing API that supports audio understanding and generative use cases via Google's Gemini models.
MyShell OpenVoice – Open-source project enabling instant voice cloning and speech generation.
Supertone.ai – Delivers hyper-realistic singing and speaking voices, used in K-pop and video game soundtracks.

Create an account or sign in to comment

Followers

Go to topic listing

Experimental / Research Projects in Audio AI

Featured Replies

Tools:

Create an account or sign in to comment

Who's Online (See full list)

Lead AI Transformation without coding

Most Solved

Forum Statistics

Member Statistics

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)