Skip to main content

Alibaba's New Voice Tech Lets You Command Sounds Like Magic

Alibaba's Voice Revolution: Speak It Into Existence

Imagine telling your computer "Make this voice sound like a nervous teenager" or "Add café chatter in the background" - and having it happen instantly. That's the promise of Alibaba Tongyi Lab's new voice technology duo unveiled today.

Image

Your Personal Voice Director

The Fun-CosyVoice3.5 isn't your average text-to-speech tool. Want your audiobook narrator to sound more dramatic? Just say "Add some Shakespearean flair." Need customer service training audio? Tell it to "sound patient but slightly exasperated." This multilingual whiz now understands Thai, Indonesian, Portuguese and Vietnamese too - with obscure character errors slashed by nearly 70%.

Meanwhile, Fun-AudioGen-VD acts like a Hollywood sound studio in your browser. Picture this:

  • "Create a deep-voiced villain with a slight lisp standing in a cathedral"
  • "Make a children's storyteller with background forest sounds"
  • "Simulate an underwater conversation between two robots"

The system handles everything from subtle vocal quirks to complex environmental acoustics.

Why This Changes Everything

For podcasters, these tools eliminate expensive voice actors for placeholder tracks. Game developers can prototype character voices before recording sessions. Even filmmakers can quickly generate temporary dialogue during editing.

"We're removing the technical barriers," explains Tongyi Lab's spokesperson. "Now creative vision directly translates to audio reality."

The models aren't perfect yet - extremely specific requests might still need tweaking. But for most users, speaking their audio needs into existence just became reality.

Key Points:

  • Natural language control: Adjust voices and scenes using everyday phrases
  • Multilingual mastery: Supports 13 languages with improved accuracy
  • Lightning fast: 35% reduction in processing delays
  • Creative playground: Combine characters, emotions and environments freely

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

Google's AI Turns News Reports into Flood Warnings for Vulnerable Regions

Google has developed an innovative flood prediction system by analyzing millions of news articles with its Gemini AI. The technology transforms qualitative reports into quantitative data, creating early warnings for areas lacking traditional weather monitoring. Already implemented in 150 countries, this approach marks a breakthrough in using language models for disaster prevention while addressing global inequality in weather forecasting capabilities.

March 13, 2026
AI innovationdisaster preventionclimate technology
Google's Gemini Embedding 2 Bridges the Gap Between Machines and Human Understanding
News

Google's Gemini Embedding 2 Bridges the Gap Between Machines and Human Understanding

Google has unveiled Gemini Embedding 2, its first native multimodal embedding model that can process text, images, videos, audio, and documents simultaneously. Unlike generative models focused on content creation, this breakthrough technology helps machines truly 'understand' complex data by mapping diverse media types into unified mathematical spaces. With support for over 100 languages and combined media inputs, it promises significant improvements in search accuracy, legal research, and AI-powered analysis across industries.

March 11, 2026
AI innovationmultimodal learningmachine understanding
News

NVIDIA shakes up AI with open-source NemoClaw platform

NVIDIA is making waves with its new open-source AI agent platform NemoClaw, breaking free from hardware dependencies. Meanwhile, China celebrates a milestone in industrial communication standards, and Apple gears up for its foldable iPhone launch with boosted production targets. The tech world is buzzing with innovation as these developments signal major shifts across industries.

March 11, 2026
AI innovationtech trendsopen source
News

Shenzhen Hosts Lobster Feast with AI Twist to Boost Tech Adoption

Longgang District teams up with AI firm Kimi for an unforgettable culinary-tech fusion event. On March 14th, attendees will witness robots cooking lobster while enjoying free samples, all while learning about OpenClaw deployment. The festival offers practical benefits too - from free installation services to API discounts for businesses embracing AI transformation.

March 10, 2026
AI innovationculinary techShenzhen events
News

MiniMax Brings Voice and Music Magic to OpenClaw

MiniMax has transformed OpenClaw's chatbots from text-only tools into versatile AI companions with voice and music capabilities. Users can now equip their 'Little Crabs' with over 40 languages, custom voices, and even music composition skills through simple plugin installations. This collaboration marks another step toward more human-like AI interactions in workplace applications.

March 9, 2026
MiniMaxOpenClawAI assistants
News

Alibaba's Tiny AI Model Takes On GPT-4o – And Wins

In a surprising turn of events, Alibaba's compact Qwen 3.5 model with just 4 billion parameters has outperformed OpenAI's massive GPT-4o in independent testing. This breakthrough challenges the industry's obsession with ever-larger models, proving that smarter architecture can trump sheer size. The achievement opens new possibilities for running powerful AI locally on everyday devices.

March 9, 2026
AI innovationMachine learningChinese tech