Skip to main content

Zhipu AI Unveils Smarter Voice Typing with Open-Sourced Speech Tech

Zhipu AI Raises the Bar for Voice Recognition

Chinese AI firm Zhipu has just dropped a major upgrade that could change how we interact with our computers. Their new GLM-ASR speech recognition models aren't just smarter - they're being shared with the world through open-source licensing.

Image

The star of the show is the cloud-based GLM-ASR-2512, which boasts industry-leading accuracy with a character error rate below 0.072%. That means it gets words right more than 99.9% of the time, even when dealing with different accents or noisy environments.

"We wanted to create something that works as well in a busy café as it does in a quiet office," explains Zhipu's technical lead. The model handles multiple languages seamlessly, making it ideal for global users.

Power in a Small Package

For those concerned about privacy or needing offline access, Zhipu offers GLM-ASR-Nano-2512 - a compact version packing surprising punch despite its modest 1.5 billion parameters. Tests show it outperforms some proprietary systems while running directly on your device.

This local processing means your voice data stays private rather than being sent to distant servers. It also cuts down on lag - your words appear almost instantly as you speak them.

Your Computer Just Got More Conversational

The technology powers Zhipu's refreshed AI Input Method, transforming PCs into responsive voice assistants. Beyond simple dictation, it can translate spoken words between languages or rephrase text on command - think of it like having a secretary living in your keyboard.

Early adopters get 2,000 free points (about four weeks of typical use) to explore features including:

  • Real-time speech-to-text conversion
  • Multi-language translation
  • Smart text rewriting
  • Cross-platform synchronization

The desktop app currently supports Windows and macOS, with mobile versions reportedly in development.

Why This Matters

By open-sourcing their technology, Zhipu invites developers worldwide to build upon their work rather than keeping innovations locked away. This approach could accelerate progress across everything from accessibility tools to smart home devices.

The new input method also hints at where computing interfaces might be heading - toward systems that understand natural speech as effortlessly as they process mouse clicks.

Key Points:

  • 🎙️ Two new speech models: cloud-based powerhouse + privacy-focused local version
  • 💻 Revamped input method adds translation and text editing by voice
  • 🆓 Generous free trial lets users test premium features
  • 🔓 Open-source approach encourages wider innovation

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

WeChat Work Gets Smarter: Scan, Connect, and Automate Documents
News

WeChat Work Gets Smarter: Scan, Connect, and Automate Documents

Enterprise WeChat's latest upgrade integrates OpenClaw technology, bringing two game-changing features to businesses. Now administrators can set up AI assistants in seconds with QR code scanning, while employees enjoy automated document creation with simple text commands. The update transforms WeChat from a messaging tool into a powerful collaborative platform that blends AI efficiency with human oversight.

March 16, 2026
EnterpriseWeChatAIAutomationProductivityTools
News

MiniMax Surpasses Baidu: China's AI Landscape Gets a Shake-Up

In a stunning market reversal, AI unicorn MiniMax has overtaken tech giant Baidu with a HK$382.6 billion valuation. The company's stock surged 22% amid strong financials showing 158.9% revenue growth, with 70% coming from international markets. This milestone signals shifting priorities in China's AI sector - from technical benchmarks to real-world profitability and global competitiveness.

March 11, 2026
AITechStocksMarketTrends
Xie Saining's Team Unveils Solaris: A Breakthrough in Multi-User Video AI
News

Xie Saining's Team Unveils Solaris: A Breakthrough in Multi-User Video AI

Xie Saining's research team has launched Solaris, the world's first multi-user video world model, powered by Kunlun Wanzhi's Matrix-Game2.0. This innovative technology enhances player interaction in environments like Minecraft, outperforming previous solutions. The release coincides with a major funding milestone for Xie's AI company, AMI, highlighting the growing importance of world models in advancing artificial general intelligence.

March 11, 2026
AIMachine LearningVirtual Worlds
ChatGPT Now Recognizes Songs Like Shazam - Here's How It Works
News

ChatGPT Now Recognizes Songs Like Shazam - Here's How It Works

OpenAI has teamed up with Shazam to bring music recognition directly into ChatGPT. No more switching apps when you hear that catchy tune - just ask ChatGPT what's playing and get instant results. The integration lets users identify songs through simple voice or text commands, complete with artist info and preview clips. It's like having a music-savvy friend in your chat.

March 10, 2026
OpenAIChatGPTShazam
GPT-5.4 Arrives With Mind-Reading AI and Million-Token Memory
News

GPT-5.4 Arrives With Mind-Reading AI and Million-Token Memory

OpenAI's latest model, GPT-5.4, introduces revolutionary features that bring us closer to truly intelligent digital assistants. The new Thinking mode lets users peer into the AI's reasoning process, while million-token memory enables handling massive documents. Perhaps most impressive are its native computer operation abilities - this AI doesn't just talk, it can actually work across your applications.

March 6, 2026
AIOpenAIGPT
AI Agents Get Smarter on the Fly with New Training Framework
News

AI Agents Get Smarter on the Fly with New Training Framework

Ant Group and Tsinghua University have unveiled AReaL v1.0, a breakthrough reinforcement learning framework that lets AI agents improve themselves during actual use. Unlike traditional systems that require extensive coding, this innovative solution allows existing agents to connect seamlessly - imagine your digital assistant getting better at its job every time you use it. The system's secret weapon? An AI-powered development assistant that helped build its complex architecture in record time.

March 4, 2026
AIMachineLearningTechInnovation