Skip to main content

New Benchmark Aims to Make AI Phone Calls Feel More Human

AI Phone Calls Get Their First Reality Check

For years, companies using AI for customer calls have operated without clear standards to measure performance. That changed recently when Agora partnered with Meituan to launch VoiceAgentEval, the industry's first comprehensive evaluation system for AI-powered outbound calls.

Moving Beyond Lab Conditions

The new benchmark stands out by focusing on real-world business scenarios rather than artificial lab tests. "We wanted to create something that actually reflects what happens when these systems interact with real customers," explains one project lead.

Key features include:

  • 30 specific scenarios across six major business areas
  • Authentic conversation data instead of scripted interactions
  • Dual evaluation of both text logic and vocal delivery

Putting AI Through Its Paces

The system puts AI models through rigorous testing using 150 carefully designed dialogue simulations. Think of it like giving the technology a series of pop quizzes - does it maintain the conversation flow when customers throw curveballs? Can it adapt to different personalities and speaking styles?

Early testing has already identified three top-performing models, though the team hasn't yet released specific rankings. These results provide valuable guidance for businesses considering AI call solutions, from tech startups to established firms like Beijing San Kuai Technology.

Why This Matters Now

As more companies adopt AI calling technology, having reliable performance standards becomes crucial. Customers frustrated by robotic interactions may hang up, while smooth conversations can build trust and satisfaction. VoiceAgentEval aims to push the entire industry toward more natural, effective communication.

The benchmark's creators hope it will accelerate development of AI that doesn't just follow scripts, but actually understands and responds to human needs - making those automated calls feel less like talking to a machine and more like chatting with a helpful assistant.

Key Points:

  • First industry standard for evaluating AI outbound calls
  • Tests real business scenarios rather than lab conditions
  • Evaluates both text logic and voice quality
  • Includes 150 simulated dialogue situations
  • Already identified top-performing models in initial testing

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

News

MiniMax Brings Voice and Music Magic to OpenClaw

MiniMax has transformed OpenClaw's chatbots from text-only tools into versatile AI companions with voice and music capabilities. Users can now equip their 'Little Crabs' with over 40 languages, custom voices, and even music composition skills through simple plugin installations. This collaboration marks another step toward more human-like AI interactions in workplace applications.

March 9, 2026
MiniMaxOpenClawAI assistants
Alibaba's New Voice Tech Lets You Command Sounds Like Magic
News

Alibaba's New Voice Tech Lets You Command Sounds Like Magic

Alibaba's Tongyi Lab has unveiled two groundbreaking voice models that respond to natural language commands. Forget complex settings - just tell Fun-CosyVoice3.5 to 'speak more confidently' or instruct Fun-AudioGen-VD to create a battlefield scene with echoing gunfire. These tools promise to revolutionize audio creation for podcasts, games, and films by making professional sound design accessible to everyone.

March 2, 2026
voice technologyAI innovationaudio production
How a Small Town Grocery Store Sold 5,000 Orders with AI's Help
News

How a Small Town Grocery Store Sold 5,000 Orders with AI's Help

A family-run grocery store in rural Shanxi province saw an unexpected sales boom during the Spring Festival, all thanks to an AI-powered shopping feature. The Yang Pengchu Grocery Store received over 5,000 orders in just ten days - about seven times their usual holiday sales - after customers discovered they could simply tell an app 'buy me eggs' to get discounted local produce. This heartwarming story shows how cutting-edge technology is making inroads into China's countryside.

February 22, 2026
AI shoppingrural e-commercevoice technology
News

Sogou Input Hits 100 Million AI Users With Near-Perfect Voice Recognition

Tencent's Sogou Input Method has crossed a major milestone with over 100 million users embracing its AI-powered features. The latest version boasts 98% voice recognition accuracy and processes a staggering 2 billion daily voice requests. Beyond technical upgrades, the update brings smarter predictive typing and cleaner interfaces - proving AI can make even our keyboards more helpful.

January 27, 2026
AI assistantsvoice technologyTencent products
Qwen's AI Dining Assistant: No Humans Needed Behind Those Convincing Calls
News

Qwen's AI Dining Assistant: No Humans Needed Behind Those Convincing Calls

Qwen has addressed speculation that real people power its restaurant booking AI. The company revealed its assistant uses advanced emotion recognition to deliver remarkably human-like calls. Capable of detecting over 50 emotions in just 0.1 seconds, the system crafts perfectly timed responses. While some questioned why the AI keeps 'working hours,' Qwen explains this actually improves booking success by matching restaurant schedules. Coming soon? Personalized voices and multilingual support for global dining reservations.

January 26, 2026
AI assistantsvoice technologyQwen
News

Bangalore AI Startup Bolna Raises $6.3M to Revolutionize Multilingual Calls

Bangalore-based Bolna has secured $6.3 million in seed funding led by General Catalyst, with participation from Y Combinator and Blume Ventures. The voice AI startup specializes in multilingual smart calls for businesses, boasting explosive growth since its May 2025 launch - from 1,500 daily calls to over 200,000. With plans to expand its team and enhance dialect technologies, Bolna aims for $5M annual revenue by mid-2026.

January 21, 2026
AI startupsvoice technologybusiness automation