Skip to main content

AI Trading Showdown: DeepSeek Outperforms Gemini in Market Test

AI Models Face Off in Real-Market Trading Challenge

Financial research lab nof1 has conducted a groundbreaking experiment called Alpha Arena, pitting six major AI models against each other in live trading scenarios on decentralized exchange Hyperliquid. Each model received $10,000 in real funds and operated under identical conditions to test their financial decision-making capabilities.

The Competitors and Results

The participating models included:

  • GPT-5
  • Gemini 2.5 Pro
  • Grok-4
  • Claude Sonet 4.5
  • DeepSeek V3.1
  • Qwen3Max

Image

The results revealed stark differences in performance:

  • DeepSeek V3.1 and Grok-4 tied for top position with returns exceeding 14%
  • Gemini 2.5 Pro suffered catastrophic losses of 42.57%, the worst performance recorded

The other models delivered mixed results, with none matching the top performers' success.

Beyond Simple Competition

The Alpha Arena project aims to evaluate more than just raw profitability. According to nof1 researchers, the primary objectives include:

  1. Assessing strategy stability under market volatility
  2. Testing risk response mechanisms across different model architectures
  3. Establishing benchmarks for AI-driven quantitative trading systems

The experiment demonstrates how large language models are evolving beyond text processing into complex financial applications.

Implications for Financial AI

The successful performance of certain models suggests promising applications for:

  • Automated portfolio management
  • Real-time trading algorithms
  • Risk assessment systems The dramatic failure of Gemini 2.5 Pro also underscores the importance of robust testing before deploying AI systems with real capital.

The financial sector continues to show strong interest in AI solutions that can process market data faster and more comprehensively than human traders.

Key Points:

  • DeepSeek V3.1 and Grok-4 achieved over 14% returns in live trading test
  • Gemini 2.5 Pro lost nearly half its allocated capital
  • Experiment conducted with $10,000 real funds per model on Hyperliquid exchange The study highlights both the potential and risks of AI-driven financial systems

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

NVIDIA's NemoClaw Brings One-Click AI to OpenClaw Ecosystem
News

NVIDIA's NemoClaw Brings One-Click AI to OpenClaw Ecosystem

NVIDIA has unveiled NemoClaw, a game-changing toolkit that simplifies AI agent deployment for the OpenClaw platform. With just one command, users can now install powerful AI models like Nemotron and OpenShell runtime. The solution addresses critical privacy concerns with isolated sandboxes and hybrid model strategies while supporting everything from consumer devices to enterprise supercomputers. NVIDIA CEO Jensen Huang calls it the 'AI operating system' of our era.

March 17, 2026
AINVIDIAOpenClaw
HydraDB Raises $6.5M to Reinvent AI Memory with Smarter Storage
News

HydraDB Raises $6.5M to Reinvent AI Memory with Smarter Storage

HydraDB has secured $6.5 million in funding to challenge traditional vector databases with its innovative approach to AI memory storage. Unlike current systems that struggle with relevance despite finding similarities, HydraDB introduces a relationship graph model inspired by human logic and Git-style versioning. This breakthrough could finally solve AI's persistent 'similar but wrong' problem, potentially transforming how assistants and knowledge systems remember information.

March 16, 2026
AI InfrastructureDatabase TechnologyMachine Learning
Zhipu's GLM-5-Turbo Takes AI Agents to New Heights
News

Zhipu's GLM-5-Turbo Takes AI Agents to New Heights

Chinese AI firm Zhipu has unveiled GLM-5-Turbo, a groundbreaking model specifically designed for complex Agent scenarios. Unlike generic large models that stumble with lengthy tasks, this new release shines in tool calling, instruction processing, and continuous execution. Already topping domestic benchmarks with a 90% developer approval rating, it's now powering the innovative OpenClaw Box terminal while offering enterprise-grade security features.

March 16, 2026
AI AgentsZhipuAIGLM-5-Turbo
News

Meta Hits Pause on Llama4 Launch as Engineers Fine-Tune AI Model

Meta has pushed back the release of its next-generation Llama4 AI model to May, citing the need for additional technical refinements. While CEO Mark Zuckerberg remains bullish on the project, developers are wrestling with performance optimization and logical reasoning challenges. The delay highlights the growing complexity of cutting-edge AI development, though Meta promises the extra time will yield a more robust open-source offering. The company continues expanding its computing infrastructure to support what could be a game-changing release in the competitive AI landscape.

March 13, 2026
MetaLlama4AI Development
Xie Saining's Team Unveils Solaris: A Breakthrough in Multi-User Video AI
News

Xie Saining's Team Unveils Solaris: A Breakthrough in Multi-User Video AI

Xie Saining's research team has launched Solaris, the world's first multi-user video world model, powered by Kunlun Wanzhi's Matrix-Game2.0. This innovative technology enhances player interaction in environments like Minecraft, outperforming previous solutions. The release coincides with a major funding milestone for Xie's AI company, AMI, highlighting the growing importance of world models in advancing artificial general intelligence.

March 11, 2026
AIMachine LearningVirtual Worlds
News

AI Pioneer Yann LeCun Secures $1 Billion for His Next Big Bet

Yann LeCun, the Turing Award-winning AI researcher, has raised over $1 billion for his new venture Advanced Machine Intelligence. The startup aims to move beyond today's language models by developing systems that can truly reason and understand the physical world. With backing from major investors, LeCun's company could reshape industries from robotics to healthcare.

March 10, 2026
Artificial IntelligenceTech StartupsMachine Learning