Skip to main content

AI Trading Showdown: DeepSeek Outperforms Gemini in Market Test

AI Models Face Off in Real-Market Trading Challenge

Financial research lab nof1 has conducted a groundbreaking experiment called Alpha Arena, pitting six major AI models against each other in live trading scenarios on decentralized exchange Hyperliquid. Each model received $10,000 in real funds and operated under identical conditions to test their financial decision-making capabilities.

The Competitors and Results

The participating models included:

  • GPT-5
  • Gemini 2.5 Pro
  • Grok-4
  • Claude Sonet 4.5
  • DeepSeek V3.1
  • Qwen3Max

Image

The results revealed stark differences in performance:

  • DeepSeek V3.1 and Grok-4 tied for top position with returns exceeding 14%
  • Gemini 2.5 Pro suffered catastrophic losses of 42.57%, the worst performance recorded

The other models delivered mixed results, with none matching the top performers' success.

Beyond Simple Competition

The Alpha Arena project aims to evaluate more than just raw profitability. According to nof1 researchers, the primary objectives include:

  1. Assessing strategy stability under market volatility
  2. Testing risk response mechanisms across different model architectures
  3. Establishing benchmarks for AI-driven quantitative trading systems

The experiment demonstrates how large language models are evolving beyond text processing into complex financial applications.

Implications for Financial AI

The successful performance of certain models suggests promising applications for:

  • Automated portfolio management
  • Real-time trading algorithms
  • Risk assessment systems The dramatic failure of Gemini 2.5 Pro also underscores the importance of robust testing before deploying AI systems with real capital.

The financial sector continues to show strong interest in AI solutions that can process market data faster and more comprehensively than human traders.

Key Points:

  • DeepSeek V3.1 and Grok-4 achieved over 14% returns in live trading test
  • Gemini 2.5 Pro lost nearly half its allocated capital
  • Experiment conducted with $10,000 real funds per model on Hyperliquid exchange The study highlights both the potential and risks of AI-driven financial systems

Enjoyed this article?

Subscribe to our newsletter for the latest AI news, product reviews, and project recommendations delivered to your inbox weekly.

Weekly digestFree foreverUnsubscribe anytime

Related Articles

DeepSeek-V4 Set to Revolutionize Code Generation This February
News

DeepSeek-V4 Set to Revolutionize Code Generation This February

DeepSeek is gearing up to launch its powerful new AI model, DeepSeek-V4, around Chinese New Year. The update promises major leaps in code generation and handling complex programming tasks, potentially outperforming competitors like Claude and GPT series. Developers can expect more organized responses and better reasoning capabilities from this innovative tool.

January 12, 2026
AI DevelopmentProgramming ToolsMachine Learning
News

DeepSeek Finds Smarter AI Doesn't Need Bigger Brains

DeepSeek's latest research reveals a breakthrough in AI development - optimizing neural network architecture can boost reasoning abilities more effectively than simply scaling up model size. Their innovative 'Manifold-Constrained Hyper-Connections' approach improved complex reasoning accuracy by over 7% while adding minimal training costs, challenging the industry's obsession with ever-larger models.

January 4, 2026
AI ResearchMachine LearningNeural Networks
Chinese AI Model Stuns Tech World with Consumer GPU Performance
News

Chinese AI Model Stuns Tech World with Consumer GPU Performance

Jiukun Investment's new IQuest-Coder-V1 series is turning heads in the AI community. This powerful code-generation model, running on a single consumer-grade GPU, outperforms industry giants like Claude and GPT-5.2 in coding tasks. Its unique 'code flow' training approach mimics real-world development processes, offering developers unprecedented creative possibilities while keeping hardware requirements surprisingly accessible.

January 4, 2026
AI DevelopmentMachine LearningCode Generation
AI Set to Reshape European Banking Workforce by 2030
News

AI Set to Reshape European Banking Workforce by 2030

A Morgan Stanley report predicts artificial intelligence could impact up to 200,000 banking jobs across Europe within this decade. While fintech innovations promise greater efficiency, they also threaten traditional roles - particularly in back-office operations handling routine tasks. Banks now face dual challenges: implementing cutting-edge technology while supporting employees through inevitable workforce transitions.

December 31, 2025
Banking AutomationAI Workforce ImpactFinancial Technology
News

Tencent Cloud's AI Revolution in Finance: 100+ Real-World Applications Now Live

Tencent Cloud has taken financial AI from theory to practice, deploying over 100 large model applications with major Chinese institutions. From detecting fraud in seconds to generating investment insights, these real-world implementations are transforming how banks and exchanges operate. The company emphasizes security and compliance as key differentiators in this sensitive sector.

December 31, 2025
Financial TechnologyAI ApplicationsTencent Cloud
NVIDIA's NitroGen learns to game like humans by watching YouTube
News

NVIDIA's NitroGen learns to game like humans by watching YouTube

NVIDIA has unveiled NitroGen, an AI model that learns to play video games simply by watching gameplay videos. Trained on 40,000 hours of footage spanning over 1,000 titles, this breakthrough can understand controller inputs from screen recordings alone. The system shows remarkable adaptability, improving performance by up to 52% when transferring skills to new games.

December 29, 2025
AI GamingNVIDIAMachine Learning