Alibaba's FantasyWorld Takes Top Spot in Global AI Model Rankings
Alibaba Makes Waves With New 3D World Model
AutoNavi, Alibaba's mapping subsidiary, has officially launched its ambitious "FantasyWorld" project - and it's already turning heads in the AI community. Within days of release, the model secured top honors on Stanford University's prestigious WorldScore Leaderboard, outperforming international competitors across multiple metrics.

Technical Innovation Behind the Success
What sets FantasyWorld apart is its clever fusion of video processing and 3D modeling. The team added a trainable geometric component to existing video-based models, creating what they call "joint modeling of video latent variables and implicit 3D fields." In simpler terms? It generates remarkably realistic 3D environments from flat videos with impressive efficiency.
The results speak for themselves. Compared to other methods, FantasyWorld maintains exceptional consistency across different viewing angles - even handling extreme perspectives like complete 180-degree rotations without losing detail or coherence.
Real-World Applications Take Flight
The technology isn't just theoretical. AutoNavi has already integrated FantasyWorld into its "Flying Street View" feature, revolutionizing how businesses create virtual tours. Restaurant owners can now generate photorealistic 3D walkthroughs by simply uploading smartphone videos - no expensive equipment or technical expertise required.
This democratization of spatial modeling aligns with what AutoNavi calls "technological equity," lowering barriers for small businesses while giving customers richer preview experiences.
Industry Implications: A New Era Dawns
The timing couldn't be better. As autonomous vehicles shift toward visual-based navigation and embodied AI systems grow more sophisticated, demand for accurate world models has skyrocketed. FantasyWorld positions Alibaba at the forefront of this transformation.
The company isn't stopping here. An internal embodied business division is already exploring applications ranging from service robots to robotic dogs, signaling Alibaba's broader ambitions in physical AI systems.
Key Points:
- Top-ranked performance: Scores 78.55 (static scenes) and 66.89 (dynamic scenes) on WorldScore benchmarks
- Technical breakthrough: Combines video processing with geometric modeling in single computation pass
- Commercial deployment: Powers AutoNavi's Flying Street View feature for businesses
- Academic recognition: Research papers accepted by ICLR 2025 and NeurIPS 2025 conferences



