MINT-4B Multimodal VLA Model Launches in Collaboration with Cai Panpan's Team

MINT-4B Multimodal VLA Model Launches in Collaboration with Cai Panpan's Team

2026-06-04 13:36View Original

According to the official announcement, Guangdong Zhidong Weilai, in collaboration with Professor Cai Panpan's team from Shanghai Chuangzhi Academy, has unveiled the MINT-4B multimodal VLA (Vision-Language-Action) large model. The model ranks among the top three in global benchmark evaluations of mainstream general-purpose robotics models, outperforming industry-standard models such as OpenVLA, GR00T, π, and UniVLA in technical metrics.

The core innovation of the MINT series lies in "replicating task intent rather than mechanically replicating trajectories." It pioneers the SDAT multiscale frequency-domain tokenization technique, enabling hierarchical decoding through cross-scale autoregressive inference. This technology is already integrated into the Xiao Zhi S2 humanoid robot, deployed across scenarios including scientific research, education, and commercial exhibitions, with the model having been commercially rolled out in multiple regions nationwide.

Disclaimer: Contains third-party opinions, does not constitute financial advice

Recommended Reading

Google announces Gemini 3.5 Pro will launch in June, with over 900 million monthly active users

1 day ago
Google announces Gemini 3.5 Pro will launch in June, with over 900 million monthly active users

MiniMax launches M3 model, with API context length support up to 1M tokens

3 days ago
MiniMax launches M3 model, with API context length support up to 1M tokens

Alibaba Qwen3.7-Max Tops Global Code Arena Programming Ranking at Second Place

9 days ago
Alibaba Qwen3.7-Max Tops Global Code Arena Programming Ranking at Second Place

DeepSeek V4 Flash Tops OpenRouter's Monthly API Call Rankings, Chinese Large Models Dominate Top Ten

9 days ago
DeepSeek V4 Flash Tops OpenRouter's Monthly API Call Rankings, Chinese Large Models Dominate Top Ten

Nosh One Kitchen Robot Launches Crowdfunding, Powered by NoshOS System with Support for 500 Dishes

10 days ago
Nosh One Kitchen Robot Launches Crowdfunding, Powered by NoshOS System with Support for 500 Dishes

Google I/O 2026 Dialogues Stage Focuses on AI, Quantum Computing, and the Future of Robotics

12 days ago
Google I/O 2026 Dialogues Stage Focuses on AI, Quantum Computing, and the Future of Robotics

Kawasaki Heavy Industries Joins Forces with NVIDIA, Microsoft, and Others to Establish a Specialized AI Center Focused on Healthcare and Robotics

13 days ago
Kawasaki Heavy Industries Joins Forces with NVIDIA, Microsoft, and Others to Establish a Specialized AI Center Focused on Healthcare and Robotics