MiniMax, a Chinese startup has released three low-cost open-source AI models, which it has pitched to compete with top chatbots like OpenAI’s ChatGPT and other US-made models.
According to an SCMP report, this release of multimodal models comes weeks after rival DeepSeek also set a new standard with its own open-source models, intensifying competition in mainland technology firms.
The Alibaba and Tencent-backed startup released its models on Tuesday. These include the MiniMax-01 large language model (MML) basket that includes a general-purpose foundational model, the MiniMax-Text-01, and the multimodal MiniMax-VL-01. The MiniMax-VL-01 has visual capabilities.
According to the firm, the foundational language model exhibited capabilities that are at par with the world’s leading AI models in evaluations that include solving math problems. In benchmark tests that the firm posted on its WeChat account, MiniMax revealed the model’s other capabilities, including domain knowledge, the ability to follow instructions, and avoiding hallucinations or factual errors.
The benchmarks also revealed that the performance of its new models matches closed-source models, which are typically considered to have the most advanced capabilities in the industry. Models that support products such as Google’s Gemini, Amazon-backed Anthropic’s Claude, and OpenAI’s ChatGPT are closed-source. These typically lead rankings from Chatbot Arena, an AI benchmarking project by researchers at UC Berkeley.
The MiniMax-Text-01 in particular is 465 billion parameters in size and performs better than Google’s recently released Gemini 2.0 Flash on benchmarks like MMLU and SimpleQA, which measures a model’s ability to answer math problems.
According to the firm, MiniMax-VL-01 rivals Anthropic’s Claude 3.5 Sonnet on evaluations that require multimodal understanding.
The development comes weeks after Hangzhou-based competitor DeepSeek took the world’s tech industry by storm with its open-source V3 model in December. Growing competition in the country’s crowded and fast-evolving AI market has pushed industry giants and startups alike into a fierce race for AI supremacy, outdoing each other every few months with new releases.
MiniMax released its models in an environment characterized by intense competition. The release of its new models also comes after Hong Kong-listed SenseTime released a new “unified large model” on the same day.
According to SCMP, benchmark tests by SuperCLUE, which specializes in evaluating Chinese models, ranked the new SenseTime product as a top contender among multimodal models.
Despite the advancements in AI technology, Chinese startups face monetization challenges. Big tech firms like TikTok owner ByteDance, whose Doubao was the most popular Chinese chatbot in December have pockets deep enough to push their AI products to millions of users for free.
On the other hand, startups need to balance ambitious expansion with monetization efforts to sustain operations.
Tough times may still be ahead for MiniMax and its products. Its companion app Talkie, a main contributor to its revenue, according to a Financial Times report – has been delisted from Apple’s App Store in the US since late last year over unspecified “technical reasons.” The Android app remains accessible on Google Play.
MiniMax was founded in 2021 by former employees of one of China’s largest AI firms, SenseTime. According to Tech Crunch, the firm’s products have been embroiled in some minor controversies. For example, Talkie features avatars of public figures including Donald Trump, Elon Musk, and LeBron James, none of whom appear to have consented to being featured on the app.
Broadcast magazine reported in December that MiniMax’s video generators could produce British television channels’ logos. This, according to the magazine, suggested that the firm’s models were trained on content from those channels.
On top of that, the Shanghai-based startup is also reportedly being sued by iQiyi, a Chinese video streaming service alleging MiniMax trained on iQiyi’s copyrighted recordings without authorization.
From Zero to Web3 Pro: Your 90-Day Career Launch Plan