本周,Mistral AI 的 Le Chat 也宣布了更新,允许免费访问新功能。这些产品的推出似乎在生成人工智能市场上引发了激烈的竞争,克服了其他产品的缺点。
DeepSeek表示,其人工智能可以显示分步实时推理,使其思维过程更加透明。与此同时,这家人工智能公司还表示,他们将在未来几天发布开源模型和API开发工具。
根据人工智能和技术评论员 Andrew Curran 引用的对比图表,DeepSeek-R1-Lite-Preview 在 AIME (52.5) 和 Codeforces (1450) 等参数上取得了最高分,优于 OpenAI o1-preview 和 Claude 3.5 Sonnet 等竞争对手。
它还在 MATH-500 (91.6) 中领先,表明在高级问题解决任务中表现出色。然而,与 OpenAI o1-preview(分别为 73.3 和 71.4)等模型相比,它在 GPQA Diamond(58.5)和 Zebra Logic(56.6)方面落后。这些数字意味着常识和逻辑推理部分还有改进的空间。
在 o1-preview 发布两个月后,其思想链推理已被复制。鲸鱼现在可以推理了。 DeepSeek表示,DeepSeek-R1正式版将完全开源。 https://t.co/Ya9mVyLvDP pic.twitter.com/6wZ8xoAyyz
—安德鲁·柯兰 (@AndrewCurran_) 2024 年 11 月 20 日
Cryptopolitan tried the features of the launch for an unbiased review. Firstly, DeepSeek’s chat requires a user login. The chat under the ‘Deep Think’ feature limits conversations to up to 50 messages per day. We can say that Deepseek thinks loudly while also estimating its time of response. It also solved the math problem we presented in a logical order. In comparison, ChatGPT 4o took less time for the solution but did not present a step-by-step reasoning for the same.
Influencer Bilawal Sidhu took a jibe at o1 and stated, “Ironic that OpenAI’s o1 model hides its chain-of-thought reasoning, while the Chinese DeepSeek-R1 makes it transparent to users. Shouldn’t it be the other way around?”
That said, China has a comprehensive framework around AI. On July 13, 2023, multiple Chinese authorities, including the Cyberspace Administration of China (CAC) and the Ministry of Education, introduced new regulations for generative AI technologies. These rules, called the Generative AI Regulation, officially came into effect last year on August 15.
The scope of the regulation reportedly covers the use of algorithms, deep synthesis technologies, the use of all generative AI technologies, and several other tech activities. And with its transparent reasoning approach, strong performance on competitive benchmarks, and plans to release open-source tools, DeepSeek is pushing the boundaries of generative AI in China and among its competitors globally.
Land a High-Paying Web3 Job in 90 Days: The Ultimate Roadmap