本週,Mistral AI 的 Le Chat 也宣佈了更新,允許免費訪問新功能。這些產品的推出似乎在生成人工智能市場上引發了激烈的競爭,克服了其他產品的缺點。
DeepSeek表示,其人工智能可以顯示分步實時推理,使其思維過程更加透明。與此同時,這家人工智能公司還表示,他們將在未來幾天發佈開源模型和API開發工具。
根據人工智能和技術評論員 Andrew Curran 引用的對比圖表,DeepSeek-R1-Lite-Preview 在 AIME (52.5) 和 Codeforces (1450) 等參數上取得了最高分,優於 OpenAI o1-preview 和 Claude 3.5 Sonnet 等競爭對手。
它還在 MATH-500 (91.6) 中領先,表明在高級問題解決任務中表現出色。然而,與 OpenAI o1-preview(分別爲 73.3 和 71.4)等模型相比,它在 GPQA Diamond(58.5)和 Zebra Logic(56.6)方面落後。這些數字意味着常識和邏輯推理部分還有改進的空間。
在 o1-preview 發佈兩個月後,其思想鏈推理已被複制。鯨魚現在可以推理了。 DeepSeek表示,DeepSeek-R1正式版將完全開源。 https://t.co/Ya9mVyLvDP pic.twitter.com/6wZ8xoAyyz
—安德魯·柯蘭 (@AndrewCurran_) 2024 年 11 月 20 日
Cryptopolitan tried the features of the launch for an unbiased review. Firstly, DeepSeek’s chat requires a user login. The chat under the ‘Deep Think’ feature limits conversations to up to 50 messages per day. We can say that Deepseek thinks loudly while also estimating its time of response. It also solved the math problem we presented in a logical order. In comparison, ChatGPT 4o took less time for the solution but did not present a step-by-step reasoning for the same.
Influencer Bilawal Sidhu took a jibe at o1 and stated, “Ironic that OpenAI’s o1 model hides its chain-of-thought reasoning, while the Chinese DeepSeek-R1 makes it transparent to users. Shouldn’t it be the other way around?”
That said, China has a comprehensive framework around AI. On July 13, 2023, multiple Chinese authorities, including the Cyberspace Administration of China (CAC) and the Ministry of Education, introduced new regulations for generative AI technologies. These rules, called the Generative AI Regulation, officially came into effect last year on August 15.
The scope of the regulation reportedly covers the use of algorithms, deep synthesis technologies, the use of all generative AI technologies, and several other tech activities. And with its transparent reasoning approach, strong performance on competitive benchmarks, and plans to release open-source tools, DeepSeek is pushing the boundaries of generative AI in China and among its competitors globally.
Land a High-Paying Web3 Job in 90 Days: The Ultimate Roadmap