DeepSeek challenges OpenAI with transparent AI breakthrough, beating OpenAI in 3 parameters

Source Cryptopolitan

DeepSeek, a China-based AI company has launched DeepSeek-R1-Lite-Preview for better reasoning and problem-solving capabilities. Announced in a post on X, the system is positioned as a competitor to industry leaders like OpenAI.

Commentators believe that DeepSeek’s transparency is ironic when companies in the West have not addressed these gaps.

DeepSeek AI’s new launch can do better math

DeepSeek, an AI company based in China, introduced a new version of its AI system called DeepSeek-R1-Lite-Preview. In a post on X, it said that the new AI system has improved reasoning and problem-solving abilities.

According to DeepSeek, the preview performs well on benchmarks like AIME (American Invitational Mathematics Examination) and MATH, which measure problem-solving and reasoning abilities. As the AI seems skilled at handling complex mathematical and logical problems, it might be ready to compete with OpenAI’s ChatGPT and specifically with OpenAI o1.

This week, Mistral AI’s Le Chat also announced updates to allow free access to new features. The launches are seemingly creating intense competition in the generative AI market, overcoming the shortcomings of the other.

DeepSeek thinks out loud unlike ChatGPT

DeepSeek says that its AI can show step-by-step real-time reasoning to make its thought process more transparent. Meanwhile, the AI company has also said that they will release the open-source model and API developer tools in the coming days.

According to a comparison chart cited by AI and tech commentator Andrew Curran, DeepSeek-R1-Lite-Preview achieves the highest score in parameters like AIME (52.5) and Codeforces (1450), outperforming competitors like OpenAI o1-preview and Claude 3.5 Sonnet.

It also leads in MATH-500 (91.6), indicating high performance in advanced problem-solving tasks. However, it lags in GPQA Diamond (58.5) and Zebra Logic (56.6) compared to models like OpenAI o1-preview (73.3 and 71.4, respectively). The figures mean that there is room for improvement in general knowledge and logical reasoning segments.

Cryptopolitan tried the features of the launch for an unbiased review. Firstly, DeepSeek’s chat requires a user login. The chat under the ‘Deep Think’ feature limits conversations to up to 50 messages per day. We can say that Deepseek thinks loudly while also estimating its time of response. It also solved the math problem we presented in a logical order. In comparison, ChatGPT 4o took less time for the solution but did not present a step-by-step reasoning for the same.

How DeepSeek responds to math problems
How DeepSeek responds to math problems
How ChatGPT-4o responds to math problems
How ChatGPT-4o responds to math problems

Influencer Bilawal Sidhu took a jibe at o1 and stated, “Ironic that OpenAI’s o1 model hides its chain-of-thought reasoning, while the Chinese DeepSeek-R1 makes it transparent to users. Shouldn’t it be the other way around?”

That said, China has a comprehensive framework around AI. On July 13, 2023, multiple Chinese authorities, including the Cyberspace Administration of China (CAC) and the Ministry of Education, introduced new regulations for generative AI technologies. These rules, called the Generative AI Regulation, officially came into effect last year on August 15.

The scope of the regulation reportedly covers the use of algorithms,  deep synthesis technologies, the use of all generative AI technologies, and several other tech activities. And with its transparent reasoning approach, strong performance on competitive benchmarks, and plans to release open-source tools, DeepSeek is pushing the boundaries of generative AI in China and among its competitors globally.

Land a High-Paying Web3 Job in 90 Days: The Ultimate Roadmap

Disclaimer: For information purposes only. Past performance is not indicative of future results.
placeholder
Bitcoin CME gaps at $35,000, $27,000 and $21,000, which one gets filled first?Prioritize filling the $27,000 gap and even try higher.
Author  FXStreet
Aug 21, 2023
Prioritize filling the $27,000 gap and even try higher.
placeholder
Understanding the first crypto market crash of 2024 and what to expect nextThe 365-day MVRV ratio suggests that this crash may be just the beginning. If the ETF is rejected before the second quarter of 2024, it could trigger a sharp correction.
Author  FXStreet
Jan 04, Thu
The 365-day MVRV ratio suggests that this crash may be just the beginning. If the ETF is rejected before the second quarter of 2024, it could trigger a sharp correction.
placeholder
Japanese Yen stands tall near one-month top against USD on hawkish BoJ talksThe Japanese Yen (JPY) rallied to the highest level since early February against its American counterpart on Friday amid bets for an imminent shift in the Bank of Japan's (BoJ) policy stance.
Author  FXStreet
Mar 11, Mon
The Japanese Yen (JPY) rallied to the highest level since early February against its American counterpart on Friday amid bets for an imminent shift in the Bank of Japan's (BoJ) policy stance.
placeholder
Natural Gas sinks to pivotal level as China’s demand slumpsNatural Gas price (XNG/USD) edges lower and sinks to $2.56 on Monday, extending its losing streak for the fifth day in a row. The move comes on the back of China cutting its Liquified Natural Gas (LNG) imports after prices rose above $3.0 in June. It
Author  FXStreet
Jul 01, Mon
Natural Gas price (XNG/USD) edges lower and sinks to $2.56 on Monday, extending its losing streak for the fifth day in a row. The move comes on the back of China cutting its Liquified Natural Gas (LNG) imports after prices rose above $3.0 in June. It
placeholder
XRP Gains Momentum: Whale Activity Points To $15 BreakthroughXRP is gaining prominence in the cryptocurrency market, propelled by a substantial purchasing surge from major investors referred to as whales. Related Reading: Upbit Listing Sends BONK Skyrocketing
Author  NewsBTC
Nov 22, Fri
XRP is gaining prominence in the cryptocurrency market, propelled by a substantial purchasing surge from major investors referred to as whales. Related Reading: Upbit Listing Sends BONK Skyrocketing
goTop
quote