Stanford and UW researchers build $50 open-source ChatGPT o1 rival

Source Cryptopolitan

AI researchers at Stanford and the University of Washington have allegedly pulled off what no one thought possible—they built an AI model called s1 for under $50 in cloud computing credits, according to a research report released Monday.

The s1 model reportedly delivers performance on par with top-tier reasoning models, including OpenAI’s o1 and DeepSeek’s R1, according to benchmarks on math and coding tests. Even better, the entire project is available on GitHub—code, data, and all.

The team started by grabbing an off-the-shelf model instead of building something new. From there, they fine-tuned it through a process called distillation. According to their research, they distilled the reasoning power from Google’s Gemini 2.0 Flash Thinking Experimental model.

They trained s1 to mimic Gemini’s problem-solving techniques by learning from its answers. “We aimed for simplicity,” said Niklas Muennighoff, one of the Stanford researchers involved.

Small dataset, big results

Distillation is quickly becoming a weapon of choice for AI teams looking to cut costs. A similar AI model developed last month by Berkeley researchers cost $450 to train—still cheap by industry standards. But Stanford and UW took it a step further, reducing costs to about $50 by distilling their model with a smaller dataset and streamlined techniques.

The researchers handpicked just 1,000 questions, each paired with detailed answers and step-by-step reasoning from Google’s Gemini 2.0. This dataset provided all the knowledge s1 needed to perform at a high level. According to the team’s paper, the model learned how to think through problems.

Supervised fine-tuning, or SFT, was key to making this work. Instead of using reinforcement learning, which requires huge datasets and tons of computing power, the researchers opted for SFT, a method where the AI is directly trained to mimic behaviors found in the dataset. It’s cheaper and faster, and in this case, it worked.

The base model came from Qwen, the AI lab owned by Alibaba. Their off-the-shelf model was free to download, making the project even more budget-friendly. In less than 30 minutes and using just 16 Nvidia H100 GPUs, the team finished training s1. “We could rent the necessary compute for about $20 today,” Muennighoff said.

“Wait” command boosted s1’s accuracy

The researchers reportedly included a clever trick to improve s1’s reasoning: they added the word “wait” during problem-solving. By telling the model to pause and double-check its work, they increased accuracy. The research paper details how this tweak gave s1 enough time to consider better answers before submitting them.

OpenAI is watching closely. They’ve already accused DeepSeek of improperly harvesting data from their API to build AI competitors. With s1 showing how cheap replication can be, big players like OpenAI aren’t thrilled. The commoditization of AI models could hurt their bottom line.

But the tech company has other problems. On Tuesday, Elon Musk’s legal team faced off against OpenAI in an Oakland federal courtroom. Elon, who was an early investor in OpenAI, claims the company abandoned its nonprofit mission and betrayed its original goals.

He’s suing to block their transition into a fully for-profit business. Elon’s lawyer, Marc Toberoff, told the court on Friday that Elon invested $45 million into OpenAI between its founding and 2018 cause he believed it would always prioritize public benefit over profits, just like it said it would in the beginning.

Judge Yvonne Gonzalez Rogers called it a “stretch” and questioned why Elon would even invest that much money without a written contract. “That is just a lot of money on a handshake,” she mocked. Attorney Toberoff then explained that Elon and OpenAI’s CEO, Sam Altman, had a close relationship at the time, and everything was built on trust.

But the case isn’t dead yet. The judge is concerned about OpenAI’s close relationship with Microsoft. Two key players—LinkedIn co-founder Reid Hoffman and Microsoft executive Deanna Templeton—were involved with OpenAI’s board. Templeton was added as a non-voting member after Altman’s brief ouster in 2023 but was later removed amid antitrust concerns.

Elon wants to stop their plans and potentially gain more control over the company through his own AI venture, xAI, which he added as a plaintiff in the case. OpenAI argues that Elon’s request would cripple its business and is based on “far-fetched” claims.

Cryptopolitan Academy: Are You Making These Web3 Resume Mistakes? - Find Out Here

Disclaimer: For information purposes only. Past performance is not indicative of future results.
placeholder
Australian Dollar appreciates as US Dollar loses ground amid US growth concernsThe Australian Dollar (AUD) rebounded on Monday, recovering losses from the previous two sessions against the US Dollar (USD).
Author  FXStreet
Mar 10, Mon
The Australian Dollar (AUD) rebounded on Monday, recovering losses from the previous two sessions against the US Dollar (USD).
placeholder
Here’s What to Watch With February 2025’s CPI Inflation ReportTradingKey - It’s been a rough few weeks for stock markets at technology stocks, in particular, have been hard hit by the uncertainty surrounding the impact of tariffs on the US economy.Of course, the
Author  TradingKey
Yesterday 03: 08
TradingKey - It’s been a rough few weeks for stock markets at technology stocks, in particular, have been hard hit by the uncertainty surrounding the impact of tariffs on the US economy.Of course, the
placeholder
U.S. February CPI Preview: Is the Era of Stagflation Approaching?On 12 March 2025, the United States will release the Consumer Price Index (CPI) data for February. Market consensus predicts that the year-over-year growth rates for Headline CPI and Core CPI will rea
Author  TradingKey
19 hours ago
On 12 March 2025, the United States will release the Consumer Price Index (CPI) data for February. Market consensus predicts that the year-over-year growth rates for Headline CPI and Core CPI will rea
placeholder
EUR/USD rises despite trade war risks as markets pivot to US CPI inflation dataEUR/USD bidders found the gas pedal on Tuesday, bolstering Fiber further and sending the pair back into the 1.0950 level, albeit briefly.
Author  FXStreet
3 hours ago
EUR/USD bidders found the gas pedal on Tuesday, bolstering Fiber further and sending the pair back into the 1.0950 level, albeit briefly.
placeholder
XRP Bears Dominate as Price Continues to Fall Further from January’s $3.40 PeakXRP continues its decline, falling 10% over the past week as bearish momentum strengthens.The fourth-largest cryptocurrency by market capitalization remains under pressure, with waning buying interest
Author  Beincrypto
2 hours ago
XRP continues its decline, falling 10% over the past week as bearish momentum strengthens.The fourth-largest cryptocurrency by market capitalization remains under pressure, with waning buying interest
goTop
quote