Open-Source AI Achievements: NovaSky's Budget-Friendly Sky-T1 Model

Jimmy Jing

13 Jan 2025 — 2 min read

On Friday, the NovaSky research team at UC Berkeley released a new reasoning model, Sky-T1-32B-Preview, that performs comparably to OpenAI's o1-preview. Significantly, it is open-source and was built in just 19 hours for under $450 using eight Nvidia H100 GPUs.

The team developed Sky-T1 by fine-tuning Alibaba's Qwen2.5-32-Instruct and trained it on data generated with QwQ-32B-Preview, another open-source model comparable to o1-preview. Utilizing synthetic training data helps lower the costs.

"We curate the data mixture to cover diverse domains that require reasoning, and a reject sampling procedure to improve the data quality. We then rewrite QwQ traces with GPT-4o-mini into a well-formatted version, inspired by Still-2, to improve data quality and ease parsing," the team says of their data preparation process in the blog.

Outperforming OpenAI's o1-preview

The model performed at or above o1-preview's level on math and coding benchmarks but did not surpass o1 on the graduate-level benchmark GPQA-Diamond, which includes more advanced physics-related questions. NovaSky open-sourced all parts of the model, including weights, data, infrastructure, and technical details.

o1 is now out of preview and is therefore more capable than its initial release. Moreover, OpenAI is already preparing to launch o3, which the company says can outperform o1. However, as the NovaSky team highlights, the ability to build Sky-T1 so quickly "demonstrate[s] that it is possible to replicate high-level reasoning capabilities affordably and efficiently."

A More Affordable Reasoning Model

The relatively short 19-hour training time means Sky-T1 cost just $450 to build, according to Lambda Cloud pricing, the team clarifies in the blog post. Considering GPT-4 used a suspected $78 million in compute, it is no small feat to present an example of a more affordable reasoning model that can be replicated by academic and open-source groups that lack OpenAI's funding.

Almost half of those adopting generative AI want it to be open-source, citing cost and trust concerns. Continued breakthroughs in open-source AI could create a more even playing field for smaller labs, nonprofits, and other entities to develop competitive models — a refreshing turn for a new field already dominated by tech giants.

Will RedNote get banned in the US?

There’s a certain irony in the recent wave of TikTok users transitioning to RedNote. Originally, the clause to either divest or ban TikTok was aimed at curbing the influence of foreign-owned social networks potentially susceptible to the Chinese government's control. However, the move has unintentionally led users

Apple Temporarily Suspends Notification Summaries in iOS 18.3 Beta

Apple has announced a temporary suspension of the notification summaries feature for news and entertainment applications in its latest iOS 18.3 developer beta. This decision was confirmed following reports highlighting inaccuracies in content summarizations, which stemmed from the Apple Intelligence platform. In response to criticisms over these inaccuracies, particularly

Biden punts the TikTok ban to Trump

The Biden administration has declared that it will defer dealing with the controversy surrounding the TikTok ban to President Donald Trump, who will be stepping into office shortly. A White House official clarified, “Our position on this has been clear: TikTok should continue to operate under American ownership. Given the

Sony Launches Black PlayStation 5 Accessories for Preorder

In conjunction with the recent CES event, Sony has unveiled a selection of black PlayStation 5 accessories, which are now open for preorder before they hit the shelves on February 20th. The new lineup features a variety of items, including the DualSense Edge controller priced at $199.99, the Pulse