Abu Dhabi launches low-cost AI reasoning model in challenge to OpenAI, DeepSeek

4 months ago 59

Omer Taha Cetin | Anadolu | Getty Images

A caller challenger successful the planetary artificial quality contention has entered the ring.

The Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), an AI-focused probe assemblage established by the United Arab Emirates, announced connected Tuesday the merchandise of a new, low-cost reasoning exemplary to rival OpenAI and DeepSeek.

It comes aft DeepSeek, a Chinese AI lab, earlier this twelvemonth shocked the world with the merchandise of a reasoning exemplary called R1 which it said could outperform OpenAI but with acold little grooming costs.

At conscionable 32 cardinal parameters, MBZUAI's model, dubbed K2 Think, is overmuch smaller than competing systems from OpenAI and DeepSeek. It was built connected apical of Alibaba's open-source Qwen 2.5 exemplary and is tally and tested connected hardware provided by AI chipmaker Cerebas.

For context, DeepSeek's R1 has a full of 671 cardinal parameters, which is fundamentally different word for the variables that an AI connection exemplary learns to recognize and make language. OpenAI doesn't disclose the parameter counts of its AI models.

K2 Think was developed successful concern with G42, the buzzy UAE-based AI firm backed by U.S. tech elephantine Microsoft. The researchers down it accidental it delivers show connected par with the flagship reasoning models of OpenAI and DeepSeek — contempt being a fraction of the size.

They cited the benchmarks AIME24, AIME25, HMMT25 and OMNI-Math-HARD, which subordinate to math, coding benchmark LiveCodeBenchv5 and subject benchmark GPQA-Diamond.

How did they bash it?

Hector Liu, manager of MBZUAI's Institute of Foundation Models, told CNBC the squad down K2 Think were capable to execute specified precocious levels of show by utilizing a fig of methods.

They see agelong chain-of-thought (CoT) supervised fine-tuning — a method of step-by-step reasoning — arsenic good arsenic alleged test-time scaling, which is simply a method for improving show by allocating other computing resources during "inferencing" — or, applying learned cognition to information it's ne'er seen before.

"What was peculiar astir our exemplary is we dainty it much similar a strategy than conscionable a model," Liu told CNBC. "So, dissimilar a regular open-source exemplary wherever we tin conscionable merchandise the model, we really deploy the exemplary and spot however we tin amended the exemplary implicit time."

"If you inquire maine which 1 of the azygous steps is the astir important, it's precise hard to say. It's much similar a strategy method enactment wherever each these methods combined delivered the last result," helium added.

Why does it matter?

There are 2 countries connected the satellite signifier that basal retired arsenic the forerunners successful the AI race: the U.S. and China.

America's tech giants and startups similar OpenAI led the aboriginal momentum with alleged instauration models, which purpose to fulfill a wide scope of tasks by relying connected immense amounts of grooming data. However, DeepSeek's breakthrough with R1 earlier this twelvemonth reinforced China's presumption arsenic a formidable AI subordinate successful its ain right.

More recently, the UAE has sought to presumption itself arsenic a planetary person successful AI successful a bid to heighten its geopolitical power and diversify its system beyond crude lipid dependency.

The portion tin constituent to its AI improvement steadfast G42 arsenic an illustration of however it's gaining crushed successful the space. However, it faces fierce contention from neighboring Saudi Arabia, which is looking to make full-stack AI capabilities via Humain, a institution launched nether the Public Investment Fund successful May.

Beyond that, determination are besides geopolitical complexities that shroud the UAE's AI ambitions. Microsoft's concern and concern with G42 past twelvemonth attracted a great woody of scrutiny successful the U.S. related to the company's narration with China.

More broadly, the UAE's AI manufacture inactive has a agelong mode to spell to scope the standard of its U.S. and Chinese counterparts. OpenAI and the Big Tech players person enjoyed a bully caput commencement with their respective instauration AI models, portion Beijing has agelong considered AI a strategical priority.

Focus connected technological breakthroughs

While K2 Think demonstrates show connected par with OpenAI, the system's developers accidental the purpose is not to physique a chatbot similar ChatGPT. Richard Morton, managing manager for MBZUAI's Institute of Foundation Models, explains the exemplary is intended to service circumstantial uses successful fields similar mathematics and science.

"The information is that the cardinal reasoning of the quality encephalon is the cornerstone of each the reasoning process," Morton told CNBC.

"With this peculiar application, alternatively of taking 1,000, 2,000 quality beings 5 years to deliberation done a peculiar question, oregon spell done a peculiar acceptable of objective trials oregon thing similar that, this vastly condenses that period."

It could besides grow the scope of precocious AI technologies successful regions that don't person entree to the benignant of superior and infrastructure U.S. firms possess.

"What we're discovering is that you tin bash a batch much with less," Morton said.

Read Entire Article