Alkyne O3 - Search News

News

OpenAI’s o3 model might be costlier to run than originally estimated

When OpenAI unveiled its o3 “reasoning” AI model in December, the company partnered with the creators of ARC-AGI, a benchmark designed to test highly capable AI, to showcase o3’s capabilities.

TechCrunch4d

OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled ...

ZDNet8d

OpenAI just dropped new o3 and o4-mini reasoning AI models - and a surprise agent

As a result, like older models, o3 and o4-mini show strong and even improved performance in coding, math, and science tasks. However, they also have an important new addition: visual understanding.

TechSpot4d

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

A hot potato: OpenAI's latest artificial intelligence models, o3 and o4-mini, have set new benchmarks in coding, math, and multimodal reasoning. Yet, despite these advancements, the models are ...

Bleeping Computer15d

ChatGPT's o4-mini, o4-mini-high and o3 spotted ahead of release

OpenAI is preparing to launch as many as three new AI models, possibly called "o4-mini", "o4-mini-high" and "o3". Right now, ChatGPT has as many as five models, including GPT 4o (the non-reasoning ...

Yahoo Finance4d

OpenAI's o3 AI model scores lower on a benchmark than the company initially implied

Epoch found that o3 scored around 10%, well below OpenAI's highest claimed score. That doesn't mean OpenAI lied, per se. The benchmark results the company published in December show a lower-bound ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results