Text Generation Inference

OpenAI announces 'o3' and 'o4-mini,' which it calls 'the most advanced inference model in the company's history,' enabling 'Thinking with images' to be used to think using ...

OpenAI has announced the release of new AI inference models, 'o3' and 'o4-mini'. OpenAI calls o3 'OpenAI's most advanced inference model ever', and claims that it outperforms previous models in ...

Semiconductor Engineering

Next Generation AI: Transitioning Inference from the Cloud to the Edge

Deploying AI inference at the edge—on smartphones, appliances, industrial devices, and vehicles—promises faster, private, and energy-efficient intelligence. Expedera’s packet-based NPU architecture ...

Hartware Net

Introducing the AI Text Generation Benchmark

Testing AI LLM performance can be very complicated and time-consuming. There are also many variables such as quantization, conversion, and variations in input tokens that can reduce a test’s ...

Inception Launches Mercury 2, the Fastest Reasoning LLM — 5x Faster Than Leading Speed-Optimized LLMs, with Dramatically Lower Inference Cost

Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of ...

Semiconductor Engineering

GDDR7 Tackles Massive-Context AI Inference

The AI hardware landscape is evolving at breakneck speed, and memory technology is at the heart of this transformation. NVIDIA’s recent announcement of Rubin CPX, a new class of GPU purpose-built for ...

Electronics Weekly

MS Gen2 inference processor software available to developers

Yesterday, Microsoft made the software for its Maia 200 chip – its second generation inference processor – available to developers. MS AI chief Scott Guthrie called the Maia 200 “the most efficient ...

Forbes

How AI Inference Can Unlock The Next Generation Of SaaS

Roman Chernin is the CBO and cofounder of AI infrastructure company Nebius. His career spans over 20 years in the tech industry. Every major advance in AI begins with model training, but the ...

13d

GIBO Announces Breakthrough in Proprietary AIGC Engine, Entering Next-Generation Intelligent Content Infrastructure Phase

GIBO Holdings Ltd. (NASDAQ: GIBO) today announced a significant technological breakthrough in its proprietary AIGC (AI-Generated Content) multimodal engine, marking the transition into a ...

AFP

Positron AI Raises $230 Million Series B at Over $1 Billion Valuation to Scale Energy-Efficient AI Inference

Positron AI, the leader in energy-efficient AI inference hardware, today announced an oversubscribed $230 million Series B financing at a post-money valuation exceeding $1 billion. This press release ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results