Cerebras CEO confirms OpenAI GPT 5.4 runs on Cerebras, plans GPT 5.5 rollout

1 hour ago 1



Cerebras Systems CEO Andrew Feldman has confirmed that OpenAI’s GPT-5.4 model is now running on Cerebras hardware, with plans to bring GPT-5.5 onto the platform in the near term. The announcement deepens what has already become one of the most consequential infrastructure partnerships in AI.

The OpenAI-Cerebras relationship was formalized on January 14, 2026, when the two companies announced a multibillion-dollar agreement reportedly exceeding $10 billion. The deal covers the provision of up to 750 MW of compute capacity over a three-year window stretching through 2028 or 2029, focused on ultra-low-latency inference for OpenAI’s flagship products, including ChatGPT.

The first major milestone came on February 12, 2026, when the GPT-5.3-Codex-Spark model launched on Cerebras hardware. That model clocked in at over 1,000 tokens per second. Now, with GPT-5.4 confirmed live and GPT-5.5 in the pipeline, Cerebras is becoming the preferred inference engine for OpenAI’s entire next-generation lineup.

Every single time someone asks ChatGPT a question, that’s an inference call. Every API request from a developer building on OpenAI’s models triggers inference. The over-1,000-tokens-per-second benchmark set by GPT-5.3-Codex-Spark on Cerebras hardware signals that real-time AI interactions are becoming table stakes for applications like live coding assistants, real-time translation, and conversational agents.

Cerebras achieves this through its wafer-scale engine, a chip architecture that is the size of an entire silicon wafer. While Nvidia and AMD design individual chips that get networked together, Cerebras builds one enormous processor, eliminating much of the communication overhead that slows down traditional multi-chip setups.

Cerebras completed its IPO in May 2026, entering the public markets with a market capitalization reported between $63 billion and $95 billion. The OpenAI partnership, worth more than $10 billion, is a significant factor in that valuation.

Community insights and company reports indicate that internal trials of GPT-5.4 and GPT-5.5 are already underway, with public release anticipated shortly. The 750 MW compute capacity commitment through 2028/2029 represents a multi-year revenue floor for Cerebras. The risk worth monitoring is customer concentration: a deal exceeding $10 billion with a single customer means Cerebras’ fortunes are heavily tied to OpenAI’s trajectory and willingness to continue the relationship.

Disclosure: This article was edited by Editorial Team. For more information on how we create and review content, see our Editorial Policy.

Read Entire Article