What Jalapeño establishes institutionally is that the economics of frontier AI inference are no longer fixed by Nvidia's pricing. OpenAI's chip, produced with Broadcom in under a year , is designed for the specific workload profile of LLM inference — not general GPU compute. That specificity is the point: a purpose-built chip for a known workload can undercut general-purpose hardware on cost without needing to match it on flexibility. The enterprises and developers building on OpenAI's API will absorb those savings or OpenAI will pocket them as margin — either outcome improves OpenAI's competitive position relative to labs still paying full Nvidia rates.
Loading story