The single biggest update to the Signals Desk view since the original thesis is the shape of the inference capex wave. Inference compute is the demand for AI processing power that supports running already-trained models at production scale. Every consumer-facing AI application, every enterprise AI deployment, and every embedded AI feature consumes inference compute. The demand scales with active usage rather than with training cycles.
The data over the past nine months shows that inference compute demand is growing at 80-100 percent year-over-year. The growth rate is higher than training compute demand because the deployment of trained models is happening faster than the training of new models. Every major hyperscaler has shifted capex allocation toward inference-oriented hardware in their forward guidance.
Nvidia's inference-oriented chip portfolio includes the H200, the L40S for inference workloads, the Grace-Blackwell platform, and the recently announced Rubin family. The competitive position in inference is slightly less unassailable than in training, with AMD's MI300 and MI400 platforms achieving some traction in inference workloads and the hyperscaler-designed custom silicon (Trainium, Inferentia, Maia, MTIA) capturing meaningful internal workloads.
The market dynamics in inference are therefore: total demand growing at 80-100 percent annually, Nvidia share approximately 75-85 percent (compared to 90 percent plus in training), and competitive intensity gradually increasing as alternative silicon scales. The net effect for Nvidia is still extraordinary growth, but with slightly more competitive pressure on price per chip than in the training-dominated phase.
The Signals Desk model now assumes Nvidia data centre revenue scales to approximately $280-320 billion by FY28. The FY26 actual of $185 billion is the launching point. The growth path implies continued 40-60 percent revenue growth in FY27 and 25-40 percent in FY28.