AI Cloud Sector Thesis 3

3. Compute Platform Evolution: From Scarcity to Heterogeneity

The infrastructure layer itself is evolving rapidly:

Blackwell-class systems deliver approximately 2× per-GPU training efficiency and 3–5× per-GPU inference efficiency versus Hopper, with much larger gains at cluster scale.


Power efficiency, not raw FLOPs, becomes the binding constraint as data centers hit the energy wall.


By 2030, 25–35% of inference workloads are likely to run on non-NVIDIA silicon, led by AMD and hyperscaler-specific ASICs. Training, however, remains NVIDIA-dominated well into the decade.


The long-term implication is heterogeneous compute, favoring platforms and clouds that abstract complexity rather than lock customers into a single architecture.