The conversation opens by arguing that AI is changing memory from a background component into a strategic data-center asset, especially as inference becomes more important than training. The guest explains how Micron has been preparing across HBM, LPDDR5, SSDs, and related products, then introduces KV cache and longer context windows as the key reason memory demand rises sharply during decoding. The central idea is that inference requires systems to remember prior state, and if they cannot, recomputation drives much higher compute usage.
The discussion repeatedly argues that inference, longer context windows, and agentic workloads create persistent demand for high-performance memory and storage.
The episode stresses that AI scaling is increasingly limited by memory bandwidth and data-center power budgets, not just flops or storage size.
The guest argues that AI-generated content and broader retrieval patterns increase how often data is accessed, lifting demand for SSDs and related infrastructure.
Micron says the industry is already behind demand and that new fabs take a long time to come online, limiting near-term supply relief.
The company describes using AI to speed engineering, detect issues, improve yields, and accelerate fab ramps, which could compound future supply advantages.