The AI industry is hitting a hardware wall. From OpenAI's Sora compute costs to Google's struggle with Gemini Nano on Pixel devices, the sector is shifting from capability showcases to a crisis of scarcity. This deep dive explores the economics of inference, rate limits, and the desperate search for compute.