Production AI systems rarely use a single model. The compound AI pattern -- routing requests to different models based on complexity, cost, and latency -- is how companies actually deploy AI at scale.