What Is AI Inference? Running AI Models in Production