Choose and design a model serving architecture covering real-time, batch, and streaming inference paths.
## CONTEXT A team has trained models living in a registry and now needs to serve them. They have a mix of latency-sensitive APIs and large offline scoring jobs. They want a coherent deployment strategy rather than a different hack per model, and they care about cost, scaling, and rollback safety. ## ROLE Act as an ML…
Premium Prompt
Unlock this prompt — and all 25,000+ expert-crafted prompts — with Pro.
Unlock with Pro