Machine learning model serving architectures
Xebia
APRIL 2, 2024
In this blog we will discuss the most common serving architectures 1 ; batch predicting, on-demand synchronous serving and streaming serving. A prediction is as stale as the input data and model used to compute it. A higher ingestion frequency means more time sensitive use cases can be addressed.
Let's personalize your content