Why would one want to pull a model several times from the server?

MelT · November 21, 2022, 11:14am

I’m thinking of storing the RASA model on our own servers and fetching it for deployment from there. In the description for model storage there is the option to specify when the model should be pulled: Model Storage What I do not understand is why would one want to pull it every 10 seconds instead of pulling it once as it does not change during that time right? I would like to understand why someone would consider this and it would be great to get some insights into it. Maybe @stephens has some ideas?

tomp · November 21, 2022, 4:09pm

Good question. I have no official knowledge from Rasa Corp, but I assume it should be efficient about pulling only when there have been updates. Checking often would ensure it grabs a model as soon as it is updated. If it’s not careful, this could be a waste of (network) resources.

MelT · November 22, 2022, 9:03am

Thanks @tomp, I agree. And if not often retrained you would probably also set pull to “null”?

stephens · November 23, 2022, 1:09am

10 seconds seems aggressive to me. The source code shows that the model is downloaded at each interval and it checks the fingerprint to see if it is newer than the current model.

There are different approaches to update the model on a production system and this could be one (particularly for Kubernetes where you have any number of production pods). I prefer k8s rolling updates triggered by a CD pipeline. Could also use the http API to load a new model but you then have to connect to each production instance.

tanusharma · September 4, 2023, 11:14am

Hello, Pulling a model from a server at regular intervals, even if the core model remains static, can be beneficial for several reasons. Real-time updates ensure that dynamic components or external data sources are constantly integrated, maintaining accuracy. Frequent pulls enhance fault tolerance, reducing downtime during server or network disruptions. Load balancing becomes more efficient, distributing computational work evenly. Developers can easily switch between model versions for experimentation. Resource optimization is possible by loading the model only when needed, conserving resources. Finally, it’s a security measure, as periodic pulls enable prompt application of security patches and updates, ensuring a secure deployment.

I hope this will help you.

Topic		Replies	Views
How to pull periodically models from cloud Rasa Open Source	2	657	July 25, 2022
How to reload the model without restart server Rasa Open Source	3	3130	September 17, 2020
Can't request model from a model server Rasa Open Source	4	1215	May 21, 2020
Rasa X is constantly looking for new model [Deprecated] Rasa X Community Edition	1	1179	July 15, 2019
Zero downtime model replace? Rasa Open Source	5	307	March 3, 2023

Why would one want to pull a model several times from the server?

Related topics