Run_in_thread for model-loading

tejabhat · May 22, 2023, 4:51am

Hi,

we have an issue that our rasa server stops responding while loading a new model.

But we realized that by using “@run_in_thread” in the /model api in server.py (as given below), makes it non-blocking and serves our purpose.

But how can I override this api in a graceful manner without touching the rasa code. Please give us some hints if possible.

@app.put("/model")
**@run_in_thread**
@requires_auth(app, auth_token)
async def load_model(request: Request) -> HTTPResponse:

Best Regards, tejaswini

stephens · May 23, 2023, 7:56pm

I don’t know that you can resolve the issue with that approach. I normally see users bringing up multiple instances and using a load balancer as a proxy to switch from an instance with the old model to another instance with the new model.

Topic		Replies	Views
Trouble replacing new models through the API Rasa Open Source	1	737	May 7, 2021
Load only once the model of core and rasa in flask [custom code] Rasa Open Source	0	1018	January 11, 2019
How to reload the model without restart server Rasa Open Source	3	3023	September 17, 2020
Can't request model from a model server Rasa Open Source	4	1191	May 21, 2020
How to know if rasa is busy in loading the model (as rasa does not respond during this time)? Rasa Open Source	2	191	March 24, 2023

Run_in_thread for model-loading

Related topics