[Deployment] Rasa with High Availability?

fun · February 7, 2019, 5:55pm

Based on the docs and installation (e.g. Docker) the model files are being saved on the local disks.

How do you handle the case when the machine/disk dies?

Has anyone set up Rasa with HA in AWS with Ansible/Docker and could share your setup?

Thanks!

mauricedoepke · February 19, 2019, 11:35am

EDIT: Redis seems unsuitable for that usecase.

I don’t have experience with such scenarios. But I guess you could deploy multiple rasa instances that have all the same model files (maybe from an aws bucket, or just plain copies) connect them to a high available tracker store (mongo cluster) and put these rasa instances behind a load balancer.

According to the mongodb faq:

MongoDB is consistent by default: reads and writes are issued to the primary member of a replica set. Applications can optionally read from secondary replicas, where data is eventually consistent by default. Reads from secondaries can be useful in scenarios where it is acceptable for data to be slightly out of date, such as some reporting applications. Applications can also read from the closest copy of the data (as measured by ping distance) when latency is more important than consistency.

This means rasa will always use the newest conversation state to predict the next action. Redis does not guarantee this which might lead to a corrupted tracker state or wrong predictions.

Topic		Replies	Views
Rasa Deployment in AWS cluster mode Rasa Open Source	1	457	August 24, 2020
Tracker Store Selection Rasa Open Source	2	1681	December 9, 2020
Rasa loads latest model only, ignores -m model/{model.tar.gz} Rasa Open Source	1	302	March 25, 2022
Creating a Scalable Rasa Cluster Without Kubernetes or Docker Rasa Open Source	0	290	February 24, 2021
Add conversation persistence with load balancer (nginx, redis, found possible solution) Rasa Open Source	1	616	July 29, 2022

[Deployment] Rasa with High Availability?

Related topics