I am new to Rasa and have just done a POC, We are very excited to take it forward and build full application around it. Before we do that, we have some very fundamental questions around Scalability and Recovery. Answer to these will help us architect a solution better:
- How does rasa bot service scale across 100s or 1000s of parallel conversations: * Can we run many instances Rasa behind a load balancer to scale? * Should we build a gateway service that will play the role of creating a sticky session between user and one of many bot instances?
- In the event the bot service holding conversation with the user goes down, Is there a way to recover from this scenario, ie Is there a way to continue conversation on another bot service?
- Is there a way to replicate/send conversation state and slot for a conversation id to continue on a different bot instance to continue conversation with user?