I am building a bot using RASA. The performance of
NLU is really great! So thank you, RASA, for that. But as I move away from developing the bot to deploying the bot to production, I can’t help but wonder that “raw deployment of RASA”(as is) might just be an excruciating slow server/service. Keeping in mind that I expect the traffic to 500,000 upwards per month, how do I effectively deploy RASA at such a scale?
Furthermore, what are some gotchas or “scale issues” I need to keep in mind when doing this?
TLDR: deploy rasa at scale, how to do it, what to look out for?
Thanks, let the community benefit from the answers!!