Hi,
I think I’ve seen this answered before, but I was wondering if rasa was multithreaded. If there were many many requests per second, how well would it perform? And what’s the best way for scaling? Is the recommended method to dockerize and scale that way?
The nginx container of the platform is setup to handle that amount of concurrent users. The actual system requirements of the platform shouldn’t differ too much from what we’ve specified in our documentation.