How to scale Rasa X to handle more concurrent users

andrewck · September 29, 2021, 10:45am

Rasa X: 0.39.2 Rasa: 2.6.0-full

Server:

4 Core 16GiB Ram 150GB SSD

During testing we saw one of the rasa x components become CPU bound. Which limits performance. I do not properly recall which component.

The main questions to the rasa x team is: How to scale Rasa X to handle more concurrent users?

We have use Helm to install Rasa X on Kubernetes. Which deployments or statefulset needs to be scaled up? And does this require us to reconfigure anything else besides scaling out more pods?

Thanx in advance!

ChrisRahme · September 29, 2021, 4:40pm

You can increase the number of replicas for your pods. In my opinions, these are the most useful pods to replicate:

Service	Role
`rasa-x`	Running the HTTP API
`rasa-production`	Running a trained model, parsing intents, predicting actions
`rasa-worker`	Training and evaluating models

You can learn more in the Rasa Advanced Deployment Workshop .

Topic		Replies	Views
Deployment of Rasa ,1000 concurrent user at a time Rasa Open Source	1	261	August 11, 2023
Scale Rasa-X for production [Deprecated] Rasa X Community Edition	0	945	June 12, 2020
Rasa Open source scalability Rasa Open Source	0	230	February 22, 2022
Concurrency in Rasa & multiple Core Usage Rasa Open Source	5	1377	April 29, 2021
Rasa server 500 concurrent users Rasa Open Source	9	2700	September 9, 2021

How to scale Rasa X to handle more concurrent users

Related topics