Implementing Streaming Responses in Rasa for a ChatGPT-like Experience

Ulan · July 10, 2023, 8:43pm

Hello Rasa Community,

I’m currently working on a project where I have a Rasa custom action that sends user questions to a FastAPI backend. This backend then forwards the question to the OpenAI API to generate an answer in natural language (using gpt-3.5-turbo model), which is sent back to the Rasa server.

I’ve noticed that the OpenAI API has an option to send responses as a stream (using ‘stream=true’) which results in the model’s response being sent character by character, similar to the experience with ChatGPT. I’m interested in implementing this streaming response in my chatbot to create a more dynamic and interactive user experience.

My goal is to have this stream sent to the Rasa server and then from Rasa to my chatbot UI (built with React), so it will print the answer character by character as it is received.

I’ve seen the stream_response method in the Rasa documentation, but I’m not sure if this is applicable to my use case or how I would go about implementing it.

Does anyone have any ideas or suggestions on how to implement this kind of streaming response in Rasa? Is it even possible to do this with the current capabilities of Rasa?

Any guidance or advice would be greatly appreciated. Thank you!

Christopher1 · August 9, 2023, 11:49am

You can Receive the streaming response from OpenAI. Convert it into chunks and send these chunks step by step to Rasa using stream_response . And Handle backend processing efficiently to avoid overwhelming your Rasa server.

parth1311 · October 3, 2023, 6:36am

Facing same issue. Please provide solution if you found any.

zchenchen1999 · February 15, 2024, 5:00am

Any solution to this?
How to use stream_response in rasa? Use in action.py? Thanks!

avatar · March 19, 2024, 9:21pm

Can you please provide some details? Any sample code or a bit more clarity where exactly and what needs to be changed?

notabaka · April 12, 2024, 3:51pm

Any updates on this? can not find any good resource on implementing streaming with rasa. not exactly sure what ‘stream_response’ option is or how to set it to true.

DataXujing · August 28, 2024, 8:50am

I have the same issue

saimadib · January 14, 2025, 9:59am

I am able to get the response from open ai via stream in chunks. But I am facing issues in how to response it to my frontend from rasa?

  for chunk in explanation:
        # print(chunk)
        dispatcher.utter_message(text=chunk)  # Send each chunk immediately

The below is how I am sending currently but not working. . any help would be really appreciated. Thank you

Topic		Replies	Views
How to stream real time openai response using rasa? Rasa Open Source	6	898	March 19, 2024
Streaming response for custom actions Rasa CALM	2	353	January 14, 2025
How to set response in Rasa Rasa Open Source	5	447	January 12, 2022
Stream receiving messages with stream=true in PHP Rasa Open Source	1	425	November 10, 2023
Integrate RASA model into streamlit chatbot Rasa Open Source	1	658	March 1, 2024

Implementing Streaming Responses in Rasa for a ChatGPT-like Experience

Related topics