Rasa 3.9 launched 🚀

Rasa Summer Release - Boosting Performance for Voice Assistants

Hey everyone, hope you’re all doing great! We’re excited to share some updates in Rasa Pro 3.9, aimed at making CALM voice assistants more efficient and high-performing. Here’s a rundown of what’s new:

  • We have optimized Rasa Pro for voice interactions with key runtime performance improvements:
    • Added a gRPC action server that can use gRPC protocol to efficiently handle double the amount of requests in the amount of time
    • Optimized model loading of bot config files
  • We’ve added multi-step prompting. This breaks the dialogue understanding module into multiple shorter steps and enables smaller and cheaper LLMs (GPT-3.5) to perform almost at the same functional performance as larger, stronger, but more expensive models (e.g. GPT-4)
  • You can now create custom information retrievers to improve search accuracy. This allows you to connect to any information retrieval source for more precise results.
  • We improved slot extraction by allowing CALM assistants to use either an LLM or a lightweight NLU extractor, such as Duckling. This reduces the number of LLM calls and lowers the operational costs and latency per user interaction.

See the full change log here.

Let us know what you think and how these new features work for you. Happy coding!