What is the standard practice for Rasa Testing?

student99 · September 16, 2020, 3:00am

Hi All,

By convention how much stories should be tested using ‘Rasa Test’ command, all the data or just a small set. I have about 40 intents. Should I create stories that should fail in the end to end format? can I test fallback policy?

I can make all my stories test pass without changing nlu/core, with careful wording in my text in the end to end format is enough to pass, but surely there are other ways to give a realistic test? My chatbot is far from perfect but passes test with high accuracy, so perhaps i am not using good testing practice?

edit: I have also noticed if we are using a end2end format similar to this: ‘show me ‘[‘chinese’]’(cuisine)’ restaurants’ we already helping the chatbot extract the correct entity, sometime giving the correct entity is enough to predict a good NLU confidence, since most my entities are not reused for diff stories.

koaning · September 18, 2020, 2:10pm

You can also run rasa test nlu with a cross validation command. That way you can have each data point be represented as a test case once. Have you ever done this?

You might appreciate this benchmarking guide that I wrote for rasa nlu examples.

student99 · September 19, 2020, 12:04am

Thanks I have ran the benchmark tests for my NLU. the latter test (cross validation) I get less accuracy compared to Rasa test nlu where it outputs 100%. What does the output result of cross validation suggest.

koaning · September 22, 2020, 11:28am

Could you share the commands that you ran? There’s settings that might cause such changes.

student99 · September 24, 2020, 1:36am

I ran this following command: rasa test nlu --config basic-config.yml –cross-validation --runs 1 --folds 2 –out gridresults/basic-config

Topic		Replies	Views
Rasa test vs. rasa test core Rasa Open Source testing	3	1242	November 7, 2021
How to only test stories? Rasa Open Source	2	378	May 20, 2020
User input text in test-stories Rasa Open Source testing	3	233	November 21, 2023
Anyone written code to generate end-to-end stories from "regular" stories? Rasa Open Source	4	356	July 15, 2020
End to end testing runs nlu test on training data Rasa Open Source	4	512	August 18, 2020

What is the standard practice for Rasa Testing?

Related topics