Comparing NLU Pipelines with data augmentation

rosodelu · October 25, 2021, 1:28pm

Hello,

I am writing to ask if anyone can help me, I want to test different NLU pipelines, using different training/test sets already defined, instead of passing a random division with different percentages: rasa test nlu --nlu data / nlu.yml --config config_1.yml config_2.yml --runs 4 --percentages 0 25 50 70 90

What happens is that I have divided my original data into 5 folds and created for each fold augmented data and concatenated it to the original data. I do not want to mix the data to test the different pipelines, since I want the splits to be independent and for this reason, I prefer to pass already established files.

If anyone can help me with this I would appreciate it!

koaning · November 3, 2021, 12:42pm

A while ago I wrote a data augmentation tool for Rasa which also provides a benchmarking guide. The use-case differs slightly from yours, but the guide may still be useful.

rosodelu · November 3, 2021, 1:07pm

I will take a look at it!!! Thank you!

Topic		Replies	Views
A way to compare different NLU Pipelines with new test data Rasa Open Source	4	714	September 10, 2021
Comparing pipeline Performance Getting Started with Rasa	2	162	January 20, 2021
Comparison of different pipelines with rasa_nlu version 0.15.1 on jupyter notebook Rasa Open Source	0	507	December 3, 2019
Graphics' overlap Rasa Open Source	0	202	June 30, 2021
Comparison between 2 models on RASA 2.0 Rasa Open Source	2	1171	October 27, 2020

Comparing NLU Pipelines with data augmentation

Related topics