Reproduce the results of rasa-nlu

mac_71128 · September 23, 2018, 1:48pm

I want to reproduce the result of rasa-nlu to get the same set of error messages in each execution of my model. There are a couple of lines in “intent_classifier_tensorflow_embedding” component which had randomness and I set a seed for them but I still get a different result each time I ran my model. Do you have any solution for this problem? I would appreciate it if you could help me figure this out.

akelad · September 24, 2018, 8:31am

I’m not sure what you mean, could you post an example?

mac_71128 · September 25, 2018, 12:36pm

My data includes 2500 messages. I train my model with the fixed training data but I get a different number of misclassification on "intent " and "entities "in each run. For example, the number of misclassification in one run is 52, and in the second run with the same data and same everything, I got 63. I wonder why the number of misclassification is not fixed?

For the first try to find out the reason, I tried to find and set a seed for each random command. “intent_classifier_tensorflow_embedding” component has a couple of lines (“permutation” and “choice”) and I set a seed for them but I still get a different result each time I ran my model. It would be great if you could mention a possible reason for this problem.

My final goal is adding a new component and checking the effect of that component in reducing the misclassification, but in the meantime, since I got different results for the same data, I couldn’t rely on my result.

akelad · September 26, 2018, 9:54am

The reason it’s not fixed is because the ML models predictions will differ slightly every time you train it. But 52-63 out of 2500 isn’t a huge inconsistency. I’d suggest looking at the misclassifications and seeing whether you can improve your training data

ankeshp · July 3, 2019, 7:41am

@akelad I am facing the same problem in rasa core. The issue is not about number of misclassifications but being able reproduce the results. Rasa docs mention that " In order to get reproducible training results for the same inputs you can set the random_seed attribute of the KerasPolicy to any integer." But it does not work because of internal shuffling of training data on every train run. Also, your point that “ML models predictions will differ slightly every time you train it” is not correct. Given same training data, same initial weights, and same training config the final weights will always be the same. In my case of training rasa core model, training config is same, initial weights are same by setting random_seed but training data(shuffled_X, shuffled_Y) is changing due to the preprocessing steps. I am investigating this further. If you have any insights, kindly share. Thanks!

akelad · July 5, 2019, 2:01pm

Hey @ankeshp welcome to the community!

Well, it is correct if you use a different random seed each time. I think you submitted a PR for rasa core to solve the issues you’re talking about right? The original poster here was talking about NLU though, so this would really belong in a new post in the forum

Topic		Replies	Views
Inconsistency between results/intent_errors.json and rasa shell nlu Rasa Open Source	7	551	July 15, 2021
Tensorflow Embedding / Confusion matrix Rasa Open Source	10	1696	January 30, 2019
NLU Performance Rasa Open Source	4	591	June 18, 2020
Getting different accuracy each time (randomness) Getting Started with Rasa	1	172	April 2, 2019
Confidence score Different between AWS instance and Local system Rasa Open Source	6	539	October 24, 2019

Reproduce the results of rasa-nlu

Related topics