Different DIET Intent confidence scores after each train

pomegran · November 5, 2021, 1:07pm

Hi,

I am using the DIET Classifier without about 15 intents. Nothing heavy at all. I have an 8GB RAM Virtual Machine with 4 cores and it takes about 4 minutes to train.

However, every time I retrain I get a different confidence score for my intents. Why would that be if my nlu.yml remains unchanged? Can somebody explain why this might be happening?

Thanks Mark

nik202 · November 5, 2021, 1:27pm

@pomegran I am not sure but please try this approach:


- name: DIETClassifier
  epochs: 100
  model_confidence: linear_norm
  constrain_similarities: true
  random_seed: 1

set random_seed to any number

@pomegran make sure to delete older trained models.

pomegran · November 5, 2021, 1:43pm

Will do and will report back.

Thanks @nik202 !

ChrisRahme · November 5, 2021, 1:52pm

Nik gave you the answer, just use random_seed.

It can also be set in the ResponseSelector and TEDPolicy (basically any component that needs training).

But my message is gonna focus on this, because I’ve been seeing it a lot on the forum:

It’s because Machine Learning involves randomness, mainly while splitting the data into training and testing/validation sets. Each time, a different chunk of your data is selected for training and testing.

To “make that randomness not random”, you use a random seed. I used quotes because setting a seed doesn’t remove the randomness, it just controls it.

Example:

Generating numbers between 1 and 10 without seed:

Try 1: 1 4 2 5 6
Try 2: 8 7 2 4 1

Generating numbers between 1 and 10 with seed:

Try 1: 3 1 6 9 4
Try 2: 3 1 6 9 4

The sequence is still random, but it’s the same randomness on each run.

Fun fact: When we talk about a Minecraft World Seed, this is exactly what it’s about. Using the same seed will create the same randomly generated world.

pomegran · November 5, 2021, 4:45pm

Thank you both, this is very interesting.

I understand the concept of seeds, but I was unaware there was randomness in classifier training. I’m surprised by default they’re not fixed too.

If the seed is fixed, does the order of the intent training data impact scores? Apologies for my lack of understanding here, but what is the benefit of a different seed each time for training?

This does explain my weird results though!

ChrisRahme · November 5, 2021, 7:52pm

That’s a nice question…

No need to apologize! Those are nice genuine questions!

To possibly avoid a very low confidence that was just a coincidence due to an unlucky seed.

nik202 · November 6, 2021, 1:50pm

No, I guess this will not be an issue. But on the other hand, If we want to understand the training data impact on confidence scores please see this explanation.

There are two main pitfalls when creating model accuracy and confidence scores, both due to training data. The first is when there isn’t very much training data available. The second is when there is an abundance of training data available, but it is skewed or not representative of the data to be predicted.

If this is the case, there is a significant risk that the model will overfit and produce high confidence scores for inaccurate predictions. This is because the scoring population is inconsistent with the training population. It’s like creating a model to identify oranges and using it to predict apples.

We use random seed value while creating training and test data set . The goal is to make sure we get the same training and validation data set while we use different hyperparameters or machine learning algorithms in order to assess the performance of different models.

I hope this will give you clear idea and don’t worry about good or bad question, its good you asked the question! Good Luck!

pomegran · November 11, 2021, 2:55pm

Thanks both. Very interesting input!

nik202 · November 11, 2021, 8:11pm

@pomegran No worries Mark, please close this active thread for your reference and for other.

Topic		Replies	Views
Make rasa train reproducible Feedback on Rasa Open Source rasa	4	1430	November 23, 2021
Training the DIETClassifier (NLU) Rasa Open Source	1	446	April 24, 2020
Will two models trained on the same data behave the same all times? Rasa Open Source	1	343	June 16, 2021
DIETClassifier prone to changes in nlu data Rasa Open Source	1	360	March 5, 2021
Why i get different confidence scores for the same question when i only change the training phrase order position Rasa Open Source	4	322	May 6, 2021

Different DIET Intent confidence scores after each train

Related topics