Why i get different confidence scores for the same question when i only change the training phrase order position

sfurao · May 5, 2021, 3:22pm

First training data order:

intent: welcome

hello
hi
hey
how are you

Second training data order: intent: welcome

how are you
hello
hey
hi

When i do the parse for the same question “Hello” i got different confidences

koaning · May 6, 2021, 8:53am

Once trained, neural networks are deterministic. Training neural networks, typically, is a stochastic process though. The weights of layers are often initialised randomly but the batches of data that create the gradient signal are usually also stochastically sorted. This could explain part of what you’re experiencing but I also want to double check: did you also add examples in any of the other intents? That would certainly also influence the confidence scores.

sfurao · May 6, 2021, 4:01pm

@koaning Thank you so much for replying. I didn’t change the others intents, only this one. And as i said i didn’t add new data, i only changed the order of the training phrases of the intent:welcome like the example.

I always have random_seed: 1 in the pipeline:

language: pt pipeline:

name: tokenizer_whitespace
name: ner_crf features: [[“low”],[“bias”, “low”, “prefix5”, “prefix2”, “suffix5”,“suffix3”, “suffix2”, “digit”,“pattern”],[“low”]]
name: ner_synonyms
name: intent_featurizer_count_vectors lowercase: true OOV_token: None
name: intent_classifier_tensorflow_embedding random_seed: 1
name: “ner_duckling_http” url: “http://rasa_duckling:8000” locale: “pt_PT” timezone: “UTC” dimensions: [“amount-of-money”,“distance”,“duration”,“email”,“phone-number”,“quantity”,“temperature”,“time”,“url”,“volume”,“number”]

Your explanation seems to make sense and it could explain the pipeline behaviour. Thank you so much!

koaning · May 6, 2021, 4:35pm

What version of Rasa are you using here? You can confirm via;

rasa --version

sfurao · May 6, 2021, 4:41pm

Im working with a legacy code base that works with version 0.14.6

Topic		Replies	Views
Same training data in different projects give different confidence scores Rasa Open Source	3	558	February 26, 2019
Rasa shell command giving intent confidence Rasa Open Source	11	4793	January 14, 2021
Difference in intent prediction confidence values across rasa1.x and rasa2.x Rasa Open Source	3	565	June 9, 2021
Confidence score Different between AWS instance and Local system Rasa Open Source	6	539	October 24, 2019
Environment Confidence Discrepancies Rasa Open Source	3	240	November 20, 2020

Why i get different confidence scores for the same question when i only change the training phrase order position

Related topics