Rasa NLU crossvalidation result

rohitharitash · October 24, 2018, 11:30am

Hi guys,

I am sharing my rasa nlu model’s cross validation evaluation result. I think my model is over fitting. Can you please have a look and suggest, how can we avoid this.

2018-10-24 16:48:52 INFO rasa_nlu.classifiers.embedding_intent_classifier - Finished training
embedding policy, loss=0.009, train accuracy=1.000
2018-10-24 16:48:52 INFO rasa_nlu.model - Finished training component.
2018-10-24 16:48:56 INFO main - CV evaluation (n=10)
2018-10-24 16:48:56 INFO main - Intent evaluation results
2018-10-24 16:48:56 INFO main - train Accuracy: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - train Precision: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - train F1-score: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - test Accuracy: 0.940 (0.020)
2018-10-24 16:48:56 INFO main - test Precision: 0.978 (0.012)
2018-10-24 16:48:56 INFO main - test F1-score: 0.952 (0.016)
2018-10-24 16:48:56 INFO main - Entity evaluation results
2018-10-24 16:48:56 INFO main - Entity extractor: ner_crf
2018-10-24 16:48:56 INFO main - train Accuracy: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - train Precision: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - train F1-score: 1.000 (0.000)
2018-10-24 16:48:56 INFO main - Entity extractor: ner_crf
2018-10-24 16:48:56 INFO main - test Accuracy: 0.996 (0.004)
2018-10-24 16:48:56 INFO main - test Precision: 0.996 (0.004)
2018-10-24 16:48:56 INFO main - test F1-score: 0.996 (0.004)
2018-10-24 16:48:56 INFO main - Finished evaluation
And my input details
INFO:rasa_nlu.training_data.training_data:Training data stats:

- intent examples: 796 (8 distinct intents)

- Found intents: 'affirm', 'greet', 'enter_data', 'what_is_your_name', 'goodbye', 'order', 'are_you_a_robot', 'ask_howdoing'

- entity examples: 644 (3 distinct entities)

- found entities: 'phoneNumber', 'email', 'product'

Thanks

souvikg10 · October 24, 2018, 12:09pm

Can you format it? with ```

it is difficult to read

rohitharitash · October 25, 2018, 11:03am

HI,

I have formatted the logs. Please have a look and suggest.

Thanks

souvikg10 · October 25, 2018, 12:06pm

it doesn’t seem to overfit because the difference between train vs test F1 is not significantly higher however the accuracy of 1 for train does seem strange. Do you have your confusion matrix, do you see any confusion between intents?

is this the tensorflow pipeline?

It could also be that your train vs test split during crossvalidation creeps a bias due to imbalanced dataset. Do you have a validation dataset that the bot has never seen, try to evaluate on that without cross-validation

Topic		Replies	Views
Rasa NLU Cross Validation Evaluation Rasa Open Source	1	1251	December 20, 2018
Is there any way to check incorrect prediction while cross validation? Rasa Open Source	0	434	May 8, 2019
Training loss - how to manage Rasa Open Source	1	1837	July 24, 2019
How interpret the test results Rasa Open Source	1	566	April 7, 2022
Performance Assessment for Intent Classification and Entity Recognition Rasa Open Source	2	394	May 27, 2021

Rasa NLU crossvalidation result

Related topics