Rasa NLU Cross Validation Evaluation

TatianaParshina · December 12, 2018, 11:23am

Hello,

I work on the project which uses Rasa NLU. I have nlu_data file with 1000 intents and about 8 samples per intent. Is my model over fitting if train metrics=1.000?

My cross validation evaluation results for folds=10:

CV evaluation (n=10)
Intent evaluation results
train Accuracy: 1.000 (0.000)
train F1-score: 1.000 (0.000)
train Precision: 1.000 (0.000)
test Accuracy: 0.905 (0.027)
test F1-score: 0.883 (0.033)
test Precision: 0.874 (0.037)

My cross validation evaluation results for folds=5:

CV evaluation (n=5)
Intent evaluation results
train Accuracy: 1.000 (0.000)
train F1-score: 1.000 (0.000)
train Precision: 1.000 (0.000)
test Accuracy: 0.886 (0.017)
test F1-score: 0.871 (0.017)
test Precision: 0.885 (0.016)

Nlu_config pipeline:

pipeline:

name: “tokenizer_whitespace”
name: “intent_featurizer_count_vectors”
name: “intent_classifier_tensorflow_embedding” intent_tokenization_flag: true

akelad · December 20, 2018, 11:40am

I would maybe reduce the amount of epochs you train it for a bit, but this kind of behaviour is expected for the tensorflow pipeline

Topic		Replies	Views
Rasa NLU crossvalidation result Rasa Open Source	3	2438	October 25, 2018
Is there any way to check incorrect prediction while cross validation? Rasa Open Source	0	436	May 8, 2019
NLU Performance Rasa Open Source	4	601	June 18, 2020
Cross validation results explanations Rasa Open Source	0	757	April 16, 2020
Training the nlu Rasa Open Source	1	434	May 9, 2022

Rasa NLU Cross Validation Evaluation

Related topics