The response intent is wrong

Givan · April 28, 2022, 3:22am

Something like that the response of intent classifier is wrong. There are some training data of intent “ask weather”, e.g.

After training, If I type in “what is the weather now”, it returns the right intent “ask_weather”. But if I type in “Open the door”, “Turn on the light” casually, the response is still “ask_weather”. How can I fix this problem?

artemsnegirev · April 28, 2022, 4:35am

Hello!

Could you give more details:

do you have other intents?
can you show pipeline in config?

Givan · April 28, 2022, 8:35am

Hello, @artemsnegirev

I just set up a demo and there are only three intents similar to asking weather. The following picture is my config, I use the spacyNLP with chinese model. And my training data are also chinese. I just translate them to English convenient for posting the question.

ChrisRahme · April 28, 2022, 5:34pm

You have too little training data.

It’s recommended to have at least 10 examples, while you only have 2. Train-test split usually removes 20% of your training data for testing, and since you have only 2 examples it basically takes 50% instead, so you are left with one example only.

And quality is better than quantity of course, examples should be as dissimilar as possible (that does not mean having similar examples is bad, it just doesn’t help much).

Take a look at this example. You don’t have entities so you don’t need as much data.

artemsnegirev · April 29, 2022, 7:06am

I would also add after @ChrisRahme that you dont really need CountVectorsFeaturizer as it may lead to overfitting, Spacy embeddings is enough for your purpose.

You could also try:

use only bpe embeddings for Chinese: BytePairFeaturizer - Rasa NLU Examples
explore this repo: GitHub - howl-anderson/rasa_chinese: rasa_chinese 专门针对中文语言的 rasa 组件扩展包，提供了许多针对中文语言的组件

Givan · May 5, 2022, 2:27am

@ChrisRahme Thanks for your very nice and detailed response. The keypoint is that if the input are other casual words, its output is also ‘ask_weather’. Of course, it works well if the input is really about weather.

ChrisRahme · May 5, 2022, 4:13am

It’s all about training data again.

Consider adding an out-scope intent if the inputs should not be understood by the bot.

Givan · May 6, 2022, 2:43am

@ChrisRahme Yeah. I know that this is due to the amount of training data. Because I have just set up a demo, so there are only several intents. If adding an out-scope intent here, the training examples of this intent are innumerable, since the users can say unexpected anything in the real application.

artemsnegirev · May 6, 2022, 4:32am

Actually you don’t need to add all of out_of_scope examples, DIET with good regularization will try to learn common patterns as long as you use contextual encoders (T5, BERT, …). Training examples just should be diverse as much as you can provide, 50 examples is a good start point. You can access them from rasa demo project. I usually translate this phrases into target language (e.g. russian).

Givan · May 10, 2022, 6:51am

Great suggestions! Is there any method which can assign the text out of all my specified intents to the “out-of-scope” intent?

Givan · May 10, 2022, 12:02pm

@artemsnegirev Any idea for preventing over-fitting, except for adding more data?

artemsnegirev · May 11, 2022, 4:36am

Sure! You can play with learning_rate, drop_rate and regularization_constant as described in DIET hyperparams section. You can use cross validation to fairly check pipeline performance. Also checkout this repo to automate hyperparams search.

llotus_eater · June 9, 2023, 4:47pm

I wrote a blog post discussing basic principles to improve your intent detection model. I know this thread is old and I thought maybe someone will still find this tips useful: You might be training your chatbot wrong | Everything Chatbots Blog

Topic		Replies	Views
Intent classification poor even with exact matches Rasa Open Source	9	1158	June 5, 2020
Rasa bot is not predicting the correct intents Rasa Open Source	11	854	September 23, 2020
Failing at intent classification Rasa Open Source	4	781	August 5, 2019
Model does not return new intents Rasa Open Source	3	441	December 17, 2020
Handling misclassification of intents Rasa Open Source	11	1221	August 18, 2021

The response intent is wrong

Related topics