Handling misclassification of intents

laboratory · August 11, 2021, 12:41pm

Honestly, this could seem like a vague question but, it’s because I am exhausted fixing and I don’t know how else to fix. I can explain further if I get a response and if required

What causes a case where the addition of (one) more intent causes a total destruction in performance of the entire bot? To the point where previous intents that were correctly classified suddenly starts predicting nlu-fallback…

What causes this and what are some suggested ways to fix this?

Dustyposa · August 12, 2021, 3:41am

After send /restart, the conversation will be correctly?

laboratory · August 12, 2021, 6:06am

@Dustyposa . I’m not sure I understand your response. Are you sure you were trying to respond to my post? Because I don’t see a relation between your response and my question.

Thank you.

laboratory · August 12, 2021, 7:30am

@Dustyposa . No you miss the point.

This is not a conversation issue, rather a miss-classification of intent during training. Let’s say I have 3 intents, and I train, it works fine when I interact with the bot. But after I add one more intent (to make it 4), and I train, the model/bot performance becomes rubbish when I interact with it. It even starts missing the 3 intents that were previously correct.

I hope you understand now?

laboratory · August 12, 2021, 9:19am

@Dustyposa OK I think I have not been able to communicate my message very well enough.

So I am afraid all your responses don’t give answers to my question. But I really appreciate your help.

Can you please delete all your responses so that future readers don’t get confused?

Thank you for your help once again.

ermarkar · August 12, 2021, 9:22am

same happened with me, when you provide more examples to intent, there could be chance that your bot is getting cofused now, reason is that examples can be relatable with other intents…

you can create confusion graph, that will give you the error json from that you can check and corerct the examples…

you can try changing the classifier, i chanfed from DIETClassifier to other classifier, results were good with that

laboratory · August 12, 2021, 9:41am

Thank you @ermarkar . Your suggestions makes sense.

Couple of questions:

you can create confusion graph, that will give you the error json from that you can check and corerct the examples…

Can you please elaborate on this or share a link to a resource that can be helpful with this? Also using this graph, how do you “correct the examples”?

you can try changing the classifier, i chanfed from DIETClassifier to other classifier, results were good with that

This is also smart, but how did you handle entity extraction since DIET also extracts entities.

ermarkar · August 12, 2021, 10:38am

based on the requirement, you can choose the classifier and configuration from this link

and to test and generate the confusion graph and json

split the data using the command, it will split into 80/20… then train the model with new splitted data and then test using

rasa test

and in one of the project, rasa was not working properly even after spending so many days and refactoring, in that case for new intents i switched to elmo embedded model to predict the extra intents

laboratory · August 12, 2021, 11:32am

@ermarkar Thank you. I actually knew about the confusion graph. All my values are along the diagonals which means my model is not confusing an intent with another but what is happening is that it is predicting/confusing intents with nlu_fallback.

In this case, I wouldn’t know if the confusion graph is still useful. However, I will try changing classifier and see.

laboratory · August 13, 2021, 7:49am

@anca can you please look at this when you’ve got the time?

Thanks.

dakshvar22 · August 13, 2021, 1:06pm

Hi @laboratory . Firstly, can you please post the config of your NLU pipeline?

Secondly, have you created a train test split of your complete data? If yes, then you can check the approximate confidence values of the predictions by running rasa test nlu -u <path to test data> . The confidence values are visible in the intent_confidence_histogram.png. You are particularly looking for bars that are green but predicted with low confidence.

As others have pointed out, when you add a new intent there can be a possibility that you add examples under an intent which are semantically very similar to the examples under some other intent. That will cause the confusion in model’s predictions and confidence values will start getting low. If you can post some examples of the new and old intents, we could try to spot some overlapping intents.

laboratory · August 18, 2021, 11:14am

@dakshvar22 … sorry for the late reply. I just saw this. Please see the NLU pipeline as shown in config.yml

config.yml (1.6 KB)

Topic		Replies	Views
Chatbot incorrectly classifying easy questions Rasa Open Source	2	541	December 3, 2019
Strange misclassification of intent Rasa Open Source	6	1107	October 31, 2018
Wrong intent prediction Rasa Open Source	3	1247	June 6, 2020
Trouble with intent classification Getting Started with Rasa	1	275	October 15, 2019
Advices for creating a data set Rasa Open Source	8	1122	September 27, 2018

Handling misclassification of intents

Related topics