Intent classification poor even with exact matches

mikehuber · November 12, 2019, 9:54am

I am using Rasa with default configurations and default pipeline and pretrained_embeddings_spacy for intent classification. For one intent I defined 30 training sentences like “Give me an example”, “Can I have an example”, “One example please”, etc.

After running the pipeline and training the svm classifier the results of the intent recognition are very poor. Even when I use an exact match from the training data “Give me an example” the probability of the intent is only 0.08 and therefore below my threshold (0.2). Note that every training sentence contains the word “example” and no other intent does, so I would expect a much higher probability.

Any ideas how the intent classification can be improved?

Ghostvv · November 12, 2019, 5:36pm

is it still the correct intent? how many intents do you have?

mikehuber · November 12, 2019, 5:48pm

Yes the intent is the right one but the confidence is too low. There are 12 intents.

mikehuber · November 13, 2019, 8:05am

Any ideas what the problem could be, or is it normal to have such a low confidence?

mikehuber · November 13, 2019, 9:08am

is there by default any stopword removal in spacy?

Ghostvv · November 14, 2019, 12:45pm

this is confidence of svm classifier. It could be low due to the lack of training data

jonathanpwheat · November 15, 2019, 5:37pm

As a test, what if you change your pipeline to supervised_embeddings and retrain? Do you get better responses?

mikehuber · November 16, 2019, 7:34am

The confidence was a bit better with supervised_embeddings, but not much. How easy is it to include stopwords or tf idf weighting on the word vectors? And can I output the word vectors of my sentences for debugging?

Ghostvv · November 18, 2019, 11:07am

you need to hack into spacyfeaturizer, to see the word vectors. For stop words removal, if you use spacy pipeline, you 'd need to write a custom component

rashmi.metri · June 5, 2020, 7:36pm

Hey how did you achieve this finally ?

Topic		Replies	Views
Failing at intent classification Rasa Open Source	4	786	August 5, 2019
Rasa with spaCy Rasa Open Source	3	526	March 3, 2022
Improve Rasa NLU model Rasa Open Source	5	2155	October 15, 2019
Rasa NLU without Rasa Core Getting Started with Rasa confidence	4	189	August 23, 2019
Rasa NLU Tensorflow predict something instead of None Rasa Open Source	6	1666	December 2, 2018

Intent classification poor even with exact matches

Related topics