Entity not extracted, if particular value not used in the training data

shota · November 9, 2018, 1:05am

Hi,

I have an intent, with one entity. The entity itself has around 20 possible values and each value has approximately 20 synonyms. Using tensorflow_embedding, I trained the model on ~500 examples and it identifies intent/entity with high confidence if it’s seen the value of the entity in the examples, but otherwise, it misses. Example (below is entity/value/synonyms hierarchy)

spending_category 
- auto_and_transport
    -- auto and transport
    -- auto
    -- transport
- food_and_groceries
    -- food
    -- groceries
    -- food and groceries

If I provide at least one example with each entity value (not necessarily with every synonym), everything works, but if I provide examples with only auto_and_transport values, but not with food_and_groceries, than rasa does not extract correct entity value from the user input, when it has not seen it in the example. Do I miss something?

znat · November 9, 2018, 1:49am

Have you tried removing the low parameter from the ner_crf config? From the array in the middle

shota · November 9, 2018, 2:04am

Hi @znat according to documentation, the default config looks like this:

features: [["low", "title"], ["bias", "suffix3"], ["upper", "pos", "pos2"]]

so I suppose it’s already removed, right?

shota · November 13, 2018, 9:21pm

OK, so turned out I had a bug in my training file. Everything works like a charm.

Topic		Replies	Views
Cannot get entity extraction to work with Rasa NLU Rasa Open Source	4	2178	October 15, 2019
Intent classification failing when entity extraction is performed Getting Started with Rasa	4	173	December 19, 2018
NLU gets one-word entity right, misses extraction Rasa Open Source	2	315	October 20, 2020
Entity Extraction ner_crf Rasa Open Source	1	822	August 13, 2019
Separate training data for crf_entity_extractor Rasa Open Source	1	484	November 28, 2019

Entity not extracted, if particular value not used in the training data

Related topics