Different intent confidence depending on unfeaturized entities

Cekir · February 21, 2020, 10:22am

Hi, I need to detect contact details of the employees from the database. For detecting the entities I am using lookup tables. The name and surname are unfeaturized:

entities:
  - employee_first_name
  - employee_surname
  
slots:
  employee_first_name:
    type: unfeaturized
  employee_surname:
    type: unfeaturized

Now, I am getting very different confidence levels on detected intents depending on the names provided, so for example:

User: What is phone number to John Smith
intent: ask_phone, confidence 0,8
User: What is phone number to Janet Jackson
intent: ask_phone, confidence 0,26

My guess is that the name and surname get features that are used for intent detection. If so, how to avoid it?

I am attaching the config file config.yml (705 Bytes)

Ghostvv · February 26, 2020, 11:27am

It’s not that simple to do. You would need to create your custom tokenizer that removes them from calculation of sentence features

Cekir · February 26, 2020, 1:16pm

I have actually added a custom nlu component that anonimizes the names in the message after the entities are detected but before the featurizer starts, but it does not work. Is this because the features are set in tokens in the tokenizer, and they are just extracted from the tokens in the featurizer?

Would that be then a good approach to remove the tokens Related to the entities and add new, anonimized tokens? Or Maybe just change the vector value in the tokens? Can you point me to some sources that could help me doing that?

Or maybe there is a completely different solution to my problem?

Topic		Replies	Views
Confusion in recogninzing the entity Rasa Open Source	4	890	August 6, 2019
Named Entity Mentions as they relate to Intents Rasa Open Source	6	1999	December 18, 2019
Confusion with intent-entity-featurizer-regex Rasa Open Source	2	1927	December 5, 2019
Problem intent/entities Rasa Open Source	0	203	January 8, 2021
Setting intent if spacy entity is recognized Rasa Open Source	3	1378	November 6, 2018

Different intent confidence depending on unfeaturized entities

Related topics