Extracting entity separated by space

Shreya · July 8, 2021, 6:16am

Hi suppose I have entity name like - Space Warriors
I want the entity to be detected as [Space Warriors] and not just [Space]

What would be the best way to achieve this?

Any help would be appreciated. Thank you.

souvikg10 · July 8, 2021, 7:18am

As long as I remember, DIET should be able to handle spaces between tokens. You need provide training data with tagged entities in your nlu.md

If it is a limited list then perhaps best to also use lookup tables.

Shreya · July 8, 2021, 8:19am

I tried using look up tables, it doesnt work as expected.

souvikg10 · July 8, 2021, 8:19am

what goes wrong? Usually lookup tables uses Regex Featurizer so you should provide example of the pattern of sentences that might have items mentioned in your lookup table

Shreya · July 8, 2021, 12:45pm

My NLU:

intent: intent_lookup examples: |
- details for [Google]{“entity”: “company”}
- [Google]{“entity”: “company”}
lookup: company examples: |
- Das capitals
- J P Morgan
- Your Story
- Linked In
- Infineon Technologies
- Tata Steel
- Morgan Stanley
- App developer studio
- Moonfrog Labs
- Hacker Earth

STORIES

story: company_lookup steps:
- intent: intent_lookup
- action: utter_lookup

I have mentioned the intent and entities in Domain file.

When I type the names in lookup table I am either getting the first work or second word.

souvikg10 · July 8, 2021, 1:16pm

you need to provide examples from the lookup table into your training data. Google is quite simple, perhaps use companies with more than one token as an example. i know you don’t have to give the list of all companies as examples but some would help DIET learn better the token positions

you might be using Lexical Features

Take a look at this to understand how

Shreya · July 8, 2021, 1:57pm

If I am using Lookups, do I need to make any changes to config.yml? If yes then what would those changes be?

souvikg10 · July 8, 2021, 3:58pm

it is sort of fine tuning. usually the default params in the config should work for you. but if you have to fine tune your models to get better in extracting entities, perhaps it is best to tune lexical features and see how it impacts your model.

I would say for lookups, provide more examples to help learn the regex features because it creates patterns.

Topic		Replies	Views
Look up table in rasa Tutorials, Resources & Videos	3	1762	May 7, 2020
Lookup Table not working for DIET Classifier + RegexFeaturizer Rasa Open Source	10	2123	June 29, 2021
Improving Extraction of Alphanumeric Entity Rasa Open Source	8	1836	June 30, 2019
Lookup table for language that has no space to separate words Rasa Open Source	9	1305	April 6, 2020
The "two entity extractor" problem - do I really need to write custom code for stories? Rasa Open Source	3	1801	July 30, 2021

Extracting entity separated by space

Related topics