Which is the best entity extractor for random names

Vishwas · April 21, 2022, 12:40pm

Hi,

I am trying to match entities that can contain any character [A-Za-z0-9_.,-"\s()’] etc. Can someone please tell me the best entity extractor to use

Examples:

Universal Plug N’ Play Event
Point-to-Point Tunnelling Protocol,ms-content-repl-srv
DirectInternet,VPN,ServiceLink,PrivateWAN,PrivateVPN
US-0402_ABC Airport Store
UT_St. Louis Sales
Hi-Tech Hley (NSW),Hy-Tech Bangalore (NSW)

stephens · April 29, 2022, 5:10am

There’s a blog post on this here

Vishwas · May 4, 2022, 10:35am

@stephens i had gone through the post. The question i had was would DIET suffice or would i need to include ner_crf for custom and random entities

stephens · May 4, 2022, 4:16pm

I think DIET is fine. I’m assuming you can’t come up with a reasonable list of what the user could enter in this case. I think you should try to prompt the user for the value in such a way that you can use from_text to extract the entity?

Vishwas · May 23, 2022, 1:01pm

Yes. Thats is an option. But in our case the user enters the entire command which first reaches our backend and then to the nlp server. So that is actually not a useful option for us In most cases i am seeing that it is failing as some strings are not in training data and model is not able to generalize

Topic		Replies	Views
Rasa NLU in Depth - Part 2: Entity Recognition Tutorials, Resources & Videos	11	4043	February 13, 2023
Entities can't get extracted with regex Rasa Open Source	18	1213	January 18, 2022
The "two entity extractor" problem - do I really need to write custom code for stories? Rasa Open Source	3	1802	July 30, 2021
Has anyone successfully implemented strict regex patterns for entity extraction? Rasa Open Source	1	251	July 3, 2023
Double entity extraction using DIETClassifier & RegexEntityExtractor Rasa Open Source	4	1146	May 7, 2021

Which is the best entity extractor for random names

Related topics