Need help with identifying multiple entities within a single token

Ajith-Shenoy · April 9, 2019, 12:46pm

Ex. " Attendance for R15CS019"

I’ve been trying to mark “R15cs019” as entity SRN and only “CS” as entity Branch.

Using JSON format, with rasa_nlu trainer.

{ “text”: “What is my attendance r15cs019 ?”, “intent”: “getAttendance”, “entities”: [ { “start”: 22, “end”: 30, “value”: “r15cs019”, “entity”: “SRN” }, { “start”: 25, “end”: 27, “value”: “CSE”, “entity”: “Branch” } ] },

Using spacy_sklearn pipeline, Only SRN entity is recognized. Is there any way to make it recognize both entities simultaneously.

PatrickDS · April 9, 2019, 5:29pm

Is this a standard format? If you can recognize the whole thing using a regex featurizer for example, I think it would be best if you would keep using regexes to extract the middle part that you want as something else.

As a general note, it is my personal opinion that you should only let the entity extractor do the hard work of extracting the substring you want out of the sentence if that substring follows a certain logic, and then use some other thing to do logic on that substring (such as extracting the “CS” for example). Entities should be only used to extract stuff out of user messages. If you need the bot to use those values to drive its logic, use slots, custom actions, the Keras policy, etc.

Ajith-Shenoy · May 1, 2019, 4:52pm

Thanks for the suggestion. I saved it in a slot and indexed it. Thank you.

Topic		Replies	Views
Only recognizing one entity Rasa Open Source varsha	0	531	March 7, 2019
Trouble extracting entities Rasa Open Source	2	396	September 6, 2018
Detecting multiple regexes as separate entities Rasa Open Source	2	228	March 9, 2023
Extract multiple occurrence of an entity in a single statement Rasa Open Source	6	1126	October 14, 2019
How to extract two entities simultaneously using regex Rasa Open Source	0	158	January 31, 2024

Need help with identifying multiple entities within a single token

Related topics