Extracting pre-defined entities with PhraseMatcher

Amotz · November 12, 2018, 2:22pm

I saw that phrase-matching NER was addressed in PR #822 but it was never merged - it’s said to be implemented as part of PR #1312.

However, the latter doesn’t include entity_phrases - it uses the lookup table in order to generate more features for the CRF entity extractor.

Did someone here try to work with such an extractor? and do you know if phrase-matching NER is in the roadmap?

akelad · November 20, 2018, 10:46am

Hey, no the lookup table is a replacement for the phrase matcher. It should work just as well

Amotz · November 20, 2018, 11:41am

Thanks @akelad According to the lookup table documentation “For lookup tables to be effective, there must be a few examples of matches in your training data. Otherwise the model will not learn to use the lookup table match features.” Meaning that they add additional reg-exp based features for the CRF model. On the other hand, the phrase-matching doesn’t require any training data, it does a simple matching. So the outcome of both extractors will not be the same.

Amotz · December 2, 2018, 3:06pm

Hi @akelad,

Any update on this one? Thanks!

akelad · December 2, 2018, 8:55pm

But they do serve the same purpose. We decided against merging the ner phrase matcher, and instead created the lookup tables. So please use that

Topic		Replies	Views
Advice for a NER component to recognize a very large set of entities with their own grammar Rasa Open Source	1	1405	December 4, 2018
How does the lookup table in rasa_nlu work? Is there something similar to keyword_intent_classifier for entity extractors? Rasa Open Source	6	5398	August 13, 2021
Lookup Table or Multiple Examples? Rasa Open Source	12	3547	December 18, 2023
Question about optimal lookup table usage Rasa Open Source	5	1348	August 8, 2019
Lookup table is supposed to classify entities, but does it influence intent prediction? Rasa Open Source	4	1127	April 15, 2021

Extracting pre-defined entities with PhraseMatcher

Related topics