Why doesn't CRF generalize well?

twittmin · August 5, 2019, 4:39pm

I have a Japanese robot trained with CRF NE model. I works well on evaluation of a split of training-testing data. However, I tested a few cases with new names to replace the existing names in the training data, it generalize little. Examples:

[福原愛]に電話をかけてください。// I replace '福原愛' with another name '三口百惠'，and it didn't work

[消防署]に電話。// I replace '消防署' with '消防', and it didn't work.

Does the ‘name’ has to occur at least once in the training data?

Any feature manipulation can improve this?

IgNoRaNt23 · August 7, 2019, 10:38am

No, not every entity has to appear to in the training data. The question how will it generalizes depends on how strict your pattern is, if the same keywords appear and so on.

This might be really helpfull

twittmin · August 7, 2019, 4:55pm

@IgNoRaNt23, in the blog, it says " To use regular expressions and / or lookup tables add the intent_entity_featurizer_regex component before the ner_crf component in your pipeline."

What is “intent_entity_featurizer_regex”? Is it the same as ‘RegexFeaturizer’? Is it necessary when you lookup table?

Juste · August 8, 2019, 9:44am

Hey @twittmin. Yes, intent_entity_featurizer_regex was renamed to RegexFeaturizer. Yep, you should have this component if you are using lookup tables, because it’s one of the components which are used to extract the patterns.

Topic		Replies	Views
NER_CRF generalizes very badly Rasa Open Source	9	924	November 27, 2019
Lookup table is supposed to classify entities, but does it influence intent prediction? Rasa Open Source	4	1127	April 15, 2021
NER_CRF model is not generalizing Rasa Open Source	3	838	December 2, 2019
My Pipeline is not using the component ner-crf when I use look up tables Rasa Open Source	25	2224	November 7, 2018
Lookup Table or Multiple Examples? Rasa Open Source	12	3547	December 18, 2023

Why doesn't CRF generalize well?

Related topics