Are the lemmatized words used as features for NER_CRF?

datistiquo · August 13, 2018, 3:48pm

For german you have a lot of possible forms which makes training difficult. Like training

ein/eine/einer/eines in front of the entity

Are the trained words used as training or some lemmatized form?

So I would just like train

ein Hotel instead together with

eines Hotels einem Hotel

If just used the pure word you need for german train all posible grammar form, because otherwise entity will not extracted?

Where would I have to implement my own Stemmer?

Topic		Replies	Views
Remove Stop words for NER_CRF? Rasa Open Source	0	625	August 13, 2018
NER_CRF generalizes very badly Rasa Open Source	9	820	November 27, 2019
Using NER as a Feature for CRFEntityExtractor Rasa Open Source	6	1474	June 28, 2021
Rasa NLU 0.13.0 is released! Release Announcements	7	829	August 7, 2018
Crf_entity_extractor with ner_features Rasa Open Source	1	575	February 25, 2020