Is TF-IDF featurizer beneficial in rasa as custom featurizer?

lokkrish · June 22, 2021, 9:35am

Hi,

I am wondering why tf-idf featurizer is not part of Rasa components. It is easily implementable and similar to countvector featurizer. But i dont understand why it is not provided. Is there any reason that this featurizer cant perform well or degrades the model?

harloc · June 23, 2021, 7:41am

There is just no need to provide the frequency, because the transformer itself learns to predict the intent depending on certain occurancies and combination of words. The embeddings are also passed through a feed-forward network first. Personally I usally don’t not use featurizers based on the occurance on whole words, they are in my opinion too sensible to misspellings and rather use subword or ngram-based methodes. In the end, you can give it a shot and try it out with a custom featurizer, maybee you will find something interesting.

Topic		Replies	Views
Implementing TFIDF as a custom component? Rasa Open Source	5	839	July 6, 2020
Phonetics Featurizer Rasa Open Source	19	1312	September 14, 2021
Custom sentence embedding component Rasa Open Source	0	775	May 8, 2022
Best featurizer for rasa Rasa Open Source	1	185	March 1, 2024
Implementing TFIDF as a custom component? [necro] Rasa Open Source	3	1234	March 25, 2020

Is TF-IDF featurizer beneficial in rasa as custom featurizer?

Related topics