DIETClassifier: Where do pretrained embeddings come from?

John · July 24, 2020, 10:58am

Hi,

I watched the video about the DIETClassifier. I dont understand how tokens can be processed by pretrained embeddings. In the default (non-english) DIET-Pipeline, there are no pretrained embeddings like spaCy oder BERT configured for usage.

Is the output of the pretrained embeddings step an empty vector in such a pipeline?

noman · July 27, 2020, 9:06pm

DIET is a plug n play model you can use any custom pretrained embeddings. Though Rasa provides Convert and Bert embeddings in this blog How to Use BERT in Rasa NLU And i think if there is no pretrained embeddings defined in the pipeline then there wouldn’t be any empty vector either.

mloubser · July 28, 2020, 7:34am

Yup, if there are none provided they simply won’t be used; if they are provided, the token will be “looked up” in the pretrained embeddings to get a representation (vector) which will be added to the sparse features.

Topic		Replies	Views
how to see the word embedding representation used by rasa given a model? Rasa Open Source	2	704	January 22, 2021
How to access DIET embedding vectors? Rasa Open Source	2	1163	January 20, 2021
Word Embedding in RASA NLU Rasa Open Source	4	1745	January 14, 2021
Usage of pre-trained BytePairEmbeddings and BERT embeddings in Rasa Rasa Open Source	11	1204	July 2, 2021
DIETClassifier with sparse input features only Rasa Open Source	9	2571	January 19, 2021

DIETClassifier: Where do pretrained embeddings come from?

Related topics