How to access DIET embedding vectors?

bayesianwannabe · December 24, 2020, 3:05pm

Hey all,

I want to access the dense vector obtained after convergence from the part marked with a red circle (this architecture picture without this mark is from the Rasa Whiteboard video: Rasa Algorithm Whiteboard - Diet Architecture 1: How it Works - YouTube).

I want to use the message vectors to analyze through some data mining or ML algorithms and obtain some general insights.

Now, I do understand that from a Message object applied to an NLU interpreter like presented on this blog post might offer me some sparse and dense features, probably the ones constructed before the DIETClassifier call.

Below is what I tried:

featurized_msg = nlu_interpreter.featurize_message(train_data.intent_examples[0])
featurized_msg.get_dense_features('text')[0].features # Seems to offer me the embedding for each token
featurized_msg.get_dense_features('text')[1].features # Seems to offer me the average of the vectors from the previous element as an embedding of the phrase

But then again, I think those are the vectors I obtain from my pre-trained word embedding from spaCy. Does anyone knows how can I access the word embeddings obtained from the DIET model?

Any help on this would be very appreciated!

Cheers

koaning · January 6, 2021, 3:21pm

That’s a great question and we’re actually working on a feature that does this. You can find the PR here.

Once the feature is ready I’ll likely also add it as an easy to use the component in whatlies so that you may inspect the “DIET”-embeddings from Jupyter.

bayesianwannabe · January 20, 2021, 12:07pm

That’s nice! Thank you, Vincent. I really enjoy your Rasa Whiteboard videos.

It’s good to know that this is going to be addressed as I barely have experience using TensorFlow directly.

I also didn’t know about the whatlies lib, going to keep an eye on it.

Topic		Replies	Views
how to see the word embedding representation used by rasa given a model? Rasa Open Source	2	697	January 22, 2021
Featurizer for DIET Rasa Open Source	7	1469	May 15, 2020
Access to the input generated for the DIET Classifier Rasa Open Source	2	323	July 24, 2020
DIETClassifier: Where do pretrained embeddings come from? Rasa Open Source	2	1257	July 28, 2020
Question on Algorithms Whiteboard How DIET works Rasa Open Source	1	298	October 7, 2021

How to access DIET embedding vectors?

Related topics